我读到的注释似乎是这样说的:给定所有封闭频繁项集的集合及其支持计数,可以获得任何频繁项集中的支持计数。
A frequent itemset is called closed if no larger itemset properly contains it and has the same support count.
试图证明这一点,但无法解决。
以下是一些与关联规则挖掘相关的定义链接:
我读到的注释似乎是这样说的:给定所有封闭频繁项集的集合及其支持计数,可以获得任何频繁项集中的支持计数。
A frequent itemset is called closed if no larger itemset properly contains it and has the same support count.
试图证明这一点,但无法解决。
以下是一些与关联规则挖掘相关的定义链接:
封闭项集X是不包含在具有相同支持的另一项集中的项集。
所有项目集Y1、Y2、Y3。。被包括在X中并且具有相同支持的YN被称为在相同的等价类中。它们不是封闭项集,因为它们包含在具有相同支持(X)的较大项集中。
现在让我们假设你有所有频繁闭项集C的集合,并且你想知道项集F的支持。
你需要做的很简单。您需要将F与所有频繁闭合项集进行比较。你必须找到最小的闭项集W,使得F包含在W中。那么F的支持就是W的支持。
如果你想了解更多关于封闭项目集的详细信息,我建议你阅读Pasquier的论文:
如果你想要一些挖掘封闭项集的算法源代码,你可以查看我的Java项目:
http://www.philippe-fournier-viger.com/spmf/
它提供AprioriClose和DCI_Closed。
You know that no set can have higher support than its subsets... so the support of any given itemset equals the support of the most frequent superset: sup(x) = max{y.support | y is superset of x and y is in the closed frequent itemsets}
存在一种算法来产生对所有项集的支持,给定闭频繁项集及其支持:
kmax = size of largest closed itemset
Fmax = closed frequent itemsets of size kmax
for k = kmax downto 1 do
Fk = {f | f immediate subset of f in Fk+1 or f is closed | |f|=k}
for every f in Fk do
if f is not closed
f.support = max{f .support | f in Fk+1 , f is a superset of f}
endif
endfor
endfor
来源:http://www.cs.helsinki.fi/group/bioinfo/teaching/dami_s10/dami_lecture4.pdf
If I have an algorithm which is comprised of (let s say) three sub-algorithms, all with different O() characteristics, e.g.: algorithm A: O(n) algorithm B: O(log(n)) algorithm C: O(n log(n)) How do ...
I m using Electro in Lua for some 3D simulations, and I m running in to something of a mathematical/algorithmic/physics snag. I m trying to figure out how I would find the "spin" of a sphere of a ...
There s a (relatively) new sort on the block called Timsort. It s been used as Python s list.sort, and is now going to be the new Array.sort in Java 7. There s some documentation and a tiny Wikipedia ...
As a learning experience I recently tried implementing Quicksort with 3 way partitioning in C#. Apart from needing to add an extra range check on the left/right variables before the recursive call, ...
Given a 10 digit Telephone Number, we have to print all possible strings created from that. The mapping of the numbers is the one as exactly on a phone s keypad. i.e. for 1,0-> No Letter for 2->...
I have a directed graph and my problem is to enumerate all the minimal (cycles that cannot be constructed as the union of other cycles) directed cycles of this graph. This is different from what the ...
Given an array of integers arr = [5, 6, 1]. When we construct a BST with this input in the same order, we will have "5" as root, "6" as the right child and "1" as left child. Now if our input is ...
I was trying to speed up a certain routine in an application, and my profiler, AQTime, identified one method in particular as a bottleneck. The method has been with us for years, and is part of a "...