Académique Documents
Professionnel Documents
Culture Documents
Like for example look at the lattice below to identify the maximal itemsets
In this lattice, if we closely observe the red border it selectively highlights three
itemsets in blue which are maximal frequent itemsets. This can be easily seen as
none of the supersets of the blue color itemsets are above the border.
However, one problem associated with maximal frequent itemsets is that even
though we know that all its subsets are frequent we do not know their supports.
Actual support information is very important for these itemsets when we are
deriving the association rules. So now we shift our goal and try to find out all such
frequent itemsets that have the same support as their subsets. This forms the
notion of a closed frequent itemset.
Now to understand the notion of closed frequent itemsets lets look at the
example below.
Itemset Support
TID Items {A} 4 Itemset Support
{B} 5
1 {A,B} {C} 3
{A,B,C} 2
2 {B,C,D} {D} 4 {A,B,D} 3
3 {A,B,C,D} {A,B} 4 {A,C,D} 2
{A,C} 2
4 {A,B,D} {A,D} 3
{B,C,D} 3
5 {A,B,C,D} {B,C} 3 {A,B,C,D} 2
{B,D} 4
{C,D} 3
In the above process we have derived the set of frequent itemsets from the
transactions. Now let’s represent the itemset lattice and identify the closed and
frequent itemsets. The minimum support value here in the current example is set
to 2. The labels in red for each node in the lattice represent the set of transactions
where the element occurs. (support)
We have provided labeling for few of the nodes which can be verified by checking
once. C,D,E are closed because one can observe that none of the supersets have
the same support but there does exist a frequent superset hence it’s not maximal.
But for CE and DE one can observe that both the properties are satisfied and
hence it comes under the category of both Closed and Maximal.