Académique Documents
Professionnel Documents
Culture Documents
Mamata Jenamani
Professor
Department of Industrial & Systems Engineering
Association based recommendation system
• A variation of collaborative filtering
• Recommending the items that can be purchased
with the items that users have purchased in the past
or have shown interest to purchase
– Co-occurrences of items that the users frequently
preferred to purchase/view together
• Information used
– Unary rating
• Type of recommendation decision
– Prediction
– Top-N recommendations
• Personalized
Introduction to frequent pattern analysis
• Frequent pattern: a pattern (a set of items, subsequences,
substructures, etc.) that occurs frequently in a data set
• Frequent pattern analysis is the basis of association rule
mining
• Motivation: Finding inherent regularities in data
– What products were often purchased together?
– What are the subsequent purchases after buying a PC?
– What kinds of DNA are sensitive to this new drug?
– Can we automatically classify web documents?
• Applications
– Basket data analysis, cross-marketing, catalog design, sale campaign
analysis, Web log (click stream) analysis, and DNA sequence analysis.
Basic Concepts: Frequent Patterns
and Association Rules
Transaction-id Items bought
10 A, B, D Itemset X = {x1, …, xk}
20 A, C, D Find all the rules X Y with
30 A, D, E minimum support and confidence
40 B, E, F
support, s, probability that a
50 B, C, D, E, F
transaction contains X Y
Customer Customer confidence, c, conditional
buys both buys B
probability that a transaction
having X also contains Y
Let supmin = 50%, confmin = 50%
Freq. Pat.: {A:3, B:3, D:4, E:3, AD:3}
Customer
Association rules:
buys A
A D (60%, 100%)
D A (60%, 75%)
Interestingness measures
• Generate association
rules 20 B, C, E
30 A, B, C, E
40 B, E
Solution Supmin = 2 (50%)
Itemset sup
Database TDB Itemset sup
C1 {A} 2 L1 {A} 2
Tid Items {B} 3
{B} 3
10 A, C, D {C} 3
20 B, C, E
1st scan {C} 3
{D} 1
{E} 3
30 A, B, C, E {E} 3
40 B, E
C2 Itemset sup C2 Itemset
Itemset sup {A, B} 1
L2 2nd scan {A, B}
{A, C} 2 {A, C} 2
{B, C} 2 {A, E} 1 {A, C}
{B, E} 3 {B, C} 2 {A, E}
{C, E} 2 {B, E} 3 {B, C}
{C, E} 2 {B, E}
Itemset sup {C, E}
{A, B, C} 1
3rd scan L3 Itemset sup
C3 {A, B, C, E} 1 {B, C, E} 2
{A, C, E} 1
{B, C, E } 2
Solution