Association Rule
Goal: Find the most frequently appearing X=(X1,...,Xp). This problem can be viewed as to the problem finding the frequent subsets (v1,...vL),vj⊂X, such that the probability density P(vl) evaluated at each of those values is relative large.
In most cases Xj∈{0,1}, where it is referred to as "market basket" analysis. For observation i, each variable Xj is assigned one of two values; xij=1 if the jth item is purchased. In this setting of the goal, X=vl will nearly always be too small for reliable estimation. Thus we need to modify our goal as following way.
Modified Goal: Instead of seeking values x where P(x)is large, We seeks regions of the Xspace with high probability content relative to their size or support. Then, the modified goal is to find subsets of variable s1,...,sp such that the probability of each of the variables is relative large.
The intersection part is called a conjunctive rule. The subsets sj are interval for quantitative Xj.
K⊂{1,...,P},P=∑j=1p∣Sj∣. ∣Sj∣ is the number of distinct values attainable by Xj. K is called an item set.
Market Basket Analysis
Last updated