Mining Negative Association Rules

14
2006/9/7 1 Mining Negative Association Rules Xiaohui Yuan, Bill P. Buc kles Zhaoshan Yuan Jian Zhang ISCC 2002

description

Mining Negative Association Rules. Xiaohui Yuan, Bill P. Buckles Zhaoshan Yuan Jian Zhang ISCC 2002. Outline. Motivation Problem define Algorithm Conclusion & Thought. Motivation. conf (age < 30 →coupe ) = 0.3/0.4 =75% conf (age > 30 → not buy coupe) = 0.5/0.6 = 83.3%. Problem. - PowerPoint PPT Presentation

Transcript of Mining Negative Association Rules

2006/9/7 1

Mining Negative Association Rules

Xiaohui Yuan, Bill P. Buckles

Zhaoshan Yuan

Jian Zhang

ISCC 2002

2006/9/7 2

Outline

Motivation Problem define Algorithm Conclusion & Thought

2006/9/7 3

Motivation

conf(age < 30 →coupe ) = 0.3/0.4 =75% conf(age > 30 →not buy coupe) = 0.5/0.6 =

83.3%

2006/9/7 4

Problem

The difficulty for mining negative rulesCan’t simply pick threshold values for support

and confidence.Thousands of items are included in the

transaction records.

2006/9/7 5

Positive association rule, supp() and conf()

( )supp X Y

( )( )

( )

P X Yconf X Y

P X

, , , and = 0X Y X I Y I X Y

2006/9/7 6

Negative association rule, supp() and conf()

( ) 1 ( )conf X Y conf X Y

( ) ( ) ( )supp X Y supp Y supp Y X

( ) ( ) ( )supp X Y supp X supp X Y

( )( ) (1 ( ))

1 ( )

P Yconf X Y conf Y X

P X

( ), , , and 0X Y X Y X I Y I X Y

2006/9/7 7

Sibling relationships

Called LOS and denote as[i1,i2…..,im][IBM Aptiva , Compaq]Extend [IBM Aptiva , Compaq, Notebook]

2006/9/7 8

LOS

Locality of Similarity (LOS)Similarity assumption

Sibling rule If the item set X’ = {i1, i2,….ik,…,im} is the same as X ex

cept item ik is substituted for ih.

Rule r : X→Y and Rule r’: X’→Y

2006/9/7 9

Discover Negitive Rule

If rule r’: X’→Y is not support ,it may exist negative rule.

Salience measure (distance betewwn conf level)

E(): estimated conf

( ') E( ( '))SM conf r conf r

2006/9/7 10

Condictions

for qualify a negative rule

1. there must exist a large deviation between the estimated and actual confidence.

2. the support and confidence are greater than the minima required.

2006/9/7 11

Pruning

Equivalent or similar pair

Exam:

Negative rule and positive rule are couple

Exam:” female →BuyHat “ and “﹁male→BuyHat”

, andX Y Y X

2006/9/7 12

Algorithm

2006/9/7 13

Conclusion

It’s not depend on numbers of transactions Complexity O(P x L)

P : number of positive ruleL : average size of LOS

2006/9/7 14

Thought

Estimate from postive to find negative cost , how ?