Context-Aware Clustering

17
1 Context-Aware Clustering Junsong Yuan and Ying Wu EECS Dept., Northwestern University

description

Context-Aware Clustering. Junsong Yuan and Ying Wu EECS Dept., Northwestern University. Contextual pattern and co-occurrences. ?. Spatial contexts provide useful cues for clustering . K-means revisit. EM Update. Binary label indicator. Assumption: data samples are independent. - PowerPoint PPT Presentation

Transcript of Context-Aware Clustering

Page 1: Context-Aware Clustering

1

Context-Aware Clustering

Junsong Yuan and Ying WuEECS Dept., Northwestern University

Page 2: Context-Aware Clustering

2

Contextual pattern and co-occurrences

?

Spatial contexts provide useful cues for clustering

Page 3: Context-Aware Clustering

3

K-means revisit

Assumption: data samples are independent

Binary label indicator

Limitation: contextual information of spatial dependency is not considered in clustering data samples

EM Update

Page 4: Context-Aware Clustering

4

Clustering higher-level patterns

• Regularized k-means

Distortion in original feature space

Distortion in hamming spacecharactering contextual patterns

Same as traditionalK-means clustering

Regularization term due tocontextual patterns

Not a smooth term!

Page 5: Context-Aware Clustering

5

Chicken and Egg Problem• Hamming distance in clustering contextual patterns

• Matrix form

• Cannot minimize J1 and J2 separately !

J1 is coupled with J2

Page 6: Context-Aware Clustering

6

Decoupling

Fix Update

Fix Update

Page 7: Context-Aware Clustering

7

Nested-EM solution

NestedE-step

M-stepUpdate and separately

the nested-EM algorithm can converge in finite steps.

Theorem of convergence

Page 8: Context-Aware Clustering

8

Simulation results (feature space)

Page 9: Context-Aware Clustering

9

Simulation results (spatial space)

Page 10: Context-Aware Clustering

10

Page 11: Context-Aware Clustering

11

K-m

eansInitialization

Page 12: Context-Aware Clustering

12

1st roundFinal Phrases

Page 13: Context-Aware Clustering

13

Page 14: Context-Aware Clustering

14

Page 15: Context-Aware Clustering

15

Page 16: Context-Aware Clustering

16

Multiple-feature clustering• Dataset: handwritten numerical (‘0’-‘9’) from UCI data set

– Each digit has three different types of features– Contextual pattern corresponds to compositional feature

• Different types of features serve as contexts of each other– Clustering each type of features into 10 “words”– Clustering 10 “phrases” based on a word-lexicon of size 3x10

Page 17: Context-Aware Clustering

17

Conclusion• A context-aware clustering formulation proposed

– Targets on higher-level compositional patterns in terms of co-occurrences

– Discovered contextual patterns can feed back to improve the primitive feature clustering

• An efficient nested-EM solution which is guaranteed to converge in finite steps

• Successful applications in image pattern discovery and multiple-feature clustering– Can be applied to other general clustering problems