Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works...

18
Product Review Summarization Ly Duy Khang

Transcript of Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works...

Page 1: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

Product Review Summarization

Ly Duy Khang

Page 2: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

Outline

1. Motivation2. Problem statement3. Related works4. Baseline5. Discussion

Page 3: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

1. Motivation (1)

• A rapid expansion of e-commerce, where more and more products are sold via online portals (Amazon, eBay … )

• Online product reviews thus become an important resource:– Customers to share and find opinions about

products easily– Producers to get certain degrees of feedback

Page 4: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

1. Motivation (2)

Page 5: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

2. Problem statement

• Given a set of reviews of a product, produce an abstractive summary that captures users’ opinions about that product

Page 6: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

3. Related works (1)

• Single-document summarization– Extractive-based approach• Sentence score + ranking• Machine learning technique

– Abstractive-based approach• Template• Concept hierarchy

Page 7: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

3. Related works (2)

• Multi-document summarization– Extractive-based approach• Sentence score + ranking + MMR + Ordering

– Abstractive-based approach• Template• Concept hierarchy• Sentence fusion with paraphrasing rules

Page 8: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

3. Related works (3)

• Sentiment analysis– Reviews polarity classification– PROS/ CONS identification– Mining review opinions• Identify product facets• Identify opinion orientation on the facet

Page 9: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

4. Baseline (1)

• Extractive based summary• An integration between Liu et. al. (2004) and

NUS - DUC 2005

Page 10: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

4. Baseline (2)

Page 11: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

4. Baseline (3)

• Product facets identification– Association rule mining

• Each transaction consists of nouns/noun phrases from single sentence

• The frequent itemsets are the candidate product facets

– Redundancy pruning• Removing redundant facets that contain only single words.

(e.g. life -> battery life)

– Compactness pruning• Removing meaningless facets that contain multiple words

Page 12: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

4. Baseline (4)

• Sentiment classification– WordNet to grow seed lists of (+) and (-) ADJ– ADJ share the same orientation as their synonyms

and opposite orientation as their antonyms

Page 13: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

4. Baseline (5)

• Reviews labeling with facets and polarity– The unit of labeling is sentence– The summation of all these polarities yields the

polarity of the whole sentence

Page 14: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

4. Baseline (6)

• Summary generation– Sentences are clustered based on their labeling– For each facet, we produce a summary• Sentences are scored based on concept link similarity• MMR ranks the sentences

Page 15: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

5. Discussion (1)

• Evaluation

– We plan to carry on human evaluation.

Page 16: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

5. Discussion (2)

• In the baseline,– Inherit all problems of extractive-based summary– The unit of sentence is too coarse-grained– Relationship between facets are not addressed

Page 17: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

References[1] V. Hatzivassiloglou, J. L. Klavans, M. L. Holcombe, R. Barzilay, M. Y. Kan,

and K. R. Mckeown. SimFinder: A Flexible Clustering Tool for Summarization. Machine Learning, 1999.

[2] R. Barzilay, K. R. Mckeown, and M. Elhadad. Information fusion in the context of multidocument summarization. Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, page 550-557, 1999.

[3] I. Mani and M. T. Maybury. Advances in automatic text summarization. 1999.

[4] R. Mooney and G. DeJong. Learning schemata for natural language processing. Strategied for Natural Lanaguage Processing, pages 146 - 176.

[5] E. Hovy and C. Lin. Automated text summarization in SUMMARIST. Advances in Automatic Text Summarization, 94, 1999.

Page 18: Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

[6] M. Hu and B. Liu. Mining and summarizing customer reviews. Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, page 168-177, 2004.

[7] M. Hu and B. Liu. Mining opinion features in customer reviews. Proceedings of the National Conference on Articial Intelligence, page 755760, 2004.

[8] S. Ye, L. Qiu, T. S. Chua, and M. Y. Kan. NUS at DUC 2005: Understanding Documents via Concept Links. Document Understanding Conference (DUC05), 2005.

[9[ X. Ding, B. Liu, and P. S. Yu. A holistic lexicon-based approach to opinion mining Proceedings of the international conference on Web search and web data mining – WSDM '08, page 231, 2008.