1 Some interesting directions in Automatic Summarization Annie Louis CIS 430 12/02/08.

Some interesting directions in Automatic Summarization

Annie Louis

CIS 430

12/02/08

Today’s lecture

Multi-strategy summarizationIs one method enough?

Performance Confidence EstimationWould be nice to have an indication of expected system

performance on an input

Evaluation without human modelsCan we come up with cheap and fast evaluation

measures?

Beyond generic summarizationQuery focused, updates, blogs, meeting, speech..

Relevant Papers :Lacatusu et al. LCC’s GISTexter at DUC 2006: Multi-Strategy Multi-

Document Summarization. In Proceedings of the Document Understanding Workshop (DUC-2006)

McKeown et al. Columbia multi-document. summarization: Approach and evaluation. In Proceedings of the Document Understanding Conference (DUC01), 2001.

Nenkova et al. Can You Summarize This? Identifying Correlates of Input Difficulty for Multi-Document Summarization. In Proceedings of ACL-08: HLT

More about DUC 2002 data…

/project/cis/nlp/tools/Summarization_Data/Inputs2002

Newswire texts

Has 3 categories of inputs !

DUC 2002 input categoriesSingle event - 30 inputs

Eg: d061- Hurricane Gilbert Same place, roughly same time, same actions

Multiple distinct events – 15 inputs Eg: d064 - Opening of Mac Donald’s at Russia, Canada, South

Korea.. Different places, different times, different agents

Biographies – 15 inputs Eg: d065 – Dan Quayle, Bush’s nominee for vice president One person – one event, background info – events from the past

Do you think a single method will do well for all ?

Tf-idf summary - d061Hurricane Gilbert Heads Toward Dominican Coast . Tropical Storm Gilbert formed in the eastern Caribbean and strengthened into a

hurricane Saturday night. Gilbert Reaches Jamaican Capital With 110 Mph Winds . Hurricane warnings were posted for the Cayman Islands, Cuba and Haiti. Hurricane Hits Jamaica With 115 mph Winds; Communications. Gilbert reached Jamaica after skirting southern Puerto Rico, Haiti and the

Dominican Republic. Gilbert was moving west-northwest at 15 mph and winds had decreased to 125

mph. What Makes Gilbert So Strong? With PM-Hurricane Gilbert, Bjt . Hurricane Gilbert Heading for Jamaica With 100 MPH Winds . Tropical Storm Gilbert

Tf-idf summary - d064First McDonald's to Open in Communist Country . Police Keep Crowds From Crashing First McDonald's . McDonald's and Genex contribute $1 million each for the flagship

restaurant. A Bolshoi Mac Attack in Moscow as First McDonald's Opens . McDonald's Opens First Restaurant in China . McDonald's hopes to open a restaurant in Beijing later. The 500-seat McDonald's restaurant in a three-story building is

operated by McDonald's Restaurant Shenzhen Ltd., a wholly owned subsidiary of McDonald's Hong Kong.

McDonald's Hong Kong is a 50-50 joint venture with McDonald's in the United States.

McDonald's officials say it is not a question that

Tf-idf summary - d065

Tucker was fascinated by the idea, Quayle said. But Dan Quayle's got experience, too. Quayle's Triumph Quickly Tarnished . Quayle's Biography Inflates State Job; Quayle Concedes Error . Her statement was released by the Quayle campaign. But he would go no further in describing what assignments he

would give Quayle. ``I will be a very close adviser to the president,'' Quayle said. ``You're never going to see Dan Quayle telling tales out of It was everything Quayle had hoped for. Quayle had said very little and he had said it very well. There are windows into the workings of the

Multi-strategy summarization

Multiple summarization modules within a single systemBetter than a single method

How to employ a multi-strategy system?Use all methods, produce multiple summaries, choose bestUse a router and summarize by only one specific method

Produce multiple summaries and choose – LCC GISTexter

Task - Query focused summarizationQuery is decomposed by 3 methodsSent to a QA system and a multi-document

summarizer6 different summaries

Select the best summaryTextual entailment + pyramid scoring

Route to a specific module – Columbia’s multi-document summarizerFeatures to classify an input as

Single eventBiographyLoosely connected documents

The result of classification is used to route the input to one of 3 different summarizers

Features - Single Event

To identifyTime span between publication dates < 80 daysMore than 50% documents published on same day

To summarizeExploit redundancy, cluster similar sentences into themesRank themes based on size, similarity, ranking of

contained sentences by lexical chainsSelect phrases from each themeGenerate sentences

Features - Biographies

To identifyFrequency of most frequent capitalized letter > X

(compensate for NE)Frequency of personal pronouns > Y

To summarizeTarget individual mentioned in sentence ?Another individual found in the sentence ?Position of most prominent capitalized word in the

sentence

Features – Weakly related documentsTo identify

Not single event nor biographicalTo summarize

Words likely to be used in first paragraphs ie important words – learnt from corpus analysis

Verb specificitySemantic themes – wordnet conceptsPositional and length featuresMore weight to recent articlesDownweight sentences with pronouns

Characterizing/ Classifying inputsImportant if you want to route to a specialized

summarizerClassification can be made along several lines

Theme of input – Columbia’s summarizerScientific/ News articlesLong/ Short documentsNews articles about events/ EditorialsDifficult/ Easy ??

Input difficulty and Performance Confidence Estimation

Some inputs are more difficult than others

– Most summarizers produce poor summaries for these inputs

Input to summarizer Some inputs are easier than others !

Average system scores obtained on different inputs for 100 word summaries

mean 0.55

min 0.07

max 1.65

Data: DUC 2001 score range 0 - 4

Input difficulty & Content coverage scores Content coverage score

extent of coverage of important content

Poor content selection –> low score

If most summaries for an input get low score.. Most systems could not identify important content “ Difficult Input ”

Multi-document inputs were from 5 categories:A set of documents describing...

Single event The Exxon Valdez Oil Spill

Subject Mad Cow Disease

Biographical Elizabeth Taylor

Multiple distinct events Different occasions of police misconduct

Opinion Views of the senate, public, congress, lawyers etc on the decision by the senate to count illegal aliens in the 1990 census

Single task – generic summarization

Did system performance vary with DUC 2001 input categories?

Cohesive / “On topic” Inputs

Non Cohesive/ “Multiple facets” Inputs

Input type influenced scores obtained

Biographical Single event Subject

are easier to summarize than

Multiple distinct events Opinions

Cohesive inputs are easier to summarize

Cohesive Biographical Single event Subject

Non Cohesive Multiple distinct events Opinions

Scores for cohesive inputs are significantly* higher than those for non-cohesive inputs at 100, 200 and 400 words

*One sided t-tests95% significance level

Inputs can be easy or difficult =>

Better summarizers ~ different methods to summarize different inputs multi-strategy

Enhancing user experience ~ system can flag summaries that are likely to be poor in content low system confidence on difficult inputs

First step..

What characterizes difficult inputs?Find useful features

Can we identify difficult inputs with high accuracy?Classification task – difficult vs easy

Features – Simple length-based

Smaller inputs ~ less loss of information

~ better summaries

Number of sentences ~ information to be captured in the summary

Vocabulary size~ number of unique words

Features – Word distributions in input

% of words used only once~ lexical repetition

less repetition of content ~ difficult inputs

Type- token ratio~ lexical variation in the input

fewer types ~ easy inputs

Entropy of the input

~ descriptive words ~ high probabilities ~ less entropy ~ easy

iii wpwpXH

12 )(log)()(

Features – Document similarity and

relatedness

documents with overlapping content ~ easy input

Pair-wise cosine overlap (average, min, max) ~ similarity of the documents

High cosine overlaps overlapping content easy to summarize

v 1 , v 2 tf − idf weights of content words of 2 documents21

21.cosvv

Features – Document similarity and relatedness

tightly-bound by topic ~ easy input

KL Divergence~ distance from a large collection of random documents

Difference between 2 language models input & random collection

Greater divergence input is unlike random documents, tightly bound input

Inpw coll

inpinp wp

wpwpceKLdivergen

)(log)( 2

coll − all docum ents from all task sof DUC 2001 to 2006

Features – Log likelihood ratio based

more topic terms, similar topic terms ~ topic-oriented, easy input

Number of topic signature terms

Percentage of topic signatures in the vocabulary~ control for length of the input

Pair-wise topic signature overlap (average, min, max)~ similarity between the topic vectors of different documents

~ cosine overlap with reduced & specific vectors

What makes some inputs easy?Easy inputs have

smaller vocabulary smaller entropy greater divergence from a random collection higher % of topic signatures in the vocabulary higher avg cosine and topic overlap

Input difficulty hypothesis for systems

Indicator of an input’s difficulty Average system coverage score Difficult, if most systems select poor content

Defining difficulty of inputs 2 classes Abv/ Below “ mean average system score ”

> mean score – easy

< mean score – difficult Equal classes

Classification Results

Baseline performance : 50%

Test set: DUC 2002 - 04

10 fold cross validation on 192 observations

Precision and recall of difficult inputs

Accuracy 69.27

Precision 0.696

Recall 0.674

Summary Evaluation without Human ModelsCurrent Evaluation Measures - Recap

Content CoveragePyramidResponsivenessROUGE

* My work with Ani

Need for cheap, fast measures

All current evaluations require human effortHuman summaries (content overlap, pyramid, rouge)Manual marking of summaries (responsiveness)

Human summaries are biasedseveral summaries for the same input are needed to

remove bias (Pyramid, ROUGE)

Can we come up with cheaper evaluation techniques that will produce the same rankings for systems as human evaluations ?

Compare with input – No human modelsEstimate closeness of summary to input

The more close a summary is to the input, the better its content should be

How do we verify this ?Design some features that can reflect how close a summary

is to the input Rank summaries based on the value of this featureCompare the obtained rankings to rankings given by

humansSimilar rankings (high correlation) – you have succeeded

What features should we use?We want to know how well a summary reflects the

input’s content.

Guesses ?

Features - Divergence between input and summary

Smaller divergence ~ better summary

KL divergence input – summaryKL divergence summary – input

Jensen Shannon Divergence

)()()(|| 21

21 SummHInpHSummInpHSummInp

Features – Use of topic words from the input

1 Some interesting directions in Automatic Summarization Annie Louis CIS 430 12/02/08.

Documents

Transcript of 1 Some interesting directions in Automatic Summarization Annie Louis CIS 430 12/02/08.

Query-Based Single Document Summarization … Single Document Summarization Using an Ensemble Noisy ... Query-Based Single Document Summarization Using an ... representation is Gaussian-Bernoulli

Web Content Summarization

Text Classification & Summarization - EUCaseseucases.eu/.../documents/...expertwsclassificationsummarization.pdf · Text Classification & Summarization Kornél Markó, ... –Sentence

SUMMARIZATION AND AGGREGATION

Automatic Text Summarization

TEXT SUMMARIZATION

summarization heat transfer.docx

Video Summarization using Deep Semantic Featuresyokoya.naist.jp › paper › datas › 1469 › video-summarization-deep.pdf · Video Summarization using Deep Semantic Features Mayu

Automatic Text Summarization - Arvutiteaduse instituut · Automatic Text Summarization •Text summarization is the process of distilling the most important information from a text

SYNTACTICSENTENCECOMPRESSIONFORTEXT SUMMARIZATION

Road to Summarization

Scene Summarization

Developing Summarization Skills

Video Co-summarization: Video Summarization by …...Video Co-summarization: Video Summarization by Visual Co-occurrence Wen-Sheng Chu1 Yale Song2 Alejandro Jaimes2 1Robotics Institute,

Libretto Vocal Book · 2019. 12. 10. · Goodnight, Annie. ANNIE Goodnight , Kate. JULY Goodnight, Annie. ANNIE Goodnight , July DUFFY Goodnight, Annie. ANNIE Goodnight , Duffy TESSIE

Video summarization via spatio-temporal deep architecturefuturemedia.szu.edu.cn/assets/files/Video summarization... · 2020. 9. 7. · video summarization task. Zhang et al. tried

Abstractive Review Summarization

Teaching Section Summarization Strategies · Teaching Section Summarization Strategies. What are summarization strategies? A summary is a brief statement or set of statements used

Video Summarization Ppt

MULTI DOCUMENT TEXT SUMMARIZATION USING … · Basically text Summarization methods can be classified into extractive and abstractive summarization. An extractive summarization method