Flexibility counts more than precision

10
David McDonald BBN Technologies [email protected] http://alum.mit.edu/www/ davidmcdonald/ Flexibility Flexibility counts more than precision counts more than precision

description

Flexibility counts more than precision. David McDonald BBN Technologies [email protected] http://alum.mit.edu/www/davidmcdonald/. $$$. Any serious effort depends on funding (e.g. RAGS) Participating in an unfunded shared task isn’t reasonable (for me) - PowerPoint PPT Presentation

Transcript of Flexibility counts more than precision

Page 1: Flexibility  counts more than precision

David McDonaldBBN Technologies

[email protected]://alum.mit.edu/www/davidmcdonald/

Flexibility Flexibility counts more than precisioncounts more than precision

Page 2: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 2

$$$$$$

• Any serious effort depends on funding (e.g. RAGS)

• Participating in an unfunded shared task isn’t reasonable (for me)

• Contributing resources and discussions on a Wiki is fine– Mumble-86: TAG-based surface realizer

• But BBN is a law firm:– We have to bill hours

Page 3: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 3

The price of entryThe price of entry

• A generator’s output should be as good as a person’s– Always grammatical– Fits the discourse context.– Fluent and cohesive

• Pronouns, reduced NPs, conjunction reduction, rich repertoire of clause combinations, etc.

• Real sources – Doing real work for real (commercial) systems

Page 4: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 4

Variation is essentialVariation is essential

• Even read text will vary (bedtime stories)

• People rarely phrase their content the same way every time.

• A generator that didn’t vary its output would be unnatural.– Synthetic characters for game-based training

(NPCs) have to be realistic

Page 5: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 5

These have the same “content”These have the same “content”

• “Apple profits surge on iPod sales”» Clickable title next to graphic on first page

• “Apple reports a 78% jump in quarterly profits thanks to strong Christmas sales of its iPod digital music player”

» Summary below the graphic

• “Apple has reported a 78% surge in profits for the three months to 30 December, boosted by strong Christmas sales of its iPod digital music player”

» Same content in the full article

BBC web page: 1/17/07

Page 6: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 6

Shared data: real texts & contextsShared data: real texts & contexts

• Start with actual texts that are judged to have the same ‘content’– Varying in purpose, level of detail, randomly, …

• Parse the texts back far enough that the same source could produce all the variations

• Every player can use their favorite representation

• It’s a natural task in its own right

Page 7: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 7

““Killer app”Killer app”

• Regenerating content in different styles for different purposes– Initiatives in ‘deep’ NLU: “Machine Reading”,

“Learning by Reading”

• E.g. Rough-cut intelligence analyst’s reports– Read the raw texts of hundreds of news sources,

military movement messages, field reports, … – Provide tailored summaries (hypertexts)– Gets a handle on the intelligence overload

Page 8: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 8

Questions?

Comments?

[email protected]

Page 9: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 9

backupbackup

Page 10: Flexibility  counts more than precision

April 20th, 2007 ST&CE NLG workshop 10

Flexible natural language generation Flexible natural language generation using linguistic templatesusing linguistic templates

• Provide easy authoring of fluent prose while keeping simplicity of variable substitution seen in XSLT string templates.

• Explicit linguistic form enables optimizations, provides structure needed for synthetic speech.

• Automatically build the templates by parsing ex. with reversible grammar.

• Schematic variation given the context.

“X was in the room” Parser

x=past/be/in-room(x)

R

R (Alice) & R (Bob) NLG

Alice and Bob were in the room.Alice was in the room and so was Bob.Alice was in the room and Bob was there too.Alice and Bob were both there.Where were Alice and Bob? In the room.