1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output...

75
What is NLG? Input • Formal representation of some information (linguistic or non-linguistic) Output • Single sentences or texts (reports, explanations, instructions, etc.) Resources drawn upon • Context of situation • World and domain knowledge • Domain communication knowledge • Linguistic knowledge
  • date post

    20-Jan-2016
  • Category

    Documents

  • view

    215
  • download

    0

Transcript of 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output...

Page 1: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

1

What is NLG?

Input• Formal representation of some information (linguistic or

non-linguistic)

Output• Single sentences or texts (reports, explanations,

instructions, etc.)

Resources drawn upon• Context of situation• World and domain knowledge• Domain communication knowledge• Linguistic knowledge

Page 2: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

2

NLG in Text Summarization

On Monday, GreenChip Solutions made an acquisition offer to BuyOut Inc., a St. Louis-based plastic tree manufacturer that had tremendous success in equipping American households with pink plastic oak trees.

GreenChip offered to acquire the plastic tree

manufacturer BuyOut.

( ... ( ... )

( ... )) Analysis

Generation

Page 3: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

3

NLG in Machine Translation

SL-TextSL-Text TL-TextTL-Text

Analysis Generation

Transfer

Interlingua

Page 4: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

4

NLG in Dialogue Systems

When does the train leave?

Analysis

Dialoguemanager

Generation“At eleven p.m.,from platform four”

Speech synth.

Speech recog.

Page 5: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

5

NLG as a self-contained task

Data/knowledge Base

(...

(...)

(....))Texts

Intermed. Repr. 1 Intermed. Repr. 2

Page 6: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

6

NLG Tasks

• Content Determination

• Document Structure Planning

• Sentence (Micro) Planning– Lexicalization– Referring Expression Determination– Aggregation– Syntactic Structure Determination

• Surface Realization

Page 7: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

7

Content Determination Strategies

• Data driven strategy

• Document structure driven strategies

– Text plan (schema) driven strategy

– Discourse relation driven strategy

• Combined (data and structure driven) strategy

Page 8: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

8

Data Driven Content Determination

Based on:• Formal representation of data

• Context-dependent and domain-specific content selection rules

Strategy:

• Determine what data to communicate in the text according to messages or selection rules

Page 9: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

9 Data Driven Content Determination (an Example, Input data)

<station> Stuttgart-Mitte </station>

<substance> ozone </substance><mseries>

<meas>

<time> 06:00 </time>

<value> 20 </value>

</meas>

<meas>

<time> 06:30 </time>

<value> 33 </value>

</mseries>

Page 10: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

10 Data Driven Content Determination (an Example, Selection Rules)

IF (hourly average value of substance X > 25)

THEN select substance X for realization

IF (value at x:30) > 1.1(value at x:00)

AND(value at x:00 > 100)

THEN select value at x:00 for realizationAND

select value at x:30 for realization

ELSE select average value of x:00 and x:30

Page 11: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

11 Document structure driven content determination

Basic idea:

When generating a text we must ensure that information elements in the text are related to each other so as to

achieve that the text is coherent.

So why not select the content according to a coherent plan or following rhetorical relations that must be recognizable between data elements?

Two common approaches:

1. Text plan (schema) driven strategy

2. Discourse relation driven strategy

Page 12: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

12

Schemata

Introduced by K. McKeown (1985)(also known as Generic Structure Potential)

Observations:

• Specific text types often reveal typical structures

• A structure gives rise to a recursive document plan, a schema, which

consists of less complex subschemata or elementary elements.

• A schema ensures the coherence of the text that is built according to it

• Schemata can be compiled in terms of text grammars

Page 13: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

13

Schema Example

(NextDayGlobalWeatherForecastSchema:

CloudInfo

PrecipitationInfo

CurrentPrecipitation

PrecipitationProgression

WindInfo

TemperatureInfo

EarlyTemperature

HighestTemperature)

Page 14: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

14

Application of Schemata

SubstanceConcentrationSchema:

(CurrentConcentration <cc-sruleset>

ReferenceEvaluation

RefConcentration <rc-sruleset>

CurrentRefCompare <crc-sruleset>

ConcentrIntervalAssociation <cia-sruleset>

CompThreshold <ct-sruleset>

LegalInfo <li-sruleset1>

RegionEvaluationLowestConcentr <lc-sruleset1>HighestConcentr <hc-sruleset1>)

Page 15: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

15 Text Structure Driven Content Determination (Example, sel.rules)

<cc-sruleset>

((measure –time-> timepoint);(timepoint –hour-> ?h;?h := (get NOW INPUT))

(measure –substance-> ?s;?s := (get `substance INPUT))

(measure –value-> ?v;?v := (get `value INPUT))

(measure –location-> ?l;?l := (get `location INPUT))

...)

Page 16: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

16 Evaluation of Schema Based Content Determination

Pros:• Relatively easy to establish for a well-restricted domain• The selected information elements form a coherent text

plan which adequately reflects the structure of the texts of the domain

• Computationally efficient

Cons:• Domain-specific• Hardly allow for variation

Page 17: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

17 Discourse Relation Driven Content Determination

Based on:

• Formal representation of underlying information elements

• Discourse relations between information elements

• Rules for navigation along the discourse relations

• Heuristics for relation sequences for a given text type

Strategy:

• Collect the data or information elements travelling along the

discourse relations and using the heuristics

Page 18: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

18

Rhetorical Structure Theory

Introduced by W. Mann & S. Thompson (1987)

Observations:

• Between text elements (sentences, paragraphs, ...) „rhetorical

relations“ hold.

• Rhetorical relations (besides other elements) make the text

coherent.

• Rhetorical relations can be classified with respect to their

function.

• For a specific domain, a sequence of rhetorical relations in a

text can be precompiled.

Page 19: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

19

RST-relation example (1)

1. Heavy rain and thunderstorms in North Spain and on the Balearic Islands.

2. In other parts of Spain, still hot, dry weather with temperatures up to 35 degrees Celcius.

CONTRAST

Symmetric (multiple nuclei) Relation:

Page 20: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

20

RST-relation example (2)

2. In Cadiz, the thermometer might rise as high as 40 degrees.

1. In other parts of Spain, still hot, dry weather with temperatures up to 35 degrees Celcius.

ELABORATION

Asymmetric (nucleus-satellite) Relation:

Page 21: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

21

RST-based Content Determination

• Motto: choosing what to say and deciding how to structure

it cannot be divided

• Text planner by Moore and Paris 1993: Map

communicative goals via linguistic goals to language

• Each time alternative strategies for a (sub-) goal are

considered, new content can be selected

• Example: When the goal is to convince the reader of proposition P,

and the system reckons the reader is unlikely to believe P, check the

knowledge base for evidence supporting P, and verbalize it

Page 22: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

22

RST-based Content Determination (2)

• Model of mental states and communicative goals, e.g.:

– (know ?agent (ref ?description))

– (bel ?agent (?predicate ?e1 ?e2))

• Example: plan operator for MOTIVATION from Moore/Paris:

– EFFECT: (MOTIVATION ?act ?goal)

– CONSTRAINTS: (AND (STEP ?act ?goal)

– (GOAL ?hearer ?goal))

– NUCLEUS: (BEL ?hearer (STEP ?act ?goal))

– SATELLITES: NIL

• Moore/Paris text planner works by top-down hierarchical

expansion; alternative: bottom-up planning, e.g. (Marcu 1997)

Page 23: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

23 Evaluation of RST-based Content Determination

Pros:• The selected information elements form a coherent text plan

• Flexible production of text plans of variable size

• Allows for explicit reasoning about the reader‘s beliefs

Cons:• Usually, an information element in the data/knowledge base is

involved in discourse relations between several other information elements: constraints for selecting one path must be available

• Formalizing (all) RST relations is difficult

• Needs a sophisticated planning mechanism

• Computationally expensive

Page 24: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

24

Sentence (Micro) Planning

Goal:

To map a text plan into a sequence of sentence or phrase plans (with lexical items already determined)

Tasks:• Lexicalization• Referring Expression Determination• Aggregation• Syntactic Structure Determination

Page 25: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

25

Lexicalization (1)

Lexicalization is the process of mapping semantic entities onto lexical items.

• Aspects of lexicalization:

- Single open-class words (nouns, verbs, adjectives, adverbs)

- Function words (prepositions) that belong to the subcategorization frames of open-class words

- Discourse markers- Idiosyncratic word combinations (collocations)

Page 26: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

26

Lexicalization (2)

Lexicalization is guided by:

• Semantics of the entities to be mapped

• Communicative (textual) constraints of the domain

and previous discourse

• Pragmatics

– Basic-level preferences

– Argumentative intent

– User model: expertise, vocabulary

Page 27: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

27

Lexicalization (3): Stylistic Features

• Formality: motion picture - movie - flick

• Euphemism: washroom; ethnic cleansing

• Slant: gentleman - man - jerk

• Archaic: apothecary; albeit

• Floridity: house - habitation

• Abstractness: unemployed - out of work

• Force: big - monstrous

Page 28: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

28

Lexicalization Variations, Examples

1. The temperature dropped from 30 on Tuesday to 15 degrees C on Wednesday.

vs.

With 23 degrees C, the temperature on Wednesday was lower than on Tuesday. On Tuesday 30 degrees were measured.

vs.

On Tuesday, the thermometer read 30 degrees C. On Wednesday, it was much cooler.

Page 29: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

29

Lexicalization Strategies

• Constrain source entities until only one lexical option is

available

• Match parts of the source structure with parts of lexical

items

• If source items are indexed or labeled with lexical items:

choose one according to constraints that are either explicitly

available or are derived from the context

Page 30: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

30 Lexicalization Strategies (Constraining source entities)

(eat

:agent Freddy)IF :agent IS `human´

THEN essen

ELSE fressen

(cause

:causer Freddy

:causee Freddy

:caused: die)

IF :agent = :patient

AND

:agent IS `human´

THEN „commit suicide“

ELSE kill

Page 31: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

31 Lexicalization Strategies (Matching parts)

Example from MOOSE (Stede 1999):

pour1

tom1

coolant1

path1 radiator1

“into”

Tom poured coolant into the radiator. Tom schüttete Kühlmittel in den Kühler.

CAUSER

OBJECT

PATH DESTINATION

DIRECTION

Page 32: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

32 Lexicalization Strategies (Equating source and lexical entities)

Animal

Water Animal

Fish

Shark

Mammal

Cetacean

Dangerous Fish

Sand Shark

Dolphin

Tiger Shark

FN (Reiter)

Page 33: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

33 Lexicalization Strategies (Indexing)

`lecture´ => LECTURE

• Information available in the lexicon:

TALK, PRESENTATION, ...

[to] lecture

give [ART ~]

deliver [ART ~]

attend [ART ~]

follow [ART ~]

prepare [ART ~]

...

• Also (possibly) available: Paraphrasing rules

Page 34: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

34

Aggregation

Aggregation is the process of entity grouping at various levels of processing with the goal to avoid redundancy.

Types of aggregation:

• Conceptual aggregation

• Lexical aggregation

• Syntactic aggregation

Page 35: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

35

Aggregation (Some Examples, 1)

Conceptual aggregation:1. Heavy rain is expected in Zuffenhausen.

2. Heavy rain is expected in Cannstatt3. Heavy rain is expected in Vaihingen

1.-3. Heavy rain is expected in Metropolitan Stuttgart.

Lexical aggregation:1. From 9 am to 11 am the ozone concentration fell.2. Then the ozone concentration rose.3. Then the ozone concentration fell.4. Then the ozone concentration rose

1.-4. From 9 pm on the ozone concentration varied.

Page 36: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

36

Aggregation (Some Examples, 2)

Syntactic aggregation:

• Referential aggregation

1. The employment rate among women fell.

2. The employment rate among men rose.

1.+2. The employment rate among women fell while that among men rose.

• Elision

1. The employment rate among women rose.

2. The employment rate among men rose.

1.+2. The employment rate rose.

Page 37: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

37

Aggregation, Rule Examples

(x / process

:agent ?A

...)

AND(x / process

:agent ?B

...)

(x / process

:agent (c /conj:arg (?A ?

B))...)

Page 38: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

38

Choice of Referring Expressions

The process of determining how to identify entities known from the extralinguistic context and entities introduced in the previous discourse.

Types of referring expressions:

• Noun Definiteness/Deixis

• Pronominalization

• Elision

• Direct lexical references

• Indirect lexical references

Page 39: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

39

Referring Expressions Examples

1. John saw a small boy. The boy was crying.

2. John saw a small boy. He was crying ...3. John saw a small boy. The poor kid was crying

4. The comments are not restricted to classic AI, but are appropriately applied to theoretical linguistics as well.

5. Today‘s lecture is on Agent Technology. The lecturer is a visiting professor from the UCLA.

Page 40: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

40

Referring Expressions , Rule Example

IF (X is denotation of a transformation

ANDProp.focus mentioned in last sentence

ANDResultative Noun (RN) available for X)

THEN IF (RN unique)THEN CHOOSE RN

ELSE ...Put the batter into the oven. Remove the cake in two hours.

Page 41: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

41

Syntacticization (1)

Syntacticization is the process of choosing the most appropriate syntactic construction for a message.

Options to be chosen from:

• Sequence of sentences vs. Coordination vs. Subordination:

The Black Forest station is located in the woods. At this station, an ozone concentration of 259 g/m3 has been measured.

vs.

At the Black Forest station, which is located in the woods, an ozone concentration of 259 g/m3 has been measured.

Page 42: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

42

Syntacticization (2)

• Sentence vs. Nominal Phrase:

Tomorrow, it is cloudy with sunny periods and patchy drizzle ending in the afternoon.

vs.

Tomorrow, clouds with sunny periods and patchy drizzle till the afternoon.

Page 43: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

43

Interdependency in Microplanning

Problematic:

• Nearly all microplanning tasks are intertwined with each other, i.e., the realization of one depends on the realization of the other and vice versa.

• Theoretically still unclear which phenomenon belongs to which task.

• Theoretically still not entirely clear whether to treat microplanning as a set of different tasks.

Page 44: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

44

Interdependency in Microplanning

1. Today‘s lecture is on Agent Technology. The lecturer is a visiting professor from the UCLA.

2. The topic of today‘s lecture is Agent Technology. It is given by a visiting professor from the UCLA.

3. A visiting professor from the UCLA gives today a lecture on Agent Technology.

4. Today‘s lecture, which is on Agent Technology, is given by a visiting professor from the UCLA.

Page 45: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

45

Surface realization (1)

Goal:

To realize a sentence/phrase plan as a sentence/phrase at the surface

Tasks:

• Syntactic realization

• Morphologization

Page 46: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

46

Surface Realization Input (1)

(c / creative-material-action:tense past:lex construct-past:passivization-q passivization:actee (h / object

:lex house:multiplicity-q unitary:singularity-q singular:identifiability-q identifiable)

:relations (i / in:range (l / two-d-location

:lex forest:mult…-q unitary:singularity-q singular:ident…-q identifiable)))

Page 47: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

47

Surface Realization Input (2)

(construct :tense past:voice passive-subjectival-> (house

:number singular:article def)

-prep.objectival-> (in-objectival-> (forest

:number singular :article def)))

Page 48: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

48

Modularization of generation tasks

Content Selection + Text Structuring: Text Planner

Microplanning: Sentence Planner

+ Grammar

Surface Realization: Grammar

OR

Content Selection + Text Structuring: Text Planner

Lexicalization: Lex. Chooser

Syntacticization + Surface Real.: Grammar

Page 49: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

49

Modularization of generation tasks (cont.)

OR

Content Selection + Text Structuring: (Text) Planner

Lexicalization: (Text) PlannerSyntacticization: (Text) Planner

Surface Real.: Grammar

OR

All tasks dealt with in one module

Page 50: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

50

Architecture Issues

Main Types of NLG-System architectures:

• Pipeline Architecture

• Iterative Architecture

• (Quasi or partially) Parallel Architectures

- Communication of separate modules via a common information

space (e.g. blackboard)

- Incremental providing of information by individual modules or of

the input (interleaved architecture)

- No separate modules (integrated architecture)

Page 51: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

51

A Standard Pipeline Architecture

Text planner

Sentence planner

Grammar

Page 52: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

52

A Possible Iterative Architecture

Formulator

Grammar

Textplanner

Page 53: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

53

A Possible Blackboard Architecture

AdministratorKnowl. Sources

Blackboard(s)

Discourse Str.

Content Deter.

Lexicalization

...

Page 54: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

54

Level of Abstraction of Input Structures

The level of abstraction of the input structures is generator-specific and varies extremely from case to case. An input structure may be:

• an export from a data base

• an excerpt from a KL-ONE-like knowledge base

• a Conceptual Graph structure

• a semantic structure

• a syntactic structure

Page 55: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

55

Example of a Concrete Input Structure

(c / creative-material-action:tense past:lex construct-past:passivization-q passivization:actee (h / object

:lex house:multiplicity-q unitary:singularity-q singular:identifiability-q identifiable)

:relations (i / in:range (l / two-d-location

:lex forest:mult…-q unitary:singularity-q singular:ident…-q identifiable)))

Page 56: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

56

Example of an Abstract Input Structure

<station> Stuttgart-Mitte </station>

<substance> ozone </substance>

<mseries>

<meas>

<time> 06:00 </time>

<value> 20 </value>

</meas>

<meas>

<time> 06:30 </time>

<value> 33 </value>

</mseries>

Page 57: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

57

Generation Techniques

Depending on the scale of variation and complexity of texts required, several generation techniques are available:

• Canned text

• Templates

• Full fledged generation

• Combination of the above techniques

Page 58: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

58

Templates, Examples

1. Cloudy with sunny periods in <location>. The temperature is expected to rise to <number> degrees C.

2. In <location>, the ozone concentration reached <number> µg/m3.

3. <user-name> was logged in for <duration> hours.

4. The unemployment rate among men for the month of <month> <decreased/increased/remained stable>.

Page 59: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

59

Full fledged generation

...

subst. time value

SO2

NO2

19.11.99:18:00

... ... ...

9

7819.11.99:18:00

dimen.

µg/m3

...

station

Berlin

(measure

station: Berlin,

substance:SO2,

time: 19.11.99:18:00,

value:200,

dimension: µg/m3)

On 19.11. at 6pm the SO2 concentration reached 200 µg/m3 in Berlin.

Page 60: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

60

Technique of text production

• A paragraph or sentence never changes – its appearance being triggered by specific input data or being obligatory.

Canned Text is appropriate • Only a few variations of sentence and/or phrase structures

are available to communicate a specific information; within a sentence/phrase structure a few arguments may change.

Templates are appropriate• The information to be communicated may vary and the

sentence structures that express depend on the discourse structure progression

Full fledged generation is appropriate

Page 61: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

61

Page 62: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

62

Page 63: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

63

Multimodal Generation

Text presentations and graphical presentations have differing strengths and weaknesses. Their combination can achieve powerful synergies.

However, simply placing textual and graphical information together is no guarantee that one view is supportive of another.If the perspective on the data taken in a graphic and that taken in a text have no relation, then the result is incoherence rather than synergy.

Page 64: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

64

Multimodal Generation: Coherence

Multimodal generation is a goal-directed activity, i.e.,

when generating a multimodal document

• the author pursues certain comm. goals

Intentional Structure of the document

• the author chooses an organization of the information that supports its comm. goals

Discourse Structure of the document

Page 65: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

65

Intentional Structure of a Document

The Intentional Structure of a document is a hierarchy of Acts that ensure that the goal(s) is/are achieved

1. At each level of the hierarchy, at least one main act must be specified

2. A main act may be supported by subsidiary acts

3. The system must keep track of the beliefs it has and the facts it knows about

Page 66: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

66

Discourse Structure of a Document

The Discourse Structure of a document is a hierarchy of coherence relations – as, e.g., specified in RST.

Examples of RST-Relations:

ContrastElaborationMotivationEnablementBackground

Page 67: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

67

RST-relation example (1)

1. Heavy rain and thunderstorms in North Spain and on the Balearic Islands.

2. In other parts of Spain, still hot, dry weather with temperatures up to 35 degrees Celcius.

CONTRAST

Symmetric (multiple nuclei) Relation:

Page 68: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

68

RST-relation example (2)

2. In Cadiz, the thermometer might rise as high as 40 degrees.

1. In other parts of Spain, still hot, dry weather with temperatures up to 35 degrees Celcius.

ELABORATION

Asymmetric (nucleus-satellite) Relation:

Page 69: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

69

Cohesive Links between Doc. Elements

multimodal referring expressions

vs.

crossmode referring expressions

A multimodal referring expression refers to a world object via a combination of at least two media. Each mode conveys some discriminating attributes of the object.

A crossmode referring expression refers to a document part in a different presentation mode.

Page 70: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

70

Planning the content and the structure

Communicative Structure + Discourse Structure

Textplanning in (monomodal) Text Generation

Planning mechanisms for multimodal documents can and should be derived from the text planning

mechanisms

!!!RST-like Text Planning!!!

Page 71: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

71

RST-based Content Determination (2)

• Model of mental states and communicative goals, e.g.:

– (know ?agent (ref ?description))

– (bel ?agent (?predicate ?e1 ?e2))

• Example: plan operator for MOTIVATION from Moore/Paris:

– EFFECT: (MOTIVATION ?act ?goal)

– CONSTRAINTS: (AND (STEP ?act ?goal)

– (GOAL ?hearer ?goal))

– NUCLEUS: (BEL ?hearer (STEP ?act ?goal))

– SATELLITES: NIL

• Moore/Paris text planner works by top-down hierarchical

expansion; alternative: bottom-up planning, e.g. (Marcu 1997)

Page 72: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

72

WIP Planning Strategies

Introduce an object by showing a picture of it:

Header: (Introduce System User ?object Graphics)

Effect: (BMB System user (Isa ?object ?concept)

Applicability Conditions:

(Bel System (Isa ?object ?concept)

Main Acts:

(S-Depict System User ?object ?pic-obj ?picture)

Subsidiary Acts:

(Label System User ?object ?medium)

(Provide-Background System User ?object ?pic-obj ?picture Gr..)

Page 73: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

73

WIP Planning Strategies

Provide Background:

Header: (Provide-Background System User ?x ?px ?picture Graphics)

Effect: (BMB System user (Encodes ?px ?x ?picture)

Applicability Conditions:

(And (Bel System (Encodes ?px ?x ?picture))

(Bel System (Perceptually-Access-p User ?x))

(Bel System (Part-of ?x ?z))

Main Acts:

(S-Depict System User ?z ?pz ?picture)

Subsidiary Acts:

(Achieve System (BMB System User (Encodes ?pz ?z ?picture

?medium)

Page 74: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

74

WIP Planning Strategies

Establish a coreferential link:

Header: (Establish-coref System User ?r1 ?r2 Graphics)

Effect: (BMB System user (Coref ?r1 ?r2)

Applicability Conditions:

(And (BMB System User (Encodes ?spec1 ?r1))

(BMB System User (Text-Obj ?spec1 ?r1))

(BMB System User (Encodes ?spec2 ?r2))

(BMB System User (Pic-Obj ?spec2 ?r2)))

Main Acts:

(S-Annotate System User ?spec1 ?spec2 ?picture)

Page 75: 1 What is NLG? Input Formal representation of some information (linguistic or non-linguistic) Output Single sentences or texts (reports, explanations,

75

WIP Planner

1. The user posts the goal to be achieved.

2. The planner identifies the potentially applicable strategies by searching the strategy library for all strategies whose effect field matches the goal.

3. For each strategy found, the conditions are checked.

4. Select one of the applicable strategies (e.g., depending on the preference given to a specific mode).

5. Place the strategy in the corresponding plan node

6. If the strategy has subsidiary act strategies, expand the first; otherwise go to the nearest non-expanded strategy