Minimally Supervised Event Causality Identification

41
Minimally Supervised Event Causality Identification Quang Do, Yee Seng, and Dan Roth University of Illinois at Urbana-Champaign 1 EMNLP-2011

description

Minimally Supervised Event Causality Identification. Quang Do, Yee Seng, and Dan Roth University of Illinois at Urbana-Champaign. EMNLP-2011. Event Causality. The police arrested him because he killed someone. Event Causality. The police arrested him because he killed someone. - PowerPoint PPT Presentation

Transcript of Minimally Supervised Event Causality Identification

Page 1: Minimally Supervised  Event Causality Identification

Minimally Supervised Event Causality Identification

Quang Do, Yee Seng, and Dan Roth

University of Illinois at Urbana-Champaign

1

EMNLP-2011

Page 2: Minimally Supervised  Event Causality Identification

2

The police arrested him because he killed someone.

Event Causality

Page 3: Minimally Supervised  Event Causality Identification

3

The police arrested him because he killed someone.

event trigger event trigger

Event Causality

Page 4: Minimally Supervised  Event Causality Identification

Event Causality

4

The police arrested him because he killed someone.

causality

event trigger event trigger

• We identify causality between event pairs, but not the direction

Page 5: Minimally Supervised  Event Causality Identification

Event Causality

5

The police arrested him because he killed someone.

calculate causality association: co-occurrence counts, pointwise mutual information (PMI)…

Page 6: Minimally Supervised  Event Causality Identification

Event Causality

6

The police arrested him because he killed someone.

contingency discourse relation

connective

Page 7: Minimally Supervised  Event Causality Identification

Event Causality

Identify multiple cues to jointly identify event causality: Distributional association scores discourse relation predictions

7

The police arrested him because he killed someone.

discourse relation prediction

distributional association scoreDistributional

Discourse

Page 8: Minimally Supervised  Event Causality Identification

Cause-Effect Association (CEA) andDiscourse Relations

We define an event e as: p(a1, a2, …, an):

8

association between event predicates

association between the predicate of an event and the arguments of the other event

association between event arguments

[ … … … ] connective [ … … … ]eeee

• A connective is associated with two text spans• Training on the Penn Discourse Treebank (PDTB), we developed a system

that predicts the discourse relations of expressed by the connectives

Distributional

Discourse

Page 9: Minimally Supervised  Event Causality Identification

Event Definition

We define an event e as: p(a1, a2, …, an): predicate p: the event trigger word

a1, a2, …, an: arguments associated with e

Examples: Verbs: “… he killed someone …” Nominals: “… the attack by the troops …”

9

Page 10: Minimally Supervised  Event Causality Identification

Contributions (Event Causality)

We identify causality between event pairs in context: verb-verb, verb-noun, noun-noun triggered event pairs (prior work usually focus on just verb triggers)

A minimally supervised approach to detect event causality using distributional similarity methods

Leverage the interactions between event causality prediction and discourse relations prediction

10

Page 11: Minimally Supervised  Event Causality Identification

Overview (Event Causality)

Event causality: Interaction between event causality and discourse relations Event predicates: verbs, nominals

Cause-Effect Association (CEA) Discourse and Causality:

Discourse relations Constraints for joint inference with CEA

Experiments: Settings Evaluation Analysis

Conclusion

11

Page 12: Minimally Supervised  Event Causality Identification

Overview (Event Causality)

Event causality: Interaction between event causality and discourse relations Event predicates: verbs, nominals

Cause-Effect Association (CEA) Discourse and Causality:

Discourse relations Constraints for joint inference

Experiments: Settings Evaluation Analysis

Conclusion

12

Page 13: Minimally Supervised  Event Causality Identification

Cause-Effect Association (CEA)

13

The police arrested him because he killed someone.

CEA: prediction of whether two events are causally related

Page 14: Minimally Supervised  Event Causality Identification

Cause-Effect Association (CEA)

We define an event e as: p(a1, a2, …, an): predicate p: the event trigger word (e.g.: arrested, killed)

a1, a2, …, an: arguments associated with e

14

association between event predicates

association between the predicate of an event and the arguments of the other event

association between event arguments

Page 15: Minimally Supervised  Event Causality Identification

Predicate-Predicate Association

15

Page 16: Minimally Supervised  Event Causality Identification

Predicate-Predicate Association

16

D: total number of documents in the collectionN: number of documents that p occurs in

Page 17: Minimally Supervised  Event Causality Identification

Predicate-Predicate Association

17

awards event pairs that are closer together in the texts (in terms of num# of sentences apart), while penalizing event pairs that are further apart

Page 18: Minimally Supervised  Event Causality Identification

Predicate-Predicate Association

18

takes into account whether predicates (events) pi and pj appear most frequently with each other

Page 19: Minimally Supervised  Event Causality Identification

Predicate-Predicate Association

19

takes into account whether predicates (events) pi and pj appear most frequently with each other

ui will be maximized if there is no other predicate pk (as compared to pj) having a higher co-occurrence probability with pi

Page 20: Minimally Supervised  Event Causality Identification

Predicate-Argument Association

20

Pair up the predicates and arguments across events, calculate the PMI for each link, then average them

Page 21: Minimally Supervised  Event Causality Identification

Argument-Argument Association

21

calculate the PMI for each possible pairings of the arguments (across the two events), then average them

Page 22: Minimally Supervised  Event Causality Identification

Cause-Effect Association (CEA)

22

The police arrested him because he killed someone.

CEA score: predicts whether the two events are causally related

Page 23: Minimally Supervised  Event Causality Identification

Overview (Event Causality)

Event causality: Interaction between event causality and discourse relations Event predicates: verbs, nominals

Cause-Effect Association (CEA) Discourse and Causality:

Discourse relations Constraints for joint inference with CEA

Experiments: Settings Evaluation Analysis

Conclusion

23

Page 24: Minimally Supervised  Event Causality Identification

Discourse and Causality Interaction

24

[ … … … ] connective [ … … … ]eeee

Interaction between:•Discourse relation evoked by the connective c•Relations between ep (event pairs that crosses the two text spans)

causal? not-causal?

Page 25: Minimally Supervised  Event Causality Identification

Penn Discourse Treebank (PDTB) Relations

Discourse relations: Comparison:

Concession, Contrast, Pragmatic-concession, Pragmatic-contrast Contingency:

Cause, Condition, Pragmatic-cause, Pragmatic-condition Expansion:

Alternative, Conjunction, Exception, Instantiation, List, Restatement Temporal:

Asynchronous, Synchronous

25

Page 26: Minimally Supervised  Event Causality Identification

Discourse Relations

Comparison: Highlights differences between the situations described in the text spans:

Negative evidence for causality

Contingency: The situation described in one text span causally influences the situation in the other:

Positive evidence

26

Contrast: [According to the survey, x% of Chinese Internet users prefer Google] whereas [y% prefer Baidu].

Cause: [The first priority is search and rescue] because [many people are trapped under the rubble].

Page 27: Minimally Supervised  Event Causality Identification

Discourse Relations

Expansion: Providing additional information, illustrating alternative situations, etc.:

Negative evidence, except for Conjunction (which connects arbitrary pieces of text spans)

Temporal:

Temporal precedence of the (cause) event over the (effect) event is a necessary, but not sufficient requisite for causality

27

Conjunction: [Over the past decade, x women were killed] and [y went missing].

Synchrony: [He was sitting at his home] when [the whole world started to shake].

Page 28: Minimally Supervised  Event Causality Identification

Discourse and Causality Interaction

28

[ … … … ] connective [ … … … ]eeee

Cause, Condition

ei ejAt least one (crossing) ep is causal

1

ei ejCause, Condition, Temporal, Asynchronous, Synchrony, Conjunction

If we have a (crossing) ep which is causal2

Comparison, Concession, Contrast, Pragmatic-concession, Pragmatic-contrast, Expansion, Alternative, Exception, Instantiation, List, Restatement

ei ejNo (crossing) ep is casual

3

Page 29: Minimally Supervised  Event Causality Identification

Joint Inference: Discourse & Distributional Causality

Objective function:

29

Cc Ldr EPep LererepepERdrccDR

DR ER

yersLxdrsLArg ,, )()(max

Probability that connective c is predicted with discourse relation dr

CEA prediction that event pair ep takes on the causal or not-causal label er

discourse relation indicator variable

event pair causality indicator variable

Page 30: Minimally Supervised  Event Causality Identification

Constraints

30

If the connective is predicted with a “Cause” discourse relation, then the CEA system should predict that at least one of the (crossing) event pair is causally related

"","","", causalepcausalepCausec jiyyx

Cause, Condition

ei ejAt least one (crossing) ep is causal

1

[ … … … ] connective [ … … … ]eeee

Page 31: Minimally Supervised  Event Causality Identification

Constraints

31

If a (crossing) event pair is predicted by CEA as causally related, then the associated connective should be predicted as having discourse relation; “Cause”, “Condition”, …, “Conjunction”

"","","","", nConjunctiocConditioncCauseccausalep xxxy

[ … … … ] connective [ … … … ]eeee

ei ejCause, Condition, Temporal, Asynchronous, Synchrony, Conjunction

If we have a (crossing) ep which is causal2

Page 32: Minimally Supervised  Event Causality Identification

Constraints

32

If the connective is predicted with discourse relation “Comparison”, “Concession”, …, “Restatement”; no (crossing) event pair is causally related

{“Comparison”,”Concession”…}

[ … … … ] connective [ … … … ]eeee

Comparison, Concession, Contrast, Pragmatic-concession, Pragmatic-contrast, Expansion, Alternative, Exception, Instantiation, List, Restatement

ei ejNo (crossing) ep is casual

3

Page 33: Minimally Supervised  Event Causality Identification

Overview (Event Causality)

Event causality: Interaction between event causality and discourse relations Event predicates: verbs, nominals

Cause-Effect Association (CEA) Discourse and Causality:

Discourse relations Constraints for joint inference

Experiments: Settings Evaluation Analysis

Conclusion

33

Page 34: Minimally Supervised  Event Causality Identification

Experimental Settings

To collect the distributional statistics for measuring CEA: 760K documents in the English Gigaword corpus

25 CNN documents from first three months of 2010: 20 documents for evaluation 5 documents for development

34

Page 35: Minimally Supervised  Event Causality Identification

Annotation for Causal Event Pairs

Annotation guidelines: The Cause event should temporally precede the Effect event;

the Effect event occurs because the Cause event occurs

35

Page 36: Minimally Supervised  Event Causality Identification

Annotation for Causal Event Pairs

36

Si-1

Si

Si+1

C (causality)

R (relatedness)

R (relatedness)

Drawing links between event predicates: Event arguments are not annotated, but annotators are free to look at

the entire document text Annotators are not restricted to a fixed sentence window size

Document

Page 37: Minimally Supervised  Event Causality Identification

Annotation for Causal Event Pairs

Annotators overlap on 10 evaluation documents. Agreement ratio: 0.67 for C+R 0.58 for C

37

# relations Eval Dev

C 414 71

C+R 492 92

Page 38: Minimally Supervised  Event Causality Identification

Performance on Extracting Causality

38

Page 39: Minimally Supervised  Event Causality Identification

Performance on Extracting Causality and Relatedness

39

Page 40: Minimally Supervised  Event Causality Identification

Analysis of CEA mistakes

50 (randomly selected) false-positives (precision errors): 56%: CEA assigns a high score to event pairs that are not

causal 22%: involves events containing pronouns (“he”, “it”, etc.) as

arguments

50 false-negatives (recall errors): 23%: CEA assigns a low score to causal event pairs 19%: involving nominal predicates that are not in our list of event

evoking noun types 17%: involving nominal predicates without any argument (less

information for CEA) 15%: involves events containing pronouns as arguments

40

Page 41: Minimally Supervised  Event Causality Identification

Conclusion (Event Causality)

Developed a minimally supervised approach to identify event causality

Use distributional scores and discourse relations to jointly identify event causality

41