SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS:...

47
Cross-Level Semantic Similarity MultiJEDI ERC 259234 SemEval 2014 Task-3

Transcript of SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS:...

Page 1: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Cross-Level Semantic Similarity

MultiJEDI ERC 259234

SemEval 2014 Task-3

Page 2: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Semantic Similarity

Page 3: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Semantic Similarity

Mostly focused on similar types of lexical items

Page 4: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Semantic Similarity

What if we have different types of inputs?

Page 5: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

CLSS: Cross-Level Semantic Similarity A new type of similarity task

Page 6: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

CLSS: Cross-Level Semantic Similarity

A new type of similarity task

Page 7: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

CLSS: Comparison Types

Paragraph to Sentence

Page 8: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

CLSS: Comparison Types

Sentence to Phrase

Paragraph to Sentence

Page 9: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

CLSS: Comparison Types

Sentence to Phrase

Paragraph to Sentence

Phrase to Word

Page 10: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

CLSS: Comparison Types

Sentence to Phrase

Paragraph to Sentence

Word to Sense

Phrase to Word

Page 11: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Task Data

Training set Test set

4000 pairs in total

Page 12: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Task Data

A wide range of domains and text styles

Page 13: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

word-to-sense pairs Word to Sense

Page 14: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

word-to-sense pairs Word to Sense

Page 15: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

word-to-sense pairs Word to Sense

Page 16: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

word-to-sense pairs Word to Sense

Page 17: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Rating Scale

Page 18: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

Page 19: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

larger side

Page 20: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

larger side

Page 21: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

2

4

0

1

3

larger side

Page 22: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

2

4

0

1

3

larger side

Page 23: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

2

4

0

1

3

larger side smaller side

Page 24: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

2

4

0

1

3

larger side smaller side

Page 25: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

2

4

0

1

3

Page 26: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

2

4

0

1

3

Page 27: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Crafting an idealized similarity distribution

2

4

0

1

3

Page 28: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Test and Training data IAA

Training (all) Training (unadjudicated) Test (all) Test (unadjudicated)

Kri

ppendorf

f’s α

Paragraph-Sentence Sentence-Phrase Phrase-Word Word-Sense

Page 29: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

The annotation procedure produces a

balanced rating distribution

Page 30: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Experimental Setup

The quick brown fox

The brown fox was quick

The quick brown fox

The brown foxes were quick

Baslines:

Page 31: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Experimental Setup

The quick brown fox

The brown fox was quick

The quick brown fox

The brown foxes were quick

Baslines:

Evaluation Measure:

Page 32: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Number of participants

Paragraph-Sentence Sentence-Phrase Phrase-Word Word-Sense

Page 33: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

0 1 2 3 4

Meerkat Mafia pw*

SimCompass run1

ECNU run1

UNAL-NLP run2

SemantiKLUE run1

GST Baseline

LCS Baseline

Gold

paragraph-sentence sentence-phrase phrase-word word-sense

Top 5 Systems and Baselines

Page 34: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

0 1 2 3 4

Meerkat Mafia pw*

SimCompass run1

ECNU run1

UNAL-NLP run2

SemantiKLUE run1

GST Baseline

LCS Baseline

Gold

paragraph-sentence sentence-phrase phrase-word word-sense

Top 5 Systems and Baselines

Page 35: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

0 0.75 1.5 2.25 3

Meerkat Mafia pw*

SimCompass run1

ECNU run1

UNAL-NLP run2

SemantiKLUE run1

GST Baseline

LCS Baseline

paragraph-sentence sentence-phrase phrase-word word-sense

Where do the baselines stand?

Page 36: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

0 0.75 1.5 2.25 3

Meerkat Mafia pw*

SimCompass run1

ECNU run1

UNAL-NLP run2

SemantiKLUE run1

GST Baseline

LCS Baseline

paragraph-sentence sentence-phrase phrase-word word-sense

Where do the baselines stand?

Page 37: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

0 0.75 1.5 2.25 3

Meerkat Mafia pw*

SimCompass run1

ECNU run1

UNAL-NLP run2

SemantiKLUE run1

GST Baseline

LCS Baseline

paragraph-sentence sentence-phrase phrase-word word-sense

Where do the baselines stand?

Page 38: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Correlation per genre paragraph-to-sentence

Page 39: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Correlation per genre paragraph-to-sentence

Page 40: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Correlation per genre paragraph-to-sentence

Page 41: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Correlation per genre phrase-to-word

Page 42: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Correlation per genre phrase-to-word

Page 43: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

What makes the task difficult?

Page 44: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Handling OOV words

and novel usages

Page 45: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Dealing with social media text

Page 46: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

CLSS: Cross-Level Semantic Similarity

Similarity of different types of lexical items

High-quality dataset: 4000 pairs for four comparison types

38 systems from 19 teams

Page 47: SemEval 2014 Task-3 Cross-Level Semantic Similarity · Dealing with social media text . CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality

Thank you!

David Jurgens

Mohammad Taher Pilehvar

Roberto Navigli

MultiJEDI ERC 259234