A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model...

56
A Syntax-based Statistical Machine Translation Model Alexander Friedl, Georg Teichtmeister 4.12.2006

Transcript of A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model...

Page 1: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-basedStatistical MachineTranslation Model

Alexander Friedl, Georg Teichtmeister 4.12.2006

Page 2: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Introduction

� The model

� Experiment

� Conclusion

Page 3: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Statistical Translation Model (STM):

- mathematical model

- statistical modelling of human-languagetranslation

- parameters estimated with training corpus

Page 4: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� First steps

- IBM 1998: string-to-string word-basedtranslation

- completely independent process

Page 5: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Question: Why not word-based translation?

- no structural or synactic aspects

- how to handle different word order?

Page 6: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Solution:

A Syntax-based Statistical Translation Model

Page 7: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Input: parse tree(syntactic parser)

� Output: string

Page 8: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Three operations on each node of the tree:

- reorder

- inserting

- translating

Page 9: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Reorder

- different word order

- English vs. Japanese

Page 10: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Word-insertion

- e.g. capture linguistic differnces in specifyingsyntactic cases

Page 11: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Translation

- Translate leaf words into the destination language

Page 12: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Output

Page 13: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Introduction

� The model

� Experiment

� Conclusion

Page 14: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� English parse tree into a noisy channel

� Output should be a Japanese sentence

� Stochastical operations an each nodes

Page 15: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Reorder

- N! possiblereorderings for N children

- probability given bythe r-table

Page 16: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Word-insertion

- left, right oder nowhere

- probability given by the n-table

Page 17: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Translation

- dependent only on the word itself

- Probability given by the t-table

Page 18: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Total probability

- product of the single operation probabilities

� Tables

- English-Japanese training corpus

Page 19: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Formal Transcription

Input:

English parse tree ε (in nodes ε1, ε2, … εn)

Output:

French sentence f (in words f1, f2, … fm )

Page 20: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Probability getting f for ε

Str(Θ(ε )) is a sequence of leaf words

Page 21: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� P(Θ|ε ): Probability of a particular set of RVsin a parse tree

Page 22: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� RVs θi = <νi,ρi,τi> independent on each other, but dependent on features of εi

Page 23: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Probability of getting a French sentence fgiven an English parse tree ε

Page 24: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Automatic Estimation of model parameters

- update parameter to maximize the likelihood of the training corpus

Page 25: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Algorithm

1. Initialize probability tables2. Reset counters3. For each iteration the number of events

are counted and weighted by theprobability of events

4. Parameter re-estimated by the counts

Page 26: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Introduction

� The model

� Experiment

� Conclusion

Page 27: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Experiment with small English-Japanesecorpus

- 2121 translation sentence pairs

- taggers build the English parse trees

� Comparison with IBM 5

Page 28: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Evaluation

- generate the most probable alignment ofthe training corpus (Viterbi)

- average score of the first 50 sentences

Page 29: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

0 pointsAlignment wrong

0,5 pointNot sure

1 pointAlignment okay

00,431IBM Model 5

100,582Our Model

Perfect sentencesAverage score

Page 30: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Page 31: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Perplexity

- Our Model: 15,79

- IBM Model 5: 9,84

Page 32: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Introduction

� The model

� Experiment

� Conclusion

Page 33: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

� Syntax-based translation model

� Statistical modelling the translation process

� Syntactic information for languages withdifferent word order

� Better alignment results in an experiment

Page 34: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Outline – 2nd Part

� Problems of Tree to String� Clone Operation� Tree to Tree

� Phrasal Translation� Translation System

Page 35: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Problem of Tree to String Model

� Not all re-orderings of terminal nodes are possible

� Constrains syntactic correspondence between native and foreign language

Page 36: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

The Clone operation

� Insert a copy of a subtree at any point in the tree

� Delete original node

Clone Delete

Page 37: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

When to clone?

� Should a clone be inserted as child of

� Decide which node should be cloned

� Probability of cloning is independent of previous cloning operations

)|( jins cloneP ε

∑==

kkmakeclone

imakecloneiclone P

PcloneP

)(

)()1|(

εεε

Page 38: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Example

Page 39: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Tree – to - Tree

� Syntactic trees for foreign and native language

� Output tree instead of output string� Additional tree transformations

� Single source node → Two target nodes� Two source nodes → Single target node

� New model: )|( ab TTP

Page 40: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Building the output tree

� At each level at the output tree:�Choose a elementary tree

�Align the children of the the elementary tree

� Translate the leaves

Page 41: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Elementary tree

� Means that two nodes can grouped together� Example:

� Nodes A and B are considered a elementary tree

))(|( aaaelem childrentP εε ⇒

Page 42: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Alignment of children� All children of an elementary tree are aligned at

once according to:

� Insertions and Deletions are also done in this step

))(|( aaalign tchildrenP ⇒εα

Page 43: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Tree – to – Tree Clone

� Same reordering problems as Tree – to –String

� Cloning will now added to alignment step

))(|( aains tchildrencloneP ⇒∈ εα

∑=∈

kkmakeclone

imakecloneiclone P

PcloneP

)(

)()|(

εεαε

Page 44: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Parameter comparison

Page 45: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Experiments

� Data from Korean – English corpus (Military domain)

� Korean suffixes often carry meaning�This suffixes became leaves in the syntax tree

�Vocabulary was reduced from 10059 to 3279

� Average Korean: 13 words and 21 tokens� Average English: 16 words

Page 46: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Results

Word pairings by System (A) and Gold Standard(G)

||||

||21

GA

GAAER

+∩−=

Page 47: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Phrasal Translation

� 1 to 1 word translation not perfect� Compound nouns

�German vs English

� Idiomatic phrases� “To kick the bucket” vs “Den Löffel abgeben”

Page 48: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Model

� 1 to 1 Model:

� 1 to N with fertility µ:

� N to N:

)|()|( eftt =Ττ

∏=

==Τl

iil efteleffftt

121 )|()|()|..()|( µτ

∏=

==Φl

imim

ml

eeefteeel

eeeffftph

12121

2121

)..|()..|(

)..|..()|(

µ

φ

Page 49: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Incorporation into TM�

� R: feature forreordering

� N: feature for insertion� ρ: Reorder operation

� υ: Insert operation

)|()|()1()|()|( iiiiiiii NnRrphPii

υρλφλεθ ΦΦ −+Φ=

Page 50: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

From Models to the Translation

� Task: Translate from Foreign to English

� Reformulation

)|( FEP

)()|()|( EPEFPFEP =

Page 51: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Language Model

� Probability of an English Sentence P(E)� N-gram LM in original Implementation� No syntactic Information used� Improvement: Immediate-head parsing for

language models

Page 52: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Immediate-head parsing

� English Sentence� Non-lexical PCFG → Large Parse Forest� Pruning of the Large Parse Forest

�Which edges have high probability of beingcorrect?

� Evaluation of pruned Parse with a lexicalPCFG

Page 53: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Decoder

� Decoder works in reverse direction to TM

� Find most probable syntactic tree E from a Sentence F

� Basic idea: “Translate“ Parse tree using TM to the foreign language

� Parse the foreign sentence

� “Translate” back to English� And check LM

Page 54: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Conclusion� Improvements to Syntactic Translation Model

� Tree to String Clone� Tree to Tree� Tree to Tree Clone� Phrasal Translation

� Brief overview over Translation Process

Page 55: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

References

� A Syntax based Statistical Translation ModelKenji Yamada and Kevin Knight, Proceedings of the Conference of the Association for Computational Linguistics 2001

� Loosely Tree-based Alignment for Machine TranslationDaniel Gildea, In Proceedings of the 41st Annual Meeting of the Association of Computational Linguistics (ACL-03), Supporo, Japan, 2003.

� Syntax baed Language Models for Statistical Machine TranslationEugene Charniak, Kevin Knight and Kenji Yamada, In MT Summit IX. Intl. Assoc. for Machine Translation, 2003

� A Decoder for Syntax-Based Statistical MT Kenji Ymada and Kevin Knight, Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL-02) , 2002

Page 56: A Syntax-based Statistical Machine Translation ModelA Syntax-based Statistical Translation Model References A Syntax based Statistical Translation Model Kenji Yamada and Kevin Knight,

A Syntax-based Statistical Translation Model

Thank you for your attention!