Administrivia CS388: Natural Language Processing Lecture...
Transcript of Administrivia CS388: Natural Language Processing Lecture...
-
CS388:NaturalLanguageProcessing
GregDurre8
Lecture25:Mul
-
Morphology
Whatismorphology?‣ Studyofhowwordsform
‣ Derivaestrangement(n)
become(v)=>unbecoming(adj)
Ibecome/shebecomes
‣ Inflecinflammable
‣ Mostlyappliestoverbsandnouns
MorphologicalInflec
-
NounInflec
-
Morphologically-RichLanguages
‣ Greatresourcesforchallengingyourassump
-
Predic
-
MorphemeSegmentaun+becom+ing—weshouldbeabletorecognizethesecommonpiecesandsplitthemoff
‣ Howdowedothis?
MorphemeSegmenta
-
Cross-LingualTagging
‣ LabelingPOSdatasetsisexpensive‣ Canwetransferannota
-
Cross-LingualParsing
McDonaldetal.(2011)
‣ NowthatwecanPOStagotherlanguages,canweparsethemtoo?
‣ Directtransfer:trainaparseroverPOSsequencesinonelanguage,thenapplyittoanotherlanguage
Iliketomatoes
PRONVERBNOUN
JelesaimePRONPRONVERB
Ilikethem
PRONVERBPRON
Parsertrained toaccepttag input
VERBistheheadofPRONandNOUN
parsenew data
train
Cross-LingualParsing
McDonaldetal.(2011)
‣ Mul
-
Mul
-
MulHindi(Devanagari).Transferswelldespitedifferentalphabets!
‣ Japanese=>English:differentscriptandverydifferentsyntax
Mul