Statistical Language Modeling (SLM); Computational Linguistics (CL)

1
1 Carnegie Mellon Statistical Language Modeling (SLM); Computational Linguistics (CL) Surface (s) and hidden (h) components of language The p(s,h) function Statistical language modeling: estimating p(s) Distribution of words, sentences, documents Computational linguistics / NLP: estimating p(h|s) Classification vs. Regression vs. Density estimation The source-channel model (aka, a Bayes classifier for everything) SLM used as prior: speech, translation, spelling correction, OCR,... SLM used as likelihood: document classification,... [Probability: prior, posterior, Bayes' theorem, Bayes classifier] ) , Pr( h s

description

Statistical Language Modeling (SLM); Computational Linguistics (CL). Surface (s) and hidden (h) components of language The p(s,h) function Statistical language modeling: estimating p(s) Distribution of words, sentences, documents Computational linguistics / NLP: estimating p(h|s) - PowerPoint PPT Presentation

Transcript of Statistical Language Modeling (SLM); Computational Linguistics (CL)

Page 1: Statistical Language Modeling (SLM); Computational Linguistics (CL)

1

CarnegieMellon

Statistical Language Modeling (SLM); Computational Linguistics (CL)

Surface (s) and hidden (h) components of language The p(s,h) function Statistical language modeling: estimating p(s)

Distribution of words, sentences, documents Computational linguistics / NLP: estimating p(h|s) Classification vs. Regression vs. Density estimation The source-channel model (aka, a Bayes classifier for

everything) SLM used as prior: speech, translation, spelling correction,

OCR,... SLM used as likelihood: document classification,... [Probability: prior, posterior, Bayes' theorem, Bayes classifier]

),Pr( hs