Research Infrastructures for Linguistics
Transcript of Research Infrastructures for Linguistics
Panel: NLP Helps Linguistics ACL 2010, Uppsala
Research Infrastructures for Linguistics
Erhard Hinrichs
University of Tübingen
Panel: NLP Helps Linguistics ACL 2010, Uppsala
Corpus Studies: Now and Then
• Hinrichs (1981): Temporal Anaphora in English
• Hinrichs and Lau (2008): „In contrast“: A Complex Discourse Connective
Panel: NLP Helps Linguistics ACL 2010, Uppsala
What is Needed?
• Easy Availability of Language Resources
• Easy to Use to Tools
... Ideally in a common research infrastructure
Panel: NLP Helps Linguistics ACL 2010, Uppsala
CLARIN- Common Language Resources and Technology Infrastructure -
Panel: NLP Helps Linguistics ACL 2010, Uppsala
WebLicht- A Service Oriented Architecture -
Web 2.0 Application for
Tool Chaining
and Execution
Repository
Stuttgart Tübingen Berlin Leipzig Finland
Standard-conformant
Text Corpus Encoding
Stuttgart Tübingen Leipzig
Romania
Panel: NLP Helps Linguistics ACL 2010, Uppsala
Dynamics of Language- Dialectal Variation -
WARD Clustering of Bulgarian Dialects
Panel: NLP Helps Linguistics ACL 2010, Uppsala
Computational Dialectometry
• Uses unsupervised machine learning methods to compute dialect continua
• Replaces traditional Isogloss methods of dialectology
• Utilizes an underlying eScience infrastructure of electronic language resources, analysis- and visualization tools