Research Infrastructures for Linguistics

9
Panel: NLP Helps Linguistics ACL 2010, Uppsala Research Infrastructures for Linguistics Erhard Hinrichs University of Tübingen

Transcript of Research Infrastructures for Linguistics

Panel: NLP Helps Linguistics ACL 2010, Uppsala

Research Infrastructures for Linguistics

Erhard Hinrichs

University of Tübingen

Panel: NLP Helps Linguistics ACL 2010, Uppsala

Corpus Studies: Now and Then

• Hinrichs (1981): Temporal Anaphora in English

• Hinrichs and Lau (2008): „In contrast“: A Complex Discourse Connective

Panel: NLP Helps Linguistics ACL 2010, Uppsala

What is Needed?

• Easy Availability of Language Resources

• Easy to Use to Tools

... Ideally in a common research infrastructure

Panel: NLP Helps Linguistics ACL 2010, Uppsala

CLARIN- Common Language Resources and Technology Infrastructure -

Panel: NLP Helps Linguistics ACL 2010, Uppsala

Virtual Language Observatory

Panel: NLP Helps Linguistics ACL 2010, Uppsala

WebLicht- A Service Oriented Architecture -

Web 2.0 Application for

Tool Chaining

and Execution

Repository

Stuttgart Tübingen Berlin Leipzig Finland

Standard-conformant

Text Corpus Encoding

Stuttgart Tübingen Leipzig

Romania

Panel: NLP Helps Linguistics ACL 2010, Uppsala

Dynamics of Language- Dialectal Variation -

WARD Clustering of Bulgarian Dialects

Panel: NLP Helps Linguistics ACL 2010, Uppsala

Computational Dialectometry

• Uses unsupervised machine learning methods to compute dialect continua

• Replaces traditional Isogloss methods of dialectology

• Utilizes an underlying eScience infrastructure of electronic language resources, analysis- and visualization tools

Panel: NLP Helps Linguistics ACL 2010, Uppsala

Bundling Isoglosses: An Example from Swabian