Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory...

110
Outline The Database as a Concept and Tool The Database model and its Evolution Database Applications Day 1: Introduction to Database Theory and Design Database Theory and Design Tyler Peterson International Summer School on Language Documentation and Description Leiden University Centre for Linguistics, Leiden November 26, 2011 Database Theory and Design Day 1: Introduction to Database Theory and Design

Transcript of Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory...

Page 1: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Day 1: Introduction to Database Theory andDesign

Database Theory and DesignTyler Peterson

International Summer School on Language Documentation and DescriptionLeiden University Centre for Linguistics, Leiden

November 26, 2011

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 2: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

My Details:

Tyler Petersonoffice: LUCL

Van Wijkplaats 4Room 205a

telephone: 071-5272059email: [email protected]

[email protected] (for Google docs)

office hours: Most afternoons until 18:00

I Please fill out the short survey, and don’t hesitate to contactme!

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 3: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.

I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 4: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.

I Different commercial and free database programs: advantages andlimitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 5: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 6: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 7: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 8: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 9: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 10: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.

I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 11: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological Processes

I Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 12: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for the Course:

I Databases: what they are, what linguists can use them for.I Databases vs. spreadsheets. Types of databases.I Different commercial and free database programs: advantages and

limitations.

I To familiarize you with the concepts in database theory anddesign.

I Theory: Terminology used in database theory; Understanding thedesign features of a database (entities and attributes).

I Design: Assessing your goals; planning a database on paper(Entity-Relationship diagrams); Best practices.

I Looking at the practical implemetation of an analyticaldatabase in MS Access.

I Cross tabulation: a front-line analytical tool.I Trigger–Target Database for Phonological ProcessesI Programma de Fonologia Experimental e Historica

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 13: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes,the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 14: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes,the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 15: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes,the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 16: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes,the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 17: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

5 Day Plan:

I Day 1: Introduction to databases: the types, the purposes,the applications.

I Day 2: Principles of Database Theory and Design

I Day 3: Relational Databases I

I Day 4: Relational Databases II, examples and implementation

I Day 5: Implementation

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 18: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Goals for Today:

The Database as a Concept and ToolUnderstanding what a database isUsing databases in linguistics

The Database model and its EvolutionThe ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Database ApplicationsChoosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 19: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as a concept: A structured collection of data,or structured information:

I Index cards in a shoebox.I A table in a spreadsheet.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 20: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as a concept: A structured collection of data,or structured information:

I Index cards in a shoebox.

I A table in a spreadsheet.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 21: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as a concept: A structured collection of data,or structured information:

I Index cards in a shoebox.I A table in a spreadsheet.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 22: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

Word Gloss Gram. Morph.hon ‘fish’ N ROOTsmax ‘bear, meat’ N ROOTalgyax ‘language’ N ROOTs ’m-algyax Gitksan N STEMs ’m- ‘true’ A PREFIXsiipxw ‘sick, ill’ A ROOTwii- ’nakw ‘tall’ A STEMwii- ‘long’ A PREFIX’nakw DISTAL ROOT’nakw EVIDENTIAL ROOTx- ‘consume’ V PREFIXiixwt ‘fish’ V ROOTwitxw ‘arrive’ V ROOTbakw ‘arrive’ V ROOTlitsxxw ‘read’ V ROOT=hl common noun Det. ENCLITIC=t proper noun Det. ENCLITIC=tip plural noun Det. ENCLITIC- ’y 1sg Agr. SUFFIX-n 2sg Agr. SUFFIX-t 3 Agr. SUFFIX

Table: Structured Information: a Gitksan (Tsimshianic) word list

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 23: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as an application, or a kind of ‘processor’:

I Different types of processors:

I Word processor: processes words (!)I Spreadsheet: processes financial, numerical and statistical

information.I Database program: processes structured information.

I The digital presentation of structured information through anapplication: MS Access; OpenOffice Calc; FileMaker Pro;MySQL with a PHP server; etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 24: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as an application, or a kind of ‘processor’:I Different types of processors:

I Word processor: processes words (!)I Spreadsheet: processes financial, numerical and statistical

information.I Database program: processes structured information.

I The digital presentation of structured information through anapplication: MS Access; OpenOffice Calc; FileMaker Pro;MySQL with a PHP server; etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 25: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as an application, or a kind of ‘processor’:I Different types of processors:

I Word processor: processes words (!)

I Spreadsheet: processes financial, numerical and statisticalinformation.

I Database program: processes structured information.

I The digital presentation of structured information through anapplication: MS Access; OpenOffice Calc; FileMaker Pro;MySQL with a PHP server; etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 26: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as an application, or a kind of ‘processor’:I Different types of processors:

I Word processor: processes words (!)I Spreadsheet: processes financial, numerical and statistical

information.

I Database program: processes structured information.

I The digital presentation of structured information through anapplication: MS Access; OpenOffice Calc; FileMaker Pro;MySQL with a PHP server; etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 27: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as an application, or a kind of ‘processor’:I Different types of processors:

I Word processor: processes words (!)I Spreadsheet: processes financial, numerical and statistical

information.I Database program: processes structured information.

I The digital presentation of structured information through anapplication: MS Access; OpenOffice Calc; FileMaker Pro;MySQL with a PHP server; etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 28: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I The database as an application, or a kind of ‘processor’:I Different types of processors:

I Word processor: processes words (!)I Spreadsheet: processes financial, numerical and statistical

information.I Database program: processes structured information.

I The digital presentation of structured information through anapplication: MS Access; OpenOffice Calc; FileMaker Pro;MySQL with a PHP server; etc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 29: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I Spreadsheets are actually a kind of database: both organizeinformation into tables.

I The primary differences between a spreadsheet and database:different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’sthe average daily rainfall for the first six months of this year?”

I Databases uses functions to ask questions about structuredinformation: “Do we have any books on designing databases in ourlibrary? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications aredesigned to retrieve and present data through queries throughspecially designed forms, within a database application, or onthe web.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 30: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I Spreadsheets are actually a kind of database: both organizeinformation into tables.

I The primary differences between a spreadsheet and database:different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’sthe average daily rainfall for the first six months of this year?”

I Databases uses functions to ask questions about structuredinformation: “Do we have any books on designing databases in ourlibrary? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications aredesigned to retrieve and present data through queries throughspecially designed forms, within a database application, or onthe web.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 31: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I Spreadsheets are actually a kind of database: both organizeinformation into tables.

I The primary differences between a spreadsheet and database:different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’sthe average daily rainfall for the first six months of this year?”

I Databases uses functions to ask questions about structuredinformation: “Do we have any books on designing databases in ourlibrary? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications aredesigned to retrieve and present data through queries throughspecially designed forms, within a database application, or onthe web.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 32: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I Spreadsheets are actually a kind of database: both organizeinformation into tables.

I The primary differences between a spreadsheet and database:different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’sthe average daily rainfall for the first six months of this year?”

I Databases uses functions to ask questions about structuredinformation: “Do we have any books on designing databases in ourlibrary? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications aredesigned to retrieve and present data through queries throughspecially designed forms, within a database application, or onthe web.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 33: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What is a database?

I Spreadsheets are actually a kind of database: both organizeinformation into tables.

I The primary differences between a spreadsheet and database:different types of queries.

I Spreadsheets use functions to ask questions of numbers. “What’sthe average daily rainfall for the first six months of this year?”

I Databases uses functions to ask questions about structuredinformation: “Do we have any books on designing databases in ourlibrary? If so, on which shelves are they located?”

I Retrieval, and presentation: Today’s database applications aredesigned to retrieve and present data through queries throughspecially designed forms, within a database application, or onthe web.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 34: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:

I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 35: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.

I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 36: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.

I Enables collaboration through client-server applications over anetwork.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 37: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 38: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 39: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.

I Tools for verifying and evaluating contrasting empirical andtheoretical claims.

I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 40: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.

I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 41: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 42: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

Why use databases in linguistics?

I Linguistic research is a data-rich enterprise:I Archiving massive amounts of language/linguistic data.I Lexicography and dictionary making.I Enables collaboration through client-server applications over a

network.

I Database applications are particularily well-suited to linguisticresearch (cf. Nerbonne 1997; Everaert et al 2009):

I Cross-linguistic and typological research.I Tools for verifying and evaluating contrasting empirical and

theoretical claims.I Specialized queries that can yield new insights into data.

I Consistency and integrity: imposing a structure oninformation can help reduce inaccuracies and redundancies.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 43: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What can databases be used for in linguistics?

I Two broad types of databases in linguistics:

I A Linguistic database: contains data from language research(i.e. words, phonemes, grammatical categories, fundamentalfrequencies, etc.)

I A Metalinguistic database: contains data about languageresearch (i.e. names of speakers, locations, recording details,etc.)

I Both are concieved, designed and implemented using the sameprinciples.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 44: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What can databases be used for in linguistics?

I Two broad types of databases in linguistics:I A Linguistic database: contains data from language research

(i.e. words, phonemes, grammatical categories, fundamentalfrequencies, etc.)

I A Metalinguistic database: contains data about languageresearch (i.e. names of speakers, locations, recording details,etc.)

I Both are concieved, designed and implemented using the sameprinciples.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 45: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What can databases be used for in linguistics?

I Two broad types of databases in linguistics:I A Linguistic database: contains data from language research

(i.e. words, phonemes, grammatical categories, fundamentalfrequencies, etc.)

I A Metalinguistic database: contains data about languageresearch (i.e. names of speakers, locations, recording details,etc.)

I Both are concieved, designed and implemented using the sameprinciples.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 46: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Understanding what a database isUsing databases in linguistics

What can databases be used for in linguistics?

I Two broad types of databases in linguistics:I A Linguistic database: contains data from language research

(i.e. words, phonemes, grammatical categories, fundamentalfrequencies, etc.)

I A Metalinguistic database: contains data about languageresearch (i.e. names of speakers, locations, recording details,etc.)

I Both are concieved, designed and implemented using the sameprinciples.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 47: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

A common starting point: the ‘Flat’ database

Word Gloss Gram. Morph.hon ‘fish’ N ROOTsmax ‘bear, meat’ N ROOTalgyax ‘language’ N ROOTs ’m-algyax Gitksan N STEMs ’m- ‘true’ A PREFIXsiipxw ‘sick, ill’ A ROOTwii- ’nakw ‘tall’ A STEMwii- ‘long’ A PREFIX’nakw DISTAL ROOT’nakw EVIDENTIAL ROOTx- ‘consume’ V PREFIXiixwt ‘fish’ V ROOTwitxw ‘arrive’ V ROOTbakw ‘arrive’ V ROOTlitsxxw ‘read’ V ROOT=hl common noun Det. ENCLITIC=t proper noun Det. ENCLITIC=tip plural noun Det. ENCLITIC- ’y 1sg Agr. SUFFIX-n 2sg Agr. SUFFIX-t 3 Agr. SUFFIX

Table: A ‘Flat’ Database of a Gitksan (Tsimshianic) word list

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 48: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement;Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) orspreadsheet (MS Excel).

I One record in a paper form = One row (“record”) incomputerized table of data.

I Adequate for a simple applications with not a lot of data orfeatures (i.e. categories).

I Generating word lists.I Basic searches.

I A ‘flat’ database.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 49: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement;Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) orspreadsheet (MS Excel).

I One record in a paper form = One row (“record”) incomputerized table of data.

I Adequate for a simple applications with not a lot of data orfeatures (i.e. categories).

I Generating word lists.I Basic searches.

I A ‘flat’ database.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 50: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement;Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) orspreadsheet (MS Excel).

I One record in a paper form = One row (“record”) incomputerized table of data.

I Adequate for a simple applications with not a lot of data orfeatures (i.e. categories).

I Generating word lists.I Basic searches.

I A ‘flat’ database.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 51: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement;Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) orspreadsheet (MS Excel).

I One record in a paper form = One row (“record”) incomputerized table of data.

I Adequate for a simple applications with not a lot of data orfeatures (i.e. categories).

I Generating word lists.I Basic searches.

I A ‘flat’ database.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 52: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement;Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) orspreadsheet (MS Excel).

I One record in a paper form = One row (“record”) incomputerized table of data.

I Adequate for a simple applications with not a lot of data orfeatures (i.e. categories).

I Generating word lists.I Basic searches.

I A ‘flat’ database.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 53: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

A common starting point: the ‘Flat’ database

I Language data in field notes, a numbered arrangement;Possibly transferred onto cards.

I Enter language data into a word processor (MS Word) orspreadsheet (MS Excel).

I One record in a paper form = One row (“record”) incomputerized table of data.

I Adequate for a simple applications with not a lot of data orfeatures (i.e. categories).

I Generating word lists.I Basic searches.

I A ‘flat’ database.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 54: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Limitations of a Flat database

I You find you need more out of your data:

I Inflexible.I Difficult to expand.

I Many redundant data entries

I Identifying and eliminating incorrect entries.I Inconsistency.I Unmanageable file size (difficult to transfer), and potential

memory problems.

I Can become overwhelming complex, and unstable along withthe burden of maintaining the database.

I For language data: can obscure potentially meaningfulimplications, relationships and generalizations.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 55: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Limitations of a Flat database

I You find you need more out of your data:I Inflexible.I Difficult to expand.

I Many redundant data entries

I Identifying and eliminating incorrect entries.I Inconsistency.I Unmanageable file size (difficult to transfer), and potential

memory problems.

I Can become overwhelming complex, and unstable along withthe burden of maintaining the database.

I For language data: can obscure potentially meaningfulimplications, relationships and generalizations.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 56: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Limitations of a Flat database

I You find you need more out of your data:I Inflexible.I Difficult to expand.

I Many redundant data entries

I Identifying and eliminating incorrect entries.I Inconsistency.I Unmanageable file size (difficult to transfer), and potential

memory problems.

I Can become overwhelming complex, and unstable along withthe burden of maintaining the database.

I For language data: can obscure potentially meaningfulimplications, relationships and generalizations.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 57: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Limitations of a Flat database

I You find you need more out of your data:I Inflexible.I Difficult to expand.

I Many redundant data entriesI Identifying and eliminating incorrect entries.I Inconsistency.I Unmanageable file size (difficult to transfer), and potential

memory problems.

I Can become overwhelming complex, and unstable along withthe burden of maintaining the database.

I For language data: can obscure potentially meaningfulimplications, relationships and generalizations.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 58: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Limitations of a Flat database

I You find you need more out of your data:I Inflexible.I Difficult to expand.

I Many redundant data entriesI Identifying and eliminating incorrect entries.I Inconsistency.I Unmanageable file size (difficult to transfer), and potential

memory problems.

I Can become overwhelming complex, and unstable along withthe burden of maintaining the database.

I For language data: can obscure potentially meaningfulimplications, relationships and generalizations.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 59: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Limitations of a Flat database

I You find you need more out of your data:I Inflexible.I Difficult to expand.

I Many redundant data entriesI Identifying and eliminating incorrect entries.I Inconsistency.I Unmanageable file size (difficult to transfer), and potential

memory problems.

I Can become overwhelming complex, and unstable along withthe burden of maintaining the database.

I For language data: can obscure potentially meaningfulimplications, relationships and generalizations.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 60: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Limitations of a Flat database cont.

Word Gloss Gram. Morph.hon ‘fish’ N ROOTsmax ‘bear, meat’ N ROOTalgyax ‘language’ N ROOTs ’m-algyax Gitksan N STEMs ’m- ‘true’ A PREFIXsiipxw ‘sick, ill’ A ROOTwii- ’nakw ‘tall’ A STEMwii- ‘long’ A PREFIX’nakw DISTAL ROOT’nakw EVIDENTIAL ROOTx- ‘consume’ V PREFIXiixwt ‘fish’ V ROOTwitxw ‘arrive’ V ROOTbakw ‘arrive’ V ROOTlitsxxw ‘read’ V ROOT=hl common noun Det. ENCLITIC=t proper noun Det. ENCLITIC=tip plural noun Det. ENCLITIC- ’y 1sg Agr. SUFFIX-n 2sg Agr. SUFFIX-t 3 Agr. SUFFIX

Table: A ‘Flat’ Database of a Gitksan (Tsimshianic) word list

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 61: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

The Solution:

I Separate the flat database into two interacting systems:

I. Database Management System (DBMS)

II. An application to interact with the DBMS.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 62: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

The Solution:

I Separate the flat database into two interacting systems:

I. Database Management System (DBMS)

II. An application to interact with the DBMS.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 63: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

The Solution:

I Separate the flat database into two interacting systems:

I. Database Management System (DBMS)

II. An application to interact with the DBMS.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 64: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

I. Database Management System (DBMS)

I Keeps data in small, unique chunks

I Efficient storageI Maintains ‘just enough’ redundancy

I Principle focus: handling data

I Handles physical details of storing data efficientlyI Delivers & manipulates data for applicationsI Security and stability

I Several “industrial-strength” DBMS:

I OracleI Microsoft SQL Server

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 65: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

I. Database Management System (DBMS)

I Keeps data in small, unique chunksI Efficient storageI Maintains ‘just enough’ redundancy

I Principle focus: handling data

I Handles physical details of storing data efficientlyI Delivers & manipulates data for applicationsI Security and stability

I Several “industrial-strength” DBMS:

I OracleI Microsoft SQL Server

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 66: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

I. Database Management System (DBMS)

I Keeps data in small, unique chunksI Efficient storageI Maintains ‘just enough’ redundancy

I Principle focus: handling data

I Handles physical details of storing data efficientlyI Delivers & manipulates data for applicationsI Security and stability

I Several “industrial-strength” DBMS:

I OracleI Microsoft SQL Server

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 67: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

I. Database Management System (DBMS)

I Keeps data in small, unique chunksI Efficient storageI Maintains ‘just enough’ redundancy

I Principle focus: handling dataI Handles physical details of storing data efficientlyI Delivers & manipulates data for applicationsI Security and stability

I Several “industrial-strength” DBMS:

I OracleI Microsoft SQL Server

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 68: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

I. Database Management System (DBMS)

I Keeps data in small, unique chunksI Efficient storageI Maintains ‘just enough’ redundancy

I Principle focus: handling dataI Handles physical details of storing data efficientlyI Delivers & manipulates data for applicationsI Security and stability

I Several “industrial-strength” DBMS:

I OracleI Microsoft SQL Server

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 69: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

I. Database Management System (DBMS)

I Keeps data in small, unique chunksI Efficient storageI Maintains ‘just enough’ redundancy

I Principle focus: handling dataI Handles physical details of storing data efficientlyI Delivers & manipulates data for applicationsI Security and stability

I Several “industrial-strength” DBMS:I OracleI Microsoft SQL Server

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 70: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:

I The DBMS stores data and responds to queries – we don’t interactwith it directly.

I DBMSs are used with a “client” application: MS Access, FileMakerPro etc. These create a graphical user interface to interact with thedata through forms and reports.

I A language to query data from a DBMS:

I Structured Query Language (SQL): a standardized language thatuses user-defined functions to query the data.

I Generates reports in form of a table or pivot table.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 71: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:I The DBMS stores data and responds to queries – we don’t interact

with it directly.

I DBMSs are used with a “client” application: MS Access, FileMakerPro etc. These create a graphical user interface to interact with thedata through forms and reports.

I A language to query data from a DBMS:

I Structured Query Language (SQL): a standardized language thatuses user-defined functions to query the data.

I Generates reports in form of a table or pivot table.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 72: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:I The DBMS stores data and responds to queries – we don’t interact

with it directly.I DBMSs are used with a “client” application: MS Access, FileMaker

Pro etc. These create a graphical user interface to interact with thedata through forms and reports.

I A language to query data from a DBMS:

I Structured Query Language (SQL): a standardized language thatuses user-defined functions to query the data.

I Generates reports in form of a table or pivot table.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 73: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:I The DBMS stores data and responds to queries – we don’t interact

with it directly.I DBMSs are used with a “client” application: MS Access, FileMaker

Pro etc. These create a graphical user interface to interact with thedata through forms and reports.

I A language to query data from a DBMS:

I Structured Query Language (SQL): a standardized language thatuses user-defined functions to query the data.

I Generates reports in form of a table or pivot table.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 74: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:I The DBMS stores data and responds to queries – we don’t interact

with it directly.I DBMSs are used with a “client” application: MS Access, FileMaker

Pro etc. These create a graphical user interface to interact with thedata through forms and reports.

I A language to query data from a DBMS:I Structured Query Language (SQL): a standardized language that

uses user-defined functions to query the data.

I Generates reports in form of a table or pivot table.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 75: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

II. Applications that interact with the DBMS

I A program to retrieve data from a DBMS:I The DBMS stores data and responds to queries – we don’t interact

with it directly.I DBMSs are used with a “client” application: MS Access, FileMaker

Pro etc. These create a graphical user interface to interact with thedata through forms and reports.

I A language to query data from a DBMS:I Structured Query Language (SQL): a standardized language that

uses user-defined functions to query the data.I Generates reports in form of a table or pivot table.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 76: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Four types of database models:

I A Flat databaseI Made of a single table, or “file”.I Each row corresponds to some object (e.g., a language) being

described, and each column represents a property (attribute), suchas name, location, or word order etc..

I A Relational database

I Consists of several tables (relations) linked to each other.

I A Hierarchical database

I Not as a table but as a tree structure, similar to folders andsubfolders in an operating system: each unit “belongs” to somelarger unit, and contains smaller units.

I An Object-Oriented database database

I Data are modeled as objects of various types that share or inheritproperties according to their type

I For example, a database about word classes could let objects of thetype transitive verb inherit properties of the type verb.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 77: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Four types of database models:

I A Flat databaseI Made of a single table, or “file”.I Each row corresponds to some object (e.g., a language) being

described, and each column represents a property (attribute), suchas name, location, or word order etc..

I A Relational databaseI Consists of several tables (relations) linked to each other.

I A Hierarchical database

I Not as a table but as a tree structure, similar to folders andsubfolders in an operating system: each unit “belongs” to somelarger unit, and contains smaller units.

I An Object-Oriented database database

I Data are modeled as objects of various types that share or inheritproperties according to their type

I For example, a database about word classes could let objects of thetype transitive verb inherit properties of the type verb.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 78: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Four types of database models:

I A Flat databaseI Made of a single table, or “file”.I Each row corresponds to some object (e.g., a language) being

described, and each column represents a property (attribute), suchas name, location, or word order etc..

I A Relational databaseI Consists of several tables (relations) linked to each other.

I A Hierarchical databaseI Not as a table but as a tree structure, similar to folders and

subfolders in an operating system: each unit “belongs” to somelarger unit, and contains smaller units.

I An Object-Oriented database database

I Data are modeled as objects of various types that share or inheritproperties according to their type

I For example, a database about word classes could let objects of thetype transitive verb inherit properties of the type verb.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 79: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

The ‘Flat’ databaseThe Database Management System (DBMS)Types of Databases Models

Four types of database models:

I A Flat databaseI Made of a single table, or “file”.I Each row corresponds to some object (e.g., a language) being

described, and each column represents a property (attribute), suchas name, location, or word order etc..

I A Relational databaseI Consists of several tables (relations) linked to each other.

I A Hierarchical databaseI Not as a table but as a tree structure, similar to folders and

subfolders in an operating system: each unit “belongs” to somelarger unit, and contains smaller units.

I An Object-Oriented database databaseI Data are modeled as objects of various types that share or inherit

properties according to their typeI For example, a database about word classes could let objects of the

type transitive verb inherit properties of the type verb.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 80: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Two types of database applications:

I Stand-alone desktop databases: MS Access

I The network database: WordNet

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 81: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Two types of database applications:

I Stand-alone desktop databases: MS Access

I The network database: WordNet

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 82: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for boththe database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up,sent by email, etc.

I The ony requirement is a desktop computer with the databaseapplication; software is easy to install or already present, andit is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 83: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for boththe database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up,sent by email, etc.

I The ony requirement is a desktop computer with the databaseapplication; software is easy to install or already present, andit is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 84: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for boththe database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up,sent by email, etc.

I The ony requirement is a desktop computer with the databaseapplication; software is easy to install or already present, andit is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 85: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for boththe database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up,sent by email, etc.

I The ony requirement is a desktop computer with the databaseapplication; software is easy to install or already present, andit is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 86: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for boththe database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up,sent by email, etc.

I The ony requirement is a desktop computer with the databaseapplication; software is easy to install or already present, andit is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 87: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for boththe database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up,sent by email, etc.

I The ony requirement is a desktop computer with the databaseapplication; software is easy to install or already present, andit is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 88: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Stand-alone desktop databases

I Suitable for the one-person research project.

I A stand-alone software with a graphical user interface for boththe database configuration, and to create forms and queries.

I Many tasks are automated; customizable templates.

I Everything fits in one file or folder, and can be backed up,sent by email, etc.

I The ony requirement is a desktop computer with the databaseapplication; software is easy to install or already present, andit is not necessary to set up a server.

I Internet collaboration possible but not required.

I MS Access, FileMaker Pro, OpenOffice Calc.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 89: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Network databases

I Ideal when multiple people must collaborate on data entry.

I A modular system of three parts:

I A web-based interface (i.e. a web browser)I A server. (running PHP to manage the queries and generate the

web pages)I The database. (MySQL)

I Most of the same functions with stand-alone databases can beused in network databases.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 90: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Network databases

I Ideal when multiple people must collaborate on data entry.I A modular system of three parts:

I A web-based interface (i.e. a web browser)I A server. (running PHP to manage the queries and generate the

web pages)I The database. (MySQL)

I Most of the same functions with stand-alone databases can beused in network databases.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 91: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Network databases

I Ideal when multiple people must collaborate on data entry.I A modular system of three parts:

I A web-based interface (i.e. a web browser)I A server. (running PHP to manage the queries and generate the

web pages)I The database. (MySQL)

I Most of the same functions with stand-alone databases can beused in network databases.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 92: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Network databases

I Ideal when multiple people must collaborate on data entry.I A modular system of three parts:

I A web-based interface (i.e. a web browser)I A server. (running PHP to manage the queries and generate the

web pages)I The database. (MySQL)

I Most of the same functions with stand-alone databases can beused in network databases.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 93: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Comparing Pros and Cons

I Stand-alone databasesI Pros: Can be implemented quickly and easily.I Cons: Can be expensive and proprietary.

I Network databases

I Pros: Free, with more or less the same functionality as astand-alone, proprietary database.

I Cons: Extensive computer knowledge required (i.e. setting up aserver, making the connections, knowledge of HTML)

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 94: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Comparing Pros and Cons

I Stand-alone databasesI Pros: Can be implemented quickly and easily.I Cons: Can be expensive and proprietary.

I Network databasesI Pros: Free, with more or less the same functionality as a

stand-alone, proprietary database.I Cons: Extensive computer knowledge required (i.e. setting up a

server, making the connections, knowledge of HTML)

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 95: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Criteria

I General:I Who produced the software, which platforms the software runs on?

Is other software needed?I Is it easy to use? Is it well-supported/documented? Cost?

I Technical:

I Ability to import and export data (i.e. text, XML files).I Are the pre-defined and/or user-defined options helpful? Can they

be easily modified?I Is the application scalable?I Is it relational?

I Linguistic:

I Unicode compatibility, special character input methods, and theease of character input.

I Ability to handle texts and texts, interlinearized material.I Allows you to follow the best practices for archiving linguistic data

(i.e. XML, E-MELD emeld.org).

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 96: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Criteria

I General:I Who produced the software, which platforms the software runs on?

Is other software needed?I Is it easy to use? Is it well-supported/documented? Cost?

I Technical:I Ability to import and export data (i.e. text, XML files).I Are the pre-defined and/or user-defined options helpful? Can they

be easily modified?I Is the application scalable?I Is it relational?

I Linguistic:

I Unicode compatibility, special character input methods, and theease of character input.

I Ability to handle texts and texts, interlinearized material.I Allows you to follow the best practices for archiving linguistic data

(i.e. XML, E-MELD emeld.org).

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 97: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Criteria

I General:I Who produced the software, which platforms the software runs on?

Is other software needed?I Is it easy to use? Is it well-supported/documented? Cost?

I Technical:I Ability to import and export data (i.e. text, XML files).I Are the pre-defined and/or user-defined options helpful? Can they

be easily modified?I Is the application scalable?I Is it relational?

I Linguistic:I Unicode compatibility, special character input methods, and the

ease of character input.I Ability to handle texts and texts, interlinearized material.I Allows you to follow the best practices for archiving linguistic data

(i.e. XML, E-MELD emeld.org).

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 98: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

Databases designed for linguistics

I Stand-alone: SIL Shoebox 5.0 with Toolbox 1.2I Runs on both Windows and Mac. Proprietary, but not too

expensive.I Not very well supported, problems exporting XML files.I A native environment for text interlinearization and analysis.I Uses filter-type searches, not structured queries.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 99: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Stand-alone, relational databases:

I MS Access: powerful and customizable form and query tools.Proprietary and not cheap.

I FileMaker Pro: also with customizable form and query tools.Proprietary and not cheap.

I OpenOffice Calc: less features than Access or FileMaker, but hasthe same core functionality. Open source (free), but somewhatunstable.

I Network: MySQL (http://www.mysql.com/); Apache serverwith PHP; Google Chrome – all free.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 100: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Stand-alone, relational databases:I MS Access: powerful and customizable form and query tools.

Proprietary and not cheap.

I FileMaker Pro: also with customizable form and query tools.Proprietary and not cheap.

I OpenOffice Calc: less features than Access or FileMaker, but hasthe same core functionality. Open source (free), but somewhatunstable.

I Network: MySQL (http://www.mysql.com/); Apache serverwith PHP; Google Chrome – all free.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 101: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Stand-alone, relational databases:I MS Access: powerful and customizable form and query tools.

Proprietary and not cheap.I FileMaker Pro: also with customizable form and query tools.

Proprietary and not cheap.

I OpenOffice Calc: less features than Access or FileMaker, but hasthe same core functionality. Open source (free), but somewhatunstable.

I Network: MySQL (http://www.mysql.com/); Apache serverwith PHP; Google Chrome – all free.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 102: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Stand-alone, relational databases:I MS Access: powerful and customizable form and query tools.

Proprietary and not cheap.I FileMaker Pro: also with customizable form and query tools.

Proprietary and not cheap.I OpenOffice Calc: less features than Access or FileMaker, but has

the same core functionality. Open source (free), but somewhatunstable.

I Network: MySQL (http://www.mysql.com/); Apache serverwith PHP; Google Chrome – all free.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 103: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Stand-alone, relational databases:I MS Access: powerful and customizable form and query tools.

Proprietary and not cheap.I FileMaker Pro: also with customizable form and query tools.

Proprietary and not cheap.I OpenOffice Calc: less features than Access or FileMaker, but has

the same core functionality. Open source (free), but somewhatunstable.

I Network: MySQL (http://www.mysql.com/); Apache serverwith PHP; Google Chrome – all free.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 104: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Access, FileMaker Pro, and Calc are all suitable databases forlinguistic analysis: they are all relational and can handle SQLqueries.

I Some more pros/cons and comparisons:

I Access, FileMaker Pro, and Calc do not handle texts well.I Access is more constrained than FileMaker or Calc – less possibility

for introducing errors or inconsistencies.I FileMaker and Calc are suited to smaller, less-complex projects.I All are XML compatible and network ready.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 105: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Access, FileMaker Pro, and Calc are all suitable databases forlinguistic analysis: they are all relational and can handle SQLqueries.

I Some more pros/cons and comparisons:

I Access, FileMaker Pro, and Calc do not handle texts well.I Access is more constrained than FileMaker or Calc – less possibility

for introducing errors or inconsistencies.I FileMaker and Calc are suited to smaller, less-complex projects.I All are XML compatible and network ready.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 106: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Access, FileMaker Pro, and Calc are all suitable databases forlinguistic analysis: they are all relational and can handle SQLqueries.

I Some more pros/cons and comparisons:I Access, FileMaker Pro, and Calc do not handle texts well.

I Access is more constrained than FileMaker or Calc – less possibilityfor introducing errors or inconsistencies.

I FileMaker and Calc are suited to smaller, less-complex projects.I All are XML compatible and network ready.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 107: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Access, FileMaker Pro, and Calc are all suitable databases forlinguistic analysis: they are all relational and can handle SQLqueries.

I Some more pros/cons and comparisons:I Access, FileMaker Pro, and Calc do not handle texts well.I Access is more constrained than FileMaker or Calc – less possibility

for introducing errors or inconsistencies.

I FileMaker and Calc are suited to smaller, less-complex projects.I All are XML compatible and network ready.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 108: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Access, FileMaker Pro, and Calc are all suitable databases forlinguistic analysis: they are all relational and can handle SQLqueries.

I Some more pros/cons and comparisons:I Access, FileMaker Pro, and Calc do not handle texts well.I Access is more constrained than FileMaker or Calc – less possibility

for introducing errors or inconsistencies.I FileMaker and Calc are suited to smaller, less-complex projects.

I All are XML compatible and network ready.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 109: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

The Players

I Access, FileMaker Pro, and Calc are all suitable databases forlinguistic analysis: they are all relational and can handle SQLqueries.

I Some more pros/cons and comparisons:I Access, FileMaker Pro, and Calc do not handle texts well.I Access is more constrained than FileMaker or Calc – less possibility

for introducing errors or inconsistencies.I FileMaker and Calc are suited to smaller, less-complex projects.I All are XML compatible and network ready.

Database Theory and Design Day 1: Introduction to Database Theory and Design

Page 110: Day 1: Introduction to Database Theory and Designtrpete13/materials/Day_1.pdf · Database Theory and Design Day 1: Introduction to Database Theory and Design. Outline The Database

OutlineThe Database as a Concept and Tool

The Database model and its EvolutionDatabase Applications

Choosing the right Database ApplicationLinguistic DBMS and InterfacesNon-Linguistic DBMS and InterfacesReferences and Suggested Readings

I There are countless resources on the web on database design,theory, and implementation.

I Specific references on linguistic databases:I Ferrara, M. & Moran, S. 2004. Review of DBMS for Linguistic

Purposes. Proceedings of E-MELD 2004. Online publication, athttp://www.linguistlist.org/emeld/workshop/2004/proceedings.html.

I Nerbonne, John. 1998. Linguistic Databases, CSLI, Stanford.I Everaert, Musgrave, Dimitriadis (eds) 2009. The Use of Databases

in Cross-Linguistic Studies. Empirical Approaches to LanguageTypology (EALT) 41. Mouton de Gruyter.

Database Theory and Design Day 1: Introduction to Database Theory and Design