The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California,...

23
The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley

Transcript of The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California,...

Page 1: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

The Berkeley linguistic archives

Leanne Hinton and Andrew GarrettUniversity of California, Berkeley

Page 2: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

The Berkeley linguistic archives

1. The archives

2. Digitization

3. Access & rights

Page 3: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

California's linguistic diversity

Compare color density: Californiavs. elsewhere in N. America

More, and more genealogically diverse, native languages than any comparablysized area in the western hemisphere

Page 4: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

California's linguistic diversity A "tribal group" here may

correspond to a language or to a set of (not always mutually intelligible)

languages.

Page 5: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

• Berkeley has the largest university archive of native language documentation in the U.S. It is one of the five major U.S. linguistic archives (with the ANLC, APS, NAA, and SIL).

• Material from after 1950 (and a bit of earlier material)– Survey of California and Other Indian Languages: paper material– Berkeley Language Center: audio recordings

• Material from before 1950 (and some later material)– Bancroft Library: paper material– Hearst Museum of Anthropology: audio recordings

Four linguistic archives at Berkeley

Page 6: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

The Survey

Berkeley archives: The archives • Digitization • Access & rights

The Survey of California and Other Indian Languages

Page 7: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

• Collection: 1,942 separate cataloged items

• Various formats– field notebooks– file slips– loose sheets– unpublished manuscripts & typescripts– microfilm– dried botanical specimens

Survey of California and Other Indian Languages: Archive contents

Page 8: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Inside a Survey cabinet

Berkeley archives: The archives • Digitization • Access & rights

A typical file cabinet in the Survey

Page 9: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

A. M. Halpern,Pomoan language(s), around 1940

Berkeley archives: The archives • Digitization • Access & rights

Page 10: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

A major "misplaced" ms:J. P. Harrington,Chochenyo (Ohlone),1920s (several 100 pp.)

This was the only significant documentation of the now long-dead native language of the East (S.F.) Bay.

(It doesn't belong to us; we'll return it to the Smithsonian sometime soon, I hope.)

Page 11: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

• Collection: 206 separate audio series• A representative series

– LA 113– Wintu language– Collected by Alice (Schlichter) Shepherd in 1975-80– 51 hours 19 minutes– Segmented into 450 distinct items, e.g. Segment 4, "Bear

Song and explanation"

• Most series are shorter than LA 113, some are longer

Berkeley Language Center: Archive contents

Page 12: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Bancroft Library

Berkeley archives: The archives • Digitization • Access & rights

Page 13: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

• Felipe Arroyo de la Cuesta– Notebook documenting Mutsun and other California languages,

1810-1819, 172 pp.– Manuscript vocabulary and grammar of Mutsun (Ohlone), 1815, 94

pp.

• A. L. Kroeber Papers– 38 boxes + 21 cartons + 14 vols. + 8 oversize folders = c. 44 linear

feet– linguistic field notes: 6 boxes + 5 cartons + 6 vols. + 3 oversize

folders

• Department of Anthropology records (field notes of many scholars over many years, esp. early in the 20th century)

• More recently: field notes and papers of Mary Haas, Murray B. Emeneau, etc.

Bancroft Library: A few of the important language documentation collections

Page 14: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

Bancroft Library archives

Page 15: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

• Collection: 112 pre-1931 audio series + 437 further recordings collected after 1931

• A representative series (typical length)– series 63– Central Sierra Miwok language– collected by E. W. Gifford in 1913 (on wax cylinders)– contains 32 individual items, ranging in duration from 21

seconds to 31 minutes– Contents: "Myth of how fire was obtained", "Story of the

Grizzly Bear and the Deer", etc.

Hearst Museum of Anthropology:Linguistic archives

Page 16: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

Hearst Museum, parts of the sound & film collection

Page 17: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

Digitization of the Berkeley archives

• Berkeley Language Center (post-1950 audio)– funded by NEH Preservation & Access grant, 2004-2006

– about half-way through digitization of the full collection

– all digital audio conforms to archival digital audio standards

– functioning web interface

• Survey of California and Other Indian Languages (post-1950 paper)– exploratory grant from the Salus Mundi Foundation

– we are just beginning (3-4 collections done as of now)

– all images conform to archival digital image standards

– no web interface yet

• Hearst Museum (pre-1950 audio)– digitization discussions just beginning

• Bancroft Library (pre-1950 paper)

Page 18: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley Language Center

Recordings whose boxes have a blue label have beendigitized.

Berkeley archives: The archives • Digitization • Access & rights

Page 19: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Segmentation and indexingWorking on the digitization project

Berkeley archives: The archives • Digitization • Access & rights

Page 20: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley Language Center digitization: CDs archived at Berkeley and in Davis; web-served from hard disk

Berkeley archives: The archives • Digitization • Access & rights

Page 21: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

QuickTime™ and aTIFF (LZW) decompressor

are needed to see this picture.

Berkeley archives: The archives • Digitization • Access & rights

Serrano language recordings made by Wick Miller in 1959

Page 22: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

Survey digitization work

Page 23: The Berkeley linguistic archives Leanne Hinton and Andrew Garrett University of California, Berkeley.

Berkeley archives: The archives • Digitization • Access & rights

Access issues

• Berkeley Language Center (post-1950 audio)– all access subject to conditions imposed by donors (linguists)

– most collections are freely accessible in person or (if the digitization is done) on the web

– in a few complex cases access rights are disputed and are still being negotiated (indigenous communities disagreeing, or linguists disagreeing with heritage communities); these are interesting but confidential

• Survey of California and Other Indian Languages (post-1950 paper)– free public access in person (no digitized images online yet)

– 90% of users are members of native communities

– for new donations, donors are asked to stipulate access conditions

• Hearst Museum and Bancroft Library (pre-1950 collections)– free public access in person (no digitization yet)

– collections are treated as archived papers of the donor