Grf corpus project training 1
-
Upload
david-lazar -
Category
Education
-
view
216 -
download
2
description
Transcript of Grf corpus project training 1
![Page 1: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/1.jpg)
RA TRAINING DAY
GRF Corpus project
![Page 2: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/2.jpg)
Sign in to the project
Get your user account and log in to https://grfcorpus.teamworkpm.net/
![Page 3: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/3.jpg)
Get the software
Software download from:http://tla.mpi.nl/tools/tla-tools/elan/Or from the project page
![Page 4: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/4.jpg)
ELAN working environment
ELAN project consists of 2 files .etf file Source audio file
Download 2 files from teamwork 1) your personal audio file as per your task 2) standard etf template file
![Page 5: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/5.jpg)
Create your new project
File : new -> wav/mp3 + etf.
The annotation work consists of 2 parts:1) segmentation2) transcription
![Page 6: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/6.jpg)
Segmentation 1
Options -> segmentation mode
Listen first. Different participants are recorded.
![Page 7: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/7.jpg)
Segmentation 2
Start with Speaker1 - Sentence tier
Each speaker separate. Fine tune boundariesDelete, move merge and split
![Page 8: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/8.jpg)
Transcription 1
Options -> transcription mode
Select Speech
![Page 9: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/9.jpg)
Transcription 2
Listen and type
![Page 10: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/10.jpg)
Transcription 3
This phase:
![Page 11: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/11.jpg)
1st copy of segmentation
Options -> Annotation modeTiers -> Create annotations on
dependent tiersSpeech -> JyutPing, Translation
![Page 12: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/12.jpg)
More transcription
Use this or transcription view to enter textFor jyutping transcription use website:http://hktv.cc/hp/cantonesetojyutping/Pay attention to spaces
![Page 13: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/13.jpg)
Tokenizing
Tier ->Tokenize tiers: JyutPing -> Words
Adjust segments while pressing Alt
![Page 14: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/14.jpg)
2nd copy of segmentation
Tier -> Create annotations on dependent tiers Words -> English Gloss, IPA, Language
Language has Controlled Vocabulary: E, C, P, ?
![Page 15: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/15.jpg)
Last 2 Tiers
Code switching types Annotation mode Select a section with your mouse and double click Choose an option
Translation Annotation mode or Transcription mode Ctrl+Enter or Configure Verbal Unit Tier
![Page 16: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/16.jpg)
More participants
Recreate tier structure for each participantTier -> Add new participant -> OKTake a break and repeat
the whole transcription process.
Save your work oftenTry using a mouse
![Page 17: Grf corpus project training 1](https://reader033.fdocuments.in/reader033/viewer/2022061208/548797aa5806b5a32f8b45ea/html5/thumbnails/17.jpg)
Finish
Upload .eaf file to Teamwork and set the task to complete and upload saved file