Burst Spectrum as a Cue to Stop Consonant Voicing
Transcript of Burst Spectrum as a Cue to Stop Consonant Voicing
![Page 1: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/1.jpg)
Burst Spectrum as a Cue to Stop Consonant Voicing
English Production and Perception Results
Eleanor Chodroff and Colin Wilson Johns Hopkins University
![Page 2: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/2.jpg)
voice onset time F1 onset F1 transition F0 contour relative amplitude of aspiration following vowel duration spectral shape of the burst: lower frequencies for voiced stops
Cues to stop consonant voicing
Summerfield and Haggard (1977), Lisker (1978), Repp (1979), Lisker (1986)
![Page 3: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/3.jpg)
Background: Production
Halle, Hughes, and Radley (1957)
The lax stops also show a significant drop in level in the high frequencies. This high-frequency loss is a consequence of the lower pressure associated with the production of lax stops and is therefore a crucial cue for this class of stops.”
“Since most of our lax [voiced] stops were pronounced with vocal-cord vibration, their spectra contained a strong low-frequency component…
![Page 4: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/4.jpg)
Background: Production
see also Van Alphen and Smits (2004), Vicenik (2010), Kirkham (2011)
/p/ /b/ Δ 1910 1163 747v
/t/ /d/ Δ 3600 3300 300+
5649 5225 424v 4900 4400 500w
/k/ /g/ Δ 1940 1910 30+ 2261 2268 -7v
+ = Zue (1976) using peak frequency v = Parikh and Loizou (2005) using peak frequency w = Sundara (2005) using mean frequency (CoG)
Hz Hz Hz
labials coronals dorsals
![Page 5: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/5.jpg)
production study laboratory and TIMIT experiments
![Page 6: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/6.jpg)
Laboratory Production: Methods
methods adapted from Forrest et al. (1988), Jongman et al. (2000), Sundara (2005)
/p,t,k,b,d,g/ x /i,ɪ,e,ɛ,æ,ʌ,ɑ,ɔ,o,u/ x /t/
N=18 (4 male) resampled at 16kHz
pre-emphasized above 1000Hz high-pass filtered at 200Hz
segmented from transient to voicing
![Page 7: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/7.jpg)
Laboratory Production: Measurement
analysis as in Forrest et al. (1988), Hanson and Stevens (2003), Flemming (2007)
§ Computed 64-point FFT for 7 consecutive 3ms Hamming windows, shifted by 1ms
§ 7 PSDs averaged to give a smoothed spectrum
§ Center of Gravity (CoG)
calculated from smoothed spectrum: amplitude-weighted mean frequency
CoG = f1p(1) + … + f32p(32)
![Page 8: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/8.jpg)
Laboratory Production: Results
3318
2833
49674664
3450 3521
lab cor dor
1000
2000
3000
4000
5000
vcl vcd vcl vcd vcl vcdvoicing
CoG
(Hz)
*
*
![Page 9: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/9.jpg)
Laboratory Production: Analysis
Mixed-effects linear regression Fixed effects sum-coded and maximal random effect structure
voice βvoice = 122, p < .01 × place βlabial = -633, p < .001; βcoronal = 916, p < .001 × gender βgender = 86, p < .01
Crucially, the pattern of significance remains the same when tokens with glottal pulses near the release are excluded.
labial coronal dorsal
male βvoice = 224 p < .001
βvoice = 224 p < .05 n.s.
female βvoice = 253 p < .001 n.s. n.s.
Significant interactions examined with post-hoc comparisons
![Page 10: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/10.jpg)
TIMIT: Methods
630 different AE speakers Word-initial, pre-vocalic /p, t, k, b, d, g/ Words with high token freq. removed (too, to, do, carry, dark) Phoneme Tokens Phoneme Tokens /p/ 661 /b/ 668 /t/ 579 /d/ 547 /k/ 1179 /g/ 415
Byrd (1993), Keating et al. (1993)
![Page 11: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/11.jpg)
TIMIT: Results
3704
2672
4550
3743
31552941
lab cor dor
1000
2000
3000
4000
5000
vcl vcd vcl vcd vcl vcdvoicing
CoG
(Hz)
*
*
(*)
![Page 12: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/12.jpg)
TIMIT: Analysis
Mixed-effects linear regression Fixed effects sum-coded and maximal random effect structure
voice βvoice = 320, p < .001 × place βlabial = -314, p < .001; βcoronal = 762, p < .001 × gender βgender = 205, p < .001
labial coronal dorsal
male βvoice = 555 p < .001
βvoice = 460 p < .001
(βvoice = 112 p < .001)
female βvoice = 396 p < .001
βvoice = 280 p < .001
(βvoice = 113 p < .05)
Crucially, the pattern of significance remains the same, except for the dorsals, when tokens with glottal pulses near the release are excluded.
Significant interactions examined with post-hoc comparisons
![Page 13: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/13.jpg)
perception study laboratory and Mechanical Turk experiments
![Page 14: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/14.jpg)
Background: Perception
Trading relation between burst and VOT
Keating (1979) Nittrouer (1999)
Caldwell and Nittrouer (2013)
/t/-burst VOT continuum /d/-burst VOT continuum
![Page 15: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/15.jpg)
10
17
24
31
38
45
52
Laboratory Perception: Stimuli
p
b
CoG: 3494Hz Dur: 10ms
CoG: 1513Hz Dur: 10ms
Labial Continua /bæt/-/pæt/ VOT (ms) Keating (1979), Ganong (1980), Andruski et al. (1994)
![Page 16: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/16.jpg)
Laboratory Perception: Stimuli
t
d
CoG: 5424Hz Dur: 10ms
CoG: 3601Hz Dur: 10ms
10
17
24
31
38
45
52
VOT (ms) Coronal Continua /dat/-/tat/ Keating (1979), Ganong (1980), Andruski et al. (1994)
![Page 17: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/17.jpg)
Laboratory Perception: Methods and analysis
Massaro and Cohen (1983), Hallé and Best (2007)
Two-alternative forced choice identification
Order of labial and coronal conditions counterbalanced Within condition: 8 blocks of 14 stimuli in random order
Goodness rating
Differences verified with logistic mixed-effects analysis with maximal random effect structures
Differences verified with linear mixed-effects analysis with maximal random effect structures
![Page 18: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/18.jpg)
Laboratory Perception: Results
●
●
●
● ● ● ●
0.00
0.25
0.50
0.75
1.00
10 20 30 40 50VOT (ms)
Prop
ortio
n /p
/ Res
pons
e
burst● p
b
labials
βburst = .54 p<.001 N=16
![Page 19: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/19.jpg)
Laboratory Perception: Results
N=16
labials B P
−4
−3
−2
−1
0
1
2
3
10 17 24 31 38 45 52 10 17 24 31 38 45 52VOT (ms)
stan
dard
ized
ratin
g
burstpb
![Page 20: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/20.jpg)
Laboratory Perception: Results
●●
●
●
●
● ●
0.00
0.25
0.50
0.75
1.00
10 20 30 40 50VOT (ms)
Prop
ortio
n /t/
Res
pons
e
burst● t
d
coronals
βburst = .85 p<.001
N=16
![Page 21: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/21.jpg)
Laboratory Perception: Results
N=16
coronals D T
−4
−3
−2
−1
0
1
2
3
10 17 24 31 38 45 52 10 17 24 31 38 45 52VOT (ms)
stan
dard
ized
ratin
g
bursttd
![Page 22: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/22.jpg)
Mechanical Turk: Methods
Crowdsourcing service increasingly used in psycholinguistics and phonetic studies Greater diversity in participant population and listening conditions (noise!)
Labials 12 headphones 3 external speakers 1 internal speakers
Kleinschmidt and Jaeger (2012), Eskanazi et al. (2013)
Coronals 9 headphones 4 external speakers 3 internal speakers
![Page 23: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/23.jpg)
Mechanical Turk: Results
●
●
●
●●
● ●
0.00
0.25
0.50
0.75
1.00
10 20 30 40 50VOT (ms)
Prop
ortio
n /p
/ Res
pons
e
burst● p
b
labials
βburst = .46 p<.001
N=16
![Page 24: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/24.jpg)
Mechanical Turk: Results
coronals
● ●
●
●
●
●●
0.00
0.25
0.50
0.75
1.00
10 20 30 40 50VOT (ms)
Prop
ortio
n /t/
Res
pons
e
burst● t
d
βburst = .60 p<.001
N=16
![Page 25: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/25.jpg)
Spectral shape of the burst is a cue to anterior stop consonant voicing
Higher CoG for voiceless labials and coronals Spectral shape influences voicing identification
Summary and Implications
![Page 26: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/26.jpg)
Place and voice perception are interdependent Cues to phonetic distinctions at burst landmark Early cue to voicing and incremental perception
Summary and Implications
Repp (1978), Allopenna et al. (1998), Benkí (2001), Stevens (2002), McMurray et al. (2008a)
![Page 27: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/27.jpg)
Thank you!
![Page 28: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/28.jpg)
Production: Results by Gender
é laboratory
TIMIT ê
lab cor dor
0
1000
2000
3000
4000
5000
6000
female male female male female male
CoG
(Hz)
lab cor dor
0
1000
2000
3000
4000
5000
6000
female male female male female male
CoG
(Hz)
![Page 29: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/29.jpg)
Production: Results by Vowel
Labial Coronal Dorsal Vowel /p/ /b/ /t/ /d/ /k/ /g/
/i/ 3582 3473 5278 5024 4031 4108 /ɪ/ 3452 3107 -- 4999 3893 3882 /e/ 3447 2822 5253 4849 3868 3884 /ɛ/ 3196 2587 5192 4777 3796 3840 /æ/ 3142 2507 5161 4627 3725 3816 /ʌ/ 3329 2694 5079 4682 3190 3291 /u/ 3597 3247 4734 4534 3270 3172 /o/ 3250 2830 4635 4400 3115 3170 /ɔ/ 3150 2508 4697 4244 2713 2868 /ɑ/ 3123 2575 4712 4460 2953 3254
![Page 30: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/30.jpg)
Mechanical Turk: Results
N=16
labials B P
−4
−3
−2
−1
0
1
2
3
10 17 24 31 38 45 52 10 17 24 31 38 45 52VOT (ms)
stan
dard
ized
ratin
g
burstpb
![Page 31: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/31.jpg)
Mechanical Turk: Results
N=16
coronals D T
−4
−3
−2
−1
0
1
2
3
10 17 24 31 38 45 52 10 17 24 31 38 45 52VOT (ms)
stan
dard
ized
ratin
g
bursttd
![Page 32: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/32.jpg)
lab cor dor
10
15
20
0 2000 4000 6000 8000 0 2000 4000 6000 8000 0 2000 4000 6000 8000frequency bin (Hz)
PSD
(dB/
Hz) voice
vcl
vcd
TIMIT
TIMIT: Spectral Shape
![Page 33: Burst Spectrum as a Cue to Stop Consonant Voicing](https://reader033.fdocuments.in/reader033/viewer/2022060520/6296368ad913630a3e048df3/html5/thumbnails/33.jpg)
Background: Production
Study Language Measure /p/ /b/ /t/ /d/ /k/ /g/ Zue 1976 Am. English Peak -- -- 3600 3300 1940 1910
Parikh and Loizou 2005 Am. English Peak 1910 1163 5649 5225 2261 2268
Sundara 2005 Ca. English CoG -- -- 4900 4400 -- --
Kirkham 2011 Br. English CoG -- -- 5220 4888 -- --
Van Alphen and Smits 2004 Dutch CoG 1160 830 3540 2140 -- --
Sundara 2005 Ca. French CoG -- -- 3800 3000 -- --
Vicenik 2010 Georgian CoG 4000 3200 5300 4600 3100 3100
CoG = Center of Gravity (mean frequency)