X-ray study of articulation and formant frequencies in two ... · PDF fileDept. for Speech,...
Transcript of X-ray study of articulation and formant frequencies in two ... · PDF fileDept. for Speech,...
Dept. for Speech, Music and Hearing
Quarterly Progress andStatus Report
X-ray study of articulationand formant frequencies in
two female singersJohansson, C. and Sundberg, J. and
Wilbrand, H.
journal: STL-QPSRvolume: 23number: 4year: 1982pages: 117-134
http://www.speech.kth.se/qpsr
However, the questions of (1) the details of articulation and (2) indi-
vidual differences among singers w i t h respect to articulation were left
open. The aim of the present investigation was to supplement sorne arti-
culatory data from soprano and an al to singer, and also to attempt to
make an acoustical evaluation of these data.
Experiment
The subjects were two well known professional Swedish singers, one
alto who sings a t the Stockholm Opera and one soprano wlx, earns her life
as a concert singer.
The entire vocal tract including the contours of the lips, the hard
and soft palate, the back pharynx wall, the tongue, and the glottis were
reproduced by conventional radiography. The technique and equipment
thereby used were identical to that described i n a parallel investiga-
tion of "kolning", (see Johnson & al, this issue of SYL-QPSK).
Both singers sang the vowels [u , a , i] a t three fundamental
frequencies, about 150, 300, and 600 H z for the alto, and about 230,
470, and 950 H z for the soprano. Also, radiographs were taken when the
singers sustained the same three vowels i n a normal speech mode. Final-
ly, one picture was taken during rest w i t h the subject holding a ruler
i n front of her lips, so as to obtain the scale factor.
From these radiographs area functions were derived, and the fannant
frequencies associated with these area functions were determined by
means of an acoustical model of the vocal tract. The procedure thereby
used w i l l be accounted for later.
Results . A: Physiological data
Several physiological data are available i n a lateral rtidiograph of
the vocal tract. In the present study only such articulatory variables
have been examined that are known to be acoustically relevant.
The vertical position of the larynx was measured by means of the
template shown i n Fig. 1, where 0 represents tile midpoint of the upLxr
glottis contour during rest. Adapting this template on the other radio-
graphs caused no problems, as bob\ singers kept tlleir heads i n the same
way when they sang the various notes as during rest.
Fig. 2 shows the data obtained. There is a clear difference between
the subjects. For the soprano's lowest pitches, the larynx position i s
lower than what i s observed in the spoken vowels. FIowever, she raises
I I I I I I 1 I - SOPRANO [a]
- - Ci Y
-
- cu'l -
- -
- - 2
I I I I I I I I I I .
200 400 600 800 1000
FUNDAMENTAL FREQUENCY (Hz)
200 400 600 800 1000
FUNDAMENTAL FREQUENCY ( Hz)
- I I I I I I I I -
Fig. 2. TIE vertical position of the larynx as function of fundamental frequency i n the two subjects. Vowel symbols within brackets pertain t o spoken vuwels, other symbols refer to sung -1s. I
ALTO
- Ci 1
-
- - la3 - - cu'l - -
- m
- z I I I I I I I I I I ,
Fig. 4. Schematical i l lus t ra t ion of the measurerrent of the ver t ica l dis- tance between the upper and low- er l i p , 1, and of the retraction of the G u t h comers, - 1.
. I I I I I SOPRANO -
i a - U
- a - -
a - - - I
I I - -
- Cil - -
lul 3
I I I I I
- JAW OPENING (mm) I I I I I ALTO E 40 -
JAW OPENING ( m m ) Fig. 5. The distance between the contours of the upper and lower l i p
a s function of the jaw opening. Vowel symbols within brackers denote spoken vowels, other syinbols r e fe r t o sung vowels.
STL-QPSR 4/1982
act iv i ty than the a l to ; the alto's data points are a l l close t o one
single curve, while the mouth corners seem to be actively retracted i n
the soprano's r i l l again except for the highest pitch. In the case of
the alto and in the case of the soprano's lower furdatnental frequencies
the three vowels show different values of mouth corner retraction, while
these differences almost disappear in the soprano's highest fundamental
frequency . The tongue contours related t o the lower jaw are shown in Fig. 7.
The tongue contours are similar t o those of the spoken vowels in the
vowels sung a t the lower fundamental frequencies. Also, bt31 in the al to
and in the soprano, the trend i l lus t r a t ed in Fig. 7a can be observed
that the tongue shapes are slightly neutralized, as fundamental f r q e n -
cy is increased. A t tlle soprano's highest pitch the tongue contours for
a l l three vowels are similar to the tongue contour of the spken[a], as
can be seen in Fig. 7b. Thus, while the a l t o uses a t l eas t two d i s t inc t
tongue shapes for the three vowels a t her highest fundamentdl frequency,
the soprano uses practically identical tongue shapes for a l l vowels when
she sings the highest note.
B: Acoustical data
Area functions were derived from the radiographs using the pro-
cedure described previously (Lindblm & Sundbercj, 1971) and schematical-
ly illustrated in Fig. 8. First, p i n t s along the mid-line i n tlie mid-
sag i t t a l plane of the vocal t r a c t were determined. These points were
derived as mid-points between the two intersections of the wall curltours
withthelinesofgridofa semi-polar coordinate s y s t e ~ . This system was anchored on contours of the hard palate and the cervical vertebrates,
and was the same for a l l pictures. Then, these mid-pints were smoothly
joined and the resulting curve was regarded as a vocal tract laid-line.
The distance between the vocal tract wall contours nonnal to this mid-
line was determined a t .5 cm interval-s along the midline. The r;.sulting
mid-sagittal distances were then converted t o cross-sectional area by
means of two distance-to-area plots , one for the pharynx, and one for
the mouth cavity. Lacking better data, the first-mentioned curve was the
same as that used in a previous investigation (Sundberg, 1969; the
possibilites of in~proving this procedure by means of recent development
in radiology w i l l be explored in a future investigation). The curve
to the mouth cavity was derived from direct measurements on a
plaster cast of tile subjects' hard palates. Thus, for each rddicxjraph an
FIRST FORMANT FREQUENCY ( Hz) FIRST FORMANT FREQUENCY (Hz)
STL-QPSR 4/1982
of the f i r s t formant would imply tha t the fundamental was higher than
the f i r s t formant.
With r i s i n g fundamental f reyuency , t h e second formant frequency
r i ses i n the back vowels [ a ] and [ u ] and f a l l s i n the. front vowel [i] i n
both s ingers . In t h e case of t h e soprano, t h i s leacls t o the s i t u a t i o n
tha t the second fornant frequency is pract ical ly identical for a l l three
vowels a t the top pitch.
The frequency of t h e t h i r d formant seems r a t h e r unaffected by
pitch. The fourth formant frequency data seem ur~systematic, even though
they look s t r i k i n g l y s i m i l a r t o t h e larynx he igh t d a t a i n t h e case of
the a l t o .
D i s c u s s i o n and c o n s l u s i o n s
With respec t t o the f i r s t and second formant frequencies of the
various vowels sung a t the different fundamental frequencies, our data
are i n good agreement with the resul t s from the previous investigation
of a soprano s inger (Sundberg, 1975). This i s i n t e r e s t i n g , given both
the uncertainty of and the substantial method differences *tween the
two investigations. E.loreover, the subject used i n the previous invest-
igation was not identical with any of the subjects in the present study.
w a r d i n g the th i rd and fourth formants, the resul ts do not agree w i t h
those of t h e previous study. However, t h e d a t a concerning these for-
mants cannot be very accurate i n the present investigation. The reason
for t h i s is tha t w i t h r is ing frequency the formants become increasingly
sensitive t o de ta i l s of the area function. For t h i s reason these data
do not m e r i t any interpretation. W e conclude tha t our data on the artic-
ulation and on the t w o lowest formant frequencies are re l iable and that
they possess a certain degree of generality.
The ar t iculatory maneuvres underlying t h e tuning of formant £re-
quencies involve jaw opening, larynx height, and tongue st lap . The rnost
important one would be the jaw opening, a s both subjects were found to
change this articulatory parameter w i t h pitch; also, s imilar findings
have been reported i n previous s t u d i e s of female s ingers (OndrZckovb,
1969; Sundberg, 1975). l i e po in t of tuning the f i r s t formant t o the
v i c i n i t y of the fundamental i n high pi tched singing is a gain i n SPL
which nay be q u i t e considerable (cf. , Sundberg, 19132). In high pi tched
singing it impl ies t h e need of r a i s i n g t h e f i r s t formant t o very high
values.
The f inding t h a t t h e soprano r a i s e s he r larynx wit11 increas ing
relevance in this connection. Also, typical differences in the higher
formants are likely, which have not been revealed in the present invest-
igation (see Dmitriev & Kiselev, 1979).
References
Askenfelt, A. & Sundberg, J. (1981): "Larynx height and voice source. A relationship?", STL-QPSR 2-3/1981, 23-36.
Dmitriev, L. & Kiselev, .A. (1979) : "Relationship between the formant structure of different types of singing voices and the dimensions of the supraglottal cavities", Folia Phoniatrica - 3 1, 238-241.
Fant, G. (1960:) Acoustic Theory of Speech Production, Mouton, ra raven- hage . Fransson, F. & Jansson, E. (1973) : "The STL-Ionophone : Transducer properties and construction", J. Acoust. Soc. Am. - 58, 910-915.
Lindblom, B. & Sundberg, J. (197 1) : "Acoustical consequences of lip, tongue, jaw, and larynx movement, J. Acoust. Soc. Am. - 50, 1166-1179.
Ondrfiovk J. (1969): "Some remarks on the analysis of sung vowels, pp. 407-418 in Study of Souncls XIV, Phon. Soc. of Japan.
Shipp T. & Izdebski, K. (1975): "Vocal frequency and vertical larynx positioning by nonsingers and singers", J. Acoust. Soc. Am. - 58, 1104- 1106.
Sundberg, J. (1975): "Formant technique in a professional female sing- er", Acustica 32, 89-96. - Sundberg, J. (1982): "Perception of singing", pp. 59-98 in Psychology of Music, (D. Deutsch, ed.), Academic Press, New York.
Sundberg, J. & Gauffin, J. (1982) : "Amplitude of the voice source fundamental and the intelligibility of super pitch vowels", pp. 223-228 in The Representation of Speech in the Peripheral Auditory System, (R. Carlson & B. Granstrom, eds. ) , Elsevier Biomedical Press, Amsterdam.