Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier...

19
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Electroglottograph and contact microphone for measuring vocal pitch Askenfelt, A. and Gauffin, J. and Kitzing, P. and Sundberg, J. journal: STL-QPSR volume: 18 number: 4 year: 1977 pages: 013-021 http://www.speech.kth.se/qpsr

Transcript of Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier...

Page 1: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

Dept. for Speech, Music and Hearing

Quarterly Progress andStatus Report

Electroglottograph andcontact microphone for

measuring vocal pitchAskenfelt, A. and Gauffin, J. and Kitzing, P.

and Sundberg, J.

journal: STL-QPSRvolume: 18number: 4year: 1977pages: 013-021

http://www.speech.kth.se/qpsr

Page 2: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the
Page 3: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the
Page 4: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

STL-QPSR 4/1977

when the ca r r ie r frequency i s comparatively low (in the region below

1 MHZ) and the current from the electroglottograph i s too weak.

An accelerometer microphone, henceforth called contact microphone,

which is fastened to the neck below the larynx records a signal related

to the sound in the trachea. The wave-form i s reasonably independent

of the articulation because of the high glottal impedance. Moreover,

the trachea resonances a r e constant in frequency and more damped

than those of the vocal tract. As a sequel of this, the wa've-form of a

contact microphone signal i s suitable for fundamental frequency meas-

urements and has therefore often been used for this purpose.

As demonstrated above there a r e two different methods for obtaining

a signal suited for measurement of the fundamental frequency in speech.

The purpose of the present investigation was to evaluate the reliability

of such measurements based on signals from an electroglottograph

and from a contact microphone. .

Equipment

The elec troglottograph used was constructed by Rundkvist & Kitzing.

It has a conventional design (Le Cluse 1977) with a rather low car r ie r

frequency of 200 kHz and an electrode voltage of . 5 V. It contains a

bistable circuit which i s triggered by the signal from the electrodes.

The resulting square-wave signal which has a constant amplitude and

the same fundamental frequency a s the electroglottogram i s available

a t one of the outputs of the device.

The contact microphone (Special Instrument type BC-2) i s a simple

accelerometer containing a piezo-electric cheramic disc a s the pick-up

unit. The disc is enclosed in a metal container of 15 mm in diameter

and 5 mm in thickness. It weighs about 20 grams. The contact micro-

phone i s run with an amplifier with high input impedance (5 M ;;), which

is built into the connection wire in order to minimize hum.

The fundamental frequency detector (Fonema type 00063) i s of the

dual peak sensing type (cf. Larsson 1977). The positive and negative

peaks in the input signal wave-form trigger a bistable circuit, so that

a sq&re-wave output i s obtained having the fundamental frequency of

the vibrations sensed by the contact microphone.

Page 5: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

STL-QPSR 4/1977

Experiment

Simultaneous recordings were made with the two devices described.

The electroglottograph electrodes were attached to a fork held against

the speaker' s thyroid cartilage. The contact microphone was taped

to the neck about 2 cm below the cricoid cartilage. The square-wave

signal from the electroglottograph and the signal from the contact micro-

phone were recorded on separate channels on tape. The signal from

an ordinary microphone located 20 cm in front of the speaker' s mouth

was recorded on a third channel on the same tape, cf. Fig. 11-A- I .

Analysis

A computer program was developed for the analysis. The input

signal is the square-wave signal from the fundamental frequency de-

tector. A s the square-wave signal from the electroglottograph was

distorted by the tape recorder, i t had to be restored by means of a flip-

flop circuit, before i t was sent into the computer. The rectified and

smoothed amplitude of the contact microphone signal was simultaneous-

ly fed to the computer on a separate line. For the purpose of inspecting

the results of the measurement, the program plots the fundamental f re-

quency a s a function of time together with the amplitude signal men-

tioned, cf. Fig. 11-A-2.

When no signal o r noise i s fed into a pitch detector it generates a

more or l ess random output. Therefore, it is essential to exclude the

parts of the signal corresponding to unvoiced sounds and pauses. As

the amplitude of the contact microphone signal drops substantially when

voicing stops, this amplitude was used a s a gating criterion. The ope-

rator sets an amplitude threshold, and all parts of the fundamental

frequency curve, which a r e associated with intensities below this thres-

hold, a r e eliminated from all subsequent calculations performed by the

computer.

For the fundamental frequency curve a non-linear smoothing algo-

rithm i s available (Rabiner et a1 1975). The algorithm uses running

medians for the smoothing process. The resulting fundamental fre- - quency value, fn, of an N-point median smoother, where N i s an odd

integer number, represents the centermost value of a sequence of the

N adjacent samples:

Page 6: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

Fig. I1 -A- I. Schematic dia~ram of the data- recording procedure.

Page 7: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

- I I NO AMPLITUDE THRES.HOLD I

AMPLITUDE THRESHOLD= -20 dB

1

Fig. 11-A-2. Fundamental frequency and amplitude curves plotted by the computer. The dotted line in the lowest curve represents the amplitude threshold set by the operator.

Page 8: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

STL-QPSR 4/1977

Running medians have the property of preserving sharp discontinuities

with durations exceeding a cr i t ical value, which depends on the size

N of the median smoother. Discontinuities with shorter durations a r e

eliminated. This smoothing i s useful for eliminating some types of

e r r o r s in the fundamental frequency curve, cf. Fig. 11-A-3. W h e n not

otherwise stated, i t was not used in the data presented below.

The fundamental frequency values a r e assor ted in frequency c l a s ses

1 HA wide. The resulting fundamental frequency histogram is plotted

on the computer screen. By means of a cu r so r the operator defines

l im 1 and l im 2 being the lower and upper l imits of the relevant f r c - - quency range. Histogram contours may be smoothed by an algorithm.

If so, each point Fn along the histogram contour is rrlodified according

to:

This smoothing can be repeated severa l t imes .

The computer program a l so calculates some stat is t ics and presents

the resu l t s in a new plot, cf. Fig. 11-A-4. The statist ical values are :

(1) the ar i thmetic mean of the distribution (MFo),

(2) the mode of the distribution (F,,,),

(3) the ratio between the par t s of the distribution lying left and right of the mode (BIAS),

(4) the total sample t ime defined a s the sum of the period t imes considered in the calculations (TIME).

Also, a tr iangular approximation of the distribution is presented.

I t s top i s identical with the mode, and the a r e a s on both sides equal the

corresponding a r e a s under the his togram contour.

Fundamental frequency his tograms have been found useful for phoni-

a t r ica l purposes (Kitzing et a1 1975; Fr i tze l l et a1 1977). Therefore it

is relevant to study the comparability of such his tograms when derived

f rom electroglottograph and contact microphone recordings made simul-

taneously. The ent i re mater ia l of 13 voices was thus compared using

Page 9: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

AMPLITUDE (dB) FUNDAMENTAL FREQUENCY (Hz)

Page 10: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

Fmax = 162 HZ TIME .= 37 .SEC BIQS = 1.16

FREQUENCY (Hz)

Fig. 11-A-4. Fundamental frequency histogram plotted by the computer. The vertical dotted lines are set by the operator and limit the frequency range used in the calculations.

Page 11: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

STL-QPSR 4/1977

the computer 'program described above. Thereby, special care was

taken so a s to base the histograms on exactly the same sample of

speech sounds. This was realized by using the amplitude of the con-

tact microphone signal a s the gating criterion in both cases, and the

lim 1 and lim 2 were set at identical frequencies in both analyses.

All histograms were smoothed twice, cf. Eq. ( 2 ) .

Results

Table 11-A-I shows the results obtained f rom these measurements.

As regards the average of the distribution there is an almost perfect

agreement between the methods. The average difference between the I

values based on contact microphone and electroglottograph recordings

is not greater than 1.0 Hz (SD = .82 Hz). Eventhough thi s value i s very

small, most differences a r e positive indicating that the mean obtained

from the contact microphone i s slightly higher.

As regards the modes of the histograms greater differences a r e

observed a s seen in the same table. This occurs especially in histo-

grams which exhibit several high peaks of almost the same amplitude.

Then a small difference in the measurements may easily make another

peak the highest one. Even i f the average difference i s as small a s

- - 7 Hz (SD = 5.96 Hz) the value of this measure can be questioned.

Probably a more smoothed histogram would yield a more useful value 1

of the distribution.

The sample time shows very small differences. This i s another

indication that the differences in the data obtained by the two methods

a r e very small. The BIAS measure, on the other hand, suffers from

the discrepancies in the mode values because the BIAS i s strongly in-

fluenced by a change of the mode. Still, the average discrepancy i s

a s small a s . 16 (SD = .70). If future research will support the assump-

tion that the skewness of the distribution i s a relevant measure, i t

seems that a more sophisticated measure of the skewness should be

used.

Di s cus sion

As mentioned above there a r e ,some differences in the signals re-

corded by a contact microphone and an electroglottograph. As mentioned

Page 12: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the
Page 13: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

STL-QPSR 4/1977 19.

the contact microphone senses the vibrations i s the wall of the neck.

These vibrations reflect variations of the subglottic pressure accom--

plished by the vibrating vocal folds acting a s a sound generator. Such

pressure variations will be generated even in cases of incomplete glot-

tal closure, such a s in leaky phonation (Kitzing & Lofqvist 1975). On

the other hand, the amplitude of the electroglottograph signal seems

to be strongly dependent on whether or not the vocal folds make con-

tact, cf. Fig. 11-A-5a, b, and c. This dissimilarity will .cause differ-

ences in fundamental frequency data obtained from the two devices.

Another difference i s that the electroglottograph cannot be used on

all subjects, a s mentioned previously. 4- thick layer of subcutaneous

tissue in the neck may decrease the signal amplitude substantially.

This is not true in the case of the contact microphone. Three subjects,

which could not be measured with the electroglottograph in our experi-

ment, produced seemingly reliable fundamental frequency data when

measured by means of the contact microphone. (It should be stressed,

however, that the ratio i0:3 i s probably& representative for the

percentage of voices that cannot be measured with an electroglottograph.

In selecting the subjects we attempted to choose voices which could

be expected to cause problems for the devices tested. )

A third difference i s that the amplitude of the electroglottograph

signal reaches i t s full value sooner than that of the contact micro-

phone in voicing onsets, cf. Fig. 11-A- 5d. The fundamental frequency

meter used i s somewhat sensitive to amplitude variations and this can

be a cause to some e r r o r s in the measurements. By using a more

sophisticated fundamental frequency detector these e r r o r s may be

avoided.

A fourth difference i s that the electroglottograph wave-form i s al-

most independent of the articulation. At least i t was generally very

easy to process.with the fundamental frequency detector. The contact

microphone wave-form slightly depends on the articulation because of

. the finite glottal impedance. In other words, the subglottic sound i s

influenced by the supraglottic sound. The resulting changes in wave-

form often cause e r ro r s , cf. Fig. 11-A-6. Most such e r r o r s can be

eliminated with the running median smoother, a s can be seen in the

same figure.

Page 14: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

Leaky Norma t

. . !

I

. ,

CM ' . Normal . Falsetto ;: ' .. , . : , -. . . : ! : . . . .... . . . . I : . : ! ! ' . " ' : ! .: . . . . . . . . . 1 ! ! t S . I * l ! ,.&., a , , , , , . ? * ! : , > ! a : , ? . ! , . * ,!, . , ~ . , , . , . ! 7 , 4 l : . : . - : S , . , - t . . ; ; : j ! . . . .3 . . ...........,.. . . . ;'!,',.;;.;;[;;., ; , / : : ; 'a !

. . . . . . . . . . . , ... 3 . . . . :... . . . . . . . : . . . _._ . . j

....... ,.. , , ,.; ;.;,[ ; , ; , i : ; J : ' , .. I . . . ..,: ... ;,, . . . . . : . . . . : . . . . _ . .:. .:., ... . . . . . a ;, , I . !;',:, 1 * , , i i 3 : ! ; ! : / / ; ' : , i ; i , t i ; $ ; ; i : ? ; ; ; ; ; . ' . : . . . . . . . . . . . -

~ . ~ : ~ ~ ~ ~ : ~ ~ ~ ; i ~ l : : : i : ; . , i ; i ; ; i ; , : ~ ~ i , , ~ i ! ~ ; j ; i ; l , , ~ ..::.,, : '. J

Fig. 11-A-5. Comparisons between contact microphone signals (CM) and electroglottograph signals (ElG) for different types of phonation.

1

Page 15: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the
Page 16: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

CONTACT MICROPHONE

h

N x w

> U z w CONTACT MICROPHONE, SMOOTHED 3 200- I W I I I CX: I

I I LL 100- 7 I I A a k- I

I z so- w I Z

ELECTROGLOTTOGRAPH I I I

.J 6

11 I 100,

so, I

TIME (sec)

Fig. 11-A- 6b. Fundamental frequency curves derived from the contact microphone signal and the electroglottograph together with the contact microphone signal amplitude.

Page 17: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

STL-QPSR 4/1977

A fifth difference between the two devices i s 'their behavior in

cases of vocal fry. Here, the vocal folds do not close regularly, so

that the electroglottograph sometimes gives lower frequency readings

than the contact microphone, cf. Fig. 11-A-7. However, for very

low-frequency vocal fry phonation the electroglottograph may not al-

ways register correctly the instances of glottal closure. This i s be-

cause the electroglottograph i s also sensitive to larynx movements

and changes in the constriction of the larynx. These movements super-

. impose low-frequency signals on the glottogram and even i f a HP-iilter

i s used e r r o r s may occur in triggering the circuit delivering the

square -wave output.

Summari zing one may say that there a r e several reasons for ex-

pecting slightly differing results in fundamental frequency data obtained

from an electroglottograph and from a contact microphone: the electro-

glottograph normally depicts the frequency of glottal closures, while

a contact microphone also mi r ro r s the sound generated by the vocal

fold vibrations. Moreover, the variability in the contact microphone

amplitude and wave-form cause spurious e r r o r s in the fundamental

frequency measurement. Therefore, for the purpose of the examining

fundamental frequency events in speech, an electroglottograph should

be tr ied in the f i rs t place.

Conclusions

Both equipments examined in the present investigation have strong

and weak points. As for the electroglottograph it rarely fails when it

functions at all: those voices that can be measured a r e generally

measured accurately. It records the glottal closures rather than the

sound generated by the vibrating vocal folds. The contact microphone,

on the other hand, can probably be used on all subjects, even in cases

when the electroglottograph fails. The output signal should be LP-

filtered in order to provide a signal suited for fundamental frequency

measurements. The signal mi r ro r s the sound generated by the vocal

fold vibrations. There i s no exact agreement between the fundamental

frequency data obtained from the two devices. This i s a consequence

mainly of the fact that variations in the contact microphone signal

Page 18: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the

AMPLITUDE (dB) FUNDAMENTAL FSEQUENCY (Hz)

Page 19: Electroglottograph and contact microphone for measuring ... · STL-QPSR 4/1977 when the carrier frequency is comparatively low (in the region below 1 MHZ) and the current from the