Download - Behrman Chapter 5, 6

Transcript
Page 1: Behrman Chapter 5, 6

Behrman Chapter 5, 6

Place less emphasis on…

• Minor anatomical landmarks and features

• Extrinsic muscles of the larynx

• Blood supply to the larynx

• Central motor control of larynx

• Peripheral Sensory control of larynx

• Stress-Strain Properties of Vocal Folds

Page 2: Behrman Chapter 5, 6

Laryngeal Activity in Speech/Song

• Sound source to excite the vocal tract– Voice– Whisper

• Prosody– Fundamental frequency (F0) variation– Amplitude variation

• Realization of phonetic goals– Voicing– Devoicing– Glottal frication (//, //)– Glottal stop (//)– Aspiration

• Para-linguistic and extra-linguistic roles– Transmit affect– Speaker identity

Page 3: Behrman Chapter 5, 6

The vocal fold through life…

• Newborns– No layered structure of LP– LP loose and pliable

• Children– Vocal ligament appears 1-4

yrs– 3-layered LP is not clear

until 15 yrs

• Old age– Superficial layer becomes

edematous & thicker– Thinning of intermediate

layer and thickening of deep layer

– Changes in LP more pronounced in men

– Muscle atrophy

Page 4: Behrman Chapter 5, 6

The Glottal Cycle

Page 6: Behrman Chapter 5, 6

Myoelastic Aerodynamic Theory of Phonation

Necessary and Sufficient Conditions

• Vocal Folds are adducted (Adduction)

• Vocal Folds are tensed (Longitudinal Tension)

• Presence of Aerodynamic pressures

Page 7: Behrman Chapter 5, 6

2-mass model

Lower part of vocal fold

Upper part of vocal fold

Mechanical coupling stiffness

TA muscle

Coupling between

mucosa & muscle

Page 8: Behrman Chapter 5, 6

•VF adducted & tensed → myoelastic pressure (Pme )•Glottis is closed•subglottal air pressure (Psg) ↑ •Psg ~ 8-10 cm H20, Psg > Pme

•L and R M1 separate•Transglottal airflow (Utg) = 0

As M1 separates, M2 follows due to

mechanical coupling stiffnessPsg > Pme

glottis begins to openPsg > Patm therefore Utg > 0

Page 9: Behrman Chapter 5, 6

Utg ↑ ↑ since glottal aperature << tracheal circumference

Utg ↑ Ptg ↓ due to

Bernoulli effectPressure drop across the glottis

Bernoulli’s Law

P + ½ U2 = K

where

P = air pressure

= air density

U = air velocity

Page 10: Behrman Chapter 5, 6

Utg ↑ Ptg ↓ due to Bernoulli effect

Plus “other” aerodynamic effects

Ptg < Pme

M1 returns to midlineM2 follows M1 due to

mechanical coupling stiffness

Utg = 0

Pattern repeats 100-200 times a second

Page 11: Behrman Chapter 5, 6
Page 12: Behrman Chapter 5, 6

Limitations of this simple model

Page 13: Behrman Chapter 5, 6

The Glottal Cycle

Page 14: Behrman Chapter 5, 6

Sound pressure wave

Time

Inst

anta

neou

sso

und

pres

sure

Page 15: Behrman Chapter 5, 6

Phonation is actually quasi-periodic

• Complex Periodic– vocal fold oscillation

• Aperiodic– Broad frequency noise embedded in signal– Non-periodic vocal fold oscillation– Asymmetry of vocal fold oscillation – Air turbulence

• Voicing vs. whispering

Page 16: Behrman Chapter 5, 6

Glottal Aerodynamics

• Volume Velocity

• Driving Pressure

• Phonation Threshold Pressure– Initiate phonation– Sustain phonation

• Laryngeal Airway Resistance

Page 17: Behrman Chapter 5, 6

Measuring Glottal Behavior

• Videolaryngoscopy – Stroboscopy– High speed video

Page 18: Behrman Chapter 5, 6

Photoglottography (PGG)

Time

illum

inat

ion

Page 19: Behrman Chapter 5, 6

Electroglottography (EGG)• Human tissue = conductor • Air: conductor• Electrodes placed on each

side of thyroid lamina• high frequency, low current

signal is passed between them

• VF contact = impedance• VF contact = impedance

Page 20: Behrman Chapter 5, 6

Electroglottogram

Page 21: Behrman Chapter 5, 6

Glottal Airflow (volume velocity)

• Instantaneous airflow is measured as it leaves the mouth

• Looks similar to a pressure waveform

• Can be inverse filtered to remove effects of vocal tract

• Resultant is an estimate of the airflow at the glottis

Page 22: Behrman Chapter 5, 6

Flow Glottogram

Page 23: Behrman Chapter 5, 6

Synchronous plots

Sound pressure waveform(at mouth)

Flow glottogram(inverse filtered mask signal)

Photoglottogram

Electroglottogram

Page 24: Behrman Chapter 5, 6

F0 Control

• Anatomical factorsMales ↑ VF mass and length = ↓ Fo

Females ↓ VF mass and length = ↑ Fo

• Subglottal pressure adjustment – show example↑ Psg = ↑ Fo

• Laryngeal and vocal fold adjustments↑ CT activity = ↑ Fo

TA activity = ↑ Fo or ↓ Fo

• Extralaryngeal adjustments↑ height of larynx = ↑ Fo

Page 25: Behrman Chapter 5, 6

Fundamental Frequency (F0)

Average F0

• speaking fundamental frequency (SFF)

• Correlate of pitch

• Infants– ~350-500 Hz

• Boys & girls (3-10) – ~ 270-300 Hz

• Young adult females– ~ 220 Hz

• Young adult males– ~ 120 Hz

Older females: F0 ↓

Older males: F0 ↑

F0 variability• F0 varies due to

– Syllabic & emphatic stress– Syntactic and semantic factors– Phonetics factors (in some

languages) • Provides a melody (prosody)

• Measures– F0 Standard deviation

• ~2-4 semitones for normal speakers

– F0 Range

Page 26: Behrman Chapter 5, 6

Maximum Phonational Frequency Range

• highest possible F0 - lowest possible F0

• Not a speech measure

• measured in Hz, semitones or octaves

• Males ~ 80-700 Hz1

• Females ~135-1000 Hz1

• 3 octaves often considered normal

1Baken (1987)

Page 27: Behrman Chapter 5, 6

Fundamental Frequency (F0) Control

• Ways to measure F0

– Time domain vs. frequency domain– Manual vs. automated measurement– Specific Approaches

• Peak picking• Zero crossing• Autocorrelation• The cepstrum & cepstral analysis

Page 28: Behrman Chapter 5, 6

Autocorrelation

Data Correlation

+ 1.0

+ 0.1

- 0.82

+ 0.92

Page 29: Behrman Chapter 5, 6

Cepstrum

Page 30: Behrman Chapter 5, 6
Page 31: Behrman Chapter 5, 6

Amplitude Control

• Subglottal pressure adjustment↑ Psg = ↑ sound pressure

• Laryngeal and vocal fold adjustments↑ medial compression = ↑ sound pressure

• Supralaryngeal adjustments

Page 32: Behrman Chapter 5, 6

Measuring Amplitude

• Pressure

• Intensity

• Decibel Scale

Page 33: Behrman Chapter 5, 6

Sound Pressure Level (SPL)

Average SPL• Correlate of loudness• conversation:

• ~ 65-80 dBSPL

SPL Variability SPL to mark stress• Contributes to prosody• Measure

– Standard deviation for neutral reading material:

• ~ 10 dBSPL

Page 34: Behrman Chapter 5, 6

Dynamic Range

• Amplitude analogue to maximum phonational frequency range

• ~50 – 115 dB SPL

Page 35: Behrman Chapter 5, 6

Vocal Quality

• no clear acoustic correlates like pitch and loudness

• However, terms have invaded our vocabulary that suggest distinct categories of voice quality

Common Terms• Breathy• Tense/strained• Rough• Hoarse

Page 36: Behrman Chapter 5, 6

Are there features in the acoustic signal that correlate with these

quality descriptors?

Page 37: Behrman Chapter 5, 6

BreathinessPerceptual Description• Audible air escape in the voice

Physiologic Factors• Diminished or absent closed phase• Increased airflow

Potential Acoustic Consequences• Change in harmonic (periodic) energy

– Sharper harmonic roll off• Change in aperiodic energy

– Increased level of aperiodic energy (i.e. noise), particularly in the high frequencies

Page 38: Behrman Chapter 5, 6

harmonics (signal)-to-noise-ratio (SNR/HNR)

• harmonic/noise amplitude HNR

– Relatively more signal– Indicative of a normality

HNR– Relatively more noise– Indicative of disorder

• Normative values depend on method of calculation

• “normal” HNR ~ 15

Page 39: Behrman Chapter 5, 6

Harmonic peak

Noise ‘floor’

Noise ‘floor’

Frequency

Am

plitude

Harmonic peak

Page 40: Behrman Chapter 5, 6

From Hillenbrand et al. (1996)

First harmonic amplitude

Page 41: Behrman Chapter 5, 6

Prominent Cepstral Peak

Page 42: Behrman Chapter 5, 6

Spectral Tilt: Voice Source

Page 43: Behrman Chapter 5, 6

Spectral Tilt: Radiated Sound

Page 44: Behrman Chapter 5, 6

Peak/average amplitude ratio

Page 45: Behrman Chapter 5, 6

From Hillenbrand et al. (1996)

Page 46: Behrman Chapter 5, 6

WMU Graduate Students

Page 47: Behrman Chapter 5, 6

Tense/Pressed/Effortful/Strained Voice

Perceptual Description• Sense of effort in production

Physiologic Factors• Longer closed phase• Reduced airflow

Potential Acoustic consequences• Change in harmonic (periodic) energy

– Flatter harmonic roll off

Page 48: Behrman Chapter 5, 6

Pressed

Breathy

Spectral Tilt

Page 49: Behrman Chapter 5, 6

Acoustic Basis of Vocal Effort

100.000000 200.000000 300.000000 400.000000 500.000000

effort

100.000000

200.000000

300.000000

400.000000

500.000000

Reg

ress

ion

Ad

just

ed (

Pre

ss)

Pre

dic

ted

V

alu

eDependent Variable: effort

Scatterplot

F0 + RMS + Open Quotient

Perc

epti

on o

f E

ffor

t

Tasko, Parker & Hillenbrand (2008)

Page 50: Behrman Chapter 5, 6
Page 51: Behrman Chapter 5, 6

Roughness

• Perceptual Description– Perceived cycle-to-cycle variability in voice

• Physiologic Factors– Vocal folds vibrate, but in an irregular way

• Potential Acoustic Consequences– Cycle-to-cycle variations F0 and amplitude– Elevated jitter– Elevated shimmer

Page 52: Behrman Chapter 5, 6

Period/frequency & amplitude variability

• Jitter: variability in the period of each successive cycle of vibration

• Shimmer: variability in the amplitude of each successive cycle of vibration

Page 53: Behrman Chapter 5, 6

Jitter and Shimmer

Sources of jitter and shimmer• Small structural asymmetries

of vocal folds• “material” on the vocal folds

(e.g. mucus)• Biomechanical events, such as

raising/lowering the larynx in the neck

• Small variations in tracheal pressures

• “Bodily” events – system noise

Measuring jitter and shimmer• Variability in measurement

approaches• Variability in how measures are

reported• Jitter

– Typically reported as % or msec– Normal ~ 0.2 - 1%

• Shimmer– Can be % or dB– Norms not well established

Page 54: Behrman Chapter 5, 6

Vocal Register

What is a vocal register?

Page 55: Behrman Chapter 5, 6

Vocal Registers

Pulse (Glottal fry)– 30-80 Hz, mean ~ 60 Hz– Closed phase very long (90 % cycle)– May see biphasic pattern of vibration (open,

close a bit, open and close completely)– Low subglottal pressure (2 cm water)– Energy dies out over the course of a cycle so

parts of the cycle has very little energy– Hear each individual cycle

Page 56: Behrman Chapter 5, 6

Vocal Registers

Modal– VF are relatively short and thick – Reduced VF stiffness– Large amplitude of vibration– Possesses a clear closed phase– The result is a voice that is relatively loud and

low in pitch – Average values cited refer to modal register

Page 57: Behrman Chapter 5, 6

Vocal Registers

Falsetto– 500-1100 Hz (275-600 Hz males)– VF are relatively long and thin– Increased VF stiffness– Small amplitude of vibration– Vibration less complex– Incomplete closure (no closed phase)– The result is a voice that is high in pitch