How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive...

41
How our brains perceive categories of sounds

Transcript of How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive...

Page 1: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

How our brainsperceive categoriesof sounds

Page 2: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

/kæt/

Task of the listener: Mapping sound to meaning

Requires converting the rawsignal into the kinds of soundrepresentations that connect tomeaning.

• What are thosesoundrepresentationslike?

Page 3: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Stored sound representations must abstract away from a huge amountof physical variance in the signal

• Loudness• Pitch• Gender• Speaker identity• Accent and other variance in pronunciation• Etc…

Task of the listener: Mapping sound to meaning

Page 4: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Categorization inside and outside language

What makes a chair a chair?

What makes a ‘t’ a ‘t’?• For the speaker?• For the listener?

Narrowing down the question:• What makes a ‘t’ a ‘t’, as opposed to a ‘d’?

Page 5: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Distinctive featurest+ consonantal- sonorant- syllabic

- nasal- continuant- lateral

- voiced

- round+ anterior

Major classfeatures

Mannerfeatures

Laryngealfeatures

Place ofarticulationfeatures

d+ consonantal- sonorant- syllabic

- nasal- continuant- lateral

+ voiced

- round+ anterior

Page 6: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Voiced [+ voiced] sounds are

produced with the vocalfolds close together, insuch as way that airpassing through themcauses them to vibrate.

Touch your larynx whilepronoucing [z/] vs. [s] tofeel that [z] is voicedand [s] voiceless.

Human vocal tract

Page 7: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Voicing and stop consonants Stop consonants: p, t, k, b, d, g

• Produced by causing a complete closure of the vocaltract and then releasing it.

Stops cannot be heard until the closure is released.

All vowels are voiced.

When the stop is released quickly, and the voicing of thevowel starts quickly, the stop is considered [+ voiced].

The difference between [d] and [t] is that the onset ofvoicing of the following vowel takes longer for [t] than for[d].

Page 8: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Acoustically, voiced and voiceless stops differ in their

Voice Onset Time (VOT)

VOT: The time it takes for the voicing ofthe vowel to start.

Page 9: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

VOT continuum for ta-da

60 msec

Englishboundary

Page 10: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Categorical perception of t-d Eimas & Corbitt: (1973):

Perception ofsynthesized soundswith VOTs increasing inequal steps from 0 to80ms.

Listeners to judgewhether they hear a [da]or [ta].

Page 11: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

/d/ category /t/ category

Within category discrimination is poor. How do we learn to group sounds in this way? What is it

good for?

Page 12: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

/d/ category /t/ category

A change withinthese VOTs …

… or withinthese VOTs

is never associated with a change in meaning in English. Sobeing able to discriminate between 10 and 20ms VOTs or 60and 70ms VOTs won’t help you understand English.

Page 13: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Short VOT Long VOT

boundary

“doll” “tall”Different words

Could not bedifferent words

Page 14: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Phoneme A category of sounds.

A mental abstraction over a group of sounds withinwhich interchanging one token for another can neverinduce a change in meaning.

Since going from /t/ to /d/ in English can induce achange in meaning, /t/ and /d/ are distinct phonemes inEnglish.

Page 15: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

/kæt/

Requires converting the raw signalinto the kinds of soundrepresentations that connect tomeaning.

• What are those soundrepresentations like?

• Sequences ofphonemes!

Page 16: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

What do we know about the brain bases ofphonological categorization?

Page 17: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

An auditory evoked neuromagnetic field

Roberts, Ferrari, Stufflebeam & Poeppel, 2000, J Clin Neurophysiol, Vol 17, No 2, 2000

M100 field pattern

Page 18: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

M100 (N1 in ERP terms)

Generated in auditory cortex bilaterally.

Affected by: Intensity

higher intensity → shorter latency, larger amplitude

Frequency higher frequency → shorter latency

Phonetic identity e.g., shorter latencies for a than for u (explainable in

spectral terms)

Function of M100 activity in speech processingunclear.

Page 19: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Auditory Mismatch Field

A change detector. Not part of the basic processing of sounds.

Page 20: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Auditory Mismatch Field

Elicited in auditory cortex at ~180ms post stimulus onsetby deviant stimuli in an oddball paradigm.

XXXYXXXXYXXXXYXXX

X = “standard”

Y = “deviant”

Page 21: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Auditory Mismatch Field

A tool for investigating what counts as a changefor auditory cortex.

Has been heavily used in the study of the neuralbases of categorical perception.

Page 22: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Sams et al. 1985

Mismatch field as a function of frequency change

Page 23: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

The Mismatch Field andPhonemic Categorization

Would crossing a phonemic boundary elicit amismatch field?

If yes, this would tell us that auditory cortex has access to phonemic

categories

that category information is accessed by 180ms.

Page 24: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Voice Onset Time (VOT)

60 msec

Page 25: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

T45 ms

T64 ms T

56 ms

D00 ms

D05 ms

D15 ms

D00 ms

D00 ms

D00 ms

D05 ms

D16 ms

D18 ms

D16 ms

D24 ms

D24 ms

D24 ms

Acoustic representation

Phillips et al. (2000, Journal of Cognitive Neuroscience)

standards

deviants

perceptual boundary

Page 26: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

T45 ms

T64 ms

T56 ms

D00 ms

D05 ms

D15 ms

D00 ms

D00 ms

D00 ms

D05 ms

D16 ms

D18 ms

D16 ms

D24 ms

D24 ms

D24 ms

Phonological representation

Phillips et al. (2000, Journal of Cognitive Neuroscience)

perceptual boundary

Page 27: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

T45 ms

T64 ms

T56 ms

D00 ms

D05 ms

D15 ms

D00 ms

D00 ms

D00 ms

D05 ms

D16 ms

D18 ms

D16 ms

D24 ms

D24 ms

D24 ms

If the auditory cortex is sensitive to phonologicalcategories, deviant Ts should elicit a Mismatch Field.

Phillips et al. (2000, Journal of Cognitive Neuroscience)

perceptual boundary

Page 28: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

T45 ms

T64 ms T

56 ms

D00 ms

D05 ms

D15 ms

D00 ms

D00 ms

D00 ms

D05 ms

D16 ms

D18 ms

D16 ms

D24 ms

D24 ms

D24 ms

What should happen if we keep everything else thesame, but lift all the VOTs by 20ms?

perceptual boundary

Page 29: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

T65 ms

T84 ms T

76 ms

D20 ms

D25 ms

T35 ms

D20 ms

D20 ms

D20 ms

D25 ms

T36 ms

D38 ms

T36 ms

T44 ms

T44 ms

T44 ms

standards

deviants

The deviants should not elicit a Mismatch Field --they are not perceived as different from the standards.

perceptual boundary

Page 30: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Phillips et al. (2000, Journal of Cognitive Neuroscience)

Experiment 1: standards fall on one side of a perceptualboundary, the deviants on the other.

Experiment 2: the physical distance between standards anddeviants is as in Exp. 1, but now the standards and deviants areno longer differentiated by a perceptual boundary.

If a Mismatch Field elicited in Exp 1 is driven by the categorydifference between standards and deviants, it shoulddissappear in Exp 2.

Page 31: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Phillips et al., 2000, Journal of Cognitive Neuroscience

Page 32: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Phillips et al. (2000, Journal of Cognitive Neuroscience)

A Mismatch Field is elicited only when standards fall on oneside of a perceptual boundary, the deviants on the other.

Auditory cortex has access to phonological categoryinformation at 180ms (shortly after the M100).

In this example, category membership depended on voicing. Different stop consonant have different perceptual boundaries

for voicing. If the mismatch response is sensitive to voicing, we should be

able elicit it just for that feature, even across phonemiccategories.

Page 33: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Multiple places of articulation

pæ tæ tæ kæ dæ pæ kæ tæ pæ kæ bæ tæ pæ kæ gæ

(Phillips, Pellathy & Marantz 2000)

Page 34: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Multiple places of articulation

pæ tæ tæ kæ dæ pæ kæ tæ pæ kæ bæ tæ pæ kæ gæ – – – – [+voi] – – – – – [+voi] – – – [+voi]

• Voiceless phonemes are in many-to-one ratio with[+voice] phonemes.

• No other many-to-one ratios in this sequence

(Phillips, Pellathy & Marantz 2000)

Page 35: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Multiple places of articulation

pæ tæ tæ kæ dæ pæ kæ tæ pæ kæ bæ tæ pæ kæ gæ – – – – [+voi] – – – – – [+voi] – – – [+voi]

(Phillips, Pellathy & Marantz 2000)

Page 36: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Stimuli

Page 37: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Stimuli

Page 38: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Stimuli

Page 39: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Mismatch response elicited by voicing

Page 40: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Left laterality of language related Mismatch Field

Nonlinguistic sounds elicit a bilateral Mismatch Field. I.e., both auditory cortices are “surprised.”

Crossing a phoneme boundary in an oddball taskelicits a Mismatch Field in the left hemisphere only. The left hemisphere auditory cortex is specialized for

phoneme perception.

Page 41: How our brains perceive categories of sounds · Phillips et al. (2000, Journal of Cognitive Neuroscience) A Mismatch Field is elicited only when standards fall on one side of a perceptual

Mismatch Summary

By 180ms, auditory cortex has extracted from theinput the relevant features needed for categorizingsounds into phonemes.

Since the Mismatch field is a surprise response, thenecessary computational steps needed forcategorization must occur prior to 180ms. Howexactly this happens, we don’t currently know.