Iowa State University Developmental Robotics Laboratory Unsupervised Segmentation of Audio Speech...

Iowa State UniversityDevelopmental Robotics Laboratory

Unsupervised Segmentation of Audio Speech using the Voting Experts Algorithm

Matthew Miller, Alexander StoytchevDevelopmental Robotics Lab

Department of Electrical and Computer Engineering Iowa State University

mamille@cs.iastate.edu, alexs@iastate.eduwww.cs.iastate.edu/~mamille/

Language: A Grand Challenge• A working example• Automatically acquires

language• Well studied

Statistical Learning Experiments

• Saffran et. al. (1996): 8-month-olds can segment speech.

Artificial Language:tupiro golabu bedaku padoti

Language: tu pi ro go la bu be da kuTransition Prob: 1.0 1.0 .25 1.0 1.0 .25 1.0 1.0 ...

Acclimate

Novel Word

• Hypothesis: Infants use local minima in single syllable transition probabilities to segment speech streams.

Voting Experts

• An algorithm for unsupervised segmentation• Key Idea: Natural “chunks” have:

– Low Internal Information– High Boundary Entropy

itwasabrightcolddayinaprilandtheclockswere

))"log(Pr(")"(" brightbrightI

)"(")"(" rightcIbrightI

Voting Experts

• An algorithm for unsupervised segmentation• Key Idea: Natural “chunks” have:

– Low Internal Information– High Boundary Entropy

itwasabrightcolddayinaprilandtheclockswere

)"(")"|"Pr()"("

brightIbrightbrightE

)"(")"(" brighEbrightE

VE Implementation (Cohen 2006)

1. Build an n-gram trie from text.2. Slide a window along the text sequence3. Two experts vote how to break the window

1. One minimizes internal info2. Other maximizes boundary entropy

i t w a s a b r i g h t c o l d d a y i n a p r i lWindow

windowts

)]()([min ,)"(")"(" abrigIasI

i t w a s a b r i g h t c o l d d a y i n a p r i lWindow

windowts

)]([max )"("asaE

4. Break at vote peaks

i t w a s a b r i g h t c o l d d a y i n a p r i l

i | t | w | a | s | a | b | r | i | g | h | t | c | o | l | d0

VE Results• Results are surprisingly good on text

– Especially giving its simplicity– Accuracy and Hit rate about 75%

• Seems to capture something about the nature of “chunks”

• Can we use this algorithm to segment real audio?

It was a br igh t

Acoustic Model

• Cluster spectral features using a GGSOM

Acoustic Model

• Cluster spectral features using a GGSOM• Collapse state sequence

Acoustic Model

• Cluster spectral features using a GGSOM• Collapse state sequence• Run VE to get breaks

Experiments and Results• Used the model to segment “1984”

– CD 1 of audio book (40 mins)– Chosen for length, consistency– Evaluation: Human graders

New Experiments• Trained on infant datasets

• Tested on manually generated keys

Stream A:tupiro golabu bedaku padoti

Stream B:dapiku tilado pagotu burobi

Train Train

Test Test

Acoustic Model A

Acoustic Model B

VE Model A

VE Model B

New Experiments• Trained on infant datasets

• Tested on manually generated keys

Stream A:tupiro golabu bedaku padoti

Stream B:dapiku tilado pagotu burobi

Test TestTes

t Test

Acoustic Model A

Acoustic Model B

VE Model A

VE Model B

Results• Experiment 1

– Accuracy: 50% on all induced breaks– Hit Rate: 75% of word breaks– Significantly better than chance

• Experiment 2– Accuracy: 16% on all induced breaks– Hit Rate: 1% of word breaks– Worse than chance– 18 breaks, 3 correct

Conclusions and Future Work• VE Model can be used to segment audio

• Can reproduce the results of Infant studies

• May model part of the human chunking mechanism

• Have built more sophisticated acoustic models– Better results (nearly perfect)

Thank You• www.cs.iastate.edu/~mamille/

Iowa State University Developmental Robotics Laboratory Unsupervised Segmentation of Audio Speech...

Documents

Transcript of Iowa State University Developmental Robotics Laboratory Unsupervised Segmentation of Audio Speech...

Unsupervised segmentation of natural images via lossy data ...sli.ics.uci.edu/pmwiki/uploads/Classes-2008F/Main/texturemerge.pdf · Unsupervised segmentation of natural images via

Unsupervised Segmentation of Hyperspectral Images Using 3D ...

Unsupervised Segmentation of Collagen Fiber Distribution ...manirban/journalPub/TR_JC_JBS_2010.pdf · Unsupervised Segmentation of Collagen Fiber Distribution in Different Stages

1 Unsupervised Segmentation of Synthetic Aperture Radar ...dclausi/Papers/Clausi and Deng - PRRS 2004 - MRF Segmentation of...1 Unsupervised Segmentation of Synthetic Aperture Radar

Unsupervised Segmentation of Color-Texture … Unsupervised Segmentation of Color-Texture Regions in Images and Video Yining Deng and B. S. Manjunath Abstract A new method for unsupervised

Learning Unsupervised Video Object Segmentation Through ......Learning Unsupervised Video Object Segmentation through Visual Attention Wenguan Wang ∗1,2, Hongmei Song ∗1, Shuyang

Unsupervised Object Segmentation in Video by Efficient Selection …openaccess.thecvf.com/content_ICCV_2017/papers/Haller... · 2017-10-20 · Unsupervised object segmentation in

Unsupervised Learning (Examples)bejar/apren/docum/trans/09-clusterej-eng.pdf · Outline 1 Iris 2 Voting Records 3 Mushroom 4 Image Segmentation Javier B ejar Unsupervised Learning

Unsupervised Acute Intracranial Hemorrhage Segmentation ...

Unsupervised Video Object Segmentation for Deep ...€¦ · Unsupervised Video Object Segmentation for Deep Reinforcement Learning Machine Learning and Data Analytics Symposium Doha,

Voting Experts: An Unsupervised Algorithm for Segmenting ...heeringa/publications/voting-experts.pdf · Voting Experts: An Unsupervised Algorithm for Segmenting Sequences Paul Cohen1,

Mostly-Unsupervised Statistical Segmentation of Japanese ... · PDF fileMostly-Unsupervised Statistical Segmentation of Japanese Kanji ... the terms in various technical dictionaries

Unsupervised Deconvolution-Segmentation of Textured Imagemp71/slides/jfgiovannelli.pdf · 2017. 7. 18. · Unsupervised Deconvolution-Segmentation of Textured Image Bayesian approach:

Towards Automatic Unsupervised Segmentation of Music ...moco17.movementcomputing.org/wp-content/uploads/2017/12/...Towards Automatic Unsupervised Segmentation of Music-Induced Arm

Using Unsupervised Morphological Segmentation to Improve ...uu.diva-portal.org/smash/get/diva2:1221345/FULLTEXT01.pdfUsing Unsupervised Morphological Segmentation to Improve Dependency

Mostly-Unsupervised Statistical Segmentation of Japanese Kanji ...

Unsupervised Segmentation of Color-Texture

Unsupervised Object Segmentation by Redrawingwebia.lip6.fr/~chenm/paper/...Redrawing__poster_.pdf · Unsupervised Object Segmentation by Redrawing MickaëlChen1,ThierryArtières2,3

Unsupervised Morpheme Segmentation and Morphology ...users.ics.aalto.fi/mcreutz/papers/Creutz05tr.pdf · Unsupervised Morpheme Segmentation and Morphology Induction ... Morpheme Segmentation

UNSUPERVISED SIGNAL SEGMENTATION BASED ON ... - …