Presenter : Pei- ning Chen NTNU CSIE SLP Lab

18
Error approximation and minimum phone error acoustic model estimation Matthew Gibson and Thomas Hain Presenter : Pei-ning Chen NTNU CSIE SLP Lab Audio, Speech, and Language Processing, IEEE Transactions

description

Error approximation and minimum phone error acoustic model estimation Matthew Gibson and Thomas Hain. Audio, Speech, and Language Processing, IEEE Transactions . Presenter : Pei- ning Chen NTNU CSIE SLP Lab. Outline . Introduction Minimum Phone Error Theory Error Approximation - PowerPoint PPT Presentation

Transcript of Presenter : Pei- ning Chen NTNU CSIE SLP Lab

Page 1: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Error approximation and minimum phone erroracoustic model estimation

Matthew Gibson and Thomas Hain

Presenter : Pei-ning ChenNTNU CSIE SLP Lab

Audio, Speech, and Language Processing, IEEE Transactions

Page 2: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Outline • Introduction • Minimum Phone Error Theory• Error Approximation• Limitation of Baseline Approximation Error• Alternative Error Approximations• Experiments• Error Approximation Analysis• Summary and Future Work

Page 3: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Introduction • Acoustic models estimated using the MPE

technique have displayed significant classification performance improvements over ML-estimated models.

• This paper introduces a novel error approximation method and demonstrates how it addresses limitations of a previously used technique, and the method is found to yield significant performance improvements when deployed for MPE acoustic model estimation.

Page 4: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

MPE

• The MPE criterion

• : Levenshtein distance

R

r

rMNr

N

WwMPE wwLowp

RR

N1111 ˆ,,|1

1

rMN wwL 11 ˆ,

Page 5: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Error Approximation

• Alignment-based error approximation:

different z and q if,1

label same z and q if,21max

zqezqe

qAz

label reference goverlappin : z

q with overlaps which z of proportion theis, zqe

Page 6: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

• A substitution example:

Page 7: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

• Swap the reference and the hypothesis:

Page 8: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

• A insertion example:

Page 9: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

• A deletion example:

Page 10: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Limitations of baseline

Page 11: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Frame Error Normalisation

• With deletion

Page 12: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

• With insertion

Page 13: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Using Multiple Reference Alignments

• MSNFR and AMSNFR

Page 14: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Analysis • S : substitution, I : insertion, D : deletion

Page 15: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

• Reference with silence

Page 16: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Evaluation results• Unsmoothed

Page 17: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

I-smoothing

Page 18: Presenter : Pei- ning  Chen NTNU CSIE SLP Lab

Summary and Future work

• Significant improvements over the previously introduced error approximation when the symmetrically normalised frame error approximation is deployed for MPE acoustic parameter re-estimation.

• Future work should compare use of the approximate methods introduced in this paper with lattice manipulation approaches and the minimum phone frame error.