Signal Analysis Using Autoregressive Models of Amplitude Modulation

Signal Analysis Using Autoregressive Models of

Amplitude Modulation

Sriram Ganapathy

Advisor - Hynek Hermansky11-18-2011

Overview

Introduction AR Model of Hilbert Envelopes FDLP and its Properties Applications Summary

Overview


Introduction Sub-band speech and audio signals - product of

smooth modulation with a fine carrier









=



=

>
>
>



Non-Unique

AM

FM



Non-Unique

AM

FM

119909 (119905 )=119898 (119905 )lowast cos 120596119900119905+120593 (119905 )

Desired Properties of AM Linearity

Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1

Uniquely satisfied by the analytic signal

Desired Properties of AM

119909 (119905) H 119895+iquest iquestoriquest 119898(119905)

120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)

- analytic signal ndash Hilbert envelope

However the Hilbert transform filter is infinitely long and can cause artifacts for finite length signals

Need for modeling the Hilbert envelope without explicit computation of the Hilbert transform


Overview


Overview


Discrete-time analytic signal

AR Model of Hilbert Envelopes

119883 119886 [119896] =

Signal with zero mean in time and frequency domain for n = 0hellipN-1

0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =

Let - even-symmetrized version of

0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]DCT zero-padded with

N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal

Zero-paddedDCT sequence

We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

Spectrum ofHilbert env for

even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

Hilb env ofeven-symm

signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP

LP in Time and Frequency

Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP

FDLPLinear prediction on the cosine transform of the signal

Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech

FDLP for Speech Representation

DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram

Conventional Approaches


Time

Freq

Time

Freq

FDLP versus Mel Spectrogram

FDLP

Mel

SriramGanapathySamuelThomasandHHermanskyldquoComparison of Modulation Frequency Features for Speech RecognitionICASSP2010

Overview


Overview


Resolution of FDLP Analysis

FDLP

Mel

Sig

FDLP Env


FDLP

Mel

Res = (Critical Width)-1

Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel

Properties of FDLP Analysis

FDLP

Mel

bull Summarizing the gross temporal variation with a few parametersndash Model order of FDLP controls the degree of

smoothness ndash AR model captures perceptually important high energy

regions of the signal

bull Suppressing reverberation artifactsndash Reverberation is a long-term convolutive distortion

bull Analysis in long-term windows and narrow sub-bands


FDLP

Mel






Reverberation

When speech is corrupted with convolutive distortion like room reverberation

CleanSpeech

RoomResponse = Revb

Speech

Reverberation


In the long-term DFT domain this translates

CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation


In the DFT domain this translates to a multiplication In the sub-band

Reverberation



In narrow bands is slowly varying

FDLP envelope of band using all-pole parameters hellip is given by

When the sub-band signal is multiplied by the gain is modified

Normalization to convolutive distortions is achieved by reconstructing the FDLP envelope with

Gain Normalization in FDLP

Gain Normalization in FDLPWithout gain norm

With gain norm

SThomasSGanapathyandHHermanskyldquoRecognition of Reverberant Speech Using FDLPIEEESignalProcLetters2008

Overview


Overview


Outline of Applications

Sub-bandDecomposition FDLP

GainNorm

Quant

Short-term Features for Speaker amp

Speech Recog

Modulation Features for Phoneme Recog

Wide-band Speech amp Audio Coding

Input

Signal

SGanapathySThomasPMotlicekandHHermanskyldquoApplications of Signal Analysis Using Autoregressive Models of Amplitude ModulationIEEEWASPAAOct2009

FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Envelopes in each band are integrated along time (25 ms with a shift of 10 ms)

Integration in frequency axis to convert to mel scale

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Sub-band energies are converted to cepstral coefficients by applying log and DCT along frequency axis

Delta and acceleration coefficients are appended to obtain 39 dim feat similar to conventional MFCC feat

Speech Recognition

TIDIGITS Database (8 kHz) Clean training data test data can be clean or naturally

reverberated HMM-GMM system

Whole-word HMM models trained on clean speech Performance in terms of word error rate (WER)

Features PLP features with cepstral mean subtraction (CMS) Long term log spectral subtraction (LTLSS) [Gelbart 2002] FDLP short-term (FDLP-S) features ndash 39 dim

Speech Recognition


Clean Reverb0

10

20PLP-CMSLTLSSFDLP-SW

ER

Speaker Verification NIST 2008 Speaker recognition evaluation (SRE)

Has telephone speech and far-field speech

GMM-UBM system Trained on a large set of development speakers Adapted on the enrollment data from the target speaker Nuisance attribute projection (NAP) on supervectors Detection cost function (DCF) = 099 Pfa + 01 Pmiss

Features with warping [Pelecanos 2001] Mel Frequency Cepstral Coefficients (MFCCs) FDLP short-term (FDLP-S) features

Tel Mic Cross domain

10

20

30

MFCCFDLP-SD

CF

Speaker Verification

SGanapathyJPelecanosandMOmarldquoFeature Normalization for Speaker Verification in Room ReverberationICASSP2011



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Feature Extraction

Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Static compression is a logarithm ndash reduce the huge dynamic range in the in the sub-band envelope


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Dynamic compression is implemented by dynamic compression loops consisting of dividers and low pass filters [Kollmeier 1999]


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Compressed sub-band envelopes are DCT transformed to obtain modulation frequency components

14 static and dynamic modulation spectra (0-35 Hz) with 17 sub-bands gets a feature of 476 dim

Phoneme Recognition

TIMIT Database (8 kHz) Clean training data test data can be clean additive noise

reverberated or telephone channel Multi-layer perceptron (MLP) based system

MLPs estimate phoneme posteriors Hidden Markov model (HMM) ndash MLP hybrid model Performance in phoneme error rate (PER)

Features Perceptual linear prediction (PLP) - 9 frame context Advanced ETSI standard [ETSI2002] ndash 9 frame context FDLP modulation (FDLP-M) features ndash 476 dim

SGanapathySThomasandHHermanskyldquoTemporal Envelope Compensation for Robust Phoneme Recognition Using Modulation SpectrumJASA2010

Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output

SriramGanapathyPetrMotlicekandHHermanskyldquoAutoregressive Modeling of Hilbert Envelopes for Wide-band Audio CodingAES124thConventionAudioEngineeringSocietyMay2008

Subjective Evaluations

SGanapathyPMotlicekandHHermanskyldquoAR Models of Amplitude Modulation in Audio CompressionIEEETransactionsonAudioSpeechandLanguageProc2010

Series10

20

40

60

80

100

Hidden RefLPF7kMP3FDLPAAC

Overview


Overview


Summary

Employing AR modeling for estimating amplitude modulations

Long-term temporal analysis of signals forms an efficient alternative to conventional short-term spectrum

Provides AM-FM decomposition in sub-bands and acts as unified model for speech and audio signals

Summary




Summary




Our Contributions

Simple mathematical analysis for AR model of Hilbert envelopes

Investigating the resolution properties of FDLP

Gain normalization of FDLP Envelopes

Our Contributions




Our Contributions




Our Contributions

Short-term feature extraction using FDLP ndashImprovements in reverb speech recog

Modulation feature extraction ndash Phoneme recognition in noisy speech

Speech and audio codec development using AM-FM signals from FDLP

Our Contributions




Our Contributions




Acknowledgements Lab Buddies ndash Samuel Thomas Sivaram Garimella

Padmanbhan Rajan Harish Mallidi Vijay Peddinti Thomas Janu Aren Jansen

Idiap personnel ndash Petr Motlicek Joel Pinto Mathew Doss

IBM personnel ndash Jason Pelecanos Mohamed Omar

Others ndash Xinhui Zhou Daniel Romero Marios Athineos David Gelbart Harinath Garudadri

Thank You

Evidences

Physiological evidences - Spectro-temporal receptive fields [Shamma etal 2001]

Psycho-physical evidences - Perceptual importance of modulation frequencies

[Drullman et al 1994] Syllable recognition from temporal modulations with

minimal spectral cues [Shannon et al 1995]

Evidences





Applications

Modulation spectra has been used in the past Speech intelligibility [Houtgast et al 1980] RASTA processing [Hermansky et al 1994] Speech recognition [Kingsbury et al 1998] AM-FM decomposition [Kumaresan et al 1999] Sound texture modeling [Athineos et al 2003] Sound source separation [King et al 2010]

Linear Prediction ndash Time Domain

Current sample expressed as a linear combination of past samples

nn-1n-2n-3

a3a2

a1



Model parameters are solved by minimizing the residual sum of squares

AR model of Power Spectrum

Filter interpretation [Makhoul 1975]

From Parsevalrsquos theorem


By definition Let

Thus parameters are solved by minimizing


Solution of the linear prediction yields an all-pole model of the power spectrum

Numerator G denotes the gain of AR model (equal to minimum residual sum of squares)




AR model of power spectrum

Hilbert Envelope - Definition Analytic signal is the sum of the signal and its

quadrature component

where H denotes the Hilbert transform

Hilbert envelope is the squared magnitude of the analytic signal

Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition

a Signalb Hilb Envc FDLP Envd AM compe FM comp

Spectrogram Comparison

FDLP


PLP

Dealing with Convolutive Distortions

Cepstral mean subtraction (CMS) long-term log spectral subtraction (LTLSS) amp gain normalization CMS assumes distortion in neighboring frames

to be similar ndash suppresses short-term artifacts

Long-term subtraction deals with reverberation assuming over the same response over a window of long-term frames [Gelbart 2002]

Gain normalization deals with short and long term distortions within a single long-term frame
















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features

SriramGanapathySamuelThomasandHHermanskyldquoModulation Frequency Features for Phoneme Recognition in Noisy SpeechJASAExpressLetters2009

a Signalb Hilb Envc FDLP Envd Log compe Dyn comp

Noise Compensation in FDLP

Critical-band

Window

LinearPred

Signal

+Noise

DCT IDFT | |2 DFTFDLP

Env

When speech is corrupted with additive noise

y The noise component is additive in the non-parametric Hilbert envelope

domain (assuming the signal and noise are uncorrelated)


Critical-band

Window

InputDCT IDFT | |2 DFTWiener

Filtering

VAD

Voice activity detector (VAD) provides information about the non-speech regions which are used for estimating the temporal envelope of the noise

Noise subtraction tries to subtract the estimate the noise envelope from the noisy speech envelope


SGanapathySThomasandHHermanskyldquoTemporal Envelope Subtraction for Robust Speech Recognition using Modulation SpectrumIEEEASRU2009

Conventional signal analysis ndash starts with the estimation of short-term spectrum (10-40 ms)

Introduction

Time

Freq

uenc

y

Introduction


Spectrum is sampled at a preset rate before further modelingprocessing stages

Contextual information is typically processed with time-series models such as HMM

Introduction




Signal Analysis Using Autoregressive Models of Amplitude Modula

Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)


Desired Properties of AM (2)


Overview (3)

Overview (4)


AR Model of Hilbert Envelopes (2)













LP in Time and Frequency (2)

FDLP

FDLP (2)

FDLP (3)

FDLP (4)


FDLP for Speech Representation (2)







Overview (5)

Overview (6)


Resolution of FDLP Analysis (2)



Properties of FDLP Analysis (2)

Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)


Gain Normalization in FDLP (2)

Overview (7)

Overview (8)


Short-term Features

Short-term Features (2)



Speech Recognition

Speech Recognition (2)


Speaker Verification (2)

Outline of Applications (2)

Modulation Features


Modulation Feature Extraction (2)



Phoneme Recognition

Phoneme Recognition (2)


Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions

Our Contributions (2)





Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications


Linear Prediction ndash Time Domain (2)


AR model of Power Spectrum (2)




Hilbert Envelope - Definition

Hilbert Envelope

Duality


AM-FM Decomposition



Dealing with Convolutive Distortions (2)



Feature Comparison


Modulation Features (2)


Noise Compensation in FDLP (2)


Introduction (9)

Introduction (10)

Introduction (11)

Overview


Overview












=



=

>
>
>



Non-Unique

AM

FM



Non-Unique

AM

FM

119909 (119905 )=119898 (119905 )lowast cos 120596119900119905+120593 (119905 )


Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview












=



=

>
>
>



Non-Unique

AM

FM



Non-Unique

AM

FM

119909 (119905 )=119898 (119905 )lowast cos 120596119900119905+120593 (119905 )


Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)











=



=

>
>
>



Non-Unique

AM

FM



Non-Unique

AM

FM

119909 (119905 )=119898 (119905 )lowast cos 120596119900119905+120593 (119905 )


Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



=

>
>
>



Non-Unique

AM

FM



Non-Unique

AM

FM

119909 (119905 )=119898 (119905 )lowast cos 120596119900119905+120593 (119905 )


Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



Non-Unique

AM

FM



Non-Unique

AM

FM

119909 (119905 )=119898 (119905 )lowast cos 120596119900119905+120593 (119905 )


Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



Non-Unique

AM

FM

119909 (119905 )=119898 (119905 )lowast cos 120596119900119905+120593 (119905 )


Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Continuity

Harmonicity

120572 119909 (119905 )120572119898 (119905 )

119909 (119905 )+120575 119909 (119905 )119898 (119905 )+120575119898 (119905 )

cos (120596119900119905)1




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)




120596 (119905)120596119900+120593 (119905)

119889119889119905

- Hilbert transform

119909119886 (119905)





Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)




Overview


Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview




119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



119883 119886 [119896] =


0 for k geN 22 119883 [119896] for k lt N 2

119883 [119896] 119883 119886 [119896]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ]119910 [119896] =4119877119890119883 [119896 ] klt N

N-point DCT



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N


N-zeros

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



119876119886 [119896] =


0 k geN2119876[119896 ] k lt N

119876 [119896 ]=2119877119890 119883 [119896 ] Zero-padded DCT

119910 [119896]=4119877119890119883 [119896 ] k lt N0 k geN

119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

analytic signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

SignalSpectrum

PowerSpectrum

Auto-corr

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]

Spectrum ofeven-sym

Analytic Signal


We have shown -


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


even-sym signal

Auto-correlation of

DCT sequence


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]


signal

Auto-corr ofDCT

F


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


119876119886 [119896 ] = F 119902119886 [119899 ] = [119896 ]We have shown -

F iquest119902119886 [119899 ]or2 =119903119910 [120591 ]

AR model ofHilb env

Auto-corr ofDCT

LP


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Duality

TimePowerSpec

LP


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Duality

TimePowerSpec

LP

Duality

DCTHilbEnv

LP


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Hilb Env

FDLP Env

Speech


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Hilb Env

FDLP Env

DCT

LP


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Hilb Env

DCT

LP


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Hilb Env

FDLP Env

Speech


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


DCT


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


DCT


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


DCT

LP


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


DCT

LP


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


DCTLP

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

FDLP Spectrogram


Time

Freq

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

FDLP Spectrogram



Time

Freq

Time

Freq


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


FDLP

Mel


Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview


Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview



FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


FDLP

Mel

Sig

FDLP Env


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


FDLP

Mel


Sig

FDLP Env

Sig

FDLP Env


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


FDLP

Mel


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


FDLP

Mel







FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


FDLP

Mel






Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Reverberation


CleanSpeech

RoomResponse = Revb

Speech

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Reverberation



CleanSpeech

RoomResponse = Revb

Speech

xCleanDFT

ResponseDFT = Revb

DFT

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Reverberation



Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Reverberation









With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)






With gain norm


Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview


Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview




GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



GainNorm

Quant


Speech Recog



Input

Signal


FM

AM

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Short-term Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Short-term Features

Sub-band

Window

InputDCT FDLP Gain

Norm

Log+

DCT

FeatEnergyInt



Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Speech Recognition





Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Speech Recognition


Clean Reverb0

10


ER






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)






10

20

30

MFCCFDLP-SD

CF





GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Modulation Features


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat



Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Phoneme Recognition






Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Clean Add Noise

Reverb Tel 30

45

60

75

PLP-9ETSI-9FDLP-M

PE

R

Phoneme Recognition



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Audio Coding


GainNorm

Quant


Speech Recog



Input

Signal

FM

AM

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Audio Coding

QMFAnalysis

FDLPInput

1

32

MDCT

Env

Carr

Q

Q

Q-1

Q-1 IMDCT

MulQMF

Synthesis

1

32

Output




Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



Series10

20

40

60

80

100


Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview


Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Overview


Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Summary




Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Summary




Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Summary




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Our Contributions




Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Our Contributions




Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Our Contributions




Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Our Contributions




Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Our Contributions









Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)






Thank You

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Evidences





Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Evidences





Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Applications




nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)



nn-1n-2n-3

a3a2

a1








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)








By definition Let













Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)












Hilbert Envelope

c FDLP Env

b Hilb Env

a Speech

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Duality

LP

FDLP


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


AM-FM Decomposition



FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


FDLP


PLP





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)





















Feature Comparison


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Critical-band

Window

InputDCT FDLP

Static

Dynamic

DCT

DCT

200ms

Sub-bandFeat

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Modulation Features




Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Critical-band

Window

LinearPred

Signal

+Noise


Env





Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)


Critical-band

Window


Filtering

VAD






Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)




Introduction

Time

Freq

uenc

y

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Introduction




Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Introduction





Overview

Overview (2)

Introduction

Introduction (2)

Introduction (3)

Introduction (4)

Introduction (5)

Introduction (6)

Introduction (7)

Introduction (8)




Overview (3)

Overview (4)
















FDLP

FDLP (2)

FDLP (3)

FDLP (4)









Overview (5)

Overview (6)






Reverberation

Reverberation (2)

Reverberation (3)

Reverberation (4)



Overview (7)

Overview (8)


Short-term Features




Speech Recognition





Modulation Features





Phoneme Recognition



Audio Coding

Audio Coding (2)


Overview (9)

Overview (10)

Summary

Summary (2)

Summary (3)

Our Contributions






Acknowledgements

Slide 92

Evidences

Evidences (2)

Applications









Hilbert Envelope

Duality


AM-FM Decomposition






Feature Comparison






Introduction (9)

Introduction (10)

Introduction (11)

Signal Analysis Using Autoregressive Models of Amplitude Modulation

Documents

Transcript of Signal Analysis Using Autoregressive Models of Amplitude Modulation