1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the...

1
1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the achievement of high quality speech, especially for voiced segments. The problem The dependency of the ACB on the history of the signal makes it vulnerable to frame erasures. When a frame is lost, it affects not only the corresponding segment, but also the following frames due to error propagation, and the decoder may take several frames to recover. The proposed solution To reduce the error propagation when a frame is lost, we propose to lessen the inter-frame dependency by limiting the energy of the ACB contribution to the synthesized speech. The ACB gain by itself is not limited . 2. EXCITATION SEARCH IN THE AMR-WB CODEC The standard search for the excitation parameters is done sequentially. It consists first in finding the gain and the delay for the ACB, then the shape (pulse positions) for the innovative codebook (ICB). Finally, the ACB and ICB gains are jointly quantized. The procedure may be summarized as follows : Perform the LP analysis on the speech signal, extract the residual signal and build the target. Search for the ACB parameters that match the target (in an analysis-by-synthesis loop), according to the error criterion : Update the target by subtracting the ACB contribution. Search for the ICB parameters using the updated target. Jointly quantize the ACB and ICB gains (quadratic 4. EXPERIMENTAL RESULTS Constrained encoding improves the recovery time of the decoder after a frame erasure 6. CONSTRAINED JOINT QUANTIZATION OF THE ACB AND ICB GAINS The ACB contribution can be controlled more precisely during the gains quantization process, since at this stage both ACB and ICB excitation shapes are known. The constrained optimal solution can be found analytically using the Kuhn-Tucker conditions. 7. CONCLUSION Limiting the ACB contribution, significantly improves the recovery time of the decoder after a frame loss. In clear channel, the quality degradation due to the constraint is very small. The modified coder is completely interoperable with the standard, it requires no extra bits and adds no extra delay. Although the ACB contribution can be controlled more precisely at the gains quantization stage, modifying only the target for the ICB search is the simplest and most effective approach. INCREASING THE ROBUSTNESS OF CELP-BASED CODERS INCREASING THE ROBUSTNESS OF CELP-BASED CODERS BY CONSTRAINED OPTIMIZATION BY CONSTRAINED OPTIMIZATION 2 2 2 2 2 ) ( ˆ ) ( ˆ ) ( ˆ th n c c p n p T p R n y g n y g n y g E E R domain perceptual in the ons contributi ICB and ACB scaled) - (non the : , gains ICB and ACB quantized : ˆ , ˆ c c p y y g g searc parameters ICB for the target new the : 2 x ) ( ) ( ) ( 0 1 2 n y g n x n x gain ACB the : domain perceptual in the on contributi ACB scaled) - (non the : domain perceptual in the signal target the : domain perceptual in the error quadaratic the : 0 1 1 g y x D n n y g n x D 2 0 1 1 ) ( ) ( a) original signal b) synthesized signal using the standard encoding c) synthesized signal using the constrained encoding d) error signal between a) and b) e) error signal between a) and c) 3. CONSTRAINED SEARCH ALGORITHM If the ACB energy contribution is high, a fraction of this contribution is forced to be left in the new target. The ACB contribution is estimated by comparing, in the perceptual domain, the energy due to the ACB excitation to the target energy. A value of R th1 as low as 0.55 preserves the overall quality in clear channel while significantly improving the robustness against frame erasures. 5. SUBJECTIVE TESTS Two subjective tests were carried out in order to compare the standard and modified codecs: in the clear channel condition (no frame erasures), and with a simulated frame loss rate of 10% . Test features : 20 sentences for each condition. 12 listeners participated in the tests. Listening using binaural headphones. AMR-WB in mode 2 (12.65 kb/s). Audio files on http://www.gel. usherb .ca/ standard codecmodified codec standard codecmodified codec NO PREFERENCE IN CLEAR CHANNEL University of Sherbrooke University of Sherbrooke Sherbrooke, Quebec, Canada Sherbrooke, Quebec, Canada Mohamed Chibani, Mohamed Chibani, Philippe Gournay and Roch Philippe Gournay and Roch Lefebvre Lefebvre STRONG PREFERENCE WITH A FRAME LOSS RATE OF 10% The quadratic error relative to the quantized ACB and ICB gains D eterm ine the AC B param eters (the gain and the delay) 1 1 th R R yes no U pdate the targetforthe IC B param eters search Evaluate the AC B contriubtion n n x p n x n y g E E R ) ( ) ( 2 1 2 2 0 1 n n n y n y n x g ) ( ) ( ) ( 2 1 0 n n th n y n x R g ) ( ) ( 2 2 1 1 0

Transcript of 1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the...

Page 1: 1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the achievement of high quality speech, especially for voiced segments.

1.INTRODUCTION

The use of the adaptive codebook (ACB) in CELP-like speech coders allows the achievement of high quality speech, especially for voiced segments.

The problem

The dependency of the ACB on the history of the signal makes it vulnerable to frame erasures. When a frame is lost, it affects not only the corresponding segment, but also the following frames due to error propagation, and the decoder may take several frames to recover.

The proposed solution

To reduce the error propagation when a frame is lost, we propose to lessen the inter-frame dependency by limiting the energy of the ACB contribution to the synthesized speech. The ACB gain by itself is not limited.

2. EXCITATION SEARCH IN THE AMR-WB CODEC

The standard search for the excitation parameters is done sequentially. It consists first in finding the gain and the delay for the ACB, then the shape (pulse positions) for the innovative codebook (ICB). Finally, the ACB and ICB gains are jointly quantized. The procedure may be summarized as follows :

Perform the LP analysis on the speech signal, extract the residual signal and build the target.

Search for the ACB parameters that match the target (in an analysis-by-synthesis loop), according to the error criterion :

Update the target by subtracting the ACB contribution.

Search for the ICB parameters using the updated target.

Jointly quantize the ACB and ICB gains (quadratic error minimization in the perceptual domain).

4. EXPERIMENTAL RESULTS

Constrained encoding improves the recovery time of the decoder after a frame erasure

6. CONSTRAINED JOINT QUANTIZATION OF THE ACB AND ICB GAINS

The ACB contribution can be controlled more precisely during the gains quantization process, since at this stage both ACB and ICB excitation shapes are known.

The constrained optimal solution can be found analytically using the Kuhn-Tucker conditions.

7. CONCLUSION

Limiting the ACB contribution, significantly improves the recovery time of the decoder after a frame loss.

In clear channel, the quality degradation due to the constraint is very small.

The modified coder is completely interoperable with the standard, it requires no extra bits and adds no extra delay.

Although the ACB contribution can be controlled more precisely at the gains quantization stage, modifying only the target for the ICB search is the simplest and most effective approach.

INCREASING THE ROBUSTNESS OF CELP-BASED INCREASING THE ROBUSTNESS OF CELP-BASED CODERS BY CONSTRAINED OPTIMIZATIONCODERS BY CONSTRAINED OPTIMIZATION

22

22

2)(ˆ)(ˆ

)(ˆ

th

nccp

np

T

p Rnygnyg

nyg

E

ER

domain perceptual in the

onscontributi ICB and ACB scaled)-(non the:,

gains ICB and ACB quantized:ˆ,ˆ

c

cp

yy

gg

search parameters ICB for the target new the:2x

)()()( 012 nygnxnx

gain ACB the:

domain perceptual in theon contributi ACB scaled)-(non the:

domain perceptual in the signal target the:

domain perceptual in theerror quadaratic the:

0

1

1

g

y

x

D

n

nygnxD 2

011 )()(

a) original signal

b) synthesized signal using the standard encoding

c) synthesized signal using the constrained encoding

d) error signal between a) and b)

e) error signal between a) and c)

3. CONSTRAINED SEARCH ALGORITHM

If the ACB energy contribution is high, a fraction of this contribution is forced to be left in the new target. The ACB contribution is estimated by comparing, in the perceptual domain, the energy due to the ACB excitation to the target energy.

A value of Rth1 as low as 0.55 preserves the overall quality in clear channel while significantly improving the robustness against frame erasures.

5. SUBJECTIVE TESTS

Two subjective tests were carried out in order to compare the standard and modified codecs: in the clear channel condition (no frame erasures), and with a simulated frame loss rate of 10%.

Test features :

• 20 sentences for each condition.

• 12 listeners participated in the tests.

• Listening using binaural headphones.

• AMR-WB in mode 2 (12.65 kb/s).

Audio files on http://www.gel.usherb.ca/gournay/

standard codec modified codecstandard codec modified codec

NO PREFERENCE IN CLEAR CHANNEL

University of SherbrookeUniversity of SherbrookeSherbrooke, Quebec, CanadaSherbrooke, Quebec, Canada

Mohamed Chibani, Mohamed Chibani, Philippe Gournay and Philippe Gournay and

Roch LefebvreRoch Lefebvre

STRONG PREFERENCE WITH A FRAME LOSS RATE OF 10%

The quadratic error relative to the quantized ACB and ICB gains

Determine the ACB parameters (the gain and the delay)

11 thRR yesno

Update the target for the ICB parameters search

Evaluate the ACB contriubtion

n

n

x

p

nx

nyg

E

ER

)(

)(

2

1

22

0

1

n

n

ny

nynxg

)(

)()(

2

1

0

n

nth

ny

nxRg

)(

)(

2

2

1

10