Analysis-by-Synthesis Codingcs.haifa.ac.il/~nimrod/Compression/Speech/S4ABYS2004.pdf · A-by-S...

Analysis-by-Synthesis Coding

Nimrod PelegUpdate: Jan 2004

A-by-S Coding

• RELP, APC, ATC and SBC have sufficient quality at rates of 9.6-16Kbps

• Two main reasons:– coded speech is not analyzed to check

efficiency (No distortion control)– Errors accumulated from previous frames are

not taken into account in current frame (errors propagate since no “reset” in loop

Analysis-by-Synthesis LP Coders• Analysis-by-Synthesis coders use close-loop for

the excitation sequence determination• An optimization process determines an

excitation sequence which minimizes a measure of the difference between input and coded speech

• A weighting function is chosen to optimize for human ear

• A better quality achieved for 4.8-9.6Kbps

General A-by-S Encoder

WeightingFilter

IndexSelection

PitchSynthesisFilter

Input Speech

LPCSynthesisFilter

Error Signal

Error Minimization

Selected OptimumExcitationIndex

Time-Varying Filter

A-by-S Operation• Initialize filters (low-level random noise)• Determine LPC coeff. on a frame of samples• Divide the frame to subframes, for each:

– calculate pitch parameters (delay & scaling)– Pitch and LPC filters are grouped together. The

cascaded filter is used to determine the best secondary excitation, to minimize error between original and synthesized speech

• Final speech is generated by passing optimum secondary excitation through cascaded filter

General A-by-S Decoder

Selected OptimumExcitation

IndexPitch

SynthesisFilter

LPCSynthesis

Filter

OutputSpeech

Time-Varying Filter

Long-term Short-term

Short-Term Predictor

• Models the short-term correlation in the speech : Spectral envelope

• It is time varying to reflect changes in speech spectrum

• Adaptation rates: 20-30mSec• Filter order: 8-12

Long-Term Predictor

• Models the long-term correlation in the speech: Pitch period

• Higher adaptation rates: 5-10mSec• LTP can be omitted (as in MPLPC)• Both LTP and STP have buffer memory of

previous frame that provide smoothing effect to distortion (caused by block oriented analysis)

Error Minimization • Minimizing perceptual error by selecting filter(s)

parameters such that MSE between original and reconstructed signal is minimized

• This results in a flat spectrum of quantization error, (better results can be obtained by exploiting the auditory masking of human hearing)

• All the process done in transmitter only !

Excitation Generator

• Includes all information missing in the model parameters (residual signal)

• Several methods for it:– CELP (Code Excited LP)– SELP ( Self Excited LP)– MPLPC (Multi-Pulse LP)– RPE-LPC (Regular-Pulse LPC)– MELP (Mixed Excitation LP)

Excitation Types

Time-Varying

Filter

Vector 1Vector 2

Vector N

0 ... ..L-1

RPE-LPC

OutputSpeech

• Excitation vector chosen from a set of N possible stochastic vectors (+gain)

• The vector that produces lowest error is the desired excitation sequence

• Only the index needed to be sent to receiver• Many schemes to codebook design, among

them: unit variance white Gaussian random noise (Atal)

Speech Synthesis Model

+ +White

GaussianInnovation

Fine structure (Pitch)

Spectral Envelope (Formants)

Long termPredictor

Short termPredictor

‘Speech’

CELP Basic Scheme

Long termPredictor

Short termPredictor

StochasticCodeBook

Original Speech

PerceptualWeighting

Filter (W(z))SquareAverage

PerceptualError

Optional Improvements

• Adaptive CodeBook (closed loop pitch prediction)

• Low-Delay (LD-CELP): Backward adaptive CELP Coder:– Linear prediction of order 50– Rate: 16Kbps (Toll quality)

Vector Sum Excitation LP (VSELP)Orthogonal vectors create a basis for linear

combinations of white Gaussian noise

Adaptive CodeBook

ExcitationCodeBook 1

ExcitationCodeBook 2

Original Speech

PerceptualError

Rate: 8Kbps

MPE-LPC• The first A-by-S scheme (Multi Pulse Excitation)• Excitation is specified by a small set of pulses

with different amplitudes located at non-uniformly spaced intervals

• The only a-priory decision to make: The number of pulses required per block (usually about 1 per 10 samples, or 8 pulses for 10mSec)

• Locations and amplitudes should be sent

RPE-LPC

• regular Pulse Excitation LPC• Pulses are equally spaced, and positions are

specified by the first pulse position• This prevents the need for position

information• Used in GSM

MPE Vs. RPE

GSM Encoder

AnalysisRPE

InverseRPE

Long termpredictor

Transmitted

SideInformation

EncodedData

Similar Scheme to classic DPCM

GSM Decoder

+Inverse RPE

LPCSynthesis

Recon.speech

EncodedData

Performance and Complexity (after Spanias,1994)

Technique Rate(bps) MOS MIPSPCM (G.711) 64K 4.3-4.4 0.01ADPCM (G.726) 16K-40K 4.1-4.2 ~2LD-CELP (G.728) 16K 4.0-4.2 ~19RPE-LTP (GSM) 13k 3.47 6VSELP (IS-54) 8k 3.45 13.5CELP (DoD1016) 4.8k 3.2 16LPC-10e (DoD1015) 2.4k 2.3 7CS-ACELP (G.729/729a) 8k 4.2 17 / 10CS-ACELP (G.723) 5.3k/6.3k 3.5/4.0 14.6 / 16(Conjugate-Structure Algebraic CELP)

MELP 2.4Kbps 2.9* 20

* Better in Noisy environment !

WeightingFilter

IndexSelection

ExcitationCodeBook

SpectralCodeBook

Input Speech

SynthesisFilter

• Excitation signal is derived from the past history of the coded excitation function itself

• When the best vector is found, it is fed back with oldest samples discarded

• Its effectively a CELP coder with an adaptive codebook, containing no codebook...

MELP Principles

• Mixed Excitation• Aperiodic Pulses• Adaptive Spectral Enhancement• Pulse Dispersion• Fourier Magnitudes

MELP Encoder/Decoder

ShapingFilter

P(n)pulse train

W(n)white noise

Bandpass voicing strength

PositionJitter

AperiodicFlag

ShapingFilter

LPCFilter

GainAdaptiveSpectral

Enhancme

PulseDispersion

Filter

Synthesizedspeech

LPC Analysis

Speech

VoicingAperiodic &Analysis

Pitch Analysis

Fourier MagnitudeAnalysis

25 bit

4+1 bit

Quantization

Gain Analysis

Encoder Decoder

MELP Coder

Scheme

LPCANALYZER

SPEECHINPUT

PITCHDETECTION

VOICED/UNVOICEDDECISIONS

10 LPC COEFFICIENTS

V/UV FOR EACH BAND

APERIODIC FLAG

CALCULATETHE RESIDUAL FFT

FINDING10 FOURIER

MAGNITUDES

10 FOURIERMAGNITUDES

Analysis-by-Synthesis Codingcs.haifa.ac.il/~nimrod/Compression/Speech/S4ABYS2004.pdf · A-by-S...

Documents

Transcript of Analysis-by-Synthesis Codingcs.haifa.ac.il/~nimrod/Compression/Speech/S4ABYS2004.pdf · A-by-S...

Team 7 Personal Investment Michael Dellosa Andrew Nimrod.

NIMROD Simulations of a DIII-D Plasma Disruption

RA'S (URAŠ') OFFSPRING - NIMROD AND THE PHARAOHSenochcalendars.webs.com/Nimrods Family.pdf · son was Nimrod, who was known as Osiris in Egypt, ... King Nimrod and his brother Tammuz,

2020-2021 Student Handbook · Any RELP/RELG course (3)2 Credits from RELH, RELP, RELT, or RELB (6)2 2-The department recommends RELP 485, RELP 330, RELH 316, and RELH 317 to meet

Moving from Web-based collaboration to the mobile arena - Mobile, Task-Oriented Solutions Nimrod Geva Product Group Manager, KWizCom nimrod@kwizcom.com.

SharePoint Wiki Plus and Amdocs Pedia case study Nimrod Geva nimrod@kwizcom.com KWizCom.

SharePoint Enterprise Aggregation Caching Feature Product Overview Nimrod Geva Product Group Manager, KWizCom nimrod@kwizcom.com.

RELP (2019) 7(2): 232-254 DOI: 10.30486/relp.2018.663453relp.khuisf.ac.ir/article_663453_fcd94f5a8ce7fec8eddc02e576759ecc.pdf · RELP (2019) 7(2): 232-254 / 233 1. Introduction Over

Talent42 2014 Nimrod Hoofien Facebook Recruiting

Green DayTabs - Nimrod

Nimrod - The Genius Mastermind

NIMROD Applications for MHD Control · NIMROD Applications for MHD Control Carl Sovinec University of Wisconsin-Madison for the NIMROD Team () 16th Workshop on MHD Stability Control

Nimrod: MongoDB Shell in NodeJS (JSConfUY 2015)

SharePoint List Custom Actions Feature Product Overview Nimrod Geva Product Group Manager, KWizCom nimrod@kwizcom.com.

Entertainment System DV 7200 Instruction Manual€¦ · Audio Code MP3 WMA Bit Rate 16kbps - 320kbps CBR/VBR 16kbps - 320kbps Sampling Rate ... 7. To play tracks at random During

Nimrod & NetSolve Sathish Vadhiyar. Nimrod Sources/Credits: Nimrod web site & papers.

Historical Nimrod

The Nimrod Review - Official Documents

Nimrod Sea Assets Nimrod Sea Assets Limited Limited ... · Limited Interim Financial Report ... COMPANY OVERVIEW Nimrod Sea Assets ... Financial Statements of the Company for the

NIMROD/ORION & THE MYSTERY OF BABYLON