Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… ·...

68
Adaptive SP & Machine Intelligence Complex and Multi-D Learning Systems Danilo Mandic room 813, ext: 46271 Department of Electrical and Electronic Engineering Imperial College London, UK [email protected], URL: www.commsp.ee.ic.ac.uk/mandic c D. P. Mandic Adaptive Signal Processing & Machine Intelligence 1

Transcript of Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… ·...

Page 1: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Adaptive SP & Machine IntelligenceComplex and Multi-D Learning Systems

Danilo Mandic

room 813, ext: 46271

Department of Electrical and Electronic Engineering

Imperial College London, [email protected], URL: www.commsp.ee.ic.ac.uk/∼mandic

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 1

Page 2: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Outline:Basically, to de-mystify multidimensional adaptive estimation

◦ Multidimensional and multichannel sensors – a unified approach toadaptive filtering of such signals

◦ Circularity – a unique signature of bivariate signals # second ordercircularity (properness) # augmented complex statistics

◦ Duality between the processing in R2 and C (isomorphism)

◦ The issue of complex gradient # CR calculus

◦ Covariance, pseudocovariance, and widely linear models

◦ Complex Wiener filter, complex least mean square (CLMS) andaugmented CLMS (ACLMS)

◦ Magnitude-only, phase-only, and least mean magnitude phase (LMMP)approaches

◦ Multivariate adaptive filters (any number of data channels)

◦ Applications: communications, radar and sonar, target tracking,renewable energy, smart grid

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 2

Page 3: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

We can use multidimensional filters within our usualadaptive filtering configurations!

System Identification Noise Cancellation

Input

Adaptive

Filter

Unknown

System Output

d(k)x(k)

_

e(k)

y(k)

Σ

(k)s(k) (k)o+N

N1

_

Reference input

Adaptive

Filter

Primary input

+

d(k)

x(k)

e(k)

y(k)

DelayFilter

y(k)

d(k)

+

_

Σ

x(k) Adaptive

e(k)

x(k)

Unknown

System

Adaptive

Filter

Delay

y(k)

+

_

Σ

d(k)

e(k)

Adaptive Prediction Inverse System Modelling

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 3

Page 4: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Multidimensional adaptive signal processing:Applications

Renewable Energy Body motion sensor Wearable technologies

2D and 3D anemometers 3D - position, gyroscope, speed Biomechanics

control of wind turbine gait, biometrics virtual reality

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 4

Page 5: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Wind sensors - 2D and 3D anemometers

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 5

Page 6: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Why modelling in C?

◦ Complex signals by design (communications, analytic signals, equivalentbaseband represenation to eliminate spectral redundancy)

◦ By convenience of representation (radar, sonar, wind field)

◦ Problem: Different algebra (no ordering - operator “≤” makes nosense!), and the notion of pdf has to be induced

◦ Problem: Special form of nonlinearity (the only continuouslydifferentiable function in C is a constant (Liouville theorem)

◦ Solution: Special statistics – augmented complex statistics (started inmathematics in 1992)

◦ We can differentiate between several kinds of noises (doubly whitecircular with various distributions nr ⊥ ni & σ2

nr = σ2ni

, doubly whitenoncircular nr ⊥ ni & σ2

nr > σ2ni

, noncircular noise)

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 6

Page 7: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

What is the right basis for real world data?Back to Dennis Gabor # but careful: Bedrossian and Nutall theorems

Consider

◦ Amplitude modulated signal x(t) = m(t) cos(ω0t) → m(t) # envelope

◦ Phase modulated signal x(t) = a cos(Φ(t)) → Φ(t) # phase

Problem: there is an infinite number of pairs [a(t),Φ(t)] s.t.m(t) cos(ω0t) = a cos(Φ(t))

Solution: an analytic transform z(t) = x(t) + H(x(t)

)= a(t)eΦ(t)

Remark#1: z(t) cannot be real, as F(z(t)

)= 0 for ω < 0

Remark#2: Hilbert transform (analytic signal) makes it possible toassociate a unique pair

[a(t),Φ(t)

]to any real x(t) = <

{a(t)eΦ(t)

}

Remark#3: For x(t) = a(t) cos Φ(t) ⇒ H{x(t)

}= a(t) sin Φ(t)

Remark#4: From instantaneous phase Φ(t) → instantaneousfrequency

f(t) = dΦ(t)/dt

so we have an excellent resolution and do not depend on stationarity

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 7

Page 8: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Standard adaptive filtering algorithms in CConsidered straightforward extensions of the corresponding algorithms in R– replace the vector transpose by the Hermitian transpose:

◦ CovarianceC = E{xxT} C = E{zzH}

◦ Autoregressive model and Wiener solution

w = R−1p w∗ = R−1p

◦ Least mean square (LMS) ; complex LMS [Widrow at al. 1975]

w(k + 1) = w(k) + µe(k)x(k) w(k + 1) = w(k) + µe(k)z∗(k)

◦ Real time recurrent learning (RTRL) for recurrent neural networks(RNN) [Williams & Zipser 1989] ; complex RTRL [Goh & Mandic2004]

w(k + 1) = w(k) + µe(k)Π(k) w(k + 1) = w(k) + µe(k)Π∗(k)

This is, however, valid only for circular complex random processes

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 8

Page 9: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Property of division algebras # complex (non)circularityNoncircularity of the wind distribution v(k) = |v(k)|eΦ(k)

Deterministic vs. Stochastic natureLinear vs. Nonlinear nature

Determinism

Nonlinearity

Linearity

Chaos

ARMA

(a)

(b)

(c)

? ?

?

?

? ??

?

NARMA

Stochasticity

Change in signal modality can indicatee.g. health hazard (fMRI, HRV)

Real world signals are denoted by ’???’

◦ ∃ a unique signature of complex signals?

◦ # degree of noncircularity

N

E

Wind

speed

Wind

direction

5

10

15

20

30

210

60

240

90

270

120

300

150

330

180 0

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 9

Page 10: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Circular vs noncircular complex random variablesCircularity = Rotation invariant distribution p(ρ, θ) = p(ρ, θ − φ)

circular variable = construct z = ρ cos(θ) + jρ sin(θ), θ ∼ U [0, 2π), ρ ∼ any pdf

Gaussian circular

−2 0 2−3

−2

−1

0

1

2

3

ρ cosθ

ρ s

inθ

|ρ| = 0

−2 0 2−3

−2

−1

0

1

2

3

ρ cosθ

ρ s

inθ

|ρ| = 0.8

Gaussian uncorrelated noncircular

Uniform circular

−2 0 2−3

−2

−1

0

1

2

3

ρ cosθ

ρ s

inθ

|ρ| = 0

−2 0 2−3

−2

−1

0

1

2

3

ρ cosθ

ρ s

inθ

|ρ| = 0.8

Gaussian correlated noncircular

◦ Covariance of z isc = E{zz∗} = E{|z2|}

◦ Pseudocovariance isp = E{zzT} = E{z2}

◦ Circularity quotient:ρ = p

c

◦ Circularity coefficient:|ρ| = |p|

c

Circularity quotientand coefficient quantifythe degree of non-circularity

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 10

Page 11: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Some observations

Although a formalism similar to that used for real valued adaptive filterscan also be used for complex valued adaptive filters, notice that in thiscase the cost function J(k) = 1

2|e(k)|2 = 12e(k)e∗(k) = 1

2(e2x + e2

y) is areal valued function of complex variable.

◦ Standard complex differentiability is based on the Cauchy–Riemann(C–R) equations and imposes a stringent structure on complexholomorphic functions;

◦ Cost functions are real functions of complex variable, that is J : C 7→ R,and so they are not differentiable in the complex sense, theCauchy–Riemann equations do not apply, and we need to developalternative, more general and relaxed, ways of calculating their gradients(CR-calculus);

◦ It is also desired that these generalised gradients are formally equivalentto standard complex gradients (their generic extensions) when appliedto holomorphic (analytic) functions.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 11

Page 12: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Complex Wiener filterNotice that we have used: (xHy)∗ = yHx

The Wiener solution can be obtained by minimising

J(w) = E[e(k)e∗(k)

]= E

[(d(k)−wHx(k)

)(d(k)−wHx(k)

)∗]

J(w) = E[|d(k)|2

]−wHE

[x(k)d∗(k)

]−wTE

[x∗(k)d(k)

]+ wHE

[x(k)xH(k)

]w

= σ2d −wHp− pHw + wHRw → ∇w∗J(w) = Rw − p = 0

The optimum solution wo and the minimum mean square error Jmin

wo = argminwJ(w) = R−1p Jmin = J(wo) = σ2

d − pHR−1p

Also, J(w) can be expressed as

J(w) = Jmin + (w −wo)H

R (w −wo) = Jmin + vHRv by noticing thatd(k) = wH

o x(k) + q(k) and e(k) = (wo−w(k))Hxk + q(k) = vH(k)x(k),where q(k) ∼ N (0, σ2

q).

Notice that J(w) is quadratic in w and has a global minimum forw = wo, with Jmin = σ2

q .

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 12

Page 13: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Derivative of the cost function 12e(k)e∗(k) and CLMS

As C-derivatives are not defined for real functions of complex variable

R− der:∂

∂z=

1

2

[∂

∂x− ∂

∂y

]R∗ − der:

∂z∗=

1

2

[∂

∂x+

∂y

]

and the gradient

∇wJ =∂J(e, e∗)

∂w=

[∂J(e, e∗)

∂w1, . . . ,

∂J(e, e∗)

∂wN

]T= 2

∂J

∂w∗=

∂J

∂wr+

∂J

∂wi︸ ︷︷ ︸pseudogradient

The standard Complex Least Mean Square (CLMS) (Widrow et al. 1975)

y(k) = wH(k)x(k)

e(k) = d(k)− y(k) = d(k)−wH(k)x(k) e∗(k) = d∗(k)− xH(k)w(k)

and ∇wJ = ∇w∗J (conjugate gradient direction)

w(k + 1) = w(k)− µ ∂e(k)

∂w∗(k)e∗(k) = w(k) + µe∗(k)x(k)

Thus, no need for tedious computations # the CLMS is derived in one line

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 13

Page 14: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Which derivative to we choose to compute the gradient?R-derivative vs. R∗-derivative?

Simulation for the CLMS derived using R-der. and R∗-der. (wo = 3)

0 100 200 300 400 5000

0.5

1

1.5

2

2.5

3

3.5

4Step−Size, µ = 0.01

Time Instant, n

Wei

ght E

stim

ate

0 100 200 300 400 5000

0.5

1

1.5

2

2.5

3

3.5

4Step−Size, µ = 0.1

Time Instant, n

Wei

ght E

stim

ate

∇w*

Gradient

∇w

Gradient

∇w*

Gradient

∇w

Gradient

...but R−der. leads to instability

Both R−der. & R*−derfollow the same trajectory...

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 14

Page 15: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Orthogonality principle & alternative forms for the CLMS

Consider the expected value for the CLMS update

E[w(k + 1)

]= E

[w(k)

]+ µE

[e(k)x∗(k)

]

It has converged when w(k + 1) = w(k) = w(∞) and thus the weightupdate ∆w(k) = µE[e(k)x∗(k)] = 0.

This is achieved for E[d∗(k)x(k)

]−E

[x(k)xH(k)

]w(k) = 0⇔ Rwo = p

that is, for the Wiener solution, wo = R−1p.

The condition E[e(k)x∗(k)] = 0 is called the “orthogonality condition”and states that the output error of the filter and the tap input vector areorthogonal (e ⊥ x) when the filter has converged to the optimal solution.

The following formulations for the CLMS produce identical results:

y(k) = xT (k)w(k) = wT (k)x(k) → w(k + 1) = w(k) + µe(k)x∗(k)

y(k) = wH(k)x(k) = xT (k)w∗(k) → w(k + 1) = w(k) + µe∗(k)x(k)

y(k) = xH(k)w(k) = wT (k)x∗(k) → w(k + 1) = w(k) + µe(k)x(k)

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 15

Page 16: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Types of convergence of CLMS

It is of greater interest, however, to analyse the evolution of the weights intime. As with any other estimation problem, we need to analyse the “bias”and “variance” of the estimator, that is:

◦ Convergence in the mean, to ascertain whether w(k)→ wo whenk →∞;

◦ Convergence in the mean square, in order to establish whether thevariance of the weight error vector v(k) = w(k)−wo(k) approachesJmin as k →∞.

The analysis of convergence of linear adaptive filters is mademathematically tractable if we use so called independence assumptions,such that the filter coefficients are

~ statistically independent of the data currently in filter memory, and

~ {d(l), x(l)} is independent of {d(k), x(k)} for k 6= l.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 16

Page 17: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Convergence of CLMS in the Mean

Assume d(k) = xH(k)wo + q(k), q(k) ∼ N (0, σ2q). Then

e(k) = xH(k)wo + q(k)− xH(k)w(k)

w(k + 1) = w(k) + µx(k)xHwo − µx(k)xH(k)w(k) + µq(k)x(k)

Subtract wo from both sides, and define the weight error vector

v(k)def= w(k)−wo

v(k + 1) = v(k)− µx(k)xH(k)v(k) + µq(k)x(k)

Applying the statistical expectation operator E{·} gives the mean weighterror recursion

E[v(k + 1)] =(I− µE[x(k)xH(k)]

)E[v(k)] + µE[q(k)x(k)]︸ ︷︷ ︸

=0

=⇒ E[v(k + 1)] =(I− µR

)E[v(k)]

Challenge: Unless the correlation matrix R is diagonal, there will becross–coupling between the coefficients of the weight error vector.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 17

Page 18: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Convergence of CLMS in the Mean (contd.)

Solution: Since the eigenvalue decomposition of R = QΛQH, therotation of the weight error vector v(k) by the eigenvector matrix Q, that

is, v′(k)def= QHE{v(k)}, decouples the evolution of its coefficients.

Proof: Pre-multiply both sides of mean weight vector error recursion by QH

QHE[v(k + 1)]︸ ︷︷ ︸v′(k+1)

= QH(I− µQΛQH

)E[v(k)] =

(I− µΛ

)QHE[v(k)]︸ ︷︷ ︸

v′(k)

Since the eigenvector matrix is unitary, i.e. QHQ = I, the modes ofconvergence are

v′(k + 1) = (I− µΛ) v′(k)

This recursion will be stable if the diagonal elements of (I− µΛ) satisfy

|1− µλi| < 1, for i = 1, . . . , N =⇒ 0 < µ <2

λmax<

2

tr[R]

Since tr[R] = NE{|x(k)|2}, an easier to estimate bound 0 < µ < 2NE[|x(k)|2|].

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 18

Page 19: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Convergence of CLMS in the Mean Square

As the filter coefficients converge in the mean, they fluctuate around theiroptimum values wo. As a result, the mean square error ξ(k) = E[|e(k)|2]exceeds the minimum mean square error Jmin by an amount referred to asthe excess mean square error, denoted by ξEMSE(k), that is

ξ(k) = Jmin+ξEMSE(k)Jmin=σ2

d=⇒ ξ(k) = σ2q+E

[vH(k)Rv(k)

]= σ2

q+tr[RK(k)]

We have used the identity E[vH(k)Rv(k)] = tr[RK(k)] = tr[K(k)R],where K(k) = E[v(k)vH(k)].

The excess mean square error depends on second order statisticalproperties of d(k),x(k),w(k), e(k). The plot showing time evolution ofthe mean square error is called the learning curve.

For convergence in the mean square the misadjustment

M =ξEMSE(∞)

ξmin=ξEMSE(∞)

σ2q

≈ 1

2µσ2

xN

must be bounded and positive, that is, (1− 12µtr[R] > 0), and therefore

0 < µ < 2/tr[R] ↔ 0 < µ < 2/(σ2xN).

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 19

Page 20: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Does Circularity Influence Estimation in C?Voltage Sag: A magnitude and/or phase imbalance

◦ For balanced systems, v(k) = A(k)eωk∆T → circular trajectory.

◦ Unbalanced systems, v(k) = A(k)eωk∆T +B(k)e−ωk∆T areinfluenced by the “conjugate” component.

◦ We need the complex conjugate when the modelling the signal.

Circularity Diagram

Real Part

Imag

inar

y P

art

Balanced

Type CSag

Type DSag

Strictly linear model yields biased estimateswhen system is unbalanced

0 0.1 0.2 0.3 0.445

50

55

Time (sec)

Freq

uenc

y (H

z)

Old MethodSmartFrequency

True Frequency

Strictly Linear Model

Type C Sag Type D SagBalanced

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 20

Page 21: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

What are we doing wrong # the Widely Linear Model

Consider the MSE estimator of a signal y in terms of another observation x

y = E[y|x]

For zero mean, jointly normal y and x, the solution is

y = hTx

In standard MSE in the complex domain y = hHx, however

yr = E[yr|xr, xi] & yi = E[yi|xr, xi]thus y = E[yr|xr, xi] + E[yi|xr, xi]

Upon employing the identities xr = (x+ x∗)/2 and xi = (x− x∗)/2y = E[yr|x, x∗] + E[yi|x, x∗]

and thus arrive at the widely linear estimator for general complex signals

y = hTx + gTx∗

We can now process general (noncircular) complex signals!

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 21

Page 22: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Widely linear autoregressive modelling in CStandard AR model of order n is given by

z(k) = a1z(k − 1) + · · ·+ anz(k − n) + q(k) = aTz(k) + q(k),

Using the Yule-Walker equations the AR coefficients are found from

a∗ = C−1c

a∗1a∗2...a∗n

=

c(0) c∗(1) . . . c∗(n− 1)c(1) c(0) . . . c∗(n− 2)

... ... . . . ...c(n− 1) c(n− 2) . . . c(0)

−1

c(1)c(2)

...c(n)

where c = [c(1), c(2), . . . , c(n)]T is the time shifted correlation vector.

Widely linear model Widely linear normal equations

y(k) = hT (k)x(k) + gT (k)x∗(k) + q(k)

[h∗

g∗

]=

[C PP∗ C∗

]−1 [cp∗

]

where h and g are coefficient vectors and x the regressor vector.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 22

Page 23: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

This is a rigorous way to model general complex signals!

Circularity for Ikeda map AR model of Ikeda signal

0 0.2 0.4 0.6 0.8 1 1.2 1.4−1.2

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

ℜ{z(t)}

ℑ {

z(t

) }

−20 −10 0 10 200

0.2

0.4

0.6

0.8

1

Covariance

Pseudocovariance

−20 −10 0 10 200

0.2

0.4

0.6

0.8

1

Covariance

Pseudocovariance

−20 −10 0 10 200

0.2

0.4

0.6

0.8

1

Covariance

Pseudocovariance

Covariances: Original Ikeda Widely linear AR of Ikeda

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 23

Page 24: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

The Augmented (widely linear) CLMS (ACLMS)

Widely linear model y(k) = hT(k)z(k) + gT(k)z∗(k)

h(k + 1) = h(k)− µ∇h∗J ⇒ ∇h∗J = −e(k)x∗(k)

g(k + 1) = g(k)− µ∇g∗J ⇒ ∇g∗J = −e(k)x(k)

Therefore, the ACLMS update

h(k + 1) = h(k) + µe(k)x∗(k)

g(k + 1) = g(k) + µe(k)x(k)

or in a more compact form (using augmented input and weight vectors)

wa(k + 1) = wa(k) + ηea(k)xa∗(k)

where η = µh = µg, wa(k) = [hT (k), gT (k)]T , xa(k) = [xT (k), xH(k)]T ,

ea(k) = d(k)− xaT(k)wa(k)

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 24

Page 25: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Performance of the ACLMS

Evaluated for both second order circular (proper) and improper signals.

0 500 100010

−2

10−1

100

101

102

103

Epoch number

Rp

Batch training

ar4, CLMS and ACLMS

wind, CLMS and ACLMS

lorenz, CLMS

lorenz, ACLMS

ikeda, ACLMS

ikeda, CLMS

0 20 40 60 80

−0.5

−0.4

−0.3

−0.2

−0.1

0

0.1

0.2

0.3

0.4

0.5

Samples

Rea

l Par

t

Online learning (Lorenz attractor)

original

CLMS

ACLMS

The ACLMS outperforms CLMS for second order noncircular signals.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 25

Page 26: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Wind modelling: Performance vs dynamics vs circularity

Data recorded in an urban environment over one day

0 1 2 3 4 5

x 106

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

Win

d m

agnitude

Sample number

high

low

medium

(a) Modulus of complex windover one day

1 2 100

2

4

6

8

10

12

14

16

18

Moving Average window size (s)

Pre

dic

tion G

ain

Rp (

dB

)

high

medium

low

(b) CLMS vs ACLMS for different wind regimes.CLMS - black, ACLMS - blue

Different wind regimes different dynamics,

v(k) = |v(k)|eΦ(k), |v| - speed, Φ - direction

Different dynamics different circularity properties impact of ACLMS

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 26

Page 27: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Real bivariate or complex: Isomorphism between C and R2

(also serves as a basis for the CR calculus)

z → za ↔[zz∗

]=

[1 1 −

] [xy

]

whereas in the case of complex–valued signals, we have

z → za ↔[

zz∗

]=

[I II − I

] [xy

]

For convenience, the “augmented” complex vector v ∈C2N×1 can be introduced as

v = [z1, z∗1, . . . , zN , z

∗N ]T

v = Aw, w = [x1, y1, . . . , xN,yN ]T

where matrix A = diag(J, . . . ,J) ∈ C2N×2N is blockdiagonal and transforms the composite real vector winto the augmented complex vector v.

2

jy

x

y

x

C

R

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 27

Page 28: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Duality between bivariate real and complex filtersDesired signal: d(k) = dr(k) + di(k), Input: x(k) = xr(k) + xi(k),

The bivariate (dual channel) real filter:[dr(k)

di(k)

]=

[aT (k) bT (k)cT (k) dT (k)

] [xr(k)xi(k)

]

The strictly linear model d(k) = wH(k)x(k) (w(k) = wr(k) + wi(k)) inthe CLMS is highly restrictive since it imposes the condition

a(k) = d(k) = wr(k) and b(k) = −c(k) = wi(k)

Augmented CLMS: d(k) = hH(k)x(k) + gH(k)x∗(k)[dr(k)

di(k)

]=

[(hr(k) + gr(k))T (hi(k)− gi(k))T

−(hi(k) + gi(k))T (hr(k)− gr(k))T

] [xr(k)xi(k)

]

Sufficient degrees of freedom to model general complex signals!The bound on the step-size which preserves convergence: 0 < µ < 2

λmaxThe bivariate and augmented complex correlation matrices related as

Ca = E[zazaH] = E[AωωTAH

]= AWAH

It then follows that λa = 2λω.

RACLMS and DCRLMS converge at the same speed when µr = 2µaclms.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 28

Page 29: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Duality: Simulations

−4 −3 −2 −1 0 1 2 3 4−4

−3

−2

−1

0

1

2

3

4

ℜ{z(t)}

ℑ{z

(t)}

0 0.2 0.4 0.6 0.8 1 1.2 1.4−1.2

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

ℜ{z(t)}

ℑ {

z(t

) }

−100 −50 0 50 1000

5000

10000

Lag

Covariance of the AR(4) signal

−100 −50 0 50 1000

5000

10000

Lag

Pseudocovariance of the AR(4) signal

−100 −50 0 50 1000

5000

Lag

Covariance of the Ikeda signal

−100 −50 0 50 1000

5000

Lag

Pseudocovariance of the Ikeda signal

AR(4) and Ikeda signal Covariances and pseudocovariances

Rp = 10 logσ2z

σ2e

Algorithm AR4 Ikeda Wind

Rp for DCRLMS 5.8423 3.9733 13.2604

Rp for CLMS 6.6380 2.4278 14.2941

Rp for ACLMS 6.6096 4.0330 14.8926

Rp for DCRLMS (double µ) 6.6096 4.0330 14.8926

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 29

Page 30: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Duality simulations: Dependence on the parameters(observe the possibility for better tuning using the complex ACLMS)

DCRLMS Lorenz (left) and wind signal (right)

00.1

0.20.3

0.40.5

00.1

0.20.3

0.4

18

20

22

24

26

28

30

µy

µx

Pre

dict

ion

gain

[dB

]

00.1

0.20.3

0.4

00.1

0.20.3

0.4

2

3

4

5

6

7

8

µy

µx

Pre

dict

ion

gain

[dB

]

ACLMS Lorenz (left) and wind signal (right)

0

0.1

0.2

0.3

0.4

0

0.1

0.2

0.3

0.420

22

24

26

28

30

32

µg

µh

Pre

dict

ion

gain

[dB

]

0

0.1

0.2

0.3

0.4

0

0.1

0.2

0.3

0.45

6

7

8

9

µg

µh

Pre

dict

ion

gain

[dB

]

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 30

Page 31: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Part 2: Advanced learning algorithmsand applications of

Complex-Valued Adaptive Filters

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 31

Page 32: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Application: Circularity trackerRelationship between a complex variable and its conjugate

Consider the problem of using a zero-mean r.v. z ∈ C to estimate itscomplex conjugate, that is

z∗ = w∗z

find an estimate of w that minimizes

JMSE = E{|e|2} = E{|z∗ − z∗|2}

The Wiener solution

wopt = c−1r =E{z2}E{|z|2} =

p

c

→ Circularity Quotient!!!

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 32

Page 33: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Real time tracker of complex circularity

Idea: Use an adaptive filter to estimate the complex conjugate of asignal using the original signal as the input.

Since the complex least mean square (CLMS) [Widrow 1975] estimates theWiener solution, we can configure it with input signal zk and the desiredsignal z∗k.

The one-tap filter weight is the estimate of the circularity quotient ρ.

Adaptive filtering configuration

zk

(·)∗

Σ

z∗

k

z∗

k

ek

w∗

k

CLMS algorithm

z∗k = w∗kzk

ek = z∗k − z∗kwk+1 = wk + µe∗kzk

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 33

Page 34: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Real time tracker of complex circularitySimulations on synthetic white Gaussian data

The real and imaginary parts of the

evolution of the CLMS weights when

tracking circularity.

0 500 1000 1500 2000 2500 3000

0

0.5

1

Real Part of the Circularity Quotient

Sample, k

EstimatedTrue

0 500 1000 1500 2000 2500 3000−0.5

0

0.5

1

Sample, k

Imaginary Part of the Circularity Quotient

EstimatedTrue

Synthetic signal was generated by

concatenating three segments of zero-mean

white Gaussian signals, zi,k, with different

properties, where

zi,k = xi,k + jyi,k zi ∼ N (0, c, pi)

i = {1, 2, 3}

Each segment:

Sample, k c pi |ρi|1− 1000 1 0.8j 0.8

1001− 2000 1 0.6 + 0.4j 0.72

2001− 3000 1 0 0

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 34

Page 35: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Real time tracker of complex circularitySimulations on wind sensor data

The circularity diagram of wind speeds in

the low, medium and high dynamic regimes.

−3 −2 −1 0 1 2 3

−3

−2

−1

0

1

2

3

Wind speed (East−West)

Win

d sp

eed

(N

orth

−S

outh

)

Circularity Diagram of Wind Signal

HighMediumLow

Wind Regimes

Low

Medium

High

The estimate of circularity coefficient for

wind signals in the low, medium and

high dynamic regimes using the proposed

algorithm.

0 5000 10000 150000

0.5

1

Circularity Coefficient of Wind Signal

Circ

ular

ity c

oeffi

cien

t

Sample, k

High

Low Medium

Wind signal modelled as wind a complex

number

s = sE + jsN

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 35

Page 36: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Circular vs. Noncircular: Fast convergence vs. accuracyBest of both worlds # a Hybrid Filter

Virtues of Convex Combination (λ ∈ [0, 1])

yλx + (1− )yλx

Convexity ⇒ existence and uniqueness of solution ©

filter 2

filter 1

x(k)

d(k)

∑e(k)

y(k)∑

-

-

-λ(k)

1− λ(k)

e1(k)

y1(k)

y2(k)

e2(k)

w1(k)

w2(k)

Let Filter1 be trained by CLMS and Filter2 by ACLMS

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 36

Page 37: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Adaptation of the mixing parameter λ

To preserve their inherent characteristics, subfilters, Filter1 and Filter2updated based on their own errors:

◦ Linear eclms(k)

◦ Widely linear eaclms(k)

The convex mixing parameter λ is updated based on based on

J(k) =1

2e(k)e∗(k) =

1

2|e(k)|2 ∇λJ(k)λ=λ(k) = e(k)

∂e∗(k)

∂λ(k)+e∗(k)

∂e(k)

∂λ(k)

and the stochastic gradient based update of λ becomes

λ(k + 1) = λ(k) + µλ

[e(k)

(yaclms(k)− yclms(k)

)∗

+ e∗(k)(yclms(k)− yaclms(k)

)]

We must ensure that the value of λ(k) belongs to 0 ≤ λ(k) ≤ 1.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 37

Page 38: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Performance of hybrid filters – prediction settingconsider an LMS/GNGD hybrid – GNGD is fast, LMS with small µ has good M

Hybrid attempts to follow the subfilter with better performance.If one of the subfilters diverges, hybrid filters still converges.

0 1000 2000 3000 4000 5000 6000 7000 800010

−6

10−4

10−2

100

102

Samples

MS

E (db)

LMS, µ = 0.01

LMS, µ = 1.8 Hybrid Filter

0 200 400 600 800 1000−6

−4

−2

0

2

4

Number of iteration

MS

E in

dB

LMS

GNGDHybrid Filter

Learn. curves for pred.: Left # linear signal Right # nonlinear signal

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 38

Page 39: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

The hybrid CLMS ↔ ACLMS filter (prediction setting)

Left: circular AR(4) process Right: Noncircular Ikeda map

0 1000 2000 3000 4000 5000−20

−19

−18

−17

−16

−15

−14

−13

Number of iterations

10*l

og

10(e

2)

CLMS

Hybrid

ACLMS

0 500 1000 1500 2000 2500 3000−16

−15

−14

−13

−12

−11

−10

−9

−8

Number of iterations (k)

10

*lo

g10e

2

ACLMSCLMS

Hybrid filter

◦ The CLMS has half the number of parameters of ALMS◦ Hence, it initially converges faster for all test signals

~ Both filters perform similarly for proper data in terms of the steady state~ ACLMS has superior steady state properties for the improper Ikeda map~ Hybrid filter: both fast covergence and excellent steady state properties!

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 39

Page 40: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

A continuum between phase-only and magnitude-onlycost functions (when does phase error matter?)

y − filter output

updatephase only

d(k)y(k−1)

ylmp

y

ycma

clms

magnitude only update

real axis0

ima

gin

ary

axis

1

1

d − teaching signal

So, when does the (complex)phase error matter to the overallperformance?

• Answer #1: Constant ModulusChannel Estimator (CMCE)[Rupp 1998] can estimate time-varying channels better if phaseerror is ignored.

• Answer #2: Least Mean Phase(LMP) [Tarighat/Sayed 2004]can estimate complex symbolsbetter if phase error term is addedto update.

• Answer #3: Least MeanMagnitude Phase (LMMP)[Douglas/Mandic ICASSP 2011]employs a combined magnitude-and phase-based criterion bestperformance with stable behavior.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 40

Page 41: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

On phase-only cost functions

The least mean phase (LMP) learning is based on the cost function

Jp(k) =(∠dk − ∠yk

)2

where e(k) = ∠dk − ∠y(k), and y(k) = wH(k)x(k).

Then,

Jp(k) =(∠dk − ∠yk

)2=[∠dk − arctan

=(yk)

<(yk)

]2=[∠dk − arctan

=(wHk xk)

<(wHk xk)

]2

and Jp,min is achieved for wk = wopt, to give

Jp,min = Jp(wopt) =[∠dk − arctan

=(wHoptxk)

<(wHoptxk)

]2=[∠dk − arctan

=(αwHoptxk)

<(αwHoptxk)

]2

Therefore Jp(wopt) = Jp(αwopt), for any α and there is no uniqueminimum.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 41

Page 42: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Decomposing the squared error cost function

Decompose the well-known squared error cost function:

JLMS = |e(k)|2 = |d(k)− y(k)|2

= |d(k)|2 + |y(k)|2 − [d(k)y∗(k) + d(k)y∗(k)]

= (|d(k)| − |y(k)|)2+ 2|d(k)||y(k)| − [d(k)y∗(k) + d(k)y∗(k)]

= (|d(k)| − |y(k)|)2

︸ ︷︷ ︸Magnitude Error

+ 2|d(k)||y(k)| [1− cos(θd − θy)]︸ ︷︷ ︸Phase Error

The Least Mean-Magnitude Phase (LMMP) algorithm can be derived as

w(k + 1) = w(k)− µm∇w∗JMag − µp∇w∗JPhase

= w(k) + µm (|d(k)| − |y(k)|) yk|yk|

x∗k + µp

(d(k)− |d(k)| yk|yk|

)x∗k

If µp = µm, the LMMP simplifies into the CLMS!

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 42

Page 43: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Example: Channel equalisation in digital communications

Equaliser

≈1/Hc(f){s[n]} {s[n-D]}∑Channel

Hc(f)

v[n] :

channel

noise

transmitted

signal

H fH fc

c

( )( )

⋅ =

11

2 Channel equalization is a simple way of mitigating the detrimental effects caused by a

frequency-selective and/or dispersive communication link between sender and receiver.

2 During the training phase of channel equalization, a digital signal s[n] that is known to

both the transmitter and receiver is sent by the transmitter to the receiver

2 The received signal x[n] contains two signals: the signal s[n] filtered by the channel

impulse response, and an unknown broadband noise signal v[n]

2 The goal is to filter x[n] to remove the inter-symbol interference (ISI) caused by the

dispersive channel and to minimize the effect of the additive noise v[n]

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 43

Page 44: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Example: Digital communications # continued

−4 −2 0 2 4−4

−3

−2

−1

0

1

2

3

4

Re{s(n)}

Im{s

(n)}

Input signal constellation

0 5 10 15 20−1

0

1

2

3

4

5x 10

8

Samples

Impulse Response

Am

plit

ud

e

Transmission channel

−40 −20 0 20 40−40

−30

−20

−10

0

10

20

30

40

Re{x[n]}

Im{x

[n]}

Received signal x[n]

−5 0 5−5

−4

−3

−2

−1

0

1

2

3

4

5

Re{y[n]}

Im{y

[n]}

Equalized signal y[n]

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 44

Page 45: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Beamforming example

• Beamforming Model: ULA, λ/2 spacing

• Sources: BPSK, BPSK, QPSK, QPSK

• Angle of Arrival: −45◦, 8◦,−13◦, 30◦

• Number of Antenna Elements: 3

• Algorthims: ACLMS and CCLMS

• Desired Signal: d(k) = s∗p(k), 1 ≤ p ≤ 4

• Step Size: µ = 0.0001.

Compare: Convergence Rates, Steady State MSE

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 45

Page 46: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Output signal constellations

CCLMS ACLMS

ACLMS has lower MSE because of non-circular binary sources

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 46

Page 47: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Signal constellations: channel equalizationChannel: 3-tap with frequency offset Source phases: time-varying

Performance meas.: average inter-symbol interfer., ISI (not dependent on phase)

Moral :

Signal phase uncertainty

can harm channel

amplitude and phase

estimation performance,

unless it is mitigated within

the algorithm itself.

LMMP in red addresses this

uncertainty explicitly.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 47

Page 48: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Case Study: Frequency estimation in smart gridSources of frequency deviation

Transmission: well-modelleddistribution side is not

138kV

L

G

L

LL

G

L

120V

120V

available: f, V, I, L

3−5MVA

local distributionhigh voltage transmission

poorly modelled (no sensors)well modelled

transformerpole

substation

46kV

substation

GΣ L

Σ G

Σ L

GΣLΣ

G

L

L

G

L L

L

C3

C2C1

M

M

M

100%

GsubLsub

50%

30%

20%

12kV

circuit (local grid)Σ

Block diagram of power grid Nodal estimation

◦ Dual character of load/supply f+ for G > L and f− for G < L

◦ Harmonics and freq. drifts from loads with nonlinear V − I properties

◦ Transient stability issues cause inaccurate frequency estimates, alsoswitching on/off the shunt capacitors in reactive power compensation

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 48

Page 49: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

The three-phase power system and αβ transformation

Three-phase system where Va(k), Vb(k), Vc(k) are the peak values.

va(k) = Va(k)cos(ωk4T + φ)

vb(k) = Vb(k)cos(ωk4T + φ− 2π

3)

vc(k) = Vc(k)cos(ωk4T + φ+2π

3)

The αβ transform - a complex signal which carries the same information

v0

vα(k)vβ(k)

=

√2

3

√2

2

√2

2

√2

21 −1

2 −12

0√

32 −

√3

2

va(k)vb(k)vc(k)

For balanced systems v0 = 0, and thus the complex Clarke’s voltage

v(k) = vα(k) + vβ(k) = A(k)e(ωk∆T+φ) +B(k)e−(ωk∆T+φ)

A(k) =√

6(Va(k)+Vb(k)+Vc(k))

6 and B(k) =√

6(2Va(k)−Vb(k)−Vc(k))

12 −√

2(Vb(k)−Vc(k))

4

Clearly, unbalanced systems are noncircular!

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 49

Page 50: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Noncircularity in unbalanced voltage conditions

◦ Balanced system: Va(k) = Vb(k) = Vc(k), A(k) = const, B(k) = 0,and v(k) is on a circle

◦ Unbalanced system: Va(k), Vb(k), Vc(k) are not identical~ A(k) is no longer constant, B(k) 6= 0~ v(k) is not on a circle → a degree of noncircularity

−1 −0.5 0 0.5 1

−1

−0.5

0

0.5

1

ℜ{⋅}

ℑ{⋅

}

balanced and unbalanced system

Some types of voltage sags

Type A Type C Type D

−1 1

−1

1

ℜ{⋅}

ℑ{⋅

}Type A

−1 1

−1

1

ℜ{⋅}ℑ

{⋅}

Type C

−1 1

−1

1

ℜ{⋅}

ℑ{⋅

}

Type D

Their circularity diagrams

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 50

Page 51: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Simulations: Several successive sags

−1

0

1

Typ

e C

sa

g

−1

0

1

Typ

e D

sa

g

0 0.02 0.04 0.06 0.08 0.1−2

0

2

Time (sec)

Typ

e D

sa

g+

Ha

rmo

nic

s

0 0.1 0.2 0.3 0.4 0.545

46

47

48

49

50

51

52

Time (sec)

Fre

quency (

Hz)

CLMS

ACLMS

Phase voltages for different sags Linear and widely linear frequency estimation

◦ Initially, the power system (50Hz) was operating normally and bothCLMS and ACLMS converged to 50Hz

◦ The widely linear ACLMS had advantage in the subsequent type C andD sags and under harmonics

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 51

Page 52: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Using the widely linear model for frequency estimation

The widely linear model is able to estimate the frequency for both circular(balanced) and noncircular (unbalanced) voltages.

0 0.1 0.2 0.3 0.445

50

55

Time (sec)

Freq

uenc

y (H

z)

Old MethodSmartFrequency

True Frequency

Strictly Linear ModelWidely Linear Model

Type C Sag Type D SagBalanced

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 52

Page 53: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Introduction to Distributed Adaptive FiltersMultiple adaptive filters collaborating in a network

Node i

Neighborhood of Node i

Connection

Sparse Dense−28

−27

−26

−25

−24

−23

Avera

ge M

SE

(dB

)

Average MSE Across 6 Nodes

Connection Level

D−ACLMS

The Diffusion ACLMS at node iInput: zi(k) = [xTi (k),xHi (k)]T ,Weights: wi(k) = [hTi (k),gTi (k)]T

yi(k) = wHi (k)zi(k)

ei(k) = di(k)− yi(k)

ψi(k + 1) = wi(k) + µie∗i (k)zi(k)

wi(k + 1) =

N∑

`=1

a`iψ`(k + 1)

The weighting coefficients satisfyN∑

`=1

a`i = 1 =⇒ aii = 1−∑

6=i

a`i

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 53

Page 54: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Performance of the Diffusion-CLMS andDiffusion-ACLMS in the smart grid

Frequency estimate at a randomly selected

node with multiple frequency events

between 0.1 s and 0.5 s

0 0.1 0.2 0.3 0.4 0.547

48

49

50

51

52

Time (sec)

Freq

uenc

y (H

z)

SNR: 60dBSNR: 25dBReference

0.12 0.14 0.16 0.18 0.2

48

48.5

49

49.5

50

Freq

uenc

y (H

z)

Time (sec)0.29 0.3 0.31 0.32 0.33

48

48.5

49

49.5

50

Freq

uenc

y (H

z)

Time (sec)

D-ACLMSACLMS

The average MSE of the frequency estimate

of D-ACLMS and D-CLMS under a Type D

sag.

30 35 40 45 50 55 60 65 70−40

−20

0

20

Avera

ge

MS

E(d

B)

Average MSE Across 6 Nodes

SNR (dB)

D−CLMS

D−ACLMS

1 2 3 4 5 6−30

−25

−20

−15

MS

E(d

B)

Node

Individual MSE at Each Node

ACLMS (No diffusion) D−ACLMS

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 54

Page 55: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Conclusions

◦ Adaptive processing of noncircular complex signals

◦ These arise e.g. due to the nonlinearity of transievers (I/Q imbalancemitigation), multipath, in some modulation schemes (BPSK, GSMK,PAM, offset-QPSK) widely used in practical communications systems

◦ Standard solutions assume second order circularity of signal distributionsand are inadequate when the signals are observed through nonlinearsensors, mixtures of sources, or noise model which is not doubly white

◦ This is achieved based on augmented complex statistics and widelylinear modelling

◦ The complex LMS (CLMS) and augmented CLMS (ACLMS) introduced

◦ Convergence of CLMS and ACLMS – from the booklet provided(Chapter 6)

◦ This promises enhanced practical solutions in a variety of applications(interference supression, DoA estimation, blind estimation)

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 55

Page 56: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

A comprehensive account of widely linear modelling

D.Mandic and V. Goh, “Complex Valued Nonlinear Adaptive FIlters:Noncircularity, Widely Linear and Neural Models”, Wiley 2009.

◦ Unified approach to the design of complex

valued adaptive filters and neural networks◦ Augmented learning algorithms based on

widely linear models◦ Suitable for processing both second order

circular (proper) and noncircular (improper)

complex signals◦ ACLMS, augmented Kalman filters,

augmented CRTRL, linear and nonlinar IIR

filters◦ Adaptive stepsizes, dynamical range

reduction, collaborative adaptive filters,

statitical tests for the validity of complex

representations

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 56

Page 57: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Some back–up material

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 57

Page 58: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Appendix: Transform domain signal processingUsually complex valued (e.g. Fourier Domain)

Frequency domain adaptive filtering

Adaptive Filter ∑x[n]

Transformation

Function

e[n]

d[n]

Transformation

Function

Inverse

Transformation

functionyout

-+

A transform can be applied to the inputs

of an adaptive filter in order to maximise

the performance of the LMS algorithm, i.e.

a white input with equal eigenvalues.

Various techniques can be used such as

Lattice Filters, the FFT which requires

the Complex LMS algorithm, the Discrete

Cosine or Wavelet transforms, or sub-band

filters.

0 5 10 15 20−1

−0.5

0

0.5

1

ACF

Correlation lag

Co

rre

latio

n

0 0.2 0.4 0.6 0.8 1−15

−10

−5

0

5

10

Normalized Frequency (×π rad/sample)Po

we

r/fr

eq

ue

ncy (

dB

/ra

d/s

am

ple

)

Burg Power Spectral Density Estimate

0 5 10 15 200

0.5

1

ACF

Correlation lag

Co

rre

latio

n

0 20 40 60 80 100−5

0

5

Sample Number

Sig

na

l va

lue

s

x[n] = 0.8*x[n−1] + w[n]

0 0.2 0.4 0.6 0.8 1−15

−10

−5

0

5

10

15

Normalized Frequency (×π rad/sample)Po

we

r/fr

eq

ue

ncy (

dB

/ra

d/s

am

ple

)

Burg Power Spectral Density Estimate

0 20 40 60 80 100−4

−2

0

2

4

Sample Number

Sig

na

l va

lue

s

x[n] = −0.8*x[n−1] + w[n]

a < 0 → highpass, a > 0 → lowpass

x(k) = a1x(k − 1) + w(k)

Highpass signal: fast changing intime # however, smooth spectrum

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 58

Page 59: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Appendix: The CLMS algorithmStep-by-step derivation

Consider a complex valued FIR filter.

The weight update equation for a real valued filter is

w(k + 1) = w(k) + ηe(k)x(k)

Now, the weights and errors and teaching signal and input are complexvalued.

Hence

e(k) = er(k) + ei(k)

d(k) = dr(k) + di(k)

x(k) = xr(k) + xi(k)

w(k) = wr(k) + wi(k)

y(k) = xT (k)w(k)

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 59

Page 60: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Appendix: The CLMS cost function

The complex LMS should simultaneously adapt the real and imaginarypart, minimising in some sense both er(k) and ei(k), with respect to theaverage total error power, given by

E [e(k)e∗(k)] =1

2E[e2r(k) + e2

i (k)]

=1

2E[e2r(k)

]+

1

2E[e2i (k)

]

Since the two components of the error are in quadrature relative to eachother, they cannot be minimised independently.

The derivation of the complex LMS is similar to the derivation fo theoriginal LMS, except that the rules of complex algebra must be observed.

Notice that(xT (k)w(k)

)∗= (x∗(k))

Tw∗(k), e.g.

x× w = [(xr + xi)(wr + wi)] = [xrwr − xiwi + (xrwi + xiwr)] Afterconjugation we have [xrwr − xiwi − (xrwi + xiwr)]. On the other handx∗w∗ = [(xr − xi)(wr − wi] = [xrwr − xiwi − (xrwi + xiwr)],

which is the same as when we conjugate the whole expression.

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 60

Page 61: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Appendix: The CLMS Derivation

e∗(k) = d∗(k)−(xT (k)

)∗w∗(k)

Therefore, for a GD adaptation we have

w(k + 1) = w(k)− 1

2η∇w (e(k)e∗(k))

where ∇(e(k)e∗(k)) = ∇r(e(k)e∗(k)) + ∇i(e(k)e∗(k)).

Now,

∇r(e(k)e∗(k)) =

∂(e(k)e∗(k))∂w1r

∂(e(k)e∗(k))∂w2r...

∂(e(k)e∗(k))∂wNr

= e(k)∇r(e∗(k)) + e∗(k)∇r(e(k))

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 61

Page 62: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Appendix: The CLMS derivation – contd.

Notice that ( in a simplified way)

e(k) = d(k)− x(k)w(k) = d(k)−− [xr(k)wr(k)− xi(k)wi(k) + (xr(k)wi(k) + xi(k)wr(k))]

e∗(k) = d(k)− [xr(k)wr(k) + xi(k)wi(k)− (xr(k)wi(k) + xi(k)wr(k))]

The partial derivatives wrt to wr and wi are

∂e(k)

∂wr(k)= −[xr(k) + xi(k)] = −x(k)

∂e(k)

∂wi(k)= −[−xi(k) + xr(k)] = −x(k)

∂e∗(k)

∂wr(k)= −[xr(k)− xi(k)] = −x∗(k)

∂e∗(k)

∂wi(k)= −[xi(k)− xr(k)] = −[−x∗(k)]

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 62

Page 63: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Appendix: The CLMS Derivation – Complex Gradients

The instantaneous gradient with respect to its real and imaginarycomponent becomes

∇r(e(k)e∗(k)) = e(k)∇r(e∗(k)) + e∗(k)∇r(e(k))

= e(k)(−x∗(k)) + e∗(k)(−x(k))

∇i(e(k)e∗(k)) = e(k)∇i(e∗(k)) + e∗(k)∇i(e(k))

= e(k) (x∗(k)) + e∗(k) (−x(k))

Now, applying the method of steepest descent, to the real and imaginarypart of the weights we have

wr(k + 1) = wr(k)− 1

2η∇r (e(k)e∗(k))

wi(k + 1) = wi(k)− 1

2η∇i (e(k)e∗(k))

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 63

Page 64: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Appendix: The CLMS Derivation – Filter Update

Having in mind that w(k + 1) = wr(k + 1) + wi(k + 1), we have

w(k + 1) = w(k)− 1

2η [∇r (e(k)e∗(k)) + ∇i (e(k)e∗(k))]

If the gradients are now substituted in the above equation, we have

∇r (e(k)e∗(k)) + ∇i (e(k)e∗(k)) =

e(k)(−x∗(k)) + e∗(k)(−x(k)) + [e(k) (x∗(k)) + e∗(k) (−x(k))] =

−erx∗ − eix∗ − erx + eix− erx∗ − eix∗ + erx− eix =

−2erx∗ − 2eix

∗ = −2e(k)x∗(k)

Therefore, the complex form of the LMS algorithm is given by

w(k + 1) = w(k) + ηe(k)x∗(k)

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 64

Page 65: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Notes

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 65

Page 66: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Notes

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 66

Page 67: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Notes

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 67

Page 68: Adaptive SP & Machine Intelligence Complex and Multi-D ...mandic/SE_ASP_LN/ASP_MI_Lecture_6_C… · Circularity quotient: ˆ= p c Circularity coe cient: jˆj= jpj c Circularity quotient

Notes

c© D. P. Mandic Adaptive Signal Processing & Machine Intelligence 68