Signal Processing 1 - nt.tuwien.ac.at · Resume: Groups Definition 2.11 (Group): A set S for which...

Signal Processing 1Linear Operators

Univ.-Prof.,Dr.-Ing. Markus RuppWS 17/18

Th 14:00-15:30EI3A, Fr 8:45-10:00EI4

LVA 389.166

Last change: 11.1.2018

Resume: Sampling revisited Now remember the following:

Thus, by selecting p(t) we can select the space that fits our original signal best!

2Univ.-Prof. Dr.-Ing.

Markus Rupp

( )( )

( )nBtpctf

nBtptfc

nBtctf

nBttfc

cnTfdtnBttf

)2(),(: todgeneralize be can

2sinc)(

2sinc),(

)(2sinc)(

Resume Question: What would be the best

sampling/interpolation for the following signal (linear piecewise spline, lego):

Markus Rupp

nT (n+1)T (n+2)T (n+3)T (n+4)T

Resume Answer: rect(t,T) Question: does the sampling time

matter?

Markus Rupp

Resume Sampling

leads to the following reconstruction

Markus Rupp

nT (n+1)T (n+2)T (n+3)T (n+4)T

Markus Rupp

Resume: Vector spaces Definition 2.10: A linear vector space S over a set of

scalars T (C,R,Q,Z,N,B) is a set of (objects) vectors together with an additive „+“ and a scalar multiplicative „.“ operation, satisfying the following properties:

1) S is a group under addition. 2)

3) W.r.t. the multiplicative operation there exists an Identity (One) and a Zero element:

yaxayxaxbxaxbaxabxbaSxa

SyxTba

+=++=+=∈

)()(,)()(,

:,, have we in and everyFor

00;1 == xxx

Markus Rupp

Resume: Groups Definition 2.11 (Group): A set S for which a binary

operation * (operation w.r.t two elements of S) is defined, is called a group if the following holds: 1) for each a,b in S it holds that: (a*b) in S 2) there exists an identity element e in S, so that for every

element a in S: e*a=a*e=a. 3) for each element a in S there exists an inverse element b in

S, so that: a*b=b*a=e. 4) The binary operation * is associative, i.e.: (a*b)*c=a*(b*c)

The group is denominated by (S, *). If, furthermore, for each pair a,b in S it holds that a*b=b*a

(commutativity), then the group is called commutative or Abelian (Ger.: Abelsch).

Markus Rupp

Resume: Rings Definition 2.12 (Ring): A set S for which two

binary operations + and * are defined, is called a ring if the following holds: (S,+) is a commutative group (Abelian) The operation * is associative. Distributivity holds w.r.t. +: a*(b+c)=a*b+a*c,

(a+b)*c=a*c+b*c. A ring is denominated by (S,+, *). Note: for * there does not need to be an identity

or inverse element. If there exists additionally the inverse element to

*, then it is called Skew Field (Ger.: Schiefkörper).

Markus Rupp

Learning GoalsLinear Operators (4Units, Chapters 4-7) Linear Transformations, Functionals (Ch 4.1) Null- and other spaces (Ch 4.5) Orthog. Subspaces, Matrix Rank (Ch 4.7) Projections (Ch 4.8-4.9) Factorization/Decomposition

Eigenvalue-decomp. Hermitian mat. (Ch 5.1-5.2,6.1-6.3) Filter design based on eigenfilters (Ch 6.9)

Subspace techniques: PHD,MUSIC,ESPRIT (Ch 6.10-6.11) Singular value decomposition SVD (Ch 7) condition number (Ch 4.10) MIMO transmission, blind source separation

Need Abelian group w.r.t +

Markus Rupp

And need distributivitythus ring!!!

Linearity Definition 4.1: A transformation A:XY in which

X and Y are vector spaces, defined over a ring is called linear, if for each x1,x2 from X and scalars α1,α2 from R we have:

A[α1 x1+α2x2]= A[α2x2+α1 x1]=α1A[x1] +α2A[x2]

Examples for linear operators are matrices, sampling, derivatives and convolutional integrals (functionals).

Markus Rupp

Linearity Note, in maths it is even more precise:

A transformation A:XY in which X and Y are vector spaces, defined over a ring are called linear, if for each x1,x2 from X and scalars α1,α2over a number field (Ger.: Zahlenkörper) K we have :

A[α1 x1+α2x2]=α1A[x1] +α2A[x2]

However, we (engineers) restrict K for our functions to the set of real numbers!

Markus Rupp

Linearity Examples for linear operators are:

Example 4.1 A complex valued number z from C is formed by a vector x from R2:

A[z] = x =[real(z),imag(z)]T

Example 4.2 A quadruple s=[s1,s2,s3,s4] from C4 is mapped onto a 4x4 matrix from C4x4 :

[ ]1 2 3 4* * * *2 1 4 3* * * *3 4 1 2

4 3 2 1

s s s ss s s s

A ss s s ss s s s

− − = − −

Markus Rupp

Linear Operators Examples for linear operators are:

Example 4.3 Let a continuous function g(t) from C[0,1] being sampled at fix time points 0<t1<t2<...<tn<1:A[g(t)]=[g(t1), g(t2),..., g(tn)] in Rn

Example 4.4 A function f:XR(C) that maps from a vector space X onto real (complex) numbers is called a functional. If it is linear, then it is called a linear functional:

∞−

dttjtxxf

dttgtxxf

)exp()()(

)()()(

Markus Rupp

Linear Operators Example 4.5: Consider the causal sequence xk;

k=0,1,2,.. The mapping of the sequence onto a sum

is a linear operator. Example 4.6: Consider the Hermitian operator

H[.], that transposes a matrix and additionally builds the conjugate complex value of all elements. (Ger.: adjungierte Matrix, Engl.: adjoint matrix)

llk xs

[ ] [ ]( ) RAAAABB

AHABAHABHHH

∈+=+=+

2,1221122112211

222111

αααααααFrench mathematician Charles Hermite (24.12.1822 – 14.1.1901)

Markus Rupp

Linear Operators Definition 4.2: Linear Functional: f:XR,

f(ax+by)=af(x)+bf(y).

Remark 1: all inner products over functions can be interpreted as linear functionals.

Remark 2: all continuous, linear functionals in the Hilbert space can be described by inner products (Riesz‘ Theorem).

)(),()()()(2 tgtxdttgtxxfb

=−= ∫ τ

Markus Rupp

Linear Operators Note that all functional units of a MIMO

transmission can be considered as linear!

Sample

STCoding

Modu-latorg(t) gk

Modu-lator 0fc

Let the modulation alphabet not be restricted to fixed signal points!

Markus Rupp

Bounded Operators We already introduced vector norms and showed

that they can be also used for matrices as induced norms.

This can be extended towards linear operators:

Definition 4.3: Is the norm of an operator finite, then we call this operator bounded (Ger.: beschränkt): ||A[.]||p<M<oo, or, ||A[x]||p<M||x||p

[ ][ ]

AAp 100,

supsupsup. =≠≠ =

=== ind

Markus Rupp

Bounded Operators Theorem 4.1: A linear operator A:XY is

bounded, i.e., , if and only if it is continuous, i.e., for some finite and positive M and L.

Proof: Let‘s assume A is bounded, then we find that

But this is identical to the condition for continuity.

Starting with continuity we can thus also conclude boundedness!

[ ] xMxA ≤[ ] [ ] xdLxdxAxA ≤+−

[ ] xdMxdA ≤

[ ] [ ] xdLxdxAxA ≤+−[ ] =xdA

[ ] [ ] =+− xdxAxA

Markus Rupp

Inner Products Lemma 4.1: Inner products are continuous. I.e. if xnx

is true in an inner product space S, then <xn,y> <x,y> for y from S.

Proof: If xn converges, it must also be bounded, thus

Then, we have:∞<≤ Mxn

yxyxxx

yxxyxxyxyx

towards converges then Since →−

−≤−=−

Markus Rupp

Continuity of Functionals With such technique we can also show

the following: Let f(x)=<x,g(x)> be a functional, then

f(x) is continuous if g(x) is bounded:

.,)(,)(,0

gxxfgxxfxx

gxxgxxgxgx

==→−

−≤−=−

towards convergesthus Since

Markus Rupp

Bounded Operators Such properties are also important for the

inverses of linear operators.

Notation: If we concatenate an operator n times:

A[A[…]]=An[.]

A0[.]=I[.] is the identity operator A-1[A[.]]=I[.] defines the inverse of an operator.

Markus Rupp

Bounded Operators Theorem 4.2: Let ||.|| be an operator norm

satisfying the submultiplicative property and A[.]:XX a linear operator with ||A[.]||<1. Then (I-A)-1 exists and:

( )∑

∑∞

Markus Rupp

Bounded Operators Proof: Let ||A[.]||<1. If I-A is singular then

there is at least one vector x unequal to 0 so that (I-A)[x]=0. Thus we also have x=A[x] and

In this case we must have which is a contradiction. I-A is not singular !

By successive multiplication we have:

[ ] [ ].AxxAx ≤=

( )( ) kk AIAAAIAI −=++++− −12 ..

[ ] 1. ≥A

Markus Rupp

Bounded Operators Since

and ||A[.]||<1 it must be true that

And therefore also:

Note that A[.] must be square XX !

kk AA ≤

0lim =∞→k

( ) IAAIi

− ∑

Submultiplicative Property

Markus Rupp

Bounded Operators Example 4.7: Consider the following

operator:

Obviously, this operator is bounded:

Thus, its inverse must exist in the form:

2/1[.]2/12/2/

* =⇒

( ) ∑∞

− =−0

Markus Rupp

Bounded Operators Still Example 4.7: We thus have to show that

−⇒

)(;2/2/

( ) ( )

− ∑∑

Markus Rupp

Bounded Operators Still Example 4.7: We thus have to show that

∑∑

∑∑∑

4/1112/1

2/12/1

− ∑∑

Markus Rupp

Remark Since most of the linear operators are being used

in form of matrices, we will mainly deal with matrices in the following, thus A[x]=Ax.

Most of the shown properties, in particular the projection properties are not limited to matrices!

Markus Rupp

Null- and other Spaces Definition 4.4: The vector space, spanned

by the columns of a matrix A=[a1,a2,..,an]:XY is called its column space or range (Ger.: Spaltenraum von A):

The second row is the more general form and describes the column space of a linear operator.

[ ]( ) ( ) [ ][ ] XxyxAYy

aaaAaaaAR nn

∈=∈===

for :,...,,,...,span. 2121

Markus Rupp

Null- and other Spaces Definition 4.5: The vector space spanned by the

(conjugate complex) rows of a matrix A=[b1

T;b2T;...;bn

T]:XY, is called row space of A (Ger.: Zeilenraum) or column space of the adjoint operator A*[.]:

Note: the Hermitian of a matrix is a special form of the adjoint (Ger.: adjungierter) linear operator: A*[x]=AHx.

( ) ( )

[ ] YyxyAXx

AbbbAR

∈=∈=

;...;;span[.] 2

Adjoint Operator Definition 4.6: Consider matrix A, then

AH is called the Adjoint matrix = Hermitian of a matrix.

Now consider linear operator A[]:

A*[] is called the adjoint operator.

Markus Rupp

[ ] ( )[ ] [ ]yAxyAadjxyxA

yAxyxA H

Adjoint Operator

Markus Rupp

Example 4.8: Adjoint operator

( ) [ ]nn

yAntrecytx

dtyx(t)dttxx(t)

∫∑∫

−∞=

∞−

:Answer

that such there is rec(t)

-1/2 1/2

Self-Adjoint In some cases the operators are self-

adjoint Definition 4.7: A self adjoint

operator satisfies:

This is given for all Hermitian matrices.

Markus Rupp

yAxyAxyxA H ,,, ==

Markus Rupp

Null- and other Spaces Definition 4.8: The vector space defined

by the solutions A[x]=0 of a linear operator A[.]:XY is called nullspace N(A) of A[.] (Ger.: Nullraum). It is also called kernel (Ger.: Kern) of the operator: ker(A)

Definition 4.9: The vector space defined by the solutions A*[y]=0 of a linear operator A[.]:XY is called nullspace N(A*) of A* or left nullspace (Ger.: linker Nullraum): ker(A*).

Markus Rupp

Null- and other Spaces Example 4.9: Let A be a linear matrix operator

Then the column space and nullspace of A are given by:

000101001

span)(;010

span)( ANAR

Markus Rupp

Null- and other Spaces

Still Example 4.9: The row space (column space of the adjoint matrix) and the left nullspace are given by:

010000011

000101001

( ) ( )

span;101

span HH ANAR

Kernel

Markus Rupp

Null- and other Spaces Example 4.10: Consider the convolution:

The nullspace of the linear operator L consists of all functions x(t) which convolved with h(t) result in zero.

In the Fourier domain these are the functions X(jω) that have no overlap with H(jω). Thus:

( ) ∫ −=t

dthxtxL0

)()()( τττ

0)()(|)()( ≡= ωω jXjHtxLN

Null and other Spaces

Markus Rupp

[5,2,8]T defines N(AH)

Markus Rupp

Null- and other Spaces Let vector b be from the column space of A. Then

we have: A linear combination of the columns of A must be exactly b: Ax=b.

Given Ax=b, There is exactly one solution if b is in the column space

of A and the columns are linearly independent. There is no solution if b is not in the column space of A. There is an infinite amount of solutions if b is in the

column space of A, and its columns are not linearly independent.

Proofs follow later...

Markus Rupp

Null- and other Spaces Based on these definitions we can already state

the following relations for linear operatorsA:XY:

( ) YANXANXAR

⊂⊂

∈∈

Markus Rupp

Example 4.11 Hands Free Telephone (Freisprechtelefon)

Far end speaker

localspeaker+ echo of far endspeaker

Markus Rupp

Example 4.11: System Identification

The essential problem of a hands free telephone is a system identification. Such a problem can be described in form of a system of linear equations.

Far endspeaker

Localspeaker

Markus Rupp

System Identifikationwith random signals

Assume a (at least WSS) signal at the input of an LTI system with impulse response w(τ). The correlation of input and output signal is:

w(τ)x(t) y(t)

xyxx )()()(

rdttwtr

=−∫ ττ

Markus Rupp

System Identifikationwith random signals Not knowing the correlation terms, the linear system of

equations can be approximated by observations:

In order to solve a system of order m (dim(w)=m), n must be at least m. In practice, often n=2m.

xk must be persistent exciting!

[ ] [ ]

∑∑

Markus Rupp

Example 4.11 Consider now the problem of stereo transmission

of a source S (far end speaker):

Note: gk(i) are the various paths to the

microphones.

s gk(1)

Markus Rupp

Example 4.11 With our vector notation we find the following

relations:

[ ][ ]

,...,,

+−+−

−−

+−−

Hankel matrix

Markus Rupp

Example 4.11 Consider the following relation:

Construct the vector xkT=[xk

T(1),xkT(2)].

Note, the following ACF matrix is singular!

)()()()(

)()()()()()(

iTjiTk

gSggSggx

∑∑==

kkxx SggSSggS

SggSSggS

1)2()2()1()2(

)2()1()1()1(

Markus Rupp

Example 4.11 Consider the following vector for arbitrary values

α unequal to zero:

For this vector we have:

Obviously, u is in the nullspace of Rxx(n).

[ ])1()2( , TTT ggu −= α

01)( )1(

1)2()2()1()2(

)2()1()1()1(

SggSSggS

Markus Rupp

ResumeLinear operators are defined on a…Group…Ring…linear vector space .

Markus Rupp

Resume Rings Definition 2.12 (Ring): A set S for which two

binary operations + and * are defined, is called a ring if the following holds: (S,+) is a commutative group (Abelian) The operation * is associative. Distributivity holds w.r.t. +: a(b+c)=ab+ac, (a+b)c=ac+bc.

A ring is denominated by (S,+, *). Note: for * there does not need to be an identity

or inverse element. If there exists additionally the inverse element to

*, then it is called Skew Field (Ger.: Schiefkörper).

Markus Rupp

Resume

Boundedness of the operators requires

continuity and

vice versa

Theorem 4.1: A linear operator A:XY is bounded if and only if it is continuous.

Markus Rupp

Resume Consider the following CDMA transmission in which

each of the four users has a different code from:(as in UMTS)

The receiver observes a linear combination of all four signals:

−−−−−−

1111111111111111

−−−−−−

−−

44332211

1111111111111111

shshshsh

shshshshr

Markus Rupp

Resume Design an LS filter that selects only the first user

(single-user detector):

1111111111111111

shshshsh

AAAAFA

−−−−−−

Markus Rupp

Resume Then we can apply classical techniques:

1. Matched filter (in this case equivalent to LS filter):

2. Elimination of channel influence:

[ ] [ ] 1111

1,1,1,1411,1,1,1

41 hshsrFLS =

[ ] 11

1,1,1,141

ˆ1 srFh LS =

Resume Consider an operator on continuous

functions that maps them to be even:

Is the operator self-adjoint, i.e.,

Markus Rupp

2)()()]([ xfxfxfP −+

)]([),()()],([ xgPxfxgxfP =

Resume Yes, we find

What is the projection onto its orthogonal complement?

odd functions

Markus Rupp

)(),()(),()()]([),(

)(),()()(),()()],([)()()();()()(

xgxfxgxfxfxgPxf

xgxfxgxgxfxgxfPxgxgxgxfxfxf

2)()()]([ xfxfxfxfxfxfxfPI −−

Markus Rupp

Remember: Null- and other SpacesWe can already state the following relations for linear operatorsA:XY:

( ) YANXANXAR

⊂⊂

∈∈

Markus Rupp

Resume

( ) YANXANXAR

⊂⊂

Consider system of linear equations:Ax=y; x from X and yfrom Y:

yxxA =

000000100110111

Adjoint OperatorAH: AHz=x<Ax,z>=<x,AHz>

Markus Rupp

Resume Let

Which values has got?

145321

( ) ==

span)(145

span42

span)(

Markus Rupp

Resume

N : Aof nullspace the in is

not. of space row inthe is thus

: of space (column of AR space column the in solution norm-) (minimum afor Search

:againConsider

−−

145321

Markus Rupp

Sumspaces Definition 4.10: Let V and W be linear subspaces,

then the space S=V+W is called inner sumspace consisting of all combinations x=v+w.

Definition 4.11: Let V and W be linear subspaces. The direct sumspace

is constructed by the pairs (v,w) . V+W and are different linear spaces. If V and

W are disjoint, they have the same mathematical properties and are said to be isomorphic

WVT ⊕=

WV ⊕

Markus Rupp

Orthogonal Subspaces Definition 4.12: Let S be a vector space

and V and W both subspaces of S. V and W are called orthogonal subspaces if for each pair v from V and w from W we have: <v,w>=0.

Definition 4.13: Let V be a subset of a vector space S with inner product. The space of all vectors orthogonal to the vectors in V is called orthogonal complement (Ger.: orthogonaler Komplementärraum) and is denoted:

WV =⊥

Markus Rupp

Sumspaces Example 4.12:

)1,1,1(),0,1,1(),1,0,1(),1,1,0(),1,0,0(),0,1,0(),0,0,1(),0,0,0()1,1,0,0,0,1(),1,1,0,0,0,0(),1,0,0,0,0,1(),1,0,0,0,0,0(

),0,1,0,0,0,1(),0,1,0,0,0,0(),0,0,0,0,0,1(),0,0,0,0,0,0(

span)1,1,0(),1,0,0(),0,1,0(),0,0,1(),0,0,0(

)1,1,0(),1,0,0(),0,1,0(),0,0,0()0,0,1(),0,0,0(

SVVSVV

Markus Rupp

Orthogonal Subspaces Example 4.13: Let be S=GF(2)3. The vectors

v=(1,0,0) and w=(0,0,1) are from S. They span the subspaces V and W:V=span(v)=(0,0,0),(1,0,0)W=span(w)=(0,0,0),(0,0,1)

Both spaces are orthogonal subspaces. The subspace V has the orthogonal complement :

)1,1,0(),1,0,0(),0,1,0(),0,0,0(=⊥V

Markus Rupp

Orthogonal Subspaces Which vectors span the orthogonal

complement? Answer:

( ))1,0,0(),0,1,0(span

)1,1,0(),1,0,0(),0,1,0(),0,0,0(==⊥V

)1,1,1(),1,1,0(),1,0,1(),0,1,1(),1,0,0(),0,1,0(),0,0,1(),0,0,0(

)1,1,0(),1,0,0(),0,1,0(),0,0,1(),0,0,0(

=≠=∪ ⊥

Markus Rupp

Orthogonal Subspaces Note: let be v from V and w from W. Assume that V and

are orthogonal complements in S. Then we do not necessarily have:

Typically for such properties we need complete spaces (Cauchy series!)

( )SVVSVVspanSVV

WV =⊥

Projections We had already used projections P in

the context of LS: P=P2. Definition 4.14: In an orthogonal

projection its range and its nullspace are orthogonal subspaces, in oblique projections, this is not the case.

Markus Rupp

Example 4.14 Consider the following projection

matrix

Its range and nullspace are given by

Markus Rupp

( ) ( )

spanANspanAR

Example 4.14 Thus, we have in general an oblique

projection Only for α=0, we have an orthogonal

projection

Note that the eigenvalues of projection matrices are either 0 or 1.

Markus Rupp

Orthogonal Subspaces Theorem 4.3: Let be V and W two subspaces of a

vector space S (not necessarily a complete one) with inner product. Then we have:

0)5)4)3)2)1

=∩∈

⊂⊂

⊥⊥

⊥⊥⊥⊥

⊥⊥

xVVxVV

VWWVVV

then If

:have we then If

of subspace complete a is

Recall Lemma 4.1: Inner products are

continuous. I.e. if xnx is true in an inner product space S, then <xn,y> <x,y> for every y from S.

Markus Rupp

Orthogonal Subspaces

Proof (Part 1): Let be xn a series of vectors in with xnx and v from V. Because of the continuity property of the inner product (Lemma 4.1) we have:

Note: The following is not true: This is because the space S is not complete. There can be Cauchy-series in V that are not in .

∈⇒

∈==⇒=

Vvvxvxvx nxxn nfor ;0,,lim0,

⊥⊥= VV

⊥⊥V

SV of subspace complete a is ⊥)1

Markus Rupp

Orthogonal Subspaces Theorem 4.4: Let be A:XY a bounded linear

operator between two Hilbert spaces X and Y and R(A) as well as R(AH) complete subspaces. Then we have: 1)

Proof:

( ) ( ) ( ) ( ) ⊥⊥

⊥⊥

ANARANAR

ANARANARHH

( ) ⊥⊂⊥

∈=∈=∈

00,,,,

ARANzy

xzAxzxAzy

XxyxAARyzAANz

therefore and Thus

for with and from thus ,Let

XANYARYyXxyxA ∈∈∈∈= )(,)(,,;

For every z,we find a y, such that…

Markus Rupp

Orthogonal Subspaces Further:

( )( ) ( )

( ) ( )H

zxzAxzxA

ARxAXxARz

YARxAYyxA

∈∈∈

⊂∈∈=

)( :Finally

, thereforeand 0 Thus

andevery for ,0,

, and nowLet

, :have We

( ) ( )( ) ⊥

⊥⊥

⊂⊂

ANARARANH

:havemust we

XANYARYyXxyxA ∈∈∈∈= )(,)(,,;

No matter what x we select, for all z…

Orthogonal Subspaces With the same technique, we can also

prove that:

Lemma 4.2 R(A*[.])=R(A*[A[.]]) or for matrices: R(AH)=R(AHA)

Markus Rupp

Orthogonal Subspaces Theorem 4.5: (Fredholm’s Alternative Theorem,

Ger.: Fredholmscher Alternativsatz) Let A be a bounded, linear operator. The equation

Ax=bhas at least one solution if and only if <b,v>=0 for every vector v from N(AH): AHv=0. More precisely:

Particular for matrices: the equation Ax=b has (at least) one solution if and only if bHv=0 for each vector v for which AHv=0.

( )HANbARb ⊥⇔∈ )(

Erik Ivar Fredholm (7.4.1866 – 17.8.1927) was a Swedish mathematician.

Markus Rupp

Orthogonal Subspaces Proof:

Converse: Consider now that <b,v>=0 if v from N(AH), but Ax=b has no solution. Since b is not from R(A), we assume that b= br+b0 , with br from R(A) and let b0 be orthogonal to the vectors from R(A). Thus we have <Ax,b0>=0 for all x and thus AHb0=0. Moreover, if b0 is in N(AH) and 0=<b,b0>=<br+b0,b0>= <br,b0>+<b0,b0>=<b0,b0> b0=0, b is therefore from R(A).

( )00,,, ====

xvAxvxAv,b

ANvbxAH

. and Let

Markus Rupp

Orthogonal Subspaces We thus have proven the existence but not the

uniqueness of the solution.

Theorem 4.6: The solution of Ax=b is unique if and only if the unique solution of Ax=0 is x=0, thus N(A)=0.

Proof hint: start with A(x+∆x)=b

Orthogonal Subspaces Example 4.15 Fredholm‘s Theorem

Markus Rupp

spanspan

635241

== bxA 0

Markus Rupp

Orthogonal Subspaces Remember: The dimension of a vector space

defines the number of linearly independent vectors required to span the space.

Definition 4.15: The rank (Ger.: Rang) of a matrix A is defined by the dimension of its column space (row space).

Example 4.16: Let an mxn matrix A be of rank r:

( ) ( )( )( ) ( )( ) rmANrnAN

rARrARH

−=−=

dim;)(dimdim;)(dim

remember Definition 2.23: Let T be a Hamel

basis for S. The cardinality of T is the dimension of S, |T|=dim(S). It equals the number of linearly independent vectors, required to span the space S.

Markus Rupp

Orthogonal Subspaces Example 4.17:

span;014

span)(

span;52

span)(

2,3,2;5205241

Markus Rupp

Orthogonal Subspaces Definition 4.16: An mxn matrix is called of full

rank if rank(A)=min(m,n). If a matrix is not of full rank then it is called

rank-deficient. Theorem 4.7: For matrix products AB we have:

( ) ( )( )( )( ) ( )HH

ARABRABNBN

⊂⊂

)()()()(

Markus Rupp

Orthogonal Subspaces Proof: (only part 1, thus )

If Bx=0, then ABx=0. Thus every x from N(B) is also in N(AB).

Note that the 2nd and 4th property leads to the following:

)rank()rank()rank()rank(

BABAAB

≤≤

)()( ABNBN ⊂

We can neverincrease the rank by a matrix product!

How does LS help in solving sets of linear equations? Example 4.18: Consider the following:

an overdetermined set of equations. What happens if we apply LS?

Markus Rupp

How does LS help in solving sets of linear equations? Consider the following:

If AHA is of rank N, LS delivers a unique solution, as then AHb is in range of AH!

If rank(AHA)<N, LS cannot solve the problem!Regularisation

Markus Rupp

bAxAAbxA

How does LS help in solving sets of linear equations? If rank(AHA)<N, LS cannot solve the

problem!Regularisation

A small positive ε guarantees the set of equations to be solvable.

The solution may be different though!

Markus Rupp

bAxIAAbxA

How does LS help in solving sets of linear equations? Example 4.18: Consider the following:

which we know as underdetermined problem.

Markus Rupp

How does LS help in solving sets of linear equations? Underdetermined LS delivers:

If rank(AAH)=M, the solution is the Minimum Norm LS solution.

If rank(AAH)<M, the inverse cannot be computedregularization.

Markus Rupp

Factorisation A linear system of equations Ax=b can be solved in

many different ways. Hereby the numerical precision is in most cases an important factor for the quality of the result.

There are numerous methods for matrix equations that convert the general problem Ax=b in an equivalent problem Bx=c in which the matrix B exhibits particular properties so that the system can be solved easily.

In the following we will shortly present the major ideas without going into the details of each method.

Markus Rupp

Factorisation LU: stands for „lower-“ and „upper-triangular“. A=LU can be

solved easier since LUx=b: Ux=c and Lc=b, i.e., two linear systems of equations, that are easy to solve.

Cholesky: a particular solution of LU factorisation for Hermitian, positive-definite matrices A=LU=LLH.In general such matrices can be decomposed into LDLH, UDUH or QDQH. Here, Q is a unitary matrix: QQH=I and D is a diagonal matrix. In case of QDQH it is called eigenvalue decomposition.

QR: A=QR. Here, Q is a unitary matrix: QQH=I and R=U is an upper triangular matrix. A=QR is easier to solve since QRx=b: Rx=c und Qc=bc=QHb, i.e., two sets of equations that are easy to solve.

Markus Rupp

Factorisation SVD: Singular Value Decomposition. A=UΣVH with

two unitary matrices U and V and the diagonal matrix Σ.

In the following we treat the eigenvalue decomposition first and we will then show the singular valued decomposition.

Markus Rupp

Eigenvalue Decomposition Let A be an m x m matrix from C. Consider the

linear equationAu=λu

or equivalently (A-λI)u=0.

Here the trivial solution u=0 is not of interest but the nullspace of (A-λI).

Particular values λ, generating non-trivial nullspaces, are called eigenvalues. The corresponding vectors u, are called eigenvectors.

Markus Rupp

Eigenvalue Decomposition Definition 4.17: The polynomial in λ,

generated by the determinant of (A-λI) is called characteristic polynomial.

The determinant det(A-λI)=0 is called characteristic equation of A.

The roots of the characteristic equation are called eigenvalues. The set with all eigenvalues is called the spectrum of A.

Markus Rupp

Eigenvalue Decomposition Example 4.19: Let a linear, time invariant system

be described by the following equation in state space:

Since the matrix inversion of (qI-A) determines the dynamic and stability behavior of the system, so does its determinant det(λI-A).

( )( ) k

xBAqIC

zCyxBzAz

Markus Rupp

Eigenvalue Decomposition Lemma 4.3: If the eigenvalues of a matrix A are

all different then the corresponding eigenvectors are all linearly independent.

Proof: We start with m=2 and the opposite: Let’s assume the eigenvectors u1 and u2 are linearly dependent.

Since λ1 and λ2 are different and u1 is not the zero vector, we must have c1 =0.

( ) 00|

222121

2221112211

=−=+−

ucucuc

ucucuAcuAcucuc

λλλλ

Markus Rupp

Eigenvalue Decomposition A similar argument leads to c2=0. This proves the

two eigenvectors are linearly independent. For m>2 we consider always the case that two

vectors are linearly dependent and prove the contradiction.

If the eigenvalues are not different, the eigenvectors can be linearly dependent or not. Consider the following matrices:

Markus Rupp

Eigenvalue Decomposition Consider the decomposition A=UΛU-1, with the

diagonal matrix Λ. Let’s assume first that A has n linearly

independent eigenvectors. Then we have:

[Ax1, Ax2,... Axn]=[λ1u1,λ2u2,... λnun]=AU=UΛ.

Are the eigenvectors linearly independent, then U can be inverted, and we find:

A= UΛU-1

Markus Rupp

Eigenvalue Decomposition Remark: Such matrix transformation in which the

eigenvalues are not changed are called similarity transformation (Ger.: Ähnlichkeitstransformation). Two matrices are called similar if they have the same eigenvalues.

Advantage of the transformation:

−Λ−∞

∑∑

UfUAfAf

Markus Rupp

Jordanform If the eigenvectors are not linearly independent, a

diagonalisation is not possible! However, a close to diagonal form is possible, the so-called Jordan form:

A=TJT-1. Here, matrix J is of blockdiagonal form with the

blocks Ji:

Markus Rupp

Jordanform Example 4.22: Consider the following

matrix:

It has a single eigenvalue λ=3, and two linearly independent eigenvectors:

300030103

[ ] [ ] ;0,1,0;0,0,1 21TT uu ==

Markus Rupp

Jordanform Still Example 4.22: Thus, the Jordan

form of the matrix B becomes:

We have: B=TJT-1

Since: Bm =TJmT-1

However, Jm is not diagonal or of Jordan form!

300030013

010110001

Markus Rupp

Minimal Polynomial Theorem 4.8: (Cayley Hamilton): Each square

matrix satisfies its own characteristic equation.

Definition 4.18: A polynomial f is called anihilating polynomial of a square matrix A if: f(A)=0.

Definition 4.19: The anihilating (monic) polynomial of A with smallest degree is called minimal polynomial of A.

Markus Rupp

Minimal Polynomial Example 4.23: Consider the following

matrices

The corresponding minimal polynomials are:

We recognize the relation to the size of the Jordan blocks.

( ) ( ) ;6)(;5)(;4)( 33

221 −=−=−= xxfxxfxxf

Markus Rupp

Resume We considered nullspaces and column spaces:

and concluded that orthogonal complements in Hilbert spaces exhibit favorable properties: Each Hilbert space S can be represented by two

subspaces: a subspace V and its orthogonal complement . Note that this decomposition in the Hilbert space is

truly special as it is a complete space.

( ) ( ) ( ) ( ) ⊥⊥

⊥⊥

ANARANAR

ANARANARHH

Resume Given A, how do I find its nullspace? Let

Markus Rupp

84632211

⊥ ARAN

spanARA

Resume Solution: take random vector v and

This is the first vector of the nullspace. How do we get the next (last) vector?

Markus Rupp

−−

−= −

4.08.0

36.048.048.064.0

2.04.04.08.0

vAAAAIz HH

Resume Extend A

Markus Rupp

−−

48.064.0

4.08.0

48.064.0

36.048.048.064.0

084063

4.0228.011

spanAN

vAAAAIz

Markus Rupp

Projections We already found that the overdetermined

LS problem Ax=b can be described by linearly filtering the observation vector:

Let’s assume b can be described by two parts: b=Ay+z. The first part is in the column space R(A) of A and z is in its orthogonal complement, thus in N(AH).

( ) ( )( )( ) ( )HHH

ANbAAAAIbbe

ARbAAAAb

∈−=−=

∈=−

Markus Rupp

Projections Thus we have for the LS estimate:

This also explains why the error eLS and the LS estimate are orthogonal: they are from orthogonal complements.

This also means that the two projections P and I-P decompose a vector into two orthogonal complements (project).

( ) ( )( )( )( ) zzyAAAAAIe

yAzyAAAAAbHH

=+−=

=+=−

Markus Rupp

Projections

( ) ( )( )( )( ) zzyAAAAAIe

yAzyAAAAAbHH

=+−=

=+=−

)(ˆ ARb ∈

⊥∈

Markus Rupp

Example The two signals a and b are to be transmitted. For

this, three-component vectors [u,v,w] are available. If a and b are transmitted in the form [a,b,a+b], then they span a complete subspace V in R3.

Disturbed by additive noise, a vector x=[1,2,4] is received. Which is the closest vector to [1,2,4] in V?

Is it [1,2,3], [1,3,4], [2,2,4] or even different?

Markus Rupp

Example Consider the LS solution of the problem The subspace V is spanned by the two vectors:

[1,0,1],[0,1,1]. We can thus make use of the projection property

of the LS solution:

xAAAAv

]6666,3,3333,2,333,1[

110101

Markus Rupp

Projections

s]1,1,0[

]1,0,1[],4,2,2[]4,3,1[],3,2,1[

)(ˆ ARb ∈ ]1,1,1[

)(−∈ ⊥AReLS [ ]

[ ]1,1,131

1,1,037

1,0,134]4,2,1[

−−

HH AAAA 1)( −

HH AAAAI 1)( −−

]4,2,1[

Exam Dates

Markus Rupp

•31.01 (1 student)

•21.02 (~5 students)

•Xx.03 ??

Markus Rupp

Unitary Matrices Definition 4.21:

A matrix with the property UHU=I is called (semi-)unitary (Ger.: unitär).A matrix with UTU=I is called orthogonal.

Note: if U is an nxn matrix, it follows that UH=U-1.

Markus Rupp

Hermitian Matrices Definition 4.20:

If A=AT, for A from R, then the matrix A is called symmetric (Ger.: symmetrisch).If A=AH, for A from C, then the matrix A is called Hermitian (Ger.: Hermitesch oder Hermitsch).

Such matrices naturally occur in form of covariance matrices Rxx=E[xxH] or when solving LS problems.

Lemma 4.4: The eigenvalues of Hermitian matrices are real-valued.

Proof: <Au,u>= λ<u, u>=<u,AHu>=<u,Au>= λ*<u, u> λ*=λ.

Markus Rupp

Hermitian Matrices Lemma 4.5: The eigenvectors to different eigenvalues of

Hermitian matrices are orthogonal.

Proof: Let λ1 and λ2 be two different eigenvalues with corresponding eigenvectors u1 and u2. Then we have:<Au1, u2>=<u1,AHu2>=<u1,Au2>=<u1,λ2u2>

=λ2<u1, u2> =λ1<u1, u2>thus: (λ2 -λ1)<u1, u2>=0 and therefore <u1, u2>=0.

Lemma 4.6: Every Hermitian nxn matrix A can be diagonalized by a unitary matrix U.The unitary matrix U simply consists of orthonormaleigenvectors.

A=UΛUH=Σ λiui uiH

Markus Rupp

Subspace Techniques Definition 4.22: Let A be an mxm matrix and S a

subspace of R(A). S is called invariant subspace of A if for every x from S there exists an Ax from S.

Example 4.25: Let an nxn matrix A have k (smaller than n) different eigenvalues with the corresponding eigenvectors qi, i=1,2,...,n. Let U=[u1, u2,..., um] and Ui i=1,2,…,k, be the k subsets of eigenvectors, corresponding to the k eigenvalues λi, i=1,2,...,k. The subspaces span(Ui) spanned by the subsets Uiare invariant subspaces of A.

Markus Rupp

Example 4.25 For example consider a 6x6 matrix

65436543

322322

,, and , rseigenvecto threehas , and rseigenvecto twohas

r eigenvecto one has

uuuUuuuuuUuu

=→=→=→

Markus Rupp

Example 4.25 Simplified, let

SxuuuAuAxAuuSuux

ARspanSuuA

∈=+=+=∈+=

2322232

andin n combinatiolinear any for subspaceinvariant an is ,spanS

rseigenvecto andbelet To));((;

λβλαλβαβα

Example: Hotel California from Eagles (1976) A= check out hotel x= any hotel customer S= set of all hotel customers

Ax:SR(A) „You can check out any time you like but

you can never leave„

Markus Rupp

Subspace Techniques Theorem 4.9: Let A be an nxn Hermitian matrix

with k (maximum n) different eigenvalues. Then we have:

The matrices Pi are projection matrices in the (invariant) subspace span(Ui), spanned by the normalized eigenvectors uj.

:identity

:iondecomposit spectral

Markus Rupp

Example 4.25 again For example

( ) ( )

665544333222111

and , rseigenvecto threehas and rseigenvecto twohas

r eigenvecto one has

H uuuuuuuuuuuuA

+++++= λλλ

λλλ

Markus Rupp

Subspace Techniques Proof: We already know that Hermitian matrices

can be diagonalized by unitary matrices, thus:

∑∑

∑∑∑

===Λ=

:have weNote

UUA λλλ

Markus Rupp

Subspace Techniques Consider the reaction of a Hermitian

matrix A applied onto an arbitrary vector x:

An operator A can thus be decomposed into smaller (partial) operations called projections.

∑ ∑∑= ∈=

iii xuuxPxA

subspace theof components various theonto of projection components the

of stretching1

Markus Rupp

Subspace Techniques Example 4.26: Consider the matrix

with the two eigenvalues λ1=5 and λ2=10 and the corresponding eigenvectors 1 2

1 21 1;2 15 5

1 1 1 21 12 2 2 45 5

2 2 4 21 11 1 2 15 5

− = =

− − − = = − = +

Markus Rupp

Subspace Techniques Example 4.27: Consider a weakly stationary random process

with Hermitian autocorrelation matrix Rxx. The diagonalization of Rxx leads to:

If considering the eigenvalues one realizes that some can be extremely small, thus do not have much part of the ACF matrix.

They could be neglected, approximating the process. Description of correlation by a few strong eigenvalues:

Karhunen-Loeve description of random processes x.

[ ] [ ] Λ===

UxxUEyyER

:xUyConsider

UUxxER

Orthogonal„Decorrelation“

Kari Karhunen (1915–1992) was a Finnish mathematical statistician.Michel Loève (22.1.1907–17.2.1979) was a French-American math- statistician

Markus Rupp

Subspace Techniques Consider the expression

Selecting the various eigenvectors x=un, we obtain the corresponding eigenvalues λn.

xuuxxPxxAx

∑ ∑

∑ ∑∑

= ∈=

Markus Rupp

Subspace Techniques

For the largest eigenvector we obtain the largest eigenvalue and so on…

Set x=umax (x=umin)

xuuxxAxk

= ∑ ∑

= ∈1λ

minmax min;max λλ ==xxxAx

( ) ( ) maxmaxmax1

maxmaxmaxmaxmaxmaxmaxmax

maxmaxmax

maxmaxuuuuuuuu

uuuuuAu

λλλ

= ∑ ∑

Markus Rupp

Subspace Techniques Definition 4.23: The expression

is called Rayleigh quotient.

Note that such expression only makes sense when applied to Hermitian matrices.

xxxAxxAr H

maxmin ),( λλ ≤≤ xAr

Markus Rupp

Example 4.28: Eigenfilter The matched filter (Ger.: Signalangepasstes

Filter) is well known to maximize the signal to noise ratio of deterministic signals.

If, however, the maximal signal to noise ratio of random signals is considered, we speak of an eigenfilter.

Markus Rupp

Eigenfilter

For a random signal xk we have: yk=Σhm(xk-m+vk-m)= hTxk+hTvk Received signal power: P= hTE[xkxk

H]h* =hTRxxh*= hHRxxh Noise power: N= hHE[vkvk

H]h =σv2 hHh

Maximize the Signal to noise ratio

The optimal solution can be found based on the largest eigenvector to λmax.

)(maxmax

Markus Rupp

Filter Example 4.29 This technique can be used for filter

design. Let us design a linear phase filter with 2N+1 coefficients for which we are given the magnitude (Ger.: Amplitudengang) Hd(ejΩ):

A low pass filter is to design with limit frequencies Ωp,Ωs so that

≤Ω≤ΩΩ≤Ω≤

pjd eH

( ) ( ) )()cos(0

Ω=Ω== Ω−

Ω−ΩΩ−Ω ∑ cbenbeeHeeH TjNN

Markus Rupp

Filter Example 4.29 In the stopband (Ger.: Sperrbereich) we have:

where we introduced matrix P:

( ) ( )

( ) ( ) bPbbdccb

deHeHE

=ΩΩΩ=

Ω−=

( ) ( )∫Ω

ΩΩΩ=π

djiPij coscos1

Markus Rupp

Filter Example 4.29 In the passband (Ger.: Durchlassbereich) we have

for Ω=0: Hd(ejΩ)=1, or equivalently bT1=1. The error is given by:

1- bTc(Ω)=bT[1-c(Ω)]. The so obtained error energy is to minimize:

The entire filter problem is thus given by:

( )[ ] ( )[ ] bQbbdccbE TTTP

ΩΩ−Ω−= ∫

( ) ( ) ( )10

−+==−+=α

ααααα bQbbPbbRbEEJ TTTPS

Markus Rupp

Filter Example 4.29

Obviously, there is still freedom in the choice of b. We can restrict this by normalizing in the form of bTb=1.

The filter problem is then given by: Minimize bTRb with constraint bTb=1.

Or, equivalently:

)(min min RbbbRb

( ) ( ) ( )( )QPR

bQbbPbbRbEEJ TTTPS

ααααααα

−+=−+==−+=

Markus Rupp

Further Subspace Techniques Many modern DSP techniques are based on

subspace methods. The most relevant are:

Pisarenko’s Harmonic Decomposition (PHD) MUSIC ESPRIT

We will have a closer look at them in the following.

unknown RV amplitudes

from C

Markus Rupp

Further Subspace Techniques Consider the following signal model:

)(wa)(x1

2 tetp

i += ∑=

white noiseunknown σw

2unknown

frequencies

unknown order p

Markus Rupp

Further Subspace Techniques We assume amplitudes ai and noise to be random. x(t) becomes a random process.

We sample the signal equidistantly on M (M>p) positions and obtain:

[ ] [ ]ww

)1(2222

,...,,,1

RssEER

TTMfjTfjTfji

−πππ

Hermitian

Further Subspace Techniques Connection to LS:

which is not exactly the same!

Markus Rupp

)1(2222

,...,,,1

waSwsax

TTMfjTfjTfji

−πππ

Markus Rupp

Further Subspace Techniques A bit more detailed:

Note, that S is a Vandermonde matrix.

[ ][ ]

[ ][ ] ww21

...... Rsss

RSPSRssaER

+=+= ∑=

Markus Rupp

Further Subspace Techniques The vector space spans1,..., sp, spanned by S is a

subspace of the signal xk. It is called the signal-subspace.

Setting the noise to zero, we can find only p<M eigenvalues that are different from zero with their corresponding eigenvectors (Ger.: Haupteigenvektoren):

uuusss

,...,,span,...,,span 2121

= ∑=

Further Subspace Techniques Back to LS:

Alternative method to find σw2 and p!

Markus Rupp

,...,,;

,...,,

uuuUwbUx

waSxsssS

Further Subspace Techniques Complement U:

Find p by sparsity method.

Markus Rupp

[ ]pbthsbUxb

uuuUwbUx

,...,,;

Markus Rupp

Further Subspace Techniques We thus have the possibility to find the signal

subspace out of Rxx without knowing the various frequencies fi.

In a second step we can determine the unknown frequencies.

If we further assume white noise w(t), we have:

.2wxx ISPSR H σ+=

Markus Rupp

Further Subspace Techniques We recognize that noise causes an increase of the

eigenvalues by σw2 without changing the eigenvectors.

Note that next to the p signal-eigenvalues also the M-p remaining eigenvalues now take on the value σw

2 with corresponding eigenvectors up+1,...,uM. These eigenvectors are solely determined by the noise.

We call

the noise-subspace. Note that every (eigen)vector of the signal-subspace is

orthogonal to every (eigen)vector of the noise-subspace. Pisarenko recommends to take just one noise vector: M=p+1.

Mpp uuuN ,...,,span 21 ++=

Markus Rupp

Pisarenko‘s Harmonic Decomposition In a first step, the eigenvalues and corresponding

vectors are computed from the ACF matrix Rxx. With them, we can determine p and σw

2. Since the eigenvectors from the noise subspace

are orthogonal to those of the signal-subspace, we conclude that:

siHuk=0; for i=1,2,...,p and k=p+1,p+1,…,M.

This in turn provides (M-p) polynomials for the unknown frequencies.

[ ] 0...212

Markus Rupp

Pisarenko‘s Harmonic Decomposition

We thus have

Take for example 1.row:

[ ] 0...212

∑−

0)2exp(

[ ]TTMfjTfjTfji

iii eees )1(2222 ,...,,,1 −= πππ

u is given

fi can be determined

Markus Rupp

Have ACF:

Take for example 1.row of ACF matrix:

[ ] [ ] ww212

wwyywwww1

...... Rsss

RRRSPSRssaER

+=+=+= ∑=

( ) 1,...1,0;2exp

2wyyxx

−=+−=

MmpTmfj

σδπ

Markus Rupp

−−

+=+=+=

−−−−

−−

−−−

)0(...)2()1(

)0()1()1(...)1()0(

...111

...1...11

xxxxxx

)1(2)1(2

)1(2)1(2)1(2

wwyywwww1

rrMrrr

RRRSPSRsspR

TMfjTMfj

TfjTfj

pTMfjTMfjTMfj

TfjTfjTfj

πππ

Markus Rupp

Pisarenko‘s Harmonic Decomposition Classic solution: Eventually, we can determine the

powers pi =E[|ai|2].

Since xk=yk+wk rxx(m)=ryy(m)+σw2 δm

With LS we can even find the complex-valued ai. Problematic with this method is the imprecise

determination of the frequencies.

)3()2()1(

......

32323232

22222222

eeeeeeeeeeee

ppfjpfjpfj

fjfjfjfj

πππ

ππππ

Markus Rupp

MUSIC MUSIC=MUltiple SIgnal Classification. Consider the vector:

Computing the expression

It becomes maximal for the desired frequencies. Then we continue as for PHD.

TfMjfjfj

ffMpkufs

und ,..1;0)(

,...,,,1)( )1(2222 πππ

H ufsfP

Markus Rupp

ESPRIT ESPRIT=Estimation of Signal Parameters via

Rotational Invariance Techniques. Let us consider the following two covariance

matrices: [ ] [ ]

+Φ=+=

,...,,diag

ESSPQISPSR

pfjfjfj

πππ

σσ wwxx

xx xxxx

Markus Rupp

ESPRIT As before we determine p and σw

2 by the eigenvalues of Rxx.

We then obtain SPSH=R-σw2I as well as SPΦHSH.

Consider now SP[I-λΦH]SHu=0. The generalized eigenvalues are the desired values

ej2πfi. Generalized eigenvalues are defined by:

SPSHu=λSPΦHSHu

Hint: try Matlab D=eig(A,B).

Markus Rupp

ESPRIT Applications

Note that Fourier transform gives: F[h(t-τ)]=H(jω) ejωτ

Thus, all calculations of temporal changes or delays are equivalent to the determination of frequencies. This is for example being used in radar techniques.

AoA (Angle of Arrival) and AoD (Angle of Departure) computation in wireless fields.

Resume What kind of eigenvalues do Hermitian

matrices have? Answer: eigenvalues of Hermitian matrixes are

real-valued. What can be said about eigenvectors to a

pair of distinctly different eigenvalues in Hermitian matrices? Answer: eigenvectors to different eigenvalues

of Hermitian matrices are orthogonal.

Markus Rupp

Resume Why can Hermitian matrices A be

diagonalized by unitary matrices U? (Proof of Lemma 4.6) Diagonalizing A in general: S-1AS=Λ Hermitian property:A=AH

Thus: (S-1AS)H=SHAS-1H=Λ=S-1AS Thus: SH=S-1unitary

Markus Rupp

Resume We found that symmetric (Hermitian) matrices

are of particular interest. We can construct them by:

On the other hand, can each Hermitian matrix B be decomposed into B=HHH?

Answer:

( ) BHHBHHB HHHH ==→=

negative!-non are seigenvalue theifonly Thus,

HHH HHUUUUB =ΛΛ=Λ=

Resume Example 4.24: Consider now the following

problem: Given a matrix A=HHH. Obviously this matrix can be decomposed into a

product. But what if we have A + γ2 I instead? Solution:

Markus Rupp

( ) ( )

UIIUIA

UIUUUUUIAHHUUUUA

0. as positive be also willones new the positive, are of seigenvalue all As

γγγ

+Λ+Λ=+

+Λ=+Λ=+

=ΛΛ=Λ=

Resume Now let us start with

Solution:

Markus Rupp

HAAIBIB

2 factorize we do Howγ

[ ] [ ]

Markus Rupp

Resume Example: A beamforming problem leads to the

optimization with symmetric matrices A,B>0:

How is b found?bBbbAb

aaaUAUa

bHHbbAb

aUbabHUHbHHb

bAbbBbbAb

maxmaxmax

maxmax

2/12/1

Λ=→=→Λ=

−−

Resume Example 4.30: Consider CoMP

(Coordinated MultiPoint) Problem 1:

A basestation with N antennas serves K>N users with one antenna each.

Is it possible to transmit so that a single user receives maximum signal strength while the others are interfered as little as possible?

Markus Rupp

Resume Consider users 1…K:

Simplified view, all elements are in R

For a fixed power ||x||2 User 1 receives:

Markus Rupp

Resume Maximizing signal power would be:

We obtain this by the Rayleigh quotient but also, much simpler, by Cauchy Schwarz‘s inequality.

Markus Rupp

max hx

xhhxhx

x α==

Resume Now let us also consider the

interference terms: Minimizing interference power:

Markus Rupp

maxminxH

Resume Now let‘s maximize signal power and minimize

interference power (leakage) at the same time:

Markus Rupp

( ) ( ) 11

maxmax

maxmaxmaxSLR

hHHhhHHhBx

yBhhBy

xBxxhhx

yBxyxB

xBxxhhx

xHHxxhhx

TTTopt

−−−

−−

=→==

Markus Rupp

Resume: Further Subspace Techniques Consider the following signal model:

)(wa)(x1

2 tetp

i += ∑=

white noiseunknown σw

2unknown

frequenciesUnknown RV amplitudes

from C

unknown order p

Resume How do we get these values?

1) build Rxx and find Λ and U 2) from Λ deduce p and σw

2.Improve resolution by increasing M

3) compute frequencies PHD: find polynomial zeros MUSIC: find spectral maxima ESPRIT: solve generalized eigenvalue prob.

4) solve linear system for amplitudes.179

Univ.-Prof. Dr.-Ing. Markus Rupp

Markus Rupp

Singular Value Decomposition Consider MIMO transmission

Maximum channel capacity:

2Ps B s

? C obtain to )P B,(T, PT,TRselect to How max iH=

TKRtrace HRH

NSNRIC detlogmax 2)(max

Markus Rupp

Singular Value Decomposition Theorem 4.10: Every matrix A from Cmxn can be decomposed in the

following form: A=UΣVH with the unitary matrices U from Cmxm and V from Cnxn as well as the “diagonal matrix“ Σ from Rmxn, with p=min(m,n).

This particular factorization is called Singular Value Decomposition =SVD (Ger.: Singulärwertzerlegung). The diagonal elements of Σ are called singular values (Ger.: Singulärwerte). Singular values are never negative!

Invented by: E. Beltrami (1835-1899), C.Jordan (1838-1921), J.J. Sylvester (1814-1897), E. Schmidt (1876-1959) and H.Weyl (1885-1955)

[ ]Oor D

=Σ000000

000000

Singular Value Decomposition Quick proof: Assume Λ2 is of large size, Λ1 of small All eigenvalues in Λ1 >0

Markus Rupp

1 12 2

1 1 1 1

1 2 2 2

H H H H

B A A A AV V V A AV I

B AA AA U U

U AV U U B UO

− −

= = Λ → Λ Λ =

= = Λ

Λ = Λ → = Λ =

rank(B1)=rank(B2)!

Markus Rupp

Singular Value Decomposition (lengthy) Proof: Consider the eigenvalue

decomposition of a matrix AHA with A from Cnxn : AHAV= VΛ1

Let the eigenvalues in Λ1 be ordered so that λ1,...,λr>0 and λr+1=...=λn=0.

We can thus construct the following r vectors:

We find that <ui,uj>=δi-j, for i,j=1..r.

rivAui

ii ,...2,1; ==

Markus Rupp

Singular Value Decomposition The set u1,..., ur from U1 can be extended with orthonormal

vectors (for example by the Gram Schmidt method). We thus obtain U=[u1,...,ur,...,un]=[U1,U2]: UHU=I.

Obviously, the vectors in U are eigenvectors for AAH from Cmxm : AAHui= AAHAvi/sqrt(λi)=Aλivi/sqrt(λi)=λiui

This is clear for the eigenvalues that are distinct from zero. For the zero eigenvalues the corresponding eigenvectors must come

from the nullspace of AAH: AAH ui =0; i=r+1,…,m Since we have for Hermitian matrices that R(AAH) is the orthogonal

complement to N((AAH)H) =N(AAH), all eigenvectors are orthogonal.

Markus Rupp

Singular Value Decomposition We therefore find for UAVH:

i>r: zi=AHui is in the nullspace of A (Azi=0) and in the range of AH.

For AHui=0 we also have vjH AHui = ui

H Avj =0. Thus UHAV=Σ has a diagonal block with the non-

zero elements sqrt(λj), j=1,2,…,r.

00:..1

=→=+=

=== −

jiijHH

uAAmri

vAAvvAuri

δλλ

Markus Rupp

Singular Value Decomposition Example 4.31:

Example 4.32: Let B1=AHA. Then: B1V=V Λ1 nxn

= (VΣTUH) (UΣVH) V=VΣTΣ=V Λ1

Also B2=AAH: B2U=UΛ2 =UΣΣT mxm

A=UΣVH

Markus Rupp

Singular Value Decomposition Thus:

Markus Rupp

Singular Value Decomposition Note further that if A is from R then all matrices

(U,Σ,V) are from R. Spectral decomposition:

Example 4.33: Matrix norm: Frobenius norm: ||A||F

2=trace(AAH)=σ12+σ2

2+...+σp2. l2 norm:

[ ]min( , ) ( )

H p m n r rank AH HH

i i i ii iHi i

VA U V U U u v u v

Vσ σ

= Σ = Σ = =

∑ ∑

ind2σ=A

Markus Rupp

Singular Value Decomposition Note that next to the described form of

SVD A=UΣVH=[U1 U2] [Σ+ Ο] [V1 V2]H

there is also another one: A = UΣVH = U1 Σ+ V1

called the thin SVD.

Singular Value Decomposition What is the consequence of the spectral

decomposition?

allows to decompose A into a product of two (three) matrices. The size of them depends on the ranklow rank compression.

Markus Rupp

[ ] Hr

HH VUvuvu

UUVUAHii

~~1 ~~12

121 ===

Σ=Σ= ∑∑

σσσ

Singular Value Decomposition What is the consequence of the

spectral decomposition?

similar to Rayleigh quotient.

Markus Rupp

[ ]min( , ) ( )

H p m n r rank AH HH

i i i ii iHi i

VA U V U U u v u v

= Σ = Σ = =

≤ ≤

∑ ∑

Markus Rupp

Singular Value Decomposition Let B=AAH be Hermitian. Then we have:

B=AAH=UDUH=UΣVHVΣTUH=UΣΣTUH. In this case the eigenvalues of B=AAH equal the

square of the singular values of A. The rank of an arbitrary matrix A is given by the

number r of non zero singular values: rank(A)=r. Partition

U=[U1 U2]

UzUbCb

yUUbCb

yUbCbxVUbCbxAbCbAR

==∈=

Σ=∈=

Σ=∈==∈=

Markus Rupp

Singular Value Decomposition Thus we find a new interpretation of the four

fundamental subspaces:

( )( )

( ) ( )( ) ( )2

spanspanspan)(span)(

UANVARVANUAR

Markus Rupp

Singular Value Decomposition Consider again the LS problem with m>n observations

(overdetermined): b from Cm, x from Cn

minmin

bxVUbxA

−Σ=

−Σ=−

Markus Rupp

Singular Value Decomposition By the particular structure of Σ, the lower n+1..m

rows of b are eliminated.

[ ];0/100/1

Markus Rupp

Singular Value Decomposition Thus:

Consider now the LS solution of the overdetermined system:

UVAAAbUVx

bUxVbx

)(:rsePseudoinve

Σ=→

( )( )( ) ( ) bUVbUVVV

bUVVUUV

HTTHTHT

−−

ΣΣΣ=ΣΣΣ=

ΣΣΣ=

Markus Rupp

Singular Value Decomposition The LS method thus searches in the

reduced observation space Cn the solution with smallest norm.

How does this relate to the underdetermined solution? Let us consider now m<n, b from Cm, x from Cn

In this case components x of the parameter space are eliminated.

Markus Rupp

Singular Value Decomposition Let us thus consider the solution of the

underdetermined LS solution:

The underdetermined LS method also finds a minimum norm solution, however now in a reduced parameter space.

( )( )( ) ( ) bUVbUUUV

bUVVUUV

HTTHTHT

−−

ΣΣΣ=ΣΣΣ=

ΣΣΣ=

Markus Rupp

Example 4.34: Basic MIMO System

2Ps B s

nHxy += ˆ 21 BnsBHTPs +=

VT = HUB =

2Ps1σ

'nsPTVUˆ 2/1

Markus Rupp

Example 4.34: Basic MIMO System The complicated MIMO system can now be

described as r=rank(H) independent subsystems. Maximum channel capacity:

i iiTKP

TKRtrace

2)(max

SNR1logmax

SNRdetlogmax

Waterfilling solution due to Claude Shannon (1948)

Markus Rupp

Example 4.35 .

Thus: U,V arbitrary (but unitary), Σ has entries -1,0,1. X

VUVUVUVUVU

ΣΣΣ=

ΣΣΣ=Σ

: :have We

WWWWnmRW

>∈ ×

:solve :Consider ;

Markus Rupp

Example 4.35 Consider matrix Wk of dimension m X n, m>n: Wk+1=Wk+µ Wk (I-Wk

TWk) Let µ>0, W0=X from R, with full rank

Question: whereto converges Wk , or WkTWk?

SVD: Let: Wk=UΣVT

( ) ( )( )

( )[ ] TT

TTTTTk

VIUVIUVU

VUUVIVUVUWWIWW

ΣΣ−Σ+Σ=

ΣΣ−Σ+Σ=−+

Markus Rupp

Example 4.35 Wk+µ Wk (I-Wk

TWk)= U [ Σ+µ Σ (I-ΣTΣ) ]VT

We can describe Wk+1 =U Σk+1 VT similarly: Σk+1= Σk+µ Σk (I-Σk

On its diagonal: σk+1=σk+ µ σk (1- σk

2) =σk [1+ µ (1- σk

2)] 1-σk+1 =(1-σk) [1- µ σk(1+ σk)]

All σk move towards 1Woo=UI+VT

WooTWoo=I.

Markus Rupp

Example 4.35 µ determines the speed of the movement. (1-σk+1) =(1-σk) [1- µ σk(1+ σk)]

convergence condition: µ < 2/[σk,max+σk,max

2] Note that 0,25+2x2 > x+x2;x>0 Thus: µ < 2/[0,25+2σk,max

2] Since 2/[0,25+2Σi σk,i

2]< 2/[0,25+2σk,max2]

Conservative bound: µ <2/[0,25+2trace(WkTWk)]

Markus Rupp

Example 4.35 Applications: Decorrelate vector process

WooTWoo=I

Blind Source Separation

Invert matrix: solve WooTR2Woo=I

Root of matrix: WooTR-1Woo=I

Example 4.36: CoMP Problem 2

But does this also work for K<N? K Users N Antennas

Even if desired user receives little power

Markus Rupp

∞=→∈

maxmax 112

xHHxxhhx

rank full ofnot is

CoMP Problem 2 Apply SVD on B

(B is Hermitesch, thus V=U):

Markus Rupp

SNLRmaximal is power signal desired but

maxSLR

maxmaxmax

,..,,,...,,,..,,,...,,

2112212

→∞=

=→=→=

=Σ= ++

hUUhxhhUUxhUy

yUhhUy

yHUHUy

yUhhUy

uuuuuVUB

TTTTopt

Here, any ymaximizes SLR

Find that ythat maximizessignal energy

CoMP Problem 2 Alternatively consider SNLR:

back to original Rayleigh quotient.

Markus Rupp

21maxSNLR

Markus Rupp

Singular Value Decomposition Applications of SVD:

Subspace techniques such as PHD, MUSIC and ESPRIT.

Total Least Squares: ||Ax-b|| with distorted observation matrix A.

Solution of numerically sensitive problems such as matrix inversion.

Markus Rupp

Condition Numbers Example 4.37: Consider a matrix problem in which

a few singular values are very small:

Inverting the matrix leads to a strong amplification in particularly of these small values which can lead to numerical errors. It can be better to set these values to zero and compute the pseudo inverse instead.

; Σ≠Σ

=Σ −

Markus Rupp

Condition Numbers Note however, that setting small singular values to

zero also reduces the rank of a matrix. Such methods are thus called:

Rank Reducing Methods.

A measure that tells the quality of a matrix with respect to its invertibility, is its condition number:

κ(A)=||A||2 ||A-1||2

Markus Rupp

Condition Numbers The smallest condition number is one! For

example, for identity matrices, permutation matrices, unitary quadratic matrices.

Otherwise we have κ>1. For regular matrices A we have:

max)(σσκ =A

Markus Rupp

Condition Numbers For Hermitian matrices A their condition number

is given by the magnitude of their (real-valued) eigenvalues:

If A is not Hermitian, we can alternatively compute the square roots of the eigenvalues of AHA:

)()()(

λλκ =

)()()(

AAAAA H

λλκ =

Markus Rupp

Condition Numbers Consider the following distorted problem: A(x+∆x)=b+∆b.

We find that

The condition number thus determines how much an error on one side of the equation impacts the other side.

Axx ∆

≤∆

Signal Processing 1 - nt.tuwien.ac.at · Resume: Groups Definition 2.11 (Group): A set S for which...

Documents

Transcript of Signal Processing 1 - nt.tuwien.ac.at · Resume: Groups Definition 2.11 (Group): A set S for which...

Mutual Funds w.r.t Money Market Mutual Funds

Subject: Weightage w.r.t marks Chapter wise.

Discussion of Various Issues w.r.t. Photons + MET analysis and its GM Interpretation

On Automatic Diﬀerentiation for Optimization...Forward and Backward Two modes: •Forward: recur partials (w.r.t. independent variables) of operands at each operation + good locality

Training Report on Analysis of Mutual Fund Schemes w.r.t Karvy

report on consumer preference w.r.t to coffee

CAM OPERATED ROTARY SWITCHES & ELECTRONIC TIMER OPERATED ROTARY SWITCHES... · Electrical -100,000 operation min Setting Accuracy + 10% max w.r.t full scale Power consumption 7VA

Contact email: mrupp@nt.tuwien.ac.at web: ... · OFDM physical layer - including channel coding and decoding - SISO and MIMO • ... Experimental MIMO Testbed: THE MOVIE Movie only

Visual Merchandising w.r.t. Pantaloons Retail India Ltd

Good Practices of BAs w.r.t One Network & PSG

Study on Commisions Earned by Distributors w.r.t to Mutual Funds

Engineering Stores Accountal. We will discuss regarding :- We will discuss regarding :- Duties of AEN w.r.t. stores. Duties of AEN w.r.t. stores. Stores.

Purchase Behavior of the Customer w.r.t Attributes

Numerical modelling of photoemission for the PITZ ...€¦ · • w.r.t. time step (∆t) • w.r.t. Q b "Brute-force" method . 15-06-2015 | TU Darmstadt | Fachbereich 18 | Institut

Term Paper on inventory management w.r.t. retail industry

Operation & Instruction Manual MODBUS · PDF fileFig. A: 3.XMP/MODBUS CONTROLS: FRONT PANEL ... fieldbus interconnection details; Please note the below (w.r.t. the above WD) points

CKPS Budget 2020 - Constantia Kloof - CKPS... · Resolution w.r.t. the cumulative effect of exemptions & bad debts. 8.4. Resolution w.r.t payment of additional remuneration to GDE

Job Design w.r.t. Employee Motivation and Job Performance

5G Waveform Techniques - nt.tuwien.ac.at

Progress Report w.r.t Women