Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian...

42
Bayesian Framework EE 645 ZHAO XIN
  • date post

    21-Dec-2015
  • Category

    Documents

  • view

    222
  • download

    0

Transcript of Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian...

Page 1: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Bayesian Framework

EE 645ZHAO XIN

Page 2: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

A Brief Introduction to Bayesian Framework

The Bayesian Philosophy Bayesian Neural Network Some Discussion on

Priors

Page 3: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Bayesian’s Rule

),...,(

)(,...,,...,

)()1(

)()1()()1(

n

nn

xxP

PxxPxxP

Likelihood Prior Distribution Normalizing Constant

Page 4: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Bayesian Prediction

dxxPxP

xxxP

nn

nn

)()1()1(

)()1()1(

,...,

,...,

Page 5: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

dyxyxPxfy

dyxyxPxyP

yxyxxyP

nnnk

nk

nnnn

nnnn

)),(),...,,((),(ˆ

)),(),...,,((),(

)),(),...,,(,(

)()()1()1()1()1(

)()()1()1(1()1(

)()()1()1()1()1(

Page 6: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Hierarchical Model

p

kk

p

dPP

PP

1

1

)(

),...,()(

Page 7: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

An Example Bayesian Network

)(1xf )(2xf

)(1xh )(2xh )(3xh )(4xh

1x 2x 3x

Page 8: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

kkkk

k

iiijjj

jjjkkk

yxfxyP

xwaxh

xhwbxf

)2/))((exp(2

1

)()(

)()(

22

Page 9: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Some Discussion on Priors Priors Converging to Gaussian

Process If the number of Hidden Units is infinite Priors Leads to smooth and Brownian

Functions Fractional Brownian Priors

Priors Converging to Non-Gaussian Stable Process

Page 10: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Bayesian Framework for LS RBF Kernel SVM MUD

Basic Problem and Solution Probabilistic Interpretation of the LS SVM

First Level Inference Second Level Inference Third Level Inference

Basic MUD Model Results and Discussion Summary

Page 11: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Basic Problem for LS SVM

Niebxwy

JJ

ewwebwJ

yRx

yxyxD

iiT

i

Dw

N

ii

Ti

ebw

iK

i

NN

i

,...,1 1])([ S.T.

22),,(min

1,1 , where

,),),...(,(Dataset Given

1

22

,,

11

Page 12: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Basic Solution for LS SVM

N

iiii

jijT

iij

NN

vN

Nv

Tv

bxxKysignxy

xxKxx

eee

yyY

Y

b

I

1

11

1

1

]),([)(

),,()()(

]',...,[],,...,[

],1,...,1[1,]',...,[

0

1

10

Page 13: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

The Formula for SVM

space) dual in thesolution thefind wefact,(In

)(),(ˆ

)(),( and , where

,...,1 ))(),((

)( )(

1,

1

N

iMAPiiiMAP

di

Ki

i

N

jijji

Mappingi

Mappingi

bxxywy

RxxRxx

Niebxxwy

xxxx

Page 14: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

First Level Inference

),log,log(

),log,log,(),log,log,,(

,log,log,,

model and )},{(Given 1

HDP

HbwPHbwDP

HDbwP

HyxD Niii

Page 15: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Some Assumptions of this Level Separable Gaussian Prior for conditional P(w,b) Independent Data Points Gaussian Distributed Errors Variance of b goes to infinite

)2

exp(

)2

exp(2

1)

2exp(

2,log,

2

22

1

2

2

ww

bwwHbwP

T

bb

T

N f

Page 16: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

)2

exp(

)2

exp(2

),log,,(

),log,,,(),log,log,,(

1

2

1

22

1

1

1

N

ii

N

ii

N

ii

N

iii

e

e

HbweP

HbwyxPHbwDP

Page 17: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Result of the First Level

classifier SVM LS Kernel classic of

solution thebe willb bias and wfor weight Estimation

PosterioriA Maximum thefindcan weequation, By this

)),(exp()2

exp()2

exp(

),log,log,,(

21

2 bwJeww

HDbwPN

ii

T

Page 18: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Conditional Distribution of Weight w and Bias b

22

22

2

22

22

2

1

1

1

],[

)2

1exp(

det)2(

1

),log,log,,(

b

J

wb

Jbw

J

w

J

HQ

bbwwg

where

gQgQ

HDbwP

TMAPMAP

T

N

Page 19: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Unbalance Case of 1st LevelIf the means of +1 class and –1 class are not perfectly project to +1 and –1, the

bias term will come. We will introduce 2 new random variables as followed.

)2

exp(2

)2

)ˆ))(),(((exp(

2

),log,,,1,(

22

1

embxxyw

HbwyxeP

di

N

iii

Page 20: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

),log,log,log,,1()1(),log,log,log,,1()1(

),log,log,log,,()(

),log,log,log,,(

))(2

ˆexp()(2

),log,log,,,1(

11

2

2

111

HDyxPyPHDyxPyP

HDyxPyP

HDxyP

m

HbwyxP

Page 21: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Last Solution for First Level

2

1

*

1

1*

1

1

)(

),(1

ˆ

])1(

)1(log

ˆˆ2

ˆˆ),([)(

Ijji

N

iiid

dd

ddN

iiii

xxKyN

mwhere

yP

yP

mm

mmbxxKysignxy

Page 22: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Second Level Inference

d.)distribute uniform separable is )log,(logprior (Assume

),log,log(

)(

)log,(log),log,log(),log,(log

formula following as , and eter hyperparam

toRule Bayes'apply will welevel, In this

HP

HDP

HDP

HPHDPHDP

Page 23: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Result of Second Level Inference

matrix Gram centering of

)( eigenvalue zero-non ofnumber theis and

)()det( where

)),(exp()det(

),log,(log

,

1,

2

iG

Neff

iiG

NeffN

MAPMAP

NN

Neff

NH

bwJH

HDP

f

f

Page 24: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

TNN

N

NT

Neff

iiG

MAPMAP

NM

IMM

MYIMMMYJ

Neff

bwJJ

111

where

1log

2

1 )

11det(log

2

1

)11

(2

1 ),(

log2

1log

2)log(

2

1

),(),(

3,

1,

23,

min

min

Page 25: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Last Solution for Second Level

MYUIDMUYbwEbwE

MYUIDMUYbwE

bwEbwE

bwEN

J

bwEbwENJ

TGNeffGG

TMAPMAPDMAPMAPw

TGNeffGG

TMAPMAPD

N

i iGMAPMAPDMAPMAPw

MAPMAPD

N

iiGMAPMAPDMAPMAPw

11

212

1 ,2

4

1,4

)(2

1 ),(),(

)(2

1 ),(

1

),(),(

),(

)1

log()),(),(()(

If

min

Page 26: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Third Level Inference

d.)distribute uniform is )(prior (Assume

)()()()(

)()()(

formula following as ,parameter model

toRule Bayes'apply will welevel, In this

j

jjj

jj

j

HP

HDPHPHDPDP

HPHDPDHP

H

Page 27: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Some Assumption in this Level

Neff

i iGMAP

iGMAPeff

effD

effD

DD

j

j

jjj

N

HDP

ddHDP

ddHPHDPHDP

1 ,

,

2log

2log

loglog

,

,

11

and 2

,1

2

where, and anceerror variith Gaussian w separable

as edapproxiamt wellbecan ),log,(log Assume

),log,(log

)log,(log),log,log(

Page 28: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Last Solution for Third Level

j

Neff

iiGMAPMAPeffeff

NMAP

ll

DlDl

jMAPMAPj

HDP

N

HDPHDP

themaximize which j ofindex pick the We

)())(1(

,log,log

1,

NeffMAP

loglog

loglog

Page 29: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Some Comments for this Level For Gaussian Kernel machine, the variance

of Gaussian function can represent the model H

It’s impossible to calculate for all the possible model

Luckily, in general, such as in Gaussian Kernel SVM, the performance of classifier is pretty smooth with respect to the varying of model parameter. Therefore, we can just take sample of the model in the area we feel interested.

Page 30: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

A Synchronous CDMA Transmitter

InformatinUser 1

InformatinUser 2

InformationUser K

AWGNChannel

][ ib

SpreadingCoding &

Modulation

SpreadingCoding &

Modulation

SpreadingCoding &

Modulation

)(ty

InformatinUser 1

InformatinUser 2

InformationUser K

AWGNChannel

][ ib

SpreadingCoding &

Modulation

SpreadingCoding &

Modulation

SpreadingCoding &

Modulation

sSynchronou )(ty

Demodulation

Page 31: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

The LS SVM Receiver Diagram

)(ty

Match FilterUser 1

Match FilterUser 2

Match FilterUser K

UserSpace

LS SVMNetwork

),

eter Hyperparam

and ,

(Parameter

b ][ˆ ibk][iY

Page 32: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Results and

Discussions

Page 33: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

First Inference

0 2 4 6 8 10 1210

-4

10-3

10-2

10-1

100

SNR (dB)

PE

R

Performance of First Level Result

Asterisk: Revised LS SVMCircle: LS SVM

7 Users Rho = 0.429 # of Training Pts: 200Ai/A1 = 5

Page 34: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Second Inference

-1 0 1 2 3 4 5100

150

200

250

300

350

log10(C)

J2

Second Reference Plot

Var = 1

Var = 10

Single User AWGN 0 dB Channel

Page 35: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Third Inference (Plot 1)

-1 0 1 2 3 4 5100

150

200

250

300

350

400

log10(C)

J3

Third Inference Plot

0.5 1

5

10

20

Single User AWGN ChannelSNR = 0 dBNo. of Training Pts = 100

Page 36: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Third Inference (Plot 1)

0 1 2 3 4 5 6-1200

-1000

-800

-600

-400

-200

0

200

400

log10(C)

J3

Third Inference Plot

0.1

0.5

1

5

10

Single User AWGN ChannelSNR = 8 dB No. of Training Pts = 100

Page 37: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

A Sample of Parameter Chosen

SNR (dB) Variance 1/C

0 3.98 0.11

2 2.49 0.49

4 3.98 0.28

6 5.01 0.39

8 5.01 0.28

10 10.0 0.34

12 12.6 0.08

Page 38: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Detector Performance

0 2 4 6 8 10 1210

-4

10-3

10-2

10-1

100

SNR (dB)

PE

R

Performance Comparion of Gaussian LS SVM Detector

Circle: Basic Gaussian LS SVM Asterisk: 1st Inference Applied Sqaure: 2nd & 3rd Inference Applied Diamond: MMSE

7 Users Rho = 0.429 Ai/A1 = 5 No. of Training Pts: 200

Page 39: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Some Discussions on this Detector

The first inference does better the performance of LS SVM detector especially in high SNR region by considering the bias term.

The LS SVM detector is very smooth with respect to the varying of those hyper-parameters, which means the adaptive LS SVM will reasonably work well if the channel properties are not varying fast.

The computation for 2nd and 3rd inference are very complex, so it’s not worthwhile to do calculation here. We can choose some approximation formula instead.

Page 40: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Summary of Bayesian Network Pick up a basic neural network. Properly choose the Priors (physically

right and easy for theoretical deduction). Find a reasonable hierarchical framework

(a three-level inference framework is very typical), apply the Bayesian Rule there and find some beneficial assumption to simplify the problem.

Page 41: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Some Comments on Bayesian Framework

It can help us to physically understand a neural network model.

It can theoretically help us to find the way to optimize the parameters and more important those hyper-parameters which can be sometimes impossibly set otherwise.

It even can make up some exist methods in some given problems.

Page 42: Bayesian Framework EE 645 ZHAO XIN. A Brief Introduction to Bayesian Framework The Bayesian Philosophy Bayesian Neural Network Some Discussion on Priors.

Reference Tony V. G., Johan A. K. Suykens, A

Bayesian Framework for Least Square Support Vector Machine Classifiers

N. Cristianini, John S., An Introduction to Support Vector Machine, 2000

Radford M. Neal, Bayesian Learning for Neural Network, 1996

Sergio Verdo, Multiuser Detection