Radial Basis Function Networks Introductionbillw/cs9444/rbfn/rbfs08-4up.pdf · Hence the name...

Radial Basis Function Networks Notes © Bill Wilson, 2008 1

Radial Basis Function Networks

Aim • to study radial basis function networks

Reference Haykin, Chapter 7 provides a theoretical treatment.

Another view is found in section 5.3 of Karray & de Silva: Soft Computing and

Intelligent Systems Design, Addison-Wesley, 2004. ISBN 0 321 11617 8

Unfortunately this is not currently in UNSW Library (but is on order).

Keywords radial basis function, receptive field, hybrid learning

Plan • define radial basis function (RBF)

• introduce RBF network

• training (estimating parameters for) RBFNs

• example

• applications

Introduction

• Radial basis function networks feedforward networks, with some nodes using a different

activation function from the regular sigmoid node in a backprop net – a radial activation function:

... ... ...

inputs hiddens

(RBF nodes)outputs

• The connections from the input units to the hidden units are fixed at 1.

• Each RBF/hidden node has two parameters, termed the centre

v (a vector) and the scalar width

" . The function computed by the i-th node will be of the form

gi x( ) = rix" vi

Radial Activation Functions

• Clearly the value of

gi x( ) depends on the distance of the input vector

x from the centre

vi, scaled

by the width

" i . All points

x that are equally close to

vi will be treated in the same way by the

function

gi x( ). Hence the name radial activation function.

• The most widely used radial basis function is the Gaussian function:

gi x( ) = exp" x" vi

• There is no squashing function in the output neurons: the value from the j-th output unit of our

RBF network, assuming n hidden layer units and r output units, is:

oj (x) = wijgi(x)i=1

" j =1,...,r

• Another possible radial activation function is

gi x( ) =1

1+ exp( x" vi2/# i2).

• Note that in both cases,

gi(x)" 0, and so

oj (x)" 0, as

x" vi #$

“Training” Radial Activation Functions

• Training the RBF involves estimating the best values for the

vi, the

" i , and the

• The standard technique is the hybrid approach, which is a two-stage learning strategy:

1) Unsupervised clustering algorithm used to find the

vi and

" i for the RBFs.

There are a number of possible clustering algorithms to try, the random input vector method,

the k-means-based method, maximum likelihood estimate-based method, standard deviations-

based method, and self-organising map method.

2) Supervised algorithm used to find values for the

There is only one layer with weights, so this is not as complicated as with MLPs/backprop.

We’ll describe one way to do this.

Finding Centres and Widths

• The random input vector method involves choosing the locations of centres randomly from the

training data set. All RBFs get the same width

" =dmax

, where

dmax is the maximum

distance between the chosen centres, and

m1 is the the number of centres.

• An alternative is to give centres differing widths depending on the local data density (broader

width if density is low).

• The k-means-based method is a clustering algorithm:

1 Initialise: choose small random values for the initial (but distinct) centres

tk 0( ).

2 Sample: choose a vector x from the input space – we’ll call this

3 Match: Let

k(x) be the index of the centre

tk 0( ) that minimises

x(n)" tk (n) .

4 Update: Set

tk n+1( ) = tk n( ) +" x n( )# tk n( )[ ] for

k = k x( );

tk n+1( ) = tk n( ) otherwise.

5 Continue: add 1 to n and go to step 2. Continue until no further significant changes.

• The width(s) can be done in the same way as that used for the random input vector method.

Finding Weights

• A range of supervised learning methods can be used to find the weight values. Here is one that is

appropriate for function interpolation, given a set of input vectors

xk and corresponding output

values

dk. In this case, it makes sense to use one RBF node for each pair (

dk), and the centres

for the RBF nodes are just the

xk, i.e.

vk = xk .

• Let

D be the matrix of desired outputs – i.e. if the function to be interpolated is

"n #"m, and

dk is the desired output for the input

xk, so that

f xk( ) = dk , then

D = dk[ ],

k =1,...,p, where p is

the number of hidden/RBF nodes. We can also write

dkj = f j xk( ) = wijgii=1

" xk( ) .

• Let

gki be

gi(xk ), i.e.

exp" xi " xk

) ) if we are using Gaussian RBFs. We can write

G = gki[ ].

dkj = f j xk( ) = wijgii=1

" xk( ) = wijgkii=1

" = gkiwiji=1

" , or

dkj = GW[ ]kj . That is,

D =GW.

Finding Weights 2

• Since

D =GW, if G is invertible, then

G"1D =W

• If G is not invertible (because

G = 0, or G is non-square, or G is ill-conditioned, i.e.

G is close to

0), then we replace

G+, the Moore-Penrose pseudo-inverse of G, (ako generalised

inverse), which is the unique matrix satisfying:

G =GG+G (2)

GG+( )*

=GG+ (4)

G+G( )*

• If the columns of G are linearly independent, then

GTG is invertible, and then

G+ = G

Note that then

G+G = G

"1GTG = I. So

W =G+D.

A* is the conjugate transpose of

A . This is the same as

A is all real (not complex).

Effect of Width on Approximation Performance

• The diagrams on the next slide use the same centres, but three different values of

" - but the

" for all RBFs in each graph. The top pair of graphs show

" = 0.5, the middle pair have

" = 2.1, and the bottom pair have

" = 8.5

• Inspection shows us that with the small value of , the approximation is “too wiggly”, for the large value of the approximation is too smooth, and the middle value is “just right”.

• In practice, one would use an unsupervised algorithm to estimate the best value for

" , as described above.

• The diagram comes from Karray & de Silva, Figure 5.8 a, b & c:

Applications of RBF Networks

• RBF networks have universal approximation capabilities (on compact subsets of

"n), and they

have efficient training algorithms.

• They have been used for

– control systems

– audio and video signal processing

– pattern recognition

– chaotic time series prediction (e.g. weather forecasting, power load forecasting).

David Broomhead

Professor of Applied Mathematics at UMIST and

works in applied nonlinear dynamical systems

and mathematical biology. Since the early 1980's

he has developed methods for time series

analysis and nonlinear signal processing using

techniques from nonlinear dynamics, including

modelling oculomotor control. In 1989 he was

awarded the John Benjamin Memorial Prize for

his work on radial basis function neural

networks.

D.S. Broomhead and D. Lowe. Multivariable

functional interpolation and adaptive

networks. Complex Systems, 2:321-355. 1988.

Radial Basis Function Networks Introductionbillw/cs9444/rbfn/rbfs08-4up.pdf · Hence the name...

Documents

Transcript of Radial Basis Function Networks Introductionbillw/cs9444/rbfn/rbfs08-4up.pdf · Hence the name...

Radial Basis Function Collocation Method

Training Radial Basis Function Networks · Radial Basis Function Networks: Training Training radial basis function networks: Radii of radial basis functions. Special case: Gaussian

Radial-Basis Function Networks

Multidimensional Projection with Radial Basis Function …jungle.cpsc.ucalgary.ca/wiki/uploads/Publications-FS/rbf... · Multidimensional Projection with Radial Basis Function and

Radial Basis Function Interpolation

Radial Basis Function Networks: Introductionjxb/INC/l13.pdf · Introduction to Radial Basis Functions 2. ... The special case where the output function goes ... Radial Basis Function

Radial Basis Function Networks

Radial Basis Function Networks - cedar.buffalo.edusrihari/CSE574/Chap6/Chap6.2-RadialBasisFunctions.pdf · History of Radial Basis Functions • Introduced for exact function interpolation

Introduction to Radial Basis Function Networks. Content Overview The Models of Function Approximator The Radial Basis Function Networks RBFN’s for Function.

Training Radial Basis Function Networks€¦ · Training radial basis function networks: Radii of radial basis functions. Special case: Gaussian activation function ∂out(l) v ∂σv

Aula 4 Radial Basis Function Networks

Radial Basis-Function Networks

Radial Basis Function Networks: Introduction

Interpolation By Radial Basis Function ( RBF )

Radial-Basis Function Networks RBF - DiUniTocancelli/retineu11_12/Radial.pdf · Radial-Basis Function Networks • A function is radial (RBF) if its output depends on (is a non-increasing

6. Radial-basis function (RBF) networks RBF = radial-basis function: a function which depends only on the radial distance from a point XOR problem quadratically.

NEW METHOD FOR MESH MOVING BASED ON RADIAL BASIS FUNCTION INTERPOLATIONwebserver2.tecgraf.puc-rio.br/~mgattass/ra/ref/Radial... · 2008. 9. 19. · Radial basis function interpolation

Chapter 3. Radial Basis Function Networks · Chapter 3. Radial Basis Function Networks 3.1 Introduction A radial basis function network is a neural network approached by viewing the

VII. Radial Basis Function Networks (RBFN)faculty.nps.edu/fargues/teaching/ec4460/ec4460-VII.pdf · VII. Radial Basis Function Networks (RBFN) • Radial function definition • Radial

Radial Basis Function Networks: Algorithms - School of …jxb/INC/l14.pdf · · 2015-11-09Radial Basis Function Networks: Algorithms ... The Radial Basis Function (RBF) ... widths