Introduction to Artificial Intelligence (G51IAI) Dr Matthew Hyde Neural Networks More precisely:...

Introduction to Artificial Intelligence (G51IAI)

Dr Matthew HydeNeural Networks

More precisely: “Artificial Neural Networks”Simulating, on a computer, what we understand about neural networks in the brain

G51IAI – Introduction to AI

Lecture Outline

Recap on perceptrons Linear Separability Learning / Training The Neuron’s Activation Function

Recap from last lecture A ‘Perceptron’ Single layer NN

(one neuron) Inputs can be any

number Weights on the

edges Output can only

be 0 or 1

θ = 6

2 0 or 1

Truth Tables and Linear

SeparabilityG51IAI – Introduction to AI

AND function, and OR function

X1 X2 Z

1 1 11 0 00 1 00 0 0

AND XORX1 X2 Z

1 1 01 0 10 1 10 0 0

These are called “truth tables”

AND function, and OR function

X1 X2 Z

T T TT F FF T FF F F

AND XORX1 X2 Z

T T FT F TF T TF F F

These are called “truth tables”

Important!!!

You can represent any truth table graphically, as a diagram

The diagram is 2-dimensional if there are two inputs

3-dimensional if there are three inputs

Examples on the board in the lecture, and in the handouts

0,0,1 1,0,1

0,1,11,1,1

0,0,01,0,0

1,1,00,1,0

X axisY axis

Z axis

X Y Z Output

0 0 0 10 0 1 10 1 0 00 1 1 01 0 0 11 0 1 01 1 0 11 1 1 0

3 Inputs means 3-dimensions

Linear Separability in 3-dimensions

Instead of a line, the dots are separated by a plane

• Functions which can be separated in this way are called Linearly Separable

• Only linearly Separable functions can be represented by a Perceptron

Minsky & Papert

0,0 XOR 1,0

0,11,1

0,0 1,0

0,11,1

AND XORX1 X2 Z

X1 X2 Z

Examples – Handout 3

Linear Separability Fill in the diagrams with the

correct dots black or white, for an output of 1

How to Train your Perceptron

Simple Networks

θ=1.5

-1 1.5Both of these

represent the AND function.

It is sometimes convenient to set the threshold to zero, and add a constant negative input

Training a NN

AND0,0 1,0

0,11,1

X1 X2 Z

Randomly Initialise the Network

We set the weights randomly, because we do not know what we want it to learn.

The weights can change to whatever value is necessary

It is normal to initialise them in the range [-1,1]

Randomly Initialise the Network

θ=00.5

-1 0.3

Learning

While epoch produces an errorPresent network with next inputs (pattern) from epoch Err = T – OIf Err <> 0 then

Wj = Wj + LR * Ij * Err

End If

End While Get used to this notation!!Make sure that you can reproduce this pseudocode AND understand what all of the terms mean

The ‘epoch’ is the entire training set

The training set is the set of four input and output pairs

INPUT DESIRED OUTPUT

The learning algorithm

Input the first inputs from the training set into the Neural NetworkWhat does the neural network output?Is it what we want it to output?If not then we work out the error and change some weights

First training step

Input 1, 1 Desired output is 1 Actual output is 0

θ=00.5

-1 0.3

-0.3 + 0.5 + -0.4= -0.2= Output of 0

First training step

We wanted 1 We got 0 Error = 1 – 0 = 1

While epoch produces an errorPresent network with next inputs (pattern) from epoch Err = T – OIf Err <> 0 then

End If

End While

If there IS an error, then we change ALL the weights in the network

If there is an error, change ALL the weights

Wj = Wj + ( LR * Ij * Err ) New Weight = Old Weight +

(Learning Rate * Input Value * Error)

New Weight = 0.3 + (0.1 * -1 * 1)= 0.2

1 θ=00.5

-1 0.3 0.2

If there is an error, change ALL the weights

Wj = Wj + ( LR * Ij * Err ) New Weight = 0.5 + (0.1 * 1 * 1)

1 θ=00.5

-1 0.2

1 -0.4

Effects of the first change The output was too low (it was 0, but we wanted 1) Weights that contributed negatively have reduced Weights that contributed positively have increased It is trying to ‘correct’ the output gradually

X θ=0

-1 0.2

Y -0.3

θ=00.5

-1 0.3

Epoch not finished yet

The ‘epoch’ is the entire training set

We do the same for the other 3 input-output pairs

The epoch is now finished

Was there an error for any of the inputs?

If yes, then the network is not trained yet

We do the same for another epoch, from the first inputs again

The epoch is now finished

If there were no errors, then we have the network that we want

It has been trainedWhile epoch produces an errorPresent network with next inputs (pattern) from epoch Err = T – OIf Err <> 0 then

End If

End While

Effect of the learning rate

Set too high The network quickly gets near to what you

want But, right at the end, it may ‘bounce around’

the correct weights It may go too far one way, and then when it

tries to compensate it will go too far back the other way

Wj = Wj + ( LR * Ij * Err )

Set too high It may ‘bounce around’ the correct

weights

AND0,0 1,0

0,1 1,1

Set too low The network slowly gets near to what you want It will eventually converge (for a linearly

separable function) but that could take a long time When setting the learning rule, you have to strike

a balance between speed and effectiveness

The Neuron’s Activation Function

Expanding the Model of the Neuron: Outputs other than ‘1’

θ = 5

θ = 0

θ = 9

θ = 21

Output is 1 or 0It doesn’t matter about how far over the threshold we are

Example from last lecture

Left wheel speed

Right wheel speedThe speed of

the wheels is not just 0 or 1

Expanding the Model of the Neuron: Outputs other than ‘1’

So far, the neurons have only output a value of 1 when they fire.

If the input sum is greater than the threshold the neuron outputs 1.

In fact, the neurons can output any value that you want.

Modelling a Neuron

aj : Input value (output from unit j) wj,i : Weight on the link from unit j to unit i ini : Weighted sum of inputs to unit i ai : Activation value of unit i g : Activation function

jiji aWin ,

Activation Functions

Stept(x) = 1 if x >= t, else 0 Sign(x) = +1 if x >= 0, else –1 Sigmoid(x) = 1/(1+e-x)

aj : Input value (output from unit j)

ini : Weighted sum of inputs to unit i

ai : Activation value of unit i g : Activation function

Summary

Linear Separability Learning Algorithm Pseudocode Activation function (threshold, sigmoid,

Introduction to Artificial Intelligence (G51IAI) Dr Matthew Hyde Neural Networks More precisely:...

Documents

Transcript of Introduction to Artificial Intelligence (G51IAI) Dr Matthew Hyde Neural Networks More precisely:...

Artificial Neural Networks - LASAR · In an artificial neural network, variables are associated with ... What is the mathematical basis behind Artificial Neural Networks? Neural networks

SPATIAL PREDICTIVE MAPPING USING ARTIFICIAL NEURAL · PDF fileSPATIAL PREDICTIVE MAPPING USING ARTIFICIAL NEURAL NETWORKS ... KEY WORDS Artificial Neural Network, ... like artificial

Artificial Intelligence: Artificial Neural Networks

ARTIFICIAL NEURAL NETWORKS TECHNOLOGY - IZ3MEZ · 2 2.0 What are Artificial Neural Networks? Artificial Neural Networks are relatively crude electronic models based on the neural

Artificial Neural Networks - web.cs.hacettepe.edu.trilyas/Courses/CMP712/lec03-NeuralNetwork.pdf · Artificial Neural Networks • Artificial neural networks (ANNs) provide a general,

Jacek Mazurkiewicz, PhD Softcomputing · Softcomputing Part 3: Recurrent Artificial Neural Networks Self-Organising Artificial Neural Networks . Recurrent Artificial Neural Networks

Artificial Neural NetworksArtificial Neural Networksrailway.iust.ac.ir/files/rail/Booklet_Teach/neural_network_moaveni.pdf · Artificial Neural NetworksArtificial Neural Networks

An Artificial Neural Networks Primer with Financial ... Artificial Neural... · ‘An Artificial Neural Networks Primer with Financial Applications Examples in Financial Distress

Channel Equalization using Artificial Neural NetworkChannel Equalization Using Artificial Neural Network

Artificial Neural Network

Artificial neural netwoks2

Introduction to Artificial Intelligence (G51IAI)

Introduction to Artificial Intelligence (G51IAI)pszrq/files/4IAIblindIntro.pdf · Introduction to Artificial Intelligence ... Evaluating a Search 3. Space Complexity ... Simply searches

G51IAI Introduction to Artificial Intelligence Andrew Parkes ajp/ Course Introduction.

G51IAI Introduction

Introduction to Artificial Intelligence (G51IAI) Dr Rong Qu Blind Searches.

CHAPTER 4 ARTIFICIAL NEURAL NETWORKS · 4.4 AN ARTIFICIAL NEURAL NETWORK Fig. 4.3: An artificial neural network Fig. 4.3 shows an artificial neural network. Inputs enter into the

Rapidly Adapting Artificial Neural Networks for Autonomous ...papers.nips.cc/paper/432-rapidly-adapting-artificial-neural-networks... · Rapidly Adapting Artificial Neural Networks

Artificial Neural Networks - Newcastle University Automation/NN... · Artificial Neural Networks . Introduction . ... The term "artificial" means that neural nets are ... The basic

Artificial Neural