Unsupervised and Representation...

Unsupervised and Representation Learning

Davide Bacciu

Dipartimento di InformaticaUniversità di Pisabacciu@di.unipi.it

Applied Brain Science - Computational Neuroscience (CNS)

Module OverviewPractical Info

Part I

Module Information

Objectives

Train computational neuroscience and machine learningspecialists capable of

understanding application challenges and choosing theright neural-network-based solutionsdesigning computational models from biological memorymechanismsdeveloping advanced applications using ML solutions

Expected Outcome

Students completing the module are expected toGain in-depth knowledge of advanced hierarchical neuralnetwork architecturesLearn state of the art unsupervised and representationlearning algorithmsUnderstand their theory and applicationsBe able to individually read, understand and discussresearch works in the field

The course is targeted atStudents specializing in

Machine learning and computational intelligenceData mining, data sciences and information retrievalRobotics, bionics, bioengineering

Students seeking machine learning theses

Topics

Synaptic plasticity, memory and learningAssociative learning, competitive learning and inhibition

Associative memory modelsHopfield networksBoltzmann MachinesAdaptive Resonance Theory

Representation learning and hierarchical modelsBiological inspiration: sparse coding, pooling andinformation processing in the visual cortexHMAX, CNN, Deep Learning

Schedule

Lectures1 Unsupervised and representation learning (3h)2 Associative Memories I - Hopfield networks (2h)3 Associative Memories II - Boltzmann Machines (2h)4 Adaptive Resonance Theory (1h)5 Representation and Deep learning (2h)

Laboratory activity1 Hands-on Lab I (3h)2 Hands-on Lab II (3h)3 Hands-on Lab III (3h)

Need to accomodate 1 lesson out of calendar: Thursday 20/04h. 10.30-13.30

Homepage

Reference Webpage on Didawiki:

http://didawiki.di.unipi.it/doku.php/bionics-engineering/

computational-neuroscience/start

Here you can findCourse informationLecture slidesArticles and course materials

You can subscribe to get RSS feeds on page updates

Reference Books

A classical reference book for Computational Neurosciencecourses:

P. Dayan and L.F. Abbott, Theoretical Neuroscience, The MITpress (2001)

An alternative book covering similar topics and freely availableonline:

W. Gerstner, W.M. Kistler, R. Naud and L. Paninski, NeuronalDynamics: From single neurons to networks and models ofcognition, Cambridge University Press (2014)

IntroductionModels of Learning and Plasticity

Next Lecture

Part II

Unsupervised and RepresentationLearning

Next Lecture

OverviewSynaptic Plasticity and Learning

The Big Picture

Learning to encode complex/noisy input information in theactivations of a neural network (representational learning)Requires a mechanism to reconfigure the synapticresponse (plasticity)A computational approach through bio-inspiration

Next Lecture

Reference Model

A simple computationalneuron abstractionSynaptic inputs uj areintegrated to determineactivation vi

Synaptic weights wij andactivation function F

Neural networkrepresenting connectedassemblies ofcomputational neuronsDistributed representationof stimuli

Next Lecture

Learning in the Brain

Introducing relatively permanent changes in neuron behavioras a result of experience with stimuli

Many different learning flavoursPerceptualStimulus-responseMotorRelational

Many adaptation mechanismsHabituationSensitizationPrimingConditioning

A common underlying aspect⇒ synaptic plasticity

Next Lecture

Synaptic Plasticity

Synapses arecharacterized by theirweight wij

Determines the responseof a postsynaptic neuron ito an action potential frompresynaptic neuron jAssumed fixed so far

Electrophysiological experiments show that the response(amplitude) is not fixed but can change over timeChanges of the synaptic strength are called synapticplasticity

Next Lecture

Models of Synaptic Plasticity

Synaptic plasticity depends on a varietyof factors

Different causes: co-activation,repetition,...Different effects: enhancement,depressionDifferent timescales: short-term,long-term,...

Hebbian - Plasticity depends on bothpresynaptic and postsynaptic activityHow do we define synaptic activity?

Firing ratesSpikes (action potentials)

Next Lecture

Learning Paradigms

How synaptic plasticity is used as part of a training process tochange the neural response?

Unsupervised learningNetwork responds to a series of inputs during trainingsolely on the basis of intrinsic connections and dynamicsprincipal component analysis, density estimation,representation learning

Supervised learningA desired set of input-output relationship is imposed on thenetwork by a teacherRegression, classification, imitation learning

Reinforcement learningNetwork response is adjusted through a reward/punishmentsignal assessing performance on the task

Next Lecture

Hebbian LearningUnsupervised learningTime Dependent Plasticity

Hebbian Learning

The Organization of Behavior, 1949 (Donald Hebb)

When an axon of cell A is near enough to excite cell B andrepeatedly or persistently takes part in firing it, some growthprocess or metabolic change takes place in one or both cellssuch that A’s efficiency, as one of the cells firing B, is increased.

Neurons that firetogether, wire together

Self-organization of neuronassemblies

Next Lecture

Hebb, Pavlov and his Dog

What happens now everytime a bell is rung?

Next Lecture

Hebb, Pavlov and his Dog

Next Lecture

Long Term Potentiation (LTP)

A enhancement of the synaptic response whose effect arelong-lasting (e.g. at least 10 minutes)

Next Lecture

Firing Rate Neuron (Refresher)

Relax theconstraint ofpositive firing rates

Models the steady-state output firingrate as

v∞ = F (w · u)

With a time-dependent input current,the firing-rate dynamics is

τrdvdt

= −v + F (w · u)

Refer to the simple linear model

τrdvdt

= −v +Nu∑j=1

at steady-state v = w · u.

Next Lecture

Hebb Rule

The basic Hebb rule

τwdwdt

where τw is the learning rate.In general, an averaged version would be preferred

τwdwdt

= 〈vu〉

where 〈·〉 averages on input patternsInserting the linear firing-rate yields to the correlation rule

τwdwdt

= Qw s.t. Q = 〈uu〉

Next Lecture

Implementing Hebbian Learning

Consider to haveA N ×M data matrix UAn M-dimensional synaptic weight vector w

Hebb Learning AlgorithmRandomly initialize w in [0,1], set ne = 0;do

1 ne = ne + 1;2 wold = w;3 Shuffle input data U;4 for i = 1 to N do

1 v = w · U(i, ·)T ;2 w = w + η · v · U(i, ·);

while ‖w−wold‖ > ε and ne < maxEp

Next Lecture

Hebb Rule and Postsynaptic Depression

Basic Hebbian learning does not account for Long TermDepression (LTD)Synapse strength depresses if presynaptic activity ispaired with low postsynaptic activation

τwdwdt

= (v − θv )u postsynaptic

τwdwdt

= v(u− θu) presynaptic

where θv , θu switch LTD to LTP, e.g. θv = 〈v〉When averaged both equivalent to the covariance rule

τwdwdt

= Cw s.t. C = 〈(u− 〈u〉)(u− 〈u〉)〉

Next Lecture

Basic Hebbian Learning is Unstable

Positive feedback loop whereactivity and weights mutuallyreinforce

Learning is unstableUncontrolled weight growthSolution: Weight saturation constraints

Synapses are updated independentlyPoor selectivity to different inputsSolution: Synaptic competition

Next Lecture

Stabilization Strategies

(1) Synaptic normalization

(2) Nonlinear learning rules

Both strategies prevent unboundedweight growth as well as introducesynaptic competition

Next Lecture

Synaptic normalization

Key IdeaAdd terms to the Hebb rules that depend explicitly onweightsAssume a neuron can support only a fixed total amount ofsynaptic weights

Question: What machine learning practice does this recall?

Normalization constraint can be imposedRigidly: at every time stepDynamically: satisfied only asymptotically at the end oftraining

Different normalization constraint and strategies can leadto (consistently) different training outcomes

Next Lecture

Oja Rule (a.k.a. Multiplicative Normalization)

Sum-of-squares normalization term

τwdwdt

= vu− αv2w s.t. α ≥ 0

Stability is given since the length of the weight vector ‖w‖22 willrelax over time to 1/α, since

τwd‖w‖22

dt= 2v2(1− α‖w‖22)

Note that it also introduces synaptic competition (Why?)

Oja rule is local, dynamic but not biologically plausible

Next Lecture

The BMC RuleBienenstock, Cooper and Munro (1982)

A more biologically plausible synaptic update

τwdwdt

= vu(v − θv )

A non-linear learning rule introducing a sliding threshold

τθdθv

dt= v2 − θv

No unbounded weight growth Competition among stimuli

Next Lecture

Unsupervised Learning

A computational view of the effects of Hebbian synapticplasticity in artificial neural networksFocus on training without a teacher signal

Learning to encode stimuliCortical maps

Assess the effect ofSynaptic competitionNeuron competitionInhibition

Next Lecture

What does Oja Rule Learn?

When considering a linear neuron the Oja rule is

= ν(uv − v2w) = ν(uuT w−wT uuT ww)

The expected value of dw (averaged on inputs) is

d〈w〉dt

= ν〈(uuT w−wT uuT ww)〉 = ν(Qw− (wT Qw)w)

At convergence

d〈w〉dt

= 0 = ν(Qw− (wT Qw)w)

⇔ Qw = (wT Qw)w = λw

The eigenvalue problem⇒ Eigenvectors of Q are potentialsolutions

Next Lecture

Hebb, Oja and the PCA

The fixed point of Oja rule (but also Hebb) is the largesteigenvector of Q

Hebbian learning rotates weight vector to align with theprincipal eigenvector of input correlation/covariance matrixThe weigh vector in Hebb rule has unbounded norm, whileOja rule learns a normalized weight vector

Next Lecture

Competitive LearningSynaptic Competition

Synaptic competition favours selectivityHebbian rule with constraints preventing unconstrainedgrowth explains orientation selectivity in primary visualcortex

Next Lecture

Competitive LearningPrincipal Component Analysis

Hebbian learning orient the weight vector so that theneuron is responsive to information in the principalcomponent of data covariance/correlationCan we identify other principal components? How?

Introduce competitionbetween neuronsDifferent neurons encodedifferent stimuliSelectivity and diversity

Next Lecture

Competitive Learning with Multiple Neurons

Recurrent connections serve neuron differentiationOutput activity

τvdvdt

= −v + Wu + Mv

Stable fixed point with steady-state output (ρ(M) < 1)

v = Wu + Mv,v = KWu and K = (I−M)−1

Next Lecture

Competitive Hebbian Learning

A two step-process1 Long-range competition (feedforward)

(∑j Wijuj

)δ∑

(∑j Wkjuj

)δ2 Short-range cooperation between neighbors (recurrent)

vi =∑

k∈Ne(i)

Purely linear units produce little differentiation among neurons(added nonlinearity δ)

Next Lecture

Modeling Causality in Learning

Let’s review Hebb’s hypothesis

When an axon of cell A is near enough to excite cell B andrepeatedly or persistently takes part in firing it, some growthprocess or metabolic change takes place in one or both cellssuch that A’s efficiency, as one of the cells firing B, is increased.

The interpretation of taking part is the keyRelative timing between pre-synaptic and post-synapticactivity plays a critical roleProved experimentally and is also at the root of Hebb’soriginal interpretationSpike-timing dependence in synaptic plasticity (STDP)

Approximate firing-rate model (in place of spiking neuronmodel)

Next Lecture

STDP Reprise

LTP when post-synapticoccurs within 50msec frompre-synaptic spikeLTD when pre-synapticspike occurs within50msec from post-synapticaction potential

H(τ)⇒ Rate of synaptic modification when post-synapticactivity is separated from pre-synaptic by τmsec

τwdwdt

∫ ∞0

(H(τ)v(t)u(t − τ)︸︷︷︸LTP

+H(−τ)v(t − τ)u(t)︸︷︷︸LTD

Next Lecture

Take Home Messages

Synaptic plasticity is the mechanism underlying all learningschemeHebbian learning

Promotes synapses between co-activated neurons (LTP)Depresses synapses responsible for asynchronousactivations (LTD)Leads to principal component analysis, self-organizingmaps, visual filters, ...

Competition is essential to ensureSelectivity and stability at synaptic levelDiversity between neurons

Hebbian time-dependent plasticity allows learningsequential patterns

Next Lecture

Things We Haven’t Seen

Anti-Hebbian LearningReducing synaptic strength as result of co-activation

Timing-based plasticity and the spiking modelSupervised Hebbian learning

PerceptronDelta-rule

Next Lecture

Associative memoriesRed hammers and primingLearning and recalling associations betweenstimuli/concepts

Hopfield networksAn associative memoryA recurrent neural networkAn energy-based model

Unsupervised and Representation...

Documents

Transcript of Unsupervised and Representation...

Requirements Louis V. Gerstner, Jr. Thriving biomedical ... · Louis V. Gerstner, Jr. Graduate School of Biomedical Sciences Memorial Sloan Kettering Cancer Center gradstudies@sloankettering.edu

Real-time optimization of neurophysiology experiments Jeremy Lewi 1, Robert Butera 1, Liam Paninski 2 1 Department of Bioengineering, Georgia Institute.

Neuronal Dynamics: 7.2 AdEx model - Firing patterns and ... · Neuronal Dynamics week 7– Suggested Reading/selected references Reading: W. Gerstner, W.M. Kistler, R. Naud and L.

Page of H. Gerstner Sons Style 35B Kit Chests · Page 2 of 8 H. Gerstner & Sons µ 20 Gerstner Way µ Dayton, Ohio 45402 µ µ 937 r228 r1662 35B DIY Assembly Instructions Parts prep

KAREN S. GERSTNER & ASSOCIATES, P.C. BASICS Every Executor 2-06-09.pdf · *Karen S. Gerstner & Associates, P.C. has compiled the Basics series to provide plain-English, ... a public

SELL Meeting, June 2010, Bordeaux SELL 2010 France Country Report Catherine Etienne & Mariette Naud Couperin Consortium Couperin.

No Bed In Deseret - by Nickolae Gerstner

Renaud Jolivet, Timothy J. Lewis and Wulfram Gerstner

Karl Gerstner and Design Programmes - WordPress.com · Karl Gerstner and Design Programmes ... form of text can convey a meaning or some information, ... Gerstner was commissioned

Wulfram Gerstner - École Polytechnique Fédérale de Lausanne · 2021. 2. 25. · Wulfram Gerstner Summary EPFL, Lausanne, Switzerland Artificial Neural Networks for Classification

Online Photo Sharing Hayley Charles, Kelsey Kempke, Lindsey Stieben & Christy Gerstner.

CMMN Thesis Gerstner

Gerstner Frontiers Mol Neurosci 2012

Unsupervised and Representation Learningdidawiki.cli.di.unipi.it/lib/exe/fetch.php/bionics... · 2016-04-06 · W. Gerstner, W.M. Kistler, R. Naud and L. Paninski, Neuronal Dynamics:

Reasons for Faith (John H. Gerstner, 1960 H&B)

Gerstner Leadership Ibm

Cervical Cancer, HPV and Prophylactic Vaccines Paulo Naud.

Doug Gerstner Cliff Davis Battelle Energy Alliance Idaho National Laboratory

Wulfram Gerstner and Werner M. Kistler Cambridge ......Wulfram Gerstner and Werner M. Kistler Cambridge University Press, 2002 2 Preface The task of understanding the principles of

BY THE AUTHORITY OF THE COUNCIL August Gerstner … · 2020. 5. 20. · August Gerstner Ringfabrik GmbH & Co. IKQTQ\a#. IS A CERTIFIED MEMBER OF THE . Responsible Jewellery Council.