CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural...

32

1 CS 4700: Foundations of Artificial Intelligence Prof. Bart Selman [email protected] Machine Learning: Neural Networks R&N 18.7 Backpropagation

Upload
others
Category

Documents
view
3
download
0

Embed Size (px):

Transcript of CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural...

Page 1: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

1

CS 4700:Foundations of Artificial Intelligence

Prof. Bart [email protected]

Machine Learning:Neural Networks

R&N 18.7Backpropagation

Page 2: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

2

Page 3: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

3

Page 4: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

4

Page 5: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

5

Page 6: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

6

Page 7: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

7

Page 8: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

8

Ok… details backpropagation NOT on exam.

But do work through gradient descent example on homework.

Page 9: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

9

Page 10: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

10

Page 11: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

11

Page 12: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

12

Page 13: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

13

It’s “just” multivariate (or multivariable) calculus.

Page 14: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

14

Optimizationandgradient descent.

Considerx and y ourtwo weightsand f(x,y)the errorsignal.

Multi-layernetwork:compositionof sigmoids;use chain rule.

Note:For descentwe want togo in oppositedirection ofgradient.

Page 15: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

15

Reminder:Gradient descent optimization.(Wikipedia)

Page 16: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

16

Page 17: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

17

Page 18: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

18

Page 19: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

19

So, first we “backpropagate” the errorsignal to get the gradient, and then we update the weights, layer by layer.

That’s why it’s called “backprop learning.”Now changing speech recognition, imagerecognition etc.!

Page 20: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

20

Page 21: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

21

Trace example!

Page 22: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

22

Page 23: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

23

Page 24: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

24

Page 25: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

25

Page 26: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

26

Page 27: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

27

Page 28: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

28

Page 29: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

29

Etc.

Tedious but“just” gradientdescent in theweight space,after all.

Page 30: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

30

Bla bla

Wall Street Journal, 11/24/16

Good news…

Page 31: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

31

Summary J

A tour of AI:I) AI

--- motivation--- overview of the field

II) Problem-Solving–---- problem formulation using state space– (initial state, operators, goal state(s))–--- search techniques--- adversarial search

White white

Page 32: CS 4700: Foundations of Artificial Intelligence...--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent) The field has grown (and continues to grow)

32

Summary, cont. III) Knowledge Representation and Reasoning

--- logical agents--- Boolean satisfiability (SAT) solvers--- first-order logic and inference

V) Learning–--- decision tree learning (info gain)--- reinforcement learning (intro)--- neural networks / deep learning (gradient descent)

The field has grown (and continues to grow) exponentially but youhave now seen a good part!

Thanks & Have a great winter break!!!!

PathNet: Evolution Channels Gradient Descent in Super ... · PathNet: Evolution Channels Gradient Descent in Super ... and SVHN supervised learning classi ca- ... recti ed linear

PathNet: Evolution Channels Gradient Descent in Super ... · PathNet: Evolution Channels Gradient Descent in Super ... and SVHN supervised learning classi ca- ... recti ed linear

Learning Functors using Gradient Descent Gavranovic.pdf · Learning Functors using Gradient Descent ... 2 Categorical Deep Learning Modern deep learning optimization algorithms canbeframedasagradient-basedsearchinsome

Learning Functors using Gradient Descent Gavranovic.pdf · Learning Functors using Gradient Descent ... 2 Categorical Deep Learning Modern deep learning optimization algorithms canbeframedasagradient-basedsearchinsome

Learning Stochastic Optimal Policies via Gradient Descent

Learning Stochastic Optimal Policies via Gradient Descent

Descent and Essential Descent Spectrum of Linear Relations · Descent and Essential Descent Spectrum ... ezzeddine.chafai@ipeis.rnu.tn, ... we study the descent spectrum and the essential

Descent and Essential Descent Spectrum of Linear Relations · Descent and Essential Descent Spectrum ... [email protected], ... we study the descent spectrum and the essential

Gradient Descent Learning and Backpropagationpages.cpsc.ucalgary.ca/~jacob/Courses/Fall2003/CPSC533/Slides/0… · Examples: Perceptron learning, backpropagation. ËReinforcement

Gradient Descent Learning and Backpropagationpages.cpsc.ucalgary.ca/~jacob/Courses/Fall2003/CPSC533/Slides/0… · Examples: Perceptron learning, backpropagation. ËReinforcement

Learning to learn by gradient descent by gradient descent · Learning to learn by gradient descent by gradient descent Marcin Andrychowicz 1, Misha Denil , Sergio Gómez Colmenarejo

Learning to learn by gradient descent by gradient descent · Learning to learn by gradient descent by gradient descent Marcin Andrychowicz 1, Misha Denil , Sergio Gómez Colmenarejo

Improving Gradient Descent Learning in Neural Networksguerzhoy/321/lec/W10/sgd.pdf · 2016. 3. 24. · Improving Gradient Descent Learning in Neural Networks CSC321: Intro to Machine

Improving Gradient Descent Learning in Neural Networksguerzhoy/321/lec/W10/sgd.pdf · 2016. 3. 24. · Improving Gradient Descent Learning in Neural Networks CSC321: Intro to Machine

Learning long-term dependencies with gradient descent is difficult

Learning long-term dependencies with gradient descent is difficult

Perspectives on Stochastic Gradient Descent for Machine ...pillaud/Presentations/Cermics/presentation... · Perspectives on Stochastic Gradient Descent for Machine Learning problems

Perspectives on Stochastic Gradient Descent for Machine ...pillaud/Presentations/Cermics/presentation... · Perspectives on Stochastic Gradient Descent for Machine Learning problems

Package ‘gradDescent’ · Description An implementation of various learning algorithms based on Gradient Descent for deal-ing with regression tasks. The variants of gradient descent

Package ‘gradDescent’ · Description An implementation of various learning algorithms based on Gradient Descent for deal-ing with regression tasks. The variants of gradient descent

Stochastic Gradient Descent Tricks › diglib › lsml › bottou-sgd-tricks-2012.pdfTable 1 illustrates stochastic gradient descent algorithms for a number of classic machine learning

Stochastic Gradient Descent Tricks › diglib › lsml › bottou-sgd-tricks-2012.pdfTable 1 illustrates stochastic gradient descent algorithms for a number of classic machine learning

Dynamics of On-Line Gradient Descent Learning for Multilayer … · 2014. 4. 14. · Dynamics of On-Line Gradient Descent Learning for Multilayer Neural Networks David Saad" Dept.

Dynamics of On-Line Gradient Descent Learning for Multilayer … · 2014. 4. 14. · Dynamics of On-Line Gradient Descent Learning for Multilayer Neural Networks David Saad" Dept.

Intro Logistic Regression Gradient Descent + SGDIntro Logistic Regression Gradient Descent + SGD Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade

Intro Logistic Regression Gradient Descent + SGDIntro Logistic Regression Gradient Descent + SGD Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade

DESCENT PREPARATION MANAGED DESCENT T/D · PDF filepf descent phase pnf descent preparation managed descent iaf t/d 80 nm ~10 minutes a320 -version 05a

DESCENT PREPARATION MANAGED DESCENT T/D · PDF filepf descent phase pnf descent preparation managed descent iaf t/d 80 nm ~10 minutes a320 -version 05a

Probabilistic Line Searches for Stochastic Optimizationthe need to deﬁne a learning rate for stochastic gradient descent. 1 Introduction Stochastic gradient descent (SGD) [1] is

Probabilistic Line Searches for Stochastic Optimizationthe need to deﬁne a learning rate for stochastic gradient descent. 1 Introduction Stochastic gradient descent (SGD) [1] is

Learning to learn by gradient descent by gradient descent · Learning to learn by gradient descent by gradient descent Liyan Jiang July 18, 2019 1 Introduction The general aim of

Learning to learn by gradient descent by gradient descent · Learning to learn by gradient descent by gradient descent Liyan Jiang July 18, 2019 1 Introduction The general aim of

Using Gradient Descent for Optimization and Learning

Using Gradient Descent for Optimization and Learning

Learning to learn by gradient descent by gradient descentpapers.nips.cc/...to...descent-by-gradient-descent.pdf · Learning to learn by gradient descent by gradient descent Marcin

Learning to learn by gradient descent by gradient descentpapers.nips.cc/...to...descent-by-gradient-descent.pdf · Learning to learn by gradient descent by gradient descent Marcin

Deep Learning I: Gradient Descentgd.pdf · Roadmap Intro,model,cost Gradient descent Gradient descent (GD) Gradient descent is a learning algorithm. Given: a hypothesis space (or

Deep Learning I: Gradient Descentgd.pdf · Roadmap Intro,model,cost Gradient descent Gradient descent (GD) Gradient descent is a learning algorithm. Given: a hypothesis space (or

Learning to Learn by Gradient Descent with Rebalancing€¦ · that, for instance, are capable of learning to learn without gradient descent by gradient descent. It should be expected

Learning to Learn by Gradient Descent with Rebalancing€¦ · that, for instance, are capable of learning to learn without gradient descent by gradient descent. It should be expected

Languages

Pages

Legal

Copyright © 2022 FDOCUMENTS