The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines...

55
The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Transcript of The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines...

Page 1: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

The Deep Learning Revolutionrethinking machine learning pipelines

Soumith ChintalaFacebook AI Research

7th April 2015EmergingTech 2015

Page 2: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Getting your attention

● Changed the landscape of:○ Natural language processing○ Speech recognition○ Computer vision○ Robotics○ Modern statistical physics○ Computational Biology○ Digital assistants (Siri, Cortana, etc.)

Page 3: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Traditional Machine Learning

Page 4: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Hand-Crafted Features

SIFT/HOG

SURF

MFCC

Spectrogram

Page 5: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Traditional ML

● Feature Engineering + [your favorite classifier]● Steps:

○ Find a poor sod to think of good features○ Find another unfortunate chap to extract those

features from your data○ Collect all the features and grind them through:

■ Logistic regressor■ Decision trees, random forests, boosted■ Support Vector Machines

Page 6: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Deep learning

● End-to-end learning○ No feature engineering

● Chained cascade of non-linear transforms● General framework

Page 7: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

How to win at life (5-step process)

● Pick a problem● Get as much data as you can● Expand your dataset more● Train several deep nets● Ensemble● Win!

Page 8: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Deep Neural Networks

Page 9: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

What is a neural network?

x^2 Sin(x)Input Output

Page 10: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

What is a deep neural network?

i*w Sigmoid

Input (x) Output

weights (w)

i*w Sigmoid

weights (w)

i*w Sigmoid

weights (w)

i*w Sigmoid

weights (w)

Page 11: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Neural networks vs computation graphs?

● Neural networks are trained via back-propagation● Every node has f(x) and df(x)/dx

i*w Sigmoid

Input (x) Output

weights (w)

i*w Sigmoid

weights (w)

i*w Sigmoid

weights (w)

i*w Sigmoid

weights (w)

forward

i*w Sigmoid gradient

weights (w)

i*w Sigmoid

weights (w)

i*w Sigmoid

weights (w)

i*w Sigmoid

weights (w)

backward

Input (x)

Page 12: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

One weird trick: Convolutions

Page 13: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Fully-connected layers: issues

Page 14: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Locally connected layers

Page 15: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolutional layers

Page 16: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolutional layers

Page 17: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 18: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 19: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 20: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 21: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 22: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 23: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 24: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 25: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 26: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 27: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 28: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 29: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 30: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 31: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 32: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolution layer

Page 33: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Convolutional neural networks

● on the very small scale every piece of image can be processed the same way

Page 34: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

ImageNet classification results 20121M training images, 1K categories, top-5 error

Human performance ~3-5%

Deep-learning models ~15%

Non-deep learning modelsISI, JapanOxford, EnglandINRIA, FranceUniversity of Amsterdam, etc.

~26%

Page 35: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

ImageNet classification results 20151M training images, 1K categories, top-5 error

Human performance ~3-5%

Deep-learning models ~4.5%

Non-deep learning modelsISI, JapanOxford, EnglandINRIA, FranceUniversity of Amsterdam, etc.

~26%

Page 36: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Segmentation and Stereo

LeCun (2014)

Couprie et. al. (2013)

Farabet et. al. (2012)

Page 37: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

ConvNets + Graphical Model (Tompson et. al. 2014)

Page 38: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Generative Adversarial NetsGoodfellow et. al. (2014)

Page 39: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

ConvNets for Video

DeepVideo (Karpathy et. al)

C3D (Tran et. al.)

Page 40: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

ConvNets for NLP

Collobert et. al.

Zhang et. al.

Page 41: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Weird trick #2: Recurrent Nets

Page 42: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Recurrent Networks

i*w Sigmoid

Input (x) Output

weights (w)

forward

past experience

i*w Sigmoid

Input (x) Output

weights (w)

backward

past experience

???

Page 43: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Acyclic computational graphs

Unfolding in time

Page 44: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

RNN-LSTMs

● Machine Translation● Language Modeling● Learning to execute (Python programs)

Basic LSTM unit (figure from deeplearning.net)

Sutskever et. al. (2014)

Page 45: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Examples

Sequence of character on the input and on the output.

Page 46: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Recurrent Nets for Q&A

Weston et. al. 2014Facebook AI Research

Page 47: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Implementation and Engineering

Page 48: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Implementation: FLOP Eaters

● Convolutions are expensive● Sequential Processing● Matrix multiplies are expensive

Page 49: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Implementation: FLOP Eaters

● Convolutions are expensive● Sequential Processing● Matrix multiplies are expensive

GPUs

Page 50: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Training

● Days to weeks● Engineering feat● Multiple machines, multiple GPUs

Page 51: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Frameworks

● Torch● Caffe● Theano, Keras, Lasagne● Common Features

○ syntax to define graph○ every node in graph has derivative○ train via backpropagation

Page 52: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015
Page 53: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Trends● Deeper nets● Smaller convolutions● RNN + LSTM● Multiple GPUs + Multiple Machines● Neural Machines● Other kinds of memory units● Better weight initialization● Meta-problems

Page 54: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Challenges

● Scaling up for big data (videos, social networks etc.)

● Discrete optimization● Memory that works

Page 55: The Deep Learning Revolution · The Deep Learning Revolution rethinking machine learning pipelines Soumith Chintala Facebook AI Research 7th April 2015 EmergingTech 2015

Questions