Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning...

22
University of Hamburg MIN Faculty Department of Informatics Machine Learning in Robotics Machine Learning in Robotics 64-450 Integrated Seminar Intelligent Robotics Oke Martensen University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Department of Informatics Technical Aspects of Multimodal Systems 9. June 2016 Oke Martensen 1

Transcript of Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning...

Page 1: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning in Robotics

Machine Learningin Robotics

64-450 Integrated Seminar Intelligent Robotics

Oke Martensen

University of HamburgFaculty of Mathematics, Informatics and Natural SciencesDepartment of Informatics

Technical Aspects of Multimodal Systems

9. June 2016

Oke Martensen 1

Page 2: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning in Robotics

Outline

1. Machine LearningBasicsML in Robotics

2. Deep LearningDL in a NutshellDeep Learning in Robotics

3. Examples for DL in RoboticsEnd-to-End Training of Deep Visuomotor PoliciesHand-Eye Coordinated Grasping with Deep Learning

Oke Martensen 2

Page 3: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Motivation Machine Learning in Robotics

Why Machine Learning?

Optimization is concerned with mathematical problems whichare mathematically well-defined thus have verifiable solutions.

Machine Learning is

I concerned with engineering problems (often not well-defined)

I about building mathematical models

ML algorithms can be seen as being composed of:

1. representation

2. evaluation

3. optimization

Oke Martensen 3

Page 4: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning - Basics Machine Learning in Robotics

Machine LearningTypes

Supervised Learning Unsupervised Learn. Reinforcement Learn.

Sutton and Barto (1998)

learn IO mapping clustering trial and error

regression, SVMs, .. nearest neighbour, .. Q-learning, SARSA, ..

Oke Martensen 4

Page 5: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning - Basics Machine Learning in Robotics

Machine LearningImportant Notions

Generalizability: overfitting vs. underfitting

www.medium.com/@nomadic_mind/new-to-machine-learning-avoid-these-three-mistakes-73258b3848a4

Inductive bias: prior assumptions about the task at hand

No-free-lunch theorem: there is no algorithm superior for all tasks

Oke Martensen 5

Page 6: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning - ML in Robotics Machine Learning in Robotics

Robot LearningProblem

Again, in contrast to optimization (e.g. inverse kinematics).

Levine (2015)

I solve subtasks in advance e.g. via computer vision ⇒ states

I explore, learn a policy through experience (RL)

Oke Martensen 6

Page 7: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning - ML in Robotics Machine Learning in Robotics

Robot LearningChallenges

https://www.youtube.com/watch?v=g0TaYhjpOfo

high-dimensional spaces

scarce real-world data

high variability / noise

high-level targets

many distinct tasks

...

Oke Martensen 7

Page 8: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning - ML in Robotics Machine Learning in Robotics

Machine Learning in RoboticsMore Relevant ML Types

Supervised Learning Reinforcement Learn.

Sutton and Barto (1998)

Combinations:Behavior CloningApprenticeship Learn....

Problems with RL: limited data, dimensiona-lity, few parameters, ...

Ideas:

I initialize learning process with data of a successful execution

I interim policy evaluation by user

Oke Martensen 8

Page 9: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Deep Learning - DL in a Nutshell Machine Learning in Robotics

Deep Learning in a NutshellNothing much new actually..

mostly deep neural networks (> 1 hidden layers)

plethora of old and newer methods for tweaking: dropout,

batch normalization, data augmentation, ..

large improvements in various domains: computer vision, speech

recognition, game-playing, ..

Ng (2016)

Oke Martensen 9

Page 10: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Deep Learning - DL in a Nutshell Machine Learning in Robotics

Deep Learning in a NutshellCharacteristics

learned hierarchy of features

performance scales with data

heavy computations

Ng (2016)

Oke Martensen 10

Page 11: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Deep Learning - DL in a Nutshell Machine Learning in Robotics

Deep Learning IllustrationConvolutional Neural Network (CNN/ConvNet)

Bezak et al. (2014)

Lee et al. (2009)

Oke Martensen 11

Page 12: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Deep Learning - Deep Learning in Robotics Machine Learning in Robotics

Robot LearningStandard Robotic Learning vs. Deep Learning Approach

Subtasks solved with domain-specific approaches:

vs.

Largely domain-agnostic Deep Learning pipeline:

Levine (2015)

Oke Martensen 12

Page 13: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Examples for DL in Robotics - End-to-End Training of Deep Visuomotor Policies Machine Learning in Robotics

End-to-End Training of Deep Visuomotor PoliciesLevine et al. (2015)

I map image pixels & joint angles to motor torquesI guided policy search:

I transforms policy search into SLI alternating between trajectory and policy optimization

I full torque control of7-DoF robotic arms

Levine et al. (2016)

Oke Martensen 13

Page 14: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Examples for DL in Robotics - End-to-End Training of Deep Visuomotor Policies Machine Learning in Robotics

End-to-End Training of Deep Visuomotor PoliciesVisuomotor Policy Architecture

Levine et al. (2015)

I ∼92k parameters, 7 layers

Oke Martensen 14

Page 15: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Examples for DL in Robotics - End-to-End Training of Deep Visuomotor Policies Machine Learning in Robotics

End-to-End Training of Deep Visuomotor PoliciesResults

covered vision leads to estimated manipulation and subsequentcorrection attempts → reliance on visual feedback

smaller changes will be adjusted for; bigger ones cause problems

recovery attempts after perturbations

Finn (2015)

Oke Martensen 15

Page 16: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Examples for DL in Robotics - Hand-Eye Coordinated Grasping with Deep Learning Machine Learning in Robotics

Learning Hand-Eye Coordination for Robotic Grasping withDeep Learning and Large-Scale Data CollectionLevine et al. (2016)

Levine et al. (2016)

use grasp success prediction network

with continuous servoing mechanism

for continuous manipulator control

trained on >800k grasp attemptsfrom 14 distinct robots

Oke Martensen 16

Page 17: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Examples for DL in Robotics - Hand-Eye Coordinated Grasping with Deep Learning Machine Learning in Robotics

Learning Hand-Eye Coordination for Robotic GraspingArchitecture of CNN Grasp Predictor

Levine et al. (2016)

Oke Martensen 17

Page 18: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Examples for DL in Robotics - Hand-Eye Coordinated Grasping with Deep Learning Machine Learning in Robotics

Learning Hand-Eye Coordination for Robotic GraspingResults

Pastor (2016c,a,b)

failure reduction from 34% to 18%

corrections after mistakes

recovery after perturbations/changes

Oke Martensen 18

Page 19: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

The End - Conclusion Machine Learning in Robotics

Conclusion

Problems for DL/ML in robotics:I data sparsityI high dimensionality/variabilityI some generalizability but limited (related to the previous points)

Benefits of end-to-end DL approaches:I more natural movementsI learned from scratchI discovery of unconventional/

non-obvious behaviour (e.g.grasping of soft vs. hard objects)

Levine et al. (2016)

Oke Martensen 19

Page 20: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning in Robotics

Thanks for your attention!

Questions?

Oke Martensen 20

Page 21: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning in Robotics

References

Bezak, P., Nikitin, Y. R., and Bozek, P. (2014). Robotic grasping system usingconvolutional neural networks. American Journal of Mechanical Engineering,2(7):216–218.

Finn, C. (2015). End-to-end training of deep visuomotor policies [video file]. Retrievedfrom https://www.youtube.com/watch?v=Q4bMcUk6pcw#t=172.

Lee, H., Grosse, R., Ranganath, R., and Ng, A. Y. (2009). Convolutional deep beliefnetworks for scalable unsupervised learning of hierarchical representations. InProceedings of the 26th Annual International Conference on Machine Learning,pages 609–616. ACM.

Levine, S. (2015). Deep learning for decision making and control [video file]. Talk atCSE 519 Colloquium. Retrieved fromhttps://www.cs.washington.edu/events/colloquia/search/details?id=2686.

Levine, S., Finn, C., Darrell, T., and Abbeel, P. (2015). End-to-end training of deepvisuomotor policies. arXiv preprint arXiv:1504.00702v5.

Oke Martensen 21

Page 22: Machine Learning in Robotics - uni-hamburg.de · Deep Learning - DL in a Nutshell Machine Learning in Robotics Deep Learning in a Nutshell Nothing much new actually.. mostly deep

University of Hamburg

MIN Faculty

Department of Informatics

Machine Learning in Robotics

References (cont.)

Levine, S., Pastor, P., Krizhevsky, A., and Quillen, D. (2016). Learning hand-eyecoordination for robotic grasping with deep learning and large-scale data collection.arXiv preprint arXiv:1603.02199.

Ng, A. (2016). How scale is enabling deep learning [video file]. Retrieved fromhttps://www.youtube.com/watch?v=LcfLo7YP8O4.

Pastor, P. (2016a). Continuous visual feedback improves grasp success rate [videofile]. Retrieved from https://www.youtube.com/watch?v=H4V6NZLNu-c.

Pastor, P. (2016b). Learning hand-eye coordination for robotic grasping [video file].Retrieved from https://www.youtube.com/watch?v=l8zKZLqkfII#t=13.

Pastor, P. (2016c). One-shot grasping often leads to failed grasp attempts [video file].Retrieved from https://www.youtube.com/watch?v=Q9tDHuidzak.

Sutton, R. S. and Barto, A. G. (1998). Reinforcement learning: An introduction,volume 1. MIT press Cambridge.

Oke Martensen 22