INFUSING PHYSICS + STRUCTURE INTO MACHINE LEARNING...PHYSICS-INFUSED LEARNING FOR ROBOTICS AND...

Anima Anandkumar

INFUSING PHYSICS + STRUCTURE INTO MACHINE LEARNING

TRINITY OF AI/ML

DATACOMPUTE

ALGORITHMS

PHYSICS-INFUSED LEARNING

Data Priors

Learning = Computational Reasoning over Data & Priors

+Learning =

How to use Physics as a (new) form of Prior?• Learnability

• Generalization

What are some of the hardest

challenges for AI?

“ Mens sana in corpore sano.”

Juvenal in Satire X.

ExplorersPlanetary, Underwater, and Space Explorers

TransformersSwarms of Robots Transforming Shapes and Functions

GuardiansDynamic Event Monitors and First Responders

TransportersRobotic Flying Ambulances and Delivery Drones

PartnersRobots Helping and Entertaining People

ROBOTICSMOONSHOTS

MIND & BODYNEXT-GENERATION AI

Fine-grained reactive Control

Instinctive:Making and adapting plans

Deliberative:

Sense and react to human

Behavioral:Acting for the greater good

Multi-Agent:

PHYSICS-INFUSED LEARNING FOR ROBOTICS AND CONTROL

Data Priors

Learning = Computational Reasoning over Data & Priors

+Learning =

How to use Physics as a (new) form of Prior?• Learnability

• Generalization

BASELINE: MODEL-BASED CONTROL (NO LEARNING)

𝑠𝑡+1 = 𝐹 𝑠𝑡 , 𝑢𝑡 + 𝜖

New State

Current State

Current Action (aka control input)

Unmodeled Disturbance / Error

Robust Control (fancy contraction mappings)

• Stability guarantees (e.g., Lyapunov)

• Precision/optimality depends on error

(Value Iteration is also contraction mapping)

LEARNING RESIDUAL DYNAMICS FOR DRONE LANDING

𝑠𝑡+1 = 𝑓 𝑠𝑡 , 𝑎𝑡 + ሚ𝑓 𝑠𝑡 , 𝑎𝑡 + 𝜖

New State

Current State

Current Action (aka control input)

Unmodeled Disturbance

𝑓 = nominal dynamicsሚ𝑓 = learned dynamics

Use existing control methods to generate actions

• Provably robust (even using deep learning)

• Requires ሚ𝑓 Lipschitz & bounded error

CONTROL SYSTEM FORMULATION

• Dynamics:

• Control:

• Unknown forces & moments:

Learn the Residual(function of state and control input)

Learn the Residual

DATA COLLECTION (MANUAL EXPLORATION)

• Learn ground effect:

• (s,u): height, velocity, attitude and four control inputs

෨𝐹 𝑠, 𝑢 →

Spectral-Normalized4-Layer Feed-Forward

Spectral Normalization: Ensures ෩𝑭 is Lipshitz[Bartlett et al., NeurIPS 2017][Miyato et al., ICLR 2018]

Ongoing Research:

Safe Exploration

PREDICTION RESULTS

Height (m)

GENERALIZATION PERFORMANCE ON DRONE

Our Model Baseline

Vertical Velocity

Spectral Regularized NN Conventional NN

Neural Lander: Stable Drone Landing Control using Learned Dynamics, ICRA 2019

Guanya

ShiXichen

ShiMichael

O’Connell

Kamyar

Azizzadenesheli

YuSoon-Jo

Yisong

CONTROLLER DESIGN (SIMPLIFIED)

Nonlinear Feedback Linearization:

Cancel out ground effect ෨𝐹(𝑠, 𝑢𝑜𝑙𝑑):

𝑢𝑛𝑜𝑚𝑖𝑛𝑎𝑙 = 𝐾𝑠𝜂

Feedback Linearization (PD control)

𝜂 =𝑝 − 𝑝∗

𝑣 − 𝑣∗Desired Trajectory(tracking error)

𝑢 = 𝑢𝑛𝑜𝑚𝑖𝑛𝑎𝑙 + 𝑢𝑟𝑒𝑠𝑖𝑑𝑢𝑎𝑙Requires Lipschitz & small time delay

STABILITY GUARANTEES

Assumptions:

Desired states along position trajectory bounded

Control updates faster than state dynamics

Learning error bounded (new): Bounded Lipschitz (through spectral normalization of layers

Stability Guarantee:(simplified)

𝜂(t) ≤ 𝜂(0) exp𝜆 − ෨𝐿𝜌

𝐶𝑡 +

𝜆 − ෨𝐿𝜌

⟹ 𝜂(t) →𝜖

𝜆𝑚𝑖𝑛 𝐾 − ෨𝐿𝜌

Exponentially fast

Unmodeled disturbance

Lipschitz of NN

Time delaycontrol gain

CAST @ CALTECHLEARNING TO LAND

TESTING TRAJECTORY TRACKING

Move around a circle super close to the ground

CAST @ CALTECHDRONE WIND TESTING LAB

TAKEAWAYS

Control methods => analytic guarantees

Blend w/ learning => improve precision/flexibility

Preserve side guarantees

Sometimes interpret as functional regularization

(side guarantees)

(possibly relaxed)

(speeds up learning)

Blending Data Driven Learning

with Symbolic Reasoning

AGE-OLD DEBATE IN AI

Symbolic reasoning:

• Humans have impressive ability at symbolic reasoning

• Compositional: can draw complex inferences from simple axioms.

Representation learning:

• Data driven: Do not need to know the base concepts

• Black box and not compositional: cannot easily combine and create more complex systems.

Symbols vs. Representations

Combining Symbolic Expressions & Black-box Function Evaluations in Neural Programs, ICLR 2018

ForoughArabshahi

SameerSingh

SYMBOLIC + NUMERICAL INPUT

Goal: Learn a domain of functions (sin, cos, log…)

Training on numerical input-output does not generalize.

Data Augmentation with Symbolic Expressions

Efficiently encode relationships between functions.

Solution:

Design networks to use both

symbolic + numeric

Leverage the observed structure of the data

Hierarchical expressions

TASKS CONSIDERED

◎Mathematical equation verification○ sin2 𝜃 + cos2 𝜃 = 1 ???

◎Mathematical question answering ○ sin2 𝜃 +

◎Solving differential equations

EXPLOITING HIERARCHICAL REPRESENTATIONS

Symbolic expression Function Evaluation Data Point Number Encoding Data Point

REPRESENTING MATHEMATICAL EQUATIONS

◎Grammar rules

DOMAIN

TREE-LSTM FOR CAPTURING HIERARCHIES

DATASET GENERATION

◎Random local changes

Replace Node Shrink Node Expand Node

DATASET GENERATION

◎Sub-tree matching

Dictionary key-value pairChoose Node Replace with value’s pattern

Majority Class Sympy LSTM : sym TreeLSTM : sym TreeLSTM:sym+num

Generalization 50.24% 81.74% 81.71% 95.18% 97.20%

Extrapolation 44.78% 71.93% 76.40% 93.27% 96.17%

40.00%

50.00%

60.00%

70.00%

80.00%

90.00%

100.00%A

EQUATION VERIFICATION

EQUATION COMPLETION

TAKE-AWAYS

Vastly Improved numerical evaluation: 90% over function-fitting baseline.

Generalization to verifying symbolic equations of higher depth

Combining symbolic + numerical data helps in better generalization for both tasks: symbolic and numerical evaluation.

LSTM: Symbolic TreeLSTM: Symbolic TreeLSTM: symbolic + numeric

76.40 % 93.27 % 96.17 %

TENSORS PLAY A CENTRAL ROLE

DATACOMPUTE

ALGORITHMS

TENSOR : EXTENSION OF MATRIX

TENSORS FOR DATAENCODE MULTI-DIMENSIONALITY

Image: 3 dimensionsWidth * Height * Channels

Video: 4 dimensionsWidth * Height * Channels * Time

TENSORS FOR MODELS

STANDARD CNN USE LINEAR ALGEBRA

Jean Kossaifi, Zack Chase Lipton, Aran Khanna, Tommaso Furlanello, A

Jupyters notebook: https://github.com/JeanKossaifi/tensorly-notebooks

TENSORS FOR MODELS

TENSORIZED NEURAL NETWORKS

SPACE SAVING IN DEEP TENSORIZED NETWORKS

T E N S O R L Y : H I G H - L E V E L A P I F O R T E N S O R A L G E B R A

• Python programming

• User-friendly API

• Multiple backends: flexible + scalable

• Example notebooks

Jean Kossaifi

TENSORLY WITH PYTORCH BACKEND

import tensorly as tl

from tensorly.random import tucker_tensor

tl.set_backend(‘pytorch’)

core, factors = tucker_tensor((5, 5, 5),

rank=(3, 3, 3))

core = Variable(core, requires_grad=True)

factors = [Variable(f, requires_grad=True) for f in factors]

optimiser = torch.optim.Adam([core]+factors, lr=lr)

for i in range(1, n_iter):

optimiser.zero_grad()

rec = tucker_to_tensor(core, factors)

loss = (rec - tensor).pow(2).sum()

for f in factors:

loss = loss + 0.01*f.pow(2).sum()

loss.backward()

optimiser.step()

Set Pytorch backend

Attach gradients

Set optimizer

Tucker Tensor form

AIREVOLUTIONIZING MANUFACTURING AND LOGISTICS

NVIDIA ISAAC —WHERE ROBOTS GO TO LEARN

Cars Pedestrians Path

Lanes Signs Lights

Cars Pedestrians Path

Lanes Signs Lights

1. COLLECT & PROCESS DATA 2. TRAIN MODELS

3. SIMULATE 4. DRIVE

NVIDIA DRIVEFROM TRAINING TO SAFETY

TAKEAWAYS

End-to-end learning from scratch is impossible in most settings

Blend DL w/ prior knowledge => improve data efficiency, generalization, model size

Obtain side guarantees like stability + safety,

Outstanding challenge (application dependent):

what is right blend of prior knowledge vs data?

Thank you

INFUSING PHYSICS + STRUCTURE INTO MACHINE LEARNING...PHYSICS-INFUSED LEARNING FOR ROBOTICS AND...

Documents

Transcript of INFUSING PHYSICS + STRUCTURE INTO MACHINE LEARNING...PHYSICS-INFUSED LEARNING FOR ROBOTICS AND...

Action Priors for Learning Domain Invariancesautonomously from previous tasks. To this end we introduce the notion of action priors, being localised distributions over the action sets

IMPROPER PRIORS FIDUCIAL INFERENCE - Statistics · “Improper priors are just limits of proper priors ... ” We present ingredients in a mathematical theory for statistics which

Abstractdivided into internal priors and external priors. The internal image priors are directly learned on the input test image itself, while the external image priors need to be

Inferring Visuomotor Priors for Sensorimotor Learning...Inferring Visuomotor Priors for Sensorimotor Learning Edward J. A. Turnham*, Daniel A. Braun, Daniel M. Wolpert Computational

Observational-Interventional Priors for Dose-Response Learning · 2017-01-13 · Observational-Interventional Priors for Dose-Response Learning Ricardo Silva Department of Statistical

CNL - Learning Overcomplete Representationspapers.cnl.salk.edu/PDFs/Learning Overcomplete... · 2011. 1. 28. · Learning Overcomplete Representations 341 Figure 1: Different priors

SCALPEL: Segmentation Cascades with Localized Priors and ...€¦ · SCALPEL: Segmentation CAscades with Localized Priors and Efﬁcient Learning David Weiss University of Pennsylvania

Salford Priors, Warwickshire 1 - Cotswold Archaeologyreports.cotswoldarchaeology.co.uk/.../0490-Salford-Priors-Warks-PAR...Salford Priors, Warwickshire ... shallow banana-shaped gullies.

Correlation Priors for Reinforcement Learning...Correlation Priors for Reinforcement Learning Bastian Alt AdrianŠošic´ Heinz Koeppl Department of Electrical Engineering and Information

Turner Priors

Pri3D: Can 3D Priors Help 2D Representation Learning?

Learning Shape Priors for Single View Reconstructionmi.eng.cam.ac.uk/~cipolla/publications/in... · Learning Shape Priors for Single View Reconstruction Yu Chen and Roberto Cipolla

Deep Learning Shape Priors for Object Segmentation

When learning physics mirrors doing physics

Using physics-based priors in a Bayesian algorithm to enhance ...

Learning Data-driven Reﬂectance Priors for Intrinsic Image Decompositiontinghuiz/papers/iccv15... · 2015-10-19 · Learning Data-driven Reﬂectance Priors for Intrinsic Image

Bayesian Learning of Bayesian Networks with Informative Priors - …stoics.org.uk/~nicos/pbs/amai.pdf · 2019. 3. 4. · Bayesian Learning of Bayesian Networks Bayesian Learning of

580.691 Learning Theory Reza Shadmehr Bayesian learning 1: Bayes rule, priors and maximum a posteriori.

LEARNING PRIORS FOR ADVERSARIAL AUTOENCODERS · Learning Priors for Adversarial Autoencoders Hui-Po Wang, Wei-Jan Ko and Wen-Hsiao Peng Department of Computer Science National Chiao

PGM: Tirgul 10 Parameter Learning and Priors