Animesh Garg

Unsupervised Representations towards Counterfactual Predictions

Animesh Garg

Compositional Representations

Vacuuming Sweeping/Mopping Cooking Laundry

Diversity:New Scenes,

Tools,…

Complexity: Long-term Settings

Personal Robot Assistant Atlas Robot

Humanoid

Staged Cooking

Dartmouth AI Meeting

U N I M AT E1st Industrial robot

‘611956 1968 2013 2018 2020

Unstructured/Unknown New Environment

Generalization

New Task Variationsin Novel Environments

Task Imitation

Task Performance in Data Scarce Set-up

Supervision

Learning Disentanglement

Learning Keypoints

Learning Causal Graphs

Learning Keypoints

Generative Models: Disentanglement

Objectives of Disentanglement• Compositional Representations• Controllable Sample Generation

dSprites 3DShapes

Existing datasets in unsupervised disentanglement learning

True Factors of Variation

Latent Code

Disentanglement: Challenges

✗High-Resolution Output

✗Non-identifiability in Unsupervised setting

✗Metrics focus on learning disentangled representations

Locatello et al. ICML 2019.

Disentanglement: Challenges

✗High-Resolution OutputStyleGAN based backbone (~1%)New high-resolution synthetic datasets: Falcor3D and Isaac3D✗Non-identifiability in Unsupervised settingLimited Supervision (~1%)

✗Metrics focus on learning disentangled representationsNew Metric to Trade-off between controllability and disentanglement

Disentanglement: StyleGAN

Kerras et al. CVPR 2019.

• Used a style-based generator to replace traditional generator• Success at generating high-resolution

realistic images

Disentanglement: Semi-StyleGAN

(ours) Li et al. ICML 2020.

The semi-supervised loss is given by

withunsupervised InfoGAN loss term

supervised label reconstruction term

smoothness regularization term

Disentanglement in StyleGANMapping Network in the generator conditions on the

factor code and the encoder predicts its value

J set of all possible factor codesX: set of labeled pairs of real image and factor code M: mixed set of labeled and generated image-code pairsE, G: encoder and generator neural networks

Disentanglement: Semi-StyleGAN

Li et al. ICML 2020.

The semi-supervised loss is given by

withunsupervised InfoGAN loss term

supervised label reconstruction term

smoothness regularization term

Disentanglement in StyleGANMapping Network in the generator conditions on the

factor code and the encoder predicts its value

Labeled Data 𝒥 ≪ 𝒳 Unpaired Dataℳ: Artificially Augmented Data

Semi-StyleGAN: CelebA (256x256)

With very limited supervision, Semi-StyleGAN can achieve good disentanglement on real data

𝑔! 𝑔"

𝐼# Data Point<person_id>, long-hair

𝐼#$ Data Point<person_id>, bangs

Counterfactual

Fixed-value

Semi-StyleGAN: Isaac3D (512x512)

Each factor in the interpolated images changes smoothly without affecting other factors

𝑔! 𝑔"

𝐼# Data Point<object_id>, x-pos

𝐼#$ Data Point<object_id>, x-pos_new

Counterfactual

Fixed-value

Semi-StyleGAN: Falcor3D (512x512)

Each factor in the interpolated images changes smoothly without affecting other factors

𝑔! 𝑔"

𝐼# Data Point<lighting>, camera_pos

𝐼#$ Data Point< lighting >, camera_new

Counterfactual

Fixed-value

Semi-StyleGAN: Role of Limited Supervision

Only using 0.25∼2.5% of labeled data at par with supervised disentanglement

(L2 and L2-gen: lower is better, MIG and MIG-gen: higher is better)

Semi-StyleGAN with the default setting

Semi-StyleGAN: Fine-Grained Tuning

New architecture with same loss model for semantic fine-grained image editing

Coarse-Grained Fine-Grained

We randomly choose some deep learning researchers as test images

𝑔! 𝑔"

𝐼# Data Pointeyebrow, smile

𝐼#$ Data Pointeyebrow, grin

Counterfactual

Fixed-value

We shift the robot position to the right side, and attach it with an unseen object in test images

𝑔! 𝑔"

𝐼# Data Pointwall_color, obj_color

𝐼#$ Data Pointwall_color, Obj_color

Counterfactual

Fixed-value

Learning Keypoints

goal position

target object

Representations for multi-step reasoning in Robotics under physical and semantic constraints

choose action sequenceat, ……, at+H

dynamics

s’~ f (· | s,a)

[Deisenroth et al, RSS’07], [Guo et al, NeurIPS’14], [Watter et al, NeurIPS’15], [Finn et al, ICRA’17], …...

Model-based learning

data ↑learning ↑

[Janer et al. ICRA’19]

[Agrawal et al. ICRA’16]

[Ebert et al. CoRL’17]

[Deisenroth et al. RSS’07]

Model-based learning

effect code c

CAVIN: Hierarchical planning in learned latent spaces

CAVIN Planner

motion code z

Leverage Hierarchical Abstraction in Action Space

Without Hierarchical Supervision

effect code c

CAVIN Planner

motion code z

subgoals

effect code c

CAVIN Planner

motion code z

actions

meta-dynamics

choosec~ p(c)

meta-dynamics

choosec~ p(c)

st+3Tst+2T

action generatormeta-dynamics

Hierarchical planning in learned latent spaceschoosez~ p(z)

choosec~ p(c)

action generatormeta-dynamics dynamics

CAVIN: Hierarchical planning in learned latent spaceschoosez~ p(z)

choosec~ p(c)

Learning with cascaded variational inference

task-agnostic interaction

qh (c | s,s’’) qg(z | s,c,a)

h (s’’ | s,c)

meta-dynamics

g (a | s,c,z)

action generator

μc Σc

μz Σz

stpreprocess

kinect2 sensor

action[ x, y, Δx, Δy]

CAVIN Planner

visual observation

crossing

target objectgoal position

Move the target to the goal

across grey tiles

insertion

target objectgoal position

Move the target to the goal

without traversing red tiles.

clearing

Clear all objects within the

area of blue tiles.

Quantitative Evaluation

MPC CVAE-MPC SeCTAR CAVIN CAVIN-Real

Dense Sparse

15%12%

MPC (Guo et al. ‘14, Agrawal et al. ‘16, Finn et al. 17); CVAE-MPC (Ichter et al. 18), SeCTAR (Co-Reyes et al ‘18)

Hierarchical Latent space dyn.↓

Better performance with sparse reward signal

Averaged over 3 Tasks with 1000 test instances each

Move 2 obstacles5x

Get around5x

Open path5x

Learning Keypoints

Composition through Keypoints

Interpretable Unsupervised

He et al 2017, Kreiss et al 2019, Lin et al 2020, Sprurr et al 2020

Tang et al 2019, Christiansen et al 2019, Bian et al 2019, He et al 2019, Chen et al 2019

Composition through Keypoints

Interpretable Unsupervised

He et al 2017, Kreiss et al 2019, Lin et al 2020, Sprurr et al 2020

Tang et al 2019, Christiansen et al 2019, Bian et al 2019, He et al 2019, Chen et al 2019

Learning Keypoints From Video

Learning Keypoints From VideoBB

Learning Keypoints From Video

Unsupervised Keypoints: Batch RL

Large Set of Task Demonstrations

Policy Learning without Interaction

Unsupervised Keypoints: Batch RL

Unsupervised Keypoints: Batch RLUnsupervised Representation Learning

BC BCQ IRIS

Unsupervised Keypoints: Video Prediction

Learning Keypoints

Learning Causality

Photo from Wu et al., Learning to See Physics via Visual De-animation• Intuitive Physics

• vs Reinforcement Learning• Generalization• Goal specification• Sample efficiency

• vs Analytical Physics Model• Underlying dynamics is uncertain or unknown• Simulation is too time consuming• Partial observation

Learning Causality

• Intuitive Physics

• State Representation?• Keypoints

• vs. 6 DoF pose

Photo from Jakab et al., Unsupervised Learning of Object Landmarks through Conditional Image Generation

Photo from Tremblay et al., Deep Object Pose Estimation for Semantic Robotic Grasping of Household Objects

From left to right:(1) Input image(2) Predicted keypoints(3) Overlay(4) Heatmap from the keypoints(5) Reconstructed target image

Learning Causality

Learning Causality: Extrapolation

Learning Causality: Counterfactual

Learning Causality

Unsupervised Representations towards Counterfactual Predictions

Animesh Garggarg@cs.toronto.edu // pair.toronto.edu // Twitter: @Animesh_Garg

Animesh Garg

Documents

Transcript of Animesh Garg

CSC2621 Topics in Robotics · CSC2621 Topics in Robotics Reinforcement Learning in Robotics Week 1: Introduction & Logistics Animesh Garg

Adrian Schuepbach, Bernard Metzler, IBM Research, Zurich Animesh … · 2020-03-29 · Albis: High-Performance File Format for Big Data Systems Animesh Trivedi, Patrick Stuedi, Jonas

The Physics of Modern Crisis Animesh Mukherjee [Roll: 05CS9405]

AdversariallyRobust Policy Learning through Active ... · Ajay Mandlekar, Yuke Zhu, Animesh Garg, Li Fei-Fei, Silvio Savarese ... They are then updated in the environment. Observation

Animesh Mukherjee - IITKGP

Presented by: Animesh Verma 074003

Animesh Jain - Brookhaven National Laboratory — a … Magnet Division Magnetic Measurements Animesh Jain Superconducting Magnet Division Brookhaven National Laboratory, Upton, NY

Garg Dissertation

VIJAY KUMAR GARG - users.ece.utexas.eduusers.ece.utexas.edu/~garg/14vita.pdf · Kumar, V.K. Garg, Modeling and Control of Logical Discrete Event Systems, The Springer International

Presented By Animesh Mukherjee IIT Kharagpur

Garg labels

, Abhishek Lahiri, Animesh Jha Alkali roasting of bomar ...

Dr. S. P. Mathur Registrar - old.iitbhu.ac.in · 63 Anil Kumar Varma Late Kishori Lal OBC 64 Animesh Kumar Shri Om Kumar SC 65 Animesh Kumar Ojha Sh.Anil Kumar Ojha General 66 Animesh

Survey Support to Magnetic Measurement · Animesh, Production Measurements of magnets for the NSLS-II Storage Ring, 17th International Magnetic Measurement Workshop. 2. J. Animesh,

VIJAY KUMAR GARG - University of Texas at Austinusers.ece.utexas.edu/~garg/16vita.pdf · Rahul Garg, Vijay K. Garg, Yogish Sabharwal: Scalable algorithms for global snapshots in distributed

Animesh Kundu - DSPACE

Abhay Garg Horoscope

Modeling and Analyzing Periodic Distributed Computations Anurag Agarwal Vijay Garg (garg@ece.utexas.edu)garg@ece.utexas.edu Vinit Ogale The University.

Robbie Adler , Animesh Mishra Chassis Engineering, Test ...… · Robbie Adler , Animesh Mishra Chassis Engineering, Test Chips and Discrete Devices Chipset and IP Group, IPSG OCP/ODSA

Animesh Ispat Private Limitedenvironmentclearance.nic.in/writereaddata/EIA/11062019...Animesh Ispat Private Limited [Establishment of 1 x 350 TPD DRI Kiln to manufacture 1,05,000 TPA