Multi-Output Learning for Camera Relocalization Abner Guzmán-Rivera UIUC Pushmeet Kohli Ben Glocker...

Multi-Output Learningfor Camera Relocalization

Abner Guzmán-Rivera UIUC

Pushmeet Kohli Ben Glocker Jamie Shotton Toby Sharp Andrew Fitzgibbon Shahram Izadi

Microsoft Research

Camera Relocalizationfrom RGB-D images

Know 3D model

RGB-Depth

Observe single frame

Where is the camera?

6D camera pose H(rotation and translation)

Applications Large scale 3D model reconstruction

Applications Vehicle, robot, etc. localization

Applications Augmented Reality

Other Approaches to Localization Sparse key-point matching:

– Detectors: [Rosten et al. PAMI’10], [Holzer et al. ECCV’12]

– Descriptors: [Winder and Brown CVPR’07], [Calonder et al. ECCV’10], [Rublee et al. ICCV’11]

– Matching: [Lepetit and Fua PAMI’06], [Nistér and Stewénius CVPR’06], [Schindler et al. CVPR’07]

– Pose estimation: [Irschara et al. CVPR’09], [Dong et al. ICCV’09], [Yi et al. ECCV’10], [Baatz et al. IJCV’11], [Sattler et al. ICCV’11]

Whole key-frame matching[Klein and Murray ECCV’08], [Gee and Mayol-Cuevas BMVC’12]

Epitomic location recognition[Ni et al. PAMI’09]

Relocalization as Inverse Problem Find the pose H* minimizing the error in a

rendering of the model

3D model of sceneRendering error

View “renderer”Input RGB-D frame

Inverse Problem

DiscriminativePredictor

Inverse Problem

Single Predictor Not Powerful Enough Limited expressivity

The mapping is one-to-many

Input frame

Approx. Inverse Problem Stage 1

Portfolio ofDiscriminative

PredictorsWant complementary or “diverse” predictions

Approx. Inverse Problem Stage 2

How to train such portfolioof complementary predictors?

Discriminative Predictor[Shotton et al. CVPR’13]

Scene Coordinate Regression Forests

[Shotton et al. CVPR’13]

Pixel comparison features(Depth and RGB) (x,y,z) world coordinate

Regression tree:

Regression forest

Scene Coordinate Regression Forests

Inliers for several hypothesesfrom RANSAC

. . .Forest predicts 3Dworld coordinates

Sample pixels frominput RGB-D frame

Learning a portfolio of predictors

to output a set of hypotheses that:Would like to train a set of predictors

1. Are relevant, i.e., approx. local minimizers2. Summarize well the output space

Learning a portfolio: previous work Multiple Choice Learning

[Guzman-Rivera et al. NIPS’12, AISTATS’14]

Set min-loss Oracle penalizes portfolio for the errorin the best prediction in the output

– The portfolio is NOT penalized for being diverse– Set min-loss applies to standard datasets– Iterative training of fixed size portfolio

Standard task-loss

Learning a portfolio of predictors

Portfolio of predictors CVPR’13 SCoRe Forest

We already have the objective to optimize

and propose to approximate (1) by

– The portfolio is NOT penalized for being diverse– Learning procedure is able to tune portfolio to

the reconstruction error to be used at test-time– Next we describe one way to achieve diversity

Multi-Output LossStandard task-loss

Training Algorithm

Loss to Example Weights

Diversity parameter(“variance” of the weights)

Multi-output loss for example j

Intuition: Want next predictor to emphasize accuracy on examples difficult thus far

Rendering Error

L1 Rendering ErrorInput frame 1. Raycast depth frame for some hypothesis

2. Evaluate L1 distance between input depth and raycast depth

Results

7-Scenes Dataset

[Shotton et al. CVPR’13, Glocker et al. ISMAR’13]

Metric Proportion Correct (single prediction)

– Correct if translational error ≤ 5cm ANDrotational error ≤ 5o

Competing Approaches CVPR13: Scene Coordinate Regression Forests

CVPR13 + M-Best– Take M-Best RANSAC hypotheses

Office

Input frame

Multiple predictions:

Ground-truth (white),Prediction (magenta):

Stairs

Input frame

Multiple predictions:

Ground-truth (white),Prediction (magenta):

All Scene Average

1 2 3 4 5 6 7 8 9 100.66

CVPR13 + M-BestMulti-OutputCVPR13

Size of Portfolio

All Scene Average

1 2 3 4 5 6 7 8 9 100.66

CVPR13 + M-BestMulti-OutputCVPR13

Size of Portfolio

Usingaggregation

Summary Camera relocalization as inverse problem

Portfolio of complementarydiscriminative predictors

Method to learn suchportfolio

State-of-the-art camerarelocalization

Multi-Output Learning for Camera Relocalization Abner Guzmán-Rivera UIUC Pushmeet Kohli Ben Glocker...

Documents

Transcript of Multi-Output Learning for Camera Relocalization Abner Guzmán-Rivera UIUC Pushmeet Kohli Ben Glocker...

De-Growth Is Not a Liberal Agenda: Relocalization and the ... · De-Growth Is Not a Liberal Agenda: Relocalization and the limits to low energy cosmopolitanism. STEPHEN QUILLEY Senior

Real-Time 6DOF Pose Relocalization for Event …rpg.ifi.uzh.ch/CVPR19_event_vision_workshop_files/docs/...Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial

Democracy By Dave Lewit, October 7, 2009 Presented at Mass Coalition for Healthy Communities conference on Relocalization, October 18, 2009.

P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.

Local Supports Global: Deep Camera Relocalization …openaccess.thecvf.com/content_ICCV_2019/supplemental/Xue...The star represents the start point. The poses of PoseNet and MapNet

KFNet: Learning Temporal Camera Relocalization using ... · KFNet: Learning Temporal Camera Relocalization using Kalman Filtering Lei Zhou 1Zixin Luo Tianwei Shen Jiahui Zhang2 Mingmin

Night-time Indoor Relocalization Using Depth Image with ...cs Li.pdf · Night-time Indoor Relocalization Using Depth Image with Convolutional Neural Networks Ruihao Li1, ... in recognition

Reserve Requirements for Price and Financial Stability ... · PDF fileReserve Requirements for Price and Financial Stability: When Are They Eﬀective?∗ Christian Glocker and Pascal

Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.

PowerPoint Presentation - MAP Estimation Algorithms in ... · MAP Estimation Algorithms in M. Pawan Kumar, University of Oxford Pushmeet Kohli, Microsoft Research Computer Vision

Transcription factors Mix1 and VegT, relocalization of ...Transcription factors Mix1 and VegT, relocalization of vegt mRNA, and conserved endoderm and dorsal specification in frogs

Solving Markov Random Fields using Dynamic Graph Cuts & Second Order Cone Programming Relaxations M. Pawan Kumar, Pushmeet Kohli Philip Torr.

Real-Time RGB-D Camera Relocalization via Randomized Ferns for … · 2017-02-18 · 1 Real-Time RGB-D Camera Relocalization via Randomized Ferns for Keyframe Encoding Ben Glocker,

Crossveinless-2 is required for the relocalization of Chordin protein ...

Focused Inference with Local Primal-Dual Gaps Dhruv Batra (TTIC) Joint work with: Daniel Tarlow (U Toronto), Sebastian Nowozin (MSRC), Pushmeet Kohli (MSRC),

High-mobility group A1 inhibits p53 by cytoplasmic relocalization of ...

POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.

Adaptive Parametrization of Multivariate B-splines for ... · msh@imm.dtu.dk Ben Glocker and Nassir Navab Technische Universität München München, Germany Abstract We present an

Nonlinear Hybrid Dynamical Systems: Modeling, … · Nonlinear Hybrid Dynamical Systems: Modeling, Optimal Control, and Applications Martin Buss1, Markus Glocker 2, Michael Hardt

The Relocalization of Buddhism in Thailand - Dickinson Blogs