Occlusion Aware Unsupervised Learning of Optical Flow

29
Occlusion Aware Unsupervised Learning of Optical Flow Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu Baidu Research, Institute of Deep Learning (IDL)

Transcript of Occlusion Aware Unsupervised Learning of Optical Flow

Page 1: Occlusion Aware Unsupervised Learning of Optical Flow

Occlusion Aware Unsupervised Learning of Optical Flow

Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei XuBaidu Research, Institute of Deep Learning (IDL)

Page 2: Occlusion Aware Unsupervised Learning of Optical Flow

Background – Self-supervised Learning

[3] Eslami et al, Natural scene representation and rendering, Science 2018 [2] Pathak et al, Curiosity-driven exploration by self-supervised prediction, ICML 2017

[1]Jaderberg et al, Reinforcement learning with unsupervised auxiliary tasks, ICLR 2017

[1] [2] [3]

• Learning without human annotation

Page 3: Occlusion Aware Unsupervised Learning of Optical Flow

Cherry On the Cake

Reinforcement Learning

Supervised Learning

Self-supervised Learning

Page 4: Occlusion Aware Unsupervised Learning of Optical Flow

But We Focus on Self-supervised Optical Flow

• Optical flow: [∆"#, ∆%#] encodes 2D motion for every pixel &

Brox and Malik, Large displacement optical flow: descriptor matching in variational motion estimation, PAMI 2011

Page 5: Occlusion Aware Unsupervised Learning of Optical Flow

Why Optical Flow?

• Optical flow is very useful but ground truth are hard to obtain• Optical flow technique can also apply to stereo depth estimation.• Can be evaluated with standard benchmark dataset.

• Depth and flow extend “state” representation from 2D to 4D in RL.• RGB, depth and flow are complimentary to each other.• Stereo Video -> Optical flow + Depth -> Attention / Recognition.

Page 6: Occlusion Aware Unsupervised Learning of Optical Flow

Why Optical Flow?

• A 4 month baby can• Follow an object (Tracking, Flow)• Reach and grasp an object (Depth, Shape)• Pay attention to small objects (Attention)

Page 7: Occlusion Aware Unsupervised Learning of Optical Flow

Previous Work

• Idea: Learning to predict optical flow to minimize photometric loss

[3] Zhe et al, Unsupervised Deep Learning for Optical Flow Estimation, AAAI 2017[2] Jason et al, Backtobasics: Unsupervised learning of optical flow via brightness constancy and motion, ECCV 2016W

[1] Ahmadi et al, Unsupervised convolutional neural networks for motion estimation, ICIP 2016

Page 8: Occlusion Aware Unsupervised Learning of Optical Flow

Warping Using Optical FlowUse image2 and optical flow to re-construct image1 with bilinear interpolation.Can be implemented through Spatial Transformer Networks.

Page 9: Occlusion Aware Unsupervised Learning of Optical Flow

Occlusion Problem in Warping: correct flow but wrong reconstruction

Page 10: Occlusion Aware Unsupervised Learning of Optical Flow

Problem – Occlusion affects Performance

Page 11: Occlusion Aware Unsupervised Learning of Optical Flow

Problem – Occlusion affects Performance

Page 12: Occlusion Aware Unsupervised Learning of Optical Flow

Problem – Occlusion affects Performance

Page 13: Occlusion Aware Unsupervised Learning of Optical Flow

Our Solution – Model Occlusion Explicitly

Page 14: Occlusion Aware Unsupervised Learning of Optical Flow

Occlusion Map

Page 15: Occlusion Aware Unsupervised Learning of Optical Flow

Occlusion Map

Page 16: Occlusion Aware Unsupervised Learning of Optical Flow

Occlusion Map

Page 17: Occlusion Aware Unsupervised Learning of Optical Flow

Occlusion-Aware Photometric Loss

Page 18: Occlusion Aware Unsupervised Learning of Optical Flow

Model Structure

Page 19: Occlusion Aware Unsupervised Learning of Optical Flow

Improving Backpropagation - Enlarged Search• The warped pixel only depends on its four nearest neighbors, so if the

target position is far away from the proposed position, the network will not get meaningful gradient signals during backpropagation. • We search for the best bilinear interpolation in different scales.

Page 20: Occlusion Aware Unsupervised Learning of Optical Flow

Experiments: Benchmark Datasets

• Dataset StatisticsDataset #Unsupervised Train Paris #Validation Paris Train Has GT Flow

Flying Chairs 22232 640MPI Sintel 908 133KITTI 2012 13372 194 ✘

Page 21: Occlusion Aware Unsupervised Learning of Optical Flow

Quantitative Results – The Lower the Better

5.1

10.4

17

5.3

11.3

3.3

8

3.62.7

7 7.5

0

2

4

6

8

10

12

14

16

18

Flying-Chairs MPI Sintel Clean KITTI 2012

Average Endpoint Error in pixels (EPE)

DSTFlow [1]BackToBasic [2]OursSupervise [3]

[1] Zhe et al, Unsupervised Deep Learning for Optical Flow Estimation, AAAI 2017 [2] Jason et al, Backtobasics: Unsupervised learning of optical flow via brightness constancy and motion, ECCV 2016W

[3] Flownet: Learning optical flow with convolutional networks, ICCV 2015

Page 22: Occlusion Aware Unsupervised Learning of Optical Flow

Cherry Pick Examples – KITTI

Page 23: Occlusion Aware Unsupervised Learning of Optical Flow

Cherry Pick Examples – MPI Sintel

Page 24: Occlusion Aware Unsupervised Learning of Optical Flow

Ablation Study

Page 25: Occlusion Aware Unsupervised Learning of Optical Flow

Extension: Occlusion Aware Unsupervised Learning of Stereo Depth (Disparity)• !" and !# are left and right stereo image pairs.• For calibrated cameras, the output disparity is only $ channel with ReLU.

Page 26: Occlusion Aware Unsupervised Learning of Optical Flow
Page 27: Occlusion Aware Unsupervised Learning of Optical Flow

0.102

0.060.068

0

0.02

0.04

0.06

0.08

0.1

0.12

KITTI 2015 Stereo

Abs Rel (The Lower the Better)

Godard [1]

Ours

Supervise* [1]

Results on KITTI Stereo Disparity Estimation

[1] Godard et al, Unsupervised monocular depth estimation with left-right consistency, CVPR 2017

Page 28: Occlusion Aware Unsupervised Learning of Optical Flow

Extension 2: Unsupervised Learning of Scene Flow (Flow + Disparity)

IL2

IL1 OpticalFlow

FlowNet

IL2

IL1

MotionNet

IR1

IL1

Camera Motion

DispNet

Depth Flow

Disparity /Depth

Moving Object Mask

Fused Flow

Page 29: Occlusion Aware Unsupervised Learning of Optical Flow

Cherry-Pick Examples of Moving Objects Mask