MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

32
MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes

Transcript of MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

Page 1: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Lecture 8Segmentation of Dynamical

Scenes

Page 2: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

One-body two-views

Page 3: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

One-body multiple-views

Page 4: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Motivation and problem statement

•A static scene: multiple 2D motion models

•A dynamic scene: multiple 3D motion models

•Given an image sequence, determine Number of motion models (affine, Euclidean, etc.) Motion model: affine (2D) or Euclidean (3D) Segmentation: model to which each pixel belongs

Page 5: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Previous work on 2D motion segmentation

• Local methods (Wang-Adelson ’93) Estimate one model per

pixel using data in a window Cluster models with K-

means Iterate Aperture problem Motion across boundaries

• Global methods (Irani-Peleg ‘92) Dominant motion: fit one

motion model to all pixels Look for misaligned pixels

& fit a new model to them Iterate

• Normalized cuts (Shi-Malik ‘98) Similarity matrix

based on motion profile

Segment pixels using eigenvector

Page 6: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Previous work on 3D motion segmentation

• Factorization techniques Orthographic/discrete: Costeira-Kanade ’98, Gear ‘98 Perspective/continuous: Vidal-Soatto-Sastry ’02 Omnidirectional/continuous: Shakernia-Vidal-Sastry ’03

• Special cases: Points in a line (orth-discrete): Han and Kanade ’00 Points in a conic (perspective): Avidan-Shashua ’01 Points in a line (persp.-continuous): Levin-Shashua ’01

• 2-body case: Wolf-Shashua ‘01

Page 7: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Previous work: probabilistic techniques

• Probabilistic approaches Generative model: data membership + motion model Obtain motion models using Expectation Maximization

E-step: Given motion models, segment image data M-step: Given data segmentation, estimate motion models

• 2D Motion Segmentation Layered representation (Jepson-Black’93, Ayer-Sawhney ’95,

Darrel-Pentland’95, Weiss-Adelson’96, Weiss’97, Torr-Szeliski-Anandan ’99)

• 3D Motion Segmentation EM+Reprojection Error: Feng-Perona’98 EM+Model Selection: Torr ’98

• How to initialize iterative algorithms?

Page 8: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Our approach to motion segmentation

•This work considers full perspective projection multiple objects general motions

•We show that Problem is equivalent to

polynomial factorization There is a unique global

closed form solution if n<5

Exact solution is obtained using linear algebra

Can be used to initialize EM-based algorithms

Image points

Number of motions

Multibody Fund. Matrix

Epipolar lines

Epipoles

Fundamental Matrices

Motion segmentation

Multi epipolar lines

Multi epipole

Page 9: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

One-dimensional segmentation

•Number of models?

Page 10: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

One-dimensional segmentation

• For n groups

• Number of groups

• Groups

Page 11: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

One-dimensional segmentation

• Solution is unique if

• Solution is closed form if and only if

• Solves the eigenvector segmentation problem e.g. normalized cuts

Page 12: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Three-dimensional motion segmentation

Generalized PCA (Vidal, Ma ’02)

• Solve for the roots of a polynomial of degree in one variable

• Solve for a linear system in variables

Page 13: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

The multibody epipolar constraint

• Rotation:• Translation:• Epipolar constraint

• Multiple motions

Multibody epipolar constraint• Satisfied by ALL points regardless of segmentation

• Segmentation is algebraically eliminated!!!

Page 14: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

The multibody fundamental matrix

Lifting EmbeddingEmbedding

Bilinear on embedded data!

• Veronese map (polynomial embedding)

• Multibody fundamental matrix

Page 15: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Estimation of the number of motions

• Theorem: Given image points corresponding to motions, if at least 8 points correspond to each object, then

1 2 43

8 35 99 225Minimum number of

points

Page 16: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Estimation of multibody fundamental matrix

1-body motion n-body motion

Page 17: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Segmentation of fundamental matrices

Given

rank condition for n motions

linear system F

Multibody epipolar transfer

Multibody epipole

Fundamental matrices

Page 18: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Multibody epipolar transfer

Multibody epipolar line

Lifting

Polynomial factorization

Page 19: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Multibody epipole

• The multibody epipole is the solution of the linear system

• Epipoles are obtained using polynomial factorization

Lifting

Number of distinct epipoles

Page 20: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

From images to epipoles

Page 21: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Fundamental matrices

• Columns of are epipolar lines

• Polynomial factorization to compute them up to scale

• Scales can be computed linearly

Page 22: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

The multibody 8-point algorithm

Image point

Multibody epipolar line

Multibody epipolar transfer

Polynomial Factorization

Linear system

Polynomial Factorization

Epipolar lines

Multibody epipole

Epipoles

Embedded image point

Veronese map

Fundamental matrix

Linear system

Page 23: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Optimal 3D motion segmentation

• Zero-mean Gaussian noise• Constrained optimization problem on

• Optimal function for 1 motion

• Optimal function for n motions

• Solved using Riemanian Gradien Descent

Page 24: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Comparison of 1 body and n bodies

Page 25: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Other cases: linearly moving objects

• Multibody epipole• Recovery of epipoles• Fundamental matrices• Feature segmentation 1 2 105

2 5 20 65

1 2 43

8 35 99 225

Minimum number of

points

Page 26: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Other cases: affine flows

Affine motion segmentation:

constant brightness constraint

3D motion segmentation:epipolar constraint

• In linear motions, geometric constraints are linear

• Two-view motion constraints could be bilinear!!!

Page 27: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

3D motion segmentation results

N = 44 + 48 + 81 = 173

Page 28: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Results

Page 29: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Results

Page 30: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

More results

Page 31: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

Conclusions

• There is an analytic solution to 3D motion segmentation based on Multibody epipolar constraint: it does not depend

on the segmentation of the data Polynomial factorization: linear algebra Solution is closed form iff n<5

• A similar technique also applies to Eigenvector segmentation: from similarity

matrices Generalized PCA: mixtures of subspaces 2-D motion segmentation: of affine motions

• Future work Reduce data complexity, sensitivity analysis,

robustness

Page 32: MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

MASKS © 2004 Invitation to 3D vision

References

• R. Vidal, Y. Ma, S. Soatto and S. Sastry. Two-view multibody structure from motion, International Journal of Computer Vision, 2004

• R. Vidal and S. Sastry. Optimal segmentation of dynamic scenes from two perspective views, International Conference on Computer Vision and Pattern Recognition, 2003

• R. Vidal and S. Sastry. Segmentation of dynamic scenes from image intensities, IEEE Workshop on Vision and Motion Computing, 2002.