Part 4: combined segmentation and recognition

Part 4: combined segmentation and recognition

Li Fei-Fei

Aim• Given an image and object category, to segment the object

Segmentation should (ideally) be• shaped like the object e.g. cow-like• obtained efficiently in an unsupervised manner• able to handle self-occlusion

Segmentation

ObjectCategory

Model

Cow Image Segmented Cow

In this section: brief paper reviews

• Jigsaw approach: Borenstein & Ullman, 2001, 2002

• Concurrent recognition and segmentation: Yu and Shi, 2002

• Image parsing: Tu et al. 2003 • Interleaved segmentation: Liebe & Schiele,

2004, 2005• OBJCUT: Kumar et al. 2005• LOCUS: Winn and Jojic, 2005

Jigsaw approach: Borenstein and Ullman, 2001, 2002

Object-Specific Figure-Ground SegregationStella X. Yu and Jianbo Shi, 2002

Object-Specific Figure-Ground Segregation

Some segmentation/detection results

Yu and Shi, 2002

Image parsing: Tu, Zhu and Yuille 2003

Perc

eptu

al a

nd S

enso

ry A

ugm

ente

d Co

mpu

ting

Inte

rlea

ved

Obj

ect

Cate

gori

zati

on a

nd S

egm

enta

tion

Implicit Shape Model - Recognition

BackprojectedHypotheses

Interest PointsMatched Codebook Entries

Probabilistic Voting

Voting Space(continuous)

Backprojection

of Maxima

Segmentation

Refined Hypotheses(uniform sampling)

Liebe and Schiele, 2003, 2005

Perc

eptu

al a

nd S

enso

ry A

ugm

ente

d Co

mpu

ting

Inte

rlea

ved

Obj

ect

Cate

gori

zati

on a

nd S

egm

enta

tion Cows: Results

• Segmentations from interest points

Single-frame recognition - No temporal continuity used!

Liebe and Schiele, 2003, 2005

OBJCUT:shape prior -- Layered Pictorial Structures (LPS)

• Generative model• Composition of parts + spatial layout

Layer 2

Layer 1

Parts in Layer 2 can occlude parts in Layer 1

Spatial Layout(Pairwise Configuration)

Kumar, et al. 2004, 2005

OBJCUT

Probability of labelling in addition has• Unary potential which depend on distance from Θ (shape parameter)

D (pixels)

m (labels)

Θ (shape parameter)

Image Plane

Object CategorySpecific MRFx

y

mx

my

Unary PotentialΦx(mx|Θ)

Kumar, et al. 2004, 2005

In the absence of a clear boundary between object and background

SegmentationImage

OBJCUT: ResultsUsing LPS Model for Cow

LOCUS model

Deformation field D

Position & size T

Class shape π Class edge sprite μo,σo

Edge image e

Image

Object appearance λ1

Background appearance λ0

Mask m

Shared between images

Different for each image

Winn and Jojic, 2005

Summary

• Strength– Explains every pixel of the image– Useful for image editing, layering, etc.

• Issues– Invariance issues

• (especially) scale, view-point variations

Part 4: combined segmentation and recognition

Documents

Transcript of Part 4: combined segmentation and recognition