Part 4: combined segmentation and recognition
description
Transcript of Part 4: combined segmentation and recognition
Part 4: combined segmentation and recognition
Li Fei-Fei
Aim• Given an image and object category, to segment the object
Segmentation should (ideally) be• shaped like the object e.g. cow-like• obtained efficiently in an unsupervised manner• able to handle self-occlusion
Segmentation
ObjectCategory
Model
Cow Image Segmented Cow
In this section: brief paper reviews
• Jigsaw approach: Borenstein & Ullman, 2001, 2002
• Concurrent recognition and segmentation: Yu and Shi, 2002
• Image parsing: Tu et al. 2003 • Interleaved segmentation: Liebe & Schiele,
2004, 2005• OBJCUT: Kumar et al. 2005• LOCUS: Winn and Jojic, 2005
Jigsaw approach: Borenstein and Ullman, 2001, 2002
Object-Specific Figure-Ground SegregationStella X. Yu and Jianbo Shi, 2002
Object-Specific Figure-Ground Segregation
Some segmentation/detection results
Yu and Shi, 2002
Image parsing: Tu, Zhu and Yuille 2003
Image parsing: Tu, Zhu and Yuille 2003
Perc
eptu
al a
nd S
enso
ry A
ugm
ente
d Co
mpu
ting
Inte
rlea
ved
Obj
ect
Cate
gori
zati
on a
nd S
egm
enta
tion
Implicit Shape Model - Recognition
BackprojectedHypotheses
Interest PointsMatched Codebook Entries
Probabilistic Voting
Voting Space(continuous)
Backprojection
of Maxima
Segmentation
Refined Hypotheses(uniform sampling)
Liebe and Schiele, 2003, 2005
Perc
eptu
al a
nd S
enso
ry A
ugm
ente
d Co
mpu
ting
Inte
rlea
ved
Obj
ect
Cate
gori
zati
on a
nd S
egm
enta
tion Cows: Results
• Segmentations from interest points
Single-frame recognition - No temporal continuity used!
Liebe and Schiele, 2003, 2005
OBJCUT:shape prior -- Layered Pictorial Structures (LPS)
• Generative model• Composition of parts + spatial layout
Layer 2
Layer 1
Parts in Layer 2 can occlude parts in Layer 1
Spatial Layout(Pairwise Configuration)
Kumar, et al. 2004, 2005
OBJCUT
Probability of labelling in addition has• Unary potential which depend on distance from Θ (shape parameter)
D (pixels)
m (labels)
Θ (shape parameter)
Image Plane
Object CategorySpecific MRFx
y
mx
my
Unary PotentialΦx(mx|Θ)
Kumar, et al. 2004, 2005
In the absence of a clear boundary between object and background
SegmentationImage
OBJCUT: ResultsUsing LPS Model for Cow
LOCUS model
Deformation field D
Position & size T
Class shape π Class edge sprite μo,σo
Edge image e
Image
Object appearance λ1
Background appearance λ0
Mask m
Shared between images
Different for each image
Winn and Jojic, 2005
Summary
• Strength– Explains every pixel of the image– Useful for image editing, layering, etc.
• Issues– Invariance issues
• (especially) scale, view-point variations