Similarity Metrics for Categorization: From Monolithic to Category Specific

29
Boris Babenko, Steve Branson, Serge Belongie University of California, San Diego ICCV 2009, Kyoto, Japan

description

Similarity Metrics for Categorization: From Monolithic to Category Specific. Boris Babenko, Steve Branson, Serge Belongie University of California, San Diego ICCV 2009, Kyoto, Japan. Similarity Metrics for Recognition. Recognizing multiple categories - PowerPoint PPT Presentation

Transcript of Similarity Metrics for Categorization: From Monolithic to Category Specific

Page 1: Similarity Metrics for Categorization: From Monolithic to Category Specific

Boris Babenko, Steve Branson, Serge BelongieUniversity of California, San Diego

ICCV 2009, Kyoto, Japan

Page 2: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Recognizing multiple categories– Need meaningful similarity metric / feature space

Page 3: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Recognizing multiple categories– Need meaningful similarity metric / feature space

• Idea: use training data to learn metric– Goes by many names:

• metric learning• cue combination/weighting• kernel combination/learning• feature selection

Page 4: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Learn a single global similarity metric

Labeled Dataset

Mon

olith

ic

Query Image Similarity Metric

Cate

gory

4Ca

tego

ry 3

Cate

gory

2Ca

tego

ry 1

[ Jones et al. ‘03,Chopra et al. ‘05,Goldberger et al. ‘05,Shakhnarovich et al. ‘05Torralba et al. ‘08]

Page 5: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Learn similarity metric for each category (1-vs-all)

Labeled Dataset

Mon

olith

icCa

tego

rySp

ecifi

c

Query Image Similarity Metric

Cate

gory

4Ca

tego

ry 3

Cate

gory

2Ca

tego

ry 1

[ Varma et al. ‘07,Frome et al. ‘07,Weinberger et al. ‘08Nilsback et al. ’08]

Page 6: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Monolithic:– Less powerful… there is no “perfect” metric– Can generalize to new categories

• Per category:– More powerful– Do we really need thousands of metrics?– Have to train for new categories

Page 7: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Would like to explore space between two extremes

• Idea: – Group categories together – Learn a few similarity metrics, one for each group

Page 8: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Learn a few good similarity metrics

Labeled Dataset

Mon

olith

icM

uSL

Cate

gory

Spec

ific

Query Image Similarity Metric

Cate

gory

4Ca

tego

ry 3

Cate

gory

2Ca

tego

ry 1

Page 9: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Need some framework to work with…• Boosting has many advantages:

– Feature selection– Easy implementation– Performs well

Page 10: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Training data:

• Generate pairs:– Sample negative pairs

( , ), 0( , ), 1

Images

Category Labels

Page 11: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Train similarity metric/classifier:

Page 12: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Choose to be binary -- i.e.• = L1 distance over binary vectors

– efficient to compute (XOR and sum)

• For convenience:

[Shakhnarovich et al. ’05, Fergus et al. ‘08]

Page 13: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Given some objective function• Boosting = gradient ascent in function space• Gradient = example weights for boosting

chosen weak classifier

other weak classifiers

function space

current strong classifier

[Friedman ’01, Mason et al. ‘00]

Page 14: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Goal: train and recover mapping• At runtime

– To compute similarity of query image touse

Cate

gory

4Cat

egor

y 3

Cate

gory

2Ca

tego

ry 1

Page 15: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Run pre-processing to group categories (i.e. k-means), then train as usual

• Drawbacks:– Hacky / not elegant– Not optimal: pre-processing not informed by class

confusions, etc.

• How can we train & group simultaneously?

Page 16: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Definitions:

Sigmoid Function Parameter

Page 17: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Definitions:

Page 18: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Definitions:

How well works with category

Page 19: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Objective function:

• Each category “assigned” to classifier

Page 20: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Replace max with differentiable approx.

where is a scalar parameter

Page 21: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Each training pair has weights

Page 22: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Intuition:

Approximation of Difficulty of pair(like regular boosting)

Page 23: Similarity Metrics for Categorization: From Monolithic to Category Specific

10 20 30

0.5

1

1.5

2

x 10-4

10 20 30

1

2

3

4

5

6x 10

-5

w1i

w2i

w3i

(boosting iteration) (boosting iteration)

Difficult PairAssigned to

Easy PairAssigned to

Page 24: Similarity Metrics for Categorization: From Monolithic to Category Specific
Page 25: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Created dataset with many heterogeneous categories

5 10 15 200.65

0.7

0.75

0.8

K (number of classifiers)

Acc

ura

cy

MuSL+retrainMuSLk-meansRandMonolithicPer Cat

Merged categories from:• Caltech 101 [Griffin et al.]• Oxford Flowers [Nilsback et al.]• UIUC Textures [Lazebnik et al.]

Page 26: Similarity Metrics for Categorization: From Monolithic to Category Specific

MuS

L

k-m

eans

Page 27: Similarity Metrics for Categorization: From Monolithic to Category Specific

Training more metrics overfits!

Page 28: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Studied categorization performance vs number of learned metrics

• Presented boosting algorithm to simultaneously group categories and train metrics

• Observed overfitting behavior for novel categories

Page 29: Similarity Metrics for Categorization: From Monolithic to Category Specific

• Supported by– NSF CAREER Grant #0448615 – NSF IGERT Grant DGE-0333451– ONR MURI Grant #N00014-08-1-0638– UCSD FWGrid Project (NSF Infrastructure Grant

no. EIA-0303622)