Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories...

Li Fei-Fei, UIUC

Rob Fergus, MIT

Antonio Torralba, MIT

Recognizing and Learning Recognizing and Learning Object CategoriesObject Categories

ICCV 2005 Beijing, Short Course, Oct 15ICCV 2005 Beijing, Short Course, Oct 15

AgendaAgenda

• Introduction

• Bag of words models

• Part-based models

• Discriminative methods

• Segmentation and recognition

• Conclusions

perceptibleperceptible visionvision materialmaterialthingthing

Plato said…• Ordinary objects are classified together if they

`participate' in the same abstract Form, such as the Form of a Human or the Form of Quartz.

• Forms are proper subjects of philosophical investigation, for they have the highest degree of reality.

• Ordinary objects, such as humans, trees, and stones, have a lower degree of reality than the Forms.

• Fictions, shadows, and the like have a still lower degree of reality than ordinary objects and so are not proper subjects of philosophical enquiry.

Bruegel, 1564

How many object categories are there?

Biederman 1987

So what does object recognition involve?

Verification: is that a bus?

Detection: are there cars?

Identification: is that a picture of Mao?

Object categorization

building

wallbanner

street lamp

Scene and context categorization• outdoor

• city

• traffic

• …

Challenges 1: view point variation

Michelangelo 1475-1564

Challenges 2: illumination

slide credit: S. Ullman

Challenges 3: occlusion

Magritte, 1957

Challenges 4: scale

Challenges 5: deformation

Xu, Beihong 1943

Challenges 6: background clutter

Klimt, 1913

History: single object recognition

• Lowe, et al. 1999, 2003

• Mahamud and Herbert, 2000• Ferrari, Tuytelaars, and Van Gool, 2004• Rothganger, Lazebnik, and Ponce, 2004• Moreels and Perona, 2005• …

Challenges 7: intra-class variation

History: early object categorization

• Turk and Pentland, 1991• Belhumeur et al. 1997• Schneiderman et al. 2004• Viola and Jones, 2000

• Amit and Geman, 1999• LeCun et al. 1998• Belongie and Malik, 2002

• Schneiderman et al. 2004• Argawal and Roth, 2002• Poggio et al. 1993

OBJECTS

ANIMALS INANIMATEPLANTS

MAN-MADENATURALVERTEBRATE …..

MAMMALS BIRDS

GROUSEBOARTAPIR CAMERA

Scenes, Objects, and Parts

Features

Objects

E. Sudderth, A. Torralba, W. Freeman, A. Willsky. ICCV 2005.

Object categorization: Object categorization: the statistical viewpointthe statistical viewpoint

)|( imagezebrap

)( ezebra|imagnopvs.

• Bayes rule:

zebranop

zebrap

zebranoimagep

zebraimagep

imagezebranop

imagezebrap

posterior ratio likelihood ratio prior ratio

Object categorization: Object categorization: the statistical viewpointthe statistical viewpoint

zebranop

zebrap

zebranoimagep

zebraimagep

imagezebranop

imagezebrap

posterior ratio likelihood ratio prior ratio

• Discriminative methods model posterior

• Generative methods model likelihood and prior

Discriminative

• Direct modeling of

Non-zebra

Decisionboundary

imagezebranop

imagezebrap

• Model and

Generative)|( zebraimagep ) |( zebranoimagep

Low Middle

High MiddleLow

)|( zebranoimagep)|( zebraimagep

Three main issuesThree main issues

• Representation– How to represent an object category

• Learning– How to form the classifier, given training data

• Recognition– How the classifier is to be used on novel data

Representation

– Generative / discriminative / hybrid

Representation

– Appearance only or location and appearance

Representation

– Invariances• View point• Illumination• Occlusion• Scale• Deformation• Clutter• etc.

Representation

– invariances– Part-based or global

w/sub-window

Representation

– invariances– Parts or global w/sub-

window– Use set of features or

each pixel in image

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning

Learning

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning)

– Methods of training: generative vs. discriminative

Learning

– What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.)

– Level of supervision• Manual segmentation; bounding box; image

labels; noisy labels

Learning

Contains a motorbike

– Batch/incremental (on category and image level; user-feedback )

Learning

– Training images:• Issue of overfitting• Negative images for discriminative methods

Priors

Learning

– Training images:• Issue of overfitting• Negative images for discriminative methods

– Priors

Learning

– Scale / orientation range to search over – Speed

Recognition

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories...

Documents

Transcript of Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories...

Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Color and color constancy 6.869, MIT Bill Freeman Antonio Torralba Feb. 22, 2011.

Semi-Supervised Learning in Gigantic Image Collections Rob Fergus (NYU) Yair Weiss (Hebrew U.) Antonio Torralba (MIT) TexPoint fonts used in EMF. Read.

Torralba 27

Efficient Image Search and Retrieval using Compact Binary Codes Rob Fergus (NYU) Antonio Torralba (MIT) Yair Weiss (Hebrew U.)

Unsupervised Detection of Regions of Interest Using Iterative Link Analysis Gunhee Kim 1 Antonio Torralba 2 1: SCS, CMU 2: CSAIL, MIT Neural Information.

HOGgles: Visualizing Object Detection Features (to be appeared in ICCV 2013) Carl Vondrick Aditya Khosla Tomasz Malisiewicz and Antonio Torralba,MIT Presented.

Catálogo Torralba primavera verano 2012

Torralba 08

Yusuf Aytar, Carl Vondrick, Antonio Torralba Abstract ... · Yusuf Aytar, Carl Vondrick, Antonio Torralba Massachusetts Institute of Technology fyusuf,vondrick,torralbag@csail.mit.edu

Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Scene Understanding - People | MIT CSAILpeople.csail.mit.edu/torralba/courses/6.870/slides/... · 2008-10-09 · Scene Identification: Basic-Level Oliva, A., & Schyns, P.G. (2000).

Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Miguel Ángel Torralba

Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.

Selim Aksoy Department of Computer Engineering Bilkent ...saksoy/courses/cs484-Spring2010/...CS 484, Spring 2010 ©2010, Selim Aksoy 89 Adapted from Antonio Torralba, MIT Context CS

Author's personal copy - People | MIT CSAILpeople.csail.mit.edu/torralba/publications/reviewPapers/... · 2010. 10. 13. · Author's personal copy The role of context in object recognition

Large Image Databases and Small Codes for Object Recognition Rob Fergus (NYU) Antonio Torralba (MIT) Yair Weiss (Hebrew U.) William T. Freeman (MIT)

Discriminative and generative methods for bags of features Zebra Non-zebra Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

854 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND ...people.csail.mit.edu/torralba/publications/5papers/...Street, 32-D462, Cambridge, MA 02139. E-mail: torralba@csail.mit.edu.. K.P. Murphy