Challenges to image parsing researchers
description
Transcript of Challenges to image parsing researchers
![Page 1: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/1.jpg)
Challenges to image parsing researchers
Lana LazebnikUNC Chapel Hill
sky
sidewalk
building
road
carpersoncar
mountain
![Page 2: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/2.jpg)
The past: “closed universe” datasetsTens of classes, hundreds of images, offline learning
He et al. (2004), Hoiem et al. (2005), Shotton et al. (2006, 2008, 2009), Verbeek and Triggs (2007), Rabinovich et al. (2007), Galleguillos et al. (2008), Gould et al. (2009), etc.
Figure from Shotton et al. (2009)
![Page 3: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/3.jpg)
Evolving images, annotations
http://labelme.csail.mit.edu/
The future: “open universe” datasets
![Page 4: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/4.jpg)
build
ing floor seawater
sand
perso
nsky
scrap
er sign
mirror
pillow
founta
inflo
wersho
pcou
nter to
ppa
per
furnit
urecrane pot
arcad
ebri
dge
windshi
eld brick
clock
drawer fan
dishw
asher
vase
closet
hand
lebo
ttleou
tlet
bag
tail li
ght
lights
witch
02000000400000060000008000000
1000000012000000
Non-uniform class frequencies
The future: “open universe” datasets
![Page 5: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/5.jpg)
Combination of local cues? Multiple segmentations/grouping
hypotheses? Context? Graphical models (MRFs, CRFs, etc.)? Offline learning and inference?
Which “closed universe” techniques can survive in the “open universe” setting?
![Page 6: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/6.jpg)
Learning from all of LabelMe50K images, 232 labels
sky
tree
road
car
sky
sea
sun
building
window
door
road
sky
building
mountain
skybuilding
sidewalk
carroad
car
ceiling
wall
floor
Tighe & Lazebnik, work in progress
![Page 7: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/7.jpg)
0%25%50%75%
100%SiftFlow Barcelona LM + Sun
0%25%50%75%
100%
Learning from all of LabelMe50K images, 232 labels
Per-class classification rates
Tighe & Lazebnik, work in progress
![Page 8: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/8.jpg)
Challenge: Parsing high-res images
![Page 9: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/9.jpg)
Challenge: Dynamic image interpretation
Image parsing algorithms should become autonomous decision-making agents
Visual “detective task”: Where was this photo taken?
![Page 10: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/10.jpg)
Challenge: Dynamic image interpretation
Image parsing algorithms should become autonomous decision-making agents
Input
![Page 11: Challenges to image parsing researchers](https://reader035.fdocuments.in/reader035/viewer/2022062817/5681694b550346895de0e4dc/html5/thumbnails/11.jpg)
Summary Challenges to image parsing
researchers: Learn to parse images from “open
universe” evolving datasets Try parsing gigapixel images! Develop active, sequential image
interpretation strategies