EE290T : 3D Reconstruction and...

EE290T : 3D Reconstruction and

Recognition

Acknowledgement

Introduction

“There was a table set out under

a tree in front of the house,and the March Hare and the

Hatter were having tea at it.”

“The table was a large one, but

the three were all crowded

together at one corner of it …”

From “A M a d Tea-Party”Alice's Adventures in W onderla n

Lew is Ca rroll

Illustra tion by Arthur Ra ckha m

a tree in front of the house,

and the March Hare and the

From “A M a d Tea-Party”Alice's Adventures in W onderla n

Lew is Ca rroll

-semantic

Image/ video

Computer vision

Object 1 Object N

- semantic

-semantic

Image/ video

Object 1 Object N

- semantic

-geometry -geometry

Computer vision

-semantic

Image/ video

Object 1 Object N

- semantic

-geometry -geometry

spatial & temporal relations

Computer vision

-semantic

Image/ video

Object 1 Object N

- semantic

-geometry -geometry

spatial & temporal relations

-Sema ntic

- geometry

Computer vision

Sensing device

• Information extraction

• Interpretation

Computer vision

1. Information extraction: features, 3D structure, motion flows, etc…

2. Interpretation: recognize objects, scenes, actions, events

Computational device

Computer vision and Applications

1990 20102000

EosSystems

Fingerprint biometrics

Augmentation with 3D computer graphics

3D object prototyping

24PhotomodelerEosSystems

1990 20102000

EosSystems Autostich

Face detection

Web applications

28Photometria

Panoramic Photography

3D modeling of landmarks

1990 20102000

• Efficient SLAM/SFM• Large scale image repositories• Deep learning (e.g. ImageNet)

1990 20102000

Kooaba

Kinect

Google Goggles

Image search engines

Google Goggles34

Visual search and landmarks recognition

Augmented reality

• Magic leap• Daqri• Meta

• Etc…

Motion sensing and gesture recognition

Autonomous navigation and safety

Mobileye: Vision systems in high-end BMW, GM, Volvo models

But also, Toyota, Google, Apple, Tesla, Nissan, Ford, etc….

Source: A. Shashua, S. Seitz

Personal robotics

Vision for robotics, space exploration

Factory inspectionSurveillanceAssistive technologies

Security

411990 20102000

Kooaba

Kinect

Google Goggles

EosSystems

421990 20102000

Google Goggles

431990

EosSystems

20102000

Google Goggles

Current state of computer vision

3D Reconstruction• 3D shape recovery• 3D scene reconstruction• Camera localization• Pose estimation

2D Recognition• Object detection• Texture classification• Target tracking• Activity recognition

Levoy et al., 00Hartley & Zisserman, 00Dellaert et al., 00Rusinkiewic et al., 02Nistér, 04Brown & Lowe, 04Schindler et al, 04Lourakis & Argyros, 04Colombo et al. 05

Golparvar-Fard, et al. JAEI 10Pandey et al. IFAC , 2010Pandey et al. ICRA 2011Savarese et al. IJCV 05Savarese et al. IJCV 06Microsoft’s PhotoSynth Snavely et al., 06-08Schindler et al., 08Agarwal et al., 09 45Frahm et al., 10

Lucas & Kanade, 81Chen & Medioni, 92Debevec et al., 96Levoy & Hanrahan, 96Fitzgibbon & Zisserman, 98Triggs et al., 99Pollefeys et al., 99Kutulakos & Seitz, 99

Snavely

et al., 06

Levoy et al., 00Hartley & Zisserman, 00Dellaert et al., 00Rusinkiewic et al., 02Nistér, 04Brown & Lowe, 04Schindler et al, 04Lourakis & Argyros, 04Colombo et al. 05

Golparvar-Fard, et al. JAEI 10Pandey et al. IFAC , 2010Pandey et al. ICRA 2011Savarese et al. IJCV 05Savarese et al. IJCV 06Microsoft’s PhotoSynth Snavely et al., 06-08Schindler et al., 08Agarwal et al., 09Frahm et al., 10

Lucas & Kanade, 81Chen & Medioni, 92Debevec et al., 96Levoy & Hanrahan, 96Fitzgibbon & Zisserman, 98Triggs et al., 99Pollefeys et al., 99Kutulakos & Seitz, 99

Snavely

et al., 06

Turk & Pentland, 91Poggio et al., 93Belhumeur et al., 97LeCun et al. 98 Amit and Geman, 99 Shi& Malik, 00 Viola & Jones, 00Felzenszwalb & Huttenlocher 00Belongie & Malik, 02Ullman et al. 02

Argawal & Roth, 02Ramanan & Forsyth, 03Weber et al., 00Vidal-Naquet & Ullman 02Fergus et al., 03Torralba et al., 03Vogel & Schiele, 03Barnard et al., 03Fei-Fei et al., 04Kumar & Hebert ‘04

He et al. 06Gould et al. 08Maire et al. 08Felzenszwalb et al., 08Kohli et al. 09L.-J. Li et al. 09Ladicky et al. 10,11Gonfaus et al. 10Farhadi et al., 09Lampert et al., 09 47

car person car

Car Car

Street

PersonBike

Building

Turk & Pentland, 91Poggio et al., 93Belhumeur et al., 97LeCun et al. 98 Amit and Geman, 99 Shi& Malik, 00 Viola & Jones, 00Felzenszwalb & Huttenlocher 00Belongie & Malik, 02Ullman et al. 02

Argawal & Roth, 02Ramanan & Forsyth, 03Weber et al., 00Vidal-Naquet & Ullman 02Fergus et al., 03Torralba et al., 03Vogel & Schiele, 03Barnard et al., 03Fei-Fei et al., 04Kumar & Hebert ‘04

He et al. 06Gould et al. 08Maire et al. 08Felzenszwalb et al., 08Kohli et al. 09L.-J. Li et al. 09Ladicky et al. 10,11Gonfaus et al. 10Farhadi et al., 09Lampert et al., 09

Perceiving the World in 3D!

Course overview

1. Geometry

2. Semantics

Geometry:- How to extract 3d information?- Which cues are useful?- What are the mathematical tools?

Camera systemsEstablish a mapping from 3D to 2D

How to calibrate a cameraEstimate camera parameters such as pose or focallength

Single view metrologyEstimate 3D properties of the world from a single image

Multiple view geometryEstimate 3D properties of the world from multiple views

Toma si & Kana de (1993)

Mathematical tools

Epipola r geometry

Photoconsistency

Structure from motion

Courtesy of Oxford Visua l Geometry Group

Structure lighting and volumetric stereo

Scanning Michelangelo’s “The David”• The Digital Michelangelo Project

- http://graphics.stanford.edu/projects/mich/

• 2 BILLION polygons, accuracy to .29mm

Course overview

1. Geometry

2. Semantics

Semantics:- How to recognize objects?- How to classify images or understand a scene?- How to segment out critical semantics- How to estimate 3D properties (pose, size, shape…)

Object recognition and categorization

Classification:Is this an forest?

Classification:Does this image contain a building? [yes/no]

Detection:Does this image contain a car? [where?]

Building

personcar

Detection:Which objects do this image contain? [where?]

Detection:Accurate localization (segmentation)

Building 45 degree

Person, backCar, side view

Detection:Estimating 3D geometrical properties

Challenges: viewpoint variation

slide credit: Fei-Fei, Fergus & Torralba

Challenges: illumination

image credit: J. Koenderink

Challenges: scale

Challenges: deformation

Challenges:

occlusion

Magritte, 1957 slide credit: Fei-Fei, Fergus & Torralba

Challenges: background clutter

Kilmeny Niland. 1995

Challenges: object intra-class variation

A t. .•

. l , 11t'-:.

I • <\J I . ""'

Course overview

1. Geometry

2. Semantics

Joint recovery of geometry and semantics!

Visua l processing in the bra in

where pathway (dorsal stream)

what pathway (ventral stream)

The ventral stream (also known

as the "what pathway") is

involved with object and visual

identification and recognition.

The dorsal stream (or,

"where pathway") is involved

with processing the object's

spatial location relative to

the viewer and with speech

recognition

where pathway (dorsal stream)

what pathway (ventral stream)

Pre-frontal cortex

Visua l processing in the bra in

Joint reconstruction and recognition

…Input images

80 Car Person Tree Sky

Street Building Else

…Input images

81 Car Person Tree Sky

Street Building Else

Joint reconstruction and recognition

a tree in front of the house,and the March Hare and the

From “A M a d Tea-Party”

Alice's Adventures in W onderla ndby

Lew is Ca rroll

Project presentations

Syllabus

Lecture Topic

1 Introduction

2 Camera models

3 Camera calibration

4 Single view metrology

5 Epipolar geometry

6 Multi-view geometry

7 Structure from motion/ SLAM

9 Fitting and Matching

10 Detector and Descriptors

11 Intro to Recognition; Object classification I

12 Object classification II

13 Scene understanding & segmentation

14 Visual Representation Learning by NNs

15 3D Object recognition

16 3D Scene understanding

EE290T : 3D Reconstruction and...

Documents

Transcript of EE290T : 3D Reconstruction and...

3D Model · 3D Model ... 3D Model

BundleFusion: Real-time Globally Consistent 3D ...ee290t/fa18/readings/dai...BundleFusion: Real-time Globally Consistent 3D Reconstruction using On-the-fly Surface Re-integration 1:3

Video Coding Basics - EECS Instructional Support Group ...inst.eecs.berkeley.edu/~ee290t/sp04/lectures/video_coding.pdf · – DCT coding of error at the block level ... Video Coding

Multiple View Geometry in Computer Visioninst.eecs.berkeley.edu/~ee290t/fa09/lectures/course02...Projective 2D Geometry Jan. 14, 16 (no course) Projective 2D Geometry Jan. 21, 23 Projective

Cheek and inferior eyelid reconstruction after skin cancer ...lipteh.com/Study-Notes/Articles/clinics/cheek recon.pdf · Cheek and inferior eyelid reconstruction after skin cancer

INTRODUCTION - Abdulrahman Kaki 3D · FDM 3D Printer, Food 3D Printer, SLA 3D Printer, SLM 3D Printer, DLP 3D Printer, White Light 3D Scanner, Blue Light 3D Scanner, Laser 3D Scanner.

Overlay Network and Data Transmission Over Wireless For EE290T Minghua Chen EECS@UC, Berkeley.

Domain Discovery - ROOTCON 11/Trainings/RECON.pdf · A security testing Slackbot built with a Kubernetes backend on the Google Cloud Platform ★Kubebot: A security testing Slackbot

THUNDER Imager 3D Live Cell, 3D Cell Culture & 3D Assay Imager 3D Liv… · 3D Live Cell, 3D Cell Culture & 3D Assay Decode 3D Biology in Real Time* Cultured Cortical Neurons. Green:

Review of Networking Basics - University of …inst.eecs.berkeley.edu/~ee290t/sp04/lectures/network...Review of Networking Basics Yao Wang Polytechnic University, Brooklyn, NY11201

Error Control And Concealment For Video Communication: A …inst.eecs.berkeley.edu/~ee290t/sp04/lectures/ER_WZ98.pdf · 2004-02-17 · (a) (b) Fig. 1. Two reconstructed frames from

Scalable Internet video using MPEG-4inst.eecs.berkeley.edu/~ee290t/sp04/lectures/scalable_internet_no13.pdftransport layer-video decoder bu!er model with re-transmission. We also evaluate

Multimedia Networking - EECS Instructional Support …inst.eecs.berkeley.edu/~ee290t/sp04/lectures/network_multimedia... · Multimedia Networking Applications ... how does callee

One font vulnerability to rule them all - j00ru//vx tech blogj00ru.vexillium.org/slides/2015/recon.pdf · One font vulnerability to rule them all A story of cross-software ownage,

Fast, Automatic, Photo-Realistic, 3D Modeling of Building ...inst.eecs.berkeley.edu/~ee290t/fa11/lectures/icme_barcelona_2011.pdfIndoor Modeling Goals and objectives: – 3D, fast,

PORTABLE 3D SCANNERS: HANDYSCAN 3D - 3D Printing

Anarkik3D's '3D Consequences Project: 3D modelling & 3D printing

EE290T: Advanced Reconstruction Methods for Magnetic ...ee290t/sp14/lecture03.pdf · Parallel MRI Goal: Reduction of measurement time I Subsampling of k-space I Simultaneous acquisition

How I learned Reverse Engineering with Stormrecon.cx/2008/a/pierre-marc_bureau/storm-recon.pdf · Presentation Objectives ... Grab a copy of KadC • Patch network communication to

EE290T: Advanced Reconstruction Methods for Magnetic ...inst.eecs.berkeley.edu/~ee290t/sp14/lecture01.pdf · Tentative Syllabus I 01: Jan 27 Introduction I 02: Feb 03 Parallel Imaging