Raskar Paris Nov08

64
Raskar, Camera Culture, MIT Media Lab Camera Culture Ramesh Raskar Camera Culture Associate Professor, MIT Media Lab Computational Photography: Epsilon to Coded Imaging

Transcript of Raskar Paris Nov08

Page 1: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

Camera Culture

Ramesh Raskar

Camera CultureAssociate Professor, MIT Media Lab

Computational Photography:

Epsilon to Coded Imaging

Page 2: Raskar Paris Nov08
Page 3: Raskar Paris Nov08

Where are the ‘camera’s?Where are the ‘camera’s?

Page 4: Raskar Paris Nov08

Where are the ‘camera’s?Where are the ‘camera’s?

Page 5: Raskar Paris Nov08

We focus on creating tools to better capture and share visual information

The goal is to create an entirely new class of imaging platforms

that have an understanding of the world that far exceeds human ability

and produce meaningful abstractions that are well within human comprehensibility

Ramesh Raskar

Page 6: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

Cameras of Tomorrow

Page 7: Raskar Paris Nov08

Motion Blurred Photo

Page 8: Raskar Paris Nov08
Page 9: Raskar Paris Nov08

Flutter Shutter CameraFlutter Shutter CameraRaskar, Agrawal, Tumblin [Siggraph2006]

LCD opacity switched in coded sequence

Page 10: Raskar Paris Nov08

Short Exposure

Traditional MURA Coded

Deblurred Result

Captured SinglePhoto

Shutter

Banding Artifacts and some spatial frequencies

are lostDark

and noisy

Page 11: Raskar Paris Nov08

Blurring == Convolution

Traditional Camera: Shutter is OPEN: Box Filter

PSF == Sinc Function

ω

Sharp Photo

Blurred Photo

Fourier Transform

Page 12: Raskar Paris Nov08

Flutter Shutter: Shutter is OPEN and CLOSED

Preserves High Spatial Frequencies

Sharp Photo

Blurred PhotoPSF == Broadband Function

Fourier Transform

Page 13: Raskar Paris Nov08

Traditional

Coded Exposu

re

Image of Static Object

Deblurred Image

Deblurred Image

Page 14: Raskar Paris Nov08
Page 15: Raskar Paris Nov08

Coded Exposure

Temporal 1-D broadband code: Motion Deblurring

Coded Aperture

Spatial 2-D broadband mask: Focus Deblurring

Page 16: Raskar Paris Nov08

Coded Aperture CameraCoded Aperture Camera

The aperture of a 100 mm lens is modified

Rest of the camera is unmodifiedInsert a coded mask with chosen binary pattern

Page 17: Raskar Paris Nov08

In Focus Photo

LED

Page 18: Raskar Paris Nov08

Out of Focus Photo: Open Aperture

Page 19: Raskar Paris Nov08

Out of Focus Photo: Coded Aperture

Page 20: Raskar Paris Nov08

Captured Blurred Photo

Page 21: Raskar Paris Nov08

Refocused on Person

Page 22: Raskar Paris Nov08

Less is MoreLess is MoreBlocking Light == More InformationBlocking Light == More Information

Coding in Time Coding in Time Coding in SpaceCoding in Space

Page 23: Raskar Paris Nov08

Larval Trematode WormLarval Trematode Worm Coded Aperture CameraCoded Aperture Camera

Page 24: Raskar Paris Nov08

Shielding Light …Shielding Light …

Larval Trematode WormLarval Trematode Worm Turbellarian WormTurbellarian Worm

Page 25: Raskar Paris Nov08

Coded Computational Photography• Coded Exposure

– Motion Deblurring [2006]

• Coded Aperture– Focus Deblurring [2007]– Glare reduction [2008]

• Optical Heterodyning– Light Field Capture [2007]

• Coded Illumination– Motion Capture [2007]– Multi-flash: Shape Contours [2004]

• Coded Spectrum– Agile Wavelength Profile [2008]

• Epsilon->Coded->Essence Photographyhttp://raskar.info

Page 26: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

Computational Photography

1. Epsilon Photography– Low-level Vision: Pixels– Multiphotos by bracketing (HDR, panorama)– ‘Ultimate camera’

2. Coded Photography– Mid-Level Cues:

• Regions, Edges, Motion, Direct/global– Single/few snapshot

• Reversible encoding of data– Additional sensors/optics/illum

3. Essence Photography– Not mimic human eye– Beyond single view/illum– ‘New artform’

Page 27: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

Epsilon Photography

• Dynamic range– Exposure braketing [Mann-Picard, Debevec]

• Wider FoV – Stitching a panorama

• Depth of field – Fusion of photos with limited DoF [Agrawala04]

• Noise– Flash/no-flash image pairs [Petschnigg04, Eisemann04]

• Frame rate– Triggering multiple cameras [Wilburn05, Shechtman02]

Page 28: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

Computational Photography

1. Epsilon Photography– Low-level Vision: Pixels– Multiphotos by bracketing (HDR, panorama)– ‘Ultimate camera’

2. Coded Photography– Mid-Level Cues:

• Regions, Edges, Motion, Direct/global– Single/few snapshot

• Reversible encoding of data– Additional sensors/optics/illum

3. Essence Photography– Not mimic human eye– Beyond single view/illum– ‘New artform’

Page 29: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

• 3D– Stereo of multiple cameras

• Higher dimensional LF– Light Field Capture

• lenslet array [Adelson92, Ng05], ‘3D lens’ [Georgiev05], heterodyne masks [Veeraraghavan07]

• Boundaries and Regions– Multi-flash camera with shadows [Raskar08]

– Fg/bg matting [Chuang01,Sun06]

• Deblurring– Engineered PSF– Motion: Flutter shutter[Raskar06], Camera Motion [Levin08]

– Defocus: Coded aperture, Wavefront coding [Cathey95]

• Global vs direct illumination– High frequency illumination [Nayar06]

– Glare decomposition [Talvala07, Raskar08]

• Coded Sensor– Gradient camera [Tumblin05]

Page 30: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

Computational Photography

1. Epsilon Photography– Low-level Vision: Pixels– Multiphotos by bracketing (HDR, panorama)– ‘Ultimate camera’

2. Coded Photography– Mid-Level Cues:

• Regions, Edges, Motion, Direct/global– Single/few snapshot

• Reversible encoding of data– Additional sensors/optics/illum

3. Essence Photography– Not mimic human eye– Beyond single view/illum– ‘New artform’

Page 31: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

• Ramesh Raskar and Jack Tumblin

• Book Publishers: A K Peters

Page 32: Raskar Paris Nov08

Mask?

Sensor

MaskSensorMask

?Sensor

MaskSensor

Mask?

Sensor

4D Light Field from 2D Photo:

Heterodyne Light Field Camera

Full Resolution Digital Refocusing:

Coded Aperture Camera

Page 33: Raskar Paris Nov08

Lenslet-based Light Field cameraLenslet-based Light Field camera

[Adelson and Wang, 1992, Ng et al. 2005 ]

Light Field Inside a CameraLight Field Inside a Camera

Page 34: Raskar Paris Nov08

Stanford Plenoptic Camera Stanford Plenoptic Camera [Ng et al 2005][Ng et al 2005]

4000 × 4000 pixels ÷ 292 × 292 lenses = 14 × 14 pixels per lens

Contax medium format camera Kodak 16-megapixel sensor

Adaptive Optics microlens array 125μ square-sided microlenses

Page 35: Raskar Paris Nov08

Digital RefocusingDigital Refocusing

[Ng et al 2005][Ng et al 2005]

Can we achieve this with a Can we achieve this with a MaskMask alone? alone?

Page 36: Raskar Paris Nov08

Mask based Light Field CameraMask Sensor

[Veeraraghavan, Raskar, Agrawal, Tumblin, Mohan, Siggraph 2007 ]

Page 37: Raskar Paris Nov08

How to Capture 4D Light Field with 2D

Sensor ?

What should be the pattern of the mask ?

Page 38: Raskar Paris Nov08

Optical HeterodyningOptical Heterodyning

Photographic Signal

(Light Field)

Carrier Incident Modulat

ed Signal

ReferenceCarrier

Main LensObject Mask Sensor

RecoveredLight Field

Software Demodulation

Baseband Audio Signal

Receiver: Demodulation

High Freq Carrier 100 MHz

ReferenceCarrier

Incoming Signal

99 MHz

Page 39: Raskar Paris Nov08

1/f0

Mask Tile

Cosine Mask Used

Page 40: Raskar Paris Nov08

Captured 2D Photo

Encoding due to Mask

Page 41: Raskar Paris Nov08

2D FFT

Traditional Camera Photo

Heterodyne Camera Photo

Magnitude of 2D FFT

2D FFT

Magnitude of 2D FFT

Page 42: Raskar Paris Nov08

Computing 4D Light Field2D Sensor Photo, 1800*1800 2D Fourier Transform, 1800*1800

2D FFT

Rearrange 2D tiles into 4D planes200*200*9*94D IFFT

4D Light Field

9*9=81 spectral copies

200*200*9*9

Page 43: Raskar Paris Nov08

Extra sensor bandwidth cannot capture extra dimension of the light field

fx

fθ0

fx0

Sensor Slice

Extra sensor bandwidth

Fourier Light Field Space

Page 44: Raskar Paris Nov08

Solution: Modulation Theorem

Make spectral copies of 2D light field

fx

fθ0

fx0

Modulation Function

Page 45: Raskar Paris Nov08

Modulated Light Field

fx

fθ0

fx0

Modulation Function

Sensor Slice captures entire Light Field

Page 46: Raskar Paris Nov08

Demodulation to recover Light Field

fx

Reshape 1D Fourier Transform into 2D

1D Fourier Transform of Sensor Signal

Page 47: Raskar Paris Nov08

Full resolution 2D image of Focused Scene Parts

Captured 2D Photo

Image of White Lambertian Plane

divide

Page 48: Raskar Paris Nov08

MIT Media Lab

Single shot visual hull: Shield Field

Lanman, Raskar, Agrawal, Taubin, Siggraph Asia 2008

Page 49: Raskar Paris Nov08

MIT Media Lab

Single shot 3D reconstruction: Simultaneous Projections using Masks

Page 50: Raskar Paris Nov08

• Barcode size : 3mm x 3mm• Distance from camera : 5 meter

Long Distance Bar-codesLong Distance Bar-codes

Woo, Mohan, Raskar, [2008]

Page 51: Raskar Paris Nov08

Mitsubishi Electric Research Laboratories Raskar 2006Spatial Augmented Reality

Computational Camera and PhotographyComputational Camera and Photography

Page 52: Raskar Paris Nov08

Towards a 6D DisplayPassive Reflectance Field Display

Martin Fuchs, Ramesh Raskar,Hans-Peter Seidel, Hendrik P. A. Lensch

Siggraph 2008

1 2

11

1 MPI Informatik, Germany 2 MIT

2D 2D 2D

Page 53: Raskar Paris Nov08

View dependent 4D Display

Page 54: Raskar Paris Nov08
Page 55: Raskar Paris Nov08
Page 56: Raskar Paris Nov08

6D = light sensitive 4D display

Page 57: Raskar Paris Nov08

One Pixel of a 6D Display = 4D Display

Page 58: Raskar Paris Nov08

Camera Culture group

• New group at Media Lab– New graduate student RA, post-doc, MEng slot open– Graduate school deadline Dec 15th

– New faculty positions

• Keywords– Computational imaging, Signal processing, Applied optics, Computer graphics and vision,

Hardware electronics, Displays, Art, Online photo collections, Visual social computing

• Acknowledgements– My former group at MERL, Amit Agrawal, Jack Tumblin, Shree Nayar– ETVC: Frank, Evelyne– Apologies for not covering related work

http://CameraCulture.info

Page 59: Raskar Paris Nov08

Fernald, Science [Sept 2006]

Shadow Refractive

Reflective

Tools for

Visual Computin

g

Page 60: Raskar Paris Nov08
Page 61: Raskar Paris Nov08

Blind CameraBlind Camera

Sascha Pohflepp, U of the Art, Berlin, 2006

Page 62: Raskar Paris Nov08

Raskar, Camera Culture, MIT Media Lab

Cameras of Tomorrow

Page 63: Raskar Paris Nov08

Cameras of Tomorrow• Coded Exposure

– Motion Deblurring [2006]

• Coded Aperture– Focus Deblurring [2007]– Glare reduction [2008]

• Optical Heterodyning– Light Field Capture [2007]

• Coded Illumination– Motion Capture [2007]– Multi-flash: Shape Contours [2004]

• Coded Spectrum– Agile Wavelength Profile [2008]

• Epsilon->Coded->Essence Photographyhttp://raskar.info

Page 64: Raskar Paris Nov08

We focus on creating tools to better capture and share visual information

The goal is to create an entirely new class of imaging platforms

that have an understanding of the world that far exceeds human ability

and produce meaningful abstractions that are well within human comprehensibility

Ramesh Raskar http://raskar.info