Seminar on Media Technology Computer Vision Albert Alemany Font.
-
Upload
denis-charles -
Category
Documents
-
view
218 -
download
0
Transcript of Seminar on Media Technology Computer Vision Albert Alemany Font.
![Page 1: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/1.jpg)
Seminar on Media Technology
Computer Vision
Albert Alemany Font
![Page 2: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/2.jpg)
Outlines Introduction
• What is computer vision and why this topic
History of computer vision and related disciplines
Applications
• Face/smile detection, OCR, object recognition, medical imaging, ...
Conclusions References
![Page 3: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/3.jpg)
What is computer vision?
Traffic scene Number of vehicles Type of vehicles Location of closest
obstacle Assessment of
congestion Location of the scene
captures ...
Given an image or more, extract properties of the 3D
world
![Page 4: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/4.jpg)
Related disciplines
![Page 5: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/5.jpg)
History of computer vision 1950′s – Two dimensional imaging for statistical
pattern recognition developed
1960′s – Roberts begins studying 3D machine vision
1970′s – MIT’s Artificial Intelligence Lab opens a "Computer Vision" course
1980’s – New theories and concepts emerging. Shift toward geometry and increased mathematical rigor
1990’s – Face recognition. Statistical analysis in vogue
2000’s – Broader recognition. Large annotated datasets available. Video processing starts
![Page 6: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/6.jpg)
Finding people in images"Yes"
instances
![Page 7: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/7.jpg)
Finding people in images"No"
instances
![Page 8: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/8.jpg)
Face detection
The camera detects faces in a scene and then automatically focus (AF) and optimizes exposure (AE) and, if needed, flash output
Face detection in digital cameras
![Page 9: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/9.jpg)
Smile detection
![Page 10: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/10.jpg)
Optical character recognition (OCR)
Technology to convert scanned docs to text
![Page 11: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/11.jpg)
Vision-based biometrics
http://www.cl.cam.ac.uk/~jgd1000/afghan.html
Photographer: Steve McCurry
How the Afghan girl was identified by her iris pattern:
1984 - Right eye processed image
2002 - Right eye processed image
![Page 12: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/12.jpg)
Object recognition
Google goggles
Query image
Webpage
Matching image
Lincoln Microsoft Research
![Page 13: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/13.jpg)
Mimic human behaviour?
![Page 14: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/14.jpg)
Limits of human vision
![Page 15: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/15.jpg)
Limits of human vision
![Page 16: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/16.jpg)
Vision evolution
Google reCaptcha
![Page 17: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/17.jpg)
Making the invisible visible
Eulerian Video Magnification for Revealing Subtle Changes in the WorldSIGGRAPH
2012http://people.csail.mit.edu/mrub/
vidmag/
Raw version
![Page 18: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/18.jpg)
Making the invisible visible
Eulerian Video Magnification for Revealing Subtle Changes in the Worldhttp://people.csail.mit.edu/mrub/
vidmag/
Magnified version
SIGGRAPH 2012
![Page 19: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/19.jpg)
Smart cars
www.mobileye.com
![Page 20: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/20.jpg)
Medical imaging
Image guided surgery
3D Imaging
![Page 21: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/21.jpg)
Special effects: shape capture
The Matrix movies, ESC Entertainment
![Page 22: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/22.jpg)
Special effects: shape capture
![Page 23: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/23.jpg)
Special effects: motion capture
Pirates of the caribbean, Industrial Light and Magic
![Page 24: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/24.jpg)
Video-based interaction: gaming
Sony Eyetoy
Microsoft Natal
![Page 25: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/25.jpg)
Image mosaic
3D from multiple images 3D from one image "Big" image from other
images/video
![Page 26: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/26.jpg)
Image mosaic
![Page 27: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/27.jpg)
Supermarket scanner
![Page 28: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/28.jpg)
Conclusions
![Page 29: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/29.jpg)
References
Richard Szeliski (2010). Computer Vision: Algorithms and Applications. Springer-Verlag.
Gérard Medioni and Sing Bing Kang (2004). Emerging Topics in Computer Vision. Prentice Hall.
Pedram Azad, Tilo Gockel, Rüdiger Dillmann (2008). Computer Vision – Principles and Practice. Elektor International Media BV.
http://people.csail.mit.edu/mrub/vidmag/
http://www.cvpapers.com/
![Page 30: Seminar on Media Technology Computer Vision Albert Alemany Font.](https://reader035.fdocuments.in/reader035/viewer/2022062423/56649e8b5503460f94b908c5/html5/thumbnails/30.jpg)
Thank you for your attention