CS485/685 Computer Vision
description
Transcript of CS485/685 Computer Vision
![Page 1: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/1.jpg)
CS485/685 Computer Vision
Dr. George Bebis
Spring 2012
![Page 2: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/2.jpg)
General Information
• Office: 235 SEM
• Phone: 784-6463
• E-mail: [email protected]
• Office Hours: TR 4:00pm - 5:30pm or by appointment
• Course Web Page: http://www.cse.unr.edu/˜bebis/CS485
![Page 3: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/3.jpg)
Text
• We will be covering material from several different textbooks and research papers.
[Szeliski11] Computer Vision: Algorithms and Applications, by R. Szeliski, Springer-Verlag, 2011 (freely available from http://szeliski.org/Book/)
[Jain95] Machine Vision, by R. Jain et. al, McGraw Hill, 1995.
[Trucco98] Introductory Techniques for 3-D Computer Vision, by E. Trucco, and A. Verri, Prentice Hall, 1998.
[Nawla93] A Guided Tour of Computer Vision, by V. Nawla, Addison-Wesley, 1993.
![Page 4: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/4.jpg)
Course requirements
• Prerequisites—these are essential!– A good working knowledge of C/C++ programming
– Data structures
– Calculus, Linear algebra and Probabilities/Statistics (recommended)
• Course does not assume prior imaging experience
![Page 5: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/5.jpg)
Course Outline (tentative)
• Introduction to CV– Relation to other fields
– Main challenges
– Applications
• Image Formation and Representation– Pinhole camera
– Cameras & lenses
– Human eye
– Digitization
![Page 6: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/6.jpg)
Course Outline (tentative)
• Image Filtering (spatial domain)
– Mask-based (e.g., correlation, convolution)
– Smoothing (e.g., Gaussian), Sharpening (e.g., gradient)
![Page 7: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/7.jpg)
Course Outline (tentative)
• Edge Detection (e.g., Canny, Laplacian of Gaussian)
![Page 8: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/8.jpg)
Course Outline (tentative)
• Interest Point Detection (e.g., Moravec, Harris)
![Page 9: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/9.jpg)
Course Outline (tentative)
• Segmentation– Edge-based (e.g., voting,
optimization, perceptual grouping)
Examples: Hough Transform, Snakes, Tensor Voting
– Pixel-based (e.g., clustering)
Examples: Histogram-based, Graph- Cuts, Mean-Shift)
![Page 10: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/10.jpg)
Course Outline (tentative)
• Feature Extraction – Geometric (e.g., lines, circles,
ellipses etc.)– Blobs
• Description and Matching – SIFT, SURF, HOG, WLD, LBP
Feature extraction
![Page 11: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/11.jpg)
Course Outline (tentative)• Recognition
– Geometry-based (e.g., alignment, geometric hashing)
– Appearance-based (e.g., subspace, bag-of-features)
![Page 12: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/12.jpg)
Course Outline (tentative)
• Recognition (cont’d)– Object recognition (single / category)
– Face recognition
Single instance recognition
Category recognition
Face detection and recognition
![Page 13: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/13.jpg)
Course Outline (tentative)
• Camera Calibration– Camera parameters
– 3D to 2D transformation
• Stereo Vision– 3D reconstruction from pairs of
2D images.
![Page 14: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/14.jpg)
Grading
• Two exams (midterm, final)
• Programming assignments
• Paper presentation (grad students only)
• Homework will be assigned but not graded
• Midterm: ~ 25%
• Final: ~ 25%
• Programming assignments: ~ 50%
• Paper presentation: ~ 10%
![Page 15: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/15.jpg)
Software
• You will not use any software package for most assignments.
• There might 1-2 programming assignments where you would need to use OpenCV.
http://opencv.willowgarage.com/wiki/
[OpenCV08] Learning OpenCV: Computer Vision with the OpenCV Library, by G. Bradski and A. Kaehler, O’Reilly Press, 2008.
![Page 16: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/16.jpg)
Course Policies
• Lecture slides, assignments, and other useful information will be posted on web.
• If you miss a class, you are responsible for all material covered or assigned in class. .
• Discussion of the programming assignments is allowed and encouraged. However, each student should do his/her own work.
Assignments which are too similar will receive a zero.
![Page 17: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/17.jpg)
Course Policies (cont’d)
• No late programming assignments will be accepted unless there is an extreme emergency.
• A missed quiz/exam may be made up only if it was missed due to an extreme emergency.
• No incomplete grades (INC) will be given in this course
![Page 18: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/18.jpg)
Extra Credit
• Class participation is highly encouraged and will be rewarded with extra credit.
• Additional extra credit will be offered to the students who attend the departmental colloquia.
• You will be reminded in class about upcoming talks but you should also check the colloquia page on a regular basis
http://www.cse.unr.edu/get-involved/colloquia/
![Page 19: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/19.jpg)
Academic Dishonesty
• Your continued enrollment in this course implies that you have read the section on Academic Dishonesty found in the UNR Student Handbook and that you subscribe to the principles stated therein.
http://www.unr.edu/stsv/acdispol.html
Remember: I can Google too (and I have the copies of everybody’s assignments from the last four years this class was offered)
![Page 20: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/20.jpg)
Disability Statement
• Any student with a disability needing academic accommodations is requested to speak with me or contact the Disability Resource Center (Thompson Building, Suite 101), as soon as possible to arrange for appropriate accommodations.
![Page 21: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/21.jpg)
Unauthorized class audio recording or video-taping
• Surreptitious or covert video-taping of class or unauthorized audio recording of class is prohibited by law and by Board of Regents policy.
• This class may be videotaped or audio recorded only with the written permission of the instructor.
• In order to accommodate students with disabilities, some students may have been given permission to record class lectures and discussions.
• Therefore, students should understand that their comments during class may be recorded.
![Page 22: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/22.jpg)
Important Dates
• March 15, 2012 – Midterm exam
• March 23, 2012 – Final Day to Drop Classes
• March 17-25, 2012 – Spring Break (no classes)
• May 9, 2012 – Prep Day
• May 10, 2012 - Final exam (8:00am – 10:00am)
![Page 23: CS485/685 Computer Vision](https://reader035.fdocuments.in/reader035/viewer/2022081506/56813ac1550346895da2d067/html5/thumbnails/23.jpg)
Questions?