CS256 Intelligent Systems -Vision Systems Module Overview.
-
date post
19-Dec-2015 -
Category
Documents
-
view
222 -
download
5
Transcript of CS256 Intelligent Systems -Vision Systems Module Overview.
![Page 1: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/1.jpg)
CS256 Intelligent Systems-Vision Systems
Module Overview
![Page 2: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/2.jpg)
Timetable
Week(mode)
1 (2L)
2(2L)
3(2L)
4(LP)
5(LP)
6 (LP)
7 (LP)
8 (LP)
9 (LP)
10(2L)
Topic
Introduction to the module and vision systems
Case studies and basic concepts
Java and image Fundamentals
Feature Extraction and Image Transforms
Edge Detection and Segmentation
Colour and Texture
Recover 3D information
System Architecture
Knowledge and Reasoning
Image Classification and Retrieval (including revision)
![Page 3: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/3.jpg)
Coursework
• Develop a system that is able to identify key features in selected images.
• Write a report to describe the design, implementation and evaluation of the system. Please see details in separate document on coursework assignment.
• Questions will be asked during lab sessions• Deadline: Monday 18th April, 2005
![Page 4: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/4.jpg)
Assessment
• Examination– 60%– three questions from four
• Coursework– 40%– Report based on experiments
![Page 5: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/5.jpg)
Recommended Texts
• Nick Efford, Digital Image Processing, A Practical Introduction using Java, Addison Wesley, ISBN 0201596237, May 2000
• Tim Morris (2004), Computer Vision and Image Processing, Palgrave MacMillan, ISBN 0333994515
• Patrick H Winston, (1992), Artificial Intelligence (Third Edition), Addison Wesley Publishers Co. ISBN 0201533774
• Rob Callan (2003), Artificial Intelligence, Palgrave MacMillan, ISBN 0333801369
• Paul F Whelan and Dereck Molloy (2001), Machine Vision Algorithms in Java: Techniques and Implementation, Springer, ISBN 1852332182
![Page 6: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/6.jpg)
Objectives of the module
• Understand the fundamentals in machine intelligence– Focus on vision systems, but will relate to other domains
• Understand components in vision systems– Be familiar with common operations for processing images– Be able to implement simple image processing operations
• Evaluate a vision system• additionally: encourage the students to practise more
basic and advanced Java programming
![Page 7: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/7.jpg)
Intelligence and Perception
• First to understand how we perceive the world then to teach the machine to interpret the world based on primitive data it has received
• Human Perceptual Modalities– Tactile – touch– Gustatory – taste– Visual – sight– Auditory – hearing– Olfactory – smell
![Page 8: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/8.jpg)
Intelligent Systems• intelligent robots and intelligent machines
– With artificial intelligence principles– reason about the world and take appropriate
actions by manipulating knowledge– sense the world directly
• Vision - computational perception– a diverse and interdisciplinary body of knowledge
and techniques– to understand the principles behind the processes
that interpret perceptual signals provided by various sensors.
![Page 9: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/9.jpg)
Intelligent Systems• In vision, software’s job is to process the input
from the hardware or sensors• Humans have the natural abilities to speak, to
see, to think, to smell, to sense etc. Machines do not have such inborn abilities, but only have simple engines to follow logical algorithms.
• The procedure to have the computer obtain the similar natural abilities like speaking and vision, are closely related to building knowledge system, but it is also the combination of simulating the perception procedure and knowledge
![Page 10: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/10.jpg)
Intelligent Systems
• Integrate different levels of processing for bridging different gaps – sensors, raw data, low level processing, high level processing and knowledge, for building a complete intelligent system
• Reflected in this module structure
![Page 11: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/11.jpg)
Figure 5-10 image B95-00016-01.3.S1.X5.4.jpg (above) and the its annotation window generated in I-Browse system
![Page 12: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/12.jpg)
Applications• Classical
– robot– medical imaging– remote sensing– astronomy
• Today– DTV– image interpretation– biometry– GIS, (Earth/Planetary Observation, monitoring, exploration)– human genome project– Creative media and art, entertainment
![Page 13: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/13.jpg)
Sample applications - Biometry
• Using personal characteristics to identify a person– fingerprints– face– iris– DNA– gait– etc
![Page 14: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/14.jpg)
Iris Scan
• Striations on iris are individually unique
• Obvious applications– security– PIN
![Page 15: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/15.jpg)
} fixed number of samples
Locate the eye in the head image
Radial resampling of iris
Numerical descriptionAnalysis
![Page 16: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/16.jpg)
![Page 17: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/17.jpg)
Image Representation
x
n
11 m
y
f(x,y)
An array F:-A digital image consisting of an array of m x n pixels in the xth column and the yth
row has an intensity equal to f(x,y).
(r(x,y), g(x,y), b(x,y))
![Page 18: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/18.jpg)
Colour image and video sequence
• colour can be conveyed by combining different colours of light, using three components (red, green and blue): R = r(x,y); G = g(x,y); B = b(x,y), where R, G, B are defined in a similar way to F.
• The vector (r(x,y), g(x,y), b(x,y)) defines the intensity and colour at the point (x,y) in the colour image.
• A video sequence is, in effect, a time-sampled representation of the original moving scene.
• Each frame in the sequence is a standard colour, or monochrome image and can be coded as such.
• a monochrome video sequence may be represented digitally as a sequence o 2-D arrays [F1, F2, F3..FN].
![Page 19: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/19.jpg)
Java example for image representation;-
![Page 20: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/20.jpg)
![Page 21: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/21.jpg)
![Page 22: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/22.jpg)
![Page 23: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/23.jpg)
The Difficulty in Vision Computing – Taking the Human Visual System for
Granted
• The processing capability of human visual systems is often taken for granted
• The subtlety and difficulty of describing the exact operation of the subconscious functions presents significant difficulty in developing algorithms to emulate human visual behaviour
• If we are computer…
![Page 24: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/24.jpg)
Difficulties in vision computing- the sensory gap
• The sensory gap is the gap between the object in the world and the information in a (computational) description derived from a recording of that scene.
• disambiguation processing
![Page 25: CS256 Intelligent Systems -Vision Systems Module Overview.](https://reader035.fdocuments.in/reader035/viewer/2022062407/56649d3f5503460f94a19151/html5/thumbnails/25.jpg)
Difficulties in vision computing - The semantic gap
• The semantic gap is the lack of coincidence between the information that one can extract from the visual data and the interpretation that the same data have for a user in a given situation. (Arnold, 2000)
• The higher level interpretation, the more more domain knowledge and its management are required.