CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS
-
Upload
martin-blanchard -
Category
Documents
-
view
32 -
download
4
description
Transcript of CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS
![Page 1: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/1.jpg)
CT20A6100MACHINE VISION AND DIGITAL
IMAGE ANALYSIS
MPEG
Pauli Jutila
Cristina Petre
![Page 2: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/2.jpg)
Contents
1. Introduction to MPEG 2. MPEG Family3. Intra Frame Coding Techniques4. Non-Intra Frame Coding Techniques5.Summary
![Page 3: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/3.jpg)
Why do we need video compression?
Example video compression- 1920x1080 pixels 30 frames/sec 8 bits/color => 1.5Gb/sec
- 6MHz channel bandwidth => 19.2Mb/sec
=> compression 83:1
![Page 4: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/4.jpg)
The MPEG Family
![Page 5: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/5.jpg)
MPEG-1
● Basic form using motion compensated block-based transform coding techniques
● Optimized to work for 352x240 at 30frames/sec
or 352x288 at 25frames/sec ,possible to go up
to 4095x4095
● Defined for progressive frames only
● No direct provision for interlaced video applications
![Page 6: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/6.jpg)
MPEG-2
● Designed for digital television broadcasting
● Potentially high quality video (4-9Mb/sec)
● Profiles and levels :
profile - bitstream scalability, colorspace resolution
level - image resolution, maximum bit-rate/profile
● MP@ML : 720x480 at 30 frames/sec
● MP@HL : 1920x1080 at 30 frame/sec
![Page 7: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/7.jpg)
MPEG-4
● Interactive content
● Object-oriented composite files
● Image blocks are arbitrary shaped
● Multiplexes and synchronizes data
● Divided into a number of parts-compatibility issue
![Page 8: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/8.jpg)
Coding Techniques
![Page 9: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/9.jpg)
Intra Frame Coding
Video Filter DCT Quantizer Run-Lenght VLC
Bit-StreamBuffer
Bit-Rate Control
![Page 10: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/10.jpg)
Video Filter
• The human eye is less sensitive to chrominance than luminance.
• RGB YCbCr• Choice of sub-sampling ratios:
• 4:4:4 (Professional, Post-Production)• 4:2:2 (same as above)• 4:1:1 (DV Camcorder) • 4:2:0 (Web, DVD)
![Page 11: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/11.jpg)
Discrete Cosine Transform
• Nearly optimal in energy consentration and decorrelating
• The human eye is less sensitive to changes in higher frequencies
• Calculated in blocks, usually 8x8
![Page 12: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/12.jpg)
DCT Coefficient Quantization
• 12-bit coefficient is divided by a corresponding value from intra quantization matrix
• Each value in the resultant matrix is pre-scaled by multiplying by a single value, known as the quantizer scale code (1-112)
• The goal is to force as many of the DCT coefficients to zero or near zero
![Page 13: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/13.jpg)
Variable Length Coding
• Zigzag-scanning long runs of consecutive zeros
• Alternate pattern in MPEG-2 (for interlaced video)• Huffman coding
![Page 14: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/14.jpg)
Coding Techniques
![Page 15: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/15.jpg)
I,P,B Frames
I-pictures: intra-coded no motion-compensated prediction used as reference picture
P-picture: inter-coded forward predicted (from I or P frame)
B-picture: inter-coded bi-directional prediction high coding efficiency
![Page 16: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/16.jpg)
I,P,B Frames
![Page 17: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/17.jpg)
Motion Estimation
![Page 18: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/18.jpg)
Matching
![Page 19: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/19.jpg)
Prediction Error Compensation
![Page 20: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/20.jpg)
Non-Intra Decoding
![Page 21: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/21.jpg)
Summary
● Video compression is needed!
● MPEG enables a wide range of applications
● Different standards but same principles
● Spatial and temporal processing
![Page 22: CT20A6100 MACHINE VISION AND DIGITAL IMAGE ANALYSIS](https://reader031.fdocuments.in/reader031/viewer/2022020801/568133bd550346895d9abd98/html5/thumbnails/22.jpg)
References
● Barry G. Haskell, Atul Puri, Arun N. Netravali, Digital Video: An Introduction to MPEG-2, Chapman and Hall, 1997.
● Joan L. Mitchell, William B. Pennebaker, Chad E. Fogg, Didier J. LeGall, MPEG Video Compression Standard, Chapman and Hall, 1997.
● John Wiseman, An Introduction to MPEG Video Compression, 1998
● Marius Preda, Les standards MPEG dans l'espace multimedia, 2005
● E. G. Richardson, Video Codec Design, Wiley, 2002● Wikipedia