Download - Study and Comparison of MPEG-2 and H.264 main profiles and available transcoding methods EE 5359 Priyanka Ankolekar 1000 51 4497.

Study and Comparison of MPEG-2 and H.264 main profiles and

available transcoding methods

EE 5359Priyanka Ankolekar

1000 51 4497

Introduction

• Development of international video coding standards like MPEG-2 led to a boost in multimedia applications like digital video recording and teleconferencing.

• On growing demand for better compression performance led to advanced video coding standards like H.264.

• H.264 is superficially similar to MPEG-2

• However, there are significant differences in the details.

• This project aims to compare the MPEG-2 and H.264 main profiles and to discuss related transcoding methods.

MPEG-2

• Second of several standards developed by the moving pictures experts group [16].

• Used as the format of digital TV signals and direct broadcast satellite TV systems.

• MPEG-2 is not optimized for low bit rates like 1Mbps. But it outperforms MPEG-1 at 3Mbps and above.

• It is used for higher data rates of 4Mbps (DVD) and 19Mbps (HDTV).

• MPEG-2 devices are back compatible with MPEG-1.

• MPEG-2/Video is formally known as ISO/IEC 13818-2 and as ITU-T Rec.H.262. [21].

MPEG-2 Profiles

MPEG-2 Profiles[16]

Abbr. NamePicture Coding

TypesChromaFormat

Aspect Ratios Scalable modes

SPSimpleprofile

I, P 4:2:0square pixels, 4:3,

or 16:9none

MP Main profile I, P, B 4:2:0square pixels, 4:3,

or 16:9none

SNRSNRScalableprofile

I, P, B 4:2:0square pixels, 4:3,

or 16:9SNR (signal-to-noiseratio) scalable

SpatialSpatiallyScalableprofile

I, P, B 4:2:0square pixels, 4:3,

or 16:9SNR- or spatialscalable

HP High profile I, P, B4:2:2 or

4:2:0square pixels, 4:3,

or 16:9SNR- or spatialscalable

A profile is a collection of compression tools that together make up the coding system. A different profile means that a different set of compression tools is available. [22]

There are five profiles in MPEG-2, as summarized below.

MPEG-2 Encoder [10]

MPEG-2 Encoder (contd.)

• DCT: 2 dimensional 8x8 – for intra frames 8x8 pels – for inter frames 8x8 residual blocks

• Quantizer: Quantizes DCT coefficients using a default or modified matrix.

• Motion Estimation and Compensation:– In the motion estimation process, motion vectors for predicted and interpolated pictures are

coded differentially between macroblocks. – For the motion compensation process integer and half pel resolution motion vectors are

used to predict from previously decoded pictures.

8 16 19 22 26 27 29 3416 16 22 24 27 29 34 3719 22 26 27 29 34 34 3822 22 26 27 29 34 37 4022 26 27 29 32 35 40 4826 27 29 32 35 40 48 5826 27 29 34 38 46 56 6927 29 35 38 46 56 69 83

MPEG-2 Decoder [7]

H.264

• Developed by the Joint Video Team (JVT).

• Achieves MPEG-2 quality compression at almost half the bit rate [7].

• Significant coding efficiency, simple syntax specifications, and seamless integration of video coding into all current protocols and multiplex architectures.

• Supports various applications such as video broadcasting, video streaming, and video conferencing over fixed and wireless networks and over different transport protocols. [4]

H.264 Profiles – Comparison [11]

Baseline Extended Main High

I and P Slices Yes Yes Yes Yes

B Slices No Yes Yes Yes

SI and SP Slices No Yes No No

Multiple Reference Frames Yes Yes Yes Yes

In-Loop Deblocking Filter Yes Yes Yes Yes

CAVLC Entropy Coding Yes Yes Yes Yes

CABAC Entropy Coding No No Yes Yes

Flexible Macroblock Ordering (FMO)

Yes Yes No No

Arbitrary Slice Ordering (ASO) Yes Yes No No

Redundant Slices (RS) Yes Yes No No

Each H.264 profile specifies a subset of entire bitstream of syntax and limits that shall be supported by all decoders conforming to that profile.

There are three profiles in the first version: Baseline, Main and Extended. There are four High profiles defined in the fidelity range extensions [19].

H.264 Profiles – Coding parts [1]

H.264 Encoder[9]

H.264 Encoder

• 4x4 integer DCT: Smaller blocksize leads to a significant reduction in ringing artifacts.

• Quantization and scan: H.264 standard specifies the math formula for the quantization process.

• Deblocking filter: To reduce the blocking artifacts in the block boundaries and to stop the propagation of accumulated coded noise. The filtered image is used in motion compensated prediction of future frames and helps achieve more compression.

• Intra prediction: The encoder derives a predicted block based on its prediction with previously decoded samples (for I frames).

• Inter prediction: – Performed on the basis of temporal correlation and consists of motion estimation and motion

compensation. – Motion vector resolution is ¼ pel– Supports large number of block sizes – Multiple reference pictures (upto 32 previously coded frames).

H.264 Decoder [7]

Comparison between H.264 and MPEG-2

Algorithm Characteristic MPEG-2 H.264

General Motion compensated predictive, residual transformed, entropy coded

Same basic structure as MPEG

Block size 8x816x16, 8x16, 16x8, 8x8, 4x8, 8x4,

4x4

Macroblock size 16x16 (frame mode)16x8 (field mode)

16x16

Intra Prediction None Multi-direction, Multi-pattern

QuantizationScalar quantization with

step size of constant increment

Scalar quantization with step size of increase at the rate of 12.5%

Entropy coding VLC CAVLC, CABAC

Weighted prediction No Yes

Comparison between H.264 and MPEG-2 (contd.)

Algorithm Characteristic MPEG-2 H.264

Reference picture One picture Multiple pictures

Motion Estimation Blocks 16x16 16x16, 8x8, 8x4, 4x4

Entropy Coding Multiple VLC Tables Arithmetic Coding and adaptive VLC Tables

Frame Distance for Prediction +/- 1 Unlimited forward/backward

Fractional Motion Estimation 1/2 Pixel (MPEG2) 1/4 Pixel

Deblocking Filter None Dynamic edge filters

Scalable coding support [2] Yes, layered picture spatial, SNR, temporal scalability

With some support on temporal and SNR scalability

Bit rates with same quality HD video with resolution (1920 x 1080)

12 -20 Mbps 7 – 8 Mbps

Transmission rate 2 – 15 Mbps 64 kbps – 150 Mbps

Performance comparison between H.264 and MPEG-2 – Simulations

• Test streams used: Foreman (CIF), News (CIF), Carphone (QCIF) [26].

• Codecs used: MPEG-2 [25] and H.264 [24]

• Profiles for which the simulations were run: Main and Simple/Baseline.

• Compression ratio = Original file size/Compressed file size

• Formula to compute PSNR for MPEG-2 (MAXI = 255)

Performance comparison between H.264 and MPEG-2 –Simulation (Foreman)

MPEG-2 Main Profile H.264 Main Profile H.264 Main Profile H.264 Main Profile

Performance comparison between H.264 and MPEG-2 –Simulation results (Foreman)

Parameter MPEG-2 H.264

Input video resolution 352 x 288 (CIF)

352 x 288 (CIF)

fps 30 30

# frames encoded 90 90

GOP I-P-B-B-P-B-B I-B-B-P-B-B-P

PSNR (Y) (dB) 30.42 37.03

PSNR (U) (dB) 39.1 41.08

PSNR (V) (dB) 39.6 43.81

Bit rate (kbits/second) 481.00 481.06

Original file size (.yuv) (MB) 13.3 13.3

Compressed file size (KB) 179 172

Compression ratio 74:1 78:1

Performance comparison between H.264 and MPEG-2 –Simulation (News)

MPEG-2 Main Profile H.264 Main Profile

Performance comparison between H.264 and MPEG-2 –Simulation results (News)


Input video resolution 352 x 288 (CIF) 352 x 288 (CIF)

fps 30 30



PSNR (Y) (dB) 37.02 39.1

PSNR (U) (dB) 37.02 41.0

PSNR (V) (dB) 39.02 42.0

Bit rate (kbits/second) 376.00 376.00


Compressed file size (KB) 141 134

Compression ratio 94:1 99.7:1

Performance comparison between H.264 and MPEG-2 –Simulation (Carphone)

MPEG-2 Main Profile H.264 Main Profile

Performance comparison between H.264 and MPEG-2 –Simulation results (Carphone)


Input video resolution 176 x 144 (QCIF) 176 x 144 (QCIF)

fps 30 30



PSNR (Y) (dB) 30.46 37.6

PSNR (U) (dB) 36.36 40.9

PSNR (V) (dB) 36.5 41.5

Bit rate (kbits/second) 128 127.6


Compressed file size (KB) 47.2 45.6

Compression ratio 69.6:1 72.6:1

Conclusions from simulations

For the same bit rate and video resolution, the PSNR (dB) values are greater for H.264 encoded videos than for the MPEG-2 encoded videos indicating better video quality. This can be verified from the screen shots.

The compression ratio for H.264 encoded video is also better than that for MPEG-2 encoded video inspite of better quality video in H.264.

Transcoding: Introduction

• Transcoding is the coding and recoding of digital content from one compressed format to another to enable transmission over different media and playback over various devices [29].

• High demand for efficient usage of available bandwidth.

• Interoperability between different networks and devices is gaining importance. Transcoding is a step in this direction.

• Many legacy systems like digital TVs use MPEG-2. Therefore the need for a transcoding architecture that employs the lower cost of H.264 video and does not require a significant investment in additional video coding architecture.

Transcoding: The criteria

To achieve optimum results, the following criteria should be satisfied:

• The quality of the transcoded bitstream should be comparable to the one obtained by direct decoding and re-encoding of the output stream.

• The information contained in the input stream should be used as much as possible to avoid multigenerational deterioration.

• The process should be cost efficient, low in complexity and achieve the highest quality possible.

Transcoding: Architectures [7,15]Open Loop Transcoding

• Operate in transform domain.

• Computationally efficient – since they operate directly on DCT coefficients.

• Suffer from drift problem.

Transcoding: Architectures [7,15]Cascaded Pixel Domain Transcoding

• Drift-free architecture

• Concatenation of simple encoder and decoder

• Reduced complexity since the encoder reuses the motion vectors along with other information extracted from the input bit stream.

Transcoding: Architectures [7,15]Simplified DCT Domain Transcoding

• Based on the assumption that DCT, IDCT and motion compensation are all linear operations.

• Less memory required as compared to CPDT.

• Computationally intensive, since motion compensation is performed in the DCT domain.

• Due to linearity assumptions, problem of drift may occur.

Transcoding: Architectures [7,15]Cascaded DCT Domain Transcoding

• Used for spatial and temporal resolution downscaling.

• Greater flexibility as compared to SDDT.

• Greater cost and complexity – since additional DCT motion compensation and frame memory is used.

• Adopted for downscaling operations where the encoder side DCT-MC and memory will not cost much.

References[1] Soon-kak Kwon, A. Tamhankar and K.R. Rao, “Overview of H.264 / MPEG-4 Part 10 (pp.186-

216)”, Special issue on “ Emerging H.264/AVC video coding standard”, J. Visual Communication and Image Representation, vol. 17, pp.183-552, April 2006.

[2] A. Puri, H. Chen and A. Luthra, “Video Coding using the H.264/MPEG-4 AVC compression standard”, Signal Processing: Image Communication, vol.19, pp 793-849, Oct 2004.

[3] Hari Kalva, “Issues in H.264/MPEG-2 Video Transcoding”, Computer Science and Engineering, Florida Atlantic University, Boca Raton, FL.

[4] S. Sharma, “Transcoding of H.264 bitstream to MPEG 2 bitstream”, Master’s Thesis May 2006, EE Department, University of Texas at Arlington.

[5] S. Sharma, K. R. Rao, “Transcoding of H.264 bitstream to MPEG-2 bitstream”, Proceedings of Asia-Pacific Conference on Communications 2007.

[6] “Emerging H.264/AVC Video Coding Standard”, J. Visual Communication and Image Representation, vol.17, pp. 183-552, April 2006.

[7] P.N.Tudor, “Tutorial on MPEG-2 Video Compression”, IEE J Langham Thomson Prize, Electronics and Communication Engineering Journal, December 1995.

[8] “The MPEG-2 International Standard”, ISO/IEC, Reference number ISO/IEC 13818-2, 1996.[9] T. Wiegand et. al., “Overview of the H.264/AVC Video Coding Standard”, IEEE Transactions on

Circuits and Systems for Video Technology, Vol. 13, Issue 7, pp. 560-576, July 2003.

[10] J McVeigh et. al., “A software based real time MPEG-2 video encoder”, IEEE Trans. CSVT,

Vol 10, pp 1178-1184, Oct.2000.[11] Morris, O.J., “MPEG-2: Where did it come from and what is it?”, IEE Colloquium, pp. 1/1-1/5,

24 Jan 1995.

References (contd.)[12] Kunzelmann, P.; Kalva, H., “Reduced Complexity H.264 to MPEG-2 Transcoder”, ICCE 2007,

pp. 1-2, January 2007.[13] Kamaci, N., Altunbasak, Y., “Performance Comparison of the Emerging H.264 Video Coding

Standard with the existing standards”, ICME, Vol.1, pp. 345-8, July 2003.[14] Jun Xin, Chia-Wen Lin, Ming-Ting Sun , “Digital Video Transcoding”,Proceedings of the IEEE, Vol. 93, Issue 1,pp 84-97, January 2005.[15] A. Vetros, C. Christopoulos and H. Sun, “Video transcoding architectures and techniques: an

overview”, IEEE Signal Processing magazine, Vol. 20, Issue 2, pp 18-29,March 2003.[16] “MPEG-2”, Wikipedia, 14 February 2008. Available at <http://en.wikipedia.org/wiki/Mpeg_2>[17] “Introduction to MPEG 2 Video Compression” Available at <http://www.bretl.com/mpeghtml/codecdia1.HTM>[18] “H.264/MPEG-4 AVC”, Wikipedia, 18 February 2008. Available at < http://en.wikipedia.org/wiki/H.264>[19] “H.264 A new Technology for Video Compression” – Available at <

http://www.nuntius.com/technology3.html>[20] R Periera, “Efficient transcoding of MPEG-2 to H.264”, Master’s thesis Dec 2005, EE

Department, University of Texas at Arlington. [21] “H.262 : Information technology - Generic coding of moving pictures and associated audioinformation: Video”, International Telecommunication Union, 2000-02.Available at < http://www.itu.int/rec/T-REC-H.262>[22] “MPEG-2 White paper”, Pinnacle Technical Documentation, Version 0.5, Pinnacle Systems,

Feb. 29, 2000.

http://en.wikipedia.org/wiki/Mpeg_2

References (contd.)[23] M. Ghanbari, “Standard Codecs : Image Compression to Advanced Video Coding,” Hertz, UK:

IEE, 2003.[24] H.264 software (version 13.2) obtained from:<http://iphome.hhi.de/suehring/tml/>[25] MPEG-2 software (version 12) obtained from:<http://www.mpeg.org/MPEG/video/mssg-free-mpeg-software.html>[26] Test streams (Foreman, News, Carphone) obtained from:<http://www-ee.uta.edu/dip/Courses/EE5356/ee_5356.htm> [27] Implementation Studies Group, “Main Results of the AVC Complexity analysis”, MPEG

document N4964, ISO/IEC JTC11/SC29/WG11, July 2002.[28] A. Joch et al., “Performance comparison of video coding standards using Lagarangian coder

control”, Int. Conf. of Image Processing, Vol. 2, pp. II-501 to II-504, Sept. 2002.[29] I. Sylvester, “Transcoding: The future of the video market depends on it”, IDC Executive Brief,

Nov. 2006. Available at <

http://www.ed-china.com/ARTICLES/2006NOV/2/2006NOV10_HA_AVC_HN_12.PDF>[30] R. Hoffner, “MPEG-4 Advanced Video Coding emerges”, Available at < http://www.tvtechnology.com/features/Tech-Corner/F_Hoffner-03.09.05.shtml>[31] S. Wagston and A. Susin, “IP core for an H.264 Decoder SoC”, 2007,Available at< www.us.design-reuse.com/news/?id=15746&print=yes>[32] S. Krishnamachari and K. Yang, “MPEG-2 to H.264 Transcoding: Why and How?”, Dec. 1,

2006, Available at <

http://broadcastengineering.com/infrastructure/broadcasting_mpeg_transcoding_why/index1.html>