Post on 29-Aug-2018
Page 1 of 40
EE5359: MULTIMEDIA PROCESSING
A FINAL REPORT ON THE PROJECT
COMPARISON AND ANALYSIS OF INTRA
PREDICTION EFFICIENCY IN HEVC, H.264, VP9 AND
AVS CHINA PART 2
SPRING 2015
BY
SWETHAA ALLIYALAMANGALAM JAYARAMAN
STUDENT ID: 1001053849
swethaa.jayaraman@mavs.uta.edu
UNDER THE GUIDANCE OF
DR. K.R.RAO
ELECTRICAL ENGINEERING DEPARTMENT
THE UNIVERSITY OF TEXAS AT ARLINGTON
Submission Date: 5/11/2015
Page 2 of 40
TABLE OF CONTENTS
INDEX TITLE PAGE NO.
ACRONYMS 3
1 Abstract 5
2 Introduction 6
3 What is Intra Prediction? 8
4 Overview of Video Coding Standards to be compared 9
a) AVS China PART 2 9
b) VP9 15
c) H.264/AVC 18
d) HEVC 20
5 Performance Comparison Metrics 23
a) MSE and PSNR 23
b) SSIM 23
c) Bjøntegaard-Delta Bit-Rate Measurements 23
d) RD- Plots 23
e) Computational Complexity 23
6 Test Sequences 24
7 Test Platform 26
8 Test Results 30
9 Plots 32
10 Project Progress 36
REFERENCES 37
Page 3 of 40
ACRONYMS
AVC Advanced Video Coding
AVS Audio Video Standard
ADST Asymmetric Discrete Sine Transform
AU Access Unit
BBC British Broadcasting Corporation
BD-BR Bjøntegaard-Delta Bit-Rate
BD-PSNR Bjøntegaard-Delta Peak Signal-to-Noise Ratio
CABAC Context-adaptive binary arithmetic coding
CTU Coding Tree Unit
CU Coding Unit
DBF De-Blocking Filter
DC Direct Current
DCT Discrete Cosine Transform
DFT Discrete Fourier Transform
DST Discrete Sine Transform
EBU European Broadcasting Union
HD High Definition
HDTV High Definition Television
HEVC High Efficiency Video Coding
ISO International Organization for Standardization
ITU-T International Telecommunication Union (Telecommunication Standardization Sector)
JPEG Joint Photographic Experts Group
JVT Joint Video Team
MB Macroblock
MPEG Moving Picture Experts Group
MSE Mean Square Error
NAL Network Adaptation Layer
NGOV Next Generation Open Video
OBMC overlapped block-based motion compensation
PSNR Peak Signal-to-Noise Ratio
PU Prediction Unit
QCIF Quarter Common Intermediate Format
RD Rate Distortion
RDO Rate Distortion Optimization
SAO Sample Adaptive Offset
SDTI Serial Data Transport Interface
SMPTE Society of Motion Picture and Television Engineers
SSIM Structural Similarity Index
TM True Motion
TU Transform Unit
UVLC Universal Variable Length Code
Page 5 of 40
1. ABSTRACT:
In the Present Era of Blooming Technology, demand for higher resolution and better quality videos
has escalated exponentially. This has caused a tremendous requirement for proper storage and
transmission of videos across various channels and networks which has in turn led into the
development of better video compression techniques [1]. Video compression is the process of
lessening the amount of data needed for representation of the videos by removing redundant
data [1]. Video decompression is the inverse process of Video compression. Video compression
and decompression are also called as video Encoding (Coding) and video decoding. The device or
software used for both of them is called encoder and decoder respectively [1] [3]. The need for
video coding led to the evolution of Video Coding Standards [1].
The first video coding standard developed was H.120 in 1984 by ITU-T (International
Telecommunication Union-Telecommunications standardization sector) [3]. Over the years,
numerous video coding standards have been developed and some of them got standardized [3].
This project aims at comparing various video coding standards such as HEVC (High Efficiency Video
Coding), H.264/AVC, VP9 and AVS (Audio Video Standard) China part 2 based upon Intra
prediction Efficiency. The comparison is carried out with the help of performance comparison
metrics such as PSNR [5], SSIM [45], MSE [5], and BD – PSNR [4], BD BR [4], Computational
Complexity and RD-plots have been plotted. Intra prediction is one of the key feature which helps
in determination of compression efficiency of the whole codec [11].
The tests will be carried using The HM Test Model 16.3 [8], JM Software 18.6 [9], The WebM
Project’s Encoder [7] and AVS China Reference Software [34] for HEVC, H.264/AVC, VP9 and AVS
China PART 2 respectively.
Page 6 of 40
2. INTRODUCTION:
From Figures 1 and 2, the generalized working of the Codec could be understood. The recent codec’s are
Block-Based. Herein, the input video frame is initially partitioned into blocks of the same size called
macroblocks and within each one of them coding and decoding process works. Further to perform
prediction, a macroblock is sub-partitioned into smaller blocks.
Figure 1: Generalized Video Encoder with intra prediction and other improved features [13]
Page 7 of 40
Figure 2: Generalized Video Decoder with intra prediction and other improved features [13]
Intra prediction works within a current video frame and is based upon the already available encoded
and decoded data for the block being predicted. Inter-prediction is used for motion compensation: a
similar region on previously coded frames close to the current block is used for prediction. The main
focus of the prediction process is to reduce redundancy of data and henceforth, avoid the storage of
excessive information in the encoded bit stream [14].
Page 8 of 40
3. WHAT IS INTRA PREDICTION?
Intra Frame coding is the process in which the spatial redundancies present within an image or video are
exploited by coding the original blocks through transform, quantization and entropy coding, independent
of the surrounding frames [35].
Intra prediction is carried in the current video frame and makes prediction for the current block based
upon the available encoded and decoded data [14]. When using intra frame coding, intra prediction
attempts to predict the current block from the neighboring pixels in the adjacent blocks in a defined set
of directions. [35]. Intra Prediction plays a key role in the determination of compression efficiency of the
whole codec [11]. It was initially proposed in 1952 and then it saw its application in transform domain
such as H.261 and H.263 [12]. Telenor Satellite Services proposed 3 modes [15] for intra prediction,
including DC mode plus vertical mode and horizontal mode in the spatial domain in 1997 [16].
Through this project, Intra Prediction modes among various video coding standards will be studied.
Table 1 shows the Intra Prediction modes among various video coding standards at a glance.
VIDEO CODING STANDARDS BLOCK SIZE NUMBER OF PREDICTION MODES
HEVC 16x16 CTU, 32x32 CTU, 64x64 CTU
35 (0-34)
H.264/AVC 4x4 Spatial, 16x16 Spatial, I-PCM
9 or 4
VP9 64x64, 32x32, 32x16, 16x16, 8x16, 8x8 and 4x4 (rectangular intra prediction possible)
10
AVS PART 2 8x8 block based Intra Prediction 5 (0-4)
Table 1: Intra Prediction among various video coding standards at a glance
Page 9 of 40
4. OVERVIEW OF VIDEO CODING STANDARDS TO BE COMPARED:
a) AVS (Audio Video Standard) China PART 2:
INTRODUCTION:
The AVS Video Coding standard was developed by the China Audio Video Coding Standard (AVS) working
group. It has been successful in gaining popularity from industries as well as research institutes. AVS-video
is an application driven coding standard. The AVS standards consists of several parts such as system, video,
audio, conformance testing and reference software etc. AVS Part 2 focusses on high-definition digital
video broadcasting and high-density storage media. It is also known as AVS1-P2 in AVS [18]. Figures 3 and
4 represent the encoder and decoder of AVS China Part 2.
Figure 3: AVS China PART 2: Encoder [17]
Page 10 of 40
Figure 4: AVS China PART 2: Decoder [17]
LAYERED DATA STRUCTURE:
Figure 5: AVS: Layered Data Structure [19]
AVS is built on the layered video structure where in the video signals are divided into several frames.
Figure 5 represents the layered structure. Firstly, the input video stream is organized into sequences.
Then, the sequences are divided into frames and are termed as pictures. Then, pictures are divided into
rectangular regions called slices. Furthermore, the slices are further divided into square regions called
macro-blocks and finally, the macro blocks are further divided into each of 8x8 pixels. The sequence,
pictures and slices begin with unique start codes that allows the decoder to identify them in the received
bit stream [19].
Page 11 of 40
CODING TOOLS:
Table 2: AVS China Part 2: Major Coding Tools [19]
INTRA- FRAME PREDICTION MODE:
Herein, the spatial prediction technique is implemented and it is based upon 8x8 block structure. 5
luminous intra prediction technique and 4 chrominance intra prediction technique have been
implemented. Here, the reference pixels are the reconstructed pixels of neighboring block without the
de-blocking filter [19].
Figure 6: AVS China Part 2: Neighboring pixels in intra prediction [20]
Figure 7: AVS China Part 2: Five Luminance intra prediction modes [20]
Page 12 of 40
b) VP9:
INTRODUCTION:
Like DIRAC, it is also an open source and free-license video compression standard developed
by Google [32]. Under development, it was known as NGOV (Next Generation Open Video)
and VP-Next. It is successor to VP8. It also aims at reduced bit rate by 50% compared to its
predecessor with the same video quality [32]. Figures 12 and 13 represent the encoder and
decoder of VP9.
Figure 12: VP9: Encoder [33]
Figure 13: VP9: Decoder [33]
Page 13 of 40
CODING TOOLS:
Prediction Block Sizes
Prediction Modes
Transform and Quantization
Entropy Coding
Post Processing: De-Blocking Filter
Table 4: VP9: Major Coding Tools
PREDICTION BLOCK SIZES:
VP9 introduces superblocks of size 64x64. It also facilitates intra prediction for rectangular
blocks. The rectangular blocks can further be divided into square blocks up to the size of 4x4.
Figure 14: Partitioning of a Super Block in VP9 [33]
INTRA PREDICTION MODES:
VP9 supports a set of 10 prediction modes [32] [33] for block sizes 4x4 as in Figure 15 to
32x32. They are:
DC_PRED (DC prediction)
TM_PRED (True-motion prediction)
H_PRED (Horizontal prediction)
V_PRED (Vertical prediction)
6 oblique directional prediction modes:
D27 (angle 27 degrees)
D45 (angle 45 degrees)
Page 14 of 40
D63 (angle 63 degrees)
D117 (angle 117 degrees)
D135 (angle 135 degrees)
D153 (angle 153 degrees)
Note: Angles are measured in anti-clockwise direction against the horizontal axis.
Figure 15: Angular Intra Prediction Modes for VP9 [14]
Page 15 of 40
c) H.264/AVC:
INTRODUCTION:
THE H.264/AVC is developed by ITU-T Video Coding Experts Group (VCEG) and ISO/JEC MPEG Video Group
named Joint Video Team (JVT) [36]. The high coding efficiency of H.264, gives perceptually equivalent
video quality at much less bitrate compared to traditional video coding standards such as MPEG-2 [37],
provides encouragement to TV and internet.
Main Goals:
Enhance compression performance
Provision of a network-friendly video representation addressing conversational (video telephony)
and non-conversational (storage, broadcast, or streaming) applications [38].
Figure 16: Encoder of H.264/AVC Codec [39]
Figure 17: Decoder of H.264/AVC Codec [39]
Page 16 of 40
INTRA PREDICTION MODES:
Each PU is predicted from neighboring image data in the same picture, using DC prediction (an average
value for the PU), planar prediction (fitting a plane surface to the PU) or directional prediction
(extrapolating from neighboring data) [40].
Figure 18: Intra_4x4 Prediction in H.264/AVC [40]
Intra_4x4 has 9 prediction modes:
Mode 0: Vertical Prediction
Mode 1: Horizontal Prediction
Mode 2: DC Prediction
Mode 3: Diagonal Down-Left Prediction
Mode 4: Diagonal Down-Right Prediction
Mode 5: Vertical Right Prediction
Mode 6: Horizontal Down Prediction
Mode 7: Vertical Left Prediction
Mode 8: Horizontal Up Prediction
Intra_16x16 has 4 Prediction Modes:
Mode 0: Vertical Prediction
Mode 1: Horizontal Prediction
Mode 2: DC Prediction
Mode 3: Plane Prediction
Page 17 of 40
d) HEVC(HIGH EFFICIENCY VIDEO CODING):
INTRODUCTION:
High Efficiency Video Coding (HEVC) is the latest Video Coding format [43]. It challenges the state-of-
the-art H.264/AVC [44] Video Coding standard which is in current use in the industry by being able to
reduce the bit rate by 50% [44] and retaining the same video quality. It came into existence in the
early 2012 although Joint Collaborative Team on Video Coding (JCT-VC) was formed in January 2001
to carry out developments on HEVC, and ever since then a huge range of development has been going
on. On 13 April 2013 [44], HEVC standard also called H.265 was approved by ITU-T. Joint Collaborative
Team on Video Coding (JCTVC), is a group of video coding experts from ITU-T Study Group (VCEG) and
ISO/IEC JTC 1/SC 29/WG 11 (MPEG). Figures 19 and 20 represents the encoder and decoder of HEVC.
Figure 19: Encoder for HEVC [41]
Figure 20: Decoder of HEVC [42]
Page 18 of 40
Prediction block sizes and macro-block concept:
The concept of macroblock in HEVC [14] is represented by the Coding Tree Unit (CTU). CTU size can be
16x16, 32x32 or 64x64, while AVC macroblock size is 16x16. Larger CTU size aims to improve the efficiency
of block partitioning on high resolution video sequence. Larger blocks provoke the introduction of quad-
tree partitioning of a CTU into smaller coding units (CUs). A coding unit is a bottom-level quad-tree syntax
element of CTU splitting. The CU contains a prediction unit (PU) and a transform unit (TU).
Figure 21: Prediction Unit Splitting in HEVC [14]
The CU can contain up to four prediction units. CU splitting on PUs can be 2Nx2N, 2NxN, Nx2N, NxN,
2NxnU, 2NxnD, nLx2N and nRx2N as shown in Figure 21 where 2N is a size of a CU being split. In the intra
prediction mode only 2Nx2N PU splitting is allowed. An NxN PU split is also possible for a bottom level CU
that cannot be further split into sub CUs.
Intra Prediction Mode:
There are a total of 35 intra prediction modes in HEVC: planar (mode 0), DC (mode 1) and 33 angular
modes (modes 2-34 in Figure 19). DC intra prediction is the simplest mode in HEVC. All PU pixels are set
equal to the mean value of all available neighboring pixels. Planar intra prediction is the most
computationally expensive. It is a two- dimensional linear interpolation. Angular intra prediction modes
2-34 are linear interpolations of pixel values in the corresponding directions. Vertical intra prediction
(modes 18- 34) is an up down interpolation of neighboring pixel values. Also, intra prediction can be done
at different block sizes, ranging from 4 X 4 to 64 X 64 (whatever size the PU has) [33].
Figure 22: Prediction Modes in HEVC [14]
Page 20 of 40
5. PERFORMANCE COMPARISON METRICS:
a) Mean Square Error (MSE) AND Peak Signal to Noise Ratio
(PSNR):
MSE and PSNR [5] for an NxM pixel image are defined in equations 1 and 2 where O is the original image
and R is the reconstructed image. M and N are the width and height of an image and ‘L’ is the maximum
pixel value in the NxM pixel image.
b) Structural Similarity Index (SSIM)
The structural similarity (SSIM) [45] index is a method for measuring the similarity between two images.
SSIM emphasizes that the human visual system is highly adapted to extract structural information from
visual scenes. Therefore, structural similarity measurement should provide a good approximation to
perceptual image quality. SSIM is designed to improve on methods like peak signal-to-noise ratio (PSNR)
and mean squared error (MSE), which have proved to be inconsistent with human eye perception. SSIM
considers image degradation as perceived change in structural information. Structural information is the
idea that the pixels have strong inter-dependencies especially when they are spatially close.
Where x and y correspond to two different signals that need to be compared for similarity, i.e. two
different blocks in two separate images.
c) Bjøntegaard-Delta Bit-Rate Measurements:
As rate-distortion (R-D) performance assessment [4], Bjøntegaard-Delta bit-rate (BD-BR) measurement
method is used for calculating average bit-rate differences between R-D curves for the same objective
quality (e.g., for the same PSNRYUV values), where negative BD-BR values indicate actual bit-rate savings.
Page 21 of 40
6. TEST SEQUENCES[31]
1. Name: Claire_qcif.yuv
Resolution: 176x144
Frame Rate: 15fps
2. Name: Bridge-close_cif.yuv
Resolution: 352x288
Frame Rate: 30fps
3. Name: BQMall_832x480_60.yuv
Resolution: 832x480
Frame Rate: 60fps
Page 22 of 40
4. Name: BasketballDrive_1280x720_50.yuv
Resolution: 1280x720
Frame Rate: 50 fps
5. Name: Kimono_1920x1080_24.yuv
Resolution: 1920x1080
Frame Rate: 24 fps
Page 23 of 40
7. TEST SETUP
1. HEVC Implementation: Using HM 16.4 Software[8]:
After downloading the software and installing it, the solution can be built and run using
Microsoft Visual Studio 2010. Herein, the solution is built in the ‘RELEASE’ mode in
Microsoft Visual Studio. This will generate lencod and ldecod and executable files which
can be located in the ‘bin’ directory.
The encoder or decoder can be run by using the command line parameters in the
command prompt.
This sequence is tested for various quantization parameters. The value of quantization
parameter can be changed in the encoder.cfg file.
HM 16.4[8] Configuration Set Up:
Configuration used: Main All Intra Mode Configuration
IntraPeriod : 1 # Period of I-Frame ( -1 = only first)
GOPSize : 1 # GOP Size (number of B slice = GOPSize-1)
QP : 22 # Quantization parameter(0-51) (22, 27, 32 or 37 is used at a time)
Sample command line parameters:
C:\ HEVC\ bin\vc10\Win32\Release>TAppEncoder.exe -c
C:\HEVC\cfg\encoder_intra_main.cfg -wdt 832 -hgt 480 -fr 60 -f 10 -i
C:\HEVC\test_seq\BQ_Mall 832x480_60.yuv
-Description:
-c: config file to be used
-wdt: width of the yuv video
-hgt: height of the yuv video
-fr: frame rate of the sequence
-f: no.of frames to be encoded
-i: the input sequence path
Page 24 of 40
2. H.264 Implementation: JM 18.3 Configuration Set Up[9]:
After downloading the software and installing it, the solution can be built and run using
Microsoft Visual Studio 2010. Herein, the solution is built in the ‘RELEASE’ mode in
Microsoft Visual Studio. This will generate lencod and ldecod and executable files which
can be located in the ‘bin’ directory.
The encoder or decoder can be run by using the command line parameters in the
command prompt.
This sequence is tested for various quantization parameters. The value of quantization
parameter can be changed in the encoder_main.cfg file.
HM 16.4[8] Configuration Set Up:
Profile used: Main Profile
Sample command line parameters:
C:\ h_264\ bin >lencod.exe -f encoder_ main.cfg –p InputFile=
"C:\HEVC\test_seq\bridge-close_cif.yuv" -p FramesToBeEncoded = 10 -p SourceWidth
= 352 -p SourceHeight = 288 -p -p QPISlice = 32 -p FrameRate = 30.0 -p ProfileIDC= 77 –
p LevelIDC =40 –p Intraperiod = 1
Description:
-f: config file to be used
- SourceWidth: width of the yuv video
- SourceHeight: height of the yuv video
-FrameRate: frame rate of the sequence
-FramesToBeEncoded: no.of frames to be encoded
-InputFile: the input sequence path
3. VP9:The WebM Project Software[10]
This software is supported only in linux environment. The solution can be built by using
following commands in Linux:
mkdir libvpx-pub
cd libvpx-pub
git clone http://git.chromium.org/webm/libvpx.git
cd libvpx
git checkout -b master origin/master
cd build
Page 25 of 40
mkdir linuxbuild
cd linuxbuild
../../configure --target=x86_64-linux-gcc --enable-internal-stats --disable-vp8
make -j 12
Set Up Environment: Ubuntu 14.4
Mode Used: All Intra (Achieved by configuring the key frame parameter)
Sample Command Line Parameter:
vpxenc Kimono_1920x1080_24.yuv -o kimono.webm \--codec=vp9 --i420 --
width=1920 --height=1080 --passes=2 -t 0 \--rt --good --cpu-used=0 --end-usage=q \--
auto-alt-ref=1 --fps=24000/1001 --verbose --psnr \--lag-in-frames=25 --kf-max-dist=1 \-
-min-q=32 --max-q=32
If, y4m video sequence is used then the source height, source width and frame rate
need not be mentioned.
Description:
-o: output file
--codec: codec to be used
-i: represents the chroma format
--width: width of the test sequence
--height: height of the test sequence
--passes: No.of passes (1/2)
-t: maximum no.of threads
--end-usage: cbr/vbr/cq/q
--target-bit-rate: Bit-rate desired
--cq-level: Constrained Quality Level(22,27,32,37)
--auto-alt-ref:
--fps: frame rate
--psnr: to display psnr value
--kf-max-dist: for the intra-frame config (here)
4. AVS China:AVS China Reference Software [37]:
After downloading the software and installing it, the solution can be built and run using
Microsoft Visual Studio 2010. Herein, the solution is built in the ‘RELEASE’ mode in
Page 26 of 40
Microsoft Visual Studio. This will generate ldecod and lencod and executable files which
can be located in the ‘bin’ directory.
The encoder or decoder can be run by using the command line parameters in the
command prompt.
This sequence is tested for various quantization parameters. For All-Intra Configuration
the encoder_ai.cfg is used.
Various parameters can be changed by parsing the parameters onto the config file.
Mode Used: All Intra
Sample Command Line Parameter:
lencod.exe -f encoder_ai.cfg -p InputFile = "C:\HEVC\test_seq\bridge-close_cif.yuv" -p
FramesToBeEncoded = 10 -p SourceWidth = 352 -p SourceHeight = 288 -p TraceFile =
"log_bridge.txt" -p OutputFile = "test_bridge.avs" -p QPIFrame = 32 -p FrameRate = 5 -
p ChromaFormat = 1
-f: config file
-p QPIFrame: QP=22,27,32 or 37
-p FrameRate: 5 (30fps)
Page 30 of 40
9. PLOTS
Part 1: R D Plots:
1. For the sequence: Claire_qcif.yuv :
2. For the sequence: bridge-close_cif.yuv :
30
32
34
36
38
40
42
44
46
48
0 100 200 300 400 500 600
PSN
R (
dB
)
Bitrate (kbps)
R-D Plot: claire_qcif.yuv HEVC H.264 VP9 AVS China part 2
30
32
34
36
38
40
42
44
46
48
50
0 1000 2000 3000 4000 5000 6000 7000
PSN
R(d
B)
Bitrate (kbps)
R-D Plot: bridge-close_cif.yuv
HEVC H.264 VP9 AVS China Part 2
Page 31 of 40
3. For the sequence: BQMall_832x480_60.yuv:
4. For the sequence: BasketballDrive_1080x70_50.yuv:
30
32
34
36
38
40
42
44
46
48
50
0 10000 20000 30000 40000
PSN
R (
dB
)
Bitrate (kbps)
R-D Plot: BQMall_832x480_60.yuv
HEVC H.264 VP9 AVS China part 2
30
32
34
36
38
40
42
44
46
48
50
0 5000 10000 15000 20000 25000 30000
PSN
R (
dB
)
Bitrate (kbps)
R-D Plot:BasketballDrive_1080x720_50.yuv
HEVC H.264 VP9 AVS China part 2
Page 32 of 40
5. For the sequence: Kimono_1920x1080_24.yuv:
30
32
34
36
38
40
42
44
46
48
50
0 10000 20000 30000 40000
PSN
R (
dB
)
Bitrate (kbps)
R-D Plot: Kimono_1920x1080_24.yuv HEVC H.264 VP9 AVS China part 2
Page 33 of 40
Part 2: Encoding Time Comparison:
1. For the Sequence: Claire_qcif.yuv:
2. For the Sequence: bridge-close_cif.yuv:
0
1
2
3
4
5
6
7
8
22 27 32 37
Enco
din
g Ti
me
(se
c)
Quantization Parameter
ENCODING TIME COMPARISON: CLAIRE_QCIF.YUV
HEVC H.264 VP9 AVS China Part 2
0
10
20
30
40
50
22 27 32 37
Enco
din
g Ti
me
(se
c)
Quantization Parameter
ENCODING TIME COMPARISON: BRIDGE-CLOSE_CIF.YUV
HEVC H.264 VP9 AVS China Part 2
Page 34 of 40
3. For the Sequence: BQMall_832x480_60.yuv:
4. For the Sequence: BasketballDrill_1280x720_50.yuv:
0
50
100
150
200
250
22 27 32 37
Enco
din
g TI
me
(se
c)
Quantization Parameter
ENCODING TIME COMPARISON: BQMALL_832X480_60.YUV
HEVC H.264 VP9 AVS China Part 2
0
50
100
150
200
250
22 27 32 37
Enco
din
g Ti
me
(se
c)
Quantization Parameter
ENCODING TIME COMPARISON: BASKETBALLDRIVE_1280X720_50.
YUV
HEVC H.264 VP9 AVS China Part 2
Page 35 of 40
5. For the Sequence: Kimono_1920x1080_24.yuv:
0
100
200
300
400
500
600
22 27 32 37
Enco
din
g Ti
me
(se
c)
Quantization Parameter
ENCODING TIME COMPARISON: KIMONO_1920X1080_24.YUV
HEVC H.264 VP9 AVS China Part 2
Page 36 of 40
10. PROGRESS:
Implemented the test sequences for HEVC, H.264,VP9 and AVS China Part 2.
Plotted the R-D Plots.
Compared the Encoding Time among the standards.
Will compute and plot BD-PSNR and BD-BR.
Page 37 of 40
REFERENCES [1] I.E. Richardson, “The H.264 Advanced video Compression Standards”, Wiley, 2010.
[2] HM Software manual: http://hevc.hhi.fraunhofer.de/
[3] K.R. Rao, D.N. Kim and J.J. Hwang, “Video Coding Standards: AVS China, H.264/MPEG-4 Part10, HEVC,
VP6, DIRAC and VC-1”, Springer, 2014
[4] BD-BR and BD-PSNR: G. Bjøntegaard, “Calculation of average PSNR differences between RD-curves”,
ITU-T Q.6/SG16 VCEG 13th Meeting, Document VCEG-M33, Austin, USA, Apr. 2001.
[5] PSNR and MSE:
http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/VELDHUIZEN/node18.html
[6] V.Sze, M.Budagavi and G. Sullivan, “High Efficiency Video Coding”, Springer 2014.
[7] The WebM Project’s VP9 Encoder: http://www.webmproject.org/vp9/
[8] The HM Test Model 16.3: http://hevc.hhi.fraunhofer.de/HM-doc/
[9] JM Software 18.6: http://iphome.hhi.de/suehring/tml/
[10] “Introduction to the Issue on Video Coding: HEVC and Beyond”, IEEE Journal of Selected Topics in
Signal Processing, Vol.7, pp.931-1151, Dec. 2013.
[11] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html: Project
on: “Intra Prediction Efficiency and Performance Comparison of HEVC and VP9”, S. Sukumaran, 2014.
[12] ITU-T Recommendation H.263, “Video coding for low bit-rate communication”, Feb. 1998
[13] J. Ostermann et al, “Video Coding with H.264/AVC: Tools, performance and complexity”, IEEE -Circuits
and Systems Magazine, vol. 4, pp. 7-28, First Quarter 2004.
[14] M.P. Sharabayko et al, "Intra Compression Efficiency in VP9 and HEVC" Applied Mathematical
Sciences, Vol. 7, no. 137, pp. 6803 – 6824, Hikari Ltd, 2013
[15] G. Bjontegaard, “Coding improvement by using 4x4 blocks for motion vectors and transform”, ITU-
T/Study Group 16/Video Coding Experts Group (Question 15), Eibsee, Bavaria, Germany, document Q15-
C-23, Nov. 1997.
[16] Z. Nan et al, “Spatial Prediction Based Intra-Coding [video-coding]” IEEE International Conference on
Multimedia and Expo (ICME), Vol. 1, pp. 97-100, June 2004.
[17] Access the http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project on:
“Video compression standard for high definition video: A comparative study of H.264, Dirac Pro and AVS
part 2”, S. Gangavati, 2012.
[18] W. Gao et al, “AVS Video Coding Standard”, Intelligent Studies in Computational Intelligence Vol.
280, pp. 125-166, 2010.
[19] W. Gao et al, “AVS- the Chinese Next Generation Video Coding Standard”,
http://www.avs.org.cn/reference/AVS%20NAB%20Paper%20Final03.pdf
[20] L. Yu et al, “An Overview of AVS-Video: tools, performance and complexity”, Visual
Communications and Image Processing 2005, Proc. of SPIE, vol. 5960, pp. 596021, July 31, 2006.
[21] AVS China: http://www.avs.org.cn/english/index.asp
[22] M.Wien, “High Efficiency Video Coding: Coding Tools and Specification”, Springer, 2015.
[23] A.Grange, Internet draft from Network Working Group “A VP9 Bitstream Overview” August 2013.
[24] J.Bankoski et al, “Towards a Next Generation Open source Video Codec” SPIE Vol. 8666 Page 2, Dec.
2013.
Page 38 of 40
[25] Access website http://forum.doom9.org/ “How HEVC/H.265 works, technical details & diagrams”
[26] M.Shafique and J.Henkel “Low Power Design of the Next-Generation High Efficiency Video Coding”
IEEE Automation Conference (ASP-DAC), 19th Asia and South Pacific, Pages: 274 – 281, Jan. 2014.
[27] I.E. Richardson, “Coding video: A practical guide to HEVC and beyond,” Wiley, 11 May 2015.
[28] T. Davies, “A modified rate-distortion optimization strategy for hybrid wavelet video coding”, IEEE
International Conference on Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings.
2006, Vol.: 2, pp. II, Publication Date: 14-19 May 2006.
[29] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project
on: “Performance Analysis of Dirac Pro with H.264 Intra frame coding”, P. Kharwandikar, 2010.
[30] K. Onthriar, K. K. Loo and Z. Xue, “Performance comparison of emerging Dirac video codec with
H.264/AVC”, IEEE International Conference on Digital Telecommunications, 2006, ICDT apos; Vol. 06,
pp.22, Issue: 29-31, Aug. 2006.
[31] Video Test Sequences: http://ultravideo.cs.tut.fi/#testsequences &
http://forum.doom9.org/archive/index.php/t-135034.html
32] "VP-Next Overview and Progress Update" (PDF). WebM Project (Google). Retrieved 2012-12-29.
Available on: http://downloads.webmproject.org/ngov2012/pdf/04-ngovproject-update.pdf
[33] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project
on: “Comparative study of Intra Frame Coding efficiency in HEVC and VP9”, S. Kodpadi, 2014.
[34] AVS China software: Part 2 (Password protected):
ftp://124.207.250.92/public/codec/video_codec/AVS2_P2/RD10.1.zip
[35] J. Golston (Distinguished Member Technical Staff, Texas Instruments), “Comparing Media Codecs for
Video Content”, Embedded Systems Conference – San Francisco 2004.
[36] ISO/IEC JTC1/SC29/WG1 1 (MPEG), "Coding of audio-visual objects - Part 10: Advanced Video Coding,"
International Standard 14496-10, ISO/IEC, 2004.
[37] ISO/IEC JTC I/SC29/WGI I (MPEG), "Information technology - Generic coding of moving pictures and
associated audio information: Video," International Standard 13818-2, ISO/IEC, 2000.
[38] T. Wiegand et al, “Overview of the H.264/AVC Video Coding Standard”, IEEE Transactions on Circuits
and Systems for Video Technology, Vol. 13, No. 7, pp. 560-576, July 2003.
[39] X. Zhou, E. Q. Li, and Y.-K. Chen, “Implementation of H.264 decoder on general purpose processors
with media instructions”, SPIE Conference on Image and Video Communications and Processing, vol. 5022,
pp.224-235, May 2003.
[40] A. K. Khan and H. Jamal ,”The Intra-Prediction in H.264”, 2008. : https://getinfo.de/app/The-Intra-
Prediction-in-H-264/id/BLCP%3ACN069872406
[41] D. Marpe et al, “The H.264/MPEG4 advanced video coding standard and its applications”, IEEE
Communications Magazine, Vol. 44, pp. 134-143, Aug. 2006.
[42] C. Fogg, “Suggested figures for the HEVC specification”, ITU-T / ISO-IEC Document: JCTVC J0292r1,
July 2012.
[43] G.J. Sullivan et al, "Standardized Extensions of High Efficiency Video Coding (HEVC)", IEEE Journal of
Selected Topics in Signal Processing, vol.7, no.6, pp. 1001-1016, Dec. 2013
[44] T. Wiegand et al, “Overview of the H.264/AVC Video Coding Standard”, IEEE Transactions on Circuits
and Systems for Video Technology, Vol. 13, No. 7, pp. 560-576, July 2003.
Page 39 of 40
[45] Z. Wang, et al., “Image quality assessment: From error visibility to structural similarity”, IEEE
Transactions Image Processing, vol. 13, pp. 600–612, Apr. 2004
[46] J.Ohm et al., ” Comparison of the Coding Efficiency of Video Coding Standards- Including High
Efficiency Video Coding (HEVC)”, IEEE Transactions on Circuits and Systems for Video Technology on
Circuits and Systems for Video Technology, Vol.22, pp. 1669 – 1684, No.12, December 2012.
[47] Multimedia Processing: http://nptel.iitk.ac.in/courses/Webcourse-
contents/IIT%20Kharagpur/Multimedia%20Processing/New_index1.html
[48] I.E. Richardson, “Coding Video: A Practical guide to HEVC and beyond”, Wiley, 11 May 2015.
[49] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project
on: “Overview and comparison of H.264/MPEG-4 part 10 or AVC video coding standard and audio video
standard (AVS) of China as compression standard for video” , K.V.Dhonsale, Spring 2012.
[50] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project
on: “Study and implementation of Video compression standards (H.264/AVC and Dirac).” , K.V.Dhonsale,
Spring 2012.
[51] Video Coding: HEVC and Beyond- IEEE Journal of Selected Topics In Signal Processing,vol.7, pp. 93-
1151, Dec. 2013
[52] Access the website http://www.uta.edu/faculty/krrao/dip/Courses/EE5359/index_tem.html Project
on: “Audio Video Standard for Mobile (AVS-M)”, S.Devaraju, Spring 2009.
[53] L.Yu et al, “Overview of AVS coding standards”, Signal Processing: Image Communication. Vol. 24, pp.
247 – 262 , April 2009.
[54] L.Fang et al, “Overview of AVS Standard”, Multimedia and Expo, 2004. ICME '04. 2004 IEEE
International Conference, Vol.1, pp. 423- 426, June 2004 .
[55] S.Ma et al, “Overview of IEEE 1857 video coding standard”, access the link: http://ieeexplore.ieee.org/xpl/abstractAuthors.jsp?reload=true&arnumber=6738308 [56] “P1857 - IEEE Draft Standard for Advanced Audio and Video Coding”, pp.1-190, 26 October 2012.
[57] H. Zhang and Z.Ma , “Fast Intra Mode Decision for HEVC”, IEEE Transactions on CSVT, vol.24 , pp.660-668, April 2014.
[58] B. Bross et al, “High Efficiency Video Coding (HEVC) Text Specification Draft 10”, Document JCTVC-L1003, ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCTVC), Mar. 2013 available on http://phenix.itsudparis.eu/jct/doc_end_user/current_document.php?id=7243
[59] N. Ahmed et al, “Discrete CosineTransform”, IEEE Transactions on Computers, Vol.C-23, pp.90-93, January1974.
[60] Special issue on emerging research and standards in next generation video coding, IEEE Transactions on Circuits and Systems for Video Technology (CSVT), vol. 23, pp. 2009-2142, Dec. 2013.
[61] D.Grois et al, “Performance Comparison of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders”, available on: http://iphome.hhi.de/marpe/download/Performance_HEVC_VP9_X264_PCS_2013_preprint.pdf
[62] B. Zeng and J. Fu, “Directional discrete cosine transforms - A new framework for image coding”, IEEE Trans. on Circuits and Systems for Video Technology, vol. 18, no. 3, pp. 305-313, Mar. 2008.
Page 40 of 40
[63] M.T. Pourazad et al, “HEVC:The new gold standard for video compression”, IEEE consumer electronics magazine ,vol.1, no.7, pp.36-46, July 2012.
[64] Y.-K. Chen et al, “Implementation of H.264 encoder and decoder on personal computers,” J. Visual Commun. Image Representation, vol. 17, no. 2, pp. 509 – 532, Apr. 2006.
[65] K. Chen et al, “Efficient SIMD optimization of HEVC encoder over X86 processors,” APSIPA ASC 2012 Asia-Pacific , pp. 1-4, Dec. 2012.
[66] H. S. Malvar et al, “Low-complexity transform and quantization in H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 598 – 603, Jul. 2003.
[67] S. Vasudevan and K.R. Rao, “Combination method of fast HEVC Encoding”, IEEE ECTI-CON-2014, Korat, Thailand, May 2014.
[68] HEVC tutorial by I.E.G. Richardson: http://www.vcodex.com/h265.html
[69] H.264 tutorial by I.E.G. Richardson: https://www.vcodex.com/h264.html
[70] A. Puri et al, “Video coding using the H.264/MPEG-4 AVC compression standard”, Signal Processing: Image Communication, vol. 19, pp. 793-849, Oct. 2004.
[71] M. Khan et al, “An adaptive complexity reduction scheme with fast prediction unit decision for HEVC Intra encoding”, IEEE ICIP, pp. 1578-1582, Sept. 2013.
[72] S. Jeong et al., High efficiency video coding for entertainment quality. ETRI J vol. 33, pp.145–154, 2011.