Introduction to H.264 Video Codec Standard: Hardware...

Introduction to H.264 Video Codec Standard: Hardware Challenges and Opportunities

Ahmed Abu-Hajar, Ph.D

DIGITAVID, Inc

San Jose, CA

abuhajar@digitavid.net

(408) 506-2776

PO Box: 720998

San Jose, CA 95172

Ahmed Abu-Hajar, Ph.D. Digitavid, Inc

Presentation Outline

� Introduction to Video Processing

� Overview of H.264-AVC Standard

� Overview of Hardware Implementation

� Challenges and Opportunities

Overview of Video Processing

� Objectives

– Getting familiar with digital images and digital

video sequences

– Getting familiar with common acronyms and

terms commonly used among video engineers

Q) What Is Digital Image?

A) Digital image is a 2-D signal

which is composed of very small picture elements called

pixels

Lena Image

2×2 Pixels

Q) What Is Pixel?

A) Pixel is very small rectangular

area which has a uniform intensity value

Q) How Images Display Colors?

A) Each pixel has three primary colors:

– Red

– Green

– Blue

– Human’s vision is less sensitive to Red and Blue

– In Video the colored pixels are down sampled for every 2x2 pixels into:

� Two green samples called Luminance

� One Red sample called Chrominance CR

� One blue sample chrominance CB

Q) What is Video?

A) Video is a sequence of images

– Each image is called frame

– Each frame has equal displaying time

∆t = (1/30)s or ∆t = (1/25)

Frame 1 Frame 2 Frame 3 Frame 4 Frame 5

More on Frames– Each frame is decomposed into two fields

� Even Field: Includes the even rows of the frame� Odd Field: Includes the odd rows of the frame

– Two types of fields

� Progressive Scanned Field: The even and odd fields are captured at the same time

� Interlaced Scanned Field: The even and the odd fields are captured at different times

LenaEven Field Odd Field

Segmenting Each Frame into Blocks

Macroblock (MB)� Each Frame is segmented into big blocks called macroblocks

� Each macroblock contains :

– One 16×16 Y component

– One 8×8 CB component

– One 8×8 CR component

Basics About Compression

– More probable values requires less bits (Entropy Coding)

– Frames (or images) are smooth (low frequency)

� Transform the frame into frequency domain (or DCT-Domain)

– Most of the information energy is concentrated in FEW COEFFICIENTS

– LOW ENERGY coefficients are truncated to ZEROS

– The more we truncate to zero, the lower the quality of the video

– Only few coefficients needed to be coded �� Compression

More on Compression

– The Objective is to create many zeros without lowering the video quality

– Prediction is used to predict the current pixel value.

– Prediction reduces Quantization (truncation) error

– Two types of predictions

� Intra Prediction : Predicts current pixel value from the current frame

� Inter prediction: Predicts current pixel value from previously coded frames

Introduction to H.264-AVC Standard

Scope of the Standard

The main objective is to efficiently compress video sequences

EncodingPre-Processing

Post-Processing

Original

Sequence

Decompressed

Video SequenceDecoding

Compressed

Video Sequence

Coder Stucture� Contains two layers

– Video Coding Layer (VCL)� Compress video content

– Network Abstract Layer (NAL)� Organize data to according to

networks protocols

Video Coding Layer

Data Partitioning

Coded Macroblock

Network Abstraction Layer (NAL)

H.320 MP4FF H3.23/IP MPEG-2 ect.

Slice/Partition

Control

Video Coding Layer (VCL)� Two Coding Modes

– Intra Coding: Does not use previously coded frames� Segment current frame into blocks� For each block: Predict pixels values from previously coded pixel� Transform the predicted block� Quantize the coefficients � Entropy code the quantized coefficients

– Inter Coding: Uses previously coded frames to predict current frame

� Segments the current frame into blocks� For each block in the current frame, search for the best block from

previously coded frames that minimizes the error. ( Motion Estimation)

Block Diagram

Prediction Quantize

Coefficients

Motion

Compensation

De-blocking

Filter &

Inverse

Prediction

Motion

Estimation

Algorithm

Current Block

Entropy

Coding

Transform

Current Block

Inverse

Quantization

Inverse

Transform

Compressed

Intra Coding

Inter Coding

Current Block

Block DiagramCurrent Block

Prediction Quantize

Coefficients

Motion

Compensation

De-blocking

Filter &

Inverse

Prediction

Motion

Estimation

Algorithm

Entropy

Coding

Transform

Inverse

Quantization

Inverse

Transform

Compressed

Intra Coding

Inter Coding

� Frame Format– Monochrome – contains one color– Colored frame ( Y, Cr, Cb)

� 4:4:4 ��same size for (Y, Cr,Cb)� 4:2:2 ��Y is full size, (Cr,Cb) same height half width� 4:2:0 �� Y is full size, (Cr, Cb) half height and half width

� Each frame is segmented into Macroblocks– Y component (16x16) – Cb and Cr components (8x8) blocks

� Each macroblock is segmented into (16x8), (8x16), (8x8), (8x4),(4x8) and (4x4)

16x16 8x16 16x8 8x8 4x8 8x4 4x4

Current Block

Block DiagramCurrent Block

Quantize

Coefficients

Motion

Compensation

De-blocking

Filter &

Inverse

Prediction

Motion

Estimation

Algorithm

Entropy

Coding

Transform

Inverse

Quantization

Inverse

Transform

Compressed

Intra Coding

Inter Coding

Prediction

� Intra Prediction Modes:� Lumina and chromina components are diffenent

– I_PCM : Skip prediction and transform– 16x16 Prediction Mode

– Vertical, Horizontal, DC or Plane– Select the best mode

– 4x4 Prediction Mode– Vertical, Horizontal, DC, Diagonal Down Left, diagonal Down Right, Vertical Right,

Horizontal down, Vertical Left, Horizontal Up

� Coder must select the prediction mode with the least bit rate� Each mode uses addition and shift operation

Select

Prediction

Current Block

Prediction Mode N

Prediction Mode 2

Prediction Mode 1

Predicted Current

Block Diagram

Current Block

Prediction Quantize

Coefficients

Motion

Compensation

De-blocking

Filter &

Inverse

Prediction

Motion

Estimation

Algorithm

Entropy

Coding

Inverse

Quantization

Inverse

Transform

Compressed

Intra Coding

Inter Coding

Transform

Block Transform� Compact the energy of the coefficients into the low frequency

components� Applied to 4x4 blocks� Use integer Transforms ( to reduce Hardware Complexity)

00 01 02 03 00 01 02 03

10 11 12 13 10 11 12 13

20 21 22 23 20 21 22 23

30 31 32 33 30 31 32 33

1 1 1 1 1 2 1 1

2 1 1 2 1 1 1 2

1 1 1 1 1 1 1 2

1 2 2 1 1 2 1 1

c c c c p p p p

− − − − =

− − − −

Pixels

Coefficients

Quantize Coefficients Quantize Coefficients Quantize Coefficients Quantize Coefficients

� One of 52 (QP) Quantizers is selected

� Requires Look-Up Tables

Entropy Coding

� Selects one of Two Coders

– Context-Adaptive Variable Length Coding (CAVLC)(Superior but Complex)

– Exp-Golomb code with regular decoding properties( Simple but Inferior)

Block Diagram

Current Block

Prediction

Motion

Compensation

De-blocking

Filter &

Inverse

Prediction

Motion

Estimation

Algorithm

Transform

Inverse

Quantization

Inverse

Transform

Compressed

Intra Coding

Inter Coding

Quantize

Coefficients

Entropy

Coding

Block Diagram

Current Block

Prediction

De-blocking

Filter &

Inverse

Prediction

Motion

Estimation

Algorithm

Transform

Inverse

Quantization

Inverse

Transform

Compressed

Intra Coding

Inter Coding

Quantize

Coefficients

Entropy

Coding

Motion

Compensation

Block Diagram

Current Block

Prediction

De-blocking

Filter &

Inverse

Prediction

Motion

Estimation

Algorithm

Transform

Inverse

Quantization

Inverse

Transform

Compressed

Intra Coding

Inter Coding

Quantize

Coefficients

Entropy

Coding

Motion

Compensation

Inter CodingInter CodingInter CodingInter Coding

� Predicts the current block from previously coded frames

� The process is called Motion Estimation (ME)

� The block moves due to the moving object within the scene

K+1 FrameK Frame

Inter CodingInter CodingInter CodingInter Coding

� Motion search algorithms– Full search

– Three step search

– Diamond search

– Many more

K+1 FrameK Frame

Inter CodingInter CodingInter CodingInter Coding– Search within the most four previous framesSearch within the most four previous framesSearch within the most four previous framesSearch within the most four previous frames

K+4 FrameK+3 FrameK+2 FrameK+1 FrameK Frame

Q) How is the motion Estimation Vector is selected?Q) How is the motion Estimation Vector is selected?Q) How is the motion Estimation Vector is selected?Q) How is the motion Estimation Vector is selected?

– A) Based on the sum of Absolute Difference between two A) Based on the sum of Absolute Difference between two A) Based on the sum of Absolute Difference between two A) Based on the sum of Absolute Difference between two

blocksblocksblocksblocks

,( , ) 1,( , )

( , ) ( , ) ( , )N M

k r s k u v

SAD u v p x y p x y+

= −∑∑

pk,(r,s) is the pixel value in the kth frame within block (r,s).

pk+1,(u,v) is the pixel value in the (k+1)th frame within block (u,v)

The Block with the least SAD is Selected

Q) Motion Estimation is Very COMPLEX, why?Q) Motion Estimation is Very COMPLEX, why?Q) Motion Estimation is Very COMPLEX, why?Q) Motion Estimation is Very COMPLEX, why?

– Must calculate SAD for each candidate blockMust calculate SAD for each candidate blockMust calculate SAD for each candidate blockMust calculate SAD for each candidate block

– It compares the best candidate blockIt compares the best candidate blockIt compares the best candidate blockIt compares the best candidate block

– H.264 uses variable blockH.264 uses variable blockH.264 uses variable blockH.264 uses variable block----size for motion estimationsize for motion estimationsize for motion estimationsize for motion estimation

– H.264 interpolate values between samples for (1/8) pixel H.264 interpolate values between samples for (1/8) pixel H.264 interpolate values between samples for (1/8) pixel H.264 interpolate values between samples for (1/8) pixel

resoultionresoultionresoultionresoultion

16x16 8x16 16x8 8x8 4x8 8x4 4x4

Hardware Overview

� Top-down Design Methodology

– Specifications � Set by the standard

– Subsystem� Break the system into modules

– Register� Implement subsystem using registers

– Gate� Implement registers using gates

– Transistor

– Layout

– Fabrications

– Testing and Verifications

Specification

“System”

Sub-system

Register

Transistor

Hierarchical of Top-Down Design.

Hardware Overview

� Most common hardware architectures– Single Instruction Multiple Data (SIMD)

� One Control Unit Multiple processing units

� Load data into storage elements

� Efficient for microprocessors design

– Systolic

� Pipelined structure

� Each processing unit has its own control unit

� More flexible

� Less interconnects

� Synchronized data flow through I/O

Hardware Overview

� Types of Systolic Sturctures

Hardware Overview

� Types of Systolic Structures

•Other Structures do exist

Example of Hardware Architecture HDTV IEEE Transaction on Video Technology June 2006

Hardware Overview

� Used four pipeline stages

� Intensive local SRAM units

“Caches” requirements for each stage

� High level of parallelism for

each stage “Tree structure”

Summary

� We briefly presented basics of image processing

� We briefly presented H.264-AVC Structure

� We briefly presented basics of systolic Hardware design

� We briefly presented most recent HDTV H.264-AVC chip

Introduction to H.264 Video Codec Standard: Hardware...

Documents

Transcript of Introduction to H.264 Video Codec Standard: Hardware...

Generating of packet loss in the video sequences encoded with H.264 video codec. Authors: Błażej Szczerba, Damian Ziobro.

TILE-LEVEL PARALLELISM FOR H.264/AVC CODEC USING …

VC0738 Brief Data Sheet - silicondevice.com€¦ · V GA CV BS L CD BT.656 Output GPI O JTA G DDR A udi o Ouput V i deo Pre- & Post-Processi ng M JPEG Codec H.264 Codec CPU A RM 926

china.origin.xilinx.com · 2019. 12. 11. · H.264/H.265 Video Codec Unit v1.2 2 PG252 December 10, 2019 Table of Contents IP Facts Chapter1:Overview Introduction

H.264/H.265 Video Codec Unit v1 - 电子创新网赛灵思社区xilinx.eetrend.com/files/2019-08/wen_zhang_/100044748... · 2019. 8. 16. · H.264/H.265 Video Codec Unit v1.2 6 PG252

Implementation of H.264 using JM and Intel IPP Software · H.264/AVC/MPEG-4 Part 10 H.264 is a block-oriented motion compensation based codec. It is most commonly used formats for

Cisco Webex Codec Plus Administrator Guide Collaboration ......Webex Codec Plus

Andreas Unterweger Salzburg University of Applied Sciences · 2013-02-13 · H.264 video codec developer at the Fraunhofer Institute for Integrated Circuits in Erlangen, Germany in

. Codec - USC Dana and David Dornsife College of … Concents / Video Parameters. Codec - The language in which the video data is written Example: H.264 AKA: Compressor, format, encoder.

AVC/H.264 Advanced Video Coding Codec, Profiles & System · AVC/H.264 Performance *G. Sullivan, P. Topiwala and A. Luthra, "The H.264/AVC Advanced Video Coding Standard: Overview

Implementing Entropy Codec for H.264 Video Compression ...

Great Codec Question - TMCnet · 2013-06-27 · Great Codec Question: Should WebRTC adopt the 264/265 codecs or not and what would it take for the 264/265 codecs to be acceptable?

H.264/H.265 Video Codec Unit v1.2 LogiCORE IP Product Guide · 2021. 1. 15. · H.264/H.265 Video Codec Unit v1.2 Solutions LogiCORE IP Product Guide Vivado Design Suite PG252 (v2020.2)

H.264 Hardware Codec 4Ch Network Digital Video Recorder · PDF fileH.264 Hardware Codec 4Ch Network Digital Video ... of User’s Manual has been updated ... inputon Rear Panel of

8 Channel H.264 Standalone - Atlantis-Land · DVR autonomo (basato su Linux 2.6) a 8 canali con interfaccia LAN Il codec H.264 utilizzato per la registrazione dei video (sino a 25

HD WDR IP Camera. Key Features 1. Real Time & HD/Full HD Video - 1280x720 @30fps from 1.3M Sony Sensor 2. Triple Video Codec Formats - H.264 Baseline.

LG INNOTEK SECURITY SOLUTION...Model LVM310i (iPad) LVM110i (iPhone) LVM110A (Android) Protocol HTTP, RTP / RTSP HTTP, RTP / RTSP HTTP, RTP / RTSP Media Codec H.264 / H.265 H.264

Evolution 05 and 12 - Oncam · Model EVO-05-SS2 EVO-12-SS2 Video Stream 1 Codec H.264 High, Level 5 / MJPEG (streams configurable) H.264 in 9.6 MP, 6 MP, 4 MP, 2 MP Video Stream 1

MPEG-4 AVC/H.264 Video Codecs Comparisoncompression.ru/video/codec_comparison/pdf/msu_mpeg_4_avc_h26… · Codec Preset Name Preset VideoConference “High Quality” -quality 3 VideoConference

D1 Video Codec Solution for NVR.ppt [호환 모드] • One(1)-Channel D1 Video Encoder (NTSC, PAL) or Decoder • H.264 Video Codec • RTP/RTCP, UDP Protocol Support • RTSP, TCP/UDP