Mnk hsa ppt

47
Heterogeneous System Architecture Overview M.Naveen Kumar 13241D5710 M.Tech_vlsi

Transcript of Mnk hsa ppt

Page 1: Mnk hsa ppt

Heterogeneous System Architecture Overview

M.Naveen Kumar

13241D5710

M.Tech_vlsi

Page 2: Mnk hsa ppt
Page 3: Mnk hsa ppt
Page 4: Mnk hsa ppt
Page 5: Mnk hsa ppt
Page 6: Mnk hsa ppt
Page 7: Mnk hsa ppt
Page 8: Mnk hsa ppt
Page 9: Mnk hsa ppt
Page 10: Mnk hsa ppt
Page 11: Mnk hsa ppt
Page 12: Mnk hsa ppt
Page 13: Mnk hsa ppt
Page 14: Mnk hsa ppt
Page 15: Mnk hsa ppt
Page 16: Mnk hsa ppt
Page 17: Mnk hsa ppt
Page 18: Mnk hsa ppt
Page 19: Mnk hsa ppt
Page 20: Mnk hsa ppt
Page 21: Mnk hsa ppt
Page 22: Mnk hsa ppt
Page 23: Mnk hsa ppt
Page 24: Mnk hsa ppt
Page 25: Mnk hsa ppt
Page 26: Mnk hsa ppt
Page 27: Mnk hsa ppt
Page 28: Mnk hsa ppt
Page 29: Mnk hsa ppt
Page 30: Mnk hsa ppt

Opencl platform model

Page 31: Mnk hsa ppt

Execution model

Page 32: Mnk hsa ppt

Memory model

Page 33: Mnk hsa ppt

33

HSA Foundation• Founded in June 2012• Developing a new platform for

heterogeneous systems• www.hsafoundation.com• Specifications under development in

working groups• Our first specification, HSA

Programmers Reference Manual is already published and available on our web site

• Additional specifications for System Architecture, Runtime Software and Tools are in process

Page 34: Mnk hsa ppt

HSA Foundation Membership -August 2013

34

Founders

Promoters

Supporters

Contributors

Academic

Associates

Page 35: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

35

HSA — An Open Platform• Open Architecture, membership open to all

– HSA Programmers Reference Manual– HSA System Architecture– HSA Runtime

• Delivered via royalty free standards– Royalty Free IP, Specifications and APIs

• ISA agnostic for both CPU and GPU• Membership from all areas of computing

– Hardware companies– Operating Systems– Tools and Middleware

Page 36: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

36

Inflections in Processor Design

?

Sing

le-t

hrea

d

Perf

orm

ance

Time

we arehere

Enabled by: Moore’s Law Voltage

Scaling

Constrained by:PowerComplexity

Single-Core Era

Mod

ern

Appl

icati

on

Perf

orm

ance

Time (Data-parallel exploitation)

we arehere

HeterogeneousSystems Era

Enabled by: Abundant data

parallelism Power efficient

GPUs

Temporarily Constrained by:

Programming modelsComm.overhead

Thro

ughp

ut

Perf

orm

ance

Time (# of processors)

we arehere

Enabled by: Moore’s Law SMP architecture

Constrained by:PowerParallel SWScalability

Multi-Core Era

Assembly C/C++ Java … pthreads OpenMP / TBB … Shader CUDA OpenCL C++ and Java

Page 37: Mnk hsa ppt

CPUCPU

APU

APU With HSA

Memory

CPUCPU CPU

CPU Memory

CPUCPUCPUCPU

GPU Memory

GPU

Memory

CPUCPUCPUCPU GPU

Page 38: Mnk hsa ppt

EXAMPLE WorkloadS

Page 39: Mnk hsa ppt

HAAR FACE DETECTION

Cornerstone technology for ComputerVision

Page 40: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

40

Looking for Faces in all the Right Places

Quick HD CalculationsSearch square = 21 x 21Pixels = 1920 x 1080 = 2,073,600Search squares = 1900 x 1060 = ~2 Million

Page 41: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

41

Looking for different size faces — by scaling the video frame

More HD Calculations70% scaling in H and VTotal Pixels = 4.07 MillionSearch squares = 3.8 Million

Page 42: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

42

HAAR Cascade stages

Feature l

Feature m

Feature p

Feature r

Feature q

Feature k

Stage N

Stage N+1

Face stillpossible?Yes

No

REJECTFRAME

Page 43: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

43

22 cascade stages, early out between each

STAGE 22STAGE 21STAGE 2STAGE 1

NO FACE

FACE CONFIRMED

Final HD CalculationsSearch squares = 3.8 millionAverage features per square = 124Calculations per feature = 100Calculations per frame = 47 GCalcs

Calculation Rate30 frames/sec = 1.4TCalcs/second60 frames/sec = 2.8TCalcs/second

… and this only gets front-facing faces

Page 44: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

44

Unbalancing due to Early exits

• When running on the GPU, we run each search rectangle on a separate work item

• Early out algorithms, like HAAR, exhibit divergence between work items– Some work items exit early– Their neighbors continue– SIMD packing suffers as a result

LiveDead

1 2 3 4

Page 45: Mnk hsa ppt
Page 46: Mnk hsa ppt

© Copyright 2012 HSA Foundation. All Rights Reserved.

46

The HSA Future

• Architected heterogeneous processing on the SOC

• Programming of accelerators becomes much easier

• Accelerated software that runs across multiple hardware vendors

• Scalability from smart phones to super computers on a common architecture

• GPU acceleration of parallel processing is the initial target, with DSPs and other accelerators coming to the HSA system architecture model

• Heterogeneous software ecosystem evolves at a much faster pace

• Lower power, more capable devices in your hand, on the wall, in the cloud

Page 47: Mnk hsa ppt

THANK YOU