Figures src1

28
0.4 0.8 1.2 1.6 3.156623 64604449 Normalized IPC

Transcript of Figures src1

Page 1: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB

NQU FFT

BP

BLK

SD2 SD1

RAY LK

C NN

PFN

HOT MM

STO CP

avg0.4

0.8

1.2

1.6

3.15662364604449 Minimum TLP Optimal TLP

Nor

mal

ized

IPC

Page 2: Figures src1

On Chip Network

C

L2DRAM

L2DRAM

L2DRAM

L2DRAM

L1CL1

CL1

CL1

CL1

CL1

Page 3: Figures src1

Application

Kernel Kernel

CTA

Kernel...CTA……CTA

Warp…

Warp

Thre

ad

...

Page 4: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB NQU

FFT

BP BLK

SD2

SD1 RAY

LKC

NN PFN

HOT

MM STO

CP avg

0%20%40%60%80%

100%Ac

tive

Tim

e Ra

tio (R

ACT

)

Page 5: Figures src1

1 2 3 4-2.22044604925031E-16

0.2

0.4

0.6

0.8

1

1.2

1.4AES

IPC lat_rt util

Page 6: Figures src1

1 2 3 4 5 6 7 8-2.22044604925031E-16

0.2

0.4

0.6

0.8

1

1.2

1.4CP

IPC lat_rt util

Page 7: Figures src1

1 2 3 4 5 6 7 8-2.22044604925031E-16

0.2

0.4

0.6

0.8

1

1.2

1.4JPEG

IPC lat_rt util

Page 8: Figures src1

1 2 3 4 5 6 7 80.7

0.75

0.8

0.85

0.9

0.95

1

1.05MM

IPC lat_rt util

Page 9: Figures src1

1 2 3 4 5 6 7 80

0.20.40.60.8

11.21.4

AES MMJPEG CP

Number of CTAs

Nor

mal

ized

IPC

Page 10: Figures src1

1 2 3 4 5 6 7 80

0.20.40.60.8

11.21.4

AES MMJPEG CP

Number of CTAs

Nor

mal

ized

late

ncy

Page 11: Figures src1

1 2 3 4 5 6 7 80

0.20.40.60.8

11.21.4

AES MMJPEG CP

Number of CTAs

Nor

mal

ized

core

uti-

lizati

on

Page 12: Figures src1

Idle

CTA 1CTA 2

CTA 3CTA 4Co

re 1

CTA 5CTA 6

CTA 7CTA 8Co

re 2

Page 13: Figures src1

CTA 1

CTA 2

CTA 6

CTA 7Core

1

CTA 3

CTA 4 CTA 5

CTA 8

Idle

Core

2

Page 14: Figures src1

H LC_idle

L M HL

M

H

- -- --

C_mem

C_st

all

: Increment n: Decrement n

Page 15: Figures src1

0 4 8 12 160.5

0.6

0.7

0.8

0.9

1

Type1Type2

Number of cores turned off

Rela

tive

perf

orm

ance

Page 16: Figures src1

0 4 8 12 160.5

0.6

0.7

0.8

0.9

1

Type1Type2

Number of cores turned off

Rela

tive

perf

orm

ance

Page 17: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB

NQU FFT

BP

BLK

SD2 SD1

RAY LK

C NN

PFN

HOT MM

CCP STO

CP avg

0.80.9

11.11.21.31.41.51.6

2.3 2.9 3.01.9

3.22.9 3.0 2.1

DYNCTA Optimal TLPN

orm

aliz

ed IP

C

Page 18: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB

NQU FFT

BP

BLK

SD2 SD1

RAY LK

C NN

PFN

HOT MM

STO CP

avg0.8

1

1.2

1.4

1.6

2.32.9 3.0

1.93.2

2.9 3.0 2.12.9

TL DYNCTA Optimal TLPN

orm

aliz

ed IP

C

Page 19: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB

NQU FFT

BP

BLK

SD2 SD1

RAY LK

C NN

PFN

HOT MM

STO CP

avg0

0.4

0.8

1.2

1.6Round Trip Fetch Latency Core Utilization

Nor

mal

ized

Val

ue

Page 20: Figures src1

00.5

11.5

22.5

33.5

44.5

00.10.20.30.40.50.60.70.80.91

Average n Nopt RACT

Num

ber o

f CTA

s

Activ

e tim

e ra

tio (R

ACT)

Page 21: Figures src1

0123456789

00.10.20.30.40.50.60.70.80.91

Average n Nopt RACT

Num

ber o

f CTA

s

Activ

e tim

e ra

tio (R

ACT)

Page 22: Figures src1

0

1

2

3

4

5

6

7

00.10.20.30.40.50.60.70.80.91

Average n Nopt RACT

Num

ber o

f CTA

s

Activ

e tim

e ra

tio (R

ACT)

Page 23: Figures src1

0123456789

00.10.20.30.40.50.60.70.80.91

Average n Nopt RACT

Num

ber o

f CTA

s

Activ

e tim

e ra

tio (R

ACT)

Page 24: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB

NQU FFT

BP

BLK

SD2 SD1

RAY LK

C NN

PFN

HOT MM

STO CP

avg012345678

N DYNCTA OptimalAv

erag

e nu

mbe

r of

CTAs

Page 25: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB

NQU FFT

BP

BLK

SD2 SD1

RAY LK

C NN

PFN

HOT MM

STO CP

avg0.4

0.6

0.8

1

1.2

1.4DYNCTA DYNCORE with power gating DYNCORE without power gating

Nor

mal

ized

Pow

er

Page 26: Figures src1

IIX

PVC SSC

SAD BFS

NW

PFF

MUM

SPMV KM

LUD

WP

AES SCP

PVR JPEG

LIB

NQU FFT

BP

BLK

SD2 SD1

RAY LK

C NN

PFN

HOT MM

STO CP

avg0.6

11.41.82.22.6

32.9

3.13.6

3.23.0

2.6

DYNCTA DYNCORE with power gating DYNCORE without power gatingN

orm

aliz

ed E

nerg

y Effi

cien

cy

Page 27: Figures src1

4 8 12 160

0.20.40.60.8

11.21.41.61.8

2IPC power energy

Number of Cores Turned Off

Nor

mal

ized

Val

ue

Page 28: Figures src1

64 1210

0.20.40.60.8

11.21.41.61.8

2IPC power energy

Number of Nodes in the System

Nor

mal

ized

Val

ue