Physical Planning to Embrace Interconnect Dominance in Power...

45
Physical Planning to Embrace Interconnect Dominance in Power and Performance Renshen Wang University of California, San Diego Ph.D. (Computer Science) Final Examination June 3rd, 2010 1

Transcript of Physical Planning to Embrace Interconnect Dominance in Power...

Page 1: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Physical Planning to Embrace Interconnect

Dominance in Power and Performance

Renshen Wang

University of California, San Diego

Ph.D. (Computer Science) Final Examination

June 3rd, 2010

1

Page 2: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Publications

Renshen Wang, Evangeline Young and Chung-Kuan Cheng. “Complexity of 3-D Floorplans by Analysis on Graph Cuboidal Dual Hardness” To appear in ACM Transactions on Design Automation of Electronic Systems

Renshen Wang, Yulei Zhang, Nan-Chi Chou, Evangeline Young, Chung-Kuan Cheng and Ronald Graham “Bus Matrix Synthesis based on Steiner Graphs for Power Effcient System-on-Chip Communications”. Invited submission to IEEE Transactionson Computer-Aided Design of Integrated Circuits and Systems

Renshen Wang, Evangeline F. Y. Young, Ronald Graham, Chung-Kuan Cheng “Physical Synthesis of Bus Matrix for High Bandwidth Low Power On-chip Communications” ACM International Symposium on Physical Design, 2010.

RenshenWang, Nan-Chi Chou, Bill Salefski and Chung-Kuan Cheng. “Low Power Gated Bus Synthesis using Shortest-Path Steiner Graph for System-on-Chip Communications” ACM/IEEE Design Automation Conference, 2009.

Renshen Wang and Chung-Kuan Cheng. “On the Complexity of Graph Cuboidal Dual Problems for 3-D Floorplanning of Integrated Circuit Design” ACM Great Lake Symposium on Very-large-scale Integration, 2009.

2

Page 3: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Publications

Renshen Wang and Chung-Kuan Cheng. “Octilinear Redistributive Layer Routing in Bump Arrays” ACM Great Lake Symposium on Very-large-scale Integration, 2009.

Renshen Wang, Takumi Okamoto and Chung-Kuan Cheng. “Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations” IEEE International Conference on Computer Design, 2009.

Renshen Wang, Evangeline F. Y. Young and Chung-Kuan Cheng. “Representing Topological Structures for 3-D Floorplanning” IEEE International Conference on Communications, Circuits and Systems, 2009.

Renshen Wang, Evangeline F. Y. Young, Yi Zhu, Fan Chung Graham, Ronald Graham and Chung-Kuan Cheng. “3-D Floorplanning Using Labeled Tree and Dual Sequences” ACM International Symposium on Physical Design, April 12-16, 2008.

Renshen Wang, Rui Shi and Chung-Kuan Cheng. “Layer Minimization of Escape Routing in Area Array Packaging” IEEE/ACM International Conference on Computer-Aided Design, November 5-9, 2006.

3

Page 4: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Outline of Talk

Introduction: interconnect dominance and how it affects integrated circuit design

Complexity of floorplanning in 3-dimensional spaceGraph-to-floorplan formulationFrom 2-D to 3-D (and 2.5-D), increases in hardness

Low power bus architecturesBus gating technique to reduce power on wiresHigh bandwidth and low power bus matrix, based on shortest-path Steiner graph

Conclusions and future work

4

Page 5: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Chapter 1 Introduction

Computer performance has been growing with Moore’s law

Until hit the “power wall”Moore’s law still providing larger silicon resourcesProcessor cores stop scaling up (multicore chips)

Power is the critical factor limiting performance

Interconnect wires are dominating power consumption, and still (relatively) scaling up [HMH01, The future of wires]

Interconnect is the bottleneck of power and performance of integrated circuits

5

Page 6: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Reducing Interconnect Length

Current chips on single layer siliconMoving to 3-D:

Much smaller footprintMuch shorter interconnect length

Challenges on 3-D integrated circuitsFabrication, heat removal, ...Possible increase on design complexity

Floorplanning: early stage of physical design2-D: placing rectangles in a plane3-D: placing cuboids in a box How much more difficult?

6

Page 7: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Reducing Involved Wires

Devices are (roughly) placed after the floorplanning stageInterconnect distances are fixedConnected together by a bus, bus matrix or network

Communication should take shortest pathsPath length = Manhattan distance

Not the case in current bus architectures

7

Page 8: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Chapter 2

Complexity of Floorplanning in 3-DA problem of “cuboidal dual”, 3 variations by dimensions

Showing the hardness gap between 2-D and 3-D

8

Page 9: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Preliminaries and Formulations

Adjacency graph of a set of cuboids: a vertex each cuboid, and an edge for each pair of contacting cuboids

Reverse -- graph to cuboid set -- is nontrivial

Given a graph G = (V, E), can we find a set of cuboids as V with contact relations as E?

9

Page 10: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Preliminaries and Formulations

“Rectangular dualization” in [KK84]A rectangular dissection with triangular adjacency graph

General 3-D cuboidal dual of graph G = (V, E)n axis-parallel cuboids (n = |V|) without shared spaceCi and Cj are contacting iff. vi and vj are adjacent

2.5-D (layered 3-D) cuboidal dualEach cuboid with height interval [l - 1, l]l indicates the layer

2-D (1-layer 2.5-D) cuboidal dualAll cuboids are in height interval [0, 1]

10

Page 11: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

3-D Cuboidal Dual

NP-hard. Reduction from 3-colorability [Kar72]Construct G from 3-colorability instance G3C = (W, E’)

7-vertex gadget, to have a direction (cuboids don’t have)

Lemma 1. In its cuboid dual, the cuboids of two opposite vertices are on opposite sides of v0.

11

Page 12: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

“No Same Color”

Define d1,4(N) among x, y, z as the direction of gadget Nd1,4(N)’s orthogonal plane has overlapping projections from cuboid v1 and v4

d1,4(N) has 3 choices (3 colors)To enforce two gadgets into different d1,4 directions

12

Page 13: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

13-Vertex Gadget

Lemma 3. Adding 3 pairs of vertices to the 7-vertex gadgetpair 1 connected to {v1, v2, v4} and {v1, v5, v4} (green)pair 2 connected to {v2, v3, v5} and {v2, v6, v5} (blue)pair 3 connected to {v3, v1, v6} and {v3, v4, v6} (red)

The resulting 13-vertex gadget must be like in the figure.

13

Page 14: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

2-Alignment

To enforce two gadgets into the same d1,4 direction

2-alignment on13-vertex gadgets

vA must be abovev2 and v’2

vB must be belowv2 and v’2

14

Page 15: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

2-Alignment and 3-Alignment

To enforce two gadgets into the same d1,4 direction2-alignment (only aligns d1,4), along d2,5 or d3,6

3-alignment (align all the 3 axes), along d1,4

Any pair of gadgets in space can be alignedLarge set of gadgets can be aligned

15

Page 16: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

3-D Cuboidal Dual

Theorem 1. 3-colorability reduces to 3-D cuboidal dual.

Proof: 3-colorability instance G3C = (W = {w1, w2, ..., wn}, E’)5n gadgets (13-vertex) for each vertex wi

A chain of 2-aligned n gadgets si,1, si,2, ..., si,n

si,j [2-aligns] t1,i,j [3-aligns] t2,i,j [2-aligns] t3,i,j [2-aligns] ui,j

For each edge (wi, wj) in E’, enforce ui,j and uj,i into different directions

Constructed graph G has a cuboid dual iff. G3C is colorableAll the 5n gadgets of each wi are in the same directionA cuboidal dual implies a 3-color solution

16

Page 17: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

3-Color to 3-D Cuboidal Dual

A 3-color solution implies a cuboidal dualPlace all the t2,i,j gadgets along a 45 degree line (top view)

17

Page 18: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

3-Color to 3-D Cuboidal Dual

Each edge (wi, wj) in G3C has a unique height

No blockage on the connection between ui,j and uj,i

d1,4(ui,j) doesn’t equal d1,4(uj,i) by the 3-color solution

Cuboidal dual constructed

18

Page 19: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

2-D Cuboidal Dual

2-D “rectangular dual” solved in [KK84]A dissection of a rectangle (no empty space)Can be decided by existence of a “4-completion”

Empty space does not make big difference on the problem

19

Page 20: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

2-D Cuboidal Dual

Theorem 2: A graph G has a 2-D cuboidal dual iff. G can be drawn planar with no 3-vertex cycle containing interior vertex (vertices)

Can be provedby applyingconclusions in[KK84]

Triangulation

Rectangular dual

Adding gaps

20

Page 21: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

2.5-D Cuboidal Dual

Define a “layered graph” G = (V, E, L:V->{1, ..., k})Each vertex is assigned with a layer L(vi)Each edge (vi, vj), |L(vi) - L(vj)| <= 1

Most restricted form of “2.5-D” cuboidal dual

NP-hard with 3 or more layersReduction from Planar 3-SAT [Coo71] [Lic84]Construct G from Gp3SAT = (U, C, E’)

21

Page 22: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

3-Layer 2.5-D Cuboidal Dual

Theorem 3. Planar 3-SAT reduces to 2.5-D cuboidal dual with 3 layers.Proof: By Gp3SAT = (U, C, E’) with n variables and m clauses

Each variable has m vertex pairs in the same directionEach clause has a clause gadget placed “vertical”Appearance of ui in cj -- path Ri,j between gadget ui and cj

22

Page 23: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

3-Layer 2.5-D Cuboidal Dual

A 2.5-D cuboidal dual implies a planar 3-SAT solution

A planar 3-SAT solution implies a 2.5-D cuboidal dualGadgets (variables and clauses) placed as Lemma 4Paths constructed by the “rectilinear path embedding”

23

Page 24: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Chapter 3

Low Power Bus ArchitecturesOptimizing on-chip communication architectures

Showing advantages of optimized bus matrices

24

Page 25: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Bus Architecture Overview

Figure showing the basic AMBA AHB busMasters (start transactions) + slaves (respond passively)One master-slave connection each timePros: Simplicity (decided at system level)Cons: Low power efficiency, low bandwidth

25

Page 26: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Basic Bus Gating

Bus gating to save power (like clock gating [DIBM03])

Sticking with tree structure, adddistributed mux: to share bus linesdistributed de-mux: to mask off redundant signals

Minimum rectilinear Steiner arborescence (MRSA)

26

Page 27: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

One Step Further

Tree structures’ limitationOnly one rootMaster-to-slave paths may not be shortest

Shortest-path Steiner graphSuperposition of multiple Steiner arborescencesAll master-to-slave shortest paths (minimal power)Potential bandwidth advantage

27

Page 28: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Design Flow with Bus Gating

Traditional design flow: netlist -> placement -> routing ...Placement depends on netlist

With bus gatingNetlist (of bus components) depends on placementAdditional stage in the design flow

Placement stageConsider the bus as a big net

Bus gating stageUpdate netlist and placement

28

Page 29: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Shortest Path Steiner Graph

Single source: minimum rectilinear Steiner arborescenceNP-complete [SC00]2-approximation heuristic RSA/G [RSHS92]To merge pairs of subtrees at furthest points from source

29

Page 30: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Shortest Path Steiner Graph

With multiple sources, multiple MRSA constructionsReduce terminals using existing wires

8 nodes need connections to s2 in the figure

Construct new MRSA using existing wires

30

Page 31: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Shortest Path Steiner Graph

Revised incremental RSA/G’ algorithm (on each source)

k-IDeA iterations [CKL98]

Remove up to k nodes from candidate merging pointsThe best set of skipped nodes deletedRepeat until no improvement

31

Page 32: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Bus Matrix

One shortest path in the Steiner graph for each (si, tj) pair

Parallelization in system-on-chip requires high bandwidth

Bus matrix: multiple data transactions at a timeAssign weight (# bus lines) on each edgeWith implicit bandwidth constraint on devices

32

Page 33: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Edge Weights by Max Matching

Assume each (si, tj) pair takes a fixed path

33

Page 34: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Tradeoffs on Power and Wires

Bus matrix using the shortest path Steiner graphMinimal power (shortest paths)Maximal bandwidth (weighted edges)

Reduce edge length by merging close parallel edgesGenerally (although not necessarily) reducing wire length

34

Page 35: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Tradeoffs on Power and Wires

Heuristic of parallel segment merging

Look for segments to maximize --

And merge

35

Page 36: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Control Overhead

Controlling the switches on Steiner nodes

Control wire overhead

Switch power overhead

36

Page 37: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Experimental Results

Minimizing power v.s. minimizing wire length

Applying parallel segment merging

37

Page 38: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Experimental Results

Compared to traditional busSave >90% of power on wires, with comparable wire length (unit bandwidth requirement) [WCSC09]

Bus matrices with maximum bandwidth requirement

38

Page 39: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Chapter 4 Conclusions

Power dominates circuit performance, energy consumptionInterconnect dominates the power

How to push through the “power wall”?

To 3-D integrationA few layers of 2-D induce large complexityTopology-geometry relations not inherited

A challenge for designers and CAD tool developersHeuristic algorithms for 3-D floorplanningDemand of human intelligence, experience, etc

39

Page 40: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Conclusions

Low power bus architecturesMinimize data movement (induced wire length)Maximize wire sharingSimilarities with city traffic planning

40

Page 41: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Q & A

Thank you for your attention!

41

Page 42: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Rectilinear Path Embedding

Lemma 4. Given a planar graph with n vertices, we can place the vertices on a (n - 2)x(n - 2) integer grid, such that if each vertex has a non-zero diameter, then each edge (vi, vj) has a rectilinear path from vi to vj with no more than 2n-4 corners, and without intersections.

Proof. By Fary embedding on a (n - 2)x(n - 2) grid [Sch90]

42

Page 43: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

2.5-D Gadgets

(a) Enforcing two cuboid pairs into different directionsSimilar to the 3-D gadget case

(b) v1 and v3 must be on opposite sides of v0

Similar to the 7-vertex gadget in 3-D

43

Page 44: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

Clause Gadget

Lemma 5. The clause gadget has a 2-layer cuboidal dual iff. at least one 6-vertex gadget has v0-v3 horizontal.

6-vertex gadget acts as a diamond gadget if v0-v3 is vertical

44

Page 45: Physical Planning to Embrace Interconnect Dominance in Power …cseweb.ucsd.edu/~rewang/rewang_defense.pdf · 2010-06-08 · Gated Bus Synthesis using Shortest-Path Steiner Graph

3-Layer 2.5-D Cuboidal Dual

Ri,j starts from pij, ends at (v0,v3) of cj’s 6-vertex gadget gk, with #corners

2(m+n), if ui appears in cj in non-negated formso that (v0,v3) in gk is in the same direction of pij

2(m+n)+1, if ui appears in cj in negated formso that (v0,v3) in gk is in the orthogonal direction of pij

45