Optimization of Electronic Design Automation (EDA ......Nagios Kit OEL Kit OVM Kit UniPlan Kit...

Post on 21-May-2020

16 views 0 download

Transcript of Optimization of Electronic Design Automation (EDA ......Nagios Kit OEL Kit OVM Kit UniPlan Kit...

SGEKit

UniPortalKit

UniCloudKit

GangliaKit

UniSightKit

CactiKit

NagiosKit

OELKit

OVMKit

UniPlanKit

AmazonEC2Kit

RackspaceKit

GoGridKit

Everythingis a Kit

in UniCloudOEL5.XKit

CentOS5.XKit

RHEL5.XKit

BaseKit

Tortuga Framework

DHCP Server

MySQLDB

Tortuga Resource Adapter

Default

CherryPyWS

PythonCLI

OVMEC2

Rackspace

CFM SQLAlchemyApache httpd

ApacheTomcat

Log ServerKit Util

rsync

OS UtilResourceAdapters

For UniCloud

Tortuga Framework

for UniCloud GoGrid

Tortuga Storage Adapter

Nexentia NetApp

Tortuga Network Adapter

OpenVSwitch Cisco VMWareSystemImager

Default

HPCKit

VMWare

ApplicationKit 1

UniCloudKit

CactiKit

NagiosKit

AmazonEC2 Kit

RackspaceKit

GoGridKit

OEL5.XKit

CentOS5.XKit

RHEL5.XKit

BaseKit

Useror

Univa

Win7

WinServer Software Profiles

Define collectionof OS and Kits

to Create wholeOS and Application

Stack

ApplicationKit 2

MiddlewareKits

CloudKits

ApplicationKits

Kit Repo

YumRepo

distribution

MiddlewareSoftware

Application Software

OS KitsCreated

From Standard Media

WinServer

ApplicationKit 1

Win7

ApplicationKit 2

OEL5.X

NagiosKit

ApplicationKit 1

OEL5.X

BaseKit

UniCloudKit

RHEL5.X

BaseKit

AmazonEC2 Kit

distribution

MiddlewareSoftware

Application Software

Real or VirtualHardware

Amazon EC2 GoGrid VMWare Physical HW

+

Rackspace

Software Profile + Kits Hardware Profile

=

Amazon EC2

distribution

MiddlewareSoftware

Application Software

Rackspace

distribution

MiddlewareSoftware

Application Software

ordistribution

MiddlewareSoftware

Application Software

VMWare

or

Kits and Software profiles form the complete software stack that can be run on different hardware profiles.

The Same software stack is used for all HW profilesApplications, and the whole stack of software can be moved from one real or Virtual environment to another.

Optimization of Electronic Design Automation (EDA) Infrastructurewith UniCloud and Virtualization

Bill Bryce Univa, Scott Clark Deopli

s B Cccccc

P3

P2

M1M2

Timet

Hosts/Coresh

h1 h2 h3 h4 h5 h6 h7 h8 h9 h10 h11 h12

1

2

3

4

5

6

7

8

9

10

11

P1

Standard Job Layout in Batch Cluster

In infrastructure today:Pre-emption must be used to move lower priority workload soHigh Priority workload can run.Gaps and waste happens in the host allocation (unless pre-emption is used)Many Applications fail to use all of the cores on Multi-Core Machines

Gap, Pre-emption must be used

Small jobs must be pre-empted

Pre-empted jobs are 'run from the beginning' so computation is lost. Suspending jobs rarely works since high priority jobs need the memory on the machine. Multi-core machines are under utilized.

s B Ccccc c

P3P2

M1

M2

Timet

Hosts/Coresh

h1 h2 h3 h4 h5 h6 h7 h8 h9 h10 h11 h12

1

2

3

4

5

6

7

8

9

10

11

P1

Dynamic Resource Optimizationwith UniCloud

Pre-emption rarely needs to be used. Physical machines are sliced into smaller virtual machines (v).

Small jobs are packed onto collections of virtual machines.Virtual machines can be live migrated or suspended to disk

Large High priority workload runs on cluster.Work is *not* lost due to Pre-emption

Physical hosts 'h' are split into virtual hosts 'v'. These virtual hosts allow a better packing of workload onto the available physical hosts. Applications that did not take advantage of larger memory and cores on modern machines can be split into multiple Virtual Machines and smallerjobs run on those machines. All those cores you bought can now be used!!

Result:Better Utilization of Multi-Core machinesRuntime is not lost for pre-empted jobs!

v v v v v v v v v v v v v v v v v v v v

Hosts split into Virtual Machines

Bare Metal VM Slowdown

1172 1126 -4.09%

Design Compiler

Bare Metal VM Slowdown

5663 5774 1.92%

Bare Metal VM Slowdown

53,263 53,670 0.76%

Bare Metal VM Slowdown

151 153 1.31%

153 151 1%

EDA Cluster Configuration with Virtualization

Application Bare Metal VM (CPU Affinity) Slowdown

Hsim

1 job 28.78 28.21 -2.02%

2 jobs 28.48 28.95 1.62%

4 jobs 29.1 29.11 0.03%

6 jobs 31.78 32.31 1.64%

8 jobs 34.31 34.67 1.04% Synopsys® HSIM: Hierarchical Full-chip Circuit Simulation and Analysis

VM (No Affinity) Slowdown

28.4 -1.34%

28.32 -0.56%

31.03 6.22%

32.19 1.27%

34.99 1.94%

UniCloud 2.1 Architecture

Kits, Software & Hardware Profiles Application Packaging into Kits

Benchmark Results for Common EDA ApplicationsIn Virtual Machines

Simulia Benchmark Amazon EC2

!"!#

$!!"!#

%!!!"!#

%$!!"!#

&!!!"!#

&$!!"!#

'!!!"!#

'$!!"!#

(!!!"!#

($!!"!#

(# )# %*# '&#

!"#$%&

'()*+,

#)-./)

0%,1#2)'3)4'2#.)

5+,%6+7)589)!"#$%&'()*+,#)

+,&#-&"(./0123# +,&#45678#%# 49-6/90#45678# +,&#::%"(./0123#

!"!#

$!!!"!#

%!!!"!#

&!!!"!#

'!!!"!#

(!!!"!#

)!!!"!#

*!!!"!#

'# +# $)# &%#

!"#$%&

'()*+,

#)-./)

0%,1#2)'3)4'2#.)

5+,%6+7)!8)!"#$%&'()*+,#)

,-%#.%"'/01234# ,-%#56789#$# 5:.70:1#56789# ,-%#;;$"'/01234#

Compute Node Type CPU CPU Cores

Memory [GB]

Swap [GB] Interconnect

EC2 - m2.4xlargeIntel Xeon X5550 @ 2.67GHz, 8MB Cache

8 68.4 0 1Gbps Ethernet

EC2 - cc1.4xlargeIntel Xeon X5570 @ 2.93GHz, 8MB Cache

16 22 10 10Gbps Ethernet

Simulia Intel Xeon Woodcrest

Intel Xeon X5160 @ 3.0 GHz, 4MB Cache

4 16 Unknown Infiniband

• UniCloud Provides a framework for packaging software into 'Kits'.

• Kits can be automatically installed and configured onto physical nodes

• virtual nodes and even cloud instances in public cloud infrastructures.

• Kits are 'Meta-RPMs' and contain the software, meta-data describing the software,

• scripts for installing and configuring the software.

• UniCloud automates many typical installation and configuration tasks in a cluster environment including Storage, Networking and Node configuration.

• UniCloud can be used to create Virtual Data Centers on the same physical compute infrastructure.

• Each VDC is isolated from the other using VLANs.

• Users can request their own cluster environment and UniCloud can automatically create that environment.

• EDA environments are concerned with optimizing License usage and overall cluster

throughput.

• Current schedulers cannot optimize license usage without pre-emption and re-running

workload from the beginning.

• UniCloud + Virtualization can live migrate or suspend to disk entire machines running

applications.

• Applications are not killed so very little CPU cycles are wasted.

• Virtualization overhead is in the range of 2-4%.

• Project and Customer specific virtual clusters can be created on demand to ensure compute

resources meet project deadlines and goals.

• Virtualization allows for better use of multi and many core machines by running multiple

applications in small virtual machines.

• UniCloud + Virtualization can 'pack' more applications onto the existing compute resources

ensuring maximum usage of software licenses and cpu hardware.

• Running applications in Virtual Machines provides Mobility for the entire application. Similar

to a checkpoint but far more robust.

A Negative slow down indicates that the application ran slightly faster in a virtual machine than on physical hardware.