Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting...

23
Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting [email protected] www.lc-ns.com

Transcript of Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting...

Page 1: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Alternative Performance Metrics for Server RFPs

Joe TempleLow Country North Shore Consulting

[email protected]

www.lc-ns.com

Page 2: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Local Factors / Constraints

Non-FunctionalRequirements

TechnologyAdoption

StrategicDirection

CostModels

ReferenceArchitectures

Systemz

Systemx

Power WorkloadFit

This is an IBM Chart that bridges from platform selection into Performance Architecture

Page 3: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Fit for Purpose Workload Types Mixed Workload – Type 1

• Scales up• Updates to shared data

and work queues• Complex virtualization• Business Intelligence with

heavy data sharing and ad hoc queries

Parallel Data Structures – Type 3

Small Discrete – Type 4

Application Function Data Structure Usage Pattern SLA Integration Scale

Highly Threaded – Type 2

• Scales well on clusters• XML parsing• Buisness intelligence with

Structured Queries• HPC applications

• Scales well on large SMP• Web application servers• Single instance of an ERP

system• Some partitioned

databases

• Limited scaling needs• HTTP servers• File and print• FTP servers• Small end user apps

Black are design factors Blue are local factorsThis is the IBM preSales Architects ‘ view of workload types

Page 4: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Fitness Parameters in Machine Design

Can customized to machines of interest. Need to know the specific comparisons desired

These parameters were chosen to represent the ability to handle, parallel, serial and bulk data traffic.This is based on Greg Pfister’s work on workload characterization in In Search of CLusters

Page 5: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Definitions

TP - Thread Speed X ThreadsThread Speed ~ Adjusted Clock Rate

ITR - Internal Throughput Rate Peak rate as measured in benchmarksITR <= TP

ETR – External Throughput RateAverage rate as delivered in productionETR ~ ITR X Average Utilization

Page 6: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Throughput, Saturation, Capacity

6

TP Measured ITR Capacity

TP Pure Parallel CPU ITR Other resources and Serialization ETR Load and Response Time

Page 7: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Very, Very Few Clients experience ITR

Most enterprises are interested in ETR ~ Average Utilization X ITR

Most users experience response time

Page 8: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Throughput

8

Throughput: TP (Assume parallel load with no thread interactions) Saturation: Internal Throughput Rate (ITR) ITR TP when highly parallel throughput is not limited by “other” resources (I/O, Memory, Bandwidth, Software, Cache)

Capacity: External Throughput Rate (ETR) Utilization limited to meet response time.

Page 9: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Effect of using single dimension metrics.(Max Machines)

9

The “standard metrics” do not leverage cache.This leads to the pure ITR view of relative capacity on the right.

Common Metrics:ITR TPETR ITR

Power advantagedz is not price competitive

Consolidation:ETR << ITR unless loads are consolidatedConsolidation accumulates working sets Power and z advantagedCache can also mitigate “Saturation”

Page 10: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Typical x86 Consolidation8X work on 4X CPUs 2X

8 to 1 Consolidation (8 CPUs)

0%

10%

20%

30%

40%

50%

60%

70%

80%

Average 39%, Peak 76%Peak to Average = 1.95

64 to 1 Consolidation (36 CPUs)

0%

10%

20%

30%

40%

50%

60%

70%

80%

Average 61%, Peak 78%Peak to Average = 1.28

Enterprise Server Consolidation64X work on 18X CPUs 3.6 X

Single Application Server (2 CPUs)

0%

10%

20%

30%

40%

50%

60%

70%

80%

Average 21%, Peak 79%Peak to Average = 3.76

Dedicated x86 Server1 X work on 1X CPUs 1 X

Consolidation

Page 11: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

The Math Behind Consolidation

Roger’s Equation:Uavg = 1/(1+HR(avg))

WhereHR(avg) = kcN1/2

For Distribution of work: N = s (the number of servers per load)

For Consolidation of work: N =1/ n (the number of loads per server)

k is a design parameter (Service Level)c is the variability of the initial load

Page 12: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Response Time and Variability

12

Acceptable Response Time

Hi Variability

Moderate Variability

Low Variability

“No Variability”

Page 13: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

The math behind the Hockey StickUse your favorite queuing model. If you use M/M/1 or M/M/K models cSQRT(N) will be assumed to be 1.We used an estimator for M/G/1 or G/G/1

T = To(1+ c2N(u/(1-u))Notice that elements of Rogers’ equation appear In both cases N affects the variability impactWe also know that HR(u) = (1-u)/u

T = To(1+ c2N/HR(u))

Page 14: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

We have a model which uses these concepts.

It generates characteristic curves

And profiles machines

Page 15: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Bottom Line on workload fit

•“Best” is user dependent– Some dependence on “workload factors”– Mostly dependent on parallelism, size, usage pattern and

service level of loads – Small, variable loads will lean toward density– Larger, more steady loads will lean toward throughput – Need to decide figure(s) of merit

•Designers should set at least 2 requirements: – Throughput and Thread Capacity– ETR and Density– Density and Response Time– Etc.

Page 16: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Comparing Max Machines

One core per socket of Power7 is dedicated to VIO and Intel pathlength is penalized for I/O

Page 17: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

What is the figure of merit?• ITR – What we benchmark?• ETR – closer to business value ($/Day)?• Average Response Time – User experience?• Response time at Peak – speed at max load?• Stack Density – VMs/Core (Loads per core)?• Average Utilization – Efficiency of use?

None of the machines is “best across the board”Designers should specify at least 2 metrics

Page 18: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Stacked single thread workloadsMax Threads Consolidation

ModelResponse Time Parallelism

Serial 1Threads 1

SLA and Distributed Variabilityk 3.1c 2

Ndist 1

Each workload small and variable.

Z has highest density and highest speedPower has highest throughput (SMT4)

Page 19: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Bigger, more Parallel Loads

Max Threads Consolidation

ModelResponse Time Parallelism

Serial 0.1Threads 16

SLA and Distributed Variabilityk 3.1c 1

Ndist 1

Moderate Variability, Larger workloads

Power still has highest throughputz has less speed advantagez maintains density advantage

Page 20: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Very Large Parallel Loads

Max Threads Consolidation

ModelResponse Time Parallelism

Serial 0.01Threads 64

SLA and Distributed Variabilityk 3.1c .25

Ndist 1

Low Variability, Larger workloads

Power is clear winner except for density

Page 21: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .
Page 22: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

Low Country North Shore Consulting

Visit lc-ns.com or eMail Joe at [email protected]

Page 23: Alternative Performance Metrics for Server RFPs Joe Temple Low Country North Shore Consulting jtemple29588@lc-ns.com .

lc-ns work research and services• Collateral Development and tech writing• Further development of workload fit model• Application of workload fit model to specific comparisons (will not compete with

IBM).• Specification and application of benchmarks to model• Understanding tails of short interval utilization distributions• Validation of sizings• Machine positioning • Workload analysis (usage patterns, response time parallelism and load

consolidation/distribution.)• Skill transfer/ Education / Speaking on the above• Analysis/Development of Intellectual Property• Leadership Mentoring / Coaching