Towards Predictable Multi-Tenant Shared Cloud Storage

PrincetonUniversity

David Shue*, Michael Freedman*, and Anees Shaikh✦

*Princeton ✦IBM Research

Shared Services in the Cloud

S3 EBS SQSDynamoDB

DD DD DD DDShared Storage Service

Co-located Tenants Contend For Resources

Z Y T FZ Y FTY YZ F F F

2x demand 2x demand

Y FY FY Y F F F

2x demand 2x demand Z Y T FZ Y FTY YZ F F F

2x demand 2x demand Y FY FY Y F F FZ Y T FZ Y FTY YZ F F F

Resource contention = unpredictable performance

Z Y T FZ Y FTY YZ F F FZ Y T FZ Y FTY YZ F F F

DD DD DD DDSS SS SS SS

Towards Predictable Shared Cloud Storage

Shared Storage Service

Per-tenant Resource Allocation and Performance

Isolation

Zynga Yelp FoursquareTP

Shared Storage Service

SS SS SS SS

80 kreq/s120 kreq/s 160 kreq/s40 kreq/s

Hard limits are too restrictive, achieve lower utilization

Zynga Yelp FoursquareTP

Shared Storage

wz = 20%wy = 30% wf = 40%wt = 10%

demandz = 40%ratez = 30%

demandf = 30%

SS SS SS SS

Goal: per-tenant max-min fair share of system resources

PISCES: Predictable Shared Cloud Storage

•PISCES Goals- Allocate per-tenant fair share of total system

resources

- Isolate tenant request performance

- Minimize overhead and preserve system throughput

•PISCES Mechanisms

minutessecondsmicroseconds

Partition Placement (Migration)

Local Weight Allocation

Replica Selection

Fair Queuing

Tenant A Tenant B Tenant C

PISCES Node 2

PISCES Node 3

PISCES Node 4

VM VM VM VM VM VM VM VM VM

PISCESNode 1

weightA weightB weightC Tenant D

VM VM VM

weightD

Place Partitions By Fairness Constraints

Akeyspace BkeyspaceCkeyspaceDkeyspace

keyspace partition

Controller

Rate A < wA Rate B < wB Rate C < wC Compute feasible partition

placement

Over-loaded

Controller

Compute feasible partition placement

PISCES Mechanisms

minutessecondsmicrosecondsTimescale

Mechanism

Controller

wA wB wC VM VM VM

Allocate Local Weights By Tenant Demand

wA wB wC wD = = =

wa2wb2wc2wd2 wa3wb3wc3wd3 wa4wb4wc4wd4

wa1wb1wc1wd1

RA > wA RB < wB RC < wC RD > wD

Swap weights to minimize tenant latency (demand)

Allocate Local Weights By Tenant Demand

Controller

VM VM VM VM VM VM VM VM VM VM VM VM

wA wB wC wD

A→C C→AD→B C→D B→C

Achieving System-wide Fairness

Mechanism Local Weight Allocation

Controller

Select Replicas By Local Weight

wA wB wD

1/2 1/2

GET 1101100

C over-loaded

C under-utilized

RSSelect replicas based

on node latency

GET 1101100

Select Replicas By Local Weight

wA wB wD

RSSelect replicas based

on node latency

1/3 2/3

Replica Selection

Partition Placement (Migration)Controlle

Controller

RRRR ...

Fair Queuing

Queue Tenants By Dominant Resource

50 12.5

40.5 1

outin req

outin reqBandwidth limitedRequest Limited

Out bytes fair sharing

VM VM VM VM VM VM

Queue Tenants By Dominant Resource

40.5 1in req

outin req

Fair Queuing

Bandwidth limitedRequest Limited

5511.3

5.645 55

Dominant resource fair sharing

Shared out bytesbottleneck

VM VM VM VM VM VM

Replica Selection

Fair Queuing

Partition Placement (Migration)Controlle

Controller

•Does PISCES achieve system-wide fairness?

•Does PISCES provide performance isolation?

•Can PISCES adapt to shifting tenant distributions?

Evaluation

YCSB 1

ToRSwitch

YCSB 2

PISCES 6

PISCES 8

PISCES 5

PISCES 7

GigabitEthernet

YCSB 3

YCSB 8

YCSB 7

YCSB 4

YCSB 5

YCSB 6

PISCES 1

PISCES 2

PISCES 3

PISCES 4

PISCES Achieves System-wide Fairness

Membase (Noqueue)

FQ Replica SelectionFQ + Replica

Selection

0.68 MMR 0.51 MMR 0.79 MMR 0.97 MMR

3.77-5.30 ms 3.31-5.77 ms 3.80-4.69 ms 4.05-4.23 ms

Ideal fair share: 110 kreq/s (1kB requests)

Time (s)

PISCES Provides Strong Performance Isolation

Membase (Noqueue)

Selection

0.68 MMR 0.51 MMR 0.79 MMR 0.97 MMR

3.77-5.30 ms 3.31-5.77 ms 3.80-4.69 ms 4.05-4.23 ms

Time (s)

2x demand vs. 1x demand tenants (equal weights)

Membase (Noqueue)

Selection

0.42 MMR 0.50 MMR 0.50 MMR 0.97 MMR

3.94-4.99 ms 4.29-6.15 ms 4.16-4.27 ms 5.40-5.45 ms

3.27-4.14 ms 2.41-3.72 ms 3.57-4.04 ms 2.78-2.83 ms

Equal Global Weight, Differing Resource Workloads

PISCES Achieves Dominant Resource Fairness

Time (s)

Latency (ms)

76% of effective bandwidth

76% of effective throughput

Bottleneck Resource

Differentiated Global Weights (4-3-2-1)

PISCES Achieves System-wide Weighted Fairness

Latency (ms)

Time (s)

Equal Global Weights, Staggered Local Weights

PISCES Achieves Local Weighted Fairness

Time (s)

Latency (ms)

Differentiated (Staggered) Local Weights

Server 1 Server 2 Server 3 Server 4

PISCES Achieves Local Weighted Fairness

Time (s)

wt = 4

wt = 3

wt = 2

wt = 1

PISCES Adapts To Shifting Tenant Distributions

Tenant 3Tenant 2Tenant 1 Tenant 4Weight

1xWeight

2xWeight

1xWeight

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4

Global Tenant Throughput Tenant Latency (1s average)

Server 1 Server 2 Server 3 Server 4

PISCES Adapts To Shifting Tenant Distributions

Tenant 3Tenant 2Tenant 1 Tenant 4

1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4

Conclusion

•PISCES achieves:

- System-wide per-tenant fair sharing

- Strong performance isolation

- Low operational overhead (< 3%)

•PISCES combines:

- Partition Placement: find a feasible fair allocation (TBD)

- Weight Allocation: adapts to (shifting) per-tenant demand

- Replica Selection: distributes load according to local weights

- Fair Queuing: enforces per-tenant fairness and isolation

Future Work

•Implement partition placement (for T >> N)

•Generalize the fairness mechanisms to different services and resources (CPU, Memory, Disk)

•Scale evaluation to a larger test-bed (simulation)

Thanks!

Towards Predictable Multi-Tenant Shared Cloud Storage

Documents

Transcript of Towards Predictable Multi-Tenant Shared Cloud Storage

Real-Time Operating Systems (RTOS) 101€¦ · • An RTOS has to support predictable task synchronization mechanisms – Shared memory mutexes / semaphores, etc. • A system of

Multi-Tenant Deployment Oracle FLEXCUBE Universal Banking … · 2019. 9. 17. · 2.3 Shared Application with Shared Data - Default This would be using Application Container in 18C,

Predictable Probabilistic Programming - by Deductive ... · Predictable Probabilistic Programming Introduction Perspective “Thereareseveralreasonswhyprobabilisticprogrammingcouldprove

Making Shared Caches More Predictable on Multicore Platformsanderson/papers/rtss12c_long.pdf · static priority (JLSP) scheduler, assuming either hard real-time (HRT) (all deadlines

Predictable Prospecting: Getting All the Leads You Wantsellingdisruptionshow.com/wp-content/uploads/2017/... · Predictable Revenue and Predictable Prospecting with McGraw-‐Hill.

Realizing a shared, multi-tenant infrastructure for Big ......Apr 19, 2014 · Realizing a shared, multi-tenant infrastructure for Big Data and Analytic applications with IBM® InfoSphere®

© Copyright TheJones London Ltd 2017 · Staircase Pressurisation LTHW & Duct CHW F & R Future Tenant Service Tenant Comms Tenant Electrical Tenant Comms Tenant Electrical Cleaner's

Cloud Native Advantage: Multi-Tenant, Shared Container PaaS

Predictable Revenue: Create Predictable & Scalable Revenue - Aaron Ross

Hamilton Commons - Mays Landing, New Jersey - Property Aerial · COMPETITION: 1.JOANN, Acme Tenant 2. Tenant 3. Tenant 4. Tenant 5. Tenant 6. Tenant 7. Tenant SITE FEATURES: A. Big

WINDOWS 10 How to use an multi-tenant ConfigMgr to deploy ... · WINDOWS 10 Gemaakte keuzes • IBMC alleen in Shared AD • Remote Control alleen in Shared AD • Collections invulling

Tenant Involvement The Tenant Involvement Process.

Predictable Computer Systems

PREDICTABLE AND AVOIDABLE

Towards Predictable Multi-Tenant Shared Cloud Storagemfreed/docs/pisces-ladis12.pdfnothing architectures [12], we focus our design and evalu-ation on a replicated key-value storage

ARE NATURAL DISASTERS PREDICTABLE?€¦ · PREDICTABLE (MEANING) • Anything that you can see or know before it happens is predictable. •Something that is predictable happens in

TENANT: SUITE SF: TENANT: SUITE ID - AmCap Incorporated · 2020. 10. 14. · TENANT: SUITE ID: SF: TENANT: SUITE ID: SF: TENANT: SUITE ID: SF: TENANT: SUITE ID: SF: Target X0A 142,697

Towards General-Purpose Resource Management in Shared ...jcmace/presentations/mace14...•5 design principles for resource policies in shared-tenant systems •Retro –prototype for

Disk IO Benchmarking in shared multi-tenant environments

Predictable Financial Crises