Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

12
in The Planning South Bay SRE Meetup - 20160809 The Data

Transcript of Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

Page 1: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

in The PlanningSouth Bay SRE Meetup - 20160809

The Data

Page 2: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

How many lines in our monthly AWS bill ?

QU

IZ

A. 7 MillionB. 70 MillionC. 700 MillionD. 7 Billion

Page 3: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

Tracking ~2,400 applicationsRunning on ~80 different instance typesDeployed in ~30 zonesAcross ~60 accounts

Page 4: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

How to inform and scale our capacity planning operations ?

-- GOALS --

provide context and transparencyfocus on actionable insights

automate most repetitive tasks

Capacity Planning Analytics

RohanSharma

TorioRisianto

Sebastiende Larquier

Page 5: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

provide context and transparency

Build intuitive and interactive dashboards

Page 6: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

e.g.: Devices Impact on Capacity

Page 7: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

How many apps in the Netflix environment ?

A. 24B. 240C. 2,400D. 24,000

QU

IZ Information at Scale

Page 8: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

focus on actionable insights

e.g., Keeping an Eye on Efficiency

For each metric, check:

➔ significant deviation➔ change in trend

Link each card to a detailed dashboard.

Page 9: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

Cloud Capacity Analytics at Scale

Requirements: ❏ monitor and classify our current footprint❏ forecast future capacity needs based on current usage and expected

events (hardware migrations, holidays, …)❏ optimize on-demand and reserved capacity for reliability and cost

Picsou [piksu]1. Scrooge McDuck in french.2. Netflix’s comprehensive cloud

capacity analytics tool.

Page 10: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

Cloud Capacity Planning at Scale

Page 11: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

Cloud Capacity Planning at Scale

Requirements ❏ monitor and classify our current footprint❏ forecast future capacity needs based on current usage and expected

events (hardware migrations, holidays, …)❏ optimize on-demand and reserved capacity for reliability and cost

Next 6-12 Months:

capacity alerts and forecastrecommendations for optimal reserved capacity

enforce recommendations

Page 12: Cloud Capacity Planning Tooling - South Bay SRE Meetup Aug-09-2016

Thank You