Resource Sharing Beyond Boundaries - Apache Myriad

43
Resource Sharing Beyond Boundaries Mohit Soni Santosh Marella

Transcript of Resource Sharing Beyond Boundaries - Apache Myriad

Page 1: Resource Sharing Beyond Boundaries - Apache Myriad

Resource Sharing Beyond Boundaries   Mohit Soni Santosh Marella

Page 2: Resource Sharing Beyond Boundaries - Apache Myriad
Page 3: Resource Sharing Beyond Boundaries - Apache Myriad
Page 4: Resource Sharing Beyond Boundaries - Apache Myriad

Adam Bordelon Anoop Dawar Ben HindmanBrandon Gulla Danese Cooper Darin JohnsonJim Klucar Kannan Rajah Ken SipeLuciano Resende Meghdoot Bhattacharya Paul ReedRenan DelValle Ruth Harris Shingo OmuraSwapnil Daingade Ted Dunning Will OchandarenaYuliya Feldman Zhongyue Luo

Page 5: Resource Sharing Beyond Boundaries - Apache Myriad

AgendaWhat's up with Datacenters these days?Apache Mesos vs. Apache Hadoop/YARN?Why would you want/need both?Resource Sharing with Apache Myriad

Page 6: Resource Sharing Beyond Boundaries - Apache Myriad

What's running on your datacenter?Tier 1 servicesTier 2 servicesHigh Priority BatchBest Effort, backfill

Page 7: Resource Sharing Beyond Boundaries - Apache Myriad

RequirementsProgramming models based on resources, not machinesCustom resource typesCustom scheduling algorithms:Fast vs. careful/slowLightweight executors, fast task launch timeMulti-tenancy, utilization, strong isolation

Page 8: Resource Sharing Beyond Boundaries - Apache Myriad

Hadoop and MoreSupport Hadoop/BigData ecosystemSupport arbitrary (legacy) processes/containersConnect Big Data to non-Hadoop apps, share data, resources

Page 9: Resource Sharing Beyond Boundaries - Apache Myriad

Mesos from 10,000 feetOpen Source Apache project Cluster Resource Manager Scalable to 10,000s of nodes Fault-tolerant, no SPOF Multi-tenancy, Resource Isolation Improved resource utilization

Page 10: Resource Sharing Beyond Boundaries - Apache Myriad

Mesos is more thanYet Another Resource Negotiator

Long-running services; real-time jobs Native Docker; cgroups for years;

Isolate cpu/mem/disk/net/other Distributed systems SDK;

~200 loc for a new app Core written in C++ for performance,

Apps in any language

Page 11: Resource Sharing Beyond Boundaries - Apache Myriad
Page 12: Resource Sharing Beyond Boundaries - Apache Myriad
Page 13: Resource Sharing Beyond Boundaries - Apache Myriad
Page 14: Resource Sharing Beyond Boundaries - Apache Myriad

Why two resource managers?

Page 15: Resource Sharing Beyond Boundaries - Apache Myriad
Page 16: Resource Sharing Beyond Boundaries - Apache Myriad

Static Partitioning sucksHadoop teams fine with isolated clusters, but Ops team unhappy; slow to provisionResource silos, no elasticityWant to run Hadoop on the same infrastructure, without interrupting Tier-1 servicesWant multi-tenancy, resource sharing/isolation

Page 17: Resource Sharing Beyond Boundaries - Apache Myriad
Page 18: Resource Sharing Beyond Boundaries - Apache Myriad

Introducing Myriad

Page 19: Resource Sharing Beyond Boundaries - Apache Myriad

Myriad OverviewMesos Framework for Apache YARNMesos manages DC, YARN manages HadoopCoarse and fine grained resource sharing

Page 20: Resource Sharing Beyond Boundaries - Apache Myriad

Resource Sharing

Page 21: Resource Sharing Beyond Boundaries - Apache Myriad
Page 22: Resource Sharing Beyond Boundaries - Apache Myriad
Page 23: Resource Sharing Beyond Boundaries - Apache Myriad
Page 24: Resource Sharing Beyond Boundaries - Apache Myriad
Page 25: Resource Sharing Beyond Boundaries - Apache Myriad
Page 26: Resource Sharing Beyond Boundaries - Apache Myriad
Page 27: Resource Sharing Beyond Boundaries - Apache Myriad
Page 28: Resource Sharing Beyond Boundaries - Apache Myriad
Page 29: Resource Sharing Beyond Boundaries - Apache Myriad
Page 30: Resource Sharing Beyond Boundaries - Apache Myriad
Page 31: Resource Sharing Beyond Boundaries - Apache Myriad
Page 32: Resource Sharing Beyond Boundaries - Apache Myriad
Page 33: Resource Sharing Beyond Boundaries - Apache Myriad
Page 34: Resource Sharing Beyond Boundaries - Apache Myriad
Page 35: Resource Sharing Beyond Boundaries - Apache Myriad
Page 36: Resource Sharing Beyond Boundaries - Apache Myriad
Page 37: Resource Sharing Beyond Boundaries - Apache Myriad
Page 38: Resource Sharing Beyond Boundaries - Apache Myriad
Page 39: Resource Sharing Beyond Boundaries - Apache Myriad

Demo

Page 40: Resource Sharing Beyond Boundaries - Apache Myriad

Myriad improves Mesos Tighter integration with Hadoop frameworks

like HBase, Hive, PigBorrow resources from Hadoop when traffic spikes for tier-1 servicesBackfill unused resource capacity with best-effort Hadoop jobs

No Mesos code changes necessary

Page 41: Resource Sharing Beyond Boundaries - Apache Myriad

Myriad improves Hadoop Elastic scaling Fault-tolerant: Maintain NM capacity Share resources with other workloads,

improve resource utilization High SLA hadoop jobs unaffected No YARN/Hadoop code changes

Page 42: Resource Sharing Beyond Boundaries - Apache Myriad

Other FeaturesRM failover/discovery using Marathon/Mesos-DNSDistribution of hadoop binariesWeb InterfaceMyriad scheduler HA, task reconciliation(in progress)Ability to launch Job History Server(in progress)Your favorite feature here!