Efficient Management of Ephemeral Data in …...Serverless Spark – Add serverless properties to...

Patrick Stuedi IBM Research

Efficient Management of Ephemeral Data in Serverless Computing

● Serverless frameworks are increasingly being used for interactive analytics

Exploit massive parallelism with large number of serverless tasks

Serverless Analytics

● Serverless frameworks are increasingly being used for interactive analytics– Exploit massive parallelism with large number of serverless

Serverless Analytics

● Serverless analytics involve multiple stages of execution

● Serverless tasks need an efficient way to communicated intermediate data between different stages

Challenge: Data Sharing

ephemeral data

● Ephemeral data is exchanged directly between the tasks

In traditional analytics..

mapper0

mapper1

mapper2

mapper3

reducer0

reducer1

● Direct communication between serverless tasks is difficult

– Tasks are short lived and stateless

In serverless analytics..

mapper0

mapper1

mapper2

mapper3

reducer0

reducer1?

mapper0

mapper1

mapper2

reducer0

reducer1

mapper0

mapper1

mapper2

reducer0

reducer1

● High performance for a wide range of object sizes

Exploit massive parallelism with large number of serverless tasks

Requirements for Ephemeral Storage1

● High performance for a wide range of object sizes● Fine grain, pay what you use resource billing

Example of performance-cost

tradeoff for a serverless

video analytics application

● High performance for a wide range of object sizes● Fine grain, pay what you use resource billing● Fault-tolerance

Existing cloud storage systems do not meet the elasticity, performance and cost demands of serverless analytics jobs

● Serverless Spark – Add serverless properties to Spark (elasticity, on-

demand scaling, etc)● Pocket: elastic ephemeral storage for the cloud

– Improve applications running on serverless frameworks in the cloud (AWS λ, IBM Cloud Functions, etc)

Serverless Analytics: 2 Projects

Project 1:

Spark Serverless Motivation

AWS Lambda Spark Serverless Spark Cloud On-Premise On-Premise0

Increasing flexibilityIncreasing performance

Example: Sorting 100GB

AWS λ Databricks Databricks Spark Spark Serverless Standard On-premise On-premise++

Spark/On-Premise++: Running Apache Spark on a High-Performance Cluster using RDMA and NVMe Flash, Spark Summit’17

put your #assignedhashtag here by setting the footer in view-header/footer● Scheduler: when to best add/remove resources?

● Container startup: may have to dynamically spin up containers

● Storage overheads: – Input data needs to be fetched from remote storage (e.g., S3)

– Intermediate needs to be temporarily stored on remote storage (S3, Redis)

Why is it so hard?

I/O Overhead: Sorting 100GB

AWS Lambda Spark Serverless Spark Cloud Spark Cluster Spark HPC0

500 Shuffle I/OComputeInput/Output

Spark Cluster Spark HPC0

Shuffle overheads are significantly higher when intermediate data is stored remotely

38%3%59% AWS λ Databricks Databricks Spark Spark

Serverless Standard On-premise On-premise++ Spark Spark On-premise On-premise++

Instead of improving performance of serverless frameworks, can we…● ...add serverless properties to Spark?

– Elasticity, on-demand scaling– Sharing of a Spark resources (compute, memory) among users

● Use Case:– Enable sharing of Spark deployment among many users in a company,

research lab, etc.● Challenge:

– Maintain original Spark performance

Spark Serverless: Idea

Spark Serverless: What’s missing?

SparkContext

Executor

Worker

In-memorycache(MemStore)

Filesystem

Executor

Worker

Executor

Worker

Spark Serverless: What’s missing?

SparkContext

Executor

Worker

In-memorycache(MemStore)

Filesystem

Executor

Worker

Executor

Worker

loosingIntermediatedata/state!Scale-down

Disaggregation of Ephemeral Data

SparkContext

Executor

Worker

Executor

Worker

Executor

Worker

Network Fabric

DisaggregatedStorage

DRAM tier

Flash tier

Disaggregation of Ephemeral Data

SparkContext

Executor

Worker

Executor

Worker

Executor

Worker

Network Fabric

DisaggregatedStorage

DRAM tier

Flash tier

Scale-downevent Data/state

availablewhen

executordisappears

Spark-Serverless Architecture

Driver HCS/Scheduler

ExecutorDRAMtier

Intermediate data

Intermediatedata

send schedule

register

Flashtier

send job DAG

register

assignapplication

register

launch task

crail.apache.org

Architecture Overview

ExecutorDRAMtier

send schedule

register

Flashtier

send job DAG

register

assignapplication

register

launch task

crail.apache.org

Intermediatedata

Intermediate data

ExecutorDRAMtier

send schedule

register

Flashtier

send job DAG

register

assignapplication

register

launch task

crail.apache.org

Intermediatedata

Intermediate data

ExecutorDRAMtier

send schedule

register

Flashtier

send job DAG

register

assignapplication

register

launch task

Executors getdynamically

re-assigned toapps/drivers

crail.apache.org

Intermediatedata

Intermediate data

Video: Putting things together

Application 1:GridSearch

Application 2:SQL TPC-DS

Application 3:SQL TPC-DS

HCLScheduler

Resource view:fraction of resourceseach app consumes

Executor view:Which app an executorcurrently runs

Scheduling SnapshotE

Grid SearchSQL

put your #assignedhashtag here by setting the footer in view-header/footer● Compute cluster size: 8 nodes: IBM Power8 Minsky

● Storage cluster size: 8 nodes, IBM Power8 Minsky

● Cluster hardware:– DRAM: 512 GB

– Storage: 4x 1.2 TB NVMe SSD

– Network: 10Gb/s Ethernert, 100Gb/s RoCE

– GPU: NVIDIA P100, NVLink

● Workload– SQL: TCP-DS

Let’s look at performance...

Spark-SQL: TPC-DS (Query #87)(long running query)

Spark Cluster Simple Serverless HCS/CRAIL/TCP HCS/CRAIL/RDMA0

VanillaCluster

HCS/CrailTCP,10Gb/s

HCS/CrailTCP,100Gb/s

HCS/CrailRDMA,100Gb/s

Efficiently disaggregating ephemeral data enables Spark cluster to grow and shrink without a performance cost

Runtime [seconds]

Short-running queries benefit from the shared (already-up) Spark deployment

Spark-SQL: TPC-DS (Query #3)(short running query)

Warm serverless

Crail Serverless

Spark Cluster

0 5 10 15 20 25 30 35

queryruntime

jobruntime

● Context: Serverless analytics in the cloud – AWS λ, IBM Cloud Functions, Azure Functions

● Current practice for storing ephemeral data:– S3:

● High latencies for small data sets– Redis, AWS ElasticCache:

● Inconvenient for storing large objects● No dynamic scaling● Costly (DRAM)

● Can we use Apache Crail?– Not as is, no dynamic scaling

Project 2:

Serverless Analytics in the Cloud

Pocket

Resource usage ($/hr)

Pocket dynamicallyrightsizes storageresources (nodes,

media) in an attemptto find a spot with agood performance

price ratio

● An elastic distributed data store for ephemeral data sharing in serverless analytics

How Pocket works

Pocket: Resource Utilization● Comparing Pocket to S3 and Redis

Autoscaling a Pocket Cluster

put your #assignedhashtag here by setting the footer in view-header/footer● Serverless frameworks are increasingly being used for interactive analytics

● Efficiently managing ephemeral data is important for serverless analytics

2 Projects:

● Spark-serverless– Add support to Spark for fine-grained on-demand scaling

– Permit growing/shrinking of Spark executors by disaggregating shuffle data using Apache Crail

● Pocket– Elastic distributed data store for ephemeral data sharing in serverless analytics

– Can be used together with frameworks like AWS λ, IBM Cloud Functions, etc.

Conclusion

put your #assignedhashtag here by setting the footer in view-header/footer● Pocket: Ephemeral Storage for Serverless Analytics, OSDI’18

● Navigating Storage for Serverless Computing, Usenix ATC’18

● Crail: A High-Performance I/O Architecture for Distributed Data Processing, IEEE Data Bulletin 2017

● Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash, Spark Summit’17

● Serverless Machine Learning using Crail, Spark Summit’18

● Apache Crail, http://crail.apache.org

References

Thanks to

Ana Klimovic, Yawen Wang, Michael Kaufmann, Adrian Schuepbach, Jonas Pfefferle, Animesh Trivedi, Bernard Metzler

Slides (Intro & Pocket) from Pocket presentation (OSDI’18, Ana Klimovic)

Efficient Management of Ephemeral Data in …...Serverless Spark – Add serverless properties to...

Documents

Transcript of Efficient Management of Ephemeral Data in …...Serverless Spark – Add serverless properties to...

Ephemeral 2018-19

An Ephemeral Practice

Ephemeral Project Project Tasks Ephemeral Unix/Linux C++ implementation MPP implementation Other ?

Ephemeral Love II

The Ephemeral Art of Capturing Ephemeral Art...The Ephemeral Art of Capturing Ephemeral Art Web Archiving: Issues and Challenges NYTSL Spring Meeting, May 2, 2018 Deborah Kempe Frick

Ephemeral Project Motivation Active-Message Languages Ephemeral Unix/Linux C++ features Ephemeral coding MPP implementation Other ?

IoT and Serverless - AWS - Serverless Summit - Madhusudan Shekar

Application Repository AWS Serverless · AWS Serverless Application Repository Developer Guide What Is the AWS Serverless Application Repository? The AWS Serverless Application Repository

Well Actually Serverless, - DrupalCon Well Actually...The Serverless Framework with Node.js & AWS - Udemy Course Serverless Status - Newsletter Serverless Chronicle - Newsletter The

Ephemeral Wireless Networks

Serverless & PHPthe future serverless? is PHP's future serverless? try! learn! share! bref.sh null serverless-php.news @matthieunapoli. Title: Bref - February 2020 Created Date:

Material Authenticity of the Ephemeral - Deutsches … · 1 Material Authenticity of the Ephemeral Ephemeral objects constitute the core of this international workshop, dedicated

NAVIGATING ‘EPHEMERAL WORKS’

Understanding Ephemeral Storage for Serverless Analytics · pilation, and video processing. Using AWS Lambda as our serverless platform, we analyze application per-formance using

Serverless Machine Learning on AWS - Serverless Meetup Milano

Particle: Ephemeral Endpoints for Serverless Networkingvoelker/pubs/particle-socc...CCS Concepts • Computer systems organization →Cloud comput-ing. Keywords serverless, networking,

Nascent Notions On Serverless Computing€¦ · 2 Serverless Computing 3 Server Based Model vs Serverless Model 4 Why to use Serverless? 5 Leading Serverless providers 6 The Serverless

Ephemeral vistas

hojadesala medialab ephemeral

Serverless Architecture Patterns - Manoj Ganapathi - Serverless Summit