Efficient Lightweight Fast Stream Processing at Scale*...Efficient Lightweight Fast Stream...

ELF Efficient Lightweight Fast Stream Processing at Scale* Liting Hu Karsten Schwan Hrishikesh Amur

Xin Chen Georgia Institute of Technology

*Funded in part with support from the Intel Cloud ISTC

Outline

Motivation

ELF Design

Evaluation

Micro-promotions

Product-bundling

Sales-prediction ... … …

Coupons Recommendations

Predictions

Applications

Filter @item

Sort #clicks

Filter @buys

Query Angry

Birds5?

⋈Yes/No?

Updates every half hour

Updates per min

Updates every x milliseconds

Batch model

Stream model

User query

Concurrent, diverse jobs with varied delays

Performance •  High throughput •  Low latency

Flexible •  Diverse jobs •  Cross-job coordination •  Runtime job change

Scalable •  Hundreds of concurrent jobs with shared inputs

Batch job

Stream job

Query job

System Needs

Web server

Log collection systems e.g., Flume, Kafka

Storage

N jobs

Existing Solutions

Outline

Motivation

ELF Design

Evaluation

ELF runs directly on the web server tier

Master

hash(‘app2’) =d462ba

hash(‘app1’) =d502fb hash(`app3’)

=2080c3

Join Join

Master

Each organized into a peer-to-peer overlay

Many Masters/Many Workers

ELF Software Architecture

CBT-based agent `pre-reduces’ local key-value pairs

SRT globally `shuffles’ and `reduces’ key-value pairs

ELF APIs for programmers to implement diverse jobs

ELF Design

Sample ELF Query

in-memory data structure

<(a,1)(b,1)(a,1)(b,1)(b,1)>

<(a,2)(b,3)>

(1) ̀ insert ‘to fill

(2) ̀ flush’ to fetch the pre-reduced result

(3)‘empty’ to empty the states

Compressed Buffer Tree (CBT)

Stateful, asynchronous, and synchronous execution

ELF Design

(a,2) (b,3)

(b,2) (c,2)

(c,1) (d,7)

(a,7) (b,3)

(d,7) (c,3) (b,2)

(a,7) (d,7) (b,5) (c,3)

hash(`micro-promotion’) =d462ba

(a,<b>) (b,<a>)

(a,<c>)

(c,<f>) (d,<e>)

(c,1) (d,7)

(a,<b,c>) (d,<e,g,h>)

(a,<b,c>) (b,<a>) (c,<f>) (d,<e,g,h>)

hash(`product-bundling’) =2080c3

(b,<a>) (c,<f>)

Per-job dataflow

webserver

Synchronizing CBT to be emptied Last iteration’s result New job function or parameter

Per-job dataflow

Limited to O(Log(N)) hops

hash(`Sale-predication’) =43bc3c

Cross-job Coordination

Per-job dataflow, and cross-job coordination

ELF Design

Per-job dataflow, and cross-job coordination

ELF Design

Using ELF

Outline

Motivation

ELF design

Evaluation

Latency Replay Twitter’s stream 1280 agents 50 events/second each

60 × 12-core 2.66 GHz AMD Opteron 48 GB RAM per server Gigabit Ethernet

ü  ELF outperforms for large windows due to the efficient CBT flush

New Functionality 10s millisecond for multicast messages 100s millisecond to update functions

Scalability 60 × 12-core 2.66 GHz AMD Opteron 48 GB RAM per server Gigabit Ethernet 1000 agents

ü  ELF scales well up to thousands of concurrent jobs

Conclusions and Future Work ELF introduces a `many masters/many workers’ framework: • To achieve good performance: low latency and

high throughput •  scalability: for diverse, concurrent jobs, and •  new functionality: job flexibility feedback and

coordination. Future work: understand the limitations of Elf; Explore more complex web-tier processing; look at task isolation.

Thank you!

Liting Hu foxting@gatech.edu

Efficient Lightweight Fast Stream Processing at Scale*...Efficient Lightweight Fast Stream...

Documents

Transcript of Efficient Lightweight Fast Stream Processing at Scale*...Efficient Lightweight Fast Stream...

Plastic packaging is lightweight, cost-efficient and …...Plastic Packaging Flexes Its Resource E˜ciency Plastic packaging is lightweight, cost-efficient and does a great job protecting

Development of lightweight and cost-efficient … de Engenharia da Universidade do Porto Development of lightweight and cost-efficient exterior body panels for electric vehicles André

LIGHTWEIGHT HYDROGEN STORAGE SOLUTIONS … › uploads › 2020 › 03 › Brochure...STORAGE SOLUTIONS FOR FUEL CELL ELECTRIC VEHICLES LIGHTWEIGHT SAFE EFFICIENT Hydrogen is a clean

Efficient Sub -Stream Encoding and Transmission for P2P Video … · 2007. 6. 15. · Efficient Sub-stream Encoding and Transmission for P2P Video on Demand 3 Motivation • Video-on-demand

Analysis of Lightweight Stream Ciphers - simonfischer · PDF fileAnalysis of Lightweight Stream Ciphers PhD public defense Simon Fischer EPFL and FHNW April 18th 2008 Simon Fischer

Efficient Sub-Stream Encoding and Transmission for P2P Video on Demand

DPBSV- An Efficient and Secure Scheme for Big Sensing Data Stream · 2015-09-24 · DPBSV- An Efficient and Secure Scheme for Big Sensing Data Stream Deepak Puthal*, Surya Nepal†,

Efficient Window Aggregation with General Stream Slicingasterios.katsifodimos.com/assets/publications/general... · 2019-08-07 · Efficient Window Aggregation with General Stream

Cost-efficient enactment of stream processing topologiesCost-efﬁcient enactment of stream processing topologies ChristophHochreiner1,MichaelVo¨gler2,Stefan Schulte1 andSchahram

Analysis of Lightweight Stream Ciphers - · PDF fileAnalysis of Lightweight Stream Ciphers PhD public defense Simon Fischer EPFL and FHNW April 18th 2008 Simon Fischer Analysis of

Oracle Complex Event Processing: Lightweight Modular ... · PDF fileOracle Complex Event Processing: Lightweight Modular Application Event Stream Processing in the Real World An Oracle

Compact, lightweight and power efficient voltage tunable ...1418/... · 1 Compact, Lightweight and Power Efficient Voltage Tunable Multiferroic RF/Microwave Components A Dissertation

WG-8: A Lightweight Stream Cipher for Resource ...cacr.uwaterloo.ca/techreports/2012/cacr2012-28.pdfWG-8: A Lightweight Stream Cipher for Resource-Constrained Smart Devices 3 { F2

An Efficient and Fault-Tolerant Model for Stream ... · An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters Matei Zaharia, Tathagata Das, ... • Let users

Lightweight Lattice-based Homomorphic Privacy- …kan.yang/security_bbcr/paper/ASMAAWCSP14.pdfapplications, HAN includes certain ... previously proposed an efficient and lightweight

Towards Efficient Stream Processing in the Wide Area

Space-efficient Tracking of Persistent Items in a Massive Data Stream

Exploiting media stream similarity for energy-efficient - ELIS

Counting Stream Registers: An Efficient and Effective Snoop Filter Architecture

Lightweight Fuel Efficient Engine Package