Gl conference2014 deployment_rajat

GraphLab in Production: Data Pipelines

Rajat Arya Software Engineer July 21, 2014

Reusable components

Runs on Hadoop CDH5 now; Pivotal, Spark coming…

Runs on Cloud EC2 now; Azure, Google coming…

Data pipelines & Predictive services

Clean Learn Deploy

GraphLab Data Pipeline

Beyond batch & stream processing

Predictive applications require real-time service

Deployed directly from data pipeline

GraphLab Predictive Service

Monitor from GraphLab Canvas

Sample Data Pipeline

A Simple Recommender System

Train Model Recommend Persist

•  Source: Raw data from CSV •  Tasks: Train Model, Produce Recommenda;ons, Persist •  Des;na;on: Write to Database

Sample Prototype

MESSY NOT MODULAR

FILE PATHS NOT PORTABLE

Typical Challenges to Production

•  Refactor code to remove magic numbers, file paths, support dynamic config

•  Rewrite entire prototype in ‘production’ language

•  Build / integrate workflow support tools •  Build / integrate monitoring & management

Typical Challenges to Production

•  Refactor code to remove magic numbers, file paths, support dynamic config

•  Rewrite entire prototype in ‘production’ language

•  Build / integrate workflow support tools •  Build / integrate monitoring & management

GraphLab Create provides a better way …

RECOMMEND

. users:

model:

def train_model(task): csv = task.params[‘csv’] data = gl.SFrame.read_csv(csv’) model = gl.recommender.create(data) task.outputs[‘model’] = model task.outputs[‘users’] = data

PERSIST

§  Code can be Python functions or file(s)

RECOMMEND

PERSIST

. users:

. recs: §  Code can be Python functions or file(s)

def gen_recs(task): model = task.inputs[‘model’] users = task.inputs[‘users’] recs = model.recommend(users) task.outputs[‘recs’] = recs

§  Dependencies managed logically by name

model:

RECOMMEND

PERSIST

. users:

. recs: §  Code can be Python functions or file(s)

§  Dependencies managed logically by name

def persist_db(task): recs = task.inputs[‘recs’] conn = task.params[‘conn’] import mysqlconnector save_to_db(conn, recs.save(format…)

model:

§  Set required python packages so Task is portable

§  Automatic installation and configuration prior to execution

RECOMMEND

PERSIST

. recs:

. users: model:

INTERN TRAIN

§  Tasks are modular and reusable, enabling incremental development and rapid iterations

RECOMMEND

PERSIST

. recs:

. users: model:

INTERN TRAIN

§  Tasks are modular and reusable, enabling incremental development and rapid iterations

Executing Data Pipelines job = gl.deploy.job.create( [train, recommend, persist], environment=‘cdh5-‐prod’)

•  One way to create Jobs (with task bindings)

•  One way to create Jobs (with task bindings) •  One way to monitor Jobs

Executing Data Pipelines job = gl.deploy.job.create( [train, recommend, persist], environment=‘ec2-‐prod’)

•  One way to create Jobs (with task bindings) •  One way to monitor Jobs •  Run on Hadoop, EC2, or locally without

changing code

•  One way to create Jobs (with task bindings) •  One way to monitor Jobs •  Run on Hadoop, EC2, or locally without

changing code •  Recall previous Jobs and Tasks, maintain

workbench

GraphLab Data Pipeline Demo

GraphLab Data Pipeline Recap

Define it Once Run & Monitor it anywhere

All in GraphLab Create

Thank you.

rajat@graphlab.com @rajatarya

Gl conference2014 deployment_rajat

Data & Analytics

Transcript of Gl conference2014 deployment_rajat

Gl Portfolio

МАТЕРИАЛЫ ХI ВСЕРОССИЙСКОЙ НАУЧНО …old.vsuet.ru/science/conference2014/2014_12_23-24_sbornik.pdf · Дуальная система образования

TIMax® GL - wacotech.de · timax® gl und gl-p lusf glasgespinst transparente wÄrmedÄmmung (twd) fÜr profilbauglas lamberts linit ® und pilkington profilit timax gl timax gl-plusf

COMMUNICATIONS INDUSTRY - americanradiohistory.com · gl -3c45 gl-3kp1 3 5 fg -172 gl -203-a 3 6 gl -891 gl -891-r 6 6 gl -5651 2 gl -5654 6 gl -6202 8 gl-3mp1 5 gl -207 6 gl -892

REAL FYRE VENTED GAS LOGS & FIREPLACE ACCESSORIES Lists/rf_vented_pricelist_2014.pdfGL-B: GL-BR: GL-Z: GL-ZR: GL-N: GL-NR: GL-C: GL-QR GL-E: GL-ER: GL-HR: GL-S: GL-W: Azuria Azuria

n993731-kwi © DNV GL 2014. DNV GL and the Horizon Graphic ...€¦ · n993731-kwi © DNV GL 2014. DNV GL and the Horizon Graphic are trademarks of DNV GL AS. Form code: 42.02a Revision:

Gl conference2014 scalabledata_yucheng

Romi GL SerieS · 2016. 2. 11. · 6 rOMi GL 240 / GL 240m / GL 280 / GL 280m Power Graphs Headstock ASA A2-5” (S2 - 15 min rating) Driven tool (S3 - 40% - 10 min rating) Driven

Conference 2014 (ISM-II) · @Conference collection The2nd ISM International Statistical Conference2014(ISM-II) EmpoweringtheApplicationsofStatistical andMathematical Sciences Pahang,Malaysia

pipe support, no. 947 - heco · DN L D A RG kg Art.-Nr. GL-947-150 GL-947-125 GL-947-100 GL-947-080 GL-947-065 GL-947-050 GL-947-040 GL-947-032 GL-947-025 0,000 0,000 0,256 0,000

Gl conference2014 toolkits_alice

Volkswagen VIN Decoder - VAG Links · Fox 2dr Base Golf 2dr GL Cabndel Base Cabriolet Carat Corrado Sport G60 Corrado 2dr SLC Passat CL Passat GL Golf 4dr GL Passat GL Wagon Fox GL

Gl conference2014 recsys_and_text_chris

Romi GL Series - M.Koskela · 2018. 4. 26. · Technical specifications Romi GL 240 Romi GL 240m Romi GL 280 Romi GL 280m (*) Without chip conveyor. 3 • cnc Fanuc 0i-tD with 10.4”

Adobe Photoshop PDF accessories/livguardbattery.… · Livguard TM HAR MOBILE GL-4C DIL Livguard GL-5C Battery GL-5C GL-4C GL-5B GL-4B GL-4CT GP-4L GL-4S GL-5F GL-6P GL-4U GL-5M GL-5J

SCA conference2014

Bushing pennies & nailing straps - Eaton · 2020. 12. 23. · GL 11 1/ 2” 100 2 GL 12 3/ 4” 50 3 GL 13 1” 50 4 GL 14 1 4 50 6 GL 15 11/ 2” 7 GL 16 2” 25 9 GL 17 21 2”

Nobiltà, eleganza e discrezione, sono i contenuti ... · EN14411 附錄G & ISO 13006 附錄G gruppo BIa GL group BIa GL groupe BIa GL Gruppe BIa GL grupo BIa GL группе B Ia

1 lathe machinist gear full manual gl drawing final gl

Sträck lmare modell / Stretch wrapper model GL 205 / GL 205 HS … · 2020. 3. 4. · GL 205 GL 205 HS GL 215 GL 215 HS EN This machine is equipped with the following accessories: