Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create...

26
Confidential Customized for Lorem Ipsum LLC Version 1.0 Spark na Google Cloud Friends don't let friends build data centers

Transcript of Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create...

Page 1: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Confidential Customized for Lorem Ipsum LLC Version 1.0

Spark na Google CloudFriends don't let friends build data centers

Page 2: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Data Scientist @Avenue Code

Evandro CaldeiraCientista de dados na Avenue Code. Formado em Engenharia da Computação e louco por café

E-mail: [email protected]

Page 3: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

BrasilBelo HorizonteSão PauloPorto Alegre

EUA

Canadá

Page 4: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Confidential Customized for Lorem Ipsum LLC Version 1.0

TOC

Overview

On premise vs cloud

Como migrar

Demo

Page 5: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Por que Spark?

Page 6: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 7: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 8: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

On premise1 Equipamentos

2 Gerenciamento

3 Picos de uso

4 $$$

Page 9: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Migragação para GCP:Descomissionamento de um datacenter em 2018

Spotify

Page 10: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Primeiros passos para GCP

Page 11: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 12: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 13: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

O que fazer1 Mova os dados

2 Experimente

Page 14: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

O que fazer3 Use clusters efêmeros

5 Delete o cluster ao finalizar

4 Workers preemptivos

Page 15: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Hands on

Page 16: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 17: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Créditos grátis!

Page 18: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Instalação1 Google SDK

2 Spark standalone

Page 19: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Criação do cluster

Page 21: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 22: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Execução de job

Page 24: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 25: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Sourcehttps://github.com/evandroc/tdc-spark

Page 26: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Obrigado.