Hadoop migration and upgradation

8
HADOOP MIGRATION OVERVIEW [email protected] http://helpmetocode.blogspot.co m (c) Infinity 1

description

Migrate hadoop from old version to newer version

Transcript of Hadoop migration and upgradation

Page 1: Hadoop migration and upgradation

(c) Infinity 1

HADOOP MIGRATION OVERVIEW

[email protected]

http://helpmetocode.blogspot.com

Page 2: Hadoop migration and upgradation

(c) Infinity 2

APPROACH

Preparation

Phase

•Access Grants

•Assessment of Current Infrastructure

•Inventory of Current Setup

•Meta-Data Backup

Methodology

•New Cluster Setup with Hadoop 2.x . OR Inline Migration to Hadoop 2.x from Hadoop 1.x

Staging

Environment

•Replica of Current Setup in Staging Environment

•Upgrade in Staging Environment

•Validation of Migration in Staging Environment

Productio

n Rollo

ut

•Upgrade of Production Environment

•Production Environment validation.

Page 3: Hadoop migration and upgradation

(c) Infinity 3

UPGRADE TO HADOOP 2.X WITH NEW CLUSTER SETUP

Upgrade a cluster by configuring a new cluster with same capacity and newer Hadoop version and then migrate the files/data from old cluster to new one.

Page 4: Hadoop migration and upgradation

(c) Infinity 4

UPGRADE TO HADOOP 2.X WITH NEW CLUSTER SETUP

CopyToLocal /CopyFromLocal The process flows, the files are copied to local drive using Hadoop

command CopyToLocal and the files are then pushed to the new cluster using CopyFromLocal, and the older cluster can be decommissioned.

Using Hadoop CP command This is a kind of cluster to cluster copy, using Hadoop ‘cp’ command the

files are transferred from one HDFS to other HDFS. As the version are different we need a mechanism called copy from HFTP where the command is executed from the target cluster by defining source as old cluster with HFTP protocol and target as HDFS protocol.

Using Hadoop DISTCP command This is a kind of cluster to cluster copy, using Hadoop ‘distcp’ command

the files are transferred from one HDFS to other HDFS. As the version are different we need a mechanism called copy from HFTP where the command is executed from the target cluster by defining source as old cluster with HFTP protocol and target as HDFS protocol.

Page 5: Hadoop migration and upgradation

(c) Infinity 5

INLINE MIGRATION TO HADOOP 2.X FROM HADOOP 1.X

Upgrading existing cluster from V1 to V2 inline by Installing/Configuring new cluster and updating metadata.

HDFS Upgraded Metadata HDFS

Hadoop V1 Hadoop V2

Page 6: Hadoop migration and upgradation

(c) Infinity 6

METHODOLOGY FOR INLINE MIGRATION FROM HADOOP 1.X TO HADOOP 2.X

Shutdown of Cluster/Nodes

Software Stack Stage Upgrade Phase

Validation of Upgrade Activity

Production Rollout

Page 7: Hadoop migration and upgradation

(c) Infinity 7

FOLLOW THE FOLLOWING LINK FOR STEP BY STEP UPGRADE PROCESS

http://wiki.apache.org/hadoop/Hadoop_Upgrade

Page 8: Hadoop migration and upgradation

(c) Infinity 8

Drop me mail @ [email protected]

Visit my blog http://helpmetocode.blogspot.com