SD Big Data Monthly Meetup #4 - Session 2 - WANDisco

Non-Stop Hadoop Enterprise Ready Hadoop Presentation for Big Data Meetup October 8, 2014

2 WWW.WANDISCO.COMREALIZING THE POSSIBILITIES OF BIG DATA

WANdisco Background

•  WANdisco: Wide Area Network Distributed Computing –  Enterprise ready, high availability software solutions that enable globally distributed

organizations to meet today’s data challenges of secure storage, scalability and availability •  Leader in tools for software engineers – Subversion

–  Apache Software Foundation sponsor •  Highly successful IPO, London Stock Exchange, June 2012 (LSE:WAND) •  US patented active-active replication technology granted, November 2012 •  Global locations

–  San Ramon (CA) –  Chengdu (China) –  Tokyo (Japan) –  Boston (MA) –  Sheffield (UK) –  Belfast (UK)

Customers

Non-Stop Hadoop

Non-Intrusive Plugin

Provides Continuous Availability In the LAN / Across the WAN

Active/Active

3 Key Problems For Multi Cluster Hadoop LAN / WAN

Enterprise Ready Hadoop Characteristics of Mission Critical Applications

•  Require 100% Uptime of Hadoop –  SLA’s, Regulatory Compliance

•  Require HDFS to be Deployed Globally –  Share Data Between Data Centers –  Data is Consistent and Not Eventual

•  Ease Administrative Burden –  Reduce Operational Complexity –  Simplify Disaster Recovery –  Lower RTO/RPO

•  Allow Maximum Utilization of Resource –  Within the Data Center –  Across Data Centers

Single Standby •  Inefficient utilization of resource

–  Journal Nodes –  ZooKeeper Nodes –  Standby Node

•  Performance Bottleneck •  Still tied to the beeper •  Limited to LAN scope

Active / Active •  All resources utilized

–  Only NameNode configuration –  Scale as the cluster grows –  All NameNodes active

•  Load balancing •  Set resiliency (# of active NN) •  Global Consistency

Breaking Away from Active/Passive What’s in a NameNode

Standby Datacenter •  Idle Resource

–  Single Data Center Ingest –  Disaster Recovery Only

•  One way synchronization –  DistCp

•  Error Prone –  Clusters can diverge over time

•  Difficult to scale > 2 Data Centers –  Complexity of sharing data

increases

Active / Active •  DR Resource Available

–  Ingest at all Data Centers –  Run Jobs in both Data Centers

•  Replication is Multi-Directional –  active/active

•  Absolute Consistency –  Single HDFS spans locations

•  ‘N’ Data Center support –  Global HDFS allows appropriate

data to be shared

Breaking Away from Active/Passive What’s in a Data Center

One Cluster Approach

•  Example Applications

–  HBASE –  RT Query –  Map Reduce

•  Poor Resource Management

–  Data Locality Issues –  Network Use –  Complex

Multiple Clusters

Creating Multiple Clusters

•  Example Applications

–  HBASE –  RT Query –  Map Reduce

•  Need to share data between clusters

–  DistCp / Stale Data –  Inefficient use of

storage and or network

–  Some clusters may not be available

Multiple Clusters

Cluster Zones Zoning for Optimal Efficiency

1 100%

Consistency

Multi Datacenter Hadoop Disaster Recovery

WAN REPLICATION

Absolute Consistency Maximum Resource Use

Lower Recovery Time/Point

Replicate Only What You Want BeCer UFlizaFon of Power/Cooling

Lower TCO LAN Speed Performance

Technical Overview Hadoop Powered by WANdisco

Periodic Synchronization DistCp

Parallel Data Ingest Load Balancer, Streaming

Multi Data Center Hadoop Today What's wrong with the status quo

Periodic Synchronization DistCp

Multi Data Center Hadoop Today Hacks currently in use

•  Runs as Map reduce •  DR Data Center is read only •  Over time, Hadoop clusters

become inconsistent •  Manual and labor intensive

process to reconcile differences •  Inefficient use of the network

Parallel Data Ingest Load Balancer, Flume

Multi Data Center Hadoop Today Hacks currently in use

•  Hiccups in either of the Hadoop cluster causes the two file systems to diverge

•  Potential to run out of buffer when WAN is down

•  Requires constant attention and sys-admin hours to keep running

•  Data created on the cluster is not replicated

•  Use of streaming technologies (like flume) for data redirection are only for streaming

DConE Distributed Coordination Engine

•  WANdisco’s patented WAN capable paxos implementation –  Mathematically proven –  Provides distributed co-ordination of File system metadata

•  Active/Active (All locations) •  Create, Modify, Delete •  Shared nothing (No Leader)

•  No restrictions on distance between datacenters –  US Patent granted for time independent implementation of Paxos

•  Not based on SAN block device synchronization such as EMC SRDF –  SAN block replication has distance limits resulting from the inability of file systems

such as NTFS and ext4 to tolerate long RTTs to block storage –  Possible distribution of corrupted blocks

Paxos is a family of protocols for solving consensus in a network of unreliable processors.

Consensus is the process of agreeing on one result among a group of participants.

This problem becomes difficult when the participants or their communication medium may experience failures.

•  Majority Quorum –  A fixed number of participants –  The Majority must agree for change

•  Failure –  Failed nodes are unavailable –  Normal operation continue on nodes

with quorum

•  Recovery / Self Healing –  Nodes that rejoin stay in safe mode

until they are caught up

•  Disaster Recovery –  A complete loss can be brought back

from another replica

How DConE Works WANdisco Active/Active Replication

TX id: 168 TX id: 169 TX id: 170 TX id: 171 TX id: 172 TX id: 173

Proposal 170

Agree 170

Proposal 171 Agree 172 Agree 173

Agree 171 Proposal 172 Proposal 173

C Agree 170 Agree 171 Agree 172

Agree 173

Architecture of a Non-Stop Hadoop

Use Cases

•  Eliminate The Performance Bottleneck of a Single Active NameNode •  Multi Data-Center Ingest

–  Information doesn't need to be sent to one DC and then copied back to the other using DistCP –  Parallel ingest methods don’t require redirected data streams –  Ingest data at, or close to the source –  Global Analysis (Logs, Click Streams, etc…)

•  Cluster Zones –  Efficient use of resource based on application profile –  HBASE, IMPALA, Storm, Map Reduce, SPARK, etc… –  Heterogeneous Clusters Supported

•  Maximize Data Center Resource Utilization –  All datacenters can be used to run different jobs concurrently

•  Disaster Recovery –  Data is as current as possible (no periodic synchs) –  Virtually zero downtime to recover from regional data center failure –  Regulatory compliance

•  Optimized hardware profiles for job specific tasks –  Batch –  Real-time –  NoSQL (HBASE)

•  Set replication factors per sub-cluster

•  Use at LAN or WAN scope

•  Resilient to NameNode failures

Use Case: Heterogeneous Hardware

•  Maximize Resource Utilization –  No idle standby

•  Isolate Dev and Test Clusters –  Share data not resource

•  Carve off hardware for a specific group

–  Prevents a bad map/reduce job from bringing down the cluster

•  Guarantee Consistency and availability of data

–  Data is instantly available

Use Case: Sub-Clusters

Non-Stop Hadoop Demonstration

Question and Answer Feel free to submit your questions

Thank you

SD Big Data Monthly Meetup #4 - Session 2 - WANDisco

Technology

Transcript of SD Big Data Monthly Meetup #4 - Session 2 - WANDisco

Shades SD 22, SD 40, SD 50, SD 100, SD 150 · Shades SD 22, SD 40, SD 50, SD 100, SD 150 . KD183-9ca Shades SD 22, SD 40, SD 50, SD 100, SD 150 Page1/3 . Material: hot-pressed, glass

Hybrid Data Lake on the AWS Cloud - Welcome to WANdisco · Page 1 of 22 Hybrid Data Lake on the AWS Cloud with WANdisco Fusion, Amazon S3, and Amazon Athena Quick Start Reference

SEOworkshop-Meetup-Mar272011 SEO Meetup Full Presentat… · SEO Workshop Mar 27, 2011 Internet Marketing Meetup @ Branding Personality Please Tweet: I am attending the Meetup “Using

WANdisco Sv Ac Admin Guide-07-22

Shades SD 22, SD 40, SD 50, SD 100, SD 150 · 2016-12-14 · Shades SD 22, SD 40, SD 50, SD 100, SD 150 . KD183-9ca Shades SD 22, SD 40, SD 50, SD 100, SD 150 Page1/3 . Material:

02.28.13 WANdisco ApacheCon 2013

The World Leader in - Welcome to WANdisco | WANdisco World Leader in Active Data Replication TO STAY UP TO DATE WITH ALL THE LATEST NEWS VISIT OVERVIEW 02 Financial and operational

WANdisco Fusion MAA for Oracle BDA and BDCS Fus… · WANdisco Fusion MAA for Oracle 3 ... Oracle Maximum Availability Architecture guidelines aim to explain the best practices for

Meetup 5min Lightning Talk for Meetup 2/17/2016

Internet of Things Cebu meetup : 1st meetup

Dublin Spark Meetup - Meetup 1 - Intro to Spark

1100 TH B3 WANdisco REV - Jefferies · 2013-05-14 · WANdisco: Wide Area Network Distributed Computing Patented technology addressing a large, global market Leader in tools for software

Fort Worth IBD Meetup Investor’s Business Daily Meetup …armchairinvestor.com/wp-content/uploads/2016/12/IBD-Meetup-Fort... · Fort Worth IBD Meetup Investor’s Business Daily

IBD Meetup Pikes Peak Meetup Group Colorado Springs, CO

WANDISCO FUSION GERRIT MULTISITE · 2019. 11. 1. · WANdisco Gerrit Multisite (GerritMS), using our game-changing patented technology, is a hardened service for replicating Git repositories

· 2020. 10. 28. · sd sd press thx sljte a premium chjb suite b g sd sd sd sd sd sd sd sd 9 sd sd sd sd 5 sd sd sd sdsd 5 (sd sd g g g q sd sd sd 5 sd sd sd sd sd sd sd sd sd sd

IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & OpenWhisk

MAFIA MEETUP KIT - zyngablog.typepad.com · MAFIA MEETUP KIT Mafia Meetup Checklist (Short Version) Mafia Meetup Checklist (Long/Detailed Version with Images) Meetup Tips from the

Saigon Wordpress Meetup - Themes Wordpress Meetup

Learn to Read Thai - Find Meetup groups near you - Meetup