Hadoop Networking at Datasift

Post on 13-May-2015

2.436 views 1 download

Tags:

Transcript of Hadoop Networking at Datasift

Hadoop Networking at DataSift

"How I learned to stop worrying and love Arista Switches"

About Me

Gareth LlewellynSenior Operations Engineer at DataSift

@NetworkStringabout.me/GarethLlewellynblog.NetworksAreMadeOfString.co.uk

Overview

● What is DataSift● The DataSift platform in numbers● Our initial network design● Issues with the initial design● Considered designs● Implementation● Questions

What is DataSift

● Real time and historical curation and filtering of many sources e.g. Facebook, Twitter, YouTube etc

● Augmentation of data e.g. demographic, link resolution

● Real time streams via Web Sockets, HTTP POST / PUT, SFTP etc

● Historical queries against data from as long ago as 2010

Platform in Numbers; Servers

● ~7k 2.13 - 2.8 Ghz Cores

● ~8Tb RAM

● ~2Pb Storage

● ~380 amps peak draw

● Heterogeneous mix of chassis; Intel SR2600URLXR, Dell R710s & DL380 Gen 7 / Gen8's

Platform in Numbers; Traffic

● Writes○ ~300Mb/s inbound streams

● Replication○ Peaks of 24Gb/s

● Map Reduce○ Peaks of 70Gb/s

● Exports

Initial Network Design

Buffers & Discards

Moving through the Cisco portfolio

● 2960○ 2.7 mpps○ 32 Gbs

● 3560○ 13.1 mpps○ 32 Gbs

● 3750○ 38.7 mpps○ 32 Gbs

● 4948○ 72 mpps○ 96 Gbs

Redesigning the Network● Uplink over subscription

○ Servers per cab○ Gbit uplinks per server○ Extensibility / Redundancy of uplinks

● Redundancy of TOR / Core / Distribution○ Power○ Chassis ○ Management Controllers

● Performance○ Buffers○ Head of line blocking

● Extensibility / Scalability○ Number of Hosts / Cabs supported○ Backplane

Redundancy: Mesh

Pros:● Inter cab transit is truly cab to cab● OSPF reduces admin overhead● Cisco IOS

Cons:● OSPF licence adds cost and

increases complexity● Uplink over subscription● More ports for routing than

hosts

Uplink Oversubscription: Chassis

Pros:● 720,000,000 pps● 80Gbs of inter blade transit● Cisco IOS● Dual Supervisors / PSUCons:● Still suffers Head of Line blocking● Only 2 PSUs● Overpopulated line cards increase

failure impact● Chassis failure (unlikely) = disaster● And....

Cables!

Leaf and SpineArista 7050's & 7048's

● 2x 52x port 10Gbit 7050 core switches● 12x 48x 1Gb / 4x 10Gb port TOR switches ● /27 public subnet per rack● ECMP routes to all racks● Dual PSU with disparate PDU / Dist Board / UPS /

Generator

The CoreArista 7050

● Each 7050 is a separate layer 2 network● SVI's on VLAN for Internet routing● Static routes● 1.2 Tbs throughput / 960 mpps forwarding

Top of RackArista 7048

● VLAN number = Cab Number● SVI consumes 1 IP from /27● Static ECMP routes to all other cabs● Minimum of 2x 10Gbit uplinks● 176Gbs throughput● 132 mpps forwarding

Questions

Yes, we're hiring ;)