Hw09 Welcome To Hadoop World

Post on 20-Aug-2015

24.084 views 2 download

Transcript of Hw09 Welcome To Hadoop World

Welcome to Hadoop World: NYC 2009Hadoop is Everywhere

Christophe BiscigliaFounder christophe@cloudera.com

Presents:

Hadoop World Details and Event UpdatesToo Late to Print

▪ WiFi Details▪ SSID: HadoopWorld▪ Password: hadoop09

▪ Twitter: #hadoopworld

▪ Break Out Sessions▪ Applications (This Room)▪ Dev / Admin: Terrace Ballroom (Across Lobby)▪ Extensions: Vanderbilt Suite (One Floor Up)

▪ UI BOF▪ Lead: Philip Zeyliger, Cloudera▪ Vanderbilt Suite, Afternoon Break

▪ HBase BOF▪ Lead: Michael Stack, Microsoft▪ Terrace Ballroom, Afternoon Break

Hadoop World SponsorsThanks!

Why Hadoop World?Time to Upgrade Your Data Management Strategy

▪ Hadoop isn’t just for Web Companies anymore▪ Terabytes are common place▪ Enables consumption of all enterprise data▪ Wide adoption across verticals

▪ Hadoop is driven by the Community▪ Most registrants are new to Hadoop▪ Sharing experience is critical - and incredibly valuable▪ Users and Developers exchanging needs and ideas

Growing Up with HadoopYou’ve come a long way baby...

▪ Early Days▪ 2004: Google Publishes MapReduce/GFS▪ 2005: Hadoop Prototype▪ Doug Cutting and Mike Cafarella

▪ 2006: Hadoop Running on 20 nodes▪ Internet Archive and UW

Growing Up with HadoopYou’ve come a long way baby...

Doug CuttingPhoto Credit: New York Times

▪ Formative Years▪ 2006: Yahoo! Begins Major Investment▪ 2007: Yahoo! Runs Hadoop on 2000 nodes▪ 2008: Yahoo! uses Hadoop to claim Terasort

Benchmark

Growing Up with HadoopYou’ve come a long way baby...

Growing Up with HadoopYou’ve come a long way baby...

▪ 5 Major Releases for Hadoop in last year▪ More Reliable▪ More Scalable▪ More Manageable

▪ New Sub-Projects Embrace New Users▪ Hive: SQL Data Warehouse for Hadoop▪ Pig: Data Analysis Language

Growing Up with HadoopYou’ve come a long way baby...

▪ Sqoop: Database import for Hadoop▪ Developer by Aaron Kimball, Cloudera▪ Works over JDBC▪ Extensible for better pefromance

Growing Up with HadoopYou’ve come a long way baby...

▪ RDBMS Vendors Embrace Hadoop▪ MapReduce is great for Analytics▪ Hadoop is the MapReduce Standard▪ integrates directly with Hadoop

Growing Up with HadoopYou’ve come a long way baby...

Growing Up with HadoopYou’ve come a long way baby...

▪ Adoption Spanning Globe▪ HUGs outside the US▪ Over 10x Companies “PoweredBy”▪ Not Just for Web Companies Anymore

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Hadoop Community

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Latest Stable Hadoop Release

Stable Upcoming Features (by customer request) Distribution for Hadoop

Hadoop Community

Source Code Powering Y!

Improvements for EC2 and S3

New Features from Cloudera

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Latest Stable Hadoop Release

Stable Upcoming Features (by customer request) Distribution for Hadoop

Hadoop Community

Source Code Powering Y!

Improvements for EC2 and S3

New Features from Cloudera

Cloudera EnhancementsBug Fixes

Contributed to Apache

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Latest Stable Hadoop Release

Stable Upcoming Features (by customer request) Distribution for Hadoop

Hadoop Community

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Distribution for Hadoop

Cross-Platform Packaging,Integration and Testing

Hive, Pig, Sqoop, ...

Support

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Distribution for Hadoop

Cross-Platform Packaging,Integration and Testing

Hive, Pig, Sqoop, ...

Support

Packages

Private Cloud

Public Cloud

Images

Cloudera’s Distribution for HadoopDelivering Hadoop to a Larger Community

Distribution for Hadoop

Cross-Platform Packaging,Integration and Testing

Hive, Pig, Sqoop, ...

Support

Packages

Private Cloud

Comparing Growth Rates since March 2009Standard Packaging Drives Adoption

March 2009 May 2009 July 09 Aug 09 Sept 09

95%97%93%133%95%96%100%

1,835%

1,392%

1,026%

762%

384%

238%

100%

Cloudera DownloadsApache Downloads

▪ Consistent Downloads from Apache

▪ Cloudera Packages Drive New Usage

▪ Enables New Hadoop Applications

Normalized by unique users accessing hadoop.apache.org/core/releases.html and Cloudera Package Repositories in March 2009

Cloudera’s Business to DateSupport, Training and Professional Services

▪ Dozens of Support Customers▪ Using Hadoop for real enterprise workloads

▪ Training and Certification▪ 100’s of engineers trained▪ Sysadmin and Manager programs launched at Hadoop World

▪ Professional Services