DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

Post on 26-Jan-2015

116 views 3 download

Tags:

description

 

Transcript of DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL

ApacheCassandra Jonathan Ellis, CTO DataStax

Dec, 7 2011

Real NoSQL Applications in the Enterprise Today.

Today’s Database Challenge

Navigating the NoSQL waters

 Distributed  Horizontally scalable  Eventually consistent  Non-relational

  Column store   Document stores   Key-value   Graph   … and more

Cassandra: the best for “big data”

 Elegant architecture  Operational flexibility   Industry-leading performance

 You should be using Cassandra for applications requiring   high-performance, realtime queries   scalability past one machine   bulletproof reliability

Bigtable, 2006 Dynamo, 2007

OSS, 2008

Incubator, 2009 TLP, 2010 1.0, October 2011

Cassandra Highlights  Multi-master, multi-DC  Linearly scalable  Larger-than-memory datasets  High performance  Full durability   Integrated caching  Tuneable consistency

A single four-core machine; one million inserts + one million updates

Performance

The Cassandra Difference

Scalable Performance

Oracle Exadata ✖ ✔ ✔

MySQL ✖ ✔ ✔

Sharding ✔ ✔ ✖

MongoDB ✔

Operational Ease

Cost Effective

Cassandra ✔ ✔ ✔

HBase ✔ ✖ ✔

And when it comes to Performance, we’re unmatched.

*

*

✖ ✔

“With Cassandra, we get better business agility, and we don’t have to plan capacity in advance, we don’t need to ask permission of other people to build things for us, and we don’t worry about running out of space or power.”

 Adrian Cockcroft, Cloud Architect

Netflix’s problems  Could not build datacenters fast enough  Made decision to go to cloud (AWS)  Cassandra on AWS is a key infrastructure

component of its globally distributed streaming product.

 Applications include Netflix’s subscriber system, AB testing, and viewing history service (including positions at which members stopped watching a streaming program).

Netflix on Cassandra TODO  Fast  Cheap  Scalable  Flexible  No SPOF

“Without Cassandra, our engineers would’ve had to create something that could scale to our needs, that would’ve prevented us from focusing on building product and solving problems for Backupify’s users, which are far more important tasks.”

 Matt Conway, VP Engineering

Backupify’s problem  Cloud-based utility that enables

businesses and consumers to backup, search and restore the content of popular online applications such as Google Apps, Gmail, Facebook, Twitter, and Blogger

 Needs:   Horizontal scaling   Ability to handle high write loads   Elasticity with no manual sharding

Backupify on Cassandra  Ease of scale enabled engineers to focus

on building great applications  DataStax OpsCenter made it easy to

monitor the health and perf of their cluster   Reliable, redundant and scalable low-

balance data storage helped eliminate down-time

 Ability to offer both backup and storage, but also analysis of data eventually

“You can seamlessly add new nodes and expand your total capacity without deteriorating the performance of the data store. Cassandra has allowed us to scale very effectively.”

 Harry Robertson, Tech Lead

Ooyala’s problem  Ooyala provides a suite of technologies

and services that support content owners in managing, analyzing and monetizing the digital video they publish online

 Needs:   Elasticity, to respond to spikes in data scale   Ability to respond to increasingly

sophisticated analytic needs of customers

Ooyala on Cassandra  Classic “Big Data” problem did not require

re-architecting  Application agility was enabled –

developers spend time building cool apps, not figuring out how to scale

 Enabled more powerful and granular analytics to their customers

“Cassandra has allowed us to build bigger features faster and more reliably, while using less money and without needing to expand our staff.”

 Kyle Ambroff, Sr. Engineer

Formspring’ problem  Users of Formspring engage with and learn

more about each other by asking and responding to questions. With close to 4B responses in the system and 30M unique users, they needed:   To support explosive growth   To seamlessly syndicate user content   To avoid sharding   Application flexiblity

Formspring on Cassandra  No sharding needed – just add nodes to

scale  Performance – the popular users with

many followers saw no speed reduction.  No more memcached!  Flexibility of a schema-optional

architecture is very developer friendly

Why DataStax? DataStax delivers database products and services based on Apache Cassandra from experts who are at the forefront of today's data revolution.

Database Software & Tools Support & Services

  DataStax Enterprise   DataStax Community   DataStax OpsCenter   Drivers & Connectors

  Production Support   Consultative Help   Professional Training   Online Documentation

DataStax Overview   Founded in April 2010   Commercial leader in Apache Cassandra™, the popular

open-source “big data” database   Headquartered in San Francisco Bay area   100+ customers   35+ employees (split between San Fran and Austin)   Home to Apache Cassandra Chair & most committers   Secured $11M in Series B funding in Sep 2011

DataStax Value  The simplest way to get started with Apache

Cassandra: DataStax Community Edition  A smart, integrated platform that provides

Analytics and Real-Time capabilities in the same database, without any resource contention: DataStax Enterprise

 The backing of the Cassandra Experts

100+ customers

DataStax Enterprise 1.  DataStax Enterprise

Database Server

2.  OpsCenter Enterprise Management solution

3.  Expert production support & consultative services

Enterprise Database Server

  Leverages resources on-premise or in the cloud

  Guarantees uptime with a master-less distributed architecture

  Allows for fast application changes via flexible schemas

  Handles structured, semi-structured, and unstructured data

  Provides advanced security   Eliminates the need for separate analytics

system

1

2 3

4

6 5

Real-Time

Analytics

Repl

icat

ion

Enterprise-class database built to handle today’s big-data needs in a cost-effective, easy, and reliable way.

OpsCenter Enterprise

  Visual, browser-based user interface

  Administration tasks carried out in point-and-click fashion

  Allows for visual rebalance of data across a cluster when new nodes are added

  Proactive alerts that warn of impending issues

  Built-in external notification abilities

OpsCenter Enterprise supplies management, monitoring, and control over DataStax Enterprise

Expert Production Support DataStax Enterprise includes production support and consultative services from the Cassandra experts.

 Support service level agreements that range from business hours to 24x7x365

 Consultative support for assistance on architecture, design, and tuning

 Certified quarterly service packs

 Hot-fix support

DataStax Enterprise Compared

Scalable Performance

Oracle Exadata ✖ ✔ ✔

MySQL ✖ ✔ ✔

Sharding ✔ ✔ ✖

MongoDB ✔

Operational Ease

Cost Effective

DataStax Enterprise ✔ ✔ ✔

HBase ✔ ✖ ✔

✖ ✔

Real-Time + Analytics

Oracle NoSQL DB ✔ ✔ ? ✖

DataStax – Your One-Stop Shop  DataStax Enterprise and Community Editions  Professional Training, Expert Consulting  Documentation and Dev Center

  http://www.datastax.com/docs   http://www.datastax.com/dev

 Whitepapers, Case Studies, FAQ’s and more   http://www.datastax.com/resources/whitepapers   http://www.datastax.com/resources/casestudies

Thank you!