What is the past future tense of data?

54
1 ©MapR Technologies - Confidential The Shape of Data to Come it isn’t what we thought it was

description

This is a vision pitch about where big data is going and why.

Transcript of What is the past future tense of data?

Page 1: What is the past future tense of data?

1©MapR Technologies - Confidential

The Shape of Data to Comeit isn’t what we thought it was

Page 2: What is the past future tense of data?

2©MapR Technologies - Confidential

Do you remember

the future?

Page 3: What is the past future tense of data?

3©MapR Technologies - Confidential

Page 4: What is the past future tense of data?

4©MapR Technologies - Confidential

Some things

turned out as

expected

Page 5: What is the past future tense of data?

5©MapR Technologies - Confidential

Guys wearing Fedoras

Page 6: What is the past future tense of data?

6©MapR Technologies - Confidential

What about “Big Data”?

Page 7: What is the past future tense of data?

7©MapR Technologies - Confidential

Harvard University will have 200 x 106 volumes by 2040

Fremont Rider, 1944

Page 8: What is the past future tense of data?

8©MapR Technologies - Confidential

To cope … only short papers should be published. … not more than 2500 characters counting “space,” punctuation marks, etc.

Gray and Ruston in IEEE Transactions on Electronic Computers, 1964

Page 9: What is the past future tense of data?

9©MapR Technologies - Confidential

Remember the guy in

the Fedora?

Page 10: What is the past future tense of data?

10©MapR Technologies - Confidential

He’s tweeting

about this right now

Page 11: What is the past future tense of data?

11©MapR Technologies - Confidential

So what is the big data monorail and what is the

cool hat?

Page 12: What is the past future tense of data?

12©MapR Technologies - Confidential

Data curationRigid SchemasEngineered Structure

Page 13: What is the past future tense of data?

13©MapR Technologies - Confidential

Data curationRigid SchemasEngineered Structure

MONORAIL

Page 14: What is the past future tense of data?

14©MapR Technologies - Confidential

Data as-you-find-it

Flexible schemasLate binding

Page 15: What is the past future tense of data?

15©MapR Technologies - Confidential

Data as-you-find-it

Flexible schemasLate bindingCoo

l Hats

Page 16: What is the past future tense of data?

16©MapR Technologies - Confidential

Page 17: What is the past future tense of data?

17©MapR Technologies - Confidential

MONORAIL

Page 18: What is the past future tense of data?

18©MapR Technologies - Confidential

Page 19: What is the past future tense of data?

19©MapR Technologies - Confidential

Cool H

ats

Page 20: What is the past future tense of data?

20©MapR Technologies - Confidential

Why is it different?How does it work?

Page 21: What is the past future tense of data?

21©MapR Technologies - Confidential

More data is being produced more quickly

Data sizes are bigger than even a very large computer can hold

Cost to create and store continues to decrease

The Conventional Answer

BUSTED!

Page 22: What is the past future tense of data?

22©MapR Technologies - Confidential

Analytics Scaling Laws

Analytics scaling is all about the 80-20 rule – Big gains for little initial effort– Rapidly diminishing returns

The key to net value is how costs scale– Old school – exponential scaling– Big data – linear scaling, low constant

Cost/performance has changed radically– IF you can use many commodity boxes

Page 23: What is the past future tense of data?

23©MapR Technologies - Confidential

Which bytes first?

Page 24: What is the past future tense of data?

24©MapR Technologies - Confidential

Page 25: What is the past future tense of data?

25©MapR Technologies - Confidential

Page 26: What is the past future tense of data?

26©MapR Technologies - Confidential

Net value optimum has a sharp peak well before maximum effort

Page 27: What is the past future tense of data?

27©MapR Technologies - Confidential

But scaling laws are changing both slope and shape

Page 28: What is the past future tense of data?

28©MapR Technologies - Confidential

More than just a little

Page 29: What is the past future tense of data?

29©MapR Technologies - Confidential

They are changing a LOT!

Page 30: What is the past future tense of data?

30©MapR Technologies - Confidential

Page 31: What is the past future tense of data?

31©MapR Technologies - Confidential

Page 32: What is the past future tense of data?

32©MapR Technologies - Confidential

Page 33: What is the past future tense of data?

33©MapR Technologies - Confidential

Page 34: What is the past future tense of data?

34©MapR Technologies - Confidential

Initially, linear cost scaling actually makes things worse

A tipping point is reached and things change radically …

Page 35: What is the past future tense of data?

35©MapR Technologies - Confidential

Evolution of Data Storage

FunctionalityCompatibility

Scalability

Linux

POSIX

Over decades of progress,Unix-based systems have set the standard for compatibility and functionality

Page 36: What is the past future tense of data?

36©MapR Technologies - Confidential

Evolution of Data Storage

FunctionalityCompatibility

Scalability

Linux

POSIX

HadoopHadoop achieves much higher scalability by trading away essentially all of this compatibility

Page 37: What is the past future tense of data?

37©MapR Technologies - Confidential

Evolution of Data Storage

FunctionalityCompatibility

Scalability

Linux

POSIX

Hadoop

MapR enhances Apache Hadoop by restoring the compatibility while increasing scalability and performance

Page 38: What is the past future tense of data?

38©MapR Technologies - Confidential

Introducing MapR

MapR offers thetechnology leading

distribution for Hadoop

Page 39: What is the past future tense of data?

39©MapR Technologies - Confidential

The Industry-Leaders Choose MapR in the Cloud

Google chose MapR to provide Hadoop on Google

Compute Engine

Amazon EMR is the largest Hadoop provider in revenue

and # of clusters

Page 40: What is the past future tense of data?

40©MapR Technologies - Confidential

MapR Supports Broad Set of Use Cases

Log analysis HBase

Customer targeting Social media analysis

Customer Revenue Analytics

ETL Offload

Advertising exchange analysis and optimization

Clickstream Analysis Quality profiling/field

failure analysis

Customer Sentiment

Network Analytics

Monitors and measures behavior of online shoppers

Fraud Detection Channel analytics

Customer Behavior Analysis Brand Monitoring

Customer targeting Viewer Behavioral analytics

Recommendation Engine Family tree connections

Intrusion detection & prevention Forensic analysis

Global threat analytics

Virus analysis

Patient care monitoring

Leading Retailer Recommendation Engine Fraud detection and Prevention

Leading Bank

Page 41: What is the past future tense of data?

41©MapR Technologies - Confidential

MapR

MapRThe guys with the

cool hats

Page 42: What is the past future tense of data?

42©MapR Technologies - Confidential

MapR’s Innovations

Page 43: What is the past future tense of data?

43©MapR Technologies - Confidential

Seamless integration with existing applications

100% POSIX compliant

Industry standard APIs - NFS, ODBC, LDAP, REST

More 3rd party solutions

Proprietary connectors unnecessary

Language neutral

Page 44: What is the past future tense of data?

44©MapR Technologies - Confidential

MapR’s Innovations

Page 45: What is the past future tense of data?

45©MapR Technologies - Confidential

MapR: Lights Out Data Center Ready

Reliable Compute Dependable Storage

Automated stateful failover Automated re-replication Self-healing from HW

and SW failures Load balancing Rolling upgrades No lost jobs or data 99999’s of uptime

End-to-end checksums Strong consistency Business continuity with

snapshots and mirrors Recover to a point in time

with snapshots Mirror across sites for

disaster recovery

Page 46: What is the past future tense of data?

46©MapR Technologies - Confidential

MapR’s Innovations

Page 47: What is the past future tense of data?

47©MapR Technologies - Confidential

Why MapR Is Faster

• Eliminates storage contentionLockless Storage Service™

• Provides throughput at device speed Direct Block Device IO

• Exploits MapR-FS architecture to deliver performance using Hadoop Direct Shuffle

Hadoop Direct Shuffle

• Reduces network overhead using automatic compression

Client Side Compression

• Eliminates sporadic Java garbage collection overhead (system written in C)C vs Java

Page 48: What is the past future tense of data?

48©MapR Technologies - Confidential

Security

MapR is pushing the envelope on Hadoop security

Integrates with Linux security (PAM)– Works with any user directory: Active Directory, LDAP, NIS, …

Strong wire-level authentication and encryption– Kerberos and non-Kerberos options

Fine-grained access control– Full POSIX permissions on files and directories– ACLs on tables, column families, columns, cells– ACLs on MapReduce jobs and queues– Administration ACLs on cluster and volumes

Page 49: What is the past future tense of data?

49©MapR Technologies - Confidential

Bullet-proof NoSQL with Zero Administration

ReliabilityPerformance Easy Administration

Benefit Features

High Performance Over 1 Million ops/sec with 10 Node Cluster

Continuous Low Latency No I/O Storms, No Compactions

24x7 Applications Instant Recovery, Online Schema Modification, Snapshots, Mirroring

Zero Administration No Processes to Manage, Automated Splits, Self-tuning

High Scalability 1 Trillion Tables

Low TCO Files and Tables on One Platform

Page 50: What is the past future tense of data?

50©MapR Technologies - Confidential

MapR M7 vs. CDH – Mixed Load (50-50)

Page 51: What is the past future tense of data?

51©MapR Technologies - Confidential

MapR M7 vs. CDH – Mixed Load (50-50)

Page 52: What is the past future tense of data?

52©MapR Technologies - Confidential

MapR

MapRThe guys with the

cool solutions

Page 53: What is the past future tense of data?

53©MapR Technologies - Confidential

MapR

MapRThe future of

the future

Page 54: What is the past future tense of data?

54©MapR Technologies - Confidential

Thank You