Revolution R Enterprise - 100% R and More Webinar Presentation

Post on 10-May-2015

5.102 views 0 download

Tags:

description

R users already know why the R language is the lingua franca of statisticians today: because it's the most powerful statistical language in the world. Revolution Analytics builds on the power of open source R, and adds performance, productivity and integration features to create Revolution R Enterprise. In this presentation, author and blogger David Smith will introduce the additional capabilities of Revolution R Enterprise.

Transcript of Revolution R Enterprise - 100% R and More Webinar Presentation

Revolution Confidential

Revolution R: 100% R and More

Presented by:David SmithVP Marketing, Revolution

Analytics

Revolution Confidential

Poll Question

Which stats package do you use most?

Revolution Confidential

3

October 19, 2011: Welcome!

Thanks for coming. Slides and replay available (soon) at:

http://bit.ly/p6ulsu

David SmithVP Marketing, Revolution AnalyticsEditor, Revolutions blog

http://blog.revolutionanalytics.comTwitter: @revodavid

Revolution Confidential

4

In today’s webcast:

About Revolution Analytics and R

What Revolution R adds to R

Resources for getting more from R

Q&A

Introducing Revolution R

Revolution Confidential

5

What is R?

Data analysis software A programming language

Development platform designed by and for statisticians

An environment Huge library of algorithms for data access, data

manipulation, analysis and graphics

An open-source software project Free, open, and active

A community Thousands of contributors, 2 million users Resources and help in every domain

Download the White Paper

R is Hotbit.ly/r-is-hot

Revolution Confidential

Source: http://r4stats.com/popularity 6

R is exploding in popularity and functionality

Stata 10%

S-Plus 0%

SPSS -27%

SAS -11%

R 46%

Scholarly ActivityGoogle Scholar hits (’05-’09 CAGR)

0

500

1000

1500

2000

2500

20102008200620042002

Package GrowthNumber of R packages listed on CRAN

“A key benefit of R is that it provides near-instant availability of new and

experimental methods created by its user base — without waiting for the

development/release cycle of commercial software. SAS recognizes the value of R

to our customer base…”

Product Marketing Manager SAS Institute, Inc.

“I’ve been astonished by the rate at which R has been adopted. Four years ago,

everyone in my economics department [at the University of Chicago] was using

Stata; now, as far as I can tell, R is the standard tool, and students learn it first.”

Deputy Editor for New Products at Forbes

Revolution Confidential

7

“R is the most powerful & flexible statistical programming language in the world”1

Capabilities Sophisticated

statistical analyses Predictive analytics Data visualization

Applications Real-time trading Finance Risk assessment Forecasting Bio-technology Drug development Social networks .. and more

1. Norman Nie, multiple interviews

15

20

25

30

MSFT [2009-01-02/2010-03-31]

Last 29.29

Volume (millions):63,760,000

50

100

150

200

250

Moving Average Convergence Divergence (12,26,9):MACD: 0.702Signal: 0.712

-6

-4

-2

0

2

4

6

Jan 02 2009 Apr 01 2009 Jul 01 2009 Oct 01 2009 Jan 04 2010 Mar 31 2010

8

R User CommunityFrom: The R Ecosystem

bit.ly/R-ecosystem

Revolution Confidential

Poll Question

If you're not using R today, what would you most like to use R for?

Revolution Confidential

10

Revolution R Enterprise is

Revolution Confidential

11

R Productivity Environment (Windows)Script with type ahead and code

snippetsSolutions window

for organizing code and data

Packages installed and

loaded

Objects loaded in the

R Environment

Object details

Sophisticated debugging with

breakpoints , variable values etc.

http://www.revolutionanalytics.com/demos/revolution-productivity-environment/demo.htm

Revolution Confidential

12

Interactive Debugging

One-click to set a breakpoint in an R script Step in/out/over, inspect variables Eliminate the edit -> browser -> repair cycle

Revolution Confidential

13

Performance: Multi-threaded Math

Open

Source R

Revolution R Enterprise

Computation (4-core laptop) Open Source R Revolution R Speedup

Linear Algebra1

Matrix Multiply 327 sec 13.4 sec 23x

Cholesky Factorization 31.3 sec 1.8 sec 17x

Linear Discriminant Analysis 216 sec 74.6 sec 2x

General R Benchmarks2

R Benchmarks (Matrix Functions) 22 sec 3.5 sec 5x

R Benchmarks (Program Control) 5.6 sec 5.4 sec Not appreciable

1. http://www.revolutionanalytics.com/why-revolution-r/benchmarks.php2. http://r.research.att.com/benchmarks/

Revolution Confidential

14

Three Paradigms for Big Data

Standard R engine is constrained by capacity and performance

Revolution R Enterprise offers three methods for big data with R: Off-line: high-performance file-based analytics Off-line, parallel & distributed analytics On-line, in-database analytics

Hadoop Netezza

Revolution Confidential

15

Revolution R Enterprise with RevoScaleRBig Data Statistics in R

www.revolutionanalytics.com/bigdata

Every US airline departure and arrival, 1987-2008

File: AirlineData87to08.xdfRows: 123.5 millionVariables: 29Size on disk: 13.2Gb

arrDelayLm2 <- rxLinMod(ArrDelay ~ DayOfWeek:F(CRSDepTime),cube=TRUE)

Revolution Confidential

17

Compute Node

(RevoScaleR)

Compute Node

(RevoScaleR) Master Node

(RevoScaleR)

DataPartition

DataPartition

Compute Node

(RevoScaleR)

Compute Node

(RevoScaleR)

DataPartition

DataPartition

• Portions of the data source are made available to each compute node

• RevoScaleR on the master node assigns a task to each compute node

• Each compute node independently processes its data, and returns its intermediate results back to the master node

• master node aggregates all of the intermediate results from each compute node and produces the final result

RevoScaleR – Distributed Computing

*Available for Microsoft HPC Server, November 2011Video demo: http://bit.ly/riUBgs

Revolution Confidential

18

Revolution Analytics with Netezza Appliance

More info: http://bit.ly/R-Netezza

Revolution Confidential

19

RevoConnectR for Hadoop

Revolution R Client

R

Map or Reduce

Job Tracker

Task Node

HDFS

HBASE

Thrift

rhdfs - R and HDFS

rhbase - R and HBASE

rmr - R and MapReduce

Write Map-Reduce analytics using only R code with these R packages:

rmr

rhdfs rhbase

More information at:bit.ly/r-hadoop

Revolution Confidential

20

Enterprise Readiness: Revolution R Enterprise Server

Multi-User Support Production Applications

Integrate R analytics into Web based applications Data Analysis and Visualization Reporting Dashboards Interactive applications

Revolution R Enterprise Server with RevoDeployR

Revolution Confidential

21

Deployment with Revolution R Enterprise

RevoDeployR Web Services

Client libraries (JavaScript, Java, .NET)

Desktop Applications (e.g. Excel)

Business Intelligence

(e.g. Jaspersoft)

Interactive Web Applications

HTTP/HTTPS – JSON/XML

Session Management

AuthenticationData/Script

ManagementAdministration

R

R Programmer

ApplicationDeveloper

End User

Revolution Confidential

22

Coming soon: Revolution R GUI Accessible

Powerful

Extensible

Revolution Confidential

23

The Advanced Analytics Stack

Deployment / Consumption

Advanced Analytics

ETL

Data / Infrastructure

“Open Analytics Stack” White Paper: bit.ly/lC43Kw

Revolution Confidential

24

On-Call Technical Support Consulting

Migration | Analytics | Applications | Validation Training

R | Revolution R | Statistical Topics Systems Integration

BI | ERP | Databases | Cloud

Revolution Confidential

Wrapping Up

Revolution ConfidentialWhy R?

26

Every data analysis technique at your fingertips Create beautiful and unique data visualizations Get better results faster Draw on the talents of data scientists worldwide R is hot, and growing fast

Revolution Confidential

27

Revolution R Enterprise

High-performance R for multiprocessor systemsModern Integrated Development EnvironmentStatistical Analysis of Terabyte-Class Data Sets In-database R analytics with Hadoop and NetezzaDeploy R Applications via Web ServicesTelephone and email technical supportTraining and consulting services100% compatible with R packagesEasy-to-Use GUI1

Production-Grade Statistical Analysis for the Workplace

1 Coming Soon

Revolution Confidential

29

Revolution R Enterprise: Free to Academia

Personal use Research Teaching Package development

Free Academic Downloadwww.revolutionanalytics.com/downloads/free-academic.php

Discounted Technical Support Subscriptions Available

Revolution Confidential

30

Thank You!

Download slides, replay (from Oct 20) http://bit.ly/railcj

Learn more about Revolution R revolutionanalytics.com/products

Contact Revolution Analytics http://bit.ly/hey-revo

Special Offer: Revolution R Enterprise Workstation for $499

Including R Productivity Environment (IDE) with visual debugger, multi-processor capabilities, Big Data analysis with RevoScaleR, and Technical Support

Available until November 15 at http://bit.ly/revo-499

Revolution Confidential

Poll Question

What interests you most about Revolution R Enterprise?

Revolution Confidential

32

The leading commercial provider of software and support for the popular open source R statistics language.

www.revolutionanalytics.com+1 (650) 646 9545

Twitter: @RevolutionR