Hadoop Reporting and Analysis - Jaspersoft
-
Upload
hortonworks -
Category
Technology
-
view
4.457 -
download
3
description
Transcript of Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting & AnalysisWhat Architecture is Best for Me?
©2013 Jaspersoft Corporation. 2
Jim WalkerDirector Product Marketing, Hortonworks
Twenty years experience building products and bringing them to market. His expertise includes data loss prevention, master data management and now big data.
Ben ConnorsWorldwide Head of Alliances, Jaspersoft
Prior to Jaspersoft, Ben was at HP, Oracle, Viador, and other BI companies. He has over 20 years of experience in databases and business intelligence.
Matt DahlmanTechnical Director of Alliances, Jaspersoft
Prior to Jaspersoft, Matt was with Oracle, Netonomy, and Sybase. He brings over 15 years of database and business intelligence experience to his role.
Presenters
Agenda
Hadoop in the Modern Data architecture Hadoop Usage Patterns Jaspersoft
Company BI Suite
Jaspersoft/Hortonworks Integration Demo The Future of Interactive Hadoop Q&A
©2013 Jaspersoft Corporation. Proprietary and Confidential 3
© Hortonworks Inc. 2013
A Brief History of Apache Hadoop
Page 4
2013
Focus on INNOVATION2005: Yahoo! creates
team under E14 to work on Hadoop
Focus on OPERATIONS2008: Yahoo team extends focus to
operations to support multiple projects & growing clusters
Yahoo! begins to Operate at scale
EnterpriseHadoop
Apache Project Established
HortonworksData Platform
2004 2008 2010 20122006
STABILITY2011: Hortonworks created to focus on “Enterprise Hadoop“. Starts with
24 key Hadoop engineers from Yahoo
© Hortonworks Inc. 2013
Existing Data Architecture
Page 5
APPL
ICAT
ION
SDA
TA S
YSTE
MS
TRADITIONAL REPOSRDBMS EDW MPP
DATA
SO
URC
ES
OLTP, POS SYSTEMS
OPERATIONALTOOLS
MANAGE & MONITOR
Traditional Sources (RDBMS, OLTP, OLAP)
DEV & DATATOOLS
BUILD & TEST
Business Analytics
Custom Applications
Enterprise Applications
© Hortonworks Inc. 2013
An Emerging Data Architecture
Page 6
APPL
ICAT
ION
SDA
TA S
YSTE
MS
TRADITIONAL REPOSRDBMS EDW MPP
DATA
SO
URC
ES
MOBILEDATA
OLTP, POS SYSTEMS
OPERATIONALTOOLS
MANAGE & MONITOR
Traditional Sources (RDBMS, OLTP, OLAP)
New Sources (web logs, email, sensor data, social media)
DEV & DATATOOLS
BUILD & TEST
Business Analytics
Custom Applications
Enterprise Applications
HORTONWORKS DATA PLATFORM
© Hortonworks Inc. 2013
Interoperating With Your Tools
Page 7
APPL
ICAT
ION
SDA
TA S
YSTE
MS
TRADITIONAL REPOS
apps
HORTONWORKS DATA PLATFORM
DATA
SO
URC
ES
MOBILEDATA
OLTP, POS SYSTEMS
Traditional Sources (RDBMS, OLTP, OLAP)
New Sources (web logs, email, sensor data, social media)
OPERATIONALTOOLS
MANAGE & MONITOR
DEV & DATATOOLS
BUILD & TEST
© Hortonworks Inc. 2013
OS Cloud VM Appliance
HDP: Enterprise Hadoop Distribution
Page 8
PLATFORM SERVICES
HADOOP CORE
DATASERVICES
OPERATIONAL SERVICES
Manage & Operate at
Scale
Store, Process and Access Data
Enterprise Readiness: HA, DR, Snapshots, Security, …
HORTONWORKS DATA PLATFORM (HDP)
Distributed Storage & Processing
Hortonworks Data Platform (HDP)Enterprise Hadoop
• The ONLY 100% open source and complete distribution
• Enterprise grade, proven and tested at scale
• Ecosystem endorsed to ensure interoperability
HDFS YARN (in 2.0)
WEBHDFS MAP REDUCE
HCATALOG
HIVEPIGHBASE
SQOOP
FLUME
OOZIE
AMBARI
© Hortonworks Inc. 2013
Operational Data Refinery
Page 9
DATA
SYS
TEM
SDA
TA S
OU
RCES
1
31 Capture
Capture all data
ProcessParse, cleanse, apply structure & transform
ExchangePush to existing data warehouse for use with existing analytic tools
2
3
Refine Explore Enrich
2
APPL
ICAT
ION
S
Collect data and apply a known algorithm to it in trusted operational process
TRADITIONAL REPOSRDBMS EDW MPP
HORTONWORKS DATA PLATFORM
Business Analytics
Custom Applications
Enterprise Applications
Traditional Sources (RDBMS, OLTP, OLAP)
New Sources (web logs, email, sensor data, social media)
© Hortonworks Inc. 2013
Application Enrichment
Page 10
DATA
SYS
TEM
SDA
TA S
OU
RCES
Refine Explore Enrich
APPL
ICAT
ION
S
1 CaptureCapture all data
ProcessParse, cleanse, apply structure & transform
ExchangeIncorporate data directly into applications
2
3
Collect data, analyze and present salient results for online apps
3
1
2TRADITIONAL REPOS
RDBMS EDW MPP
Traditional Sources (RDBMS, OLTP, OLAP)
New Sources (web logs, email, sensor data, social media)
Custom Applications
Enterprise Applications
NOSQL
HORTONWORKS DATA PLATFORM
© Hortonworks Inc. 2013
Big Data Exploration & Visualization
Page 11
DATA
SYS
TEM
SDA
TA S
OU
RCES
Refine Explore Enrich
APPL
ICAT
ION
S
1 CaptureCapture all data
ProcessParse, cleanse, apply structure & transform
ExchangeExplore and visualize with analytics tools supporting Hadoop
2
3
Collect data and perform iterative investigation for value
3
2TRADITIONAL REPOS
RDBMS EDW MPP
1
HORTONWORKS DATA PLATFORM
Business Analytics
Traditional Sources (RDBMS, OLTP, OLAP)
New Sources (web logs, email, sensor data, social media)
The Intelligence Inside
Competing on Time and Information
©2013 Jaspersoft Corporation. Proprietary and Confidential 13
“The New Factors of Production: Time and Information”Brian Gentile, Jaspersoft
But business users don’t have access to
timely, actionable data
Why?
Most don’t spend their day inside a BI tool …nor do they want to!
We Need “Intelligence Inside”
©2013 Jaspersoft Corporation. Proprietary and Confidential 14
We want information to FIND US, not the other way round
“We need Intelligence Inside the applications and business processes we use every day.”
Pipeline dashboard inside SaaS CRM app Performance report inside partner portal Salary data visualizations inside HR intranet Portfolio analytics inside client website Tickets crosstab inside custom helpdesk app Interactive charts inside native mobile app
Jaspersoft: The Intelligence Inside
©2013 Jaspersoft Corporation. Proprietary and Confidential 15
Self-Service BI + Embeddable + Affordable
“We empower millions of people every day to make decisions faster by delivering timely, actionable data to them inside their apps and business process through an embeddable, cost-effective reporting and analytics platform.”
Intelligence Inside
Example Customers
Commercial Apps
Customer Portals
Cloud Apps
Internal Apps
Big Data Analytics
The Intelligence Inside Business
©2013 Jaspersoft Corporation. Proprietary and Confidential 16
The Intelligence Inside the New IT Stack
Inaugural BI service: On VMware Cloud Foundry On Red Hat OpenShift Jaspersoft Certified Amazon Redshift and RDS To connect directly (no ETL) to non-SQL like MongoDB and HBase
©2013 Jaspersoft Corporation. Proprietary and Confidential 17
“Our mission is to become the de facto reporting and analytic service in the New IT Stack, enabling BI Builders to build the Intelligence Inside internal and commercial apps on the leading Cloud platforms, powered by the new Big Data stores.”
Broad Recognition, Strong Partnerships
50%+ ACV Growth Every Year
Magic Quadrants
18©2013 Jaspersoft Corporation. Proprietary and Confidential
World’s Most Widely Deployed BI
• Commercial Open Source BI Suite• Nearly 200 people in US, EMEA, APAC• 16,000,000 downloads• 325,000 community members• 130,000 embedded applications• 15,000 paying customers• 1,800 subscription customers
Jaspersoft: High Growth and Momentum
Product Overview
Design Any Report . . .
©2013 Jaspersoft Corporation. Proprietary and Confidential 20
… Dashboard
21©2013 Jaspersoft Corporation. Proprietary and Confidential
… or Analytic View
22©2013 Jaspersoft Corporation. Proprietary and Confidential
POJO files
… using Any Data Type
Relational FilesRelational Big Data Files
©2013 Jaspersoft Corporation. Proprietary and Confidential 23
Redshift
BigQuery
©2013 Jaspersoft Corporation. Proprietary and Confidential 24
… bringing Intelligence to Any App
… with a World-Class BI Platform
©2013 Jaspersoft Corporation. Proprietary and Confidential 25
Reporting, Dashboards, Visualization, OLAP Analysis
Columnar-Based In-Memory Engine
Data Connectivity to Any Data100%
Web
Sta
ndar
ds:
CS
S,
.JS
, .J
SP,
Jav
a
Ext
ensi
ve A
PIs
: H
TT
P, S
OA
P, R
ES
T
HTML5 Browser, Native Mobile Apps
Business Metadata Layer
Data Integration
Data Virtualization Direct
Hadoop Other DataRDBMS
Approach Data Exploration Operational Reporting Analytics
Use Case For data analysts and data scientists who want to discover real-time patterns as they emerge from their Big Data content
For executives and operational managers who want summarized, pre-built daily reports on Big Data content
For data analysts and operational managers who want to analyze historical trends based upon pre-defined questions in their Big Data content
Latency Low Medium High
Big Data HBase, NoSQL, Analytic DBMS Hive, NoSQL, Analytic DBMS Hadoop, NoSQL, Analytic DBMS
Connectivity Native Native, SQL ETL
Architecture
Three Approaches to Big Data Analysis
BI Platform
In-Memory Engine
Native
BI Platform
Native SQL
BI Platform
OLAP Engine
Data Mart
ETL
Multi-Dimensional Analysis
Reports & Dashboards
Multi-Dimensional Analysis
©2013 Jaspersoft Corporation. Proprietary and Confidential
Jaspersoft’s Hadoop Difference
Advanced Hadoop integration Only BI provider than can support 3 approaches to Hadoop analytics Live Exploration, Batch Analysis, Batch reporting Direct, native connectors to Hive and HBase
Broad partnerships
Deep knowledge and ecosystem
27©2013 Jaspersoft Corporation. Proprietary and Confidential
Jaspersoft 5 Demo
28
“We've taken the desktop power of data visualization tools, built it scale on the HTML5 web, and made it embeddable within any app, device or portal”
©2013 Jaspersoft Corporation. Proprietary and Confidential
© Hortonworks Inc. 2013
Hortonworks Snapshot
Page 29
• We distribute the only 100% Open Source Enterprise Hadoop Distribution: Hortonworks Data Platform
• We engineer, test & certify HDP for enterprise usage
• We employ the core architects, builders and operators of Apache Hadoop
• We drive innovation within Apache Software Foundation projects
• We are uniquely positioned to deliver the highest quality of Hadoop support
• We enable the ecosystem to work better with Hadoop
Develop Distribute Support
We develop, distribute and support the ONLY 100% open source Enterprise Hadoop distribution
Endorsed by Strategic Partners
Headquarters: Palo Alto, CAEmployees: 180+ and growingInvestors: Benchmark, Index, Yahoo
© Hortonworks Inc. 2013
Hortonworks Approach
Identify and introduce enterprise requirements into the pubic domain
Work with the community to advance and incubate open source projects
Apply Enterprise Rigor to provide the most stable and reliable distribution
Community Driven Enterprise Apache Hadoop