Decision Ready Data: Power Your Analytics with Great Data
Murthy Mathiprakasam
2
3
Repeatably deliver trusted and timely data for great analytics and great social impact
Your Mission
Great Data Powers Great Analytics and Great Impact
CIOs See Competitive Advantage in Analytics
BUT More than half of analytics projects
fail
Gartner
Analytics is the top priority for CIOs
again in 2015
Gartner
CIO Investment Priority
New Data Sources Enable New Insights
6
In the era of Big Interaction Data, unprecedented insights are available from analyzing new data sources
SENSORS METERS LOGS
BADGES WEARABLES MOBILE
Cloud
New Data Platforms
Big DataTraditional /Real Time
New Visualization Tools
New Platforms Enable New Analytical Capabilities
But Data Is Fragmented As Data Sources Proliferate
8
Data Warehouse
Transactional Applications
CRM ERP HR FIN
Big Data
UnstructuredSemi-Structured
Real-timeEvents
MainframeSystems
Cloud, Social, Partner Data
EnterpriseApplications
It’s Difficult to Trust All Of The Available Data
9
John Smith11710 Plaza DriveReston, VA ______(___)-___-____
IncompleteDATASETS THAT
ARE NOT ACCURATE
Jonathan Smith
John Smith
John H Smith
InconsistentDATASETS THAT
ARE NOT STANDARDIZED
InsecureDATASETS THAT
ARE NOT MASKED
vs
Data Has Not Kept Up With The Pace of Business
10
Up to 80%ANALYST TIME SPENT ON
DATA PREPARATION
Untimely DeliveryOF DATA INHIBITS
AGILE, REAL-TIME DECISIONS
Analytics Is Costly & Complex To Deliver Today
11
Can’t Re-Use EXISTING SKILLS
WHEN PLATFORMS CHANGE
Can’t Re-Use EXISTING PROCESSESTO DRIVE SCALABILITY
AND REPEATABILITY
As A Result, Decisions Suffer and People Suffer Impact
12
Can’t make comprehensive decisions based on all of the available data
Can’t make accurate decisions based onhigh quality and secure data
Can’t make timely decisions based on fresh and up-to-date data
Can’t operationalize data delivery tofuel decisions repeatably and scalably
What Is Needed: Faster, Better, Less Costly Analytics
Insights ThatAre Timely
Data ThatCan BeTrusted
Simple,Standardized,
ScalableDelivery
14
Repeatably deliver trusted and timely data for great analytics and great social impact
Your Mission
Imagine If You Could Put More Data To Use
15
Word, ExcelPDFStarOfficeEmail, LDAP
OracleDB2SQL ServerSybase
InformixTeradataNetezzaODBC/JDBC
Flat filesHTTP/HTMLRPGANSI
ASTFIXSWIFTMVR
SAP NetWeaverSAP NetWeaver BI SASSiebel
JD Edwards Lotus NotesOracle E-BusinessPeopleSoft
EDI–X12EDI-FactRosettaNetHL7/HIPAA
XMLLegalXMLIFXcXML
SalesforceRightNowNetSuiteOracle OnDemand
FacebookTwitterLinkedInDatasift
ebXMLHL7 v3.0ACORD
100+ PRE-BUILT PARSERS
200+ PRE-BUILT CONNECTORS
Out of the BoxBUSINESS RULES AND
DATA STANDARDIZATION
Sample of Compatible Data Types and Sources
Imagine If You Could Stream Data At High Speeds
REFINE
INGEST
Profile, Parse, Cleanse, Match
Stream
STORE
ACT
Complex Event Processing
NoSQL Databases
LAN/WAN SCALESTREAMING COLLECTION
CENTRALIZED MANAGEMENTOF DISTRIBUTED COLLECTION
REAL-TIME INGESTION OF REAL-TIME DATA FOR
REAL-TIME RESPONSE
SOURCE
Transactional Data
InteractionData
Sensor/DeviceData
Documents/Files
IndustryFormats
Imagine If You Could Easily Adopt New Platforms
17
400%FASTER MIGRATION TO
NEW PLATFORMS
500% FASTER PERFORMANCE ON
DATA PREPARATION
0%REWRITING OF DATA PREPARATION JOBS
18
Imagine If You Could Easily Adopt Hadoop
REFINE WAREHOUSE
Profile, Parse, Cleanse, Match
Offload infrequently used data for
active archivingOffload ETL/ELT for efficient processing
Reuse existing skills and processes
Imagine If You Could Develop And Staff More Quickly
19
HadoopDevelopers
InformaticaDevelopers
100,000+TRAINED DEVELOPERS
WORLDWIDE
500% MORE PRODUCTIVE THAN HAND-CODING
0%RISK OF REWRITING
OUTDATED CODE
20
Imagine If You Could Understand Your Data
“Contact [email protected] more information about #AAPL and #GOOG”
Person: William HarrisonCompany: Apple, IncCompany: Google
EXTRACT ENTITIESWITH NATURAL
LANGUAGE PROCESSING
ENRICH DATASETSWITH ADDRESS VALIDATION
AND GEOCODING
MATCH AND STANDARDIZEFOR DATA QUALITY
AND DATA MASTERING
21
Imagine If You Could Protect Your Data
PHI: Protected Health InformationPII: Personally Identifiable InformationScalable to look for/discover ANY Domain type
ANALYZE STRUCTUREOF DATA WITH BUILT-IN
DATA PROFILING
ISOLATE BAD DATA QUICKLYWITH PROFILING STATISTICS
UNDERSTAND MEANINGAND CONTEXT OF DATA
IDENTITY SENSITIVE DATAWITH DATA DOMAIN REPORTS
Imagine If You Could Provide Virtual Access…
CANONICAL DATA MODELPRODUCT …CUSTOMER ORDER
SOURCE
VIRTUALIZE
ANALYZE ACCESS & MERGEDATASETS USING VIRTUAL TABLES
PREPARE IN REAL-TIMEWITH COMMON METADATA
REPOSITORY
Transactional Data
InteractionData
Sensor/DeviceData
Documents/Files
IndustryFormats
BusinessIntelligence
Agile Visualization
…Or Broker Physical Provisioning Automatically
SOURCE
Transactional Data
InteractionData
Sensor/DeviceData
Documents/Files
IndustryFormats
BROKER
ANALYZE
BusinessIntelligence
Agile Visualization
LOOSER COUPLINGBETWEEN SOURCE SYSTEMS AND DESTINATION SYSTEMS
FASTER PROVISIONINGOF DATA TO DISTRIBUTED
CONSUMERSIntegrationHub
Informatica Delivers Great Data For Any InitiativeAccess Any Data / Any
Volume
Faster Data Onboarding
• Integrate• Load• Transform• Cleanse• Master
Offload Data and
Processing Batch
Deliver More Trusted Data
Data Warehouse
• Prepare• Analyze• Profile• Cleanse
Offload to High Performance
Storage
Realtime
Storage
X
25
#1) Define The Mission Of Your Journey
26
IdentifyTheData
SelectThe
Consumers
Establish TheGoal
#2) Deploy Leverage At Every Step
27
LeverageExisting
Centers OfExcellence
Leverage LightweightStandards &Processes
LeverageTechnology
for Scale andRepeatability
#3) Deliver With Partners For Maximum Success
28
…
Strong Partner Ecosystem
• FREE 2 Hour Workshop• DW Optimization
Assessment• Readiness Assessment
Proven Methodology
Data Integration Data Quality
Cloud Data Integration Data Archiving
Market Leading Platform
29
GreatTransportation
Aspiration: Florida Turnpike sought to improve emergency preparedness and improve the prepaid toll program
Challenge: Data collection took over one month leading to faulty analytics
Outcome: “Timely and accurate traffic, revenue, and participation reports help management make good choices that will eventually result in saving money.”
— Bob Hartmann, IT Director, Florida Turnpike Enterprise
30
GreatEnvironment
Aspiration: US Geological Service sought to improve the quality of water in the United States
Challenge: Collect distributed data and build a centralized water quality dataset
Outcome: “We chose Informatica as our data integration solution because of its maturity, wide range of features, ease of use and industrial strength, integrated architecture.”
— Harry House, Data Warehouse Practice Leader, USGS
31
GreatEducation
Aspiration: Rochester Institute of Technology sought to understand how it could improve student enrollment, student housing, and student retention
Challenge: Data was in disparate systems
Outcome: “We're becoming myth busters. Informatica provides timely, accurate information we need to spot trends, improve the quality of our academic learning, and reduce attrition.”
— Kim Sowers, Director of Application Development, Rochester Institute of Technology
32
GreatHealthcare
Aspiration: Utah Dept of Health sought to process healthcare claims faster and improve public health
Challenge: Manual effort to track and link claims data over time
Outcome: “We see the Informatica as absolutely essential to everything that we want to do, not only to meet our mandate for the All Payer Database,”
— Dr. Keely Cofrin Allen, Director, Office of Health Care Statistics, State of Utah Department of Health
33
Repeatably deliver trusted and timely data for great analytics and great social impact
Your Mission
Across nearly any data, any data platform, any data visualization
Informatica Delivers Great Data for Great Social Impact
Cloud Big DataTraditional /Modern
Data Sources
Data Visualization
Data Platforms
Thank You
Top Related