Phua Chiu KiangMicrosoft MVP (SQL Server)
•
•
•
•
•
•
Microsoft Data Warehousing Vision
Massive Scalability at Low
Cost
Improved Business Agility
and Alignment
Hardware Choice
Make SQL Server the gold standard for data warehousing offering customers
•
•
•
•
•
•
•
4Microsoft Confidential—Preliminary Information Subject to Change
Today In 3 Years
Less than 500 GB
500 GB – 1 TB
1 – 3 TB
3 – 10 TB
More than 10 TB
Don’t Know
21%
5%
20%
12%
21%
18%
19%
25%
17%
34%
2%
6%
Approximate data volume
managed by data warehouse
Source: TDWI Report – Next Generation DW
Pla
n t
o U
se
Anticipated Growth in the next 3 Years
0%
25%
50%
75%
100%
-50% -25% 0% 25% 50% 75% 100%
Decreasing Usage Increasing Usage
Nar
row
Co
mm
itm
ent
Bro
ad C
om
mit
men
tData Warehouse Industry Trends
DBMS Built
for
Transactions
SMP
Centralized
EDW
Analytics
within EDW
Analytics
Outside EDW
Blades in
Racks
DBMS Built
for DW
Server
Virtualization
DW
Bundles
Security
DW Appliance
Mixed Workloads
Data Federation
Columnar DBMS
Streaming
Data
SOA
Low-Power
Hardware
In-Memory DBMS
SaaS
Open Source
OS
Open Source
ReportingOpen Source
Data IntegrationSoftware
Appliance
Public CloudOpen Source DBMS
Advanced
Analytics
Data
Quality
HA for DW
Web Services
MPP
64-bit MDM
Real-time DW
Source: TDWI
Areas of strategic investment for Microsoft
6
• Building a traditional DW• Time consuming
• Expensive
• Performance varies
• Scalability issues
Potential bottlenecks in standard DW architecture
• The DW appliance model• Tuned h/w + s/w
• Views entire stack holistically
• Known performance & scalability
• Encapsulates best practices
• Leverages Sequential I/O
Lower TCOFaster
deployment
Better
performance
Minimised
DBA time
Benefits
©2009 Microsoft Corporation
8<Session Name> Microsoft NDA-only
Software:
• SQL Server 2008
Enterprise
• Windows Server 2008
Hardware:
• Tight specifications for servers,
storage and networking
• ‘Per core’ building block
Configuration guidelines:
• Physical table structures
• Indexes
• Compression
• SQL Server settings
• Windows Server settings
• Loading
9<Session Name> Microsoft NDA-only
Reduces DBA effort; fewer indexes, much higher level of sequential I/O
Dell, HP, Bull, EMC and IBM – more in future
Commodity Hardware and value pricing; Lower storage costs.
New reference architectures scale up to 48TB (assuming 2.5x compression)
Validated by Microsoft; better choice of hardware; application of Best Practice
TeradataSQL Server
Fast Track DWComparison
LoadingSubject Area 1
5:10:21 total time 0:51:31 total time R
6x faster
Loading Subject Area 2
4:36:08 total time 1:50.01 total time R
2.5x faster
Query times Subject Area 1
3:03 avg query time(using 9 benchmark
queries)
0:15 avg query time(using 9 benchmark
queries)
R
12x faster
Query times Subject Area 2
56:44 avg query time(using 4 benchmark
queries)
8:09 avg query time(using 4 benchmark
queries)
R
7x faster
©2009 Microsoft Corporation
•
−
−
−
•
−
−
−
•
−
−
−
•
−
−
−
•
−
−
−
•
−
−
−
•
−
−
−
−
−
•
−
−
−
−
−
•
−
−
−
•
−
−
−
•−
−
−
−
−
Microsoft Confidential
16Microsoft Confidential—Preliminary
Information Subject to Change
2008 Beyond2009 2010
Enterprise ETL Services
Star Join Query Optimizations
DW Reference Architectures
Predictable performance at low cost
Faster time to solution
Fast Track Data Warehouse
Fast Track Data Warehouse 2.0
New Reference Architectures from IBM
Updated Configurations from HP, Dell and Bull
EMC as a Service Partner for Fast Track
Microsoft to create new Test Harness for validation of new Fast Track configurations
NEC to validate new Reference Architectures
Fast Track vNextFuture Partners to create new Validated Reference Architectures with Test Harness
New Test Harness for Partners
•
•
•
•
•
Parallel Data Warehouse compute node
Database Server Storage Node
Parallel Data Warehouse Appliance - Hardware Architecture
Database Servers
Du
al In
fin
iban
d
Control Nodes
Active / Passive
Landing Zone
Backup Node
Storage Nodes
Spare Database Server
Du
al Fib
er
Ch
an
nel
SQL
SQL
SQL
SQL
SQL
SQL
SQL
SQL
SQL
Management Servers
Corporate Network Private Network
SQL
SQL
Parallel Data Warehouse demo at BI conference 2008
• Query‐ Cache flushed
‐ Inner joins
• Sample Results‐ 625K rows returned in 11 seconds
from 1 trillion row table
‐ Final product will be even faster
• Report‐ Retailer: day-part analysis
‐ Sales, Time, Date, Prod type
Existing
Environment
Hardware16 CPU HP 8620 Itanium
Hitachi Storage 27TB Raw
SATA 21 LUNS
SoftwareWindows 2003 SP2
SQLServer 2008
SSIS/SSRS
Data Warehouse18 Terabytes
Star Schema
80 Fact Tables
500 + Dimensions
Current
Challenges
Data Load Speeds
Analytic Capacity
Analytic Speed
Mixed Workload
Total Cost of
Ownership
Madison
Highlights
Improved by 300%
30TB/160 Cores
Query Speeds 70X
Improvement
Concurrency
Mixed Workload
TCO Lowered by
50%
Parallel Data Warehouse
•
•
•
•−
−
•
•−
−
2008 Beyond2009 2010
Microsoft Announce Intention to Acquire DATAllegro (July)
Acquisition Closes (Sept)
150TB demo of DATAllegro on SQL Server run at BI Conference (Oct)
PDW vNextFocus on continually lowering the costs of high end DW, while increasing performance
Additional Hardware Partners
Closer functional alignment with SQL Server
Better integration with SQL and tools and technologies
?
MTP Program Launched
Circa 10 Customers Provided with early Madison Benchmark
Madison Named as SQL Server 2008 R2 Parallel Data Warehouse
List Price at $57.5K per proc
MTP 2 Program to Launch (fully functional, fully performant)
TAP Program (on client site)
RTM in H1 2010
Compatibility with DATAllegro v3
MS BI integration
Project “Madison”
Hub and Spoke – Flexible Business Alignment
EDW provides “single version of truth” but makes it difficult to support mixed
workloads and multiple user groups, each requiring SLAs
Hub and Spoke – Flexible Business Alignment
Departmental data marts enable mixed workloads, but make it difficult to
consolidate information across the enterprise
Hub and Spoke – Flexible Business Alignment
A Hub and Spoke solution gives you the flexibility to add/change diverse workloads/user
groups, while maintaining data consistency across the enterprise
Parallel database copy
technology enables rapid
data movement and
consistency between hub
and spokes
Create SQL Server 2008, Fast Track Data Warehouse, and SQL Server Analysis
Services spokes
Support user groups with
very different SLAs:
Performance
Capacity
Loading
Concurrency
GEO AREAS METRICS
Analytic MDM
Faster time to solution
High scale: up to 48TB
Low TCO with better price performance; industry standard hardware
Better performance out of the box and predictable performance
Reduced risk through balanced hardware & Best practices
Integration with Madison Hub & Spoke Architecture
Fast Track Data Warehouse offers customers
Twelve reference architectures from HP, Dell, Bull, EMC and IBM
System Integrators with industry solution templates –Avanade, HP, Hitachi, Cognizant and EMC
SQL Server Fast Track Data Warehouse has 2 components
• Fast Track Data Warehouse offers −
−
−
−
−
• Parallel Data Warehouse offers −
−
−
−
•−
−
−
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.
The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions,
it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation.
MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.
Top Related