Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design...

Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design BI Best Practices and Tuning for Scaling SQL Server 2008

Data Warehouse

Fast Track

Traditional MD design SSAS PDW SSAS

Characteristic Typical BI (DWs & DMs)OLTP (Operational Database) Data Activity Large reads (disjoint sequential scans) Large writes (new data appends) Indexed reads and writes Large scale hashing Small transactions Constant small index reads, writes, and updates Database sweet spot size 100s of Gigabytes to Terabytes (need medium to large storage farms) Gigabytes (require smaller to medium sized storage farms) Time period Historical (contributes to large data volumes) Current Queries Largely unpredictablePredictable I/O throughput requirement Up to 20 GB/sec sustained throughput IOPS is more important than sustained throughput

Microsoft/HP Fast Track reference configurations OR SQL Server Parallel Data Warehouse (PDW) SQL Server/HP Traditional DW design reference configurations Different logical and physical DB design philosophies Mmm, what will my logical & physical DB design look like ? Lower hardware costs

It is not uncommon to have hundreds of disk drives to support the I/O throughput requirements in a traditional DW environment RAID 5

How does Fast Track and PDW get its speed ? X-Ray view at the physical disk level First lets look at a traditional DW..

Data is stored wherever it happens to land Sequential data Fact table Initial load Fact table 2 nd day load Fact table 3 rd day load Fact table 5 th day load Fact table 6 th day load

Column Index / Column Index / Column Pre-Calculated data Pre-Calculated data Duplicate data

Disk throughput is slower with indexes, aggregates and summary tables Index-lite is faster because there is less disk head movement Eliminating indexes and storing data sequentially will provide the fastest disk throughput rates Index Summary table Traditional DW design with indexes & summary tables Fast Track & PDW Index-lite Fast Track & PDW Fastest sequential scan rates

Example: Average disk Seek time is typically about 4ms; Full stroke is about 7.5ms. At 15K RPM = 250 revolutions/sec. = 4ms for a full revolution = Average latency is about 2ms. Fast Track & PDW are designed to stream large blocks of data sequentially which is even faster than average latency because disk heads are directly over the streaming data.

Seek time is typically 2 - 4x longer than average latency. By eliminating seek time you can have approximately 2 4x fewer disk drives in order to maintain a given throughput level. Fast Track & PDW are designed to stream large blocks of data sequentially! Why does PDW and Fast Track want data to be stored sequentially ?

Fast Track and PDW get its speed from FAST scan rates ! In addition, HP and SQL Server PDW uses Massively Parallel Processing (MPP) to expand Fast Track concepts in a BI appliance Fast scan rates

Traditional DB design Fast Track or PDW

Basic 6 12TB DL38x w/ MSA2000 Mainstream 12 24TB DL585 G6 w/ MSA2000 Mainstream 16 32 TB DL580 G5 w/ MSA2000 G2 Premium 24 48 TB DL785 G6 w/ MSA2000 G2

HP SQL Server 2008 Parallel Data Warehouse (PDW) Control Rack Data Rack

Free Your IT Pressures... Get More Value Without HP Factory ExpressWith HP Factory Express Faster time to solution Free up valuable IT resources Maximize your IT investment

ProLiant Servers

Miscellaneous Techniques to Improve SQL Server BI Performance

SQL Server Analysis Services 2008

SQL Server Analysis Services 2008 Techniques to Improve Performance SSAS SSAS has to major components Formula Engine (does most of the analysis work and tries to keep cells in memory) Fast clock speeds are best Storage Engine (if cells are not in memory, the Storage Engine gets the data from disk) Goal is to minimize Storage Engine use and keep data in memory for the Formula Engine to use Faster Storage (SSD) OR more disk drives for quicker responses to Storage Engine Manage your partitions in your AS Database by query performance required Because Large Cubes > 100 GB may not fit in memory. So we design the partitions to get into memory as quickly as possible. Best Practice less than 4 million cells per partition

Tune memory

Buffers are allocated via Execution Trees Each of these Numbered Steps represents a new Execution Tree Spawning multiple copies of the package with a horizontal partition of data will create more process space and execution trees

www.microsoft.com/teched www.microsoft.com/learning http://microsoft.com/technet http://microsoft.com/msdn

Sign up for TechEd 2011 and save $500 starting June 8 June 31 st http://northamerica.msteched.com/registration You can also register at the North America 2011 kiosk located at registration Join us in Atlanta next year

Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design...

Documents

Transcript of Fast Track, Microsoft SQL Server 2008 Parallel Data Warehouse and Traditional Data Warehouse Design...

Data Warehouse Prerequisites Familiarity with Microsoft SQL Server Familiarity with Microsoft SQL Server System Administration for Microsoft SQL Server.

Data Warehouse Fast Track for Microsoft SQL Server 2012 · PDF fileREFERENCE ARCHITECTU RE . EMC Solutions . Sep 2014 . DATA WAREHOUSE FAST TRACK FOR MICROSOFT SQL SERVER 2014 . EMC

Bezbednost nema alternativu - konferencija.coming.rs · Cloud Data Warehouse Data Lake NoSQL Document DB Cloud SQL Database SQL Server SQL Server Apache Hadoop ON-PREM CLOUD Nova

Introducing Microsoft Data Warehouse Fast Track for SQL ... · Microsoft Data Warehouse Fast Track for SQL Server 2016 Contents 1. SQL Server 2016: The Microsoft Data Warehouse Platform

Microsoft SQL Server - Parallel Data Warehouse Presentation

DELL Reference Configuration Microsoft SQL Server …i.dell.com/sites/content/business/solutions/whitepapers/zh/... · Reference Configuration Microsoft SQL Server 2008 Fast ... Warehouse

SQL Server 2012 Fast Track Data Warehouse for Huawei White Paper

SQL Server 2012 Parallel Data Warehouse - A Breakthrough ...today’s SMP databases such as SQL Server 2008 R2, SQL Server 2012, Oracle, or DB2. Some queries that are impossible to

Tales from Building a SQL Server Data Warehouse in Azure · PDF fileTales from Building a SQL Server Data Warehouse in Azure ... Naming Conventions & Tags ... SSIS, SSAS, SSRS, etc

SQL Server 2014 Fast Track Data Warehouse for Supermicro ... · components, the SQL Server 2014 Fast Track Data Warehouse reference architecture strives for the best balance of CPU,

SQL Server 2008 Warehouse

Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform

Cisco-EMC SQL Server Data Warehouse Fast Track Solutions

We Practice What We Teach. Public Web Site Outages Web Servers SQL Server SQL Server SQL Server SQL Server SQL Server SQL Server SQL Server SQL Server.

SQL Server 2012 Parallel Data Warehouse - A …download.microsoft.com/download/5/0/1/5015A62E-06... · Microsoft SQL Server 2012 Parallel Data Warehouse 4 This paper explains PDW’s

Enterprise data warehouse informatica, data stage, teradata, netezza, oracle, microsoft sql server

White Paper: SQL Server 2012 Data Warehouse - Dell EMC · PDF fileSolution architecture and design ... Building-block design considerations ... SQL Server 2012 Data Warehouse

Microsoftâ€™s SQL Server Parallel Data Warehouse

SQL Server 2012 Parallel Data Warehouse - A Breakthrough Platform

Building Data Warehouse in SQL Server