1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business...
-
Upload
daniel-hunter -
Category
Documents
-
view
235 -
download
3
Transcript of 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business...
![Page 1: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/1.jpg)
1Reviewing Data Warehouse Basics
![Page 2: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/2.jpg)
Lessons
1. Reviewing Data Warehouse Basics
2. Defining the Business and Logical Models
3. Creating the Dimensional Model
4. Creating the Physical Model
5. Storage Considerations for the Physical Model
6. Strategies for Extracting, Transforming, and Loading
7. Summary Management
8. Analytical Capabilities
![Page 3: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/3.jpg)
Definition of a Data Warehouse
“A data warehouse is a subject-oriented, integrated, nonvolatile, time-variant collection of data in support of management’s decisions.”
- Bill Inmon
“A system that extracts cleans, conforms, and delivers source data into a dimensional data store and then supports and implements querying and analysis for the purpose of decision making.”
- Ralph Kimball
![Page 4: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/4.jpg)
Basic Elements of the Data Warehouse
• Source: Source database or other source form
• Data staging area: Intermediate area
• Target: Presentation server for the new data warehouse or data mart
Source Target Data staging
area
![Page 5: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/5.jpg)
Diagram of a Data Warehouse System
![Page 6: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/6.jpg)
Basic Form of the Data Warehouse
Star schema (Dimensional model)
Customer Location
Sales
Supplier Product
![Page 7: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/7.jpg)
Data Warehouse and OLTP Database Design Differences
Unlike an OLTP database design, a warehouse database design must:
• Focus on queries
• Allow incremental development
• Be a nonvolatile structure
• Provide historical data
![Page 8: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/8.jpg)
Data Warehouse Features
A data warehouse:
• Is a repository for information
• Improves access to integrated data
• Ensures integrity and quality
• Provides an historical perspective
• Records results
• Is used by a broad spectrum of end users for a variety of purposes
• Reduces the reporting and analysis impact on operational systems
• Requires a major systems integration effort
![Page 9: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/9.jpg)
Exploring Data Warehouse Characteristics
• Subject-oriented
• Integrated
• Nonvolatile
• Time-variant
![Page 10: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/10.jpg)
Subject-Oriented
Data is categorized and stored by business subjectrather than by application.
OLTP applications
Customerfinancial
information
Data warehouse subject
Equityplans
Shares
Insurance
Loans
Savings
![Page 11: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/11.jpg)
Integrated
Data on a given subject is integrated.
Savings
Currentaccount
Loans
Customer
![Page 12: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/12.jpg)
Nonvolatile
Warehouse
ReadInsert
UpdateDelete
Load
Operational
Read
![Page 13: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/13.jpg)
Time-Variant
Data warehouse
JanuaryTime Data
01/01 January
02/01 February
03/01 March
![Page 14: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/14.jpg)
Load from Many Sources
Nonrelational
systems
Relational databases
External data
External formats
Archive data
Internal data
![Page 15: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/15.jpg)
Decision Support System (DSS)
Profile of DSS Queries
Storage Analytic
DSS
ODS DW OLAP DM
![Page 16: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/16.jpg)
DDS
Data Warehousing ProcessExtraction
RDBMSETL
Federated Data Warehouse
Transformation/Load
Tran
sform
ation
s
Publish
Data marts
DDS
DDS
SubscribeP
ortal
Access layer(s)
Metadata Repository
Flat files
Operational
External
Server logfiles
NDS
ETLStag
ing
area(s)
![Page 17: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/17.jpg)
Comparing Warehouses and Data Marts
Datawarehouse
Datamart
Versus
Property Data Warehouse Data MartScope Enterprise DepartmentSubjects Multiple Single, LOBData source Many FewImplementation time Months to years Months
![Page 18: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/18.jpg)
Flow of Data
StoreFeed
Operationaldata
Externaldata
Access
Relationaltools
Applications
OLAPtoolsMetadata
Summarydata
Raw data
![Page 19: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/19.jpg)
Dependent Data Mart Model
Data mart
Data mart
Systems
Legacy
Operational
Internal
External
Enterprise
ODS Data mart
![Page 20: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/20.jpg)
Independent Data Mart Model
Enterprise
ODSSystems
Legacy
Operational
Internal
ExternalData mart
Data mart
Data mart
Data mart
Data mart
Data mart
![Page 21: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/21.jpg)
Data Warehousing Today
• Business Intelligence
– To help business users understand their business better
– To help them make better operational, tactical, and strategic business decisions
– To help them improve business performance
![Page 22: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/22.jpg)
Data Warehousing Today
• Customer Relationship Management
– Consists of applications that support CRM activities
– Single customer view
– Campaign segmentation
– Customer analysis
– Personalization
– Customer loyalty scheme
![Page 23: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/23.jpg)
Data Warehousing Today
• Data Mining– Known as Knowledge Discovery
– Trying to find meaningful and useful information from a large amount of data
– Interactive or automated process to find patterns describing the data and to predict the future behavior of the data based on these patterns
• Usage– Analyzing the shopping data
– Finding out the pattern between crime and location
– Customer scoring in CRM in terms of loyalty
– Credit Scoring in the credit card industry
![Page 24: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/24.jpg)
Data Warehousing Today
• Master Data Management (MDM)
– Consolidates the master data and processes the data through predefined data quality rules.
– Any changes on master data in OLTP are sent to MDM
– Publishes data to other systems
• Customer Data Integration
– Is a MDM for customer data
– The process of retrieving, cleaning, storing, maintaining and distributing customer data
![Page 25: 1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.](https://reader036.fdocuments.us/reader036/viewer/2022062304/56649eec5503460f94bfe57b/html5/thumbnails/25.jpg)
Future Trends in Data Warehousing
• Unstructured Data
– Documents, images, audio, video, e-mails
• Search
– Search engine
• Service-Oriented Architecture (SOA)
• Real-Time Data Warehouse