Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

10
1 Joachim Hammer Data Warehousing Multidimensional Analysis Joachim Hammer

Transcript of Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

Page 1: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

1Joachim Hammer

Data Warehousing

Multidimensional Analysis

Joachim Hammer

Page 2: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

2Joachim Hammer

Data Warehousing Architecture

Information Sources Data Warehouse Server

OLAP Servers Clients

OperationalDB’s

SemistructuredSources

extracttransformloadrefreshetc.

Data Marts

DataWarehouse

MOLAP

ROLAP

serve

Analysis

Query/Reporting

Data Mining

Page 3: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

3Joachim Hammer

Multidimensional Modeling

• Support ad-hoc querying for business analyst• Think in terms of spreadsheets

– View sales data by geography, time, or product

• Away from traditional ER models– “One fact in one location”

– Entities are inter-related through a series of joins

Page 4: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

4Joachim Hammer

Multidimensional Data Model

• Database is a set of facts in multidimensional space

• Measures– Numerical data being tracked

– Stored in central fact table

– E.g., sales, inventory, expenditures

• Dimensions– Business parameters

– E.g., time, geography, account data

Page 5: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

5Joachim Hammer

Example

• “Sales by product line manager over the past six months”

Prod Code Time Code Acct Code Sales Qty

Account Info

Product Info

Time Info

. . .

Numerical MeasuresKey columns joining fact table

to dimension tables

Page 6: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

6Joachim Hammer

Multidimensional Data Model Cont’d• Dimensions have attributes• Organized into hierarchies

– E.g., Time dimension: days weeks quarters

– E.g., Product dimension: product product line brand

• Operators to navigate the hierarchies– “Roll-up”

– Drill-down is the opposite of roll-up

– Slice (defines a subcube)

– Various visualization ops (e.g., pivot)

• Physical architecture of dimensional model is described by “star” schema

Page 7: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

7Joachim Hammer

Star Schema

• Single fact table and a single table for each dimension

• Dimension tables are denormalized– E.g., dimension attributes may be stored multiple times

ProductCode ProductName ProductColor BrandCode BrandMgr

ProductCode ProductName ProductColor BrandCode

BrandCode BrandMgr

Normalized Representation

DenormalizedExample: 100 separate products5 brandslots of redundancies

Page 8: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

8Joachim Hammer

Advantages of Star Schema

• Reduces the number of physical joins• Simplify the view of the data model• Allows rel. easy maintenance

Page 9: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

9Joachim Hammer

Example

Geogr

aphy

Pro

duct

Time

M T W Th F S S

Juice

Milk

Coke

Cream

Soap

Bread

NYSF

LA

10

34

56

32

12

56

56 units of bread sold in LA on M

Dimensions:Time, Product, Geography

Attributes:Product (upc, price, …)Geography ……

Hierarchies:Product Brand …Day Week QuarterCity Region Country

roll-up to week

roll-up to brand

roll-up to region

Page 10: Joachim Hammer 1 Data Warehousing Multidimensional Analysis Joachim Hammer.

10Joachim Hammer

Example Cont’d

Sales

TimeTime CodeQuarter CodeQuarter NameWeek CodeDay CodeDay name

Account

Account CodeKey Account CodeAccount NameAccount TypeAccount Market

GeographyGeography CodeRegion CodeRegion MgrCity CodeCity Name

Product

Product CodeProduct NameBrand MgrBrand CodeProd. Line CodeProd. Line NameProd. Name...

Geography CodeTime CodeAccount CodeDollar AmountUnits