Enterprise Data Management
Mehmet ÇAVUŞ, Sybase, Technical Consultant
16.03.2012
1
SAP ANALYTICS & INNOVATION FORUM '12
© 2012 SAP AG. All rights reserved. 2
The Modeling Environment - Domains The Modeling Environment
© 2012 SAP AG. All rights reserved. 3
PowerDesigner Architecture
© 2012 SAP AG. All rights reserved. 4
Conceptual Model (CDM) / Logical Data Model(LDM) PowerDesigner Models
Fulfills the following roles:
– Represents the overall logical structure of a data, independent of any
software or data storage structure - system and business domain,
– Gives a formal representation of the data structure needed to run an
enterprise or a business process/service,
– Uses ER, Merise and IDEF1X notations,
– Uses only one conceptual data diagram,
– Represents the organization of data in a graphical format,
– Imports/Exports Data Items and Entities from BPM
© 2012 SAP AG. All rights reserved. 5
Conceptual Data Model (CDM) / Logical Data Model(LDM) PowerDesigner Models
Customer
AddressCustomerIDNamePhone
<pi>ADDRIDMNYPHN
<M>
Identifier_1
...
<pi>
Sales
SalaryCommission
MNYMNY
Items
ItemsIDDescription
<pi> IDL_TEXT
<M>
Identifier_1
...
<pi>
Shipper
Salary MNY
Employee
Employee IdentifierEmployee NameEmployee Description
<pi><ai><ai>
IDNAMEL_TEXT
<M>
Identifier_1
Identifier_2
...
<pi>
<ai>
Stock_Clerk
Hourly Rate MNY
Orders
OrderIDDescription
<pi> IDL_TEXT
<M><M>
Identifier_1
...
<pi>
•Standard E/R modeling
•Business rules, domains, data
items, entities, relationships,
associations, identifiers, inheritance
•Supports one-to-many & many-to-
many relationships
•Automatic physical data model
generation
© 2012 SAP AG. All rights reserved. 6
Physical Data Model (PDM) PowerDesigner Models
Represents the implementation of physical structure of a data for selected
(R)DBMS including data storage structure and management choices or
parameters (particularities of DBMS),
It gives a formal representation of the physical data structure to be installed
and actively used within the IT system,
Uses standardized relational or multidimensional notation,
The PDM fills the following roles:
– Represents the organization of physical data in a graphical format,
– Verifies the validity of data design for selected (R)DBMS,
– Holds all (R)DBMS related choices and particularities,
– Used for repository generation, synchronization and administration,
– Facilitate O/R, used with framework, and R/R mapping from DWH
© 2012 SAP AG. All rights reserved. 7
Multi-Dimension Data Warehouse Design
Sales
Sales ID
Product ID
Location ID...
integer
integer
integer
<pk>
<fk1>
<fk2>
Product
Product ID
Name
integer
varchar(30)
<pk>
Location
Location ID
Country
City
integer
varchar(30)
varchar(30)
<pk> Time
Time ID
Year
Month
Day
integer
integer
integer
integer
<pk>
Star Schema
Sales
Sales ID
Product ID
Location ID...
integer
integer
integer
<pk>
<fk1>
<fk2>
Product
Product ID
Category ID
Name
integer
integer
varchar(30)
<pk>
<fk>
Location
Location ID
City ID
integer
integer
<pk>
<fk>
Time
Time ID
Month Number
Year
Month
Day
integer
integer
integer
integer
integer
<pk>
<fk>
Country
Country ID
Country Name
integer
varchar(30)
<pk>
City
City ID
Country ID
City Name
Zip
integer
integer
varchar(30)
varchar(30)
<pk>
<fk>
Product Category
Category ID
Category Name
integer
varchar(30)
<pk>
Month
Month Number
Month Name
integer
varchar(10)
<pk>
Snowflake
Schema
Sales - Product Category_Product
Sales - Country_City_Location
Sales - Month_TimeSales
Sales ID
Sales number
Sales
Month_Time
Month Number
Month Name
Time ID
Year
Month
Day
<h:1>
<h:2>
Hierarchy_1 <Default> <h>
Country_City_Location
Country ID
Country Name
City ID
City Name
Zip
Location ID
<h:1>
<h:2>
<h:3>
Hierarchy_1 <Default> <h>
Product Category_Product
Category ID
Category Name
Product ID
Name
<h:1>
<h:2>
Hierarchy_1 <Default> <h>
Multi-dimension
Model
© 2012 SAP AG. All rights reserved. 8
Physical Data Model (PDM) PowerDesigner Models
Oracle 8i, 8i2, 9i, 9i2, 10g, 10gR2 and 11g support •Bitmap join indexes Oracle Database Packages •Global Temporary Tables “Undo” Tablespaces and Storage Partitions •Abstract Data Types Oracle Materialized Views •Oracle Dimension support ... IBM DB/2 5.x, 6, 7, 8 and 9 for OS/390 and Common Server support •Detailed Tablespace definitions Column LOB options •Tables with Computed Columns Generated and Encrypted Value declarations, •DB/2 index Bind Packages DB/2 Materialized Query Tables ... Microsoft SQL Server 7.x, 2000, 2005 and 2008 support •Indexed Views with encryption options Multiple file definitions for filegroups •Row Global Unique Identifier Constraint and Collation names •Identity properties Replication settings •Microsoft Analysis Services including cubes ... Adabas, AS400, Informix, MySQL, Netezza, PostgreSQL, Teradata ...
© 2012 SAP AG. All rights reserved. 9
Data Movement Model (DMM) PowerDesigner Models
High level physical view of information (data) movement/flow processes,
either through replication or any other data flow,
– Replication as a process where transactions performed on one (source)
database are propagated asynchronously to one or more target database in
a serialized manner,
It gives physical view of information sources, destination, applied
transformations, replication processes and source/destination data
structure mappings,
Uses simple graphical notation,
The DMM is used for the following roles:
– To present data flows, transformations, data sources and destination,
– To describe replication process for Replication Server or Mobilink,
– To document DWH ETL/delivery flows and used mappings,
– Allows direct access to every data model used within the flow
© 2012 SAP AG. All rights reserved. 10
Data Movement Model (DMM) PowerDesigner Models
© 2012 SAP AG. All rights reserved. 12
Impact Analysis in PowerDesigner
© 2012 SAP AG. All rights reserved. 13
PowerDesigner Reporting – Report Generator
Powerful report generator Multi-model
Customizable
Template-based
Easy-to-use, drag-and-drop interface
Generates RTF or HTML
© 2012 SAP AG. All rights reserved. 14
Repository Architecture
We need a good metadata repository
•Team development
Security to protect elements,
core, library, etc.
Single source for all
elements (find and reuse)
•Model management
Versioning
Branching
Compare/Merge
Reporting
Enterprise Glossary & Library.
Impact Analysis in Repository.
© 2012 SAP AG. All rights reserved. 15
Repository Portal
© 2012 SAP AG. All rights reserved. 16
DataWarehousing Architecture
1
2
© 2012 SAP AG. All rights reserved. 17
Migrating an OLTP Database to a Data Warehouse
To establish a data warehouse using an OLTP database
I. Reverse engineer the OLTP database with statistics
II. Generate/Create an DW Physical Data Model (PDM)
Modify DW model using indexes based on statistics, denormalization, etc...
Define the transformation between source and DW models using either mapping or DMM
III. Generate the data warehouse
IV. Move data from OLTP database to DW by
Generating export/import scripts to run on OLTP database and DW
Creating ETL templates
© 2012 SAP AG. All rights reserved. 18
1- Reverse Engineering the OLTP Database with Statistics
PowerDesigner allows users to reverse engineer statistics data (number of
rows, number of distinct values for a column, …)
© 2012 SAP AG. All rights reserved. 19
2- Generating Physical Data Model (PDM) for DW
PowerDesigner allows users to generate new models using link and synch
methodology.
© 2012 SAP AG. All rights reserved. 20
>Define the transformation by DMM or Mapping
© 2012 SAP AG. All rights reserved. 21
3- Generating the Data Warehouse Database
Generate the data warehouse database using the Database>“Generate
Database” command.
© 2012 SAP AG. All rights reserved. 22
4- Example of an Export & Load Script
© 2012 SAP AG. All rights reserved. 23
Powerdesigner Modeling Information Architecture
© 2012 SAP AG. All rights reserved. 24
Our Customer’s Perspective…
© 2012 SAP AG. All rights reserved. 25
Turkcell Success Story
Turkcell, the leading mobile phone operator in Turkey,
previously kept conceptual, logical, physical data,
ETL process modeling,
source to target mapping relations information been in
Microsoft® Word, Visio and Excel. Using Sybase PowerDesigner,
Turkcell is able to manage all modeling requirements
within BIS-Reengineering project.
Business Advantage
Turkcell is able to develop impact analysis on changes made
in any department, and provide information on changes
that affect underlying metadata.
Key Benefits
•Reduces business process modeling time by 40%
•Saves 30% in labor efforts
•Provides compatibility with other software
•Increases specifications for projects in a single modeling tool
•Synchronizes with Microsoft® Word for business process modeling (BPM)
“It was an amazing success of
Sybase PowerDesigner.”
Yuksel Guler,
Turkcell Communication Service A.fi.,
Service and Product Development
© 2012 SAP AG. All rights reserved. 26
Conclusion
Using PowerDesigner and Model-Driven Approach can greatly accelerate the
development, increase the productivity and reduce errors for data warehouse
design and implementation.
Top Related