INFA latest training

133
Informatica PowerMart Training

Transcript of INFA latest training

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 1/133

Informatica PowerMart Training

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 2/133

Course Structure

What is ETL? What is Informatica?

Informatica Products

Informatica PowerMart training

• Session I- Overview of PowerMart

• Session II- Working with PowerMart Repository Manager

• Session III- Working with PowerMart Designer

• Session IV- Working with PowerMart Server Manager

- Performance tuning

- Case Study - Test your Informatica skills

- Quiz

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 3/133

What is ETL?

• ETL(Extraction, Transformation and Loading) is aprocess by which data is integrated and transformed

from the operational systems into the datawarehouseenvironment

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 4/133

Extraction

Oracle

Sybase

Text files

Target

80 tables

50 tables

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 5/133

Transformation

Empid

Last

Name

First

Name

10001 Jones Indiana

10002 Holmes Sherlock

Name =

Concat(First Name,

Last Name)

Indiana Jones

Sherlock Homes 

Staging Area

Source

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 6/133

Loading

StagingArea

Source

Data

WarehouseDirect Load

Cleaning,

Transformation& Integration of 

Raw data

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 7/133

What is Informatica?

A market leading provider of e-business infrastructureand analytic software which enables customers to

automate the integration, analysis and real timedelivery of critical corporate information viaweb,wireless and voice

Informatica applications include

• eCRM application• eBusiness Operations application

• eProcurement

More than 1,370 customers, including 60 percent of the

Fortune 100 companies are using Informatica’s analyticsolutions

More than 900 companies are using Informaticaproducts

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 8/133

What is Informatica

• Founded in 1993

• HQ : Redwood City, CA

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 9/133

Informatica Products

Informatica provides the following suite of products fordata integration

• PowerCenter- enterprise data integration hub

• PowerMart- application deployment platform

• PowerCenter.e- PowerCenter extension for e-business data

• PowerConnect- high performance data extraction

• PowerPlug- data model import utilities

• PowerBridge- Metadata bridge to Hyperion Essbase

• Analytic Business Components- developer productivity

tools

• Mobile Access- delivery of corporate data and analytics viawireless devices and voice recognition

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 10/133

PowerCenter 5 and PowerMart 5

• eBusiness Integration 

• XML Sourcing/Targeting — XML data can be sourceanalyzed, read in, and written out. Source analysis canbe performed via direct parsing of the XML data file, viaDTD's, or via XML schemas.

• Object-Level Import/Export — Objects from XML files

can be imported/exported using the client tools.Appropriate validation is performed on importedobjects.

• Web metadata reports — More than 20 new, HTML-formatted metadata reports have been added to provide

documentation to mappings in areas such asdependencies and attribute listings. These reports areextensible through modification of XSL templates.

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 11/133

PowerCenter 5 and PowerMart 5

• Extensibility 

• IBM MQ Series Compatibility — IBM MQ Series data can besource analyzed, read in, and written out via the MQ SeriesPowerConnect. Requires Informatica PowerCenter.

• Siebel Systems Compatibility — Siebel data can be sourceanalyzed, read in, and written out via the SiebelPowerConnect. Requires Informatica PowerCenter.

• SAP R/3 Compatibility — SAP R/3 data can be sourceanalyzed, read in, and written out via the SAP/R/3PowerConnect. Requires Informatica PowerCenter.

• PeopleSoft Compatibility — PeopleSoft data can be sourceanalyzed, read in, and written out via the PeopleSoft

PowerConnect. Requires Informatica PowerCenter.• Router (splitter) — When designing mappings, a new

transformation object can be used to provide a CASEstatement for routing rows to different pipelines

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 12/133

PowerCenter 5 and PowerMart 5

• Performance 

• Parallel Transformation Pipelining (Informatica PowerCenteronly) — The DTM is multithreaded. Users can parallelize theirtransformation pipeline by specifying the number of datapartitions. Special considerations were made for aggregatorand join transformations.

• Dynamic Lookup Cache — Users can insert rows into thelookup cache during an active session.

• Shared Lookup Cache — The persistent lookup cache file canbe shared across mappings.

• Copy Session — Users can copy a session from one folder toanother, from one repository to another.

• Mapping Variables — Parameters can be specified within

various portions of a mapping.• Debugger — With the Debugger, users can perform debuggingoperations such as run mappings from Designer tool, setbreak points, examine port values, step through execution,and modify port values.

• Full Screen View — Users have the option of editing mappings

in full screen mode.

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 13/133

PowerCenter 5 and PowerMart 5

• Repository Manager Privileges — Administrators can see adisplay of all privileges granted to a specific user in the

repository manager.• Server Manager Read Only Sessions — A user with Session

Operator privileges will be able to view session properties viathe Session property dialog. However, this dialog will only bemade available in read-only mode when a user has sessionoperator privileges.

• Workspace Search — A user can search for a port name withina selected object, a port name across all objects in theworkspace, or a name of an object on the workspace.

• Show Invalid Mappings — Invalid mappings will bedistinguished with a different icon in the Navigator outline.

• Copy As (Mappings and Maplets) — A user can save amapping or maplet under a different name.

• Compare Folder — Repository Manager provides a "compare"option in the Folder menu. The user can compare the contentsof two folders and get a listing of the discrepancies.

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 14/133

PowerCenter 5 and PowerMart 5

• Resizable SQL Editor and Expression Editor — A usercan resize the dialog box for the SQL Editor (for SQL

Override) and also the Expression Editor.• Global Session Invalidation — A user can choose a

folder or an individual session and select "ValidateSession" from the menus. In the case of validation atthe folder level, the user can choose any number of theinvalid sessions to validate all at once.

• Shortcut Keys — A user can use keyboard commands toperform certain common functions within the Designerclient tool.

• Folder open on double-click — A user can double-clickon a folder icon in the navigator outline tree and expand

the outline tree to show the contents of the folder.• Enhanced overall look — Users will notice an improved

user interface appearance in the form of gradationshading on the title bars, and 3D beveled borders ontransformation objects and links.

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 15/133

PowerMart Training - Course Contents

Session I• Introduction to PowerMart and its components

• PowerMart Architecture• System requirements

• Overview of PowerMart Repository Manager

• Creating Repository

• Creating folders

• Creating Users, Groups and assigning privileges

Session II• Viewing/removing locks

• Generating metadata reports

• Import/export registry• Overview of PowerMart Designer

• Create/import source in Source analyzer

• Create/import target in Warehouse designer

• Understanding Transformation Objects

• Suggested naming conventions

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 16/133

PowerMart Training - Course Contents

Session III• Create transformation objects, mappings in Mapping

designer• Mapping wizards

• Create mapplets in Mapplet designer

• Overview of Server Manager

• What is a session/batch?

• Configure Server Manager

• Adding database connections

Session IV• Create sessions

• Create batches

• Run a session/batch

• Performance tuning techniques

• Case study

• Quiz

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 17/133

Session I

Objective

• Familiarize with Informatica Powermart and itscomponents

• Hands on with Repository Manager

What will you learn?

• Creating Repository

• Creating Folders

• Creating users

• Creating user groups

• Assigning privileges

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 18/133

Overview of Informatica PowerMart

A complete suite of components for building a

datamart/datawarehouse

Provides ability to source large volumes of fast changingdata from multiple platforms

Handles complex transformations

Supports high speed loads

Metadata repository co-ordinates and drives theextraction, transformation, loading and management

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 19/133

Components of PowerMart

PowerMart Designer

• Multi-faceted tool for visually defining mappings and

transformations

PowerMart Repository

• An open metadata store for definitions about mappings,transformations and other data mart details

PowerMart Repository Manager• A facility for managing user activities and metadata

storage in the repository

PowerMart Server

• A pipelined, multi-threaded server engine that is able to

overlap data extraction, transformation and loading

PowerMart Server Manager

• An administrative interface for configuring andscheduling extract/transform/load sessions

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 20/133

Informatica PowerMart Architecture

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 21/133

System Requirements

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 22/133

PowerMart RepositoryManager 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 23/133

Overview of Repository Manager 

Used for administering the repository

Allows to do the following tasks

• Perform Repository functions

- Create,backup,copy,restore,upgrade, and delete repositories

• Perform Folder functions

- Create, edit and delete folders

- Copy a folder within a repository or to other repository

• Implement Repository security

- Create, edit and delete repository users and user groups

- Assign and revoke repository privileges and folder

permissions

- View locks and un-lock objects, versions and folders

O f

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 24/133

Overview of Repository Manager 

• Add and remove repository reports

• Import and export repository connection information in the

registry

• Analyze source/target, mapping dependencies

• View the properties of repository objects

P M t R it M W k

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 25/133

PowerMart Repository Manager Workspace

Menu

Bar  Std. Tool

Bar 

Navigator 

Main

Dependency

Status Bar 

C ti th R it

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 26/133

Creating the Repository

Create an ODBC data source to connect to the databasewhere you want the repository tables to be created

Open Repository Manager module Click on ‘Create Repository’  

Enter the Repository name,database username andpassword and Select the ODBC data source createdpreviously and finally the native connect string and click OK.

The list of tables created for the repository will be visible inthe Output window

C ti th F ld

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 27/133

Creating the Folder 

Click on Folders>Create

Enter folder name, description, owner, group andpermissions

C ti G

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 28/133

Creating Groups

Connect to a repository

To create Groups choose Security>Manage Groups

Click Add

Enter the name of the new group and a description

Click OK

C ti U

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 29/133

Creating Users

To create Users choose Security>Manage Users

Click Add

Enter a username

Enter the password

twice to confirm it

C ti U

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 30/133

Creating Users

Click Group Memberships

To Add the user to a group, select the group in the NotMember list, and click Add

To Remove the user from a group, select the group inthe Member List, and click Remove

A i P i il

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 31/133

 Assign Privileges

Choose Security> Manage Privileges

Select a privilege from the Privilege list• Use Designer

• Browse Repository

• Create Sessions and batches

• Session Operator

• Administer Repository

• Administer Server

• Super User

Assign Pri ileges

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 32/133

 Assign Privileges

Click Add

• Repository groups without the selected privilege appear

Click Show Users to view the repository users

Select the user or the group

Click Add

Click OK to save changes

Session II

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 33/133

Session II

Objective

• Familiarize with Repository Manager

• Familiarize with Designer

What will you learn?

• Viewing/Removing locks

• Generating metadata reports• Import/export registry

• Creating/Importing source using Source Analyzer

• Creating/Importing target using Warehouse Designer

• Types of transformation objects• Types of ports

• Naming conventions for different objects

View/Remove Locks

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 34/133

View/Remove Locks

Locks prevent the users from duplicating or overridingwork

Choose Edit > Show Locks to view all the locked objects

The dialog box shows• Username locking the object

• Folder containing the object• Version containing the object

• Object type(folder, session, reusable transformation etc.)

• Object name

• Lock type(Read,Write,Execute,Save,Fetch)

• Lock time, hostname,Application(Server Manager, Designer,

Server etc.)

Generate Reports

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 35/133

Generate Reports

Types of Default Reports

• Mapping report

• source/target dependencies report

• Target table report

• executed session report

To add a report choose Report>Add Report

Generate Reports

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 36/133

Generate Reports

Click on Add to add a report to the installed report list

Select from the default reports available or custom

reports created using Crystal reports and click OK To view or print a report select it from the Reports

menu

Select Print Preview to view the report

Generate Reports

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 37/133

Generate Reports

Provide username, password and ODBC data sourcename

Click Print Preview and select the wildcard character if reports for all the folders, tables and versions have to

be viewed

Import/Export Registry

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 38/133

Import/Export Registry

Purpose- To simplify the process of setting up clientsystems

The registry contains the following connectioninformation

• Repository name

• Database user name and password

• Repository user name and password

• ODBC data source name(DSN)

Don’t forget to create the DSN before importing theregistry as it does not include the ODBC data source

itself 

Import/Export Registry

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 39/133

Import/Export Registry

To export the registry, choose Tools>Export Registry

In the dialog box enter the name for the file and Save

To import the registry, choose Tools>Import Registry Select the file and Open it

A dialog box confirms the merging of data sourceinformation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 40/133

PowerMart Designer 

Overview of PowerMart Designer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 41/133

Overview of PowerMart Designer 

Designer provides five tools with which to createmappings

• Source Analyzer- to import or create source definitions for flat file, ERP, and

relational sources

• Warehouse Designer

- to import or create target definitions

• Transformation developer

- to create reusable transformations

• Mapplet designer

- to create mapplets

• Mapping designer- to create mappings

Designer allows to work in multiple folders andrepositories at a time

Designer Work Space

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 42/133

Designer Work Space

Menu

Bar 

Navigator 

Std. Tool

Bar 

Workspace

Status Bar 

Output

Source

 Analyzer 

Warehouse

Designer 

Transformation

developer 

Mapplet

Designer 

Mapping

Designer 

Source Analyzer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 43/133

Source Analyzer 

Reads, analyzes and "reverse engineers" schemainformation of operational databases and flat files

Stores metadata information in the repository

How to import a source?

• choose Sources > Import from database/flat file/cobol file

• To import a table enter ODBC data source, username,password

Source Analyzer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 44/133

Source Analyzer 

• Click on Connect button and the list of tables appears

• Select the tables which you will be using as source tables

• Click OK to add the selected tables into the Sourceanalyzer work space

How to create a new Source?

• Choose Sources > Create

• Give a name to the new source, select the database type

• Click Create and then Done buttons to make the blanksource appear in workspace

Source Analyzer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 45/133

Source Analyzer 

• To enter column names, data types and field lengthsdouble click on the newly created source structure

• Click on the Columns tab

• Click on the ‘Add a new column’ button to add newcolumns in the source and specify the details

• Click OK to accept changes

Warehouse Designer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 46/133

Warehouse Designer 

Provides the following features

• Create a new target

• Import the target structure

• Create cubes and dimensions for multi-dimensionalmetadata analysis

How to create a new target?

• Choose Targets > Create

• Enter the name for the target table and the database type

• Double click on the empty target structure and enter thecolumn names, data type and field length same as done

for creating sources• Select the newly created table and choose from menu bar

Target >Generate/Execute SQL

Warehouse Designer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 47/133

Warehouse Designer 

• Connect to the warehouse by giving ODBC data source,user idand password

• Select the appropriate Generation options and click onGenerate SQL file button

• To view/ edit the SQL click on Edit SQL file• To generate the tables in the database click on Execute SQL

file

• Look at the Output window to check for errors in generation of the target table in database

Warehouse Designer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 48/133

Warehouse Designer 

How to import target table definition?

• Choose Target > Import from database

• Connect to the database by selecting the ODBC datasource and the enter the user name and password

• Select tables from the list of tables available in thedatabase which will be used as targets

• Click OK to get the tables in the workspace

Mapping Designer

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 49/133

Mapping Designer 

Visual aid to creating and editing source-to-targetmappings.

Dataflow diagramming.

• Method of creating dataflow links through combinations of PowerMart 4.6's transformation objects.

• Sources, targets and transformation objects can be

dragged and dropped into a workspace to construct thetransformation pipeline.

Transformation Objects

To create a transformation

• Click on Transformation > Create

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 50/133

Transformation Objects

• Select the transformation object and give a name to it

• Click on Create and then Done

Aggregator

• Performs an aggregate calculation(Count, Average etc.)

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 51/133

Transformation Objects

Expression

• Perform custom calculations of a simple or complex nature,

using data from one or more input ports

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 52/133

Transformation Objects

Filter

• Performs a test on all records before allowing them to be sent

to the next object

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 53/133

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 54/133

j

Lookup

• Looks up values

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 55/133

j

Sequence Generator

• Generates unique ID values in the same fashion as asequence in a relational database

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 56/133

j

Source Qualifier

• Represents data temporarily stored on the PowerMart server

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 57/133

j

Stored Procedure

• Calls a stored procedure and captures return values

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 58/133

j

Update Strategy

• Defines how the PowerMart server should handle updates toexisting records in targets

Transformation Objects

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 59/133

j

Rank

• Performs comparisons and groupings

Suggested Naming Conventions

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 60/133

gg g

Transformations

• Aggregator AGG_TransformationName

• Expression EXP_TransformationName

• Filter FIL_TransformationName

• Joiner JNR_TransformationName

• Lookup LKP_TransformationName• Normalizer NRM_TransformationName

• Rank RNK_TransformationName

• Sequence Generator SEQ_TransformationName

• Stored Procedure SP_TransformationName

• Source Qualifier SQ_TransformationName

• Update Strategy UPD_TransformationName

Suggested Naming Conventions

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 61/133

gg g

Targets

• T_TARGETNAME

Mappings

• m_MappingName

Mapplets

• mplt_MappletName

Sessions

• s_MappingName

Sequential Batches

• bs_SequentialBatchName

Concurrent Batches

• bc_SequentialBatchName

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 62/133

Session III

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 63/133

Objective

• Hands on with Designer

• Familiarize with Server Manager

What will you learn?

• Creating Mapping

• Types of Mapping wizards

• Mapplet designer

• Creating Mapplets

• Configuring Server Manager

• Adding database connection

Mapping Creation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 64/133

How to create a Mapping?

• Open the Mapping designer workspace

• Choose Mappings > Create

• Give a proper name to the mapping

• Open the Sources from the navigator which you wouldhave created/imported using source analyzer

• Drag and drop the source table from navigator into theworkspace

Mapping Creation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 65/133

Mapping Creation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 66/133

• Choose Transformation > Create to create atransformation object

• Select the type of transformation object you want tocreate and give a proper name to it

Mapping Creation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 67/133

• Drag and drop the required fields(ports) from SourceQualifier to the transformation object

Mapping Creation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 68/133

• Add new ports in the transformation object and definethem as variable ports to do complex transformation

Mapping Creation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 69/133

• Open the Targets from the navigator which you wouldhave created/imported using Warehouse Designer

• Select the target table, drag and drop it into the designerworkspace

Mapping Creation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 70/133

• Drag and drop the output ports from the lasttransformation object to the corresponding ports in thetarget

• Choose Repository > Save to store the mapping

• Check the output window for any errors

Mapping Wizards

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 71/133

Wizards help to create mappings quickly and easily

Wizards designed to create mappings for loading and

maintaining star schemas

Mappings created using wizards can be edited andcustomized

Getting Started Wizard

• Simple Pass Through

• mappings to load static fact and dimensions

• Use this when you want to drop all existing data from thetable before loading new data

• Slowly Growing Target• mappings to load slowly growing dimension or fact tables

• Use this to load new data when existing data does not requireupdates

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 72/133

Mapping Wizards

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 73/133

Slowly Changing Dimensions Wizard

• Type II Dimension Mapping

Version Data Mapping

- insert new and changed dimensions with version number andincremented primary key

- full history and progress of changes

Emp id Name Email

10 Shane [email protected]

PM_PRIMARYKEY

Empid

Name Email PM_VERSION_NUMBER

1000 10 Shane Shan

[email protected]

0

Source

Target

Type II Dimension/Versioning

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 74/133

Empid

Name Email

10 Shane Shane@

abc.co.in

Source

Target

PM_PRIMARYKEY

Empid

Name Email PM_VERSION_NUMBER

1000 10 Shane [email protected]

0

1001 10 Shane Shane@

abc.co.in

1

Type II Dimension/Versioning

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 75/133

Empid

Name Email

10 Shane Shane@

abc.com

Source

Target

PM_PRIMARYKEY

Empid

Name Email PM_VERSION_NUMBER

1000 10 Shane [email protected]

0

1001 10 Shane Shane@

abc.co.in

1

1003 10 Shane Shane@

abc.com

2

Mapping Wizards

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 76/133

Slowly Changing Dimensions Wizard

• Type II Dimension Mapping

Flag Current Mapping

- insert new and changed dimensions with flags and incrementedprimary key

- full history and flagging only current dimensions

Emp id Name Email

10 Shane [email protected]

PM_PRIMARYKEY

Empid

Name Email PM_CURRENT

 _FLAG

1000 10 Shane Shan

[email protected]

1

Source

Target

Type II Dimension/Flag Current

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 77/133

Empid

Name Email

10 Shane Shane@

abc.co.in

Source

Target

PM_PRIMARYKEY

Empid

Name Email PM_CURRENT_FLAG

1000 10 Shane [email protected]

N

1001 10 Shane Shane@

abc.co.in

Y

Type II Dimension/Flag Current

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 78/133

Empid

Name Email

10 Shane Shane@

abc.com

Source

Target

PM_PRIMARYKEY

Empid

Name Email PM_CURRENT_FLAG

1000 10 Shane [email protected]

N

1001 10 Shane Shane@

abc.co.in

N

1003 10 Shane Shane@

abc.com

Y

Mapping Wizards

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 79/133

Slowly Changing Dimensions Wizard

• Type II Dimension Mapping

• Effective Date Range Mapping

- insert new and changed dimensions with date range to definecurrent dimension data

- full history and tracking changes with an exact effective daterange

Empid

Name Email

10 Shane [email protected]

PM_PRIMARYKEY

Empid

Name Email PM_BEGIN

 _DATE

PM_END_DATE

1000 10 Shane [email protected]

01/01/00

Source Target

Type II Dimension/Effective Date

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 80/133

Empid

Name Email

10 Shane Shane@

abc.co.in

Source

Target

PM_PRIMARYKEY

Emp id Name Email PM_BEGIN _DATE

PM_END _DATE

1000 10 Shane [email protected]

01/01/00 03/01/00

1001 10 Shane Shane@

abc.co.in

03/01/00

Type II Dimension/Effective Date

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 81/133

Empid

Name Email

10 Shane Shane@

abc.com

Source

Target

PM_PRIMARYKEY

Empid Name Email PM_BEGIN _DATE PM_END_DATE

1000 10 Shane [email protected]

01/01/00 03/01/00

1001 10 Shane [email protected]

03/01/00 05/02/00

1003 10 Shane Shane@

abc.com

05/02/00

Mapping Wizards

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 82/133

Slowly Changing Dimensions Wizard

• Type III Dimension Mapping

- insert new and update values in existing dimensions

- keep current and previous values (partial history)

Empid

Name Email

10 Shane [email protected]

PM_PRIMARYKEY

Empid Name Email PM_Prev_ColumnName

PM_EFFECT _DATE

1 10 Shane Shane@xyz.

com

01/01 /00

Source

Target

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 83/133

Type III Dimension

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 84/133

Empid

Name Email

10 Shane Shane@

abc.com

Source

Target

PM_PRIMARYKEY

Empid

Name Email PM_Prev_ColumnName

PM_EFFECT_DATE

1 10 Shane Shane@

abc.com

Shane@

abc.co.in

01/03/00

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 85/133

Mapplet Designer 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 86/133

Rules for Objects in Mapplets

• Do not use the following in a mapplet

Joiner

Cobol Source definition

Normalizer

Target definitions

Mapplet Designer 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 87/133

How to create a Mapplet?

• Open the Mapplet designer workspace

• Choose Mapplets > Create

• Give a name to the Mapplet

• Create an input transformation to define mapplet inputports if the mapplet contains no sources

Creating a Mapplet

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 88/133

• Double click on the object to define the input ports

Creating a Mapplet

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 89/133

• Create the transformation objects to be used in themapplet

• One input transformation can be connected to only onetransformation, so to pass same values to two separatedata flows,connect the input transformation to anothertransformation and then split the data flow

• Use Output transformation to create output ports, creatingone Output transformation for each mapplet output group

• Connect all the input ports, ports in transformation objectsand the output ports to complete the data flow

Creating a Mapplet

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 90/133

• Choose Repository > Save to store the mapplet

• Check the output window for validation status

• If mapplet is not valid,correct the problem and re-save themapplet

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 91/133

PowerMart Server Manager 

Overview of Server Manager 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 92/133

Tasks performed in Server Manager

• Configure the Informatica Server

• Configure Database, External loader and FTP connections

• Manage sessions and batches

Create,edit,validate,and delete sessions and batches

Copy and move sessions and batches within a folder

Start and Stop sessions and batches, and abort sessions

View session logs,session details and session performancedetails

• Monitor, add, edit, and delete Informatica serverinformation in the repository

• Stop the Informatica Server

• Customize Server Manager

Server Manager 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 93/133

Sessions

• set of instructions that tell Informatica Server how

and when to move data from sources to targets

Batch

• Group of sessions

• Types of batch• Sequential

– Runs the sessions one after the other

• Concurrent

– Runs all the sessions at the same time

Server Manager Workspace

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 94/133

Menu

Bar 

Std. Tool

Bar 

Navigator 

Status Bar 

Output

Monitor 

Configure

Configuring Server Manager 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 95/133

How to Register the Server?

• Make sure the Informatica Server is running

• In Server Manager connect to the repository

• Choose Server configuration > Register Server

• In the dialog box that appears select the network protocol(TCP/IP or IPX/SPX) and enter the following settings

• Server Name

• Host Name or IP address

• Port Number

• Timeout

Configuring Server Manager 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 96/133

• To enter server variables,click the Advanced button

• Enter a valid root directory for $PMRootDir

• Enter the other server variables, as desired

• Click OK to save your changes 

Server Manager 

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 97/133

How to add database connections?

• Connect to a repository

• Choose Server Configuration > Database Connections

• Click Add

• Enter the following information

• Click OK to add this connection to the Data Sources list

• Click Close to save all the changes

Session IV

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 98/133

Objective

• Hands on with Server Manager

• Understand how to tune performance in Informatica

• Test your knowledge gathered through this training

What will you learn?

• Creating sessions

• Creating batch

• Running a session/batch

• Performance tuning features

• Implementing case study

Creating Session

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 99/133

How to create a Session?

• Click on the folder in the navigator which contains themapping

Creating Session

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 100/133

• Select Operations > Add Session from menu bar

• Select the mapping for which session has to becreated

• Click OK

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 101/133

Creating Session

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 102/133

• On the time page, enter a schedule for the session

• Click Next

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 103/133

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 104/133

Creating Batches

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 105/133

How to create a batch?

• Select the folder in the navigator for which sessions have

been created• Choose Operations > Add batch

• Enter the Batch name

Creating Batches

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 106/133

• Specify whether a concurrent one

• Enter the schedule for the batch

• Click OK to add the batch to the folder

• To add sessions to the batch, drag and drop therequired sessions onto the batch icon

Run a Session/Batch

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 107/133

• Select the session/batch to be run from the folder in thenavigator

• Make sure the Monitor option of the server is checked

• Click on the Start a session/batch icon

• Check the output window to make sure the serveracknowledged the request to run the session/batch

• Click on the Poll sessions icon to check the status of therun in the monitor window

• Double click on the session in the monitor window

• Click on the Refresh continuously button to monitor

whether the session is running, got completedsuccessfully or got failed

Run a Session/Batch

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 108/133

• Read the number of rows loaded, failed or the first errormessage from the dialog box

• Click on the Open Log File button to dig into why a

session failed

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 109/133

Performance Tuning

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 110/133

Source

Qualifier 

Target  Evaluation 

Buffer Inputefficiency

BufferOutputefficiency

Buffer Inputefficiency

Buffer Outputefficiency

High Low High Low Source

database slow,

eliminate read

bottleneck 

Low Low/High Low High Target

database slow,

eliminate write

bottleneck 

Low High High Low DTM slow,

optimize

session or

mapping

Efficiency Percentage

High= 80-100% 

Low= 0-20% 

Performance Tuning

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 111/133

The counters help identify

• Read/Write/DTM bottlenecks

• Caching problems

• Transformation errors

• Shared memory allocation problems

To avoid read bottleneck

• Reduce the number of records processed by filtering thesource

• Index the source

• Replace default query with an optimized query in SQL

override of Source Qualifier

Performance Tuning

d b l k

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 112/133

To avoid write bottleneck

• Utilize SQL loader facility of the database

• Drop indexes before load and rebuild after the load• Increase the database block size

To avoid DTM bottleneck

• Reduce the amount of detail in the session logs

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 113/133

Performance Tuning

T id DTM b ttl k

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 114/133

To avoid DTM bottleneck

• Optimize mapping

• Optimize session For cache problems

• Increase index and data cache

Click 

Performance Tuning

F h d ll ti bl

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 115/133

For shared memory allocation problems

• Increase the shared memory size

Performance Tuning

T ti i i

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 116/133

To optimize sessions

• Increase shared memory size

• Increase buffer block size for very large row sizes

• Consider the commit interval. More commits=slowerperformance

• Disable decimal arithmetic if possible

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 117/133

Data Transformation processData Transformation process

 Advanced server Features

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 118/133

• Data Transformation process• Data Transformation process

Session shared memory

Reader  DTM Writer 

64k…..

 Advanced server Features

• Session memory is divided into buffers

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 119/133

• Session memory is divided into buffers

• Buffer are used by the reader to store data from the

source• By the DTM, store the transformed data

• By the writer, transformed data will be written to thetarget

 Advanced Server Features

• E Mail configuration

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 120/133

• E-Mail configuration

• Send email message after a session run

variables

%s session name

%l total rows loaded

%r total rows rejected

%e session error status

%b session start time

%c session elapsed time

%g attaches the session log to the message

Case Study

Case I

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 121/133

Case I

Product data is captured in two platforms. One in

relational table and other in flat file. Combine data fromthese two sources and put them into the Productdimensional table

Case II

In a data warehouse, we create surrogate keys fordefining the primary keys in dimension tables.

Create surrogate keys for the product table created inprevious case.

Populate the fact table with the surrogate keys createdin product dimension table as the foreign keys

Case Study

Case III

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 122/133

Case III

The Employee table has data for employees situated in all

the countries.Whenever new employee data gets added or theexisting data gets modified into the source table, thenew/modified data needs to be loaded into theEmployees dimension.

Create a mapping which checks for new/changed datain the source and loads only those records into thetarget

Case Study

Case IV

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 123/133

Case IV

Create the following tables in the source database

1. Customers

Customer_id, Customer_name, Address, City, State

Country

2. Employees

Employee_id, First_name, last_name, Designation

Address

3. Products

Product_id, Product_name, item, unitprice

Case Study

4 Orders

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 124/133

4. Orders

order_id, customer_id, employee_id, order_date

required_date

5. Order details

order_id, product_id, unitprice, quantitydiscount

In the target side,

1. create the dimension table for Customer,products and

employees by adding an extra surrogate key columnsrespectively.

Case Study

• 2 Design a mapping for loading the data on the above

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 125/133

• 2. Design a mapping for loading the data on the abovementioned tables by using slowly changing dimensionwizard. Try to make use of Type1,Type2 and Type3

• 3. Create order_fact in the target db for loading orders.Design a mapping for loading the data to handle insertand update strategies. Populate customer_key andemployee_key instead of populatingcustomer_id,employee_id by using lookuptransformations.

• 4. Create a order_details_fact table with the followingmeasures

Order_id, no_of_products, Tot_qty, Tot_price• Try to make use of aggregator transformation.

Quiz

1 Where do you generate reports on Metadata?

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 126/133

1. Where do you generate reports on Metadata?

a. Designer

b. Repository manager

c. Server manager

d. Server

2. What is the use of cube and dimensions in thedesigner?

a. Metadata analysis

b. OLAP tool analysis

c. Report generation

d. Creating sessions 

Quiz

3 When only one object of an entire mapping is reusable

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 127/133

3. When only one object of an entire mapping is reusableit is called?

a. Reusable transformationb. Mapplet

c. Repeat transformation

d. Duplicate transformation

4. When a group of objects in a mapping are reusable it iscalled?

a. Reusable transformation

b. Mappletc. Repeat transformation

d. Duplicate transformation

Quiz

5 Where do you copy a mapping from one folder to

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 128/133

5. Where do you copy a mapping from one folder toanother folder?

a. Designerb. Repository manager

c. Server manager

d. Server

6. Where do you copy a session from one folder to anotherfolder?

a. Designer

b. Repository managerc. Server manager

d. Server

Quiz

7 How many repositories can you create in one

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 129/133

7. How many repositories can you create in one

database?

a.  1

b.  2

c.  3

d.  Any number

8. Where do you clear the locks on the objects in

Informatica?

a.  Designer

b.  Repository Manager

c.  Server Manager

d.  Not possible

Quiz

9 Where do you create folders?

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 130/133

9. Where do you create folders?

a.  Designer

b.  Repository Manager

c.  Server Manager

d.  None of the above

10. A mapplet must include the followinga.  At least one input transformation, at least one output

transformation

b.  Exactly one input transformation, at least one outputtransformation

c.  No input transformation, at least one outputtransformation

d.  At least one input transformation, no outputtransformation

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 131/133

Quiz

13. How much is the default size for the index cache?

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 132/133

13. How much is the default size for the index cache?

a.  100 MB

b.  10 MB

c.  1 MB

d.  100 KB

14. How much is the default size for the data cache?

a.  100 MB

b.  200 MB

c.  2 MB

d.  100 KB

7/28/2019 INFA latest training

http://slidepdf.com/reader/full/infa-latest-training 133/133