INFA latest training
Transcript of INFA latest training
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 1/133
Informatica PowerMart Training
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 2/133
Course Structure
What is ETL? What is Informatica?
Informatica Products
Informatica PowerMart training
• Session I- Overview of PowerMart
• Session II- Working with PowerMart Repository Manager
• Session III- Working with PowerMart Designer
• Session IV- Working with PowerMart Server Manager
- Performance tuning
- Case Study - Test your Informatica skills
- Quiz
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 3/133
What is ETL?
• ETL(Extraction, Transformation and Loading) is aprocess by which data is integrated and transformed
from the operational systems into the datawarehouseenvironment
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 4/133
Extraction
Oracle
Sybase
Text files
Target
80 tables
50 tables
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 5/133
Transformation
Empid
Last
Name
First
Name
10001 Jones Indiana
10002 Holmes Sherlock
Name =
Concat(First Name,
Last Name)
Indiana Jones
Sherlock Homes
Staging Area
Source
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 6/133
Loading
StagingArea
Source
Data
WarehouseDirect Load
Cleaning,
Transformation& Integration of
Raw data
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 7/133
What is Informatica?
A market leading provider of e-business infrastructureand analytic software which enables customers to
automate the integration, analysis and real timedelivery of critical corporate information viaweb,wireless and voice
Informatica applications include
• eCRM application• eBusiness Operations application
• eProcurement
More than 1,370 customers, including 60 percent of the
Fortune 100 companies are using Informatica’s analyticsolutions
More than 900 companies are using Informaticaproducts
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 8/133
What is Informatica
• Founded in 1993
• HQ : Redwood City, CA
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 9/133
Informatica Products
Informatica provides the following suite of products fordata integration
• PowerCenter- enterprise data integration hub
• PowerMart- application deployment platform
• PowerCenter.e- PowerCenter extension for e-business data
• PowerConnect- high performance data extraction
• PowerPlug- data model import utilities
• PowerBridge- Metadata bridge to Hyperion Essbase
• Analytic Business Components- developer productivity
tools
• Mobile Access- delivery of corporate data and analytics viawireless devices and voice recognition
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 10/133
PowerCenter 5 and PowerMart 5
• eBusiness Integration
• XML Sourcing/Targeting — XML data can be sourceanalyzed, read in, and written out. Source analysis canbe performed via direct parsing of the XML data file, viaDTD's, or via XML schemas.
• Object-Level Import/Export — Objects from XML files
can be imported/exported using the client tools.Appropriate validation is performed on importedobjects.
• Web metadata reports — More than 20 new, HTML-formatted metadata reports have been added to provide
documentation to mappings in areas such asdependencies and attribute listings. These reports areextensible through modification of XSL templates.
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 11/133
PowerCenter 5 and PowerMart 5
• Extensibility
• IBM MQ Series Compatibility — IBM MQ Series data can besource analyzed, read in, and written out via the MQ SeriesPowerConnect. Requires Informatica PowerCenter.
• Siebel Systems Compatibility — Siebel data can be sourceanalyzed, read in, and written out via the SiebelPowerConnect. Requires Informatica PowerCenter.
• SAP R/3 Compatibility — SAP R/3 data can be sourceanalyzed, read in, and written out via the SAP/R/3PowerConnect. Requires Informatica PowerCenter.
• PeopleSoft Compatibility — PeopleSoft data can be sourceanalyzed, read in, and written out via the PeopleSoft
PowerConnect. Requires Informatica PowerCenter.• Router (splitter) — When designing mappings, a new
transformation object can be used to provide a CASEstatement for routing rows to different pipelines
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 12/133
PowerCenter 5 and PowerMart 5
• Performance
• Parallel Transformation Pipelining (Informatica PowerCenteronly) — The DTM is multithreaded. Users can parallelize theirtransformation pipeline by specifying the number of datapartitions. Special considerations were made for aggregatorand join transformations.
• Dynamic Lookup Cache — Users can insert rows into thelookup cache during an active session.
• Shared Lookup Cache — The persistent lookup cache file canbe shared across mappings.
• Copy Session — Users can copy a session from one folder toanother, from one repository to another.
• Mapping Variables — Parameters can be specified within
various portions of a mapping.• Debugger — With the Debugger, users can perform debuggingoperations such as run mappings from Designer tool, setbreak points, examine port values, step through execution,and modify port values.
• Full Screen View — Users have the option of editing mappings
in full screen mode.
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 13/133
PowerCenter 5 and PowerMart 5
• Repository Manager Privileges — Administrators can see adisplay of all privileges granted to a specific user in the
repository manager.• Server Manager Read Only Sessions — A user with Session
Operator privileges will be able to view session properties viathe Session property dialog. However, this dialog will only bemade available in read-only mode when a user has sessionoperator privileges.
• Workspace Search — A user can search for a port name withina selected object, a port name across all objects in theworkspace, or a name of an object on the workspace.
• Show Invalid Mappings — Invalid mappings will bedistinguished with a different icon in the Navigator outline.
• Copy As (Mappings and Maplets) — A user can save amapping or maplet under a different name.
• Compare Folder — Repository Manager provides a "compare"option in the Folder menu. The user can compare the contentsof two folders and get a listing of the discrepancies.
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 14/133
PowerCenter 5 and PowerMart 5
• Resizable SQL Editor and Expression Editor — A usercan resize the dialog box for the SQL Editor (for SQL
Override) and also the Expression Editor.• Global Session Invalidation — A user can choose a
folder or an individual session and select "ValidateSession" from the menus. In the case of validation atthe folder level, the user can choose any number of theinvalid sessions to validate all at once.
• Shortcut Keys — A user can use keyboard commands toperform certain common functions within the Designerclient tool.
• Folder open on double-click — A user can double-clickon a folder icon in the navigator outline tree and expand
the outline tree to show the contents of the folder.• Enhanced overall look — Users will notice an improved
user interface appearance in the form of gradationshading on the title bars, and 3D beveled borders ontransformation objects and links.
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 15/133
PowerMart Training - Course Contents
Session I• Introduction to PowerMart and its components
• PowerMart Architecture• System requirements
• Overview of PowerMart Repository Manager
• Creating Repository
• Creating folders
• Creating Users, Groups and assigning privileges
Session II• Viewing/removing locks
• Generating metadata reports
• Import/export registry• Overview of PowerMart Designer
• Create/import source in Source analyzer
• Create/import target in Warehouse designer
• Understanding Transformation Objects
• Suggested naming conventions
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 16/133
PowerMart Training - Course Contents
Session III• Create transformation objects, mappings in Mapping
designer• Mapping wizards
• Create mapplets in Mapplet designer
• Overview of Server Manager
• What is a session/batch?
• Configure Server Manager
• Adding database connections
Session IV• Create sessions
• Create batches
• Run a session/batch
• Performance tuning techniques
• Case study
• Quiz
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 17/133
Session I
Objective
• Familiarize with Informatica Powermart and itscomponents
• Hands on with Repository Manager
What will you learn?
• Creating Repository
• Creating Folders
• Creating users
• Creating user groups
• Assigning privileges
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 18/133
Overview of Informatica PowerMart
A complete suite of components for building a
datamart/datawarehouse
Provides ability to source large volumes of fast changingdata from multiple platforms
Handles complex transformations
Supports high speed loads
Metadata repository co-ordinates and drives theextraction, transformation, loading and management
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 19/133
Components of PowerMart
PowerMart Designer
• Multi-faceted tool for visually defining mappings and
transformations
PowerMart Repository
• An open metadata store for definitions about mappings,transformations and other data mart details
PowerMart Repository Manager• A facility for managing user activities and metadata
storage in the repository
PowerMart Server
• A pipelined, multi-threaded server engine that is able to
overlap data extraction, transformation and loading
PowerMart Server Manager
• An administrative interface for configuring andscheduling extract/transform/load sessions
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 20/133
Informatica PowerMart Architecture
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 21/133
System Requirements
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 22/133
PowerMart RepositoryManager
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 23/133
Overview of Repository Manager
Used for administering the repository
Allows to do the following tasks
• Perform Repository functions
- Create,backup,copy,restore,upgrade, and delete repositories
• Perform Folder functions
- Create, edit and delete folders
- Copy a folder within a repository or to other repository
• Implement Repository security
- Create, edit and delete repository users and user groups
- Assign and revoke repository privileges and folder
permissions
- View locks and un-lock objects, versions and folders
O f
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 24/133
Overview of Repository Manager
• Add and remove repository reports
• Import and export repository connection information in the
registry
• Analyze source/target, mapping dependencies
• View the properties of repository objects
P M t R it M W k
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 25/133
PowerMart Repository Manager Workspace
Menu
Bar Std. Tool
Bar
Navigator
Main
Dependency
Status Bar
C ti th R it
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 26/133
Creating the Repository
Create an ODBC data source to connect to the databasewhere you want the repository tables to be created
Open Repository Manager module Click on ‘Create Repository’
Enter the Repository name,database username andpassword and Select the ODBC data source createdpreviously and finally the native connect string and click OK.
The list of tables created for the repository will be visible inthe Output window
C ti th F ld
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 27/133
Creating the Folder
Click on Folders>Create
Enter folder name, description, owner, group andpermissions
C ti G
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 28/133
Creating Groups
Connect to a repository
To create Groups choose Security>Manage Groups
Click Add
Enter the name of the new group and a description
Click OK
C ti U
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 29/133
Creating Users
To create Users choose Security>Manage Users
Click Add
Enter a username
Enter the password
twice to confirm it
C ti U
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 30/133
Creating Users
Click Group Memberships
To Add the user to a group, select the group in the NotMember list, and click Add
To Remove the user from a group, select the group inthe Member List, and click Remove
A i P i il
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 31/133
Assign Privileges
Choose Security> Manage Privileges
Select a privilege from the Privilege list• Use Designer
• Browse Repository
• Create Sessions and batches
• Session Operator
• Administer Repository
• Administer Server
• Super User
Assign Pri ileges
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 32/133
Assign Privileges
Click Add
• Repository groups without the selected privilege appear
Click Show Users to view the repository users
Select the user or the group
Click Add
Click OK to save changes
Session II
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 33/133
Session II
Objective
• Familiarize with Repository Manager
• Familiarize with Designer
What will you learn?
• Viewing/Removing locks
• Generating metadata reports• Import/export registry
• Creating/Importing source using Source Analyzer
• Creating/Importing target using Warehouse Designer
• Types of transformation objects• Types of ports
• Naming conventions for different objects
View/Remove Locks
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 34/133
View/Remove Locks
Locks prevent the users from duplicating or overridingwork
Choose Edit > Show Locks to view all the locked objects
The dialog box shows• Username locking the object
• Folder containing the object• Version containing the object
• Object type(folder, session, reusable transformation etc.)
• Object name
• Lock type(Read,Write,Execute,Save,Fetch)
• Lock time, hostname,Application(Server Manager, Designer,
Server etc.)
Generate Reports
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 35/133
Generate Reports
Types of Default Reports
• Mapping report
• source/target dependencies report
• Target table report
• executed session report
To add a report choose Report>Add Report
Generate Reports
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 36/133
Generate Reports
Click on Add to add a report to the installed report list
Select from the default reports available or custom
reports created using Crystal reports and click OK To view or print a report select it from the Reports
menu
Select Print Preview to view the report
Generate Reports
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 37/133
Generate Reports
Provide username, password and ODBC data sourcename
Click Print Preview and select the wildcard character if reports for all the folders, tables and versions have to
be viewed
Import/Export Registry
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 38/133
Import/Export Registry
Purpose- To simplify the process of setting up clientsystems
The registry contains the following connectioninformation
• Repository name
• Database user name and password
• Repository user name and password
• ODBC data source name(DSN)
Don’t forget to create the DSN before importing theregistry as it does not include the ODBC data source
itself
Import/Export Registry
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 39/133
Import/Export Registry
To export the registry, choose Tools>Export Registry
In the dialog box enter the name for the file and Save
To import the registry, choose Tools>Import Registry Select the file and Open it
A dialog box confirms the merging of data sourceinformation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 40/133
PowerMart Designer
Overview of PowerMart Designer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 41/133
Overview of PowerMart Designer
Designer provides five tools with which to createmappings
• Source Analyzer- to import or create source definitions for flat file, ERP, and
relational sources
• Warehouse Designer
- to import or create target definitions
• Transformation developer
- to create reusable transformations
• Mapplet designer
- to create mapplets
• Mapping designer- to create mappings
Designer allows to work in multiple folders andrepositories at a time
Designer Work Space
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 42/133
Designer Work Space
Menu
Bar
Navigator
Std. Tool
Bar
Workspace
Status Bar
Output
Source
Analyzer
Warehouse
Designer
Transformation
developer
Mapplet
Designer
Mapping
Designer
Source Analyzer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 43/133
Source Analyzer
Reads, analyzes and "reverse engineers" schemainformation of operational databases and flat files
Stores metadata information in the repository
How to import a source?
• choose Sources > Import from database/flat file/cobol file
• To import a table enter ODBC data source, username,password
Source Analyzer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 44/133
Source Analyzer
• Click on Connect button and the list of tables appears
• Select the tables which you will be using as source tables
• Click OK to add the selected tables into the Sourceanalyzer work space
How to create a new Source?
• Choose Sources > Create
• Give a name to the new source, select the database type
• Click Create and then Done buttons to make the blanksource appear in workspace
Source Analyzer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 45/133
Source Analyzer
• To enter column names, data types and field lengthsdouble click on the newly created source structure
• Click on the Columns tab
• Click on the ‘Add a new column’ button to add newcolumns in the source and specify the details
• Click OK to accept changes
Warehouse Designer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 46/133
Warehouse Designer
Provides the following features
• Create a new target
• Import the target structure
• Create cubes and dimensions for multi-dimensionalmetadata analysis
How to create a new target?
• Choose Targets > Create
• Enter the name for the target table and the database type
• Double click on the empty target structure and enter thecolumn names, data type and field length same as done
for creating sources• Select the newly created table and choose from menu bar
Target >Generate/Execute SQL
Warehouse Designer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 47/133
Warehouse Designer
• Connect to the warehouse by giving ODBC data source,user idand password
• Select the appropriate Generation options and click onGenerate SQL file button
• To view/ edit the SQL click on Edit SQL file• To generate the tables in the database click on Execute SQL
file
• Look at the Output window to check for errors in generation of the target table in database
Warehouse Designer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 48/133
Warehouse Designer
How to import target table definition?
• Choose Target > Import from database
• Connect to the database by selecting the ODBC datasource and the enter the user name and password
• Select tables from the list of tables available in thedatabase which will be used as targets
• Click OK to get the tables in the workspace
Mapping Designer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 49/133
Mapping Designer
Visual aid to creating and editing source-to-targetmappings.
Dataflow diagramming.
• Method of creating dataflow links through combinations of PowerMart 4.6's transformation objects.
• Sources, targets and transformation objects can be
dragged and dropped into a workspace to construct thetransformation pipeline.
Transformation Objects
To create a transformation
• Click on Transformation > Create
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 50/133
Transformation Objects
• Select the transformation object and give a name to it
• Click on Create and then Done
Aggregator
• Performs an aggregate calculation(Count, Average etc.)
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 51/133
Transformation Objects
Expression
• Perform custom calculations of a simple or complex nature,
using data from one or more input ports
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 52/133
Transformation Objects
Filter
• Performs a test on all records before allowing them to be sent
to the next object
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 53/133
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 54/133
j
Lookup
• Looks up values
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 55/133
j
Sequence Generator
• Generates unique ID values in the same fashion as asequence in a relational database
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 56/133
j
Source Qualifier
• Represents data temporarily stored on the PowerMart server
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 57/133
j
Stored Procedure
• Calls a stored procedure and captures return values
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 58/133
j
Update Strategy
• Defines how the PowerMart server should handle updates toexisting records in targets
Transformation Objects
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 59/133
j
Rank
• Performs comparisons and groupings
Suggested Naming Conventions
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 60/133
gg g
Transformations
• Aggregator AGG_TransformationName
• Expression EXP_TransformationName
• Filter FIL_TransformationName
• Joiner JNR_TransformationName
• Lookup LKP_TransformationName• Normalizer NRM_TransformationName
• Rank RNK_TransformationName
• Sequence Generator SEQ_TransformationName
• Stored Procedure SP_TransformationName
• Source Qualifier SQ_TransformationName
• Update Strategy UPD_TransformationName
Suggested Naming Conventions
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 61/133
gg g
Targets
• T_TARGETNAME
Mappings
• m_MappingName
Mapplets
• mplt_MappletName
Sessions
• s_MappingName
Sequential Batches
• bs_SequentialBatchName
Concurrent Batches
• bc_SequentialBatchName
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 62/133
Session III
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 63/133
Objective
• Hands on with Designer
• Familiarize with Server Manager
What will you learn?
• Creating Mapping
• Types of Mapping wizards
• Mapplet designer
• Creating Mapplets
• Configuring Server Manager
• Adding database connection
Mapping Creation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 64/133
How to create a Mapping?
• Open the Mapping designer workspace
• Choose Mappings > Create
• Give a proper name to the mapping
• Open the Sources from the navigator which you wouldhave created/imported using source analyzer
• Drag and drop the source table from navigator into theworkspace
Mapping Creation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 65/133
Mapping Creation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 66/133
• Choose Transformation > Create to create atransformation object
• Select the type of transformation object you want tocreate and give a proper name to it
Mapping Creation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 67/133
• Drag and drop the required fields(ports) from SourceQualifier to the transformation object
Mapping Creation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 68/133
• Add new ports in the transformation object and definethem as variable ports to do complex transformation
Mapping Creation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 69/133
• Open the Targets from the navigator which you wouldhave created/imported using Warehouse Designer
• Select the target table, drag and drop it into the designerworkspace
Mapping Creation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 70/133
• Drag and drop the output ports from the lasttransformation object to the corresponding ports in thetarget
• Choose Repository > Save to store the mapping
• Check the output window for any errors
Mapping Wizards
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 71/133
Wizards help to create mappings quickly and easily
Wizards designed to create mappings for loading and
maintaining star schemas
Mappings created using wizards can be edited andcustomized
Getting Started Wizard
• Simple Pass Through
• mappings to load static fact and dimensions
• Use this when you want to drop all existing data from thetable before loading new data
• Slowly Growing Target• mappings to load slowly growing dimension or fact tables
• Use this to load new data when existing data does not requireupdates
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 72/133
Mapping Wizards
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 73/133
Slowly Changing Dimensions Wizard
• Type II Dimension Mapping
Version Data Mapping
- insert new and changed dimensions with version number andincremented primary key
- full history and progress of changes
Emp id Name Email
10 Shane [email protected]
PM_PRIMARYKEY
Empid
Name Email PM_VERSION_NUMBER
1000 10 Shane Shan
0
Source
Target
Type II Dimension/Versioning
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 74/133
Empid
Name Email
10 Shane Shane@
abc.co.in
Source
Target
PM_PRIMARYKEY
Empid
Name Email PM_VERSION_NUMBER
1000 10 Shane [email protected]
0
1001 10 Shane Shane@
abc.co.in
1
Type II Dimension/Versioning
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 75/133
Empid
Name Email
10 Shane Shane@
abc.com
Source
Target
PM_PRIMARYKEY
Empid
Name Email PM_VERSION_NUMBER
1000 10 Shane [email protected]
0
1001 10 Shane Shane@
abc.co.in
1
1003 10 Shane Shane@
abc.com
2
Mapping Wizards
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 76/133
Slowly Changing Dimensions Wizard
• Type II Dimension Mapping
Flag Current Mapping
- insert new and changed dimensions with flags and incrementedprimary key
- full history and flagging only current dimensions
Emp id Name Email
10 Shane [email protected]
PM_PRIMARYKEY
Empid
Name Email PM_CURRENT
_FLAG
1000 10 Shane Shan
1
Source
Target
Type II Dimension/Flag Current
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 77/133
Empid
Name Email
10 Shane Shane@
abc.co.in
Source
Target
PM_PRIMARYKEY
Empid
Name Email PM_CURRENT_FLAG
1000 10 Shane [email protected]
N
1001 10 Shane Shane@
abc.co.in
Y
Type II Dimension/Flag Current
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 78/133
Empid
Name Email
10 Shane Shane@
abc.com
Source
Target
PM_PRIMARYKEY
Empid
Name Email PM_CURRENT_FLAG
1000 10 Shane [email protected]
N
1001 10 Shane Shane@
abc.co.in
N
1003 10 Shane Shane@
abc.com
Y
Mapping Wizards
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 79/133
Slowly Changing Dimensions Wizard
• Type II Dimension Mapping
• Effective Date Range Mapping
- insert new and changed dimensions with date range to definecurrent dimension data
- full history and tracking changes with an exact effective daterange
Empid
Name Email
10 Shane [email protected]
PM_PRIMARYKEY
Empid
Name Email PM_BEGIN
_DATE
PM_END_DATE
1000 10 Shane [email protected]
01/01/00
Source Target
Type II Dimension/Effective Date
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 80/133
Empid
Name Email
10 Shane Shane@
abc.co.in
Source
Target
PM_PRIMARYKEY
Emp id Name Email PM_BEGIN _DATE
PM_END _DATE
1000 10 Shane [email protected]
01/01/00 03/01/00
1001 10 Shane Shane@
abc.co.in
03/01/00
Type II Dimension/Effective Date
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 81/133
Empid
Name Email
10 Shane Shane@
abc.com
Source
Target
PM_PRIMARYKEY
Empid Name Email PM_BEGIN _DATE PM_END_DATE
1000 10 Shane [email protected]
01/01/00 03/01/00
1001 10 Shane [email protected]
03/01/00 05/02/00
1003 10 Shane Shane@
abc.com
05/02/00
Mapping Wizards
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 82/133
Slowly Changing Dimensions Wizard
• Type III Dimension Mapping
- insert new and update values in existing dimensions
- keep current and previous values (partial history)
Empid
Name Email
10 Shane [email protected]
PM_PRIMARYKEY
Empid Name Email PM_Prev_ColumnName
PM_EFFECT _DATE
1 10 Shane Shane@xyz.
com
01/01 /00
Source
Target
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 83/133
Type III Dimension
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 84/133
Empid
Name Email
10 Shane Shane@
abc.com
Source
Target
PM_PRIMARYKEY
Empid
Name Email PM_Prev_ColumnName
PM_EFFECT_DATE
1 10 Shane Shane@
abc.com
Shane@
abc.co.in
01/03/00
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 85/133
Mapplet Designer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 86/133
Rules for Objects in Mapplets
• Do not use the following in a mapplet
Joiner
Cobol Source definition
Normalizer
Target definitions
Mapplet Designer
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 87/133
How to create a Mapplet?
• Open the Mapplet designer workspace
• Choose Mapplets > Create
• Give a name to the Mapplet
• Create an input transformation to define mapplet inputports if the mapplet contains no sources
Creating a Mapplet
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 88/133
• Double click on the object to define the input ports
Creating a Mapplet
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 89/133
• Create the transformation objects to be used in themapplet
• One input transformation can be connected to only onetransformation, so to pass same values to two separatedata flows,connect the input transformation to anothertransformation and then split the data flow
• Use Output transformation to create output ports, creatingone Output transformation for each mapplet output group
• Connect all the input ports, ports in transformation objectsand the output ports to complete the data flow
Creating a Mapplet
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 90/133
• Choose Repository > Save to store the mapplet
• Check the output window for validation status
• If mapplet is not valid,correct the problem and re-save themapplet
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 91/133
PowerMart Server Manager
Overview of Server Manager
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 92/133
Tasks performed in Server Manager
• Configure the Informatica Server
• Configure Database, External loader and FTP connections
• Manage sessions and batches
Create,edit,validate,and delete sessions and batches
Copy and move sessions and batches within a folder
Start and Stop sessions and batches, and abort sessions
View session logs,session details and session performancedetails
• Monitor, add, edit, and delete Informatica serverinformation in the repository
• Stop the Informatica Server
• Customize Server Manager
Server Manager
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 93/133
Sessions
• set of instructions that tell Informatica Server how
and when to move data from sources to targets
Batch
• Group of sessions
• Types of batch• Sequential
– Runs the sessions one after the other
• Concurrent
– Runs all the sessions at the same time
Server Manager Workspace
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 94/133
Menu
Bar
Std. Tool
Bar
Navigator
Status Bar
Output
Monitor
Configure
Configuring Server Manager
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 95/133
How to Register the Server?
• Make sure the Informatica Server is running
• In Server Manager connect to the repository
• Choose Server configuration > Register Server
• In the dialog box that appears select the network protocol(TCP/IP or IPX/SPX) and enter the following settings
• Server Name
• Host Name or IP address
• Port Number
• Timeout
Configuring Server Manager
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 96/133
• To enter server variables,click the Advanced button
• Enter a valid root directory for $PMRootDir
• Enter the other server variables, as desired
• Click OK to save your changes
Server Manager
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 97/133
How to add database connections?
• Connect to a repository
• Choose Server Configuration > Database Connections
• Click Add
• Enter the following information
• Click OK to add this connection to the Data Sources list
• Click Close to save all the changes
Session IV
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 98/133
Objective
• Hands on with Server Manager
• Understand how to tune performance in Informatica
• Test your knowledge gathered through this training
What will you learn?
• Creating sessions
• Creating batch
• Running a session/batch
• Performance tuning features
• Implementing case study
Creating Session
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 99/133
How to create a Session?
• Click on the folder in the navigator which contains themapping
Creating Session
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 100/133
• Select Operations > Add Session from menu bar
• Select the mapping for which session has to becreated
• Click OK
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 101/133
Creating Session
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 102/133
• On the time page, enter a schedule for the session
• Click Next
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 104/133
Creating Batches
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 105/133
How to create a batch?
• Select the folder in the navigator for which sessions have
been created• Choose Operations > Add batch
• Enter the Batch name
Creating Batches
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 106/133
• Specify whether a concurrent one
• Enter the schedule for the batch
• Click OK to add the batch to the folder
• To add sessions to the batch, drag and drop therequired sessions onto the batch icon
Run a Session/Batch
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 107/133
• Select the session/batch to be run from the folder in thenavigator
• Make sure the Monitor option of the server is checked
• Click on the Start a session/batch icon
• Check the output window to make sure the serveracknowledged the request to run the session/batch
• Click on the Poll sessions icon to check the status of therun in the monitor window
• Double click on the session in the monitor window
• Click on the Refresh continuously button to monitor
whether the session is running, got completedsuccessfully or got failed
Run a Session/Batch
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 108/133
• Read the number of rows loaded, failed or the first errormessage from the dialog box
• Click on the Open Log File button to dig into why a
session failed
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 109/133
Performance Tuning
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 110/133
Source
Qualifier
Target Evaluation
Buffer Inputefficiency
BufferOutputefficiency
Buffer Inputefficiency
Buffer Outputefficiency
High Low High Low Source
database slow,
eliminate read
bottleneck
Low Low/High Low High Target
database slow,
eliminate write
bottleneck
Low High High Low DTM slow,
optimize
session or
mapping
Efficiency Percentage
High= 80-100%
Low= 0-20%
Performance Tuning
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 111/133
The counters help identify
• Read/Write/DTM bottlenecks
• Caching problems
• Transformation errors
• Shared memory allocation problems
To avoid read bottleneck
• Reduce the number of records processed by filtering thesource
• Index the source
• Replace default query with an optimized query in SQL
override of Source Qualifier
Performance Tuning
d b l k
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 112/133
To avoid write bottleneck
• Utilize SQL loader facility of the database
• Drop indexes before load and rebuild after the load• Increase the database block size
To avoid DTM bottleneck
• Reduce the amount of detail in the session logs
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 113/133
Performance Tuning
T id DTM b ttl k
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 114/133
To avoid DTM bottleneck
• Optimize mapping
• Optimize session For cache problems
• Increase index and data cache
Click
Performance Tuning
F h d ll ti bl
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 115/133
For shared memory allocation problems
• Increase the shared memory size
Performance Tuning
T ti i i
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 116/133
To optimize sessions
• Increase shared memory size
• Increase buffer block size for very large row sizes
• Consider the commit interval. More commits=slowerperformance
• Disable decimal arithmetic if possible
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 117/133
Data Transformation processData Transformation process
Advanced server Features
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 118/133
• Data Transformation process• Data Transformation process
Session shared memory
Reader DTM Writer
64k…..
Advanced server Features
• Session memory is divided into buffers
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 119/133
• Session memory is divided into buffers
• Buffer are used by the reader to store data from the
source• By the DTM, store the transformed data
• By the writer, transformed data will be written to thetarget
Advanced Server Features
• E Mail configuration
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 120/133
• E-Mail configuration
• Send email message after a session run
variables
%s session name
%l total rows loaded
%r total rows rejected
%e session error status
%b session start time
%c session elapsed time
%g attaches the session log to the message
Case Study
Case I
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 121/133
Case I
Product data is captured in two platforms. One in
relational table and other in flat file. Combine data fromthese two sources and put them into the Productdimensional table
Case II
In a data warehouse, we create surrogate keys fordefining the primary keys in dimension tables.
Create surrogate keys for the product table created inprevious case.
Populate the fact table with the surrogate keys createdin product dimension table as the foreign keys
Case Study
Case III
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 122/133
Case III
The Employee table has data for employees situated in all
the countries.Whenever new employee data gets added or theexisting data gets modified into the source table, thenew/modified data needs to be loaded into theEmployees dimension.
Create a mapping which checks for new/changed datain the source and loads only those records into thetarget
Case Study
Case IV
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 123/133
Case IV
Create the following tables in the source database
1. Customers
Customer_id, Customer_name, Address, City, State
Country
2. Employees
Employee_id, First_name, last_name, Designation
Address
3. Products
Product_id, Product_name, item, unitprice
Case Study
4 Orders
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 124/133
4. Orders
order_id, customer_id, employee_id, order_date
required_date
5. Order details
order_id, product_id, unitprice, quantitydiscount
In the target side,
1. create the dimension table for Customer,products and
employees by adding an extra surrogate key columnsrespectively.
Case Study
• 2 Design a mapping for loading the data on the above
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 125/133
• 2. Design a mapping for loading the data on the abovementioned tables by using slowly changing dimensionwizard. Try to make use of Type1,Type2 and Type3
• 3. Create order_fact in the target db for loading orders.Design a mapping for loading the data to handle insertand update strategies. Populate customer_key andemployee_key instead of populatingcustomer_id,employee_id by using lookuptransformations.
• 4. Create a order_details_fact table with the followingmeasures
Order_id, no_of_products, Tot_qty, Tot_price• Try to make use of aggregator transformation.
Quiz
1 Where do you generate reports on Metadata?
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 126/133
1. Where do you generate reports on Metadata?
a. Designer
b. Repository manager
c. Server manager
d. Server
2. What is the use of cube and dimensions in thedesigner?
a. Metadata analysis
b. OLAP tool analysis
c. Report generation
d. Creating sessions
Quiz
3 When only one object of an entire mapping is reusable
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 127/133
3. When only one object of an entire mapping is reusableit is called?
a. Reusable transformationb. Mapplet
c. Repeat transformation
d. Duplicate transformation
4. When a group of objects in a mapping are reusable it iscalled?
a. Reusable transformation
b. Mappletc. Repeat transformation
d. Duplicate transformation
Quiz
5 Where do you copy a mapping from one folder to
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 128/133
5. Where do you copy a mapping from one folder toanother folder?
a. Designerb. Repository manager
c. Server manager
d. Server
6. Where do you copy a session from one folder to anotherfolder?
a. Designer
b. Repository managerc. Server manager
d. Server
Quiz
7 How many repositories can you create in one
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 129/133
7. How many repositories can you create in one
database?
a. 1
b. 2
c. 3
d. Any number
8. Where do you clear the locks on the objects in
Informatica?
a. Designer
b. Repository Manager
c. Server Manager
d. Not possible
Quiz
9 Where do you create folders?
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 130/133
9. Where do you create folders?
a. Designer
b. Repository Manager
c. Server Manager
d. None of the above
10. A mapplet must include the followinga. At least one input transformation, at least one output
transformation
b. Exactly one input transformation, at least one outputtransformation
c. No input transformation, at least one outputtransformation
d. At least one input transformation, no outputtransformation
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 131/133
Quiz
13. How much is the default size for the index cache?
7/28/2019 INFA latest training
http://slidepdf.com/reader/full/infa-latest-training 132/133
13. How much is the default size for the index cache?
a. 100 MB
b. 10 MB
c. 1 MB
d. 100 KB
14. How much is the default size for the data cache?
a. 100 MB
b. 200 MB
c. 2 MB
d. 100 KB