SQL Server 2005

23
SQL Server Integration Services (SSIS) Presented by Tarek Ghazali IT Technical Specialist IT Technical Specialist Microsoft SQL Server (MVP) Microsoft Certified Technology Specialist (MCTS) Microsoft Certified Technology Specialist (MCTS) Web Development (MCP) Web Development (MCP) LebDev Vice President © 2007 Tarek Ghazali. All rights reserved.

description

SQL Server 2005 -SSIS Tarek Ghazali

Transcript of SQL Server 2005

  • SQL Server Integration Services (SSIS)Presented by Tarek GhazaliIT Technical SpecialistMicrosoft SQL Server (MVP)Microsoft Certified Technology Specialist (MCTS)Web Development (MCP)LebDev Vice President 2007 Tarek Ghazali. All rights reserved.

  • What Is SSIS?Integration Services is a platform for building high performance data integration and workflow solutions, including extraction, transformation, and loading (ETL) operations for data warehousing.

    Transfer and transform dataSSIS contains a data-flow engine to transfer and transform data to and from varied data sources

    Tools and wizardsSSIS contains graphical tools and wizards for creating an extraction, transform, and loading system

    New version of DTS (Data Transformation Services in SQL Server 2000)

  • Typical use of Integration Services

    Merging Data from Heterogeneous Data Stores

    Populating Data Warehouses and Data Marts

    Cleaning and Standardizing Data

    Building Business Intelligence into a Data Transformation Process

    Automating Administrative Functions and Data Loading

  • Data integration without SSIS

  • Data Integration with SSIS

  • SSIS ArchitectureIntegration Services has a completely new architecture that separates data movement and transformation from Package control flow and management

    Packages are the units of work that you execute

    Tasks do the work in packages

    Containers are objects that provide structure to packages

  • SSIS Architecture

    -The run-time engine implements the control flow and package management infrastructure

    -The data flow engine is a specialized, high performance engine that is exclusively dedicated to extracting, transforming, and loading data.

  • SSIS ToolsWizardsSQL Server Import and Export WizardPackage Migration WizardSSIS Designerbuilt into the BI Development Studio and is the main surface for package development. Command-prompt utilitiesDtexec (run an existing package at the command prompt )Dtutil (manage existing packages at the command prompt)

  • SSIS Development Environment

    SSIS Projects:

    Data SourcesData Source ViewsSSIS Packages:

    Control flowTasksLoops, sequences and eventsVariables and scopingPrecedence constraints

    Connections Manager

    Data flowSource and destination adaptersTransformationsMultiple sources with joins and unionsMultiple destinations with splits and multicast

  • Using SQL Server Integration Services

    What Are Data Sources and Data Source Views? What Is a Connection Manager?What Is Package Control Flow?What Is a Data Flow Task?How to Deploy Packages

  • Data Sources and Data Source ViewsData sourcesA data source is a connection reference that you create outside a package. A real-time reference to a data store connection, which includes all tables and views in the data store.

    Data source viewsProvide a subset of data from a data source You can extend a data source view by adding calculated columns that are populated by custom expressions, adding new relationships between tables, replacing tables in the data source view with queries, and adding related tables.

  • Connection ManagerA connection manager is a logical representation of a connection

    Different types of connection managers enable packages to connect to a variety of data sources and servers

  • Package Control FlowTasksTasks do the work in packages. The most important task is the data flow task

    Precedence constraintsPrecedence constraints join tasks together

    ContainersContainers allow you to group together, or loop through, the tasks

  • Deploy Package

    Define package configurationsAllow you to update the values of properties at run time

    Build the projectConfigure the deployment utilityBuild the project

    Deploy the buildCopy the build folder to the new systemRun the manifest

  • SSIS Execution MethodsVisual Studio / BIDSDebugging environmentCommand Line (dtexec)Execute Package Utility (dtexecui)SQL Agent Sub SystemObject Model

  • SSIS DesignerAvailable in Business Intelligence Development Studio as part of an Integration Services project.

    Perform the following tasks:Constructing the control flow in a package. Constructing the data flows in a package. Adding event handlers to the package and package objects. Viewing the package content. At run time, viewing the execution progress of the package.

  • Packages DebuggingControl flowBreakpointsProgress reportingDebug window

    Data flowData viewerRow countsProgress reporting

  • SecurityPackages signed with certificatesData encryption before saving the packagePackages protected using passwordsPackages controlled with database-level roles, by default:AdministratorUserOperator

  • CompatibilityPossibility to run DTS and SSIS packagesUse Execute DTS 2000 Package Task for running DTS packages from Integration Services Packages

    Use Package Migration Wizard for migrating packages with SQLServer 2000 DTS format

  • Migration limitationsDoesnt migrate ActiveX Script code with that accesses the DTS object model via parent property of the Global Variables Doesnt migrate Analysis Services tasks Doesnt migrate Complex Data transformation tasks Doesnt migrate customs tasks Doesnt migrate dynamic properties Doesnt migrate parallel data pumps Doesnt migrate ActiveX Script attached to package steps Doesnt migrate transaction settings

  • Break

  • Demo

  • Resources & QuestionsContact me : [email protected] Resources:msdn.microsoft.com/sqlserver/www.microsoft.com/sql/communityDownload Presentations:www.lebdev.netwww.devconnect.netSite dedicated to Integration Services:www.SQLIS.com

    Data is typically stored in many different data storage systems, and extracting data from all sources and merging the data into a single, consistent dataset is challenging.

    The data in data warehouses and data marts is usually updated frequently, and the data loads are typically very large. Integration Services includes a task that bulk loads data directly from a flat file into SQL Server tables and views, and a destination component that bulk loads data into a SQL Server database as the last step in a data transformation process.

    Whether data is loaded into an online transaction processing (OLTP) or online analytic processing (OLAP) database, an Excel spreadsheet, or a file, it needs to be cleaned and standardized before it is loaded.

    The data may need to be summarized, converted, and distributed based on data values. The process may even need to reject data, based on an assessment of column values.

    Administrators frequently want to automate administrative functions such as backing up and restoring databases, copying SQL Server databases and the objects they contain, copying SQL Server objects, and loading data. Integration Services packages can perform these functions. -Integration and warehousing require separate, staged, operations.-Preparation of data requires different, often incompatible, tools.-Reporting and escalation is a slow process, delaying smart responses.-Heavy data volumes make this scenario increasingly unworkable.

    -Integration and warehousing are a seamless, manageable, operation.-Source, prepare and load data in a single, auditable process.

    Integration Services supports both native and managed code.

    The Integration Services service, monitors running Integration Services packages and manages the storage of packages.

    The SQL Server Import and Export Wizard provides the simplest method of constructing basic packages and copying data between data sources.

    The Package Installation Wizard guides you through the process of deploying packages and updating package configurations.

    The Package Configuration Wizard guides you though the steps to create the configurations that update the values of properties of packages and package objects at run time.

    The Package Migration Wizard guides you though the steps to migrate SQL Server 2000 DTS packages to SQL Server 2005.