Generic ETL Questionnaire

download Generic ETL Questionnaire

of 2

Transcript of Generic ETL Questionnaire

  • 7/31/2019 Generic ETL Questionnaire

    1/2

    Questionnaire for ETL Requirements Gathering and Analysis

    Review Reference No.: Review Date:

    Review Reference Documents:

    Sl.No.

    Questionaire Response

    General Questions

    1. What is the Primary Business Requirement of thissystem?

    2. Who are the Business Groups/ Users of thesystem?

    3. Any strategy in place to handle incremental data

    and SCD?4. What is the Projected growth of DWH?

    5. Pls specify 'out of scope' requirements

    6. Need client contacts for any clarifications

    Questions on Existing System

    7. Please explain the current process / methodologyfollowed in the existing system

    8. Please explain the current ETL architecture with thebreakup of Development servers, QA servers andProduction servers

    9. Is the system fully automated or any kind of manualintervention required (Ex: during extraction, dataload etc). How about the new system?

    10. Any documentation available related to the existingsystem? Please provide access to the same

    11. Any project prototyping done? If yes, give details

    12. Any problems with mappings and resolution or anyarchitectural challenges

    13. Are there any known data quality issues?

    14. Are there any issues/bottlenecks related to the ETLProcess?

    15. Please indicate the number of existing Informatica

    mappings16. Please indicate the complexity distribution ofcurrent ETL mappings

    17. Please indicate if the current ETL jobs are pullingdata from the source applications or it is beingpushed into Informatica

    18. What would be the approximate volume of the datain the database?

    19. What is the batch load window being used today

  • 7/31/2019 Generic ETL Questionnaire

    2/2

    Questionnaire for ETL Requirements Gathering and AnalysisSl.No.

    Questionaire Response

    20. What is the database system used?

    Questions on Estimation

    21. Whether aggregate tables need to be created? Ifyes, how many and what subject areas?

    22. How many ETL routines/mappings required? Plsclassify with complexity as simple, medium &complex. Definition of Simple , Medium & Complex.

    23. Pls provide the source system details, Name,Platform, Description

    24. Any Data Sharing Agreements with Source DataOwners needed?

    25. Staging Area : What is the design of the Staging area?

    How much of data is retained in the Staging area?26. Are there any, Aggregations Calculations Denormalizations Business Rulesto be applied in the ETL transformations? If yes pls

    provide details.27. What is the type of extraction Full / Incremental? If

    incremental, how do you identify data what data haschanged? Are you using any specific tool for this?

    28. What is the volume of incremental data?

    29. What is the loading Mechanism to be used Bulk

    Load/ Update-Insert.30. ETL Schedule Daily/Weekly/Monthly and

    scheduling process31. Any performance constraints like Time window for

    data Extraction / Transformations / Loading?32. What should be the strategy on ETL Monitoring

    Processes? Error Handling Exception Handling Level of Logging Notification process

    33. What is the Security architecture of the application

    34. Is the security at the application level, report level ordata level?