Data Warehousing
description
Transcript of Data Warehousing
![Page 1: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/1.jpg)
Data Warehousing Lecture-1
1
![Page 2: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/2.jpg)
Introduction and Background
2
![Page 3: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/3.jpg)
Reference Books– W. H. Inmon, Building the Data Warehouse
(Second Edition), John Wiley & Sons Inc., NY.
– A. Abdullah, “Data Warehousing for beginners: Concepts & Issues” (First Edition).
– Paulraj Ponniah, Data Warehousing Fundamentals, John Wiley & Sons Inc., NY.
3
![Page 4: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/4.jpg)
Additional Material
– Research Papers
– Magazine Articles
4
![Page 5: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/5.jpg)
Summary of courseTopics (Total Lectures = 45)
1. Introduction & Background
2. De-normalization
3. On Line Analytical Processing (OLAP)
4. Dimensional modeling
5. Extract – Transform – Load (ETL)
6. Data Quality Management (DQM)
7. Need for speed (Parallelism, Join and Indexing techniques)
8. Data Mining
9. DWH Implementation steps
10. Complete implementation case study
11. Lab and tool usage
12. Others DWH-Ahsan Abdullah 5
![Page 6: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/6.jpg)
Summary of course
Topics
1. Introduction & Background
2. De-normalization
3. On Line Analytical Processing (OLAP)
4. Dimensional modeling
6
![Page 7: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/7.jpg)
Summary of course
Topics
5. Extract – Transform – Load (ETL)
6. Data Quality Management (DQM)
7. Need for speed (Parallelism, Join and Indexing techniques)
8. Data Mining
9. DWH Implementation steps
7
![Page 8: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/8.jpg)
Summary of course
Topics
10. Complete implementation case study
11. Lab and tool usage
12. Others
8
![Page 9: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/9.jpg)
Semester ProjectDevelop an application for an organization of your choice.
A case study and coding based approach to be followed.
Use 4GL or a high level programming language.
You MUST collect the necessary data and should have a first draft of the project description approved by the instructor BEFORE initiating on detailed work.
9
![Page 10: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/10.jpg)
Semester Project (Cont…)The project report to include, but is not limited to, the following as documentation:
• Narrative description of business and tables of appropriate data.
• Descriptions of decisions to be supported by information produced by system.
• Summary narrative of results produced. • Structure charts, dataflow diagrams and/or other
diagrams to document the structure of the system. • Listings of computer models/programs utilized. • Reports displaying results. • Recommended decision from results. • User instructions.
10
![Page 11: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/11.jpg)
Approach of the course• Develop an understanding of underlying RDBMS
concepts.
• Apply these concepts to VLDB DSS environments and understand where and why they break down?
• Expose the differences between RDBMS and Data Warehouse in the context of VLDB.
• Provide the basics of DSS tools such as OLAP, Data Mining and demonstrate their application.
• Demonstrate the application of DSS concepts and limitations of the OLTP concepts through lab exercises.
11
![Page 12: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/12.jpg)
Why this course?• The world is changing (actually changed), either
change or be left behind.
• Missing the opportunities or going in the wrong direction has prevented us from growing.
• What is the right direction?• Harnessing the data, in a knowledge driven
economy.
12
![Page 13: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/13.jpg)
13
The needThe need
Knowledge is power, Intelligence is absolute power!
“Drowning in data and starving for information”
![Page 14: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/14.jpg)
14
The needThe need
DATA
INFORMATION
KNOWLEDGE
POWER
INTELLIGENCE
$$
![Page 15: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/15.jpg)
15
Historical overviewHistorical overview
1960Master Files & Reports
1965Lots of Master files!
1970Direct Access Memory & DBMS
1975Online high performance transaction processing
![Page 16: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/16.jpg)
16
Historical overviewHistorical overview
1980 PCs and 4GL Technology (MIS/DSS)
1985 & 1990 Extract programs, extract processing,
The legacy system’s web
![Page 17: Data Warehousing](https://reader030.fdocuments.us/reader030/viewer/2022033021/56815839550346895dc59564/html5/thumbnails/17.jpg)
17
Historical overview: Crisis of Historical overview: Crisis of CredibilityCredibility
What is the financial health of our company?What is the financial health of our company?
-10%
+10%
??