1012DM01 Data Mining
Transcript of 1012DM01 Data Mining
-
7/29/2019 1012DM01 Data Mining
1/24
Data Mining
1
1012DM01
MI4
Thu 9, 10 (16:10-18:00) B216
Introduction to Data Mining
Min-Yuh Day
Assistant ProfessorDept. of Information Management,Tamkang University
http://mail. tku.edu.tw/myday/2013-02-21
http://mail.tku.edu.tw/myday/http://mail.tku.edu.tw/myday/cindex.htmhttp://www.im.tku.edu.tw/en_index.htmlhttp://english.tku.edu.tw/index.asphttp://www.tku.edu.tw/http://www.im.tku.edu.tw/http://mail.tku.edu.tw/myday/http://mail.tku.edu.tw/myday/http://mail.im.tku.edu.tw/~myday/http://www.im.tku.edu.tw/http://www.tku.edu.tw/http://english.tku.edu.tw/index.asphttp://www.im.tku.edu.tw/en_index.htmlhttp://mail.tku.edu.tw/myday/cindex.htmhttp://mail.tku.edu.tw/myday/ -
7/29/2019 1012DM01 Data Mining
2/24
1012(2013.02 - 2013.06)
(Data Mining)
(Min-Yuh Day)P (TLMXB4P) 2 (2 Credits, Elective) 9,10 (Thu 16:10-18:00) B216
2
-
7/29/2019 1012DM01 Data Mining
3/24
Data Mining at the
Intersection of Many Disciplines
Statistics
Management Science &Information Systems
Artificia
lIntelligen
ce
Databases
PatternRecognition
MachineLearning
MathematicalModeling
DATA
MINING
Source: Turban et al. (2011), Decision Support and Business Intelligence Systems 3
-
7/29/2019 1012DM01 Data Mining
4/24
Knowledge Discovery (KDD) Process
Data Cleaning
Data Integration
Databases
Data Warehouse
Task-relevant Data
Selection
Data Mining
Pattern Evaluation
4Source: Han & Kamber (2006)
Data mining:
core of knowledge discovery process
-
7/29/2019 1012DM01 Data Mining
5/24
Data Warehouse
Data Mining and Business Intelligence
Increasing potentialto supportbusiness decisions End User
BusinessAnalyst
DataAnalyst
DBA
Decision
Making
Data PresentationVisualization Techniques
Data Mining
Information Discovery
Data ExplorationStatistical Summary, Querying, and Reporting
Data Preprocessing/Integration, Data Warehouses
Data Sources
Paper, Files, Web documents, Scientific experiments, Database Systems
5Source: Han & Kamber (2006)
-
7/29/2019 1012DM01 Data Mining
6/24
Business PressuresResponses
Support Model
Source: Turban et al. (2011), Decision Support and Business Intelligence Systems 6
-
7/29/2019 1012DM01 Data Mining
7/24
(Data Mining)
SAS
7
-
7/29/2019 1012DM01 Data Mining
8/24
Course Introduction This course introduces the fundamental concepts and
applications technology ofdata mining. Topics include
Introduction to Data Mining
Association Analysis Classification and Prediction
Cluster Analysis
Data Mining Using SAS Enterprise Miner
Case Study and Implementation of Data Mining
8
-
7/29/2019 1012DM01 Data Mining
9/24
(Objective)
Understand and apply the fundamentalconcepts and technology ofdata mining
9
-
7/29/2019 1012DM01 Data Mining
10/24
(Subject/Topics)1 102/02/21 (Introduction to Data Mining)2 102/02/28 ()
(Peace Memorial Day) (No Classes)
3 102/03/07 (Association Analysis)4 102/03/14 (Classification and Prediction)5 102/03/21 (Cluster Analysis)6 102/03/28 SAS
(Data Mining Using SAS Enterprise Miner)7 102/04/04 ()
(Children's Day, Tomb Sweeping Day)(No Classes)
8 102/04/11 (SAS EM )
Banking Segmentation (Cluster Analysis K-Means using SAS EM)
(Syllabus)
10
-
7/29/2019 1012DM01 Data Mining
11/24
(Subject/Topics)9 102/04/18 (Midterm Presentation)10 102/04/25 11 102/05/02 (SAS EM)
Web Site Usage Associations ( Association Analysis using SAS EM)
12 102/05/09 (SAS EM )Enrollment Management Case Study
(Decision Tree, Model Evaluation using SAS EM)
13 102/05/16 (SAS EM)Credit Risk Case Study
(Regression Analysis, Artificial Neural Network using SAS EM)
14 102/05/23 (Term Project Presentation)15 102/05/30
(Syllabus)
11
-
7/29/2019 1012DM01 Data Mining
12/24
12
-
7/29/2019 1012DM01 Data Mining
13/24
(Slides)
Applied Analytics Using SAS Enterprise Mining,
Jim Georges, Jeff Thompson and Chip Wells, 2010, SAS
Decision Support and Business Intelligence Systems,
Ninth Edition, Efraim Turban, Ramesh Sharda, Dursun Delen,2011, Pearson
Efraim Turban 2011
13
-
7/29/2019 1012DM01 Data Mining
14/24
2Team Term Project
30 () 30 () 40
14
-
7/29/2019 1012DM01 Data Mining
15/24
Team Term Project
Term Project Topics Data mining
Web mining
Business Intelligence 3-5
2013.03.07 ()
15
-
7/29/2019 1012DM01 Data Mining
16/24
A Taxonomy for Data Mining TasksData Mining
Prediction
Classification
Regression
Clustering
Association
Link analysis
Sequence analysis
Learning Method Popular Algorithms
Supervised
Supervised
Supervised
Unsupervised
Unsupervised
Unsupervised
Unsupervised
Decision trees, ANN/MLP, SVM, Roughsets, Genetic Algorithms
Linear/Nonlinear Regression, Regressiontrees, ANN/MLP, SVM
Expectation Maximization, AprioryAlgorithm, Graph-based Matching
Apriory Algorithm, FP-Growth technique
K-means, ANN/SOM
Outlier analysis Unsupervised K-means, Expectation Maximization (EM)
Apriory, OneR, ZeroR, Eclat
Classification and Regression Trees,ANN, SVM, Genetic Algorithms
Source: Turban et al. (2011), Decision Support and Business Intelligence Systems 16
-
7/29/2019 1012DM01 Data Mining
17/24
The Evolution of BI Capabilities
Source: Turban et al. (2011), Decision Support and Business Intelligence Systems 17
-
7/29/2019 1012DM01 Data Mining
18/24
A High-Level Architecture of BI
Source: Turban et al. (2011), Decision Support and Business Intelligence Systems 18
-
7/29/2019 1012DM01 Data Mining
19/24
Mining the Social Web:
Analyzing Data from Facebook, Twitter,
LinkedIn, and Other Social Media Sites
19Source:http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345
http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345http://www.amazon.com/Mining-Social-Web-Analyzing-Facebook/dp/1449388345 -
7/29/2019 1012DM01 Data Mining
20/24
Web Mining Success Stories
Amazon.com, Ask.com, Scholastic.com, Website Optimization Ecosystem
WebAnalyticsVoice of
CustomerCustomer Experience
Management
Customer Interaction
on the Web
Analysis of Interactions Knowledge about the Holistic
View of the Customer
Source: Turban et al. (2011), Decision Support and Business Intelligence Systems 20
-
7/29/2019 1012DM01 Data Mining
21/24
Top 10 CIO Technology Priorities in 2013
Top 10 Technology Priorities RankingAnalytics and business intelligence 1
Mobile technologies 2
Cloud computing (SaaS, IaaS, PaaS) 3
Collaboration technologies (workflow) 4
Legacy modernization 5
IT management 6
CRM 7Virtualization 8
Security 9
ERP Applications 10
21Source: Gartner Executive Programs (January 2013)
http://www.gartner.com/newsroom/id/2304615
http://www.gartner.com/newsroom/id/2304615http://www.gartner.com/newsroom/id/2304615 -
7/29/2019 1012DM01 Data Mining
22/24
Top 10 CIO Business Priorities in 2013
Top 10 Business Priorities Ranking
Increasing enterprise growth 1
Delivering operational results 2
Reducing enterprise costs 3
Attracting and retaining new customers 4
Improving IT applications and infrastructure 5
Creating new products and services (innovation) 6
Improving efficiency 7
Attracting and retaining the workforce 8
Implementing analytics and big data 9
Expanding into new markets and geographies 10
22Source: Gartner Executive Programs (January 2013)
http://www.gartner.com/newsroom/id/2304615
http://www.gartner.com/newsroom/id/2304615http://www.gartner.com/newsroom/id/2304615 -
7/29/2019 1012DM01 Data Mining
23/24
Summary This course introduces the fundamental concepts and
applications technology ofdata mining.
Topics include
Introduction to Data Mining
Association Analysis
Classification and Prediction
Cluster Analysis
Data Mining Using SAS Enterprise Miner
Case Study and Implementation of Data Mining
23
-
7/29/2019 1012DM01 Data Mining
24/24
Contact Information
(Min-YuhDay, Ph.D.)02-26215656 #234702-26209737i716 ()
25137151Email [email protected]://mail.tku.edu.tw/myday/
24
http://www.tku.edu.tw/http://www.im.tku.edu.tw/http://mail.tku.edu.tw/myday/http://mail.tku.edu.tw/myday/http://www.im.tku.edu.tw/http://www.tku.edu.tw/