Data Mining

35
LOGO LOGO www.themegallery.com DATA MINING Dayanand Academy of Management Studies

description

 

Transcript of Data Mining

Page 1: Data Mining

LOGO

LOGO www.themegallery.com

DATA MININGDayanand Academy of Management Studies

Page 2: Data Mining

www.themegallery.com

Contents

Data Mining Introduction1

Data Mining Procedures2

Data Mining Techniques3

Data Mining Application4

Page 3: Data Mining

LOGO

LOGO

Data Mining

www.themegallery.com

Introduction

Page 4: Data Mining

www.themegallery.com

Intoduction

What is Data Mining?

Data mining is the process of extracting meaningful piece of information from Data warehouses , which can be useful for maximizing profit , fraud detection , marketing perspective and scientific research.

Page 5: Data Mining

Data Warehouses: According to Stanford University,

"A Data Warehouse is a repository of integrated information, available for queries and analysis. Data and information are extracted from heterogeneous sources as they are generated .This makes it much easier and more efficient to run queries over data that originally came from different sources."

www.themegallery.com

Page 6: Data Mining

www.themegallery.com

Data Minining Steps

Fourth Step

Third Step

Second Step

First Step

Knowledge Deployment

Model Building

Data Gathering

Problem Definition

Page 7: Data Mining

Data Mining Procedures

Problem Definition:- Data mining project focuses on understanding the

objectives and requirements of a particular project of business. The Project must be specified from a business point of view. After that it can be formulated as a data mining problem and develop a preliminary.

Data Gathering & Preparation:- This task involves data collection and exploration. It can

be done by Removing unnecessary information , Detecting Data Duplicity and supplying some new information.

www.themegallery.com

Page 8: Data Mining

Data Mining Procedures

Model Building and Evaluation:- In this phase, various Modeling Techniques can be applied

to build the data model which is likely to be sufficient with the requirement and then An Evaluation can be done to compare the current model with the originally stated project goal.

Knowledge Deployment:- Knowledge deployment is the use of data mining within a

target environment. In the deployment phase, insight and actionable information can be derived from data.

www.themegallery.com

Page 9: Data Mining

History of Data Mining Techniques

1950•Neural Networks•Clustering

1960’s•Decision Trees

1980’s•Support Vector Machine

www.themegallery.com

1999 •Cross Industry Standard Platform Data Mining Package (Crisp DM 1.0)

2004 •Java Data Mining Package (JDM 1.0)

Page 10: Data Mining

LOGO

LOGO

Data Mining

www.themegallery.com

Neural Networks

Page 11: Data Mining

Neural Networks:-

Neural networks are non-linear statistical data modeling tools. They can be used to model complex relationships between inputs and outputs or to find patterns in data. Using neural networks as a tool, data warehousing firms are extracting information from datasets in the process known as data mining.

Neural network is a techniques derived from artificial intelligence research that uses generalized regression and provide methods to carry it out.

It is self adapted and it uses learning method.

www.themegallery.com

Page 12: Data Mining

Processing of Neural Networks

Input data is presented to the network and propagated through the network until it reaches the output layer. The predicted output is subtracted from the actual output and an error value for the networks is calculated through supervised learning.

Once back propagation has finished, the forward process starts again, and this cycle is continued until the error between predicted and actual outputs is minimized.

www.themegallery.com

Page 13: Data Mining

LOGO

LOGO

Data Mining

www.themegallery.com

Clustering

Page 14: Data Mining

Clustering

Clustering is used to segment the data. Clustering models segment records into groups that are similar to each other which is totally distinct from other groups.

Typical Applications of Clustering are Online Document Classification and to cluster web log data to discover groups of similar access patterns. Pattern Recognition, Spatial Data Analysis and Image processing are other applications in Scientific areas.

www.themegallery.com

Page 15: Data Mining

Clustering

www.themegallery.com

Page 16: Data Mining

LOGO

LOGO

Data Mining

www.themegallery.com

Decision Trees

Page 17: Data Mining

Decision Trees

The Decision Tree algorithm is based on conditional probabilities. Decision trees generate rules. A rule is a conditional statement that can easily be understood by humans and easily used within a database to identify a set of records.

The Decision Tree algorithm produces accurate and interpretable models with relatively little user intervention. The algorithm can be used for both binary and multi-class classification problems.

www.themegallery.com

Page 18: Data Mining

Decision Trees

www.themegallery.com

Node 1 sows about married persons and 0 describes single persons. Node 1 has 712 records (cases). Of these, 382 have a target of 0 (not likely to increase spending), and 330 have a target of 1 (likely to increase spending).

Page 19: Data Mining

LOGO

LOGO

Data Mining

www.themegallery.com

Support Vector Machines

Page 20: Data Mining

Support Vector Machine

An optimal Defined Surface.Linear and non linear Input Space.Linear or High Dimension Feature Space which

is specially defined Kernel function.SVM involves the fitting of a hyper plane such

that the largest margin is formed between 2 classes of vectors while minimizing the effects of classification errors so that we can classified in to groups.

www.themegallery.com

Page 21: Data Mining

Support Vector Machine

www.themegallery.com

Page 22: Data Mining

Support Vector Used For

Classification

Regression

Unsupervised Learning and supervised learning.

www.themegallery.com

Page 23: Data Mining

LOGO

LOGO

Data Mining

www.themegallery.com

JAVA Data Mining

Page 24: Data Mining

JAVA Data Mining

www.themegallery.com

Page 25: Data Mining

Facilities by JDM 1.0 Package

www.themegallery.com

Page 26: Data Mining

Parallel Processing

www.themegallery.com

Page 27: Data Mining

Distributed Computing

www.themegallery.com

Page 28: Data Mining

LOGO

LOGO

Data Mining

www.themegallery.com

Cross Industry Standard Platform

Data Mining

Page 29: Data Mining

Crisp DM 1.0

Business Understanding

Data Understanding

Data Preparation

Data Modeling

Evaluation

Deployment

www.themegallery.com

Data

Page 30: Data Mining

LOGO

LOGO

Data Mining

Data Mining Applications

www.themegallery.com

Page 31: Data Mining

www.themegallery.com

Data Mining Applications

Data Mining

BusinessOnline Searching

Science

Marketing

Spatial Data Mining

Security

Page 32: Data Mining

Data Mining Applications

BUSSINESS PRECPECTIVE:-

Data mining helps business to extract information from resources such as print media, television, internet, investment. Data mining tools predicts future trend and behavior allowing business to make proactive knowledge driven decision for increasing revenue, profit of the company.

SCIENCTIFIC PRECPECTIVE:- Practical perspective describe how techniques from

data mining can be used to address and resolve the modern problem in science and engineering domains.

www.themegallery.com

Page 33: Data Mining

Data Mining Applications

SECURITY PRECPECTIVE:-

To prevent or detect for fraud such as showing wrong geographical domain and to identify stolen credit card by transaction history. Data Mining can help to make online transactions more secure and reliable by analyzing previous transaction records.

SPATIAL DATA MINING:-

Geo-marketing companies doing customer segmentation based on spatial location through data mining by mining the purchase and subscription history .

www.themegallery.com

Page 34: Data Mining

WEBSITE PROMOTION:-

Web owner can attract most number of visitors by mining their data and then modifying their layout on the basis of extracted information.

www.themegallery.com

Page 35: Data Mining

LOGO

LOGO www.themegallery.com

Add your company slogan