SUPPORT VECTOR MACHINES PRESENTED BY MUTHAPPA. Introduction Support Vector Machines(SVMs) are...

SUPPORT VECTOR MACHINES

PRESENTED BY MUTHAPPA

IntroductionSupport Vector Machines(SVMs) are supervised

learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis.

A SVM model is a representation of the examples as points in space , mapped so that the examples of the separate categories are divided by a clear gap that is as wide as possible. New examples are then mapped into that same space and predicted to belong to a category based on which side of the gap they fall on.

Why SVM?Easy to use

Often has good generalization performance

Same algorithm solves a variety of problems with little tuning.

Applications of SVMHand written characters can be recognized

using SVM.

Used in medical science to classify proteins with up to 90% of the compounds classified correctly.

Classification of images.

Data Classification using SVM

This paper walks us through the overview of SVM, kernel and model selections of SVM and rough sets.

SVM is used on different data (Diabetes data, Heart data, Satellite data and Shuttle data) which have two or multi class.

Comparative results using different kernel functions are shown for all data samples.

Maximum margin hyper plane and margins for an SVM trained with samples from two classes.

Kernel Selection of SVMThere are many kernel functions in SVM and

selecting a good kernel function is a research issue.

Popular kernel functions are:

Linear kernel : K(xi, xj) = xiTxj

Polynomial kernel : K(xi, xj) = (yxiTxj + r)d , y>0

RBF kernel : K(xi, xj) = exp(-y||xi – xj||2), y>0

Sigmoid kernel : K(xi, xj) = tanh(yxiTxj + r)

Why RBF kernel?RBF is the main kernel function because of the

following reasons

The RBF kernel nonlinearly maps samples into a higher dimensional space unlike to linear kernel.

The RBF kernel has less hyper parameters than the polynomial kernel.

The RBF kernel has less numerical difficulties.

Model selection of SVMModel selection is also an important issue in

SVM. Its success depends on the tuning of several parameters which affect the generalization error.

If we use the linear SVM, we only need to tune the cost parameter C.

As many problems are non-linearly separable, we need to select the cost parameter C and kernel parameters y, d.

Rough SetIt is a mathematical tool to deal with un-

integrality and uncertain knowledge.

It can effectively analyze and deal with all kinds of fuzzy, conflicting and incomplete information, and finds out the connotative knowledge from it, and reveals its underlying rules.

It finds the lower and upper approximation of the original set (given a pair of sets)

Its applications are in the field of data mining and artificial intelligence.

Results

Imbalance data classification algorithm

This paper combines the merits of FCM cluster algorithm and SVM algorithm to create a new algorithm – FCM-SVM algorithm.

Effectiveness of FCM-SVM algorithm was verified by repeated experiences on dataset from UCI database, the result shows that the algorithm improved the classification performance for imbalance problem compared to existing SVM algorithms.

Imbalance datasetIf the amount of positive class samples differs

greatly from the negative class in a dataset, then the feature of majority class will be much more and significant, but the feature of minority class will be very blur.

Classifiers based on this kind of highly imbalance dataset will easily misclassify a new unknown minority sample to the majority class.

To avoid this, the imbalanced dataset should be transferred into balanced dataset.

Addressing imbalance dataset

Addressing imbalance dataset classification problem can be divided into two main directions:

Sampling approaches – include methods that over-sample the minority class to match the size of the majority class and methods that under-sample the majority class to match the size of the minority class.

Algorithmic-based – designed to improve a classifier’s performance based on their inherent characteristics.

SVM algorithms for imbalance datasets

Non-clustering normal SVM – Adopted balance dataset processing method from SVM algorithm to process imbalance dataset. It directly adopted SVM train function to establish model on training dataset of the above processed dataset. Only one classifier was developed.

Smote-Oversampling classification – Transferred imbalance dataset into balance dataset at first, and then use traditional SVM. The algorithm multiplies the minority class samples until there is not much difference between the minority and majority class samples.

SVM algorithms for imbalance datasets

Under sampling classification – It is similar to oversampling classification. It transfers imbalance dataset into balance dataset first and then use traditional SVM method. It randomly selects majority class samples to match minority class samples.

Random classification – It uses cross-validation function to obtain training and testing datasets first and then adopt SVM train function on training dataset to build a classifier model.

FCM-SVM processTraining and Testing dataset

Extract the number of majority and minority class from training dataset.

Calculate the amount radio N=majority/minority, distribute majority class into N classes by FCM.

Adopt SVM train function to develop N – classifiers.

Predicting on testing dataset by the function of SVM predict and N – classifiers.

FCM-SVM process(contd.)Obtain final predicting results by one-to-veto

Evaluate algorithm performance.

Algorithm Evaluation measures

Predicted Positive Predicted Negative

Actual PositiveActual Negative

TP (True Positive)FP (False Positive)

FN (False Negative)TN (True Negative)

Precision = TP/(TP+FP)

Recall = TP/(TP+FN)

F-measure = 2*Precision*Recall/(Precision + Recall)

Results of shuttle dataset

Referenceshttp://www.cs.columbia.edu/~kathy/cs4701/docu

ments/jason_svm_tutorial.pdf

http://en.wikipedia.org/wiki/Support_vector_machine

Imbalance Data Classification based on SVM and Clustering Function, Kai-Biao Lin, Wei Weng, Robert K. Lai, Ping Lu, 2014.

Data Classification Using Support Vector Machines, Durgesh K. Srivastava , Leekha Bhambu,2005-2009.

Thank You

SUPPORT VECTOR MACHINES PRESENTED BY MUTHAPPA. Introduction Support Vector Machines(SVMs) are...

Documents

Transcript of SUPPORT VECTOR MACHINES PRESENTED BY MUTHAPPA. Introduction Support Vector Machines(SVMs) are...

An Idiot's guide to Support vector machines (SVMs)

Bias-Variance Analysis of Support Vector Machines for the ......BIAS-VARIANCE ANALYSIS OF SVMS As brieﬂy outlined, these decompositions suffer of signiﬁcant shortcomings: in particular

1 CSC 4510, Spring 2012. © Paula Matuszek 2012. CSC 4510 Support Vector Machines (SVMs)

Support Vector Machines - UCSBtyang/class/290N14/... · 6 Support Vector Machine (SVM) Support vectors Maximize margin SVMs maximize the margin around the separating hyperplane. A.k.a.

Geoff Gordon - Carnegie Mellon School of Computer …ggordon/SVMs/new-svms-and...Geoff Gordon ggordon@cs.cmu.edu June 15, 2004 Support vector machines The SVM is a machine learning

The E ects of Outliers on Support Vector Machines · outliers have on weighted-SVMs and standard SVMs on arti cially generated data, using linear, radial basis function, and polynomial

Idiot.s guide to Support vector machines · 2009-08-24 · 1 An Idiot’s guide to Support vector machines (SVMs) R. Berwick, Village Idiot SVMs: A New Generation of Learning Algorithms

Part V Support Vector Machines · CS229 Lecture notes Andrew Ng Part V Support Vector Machines This set of notes presents the Support Vector Machine (SVM) learning al-gorithm. SVMs

CS 8751 ML & KDDSupport Vector Machines1 Support Vector Machines (SVMs) Learning mechanism based on linear programming Chooses a separating plane based.

Comparing stochastic approaches to spoken language ... · vector machines (SVMs) as well as techniques recently applied to natural language processing such as conditional random ﬁelds

Support Vector Machines (SVMs) Chapter 5 (Duda et al.)

Exploring a Hybrid of Support Vector Machines (SVMs) and a Heuristic Based System in Classifying Web Pages Santa Clara, California, USA Ahmad Rahman, Yuliya.

A Novel Approach for Intrusion Detection System Using ... · Support Vector Machines (SVMs) and Multivariate Adaptive Regression Splices (MARS ... A. Intrusion Detection Framework

An Idiot’s guide to Support vector machines (SVMs) · machines (SVMs) R. Berwick, Village Idiot SVMs: A New Generation of Learning Algorithms •Pre 1980: –Almost all learning

10/18/2015 1 Support Vector MachinesM.W. Mak Support Vector Machines 1. Introduction to SVMs 2. Linear SVMs 3. Non-linear SVMs References: 1. S.Y. Kung,

Support Vector Machines (SVMs) Part 3: Kernels

Pattern Recognition Techniques to Infer Driver Intentions · machine learning frameworks, Support Vector Machines (SVMs) and Hidden Markov models (HMMs). Each framework has been employed

Support Vector Machines (SVMs) Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.

Lecture 10: Non-linear support vector machines. …Lecture 10: Non-linear support vector machines. Kernels. Gaussian Processes SVMs for non-linearly separable data The kernel trick

Building Support Vector Machines with Reduced Classiﬁer … · 2021. 3. 1. · BUILDING SVMS WITH REDUCED COMPLEXITY with those works in related kernel ﬁelds. The method outlined