Machine Learning and Real-World Applications

Ajay Ramaseshan

MachinePulse.

Machine Learning & Real-world Applications

Why Machine Learning?

Supervised Vs Unsupervised learning.

Machine learning methods.

Real world applications.

Agenda

Volume of data collected growing day by day.

Data production will be 44 times greater in 2020 than in 2009.

Every day, 2.5 quintillion bytes of data are created, with 90

percent of the world's data created in the past two years.

Very little data will ever be looked at by a human.

Data is cheap and abundant (data warehouses, data marts);

knowledge is expensive and scarce.

Knowledge Discovery is NEEDED to make sense and use of

Machine Learning is a technique in which computers learn

from data to obtain insight and help in knowledge discovery.

Supervised learning – class labels/ target variable known

Overview of Machine Learning Methods

Unsupervised learning – no class labels provided, need to

detect clusters of similar items in the data.

Overview (Contd)

Parametric Models:

Assumes prob. distribution for data, and

learn parameters from data

E.g. Naïve Bayes classifier, linear regression etc.

Non-parametric Models:

No fixed number of parameters.

E.g. K-NN, histograms etc.

Parametric Vs Nonparametric

Generative model – learns model for generating data, given

some hidden parameters.

Learns the joint probability distribution p(x,y).

e.g. HMM, GMM, Naïve Bayes etc.

Discriminative model – learns dependence of unobserved

variable y on observed variable x.

Tries to model the separation between classes.

Learns the conditional probability distribution p(y|x).

e.g. Logistic Regression, SVM, Neural networks etc.

Generative Vs Discriminative

Classification – Supervised learning.

Commonly used Methods for Classification –

Naïve Bayes

Decision tree

K nearest neighbors

Neural Networks

Support Vector Machines.

Classification

Regression – Predicting an output variable given input

variables.

Algorithms used –

Ordinary least squares

Partial least squares

Logistic Regression

Stepwise Regression

Support Vector Regression

Neural Networks

Regression

Clustering:

Group data into clusters using similarity measures.

Algorithms:

K-means clustering

Density based EM algorithm

Hierarchical clustering.

Spectral Clustering

Clustering

Naïve Bayes classifier

Assumes conditional independence among features.

P(xi,xj,xk|C) = P(xi|C)P(xj|C)P(xk|C)

Naïve Bayes

K-nearest neighbors:

Classifies the data point with the class of the k

nearest neighbors.

Value of k decided using cross validation

K-Nearest Neighbors

Leaves indicate classes.

Non-terminal nodes – decisions on attribute values

Algorithms used for decision tree learning

Decision trees

Artificial Neural Networks

Modeled after the

human brain

Consists of an input layer,

many hidden layers, and an

output layer.

Multi-Layer Perceptrons,

Radial Basis Functions,

Kohonen Networks etc.

Neural Networks

Validation methods

Cross validation techniques

K-fold cross validation

Leave one out Cross Validation

ROC curve (for binary classifier)

Confusion Matrix

Evaluation of Machine Learning Methods

Some real life applications of machine learning:

Recommender systems – suggesting similar people on

Facebook/LinkedIn, similar movies/ books etc. on Amazon,

Business applications – Customer segmentation, Customer

retention, Targeted Marketing etc.

Medical applications – Disease diagnosis,

Banking – Credit card issue, fraud detection etc.

Language translation, text to speech or vice versa.

Real life applications

Wisconsin Breast Cancer dataset:

Instances : 569

Features : 32

Class variable : malignant, or benign.

Steps to classify using k-NN

Load data into R

Data normalization

Split into training and test datasets

Train model

Evaluate model performance

Breast Cancer Dataset and k-NN

Thank you!

Machine Learning and Real-World Applications

Technology

Transcript of Machine Learning and Real-World Applications

Machine learning– driven analytics: Key to digital ... · machine learning in a host of applications from algorithmic trading and customer service chatbots through to real-time

TensorFlow: A system for large-scale machine learning...TensorFlow achieves for several real-world applications. 1 Introduction In recent years, machine learning has driven advances

Bayesian Nonparametrics in Real-World Applications: Statistical …€¦ · Bayesian Nonparametrics in Real-World Applications: Statistical Machine Translation and Language Modelling

Real World Applications

Real-Time Processing of 3D-TOF Data in Machine Vision Applications

Artificial Intelligence and Machine Learning: Current ...€¦ · Artificial Intelligence and Machine Learning: Current Applications in Real Estate by Jennifer Conway B.A. Architecture,

Machine Learning Real Life Applications By Examples - Mario Cartia

Machine learning applications in biotechnology researchfiehnlab.ucdavis.edu/downloads/staff/kind/machine-learning-2012... · Machine learning applications in biotechnology research

APPLICATIONS 1: MACHINE TRANSLATION II: … · 1 APPLICATIONS 1: MACHINE TRANSLATION II: STATISTICAL MACHINE TRANSLATION Theme The second of two lectures on Machine Translation. Developments

Machine Intelligence Applications: Our Philosophy · Machine Intelligence Applications: Our Philosophy 2 WHITE PAPER Machine Intelligent Applications: Our Philosophy We live in a

(BDT302) Real-World Smart Applications With Amazon Machine Learning

Real-World Predictive Applications with Amazon Machine ...aws-de-media.s3.amazonaws.com/images/AWS Summit Berlin 2015... · Services we will use: Amazon Machine Learning, Amazon Kinesis,

Real-World Smart Applications with Amazon Machine Learning - AWS Machine Learning Web Day

Applications of Machine to Machine Communication in Remote ...

Applications of Machine Learning

Parallelizing Big Data Machine Learning Applications with ...dsc.soic.indiana.edu/publications/Parallelizing Big Data Machine... · Parallelizing Big Data Machine Learning Applications

DRONES, REAL-TIME MAPPING, CLOUD & MACHINE-LEARNING · DRONES, REAL-TIME MAPPING, CLOUD & MACHINE-LEARNING julian.norton@pix4d.com

REAL TIME CONTROL OF DOUBLY FED INDUCTION GENERATORPerformance induction machine controllers since real time simulations are required by hardware in the loop applications.

Machine Learning Real Life Applications By Examples

Real-time data from remote machines - Vodafone · Real-time data from remote machines Global Machine-to-Machine (M2M) communications m2m.vodafone.com . Global machine to machine (M2M)