Introduction to Supervised Machine Learning Concepts

I n t r o d u c t i o n t o S u p e r v i s e dM a c h i n e L e a r n i n g C o n c e p t s

P R E S E N T E D B Y B . B a r l a C a m b a z o g l u F e b r u a r y 2 1 , 2 0 1 4⎪

Guest Lecturer’s Background

Lecture Outline

Basic concepts in supervised machine learning Use case: Sentiment-focused web crawling

Basic Concepts

What is Machine Learning?

Wikipedia: “Machine learning is a branch of artificial intelligence, concerning the construction and study of systems that can learn from data.”

Arthur Samuel: “Field of study that gives computers the ability to learn without being explicitly programmed.”

Tom M. Mitchell: “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.”

Unsupervised versus Supervised Machine Learning

Unsupervised learning› Assumes unlabeled data (the desired output is not known)› Objective is to discover the structure in the data

Supervised learning› Trained on labeled data (the desired output is known)› Objective is to generate an output for previously unseen input data

Supervised Machine Learning Applications

Common› Spam filtering› Recommendation and ranking› Fraud detection› Stock price prediction

Not so common› Recognize the user of a mobile device based on how he holds and moves the phone› Predict whether someone is a psychopath based on his twitter usage› Identify whales in the ocean based on audio recordings› Predict in advance whether a product launch will be successful or not

Terminology

Instance Label Feature Training set Test set Learning model Accuracy

Toy problem: To predict the income level of a person based on his/her facial attributes.

Instances

Categorical Labels

Numeric Labels

$12K$11K$9K$8K$7K$5K$1K $2K $3K $4K

Features

Blonde No White No Male 5cm

Bald No White Yes Male 0cm

White No Black Yes Male 3cm

Dark Yes White No Female 12cm

Training Set

Blonde No White No Male 5cm

Bald No White Yes Male 0cm

White No Black Yes Male 3cm

Test Set

Dark No White No Female 14cm

Dark No White Yes Male 6cm

Dark No Black No Male 6cm

Training and Testing

Training

Testing

Prediction

Test instanceSet of training instances

Accuracy

Actual labels

Predicted labels

Accuracy = # of correct predictions / total number of predictions = 2 / 4 = 50%

Precision and Recall

In certain cases, there are two class labels and predicting a particular class correctly is more important than predicting the other.

A good example is top-k ranking in web search.

Performance measures:› Recall› Precision

Some Practical Issues

Problem: Missing feature values

Solution:› Training: Use the most frequently observed (or

average) feature value in the instance’s class.› Testing: Use the most frequently observed (or

average) feature value in the entire training set.

Problem: Class imbalance

Solution› Oversampling: Duplicate the training

instances in the small class› Undersampling: User fewer instances

from the bigger class

Majority Classifier

Training: Find the class with the largest number of instances.

Testing: For every test instance, predict that class as the label, independent of the features of the test instance.

PredictionTesting

Size 13 8 4

k-Nearest Neighbor Classifier

Training: None! (known as a lazy classifier).

Testing: Find the k instances that are most similar to the test instance and use majority voting to decide on the label.

Decision Tree Classifier

Training: Build a tree where leaves represent labels and branches represent features that lead to those labels.

Testing: Traverse the tree using the feature values of the test instance.

Black White

Black Not blackYesNo

Naïve Bayes Classifier

Training: For every feature value v and class c pair, we compute and store in a lookup table the conditional probability P(v | c).

Testing: For each class c, we compute:

P( | ) = 0.40

P( | ) = 0.65

P( | ) = 0.78

Other Commonly Used Classifiers

Support vector machines Boosted decision trees Neural networks

Use Case:Sent iment-Focused Web Crawl ing

G. Vural, B. B. Cambazoglu, and P. Senkul, “Sentiment-focused web crawling”, CIKM’12, pp. 2020-2024.

Problem

Early discovery of the opinionated content in the Web is important.

Use cases› Measuring brand loyalty or product adoption› Politics› Finance

We would like to design a sentiment-focused web crawler that aims to maximize the amount of sentimental/opinionated content fetched from the Web within a given amount of time.

Web Crawling

Subspaces› Downloaded pages› Discovered pages› Undiscovered pages

Sentiment-Focused Web Crawling

Challenge: to predict the sentimentality of an “unseen” web page, i.e., without having access to the page content.

Features

Assumption: Sentimental pages are more likely to be linked by other sentimental pages.

Idea: Build a learning model using features extracted from› Textual content of referring pages› Anchor text on the hyperlinks› URL of the target page

Labels

Our data (ClueWeb09-B) lacks ground-truth sentiment scores. We created a ground-truth using the SentiStrength tool.

› Assigns a sentiment score (between 0 and 8) to each web page as its label. A small scale user-study is conducted with three judges to

verify the suitability of this ground-truth.› 500 random pages sampled from the collection.› pages are labeled as sentimental or not sentimental.

Observations› 22% of the pages are labeled as sentimental.› High agreement between judges: the overlap is above 85%.

Learner and Performance Metric

As the learner, we use the LibSVM software in the regression mode.

We rebuild the prediction model at regular intervals throughout the crawling process.

As the main performance metric, we compute the total sentimentality score accumulated after fetching a certain number of pages.

Evaluated Crawlers

Proposed crawlers› based on the average

sentiment score of referring page content

› based on machine learning

Oracle crawlers› highest sentiment score› highest spam score› highest PageRank

Baseline crawlers › random› indegree-based› breadth first

Performance

Introduction to Supervised Machine Learning Concepts

Documents

Transcript of Introduction to Supervised Machine Learning Concepts

Supervised Machine Learning: Learning SVMs and Deep Learninghelper.ipam.ucla.edu › publications › mpstut › mpstut_13971.pdf · 2016-09-13 · Supervised Machine Learning: Learning

SUPERVISED MACHINE/DEEP LEARNING TECHNIQUES A CASE …

Pulsar Search Using Supervised Machine Learning

Weakly Supervised Machine Reading

Supervised Machine Learning for Eliciting Individual Reservation … · 2020-07-10 · Supervised Machine Learning for Eliciting Individual Reservation Values JohnA.Clithero1,JaeJoonLee

Dynamic machine learning for supervised and unsupervised ...

Supervised Machine Learning Techniques to the Prediction ...

Introduction to Supervised ML Concepts and Algorithms

Introduction to Supervised Machine Learning Concepts PRESENTED BY B. Barla Cambazoglu February 21, 2014.

SUPERVISED AND UNSUPERVISED MACHINE LEARNING TECHNIQUES FOR TEXT ...ozgur/papers/ms-thesis-arzucan.pdf · SUPERVISED AND UNSUPERVISED MACHINE LEARNING TECHNIQUES FOR TEXT DOCUMENT

Types of Machine Learning Algorithms - cdn.intechweb.orgcdn.intechweb.org/pdfs/10694.pdf · Types of Machine Learning Algorithms 21 1.1 Supervised Learning Approach Supervised learning1

Machine Learning Basics: Supervised Learning Algorithmssrihari/CSE676/5.7 MLBasics-Supervised.pdfDeep Learning Srihari 1 Machine Learning Basics: Supervised Learning Algorithms Sargur

Supervised Learning in Physical Networks: From Machine ...

supervised learning decision treesmathcs.pugetsound.edu/~alchambers/classes/cs151... · What is supervised learning?! Decision trees Machine Learning ! The goal of machine learning

Supervised Intelligent Committee Machine Method for Hydraulic … · 2020. 6. 14. · Supervised Intelligent Committee Machine Method for Hydraulic Conductivity Estimation Gokmen

Semantic Computing - Lecture 5: Supervised Machine Learning · 2019. 3. 16. · Lecture 5: Supervised Machine Learning Dagmar Gromann International Center For Computational Logic

Data Mining and Machine Learning: concepts, techniques ...site.uottawa.ca/~stan/csi5387/set1.pdf · 11. Data mining concepts and techniques 12. Semi-supervised learning: co-training

Machine Learning: Introduction to Supervised Learning€¦ · Machine Learning: Introduction to Supervised Learning. Announcements Please fill out course evaluations! Can pick up

Supervised Machine Learning - unimi.itmarray.economia.unimi.it › 2005 › material › L7.pdf · 2009-10-28 · Supervised Machine Learning Lecture 7 Computational and Statistical

TIME SERIES & MACHINE LEARNING - Horizon 2020 Ireland · Supervised vs Unsupervised learning Machine Learning Techniques Supervised Learning Unsupervised Learning Random Forest, Support