13 Unsupervised Learning Balloch - Georgia Institute of …bboots3/CS4641-Fall2016/Lectures/... ·...

UnsupervisedLearning:K-Means,

GaussianMixtureModels

Robot Image Credit: Viktoriya Sukhanova ©

123RF.com

These slides were assembled by Eric Eaton, with grateful acknowledgement of the many others

who made their course materials freely available online. Feel free to reuse or adapt these slides for

your own academic purposes, provided that you include proper attribution. Please send comments

and corrections to Eric.

UnsupervisedLearning

UnsupervisedLearning• Supervisedlearningusedlabeleddatapairs(x, y)

tolearnafunctionf : X→Y

– But,whatifwedon’thavelabels?– Mostdatadoesn’t!

• Nolabels=unsupervisedlearning• Onlysomepointsarelabeled=semi-supervisedlearning– Labelsmaybeexpensivetoobtain,soweonlygetafew

UnsupervisedLearningMajorCategories(andtheiruses)

•Clustering• Density-estimation• Dimensionalityreduction(featureselection)

KernelDensityEstimation

SomematerialadaptedfromslidesbyLeSong,GT.

Clustering with Gaussian Mixtures: Slide 8

Kernel Density Estimation

K-MeansClustering

SomematerialadaptedfromslidesbyAndrewMoore,CMU.

Visithttp://www.autonlab.org/tutorials/ forAndrew’srepositoryofDataMiningtutorials.

ClusteringData

K-MeansClustering

K-Means(k ,X )• Randomlychoosek clustercenterlocations(centroids)

• Loopuntilconvergence• Assigneachpointtotheclusteroftheclosestcentroid

• Re-estimatetheclustercentroidsbasedonthedataassignedtoeachcluster

K-MeansClustering

K-MeansAnimation

Example generated by Andrew Moore using Dan Pelleg’s super-duper fast K-means system:

Dan Pelleg and Andrew Moore. Accelerating Exact k-means Algorithms with Geometric Reasoning.Proc. Conference onKnowledge Discovery in Databases 1999.

K-MeansAnimation

K-MeansObjectiveFunction

• K-meansfindsalocaloptimumofthefollowingobjectivefunction:

ProblemswithK-Means

• Very sensitivetotheinitialpoints– DomanyrunsofK-Means,eachwithdifferentinitialcentroids

– Seedthecentroidsusingabettermethodthanrandomlychoosingthecentroids

• e.g.,Farthest-firstsampling

• Mustmanuallychoosek– Learntheoptimalk fortheclustering

• Notethatthisrequiresaperformancemeasure

Choosingthekeyparameter

1.TheElbowMethoda. SSE=∑Ki=1∑x�cidist(x,ci)2

2.x-meansclusteringa. repeatedlyattemptingsubdivision

3.Others:a.InformationCriterionApproachb.AnInformationTheoreticApproachc.TheSilhouetteMethodd.Cross-validation

Choosingthekeyparameter

• Howdoyoutellitwhichclusteringyouwant?

Constrainedclusteringtechniques(semi-supervised)

ProblemswithK-Means

Same-clusterconstraint(must-link)

Different-clusterconstraint(cannot-link)

GaussianMixtureModelsandExpectationMaximization

GaussianMixtureModels

• RecalltheGaussiandistribution:

The GMM assumption• There are k components. The

i’th component is called ωi

• Component ωi has an associated mean vector µi

• Each component generates data from a Gaussian with mean µi and covariance matrix σ2I

Assume that each datapoint is generated according to the following recipe:

• Pick a component at random. Choose component i with probability P(ωi).

• Datapoint ~ N(µi, σ2I )

The General GMM assumption

• There are k components. The i’th component is called ωi

• Each component generates data from a Gaussian with mean µi and covariance matrix Σi

• Datapoint ~ N(µi , Σi )

Just evaluate a Gaussian at xk

Expectation-Maximization for GMMsIterate until convergence:On the t’th iteration let our estimates be

λt = { μ1(t), μ2(t) … μc(t) }

E-step: Compute “expected” classes of all datapoints for each class

M-step: Estimate μ given our data’s class membership distributions

E.M. for General GMMsIterate. On the t’th iteration let our estimates beλt = { μ1(t), μ2(t) … μc(t), Σ1(t), Σ2(t) … Σc(t), p1(t), p2(t) … pc(t) }

E-step: Compute “expected” clusters of all datapoints

M-step: Estimate μ, Σ given our data’s class membership distributions

pi(t) is shorthand for estimate of P(ωi) on t’th iteration

R = #records

Just evaluate a Gaussian at xk

Gaussian Mixture

Example: Start

Advance apologies: in Black and White this example will be

incomprehensible

After first iteration

After 2nd iteration

After 3rd iteration

After 4th iteration

After 5th iteration

After 6th iteration

After 20th iteration

Some Bio Assay data

GMM clustering

of the assay data

Resulting Density

Estimator

Hidden Markov Models:

● Andrew Ng’s Notes

Mean Shift● Mean Shift Tutorial

Deep Methods● Tutorial on Variational Autoencoders

13 Unsupervised Learning Balloch - Georgia Institute of …bboots3/CS4641-Fall2016/Lectures/... ·...

Documents

Transcript of 13 Unsupervised Learning Balloch - Georgia Institute of …bboots3/CS4641-Fall2016/Lectures/... ·...

Unsupervised and Semi- Supervised Learninglily/Teaching/498FSlides/12-Unsupervised...L. Mihalkova, CSMC498F, Fall2010 Administrativia This week continuing on unsupervised learning

Unsupervised Learning Clustering Algorithms - IT - websiteafred/tutorials/B_Clustering_Algorithms.pdf · 2 3 Unsupervised Learning Clustering Algorithms Unsupervised Learning -- Ana

Unsupervised Learning and Data Mining Unsupervised Learning

4D Crop Monitoring: Spatio-Temporal Reconstruction for ...cc.gatech.edu/~bboots3/files/4DCropMonitoring.pdf4D Crop Monitoring: Spatio-Temporal Reconstruction for Agriculture Jing Dong,

Methods in Unsupervised Dependency Parsingrasooli/papers/candidacy2016.pdf · Fully Unsupervised Parsing Models Syntactic Transfer Models Conclusion Methods in Unsupervised Dependency

Reinforcement Learning › ~bboots3 › CS4641-Fall2018 › Lecture20 › 20_RL1.pdfReinforcement Learning §Basic idea: §Receive feedback in the form of rewards §Agent’s utility

Unsupervised Learning Learning Unsupervised...Unsupervised Learning and Data Mining Learning Mining Unsupervised Learning and Data Mining Unsupervised Data Clustering Supervised Learning

Unsupervised icml2012

Quantum speed-up for unsupervised learning - Springer · Quantum speed-up for unsupervised learning ... Keywords Unsupervised learning · Clustering ·Quantum learning · Quantum

Unsupervised Anomaly Detection in Large Datacenters › ~mgabel › papers › MSc Thesis - Final.pdf · 2014-06-14 · Unsupervised Anomaly Detection ... Unsupervised Anomaly Detection

Unsupervised Learning - NTU Speech Processing …speech.ee.ntu.edu.tw/~tlkagk/courses/ML_2017/Lecture/auto.pdf · Unsupervised Learning “We expect unsupervised learning to become

Markov Decision Processes (continued)bboots3/CS4641-Fall2018/Lecture19/19_MDPs2.pdf§Alternative approach for optimal values: §Step 1: Policy evaluation: calculate utilities for some

Unsupervised Learning. Supervised learning vs. unsupervised learning.

Introduction to CS4641 · Course Logistics I Syllabus I Schedule I Project I Getting started 6/6. Title: Introduction to CS4641 Created Date: 5/15/2019 12:54:06 PM

Linear Classification: The Perceptronbboots3/CS4641-Fall2018/... · Improving the Perceptron • The Perceptron produces many θ‘sduring training • The standard Perceptron simply

Unsupervised Prsentationssssdas

CS583 Unsupervised Learning

Probability Basics - Georgia Institute of Technologybboots3/CS4641-Fall2016/Lectures/Lecture11.pdfDiscreteRandomVariables • Let A denotearandomvariable – A represents0an0event0that0can0take0on0certain0values

Linear Regression & Gradient Descentbboots3/CS4641-Fall2018/...Linear Regression & Gradient Descent Robot Image Credit: ViktoriyaSukhanova© 123RF.com These slides were assembled by

Unsupervised Learning for Physical Interaction …papers.nips.cc/paper/6161-unsupervised-learning-for...Unsupervised Learning for Physical Interaction through Video Prediction Chelsea