Machine Learning - Piazza

MachineLearning

Slides: James Hays, Isabelle Guyon, Erik Sudderth, Mark Johnson, Derek Hoiem Photo: CMU Machine Learning

Department protests G20

Clustering:grouptogethersimilarpointsandrepresentthemwithasingletoken

KeyChallenges:1)Whatmakestwopoints/images/patchessimilar?2)HowdowecomputeanoverallgroupingfrompairwisesimilariCes?

Slide: Derek Hoiem

Howdowecluster?

•  K-means–  IteraCvelyre-assignpointstothenearestclustercenter

•  AgglomeraCveclustering–  StartwitheachpointasitsownclusteranditeraCvelymergetheclosestclusters

•  Mean-shiHclustering–  EsCmatemodesofpdf

•  Spectralclustering–  Splitthenodesinagraphbasedonassignedlinkswithsimilarityweights

ClusteringforSummarizaConGoal:clustertominimizevarianceindatagivenclusters– PreserveinformaCon

( )∑∑ −=N

ijiN ij

** argmin, xcδcδc

Whether xj is assigned to ci

Cluster center Data

Slide: Derek Hoiem

K-meansalgorithm

Illustration: http://en.wikipedia.org/wiki/K-means_clustering

1. Randomly select K centers

2. Assign each point to nearest center

3. Compute new center (mean) for each cluster

K-meansalgorithm

Illustration: http://en.wikipedia.org/wiki/K-means_clustering

1. Randomly select K centers

2. Assign each point to nearest center

3. Compute new center (mean) for each cluster

Back to 2

BuildingVisualDicConaries1.  Samplepatchesfrom

adatabase–  E.g.,128dimensional

SIFTvectors

2.  Clusterthepatches–  Clustercentersare

thedicConary

3.  Assignacodeword(number)toeachnewpatch,accordingtothenearestcluster

Examplesoflearnedcodewords

Sivic et al. ICCV 2005 http://www.robots.ox.ac.uk/~vgg/publications/papers/sivic05b.pdf

Most likely codewords for 4 learned “topics” EM with multinomial (problem 3) to get topics

AgglomeraCveclustering

AgglomeraCveclusteringHowtodefineclustersimilarity?-  Averagedistancebetweenpoints,

maximumdistance,minimumdistance-  Distancebetweenmeansormedoids

Howmanyclusters?-  Clusteringcreatesadendrogram(atree)-  Thresholdbasedonmaxnumberofclusters

orbasedondistancebetweenmergesdi

AgglomeraCveclusteringdemohZp://home.dei.polimi.it/maZeucc/Clustering/tutorial_html/AppletH.html

Conclusions:AgglomeraCveClusteringGood•  Simpletoimplement,widespreadapplicaCon•  ClustershaveadapCveshapes•  ProvidesahierarchyofclustersBad•  Mayhaveimbalancedclusters•  SCllhavetochoosenumberofclustersorthreshold

•  Needtousean“ultrametric”togetameaningfulhierarchy

•  VersaCletechniqueforclustering-basedsegmentaCon

D. Comaniciu and P. Meer, Mean Shift: A Robust Approach toward Feature Space Analysis, PAMI 2002.

MeanshiHsegmentaCon

MeanshiHalgorithm•  Trytofindmodesofthisnon-parametric

density

KerneldensityesCmaCon

Kernel density estimation function

Gaussian kernel

Region of interest

Center of mass

Mean Shift vector

Slide by Y. Ukrainitz & B. Sarel

MeanshiH

Region of interest

Center of mass

Mean Shift vector

MeanshiH

Region of interest

Center of mass

Mean Shift vector

MeanshiH

Region of interest

Center of mass

Mean Shift vector

MeanshiH

Region of interest

Center of mass

Mean Shift vector

MeanshiH

Region of interest

Center of mass

Mean Shift vector

MeanshiH

Region of interest

Center of mass

MeanshiH

Simple Mean Shift procedure: •  Compute mean shift vector

• Translate the Kernel window by m(x)

⎡ ⎤⎛ ⎞⎢ ⎥⎜ ⎟

⎜ ⎟⎢ ⎥⎝ ⎠= −⎢ ⎥⎛ ⎞⎢ ⎥⎜ ⎟⎢ ⎥⎜ ⎟⎝ ⎠⎣ ⎦

x - xx

m x xx - x

g( ) ( )kʹ= −x x

CompuCngtheMeanShiH

•  AZracConbasin:theregionforwhichalltrajectoriesleadtothesamemode

•  Cluster:alldatapointsintheaZracConbasinofamode

AZracConbasin

MeanshiHclustering•  ThemeanshiHalgorithmseeksmodesofthe

givensetofpoints1.  Choosekernelandbandwidth2.  Foreachpoint:

a)  Centerawindowonthatpointb)  Computethemeanofthedatainthesearchwindowc)  CenterthesearchwindowatthenewmeanlocaCond)  Repeat(b,c)unClconvergence

3.  Assignpointsthatleadtonearbymodestothesamecluster

•  Computefeaturesforeachpixel(color,gradients,texture,etc)•  SetkernelsizeforfeaturesKfandposiConKs•  IniCalizewindowsatindividualpixellocaCons•  PerformmeanshiHforeachwindowunClconvergence•  MergewindowsthatarewithinwidthofKfandKs

SegmentaConbyMeanShiH

http://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html

MeanshiHsegmentaConresults

http://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html

Mean-shiH:otherissues•  Speedups

–  BinnedesCmaCon–  Fastsearchofneighbors– UpdateeachwindowineachiteraCon(fasterconvergence)

•  Othertricks– UsekNNtodeterminewindowsizesadapCvely

•  LotsoftheoreCcalsupportD.ComaniciuandP.Meer,MeanShiH:ARobustApproachtowardFeatureSpaceAnalysis,PAMI2002.

MeanshiHprosandcons

•  Pros–  Goodgeneral-pracCcesegmentaCon–  Flexibleinnumberandshapeofregions–  Robusttooutliers

•  Cons–  Havetochoosekernelsizeinadvance–  Notsuitableforhigh-dimensionalfeatures

•  Whentouseit–  Oversegmentatoin– MulCplesegmentaCons–  Tracking,clustering,filteringapplicaCons

Spectralclustering Grouppointsbasedonlinksinagraph

Cutsinagraph

Normalized Cut •  a cut penalizes large segments •  fix by normalizing for size of segments

•  volume(A) = sum of costs of all edges that touch A

Source: Seitz

NormalizedcutsforsegmentaCon

VisualPageRank•  Determiningimportancebyrandomwalk

– What’stheprobabilitythatyouwillrandomlywalktoagivennode?

•  Createadjacencymatrixbasedonvisualsimilarity•  EdgeweightsdetermineprobabilityoftransiCon

Jing Baluja 2008

Whichalgorithmtouse?•  QuanCzaCon/SummarizaCon:K-means

– Aimstopreservevarianceoforiginaldata– Caneasilyassignnewpointtoacluster

Quantization for computing histograms

Summary of 20,000 photos of Rome using “greedy k-means”

http://grail.cs.washington.edu/projects/canonview/

Whichalgorithmtouse?•  ImagesegmentaCon:agglomeraCveclustering

– Moreflexiblewithdistancemeasures(e.g.,canbebasedonboundarypredicCon)

– AdaptsbeZertospecificdata– Hierarchycanbeuseful

http://www.cs.berkeley.edu/~arbelaez/UCM.html

Thingstoremember•  K-meansusefulforsummarizaCon,buildingdicConariesofpatches,generalclustering

•  AgglomeraCveclusteringusefulforsegmentaCon,generalclustering

•  Spectralclusteringusefulfordeterminingrelevance,summarizaCon,segmentaCon

ClusteringKeyalgorithm•  K-means

Machine Learning - Piazza

Documents

Transcript of Machine Learning - Piazza

Computer Security: A Machine Learning Approachmedia.techtarget.com/searchSecurity/downloads/REVISED_RH9_Sabn… · Royal Holloway series Machine learning approaches • MACHINE LEARNING

Machine learning for neuroimaging with scikit-learn · Machine learning for neuroimaging with scikit-learn. ... a Python machine learning library, ... Abraham et al. Machine learning

Hawaii Machine Learning Meetup · Introduction –Why Machine Learning? Machine Learning Quotes ^A breakthrough in machine learning would be worth ten Microsofts. ―Bill Gates For

Machine Learning - 0. Overview€¦ · Machine Learning 1. What is Machine Learning? One area of research, many names (and aspects) machine learning historically, stresses learning

Introduction To Azure Machine Learning...Expected Learning Outcomes Azure Machine Learning @tetranoodle Supervised Machine Learning Machine Learning Algorithms in Azure ML Workflow

machine Learning Learning

Machine Learning Lecture 5 Bayesian Learning G53MLE | Machine Learning | Dr Guoping Qiu1.

Machine Learning for Data Science (CS4786) Lecture 1 · 2017-08-22 · Machine Learning for Data Science (CS4786) Lecture 1 Tu-Th 11:40AM to 12:55 PM ... Join Piazza: https: ... While

Introducing Machine Learning · 4 ntroducing Machine Learning How Machine Learning Works Machine learning uses two types of techniques: supervised learning, which trains a model on

Machine Learning using MapReduce - Boise State Universitycs.boisestate.edu/.../mapreduce-machine-learning-handout.pdf · 2016-10-27 · What is Machine Learning I Machine learning

An Introduction to Machine Learning - GitHub Pages · Introduction Machine learning The machine learning process What’s machine learning History Supervised learning Non-supervised

Methods MACHINE LEARNING FROM MACHINE LEARNING TO …

ARTIFICIAL INTELLIGENCE & MACHINE LEARNING · Intelligence & Machine Learning technologies and applications including Machine Learning, Deep Learning, Computer Vision, Natural Language

Machine Learning: Machine Learning:

Machine Learning on Spark - UC Berkeley AMP Campampcamp.berkeley.edu/.../Machine-Learning-on-Spark... · Machine Learning on Spark Shivaram Venkataraman ... Machine learning algorithms

Machine Learning FabSpace March 2018 - irit.fr Learning FabSpace March 2018... · Machine Learning 5 Machine Learning • Supervised : Train / test • Predictiveanalysis Machine

Machine Learning for NLP - Ethics and Machine Learning

MACHINE REASONING: A PERSPECTIVE AND …...“From machine learning to machine reasoning”. Machine Learning Volume 94 Issue 2, 2004, Pages 133-149 Machine Learning Volume …

Machine Learning with MATLAB · 2 Agenda Machine Learning Overview Machine Learning with MATLAB –Unsupervised Learning Clustering –Supervised Learning Classification Regression

Section 1Section 1 Machine Learning basic concepts · Machine Learning TutorialMachine Learning Tutorial ... Section 1Section 1 Machine Learning basic concepts ... Learning Tutorial