Unsupervised Learningi-systems.github.io/HSE545/machine learning all/Workshop/Hanwha... ·...

Unsupervised Learning with Scikit Learn

by Prof. Seungchul Lee Industrial AI Lab http://isystems.unist.ac.kr/ POSTECH

Table of Contents

I. 1. K-means ClusteringI. 1.1. (Iterative) AlgorithmII. 1.2. Choosing the Number of ClustersIII. 1.3. K-means: Limitations

II. 2. Gaussian Mixture ModelIII. 3. Correlation AnalysisIV. 4. Principal Component Analysis (PCA)

1. K-means ClusteringUnsupervised Learning

Data clustering is an unsupervised learning problemGiven:

unlabeled examples the number of partitions

Goal: group the examples into partitions

m { , ⋯ , }x(1) x(2) x(m)

the only information clustering uses is the similarity between examplesclustering groups examples based of their mutual similaritiesA good clustering is one that achieves:

high within-cluster similaritylow inter-cluster similarity

1.1. (Iterative) Algorithm

{ , , ⋯ , } ⇒ Clusteringx(1) x(2) x(m)

Randomly initialize k cluster centroids , , ⋯ , ∈μ1 μ2 μk Rn

Repeat{for i = 1 to m

:= index (from 1 to k) of cluster centroid closest to ci x(i)

for k = 1 to k := average (mean) of points assigned to cluster kμk

In [1]: import numpy as np import matplotlib.pyplot as plt from six.moves import cPickle %matplotlib inline

X = cPickle.load(open('./data_files/kmeans_example.pkl','rb'))

plt.figure(figsize=(10,6)) plt.plot(X[:,0],X[:,1],'b.') plt.axis('equal') plt.show()

In [2]: from sklearn.cluster import KMeans

kmeans = KMeans(n_clusters = 3, random_state = 0) kmeans.fit(X)

Out[2]: KMeans(algorithm='auto', copy_x=True, init='k-means++', max_iter=300, n_clusters=3, n_init=10, n_jobs=1, precompute_distances='auto', random_state=0, tol=0.0001, verbose=0)

In [3]: print(kmeans.labels_)

[2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 0 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 0 2 2 2 2 2 2 2 2 2 2 0 2 2 2 2 2 2 2 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1]

In [4]: plt.figure(figsize=(10,6))

plt.plot(X[kmeans.labels_ == 0,0],X[kmeans.labels_ == 0,1],'g.') plt.plot(X[kmeans.labels_ == 1,0],X[kmeans.labels_ == 1,1],'k.') plt.plot(X[kmeans.labels_ == 2,0],X[kmeans.labels_ == 2,1],'r.')

plt.axis('equal') plt.show()

1.2. Choosing the Number of ClustersIdea: when adding another cluster does not give much better modeling of the dataOne way to select for the K-means algorithm is to try different values of , plot the K-meansobjective versus , and look at the 'elbow-point' in the plot

In [5]: cost = [] for i in range(1,11): kmeans = KMeans(n_clusters=i, random_state=0).fit(X) cost.append(abs(kmeans.score(X)))

plt.figure(figsize=(10,6)) plt.stem(range(1,11),cost) plt.xlim([0.5, 10.5]) plt.show()

1.3. K-means: Limitations

In [6]: X = cPickle.load(open('./data_files/kmeans_lim.pkl','rb'))

plt.figure(figsize=(10,6)) plt.plot(X[:,0], X[:,1],'k.')

plt.title('Unlabeld', fontsize='15') plt.axis('equal') plt.show()

In [7]: kmeans = KMeans(n_clusters=2, random_state=0).fit(X)

plt.figure(figsize=(10,6)) plt.plot(X[kmeans.labels_ == 0,0],X[kmeans.labels_ == 0,1],'r.') plt.plot(X[kmeans.labels_ == 1,0],X[kmeans.labels_ == 1,1],'b.') plt.plot(kmeans.cluster_centers_[0][0], kmeans.cluster_centers_[0][1],'rs',markersize=10) plt.plot(kmeans.cluster_centers_[1][0], kmeans.cluster_centers_[1][1],'bs',markersize=10)

plt.axis('equal') plt.show()

2. Gaussian Mixture ModelLet's consider a generative model for the data

Suppose

there are clusterswe have a (Gaussian) probability density for each cluster

Generate a point as follows:

1) Choose a random cluster

2) Choose a point from the (Gaussian) distribution for cluster

In [8]: from sklearn import mixture import itertools from scipy import linalg import matplotlib as mpl

X = cPickle.load(open('./data_files/kmeans_example.pkl','rb'))

gmm = mixture.GaussianMixture(n_components=3, covariance_type='full') gmm.fit(X)

Out[8]: GaussianMixture(covariance_type='full', init_params='kmeans', max_iter=100, means_init=None, n_components=3, n_init=1, precisions_init=None, random_state=None, reg_covar=1e-06, tol=0.001, verbose=0, verbose_interval=10, warm_start=False, weights_init=None)

z ∈ {1, 2, ⋯ , k}

In [9]: # Plot (do not have to understand the code)

plt.figure(figsize=(10,6)) splot = plt.subplot(1, 1, 1) Y_ = gmm.predict(X)

color_iter = itertools.cycle(['navy', 'black', 'red','darkorange'])

for i, (mean, cov, color) in enumerate(zip(gmm.means_, gmm.covariances_, color_iter)): v, w = linalg.eigh(cov) if not np.any(Y_ == i): continue plt.scatter(X[Y_ == i, 0], X[Y_ == i, 1], 8.0, color=color)

# Plot an ellipse to show the Gaussian component angle = np.arctan2(w[0][1], w[0][0]) angle = 180. * angle / np.pi # convert to degrees v = 2. * np.sqrt(2.) * np.sqrt(v) ell = mpl.patches.Ellipse(mean, v[0], v[1], 180. + angle, color=color) ell.set_clip_box(splot.bbox) ell.set_alpha(.5) splot.add_artist(ell)

plt.title('GMM') plt.subplots_adjust(hspace=.35, bottom=.02) plt.axis('equal') plt.show()

3. Correlation AnalysisStatistical relationship between two sets of datahttp://rpsychologist.com/d3/correlation/ (http://rpsychologist.com/d3/correlation/)

In [10]: import seaborn as sns import pandas

sns.set(style="ticks", color_codes=True) iris = cPickle.load(open('./data_files/iris.pkl', 'rb'))

g = sns.pairplot(iris) # sns.plt.show()

4. Principal Component Analysis (PCA) Motivation: Can we describe high-dimensional data in a "simpler" way?

Dimension reduction without losing too much information Find a low-dimensional, yet useful representation of the data

How?1. Maximize variance (most separable)2. Minimize the sum-of-squares

Data projection onto unit vector

→→

PCA Demo

Optimal data representationFind the most informative point of view

In [11]: %%html <center><iframe width="560" height="315" src="https://www.youtube.com/embed/Pkit-64g0eU" frameborder="0" allowfullscreen> </iframe></center>

In [12]: %%html <center><iframe width="560" height="315" src="https://www.youtube.com/embed/x4lvjVjUUqg" frameborder="0" allowfullscreen> </iframe></center>

In [13]: %%html <center><iframe width="560" height="315" src="https://www.youtube.com/embed/2t62WkNIqxY" frameborder="0" allowfullscreen> </iframe></center>

Multivariate Time Series

System order from

Laws of physics orData

Dimension Reduction method

1. Choose top (orthonormal) eigenvectors, 2. Project onto span

Pictorial summary of PCA

projection onto unit vector distance from the origin along

In [14]: X = cPickle.load(open('./data_files/pca_example.pkl','rb'))

plt.figure(figsize=(10,6)) plt.plot(X[:, 0], X[:, 1],'b.') plt.axis('equal') plt.show()

k U = [ , , ⋯ , ]u1 u2 uk

xi { , , ⋯ , }u1 u2 uk

= or z = xz(i)

⎢⎢⎢⎢⎢

uT1 x(i)

uT2 x(i)

uTk x(i)

⎥⎥⎥⎥⎥U

→x(i) u ⟹ =uT x(i) u

In [15]: from sklearn.decomposition import PCA

pca = PCA(n_components=2) pca.fit(X)

plt.figure() plt.stem(range(1,3),pca.explained_variance_ratio_)

plt.xlim([0.5, 2.5]) plt.ylim([0, 1]) plt.title('Score (%)') plt.show()

In [16]: # Nomalization and calculate gradient principal_axis = pca.components_[0, :] u1 = principal_axis/(np.linalg.norm(principal_axis)) h = u1[1]/u1[0]

xp = np.linspace(-3,3,200) yp = xp.dot(h)

plt.figure(figsize=(10,6)) plt.plot(xp, yp, 'r.') plt.plot(X[:, 0], X[:, 1],'b.') plt.axis('equal') plt.show()

In [17]: pca = PCA(n_components=1) pca.fit(X) reduced_coordinate = pca.fit_transform(X) plt.hist(reduced_coordinate, 15) plt.show()

In [18]: %%javascript $.getScript('https://kmahelona.github.io/ipython_notebook_goodies/ipython_notebook_toc.js')

Unsupervised Learningi-systems.github.io/HSE545/machine learning all/Workshop/Hanwha... ·...

Documents

Transcript of Unsupervised Learningi-systems.github.io/HSE545/machine learning all/Workshop/Hanwha... ·...

Unsupervised Learning - NTU Speech Processing …speech.ee.ntu.edu.tw/~tlkagk/courses/ML_2017/Lecture/auto.pdf · Unsupervised Learning “We expect unsupervised learning to become

Unsupervised High-Dimensional Analysis Aligns …_120G8... · Unsupervised High-Dimensional Analysis Aligns Dendritic Cells across ... Unsupervised High-Dimensional Analysis Aligns

07 Unsupervised

iSystems reference list Rev Jun 17isystems.be/images/iSystems reference list.pdf · iSystems NV Project Reference List Revision date June 2017 Page 3/34 2004 BOTAS Natural Gas Distribution

UNSUPERVISED AI

Unsupervised learning models of primary cortical receptive ...papers.nips.cc/paper/4331-unsupervised-learning-models-of-primary... · We consider ﬁve unsupervised learning algorithms:

Unsupervised Methods

UNSUPERVISED DEEP LEARNING - TAUweb.eng.tau.ac.il/.../uploads/2016/12/Unsupervised-Deep-Learning.pdf · UNSUPERVISED DEEP LEARNING Erez Aharonov Noam Eilon Deep Learning Seminar School

Unsupervised Learning. Supervised learning vs. unsupervised learning.

ISystems S~lPI - dtic.mil · Distribution Statement, Document is available to the '"'S. 'ublic Helicopter, accidents, autorotation, Douetiavlbetohe'X.ubc ... 1.2.3 Method of Approach

Supervised and Unsupervised Land Use Classification746A27/Literature/Supervised and Unsupervised Land... · Supervised and Unsupervised Land Use Classification ... This is done by

Unsupervised LearningUnsupervised LearningUnsupervised … · 2020. 11. 17. · 4 Unsupervised LearningUnsupervised LearningUnsupervised LearningUnsupervised LearningUnsupervised

Methods in Unsupervised Dependency Parsingrasooli/papers/candidacy2016.pdf · Fully Unsupervised Parsing Models Syntactic Transfer Models Conclusion Methods in Unsupervised Dependency

Lecture 11 - csd.uwo.ca · Lecture 11 Unsupervised Learning EM. Today • New Topic: Unsupervised Learning • supervised vs. unsupervised learning • unsupervised learning • nonparametric

Unsupervised and Semi- Supervised Learninglily/Teaching/498FSlides/12-Unsupervised...L. Mihalkova, CSMC498F, Fall2010 Administrativia This week continuing on unsupervised learning

Lecture 12: Unsupervised learning. Clustering€¦ · Lecture 12: Unsupervised learning. Clustering Introduction to unsupervised learning Clustering problem K-means clustering Hierarchical

Unsupervised Learning - wnzhangwnzhang.net/teaching/cs420/slides/9-unsupervised-learning.pdf · Unsupervised Learning Weinan Zhang Shanghai Jiao Tong University CS420, Machine Learning,

Unsupervised Prsentationssssdas

Unsupervised co-segmentation through region matchingjcrubio/publications/Unsupervised Cosegmentation... · Unsupervised co-segmentation through region matching Jose C. Rubio 1, Joan

Unsupervised Keypoint Learning for Guiding Class-Conditional Video …papers.nips.cc/paper/8637-unsupervised-keypoint-learning... · 2020-02-13 · Unsupervised Keypoint Learning