Clustering UNSUPERVISED LEARNING INTRODUCTION Machine Learning.

ClusteringUNSUPERVISED LEARNING INTRODUCTION

Machine Learning

Andrew Ng

Supervised learning

Training set:

Andrew Ng

Unsupervised learning

Training set:

Andrew Ng

Applications of clustering

Organize computing clusters

Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Astronomical data analysis

Market segmentation

ClusteringVARIANT

Clustering Category• Based on the Clustering Algorithms, clustering are categorized into Four

Major Category:– Partitional (Centroid Based)

Try to cluster data into k number of cluster.Example: K-Means, K-Means++, Fuzzy C-Means.

– Hierarchical• Agglomerative

Start with all data as an individual cluster • Divisive

Start with the entire data as a single cluster.

– Distribution BasedThe clustering model most closely related to statistics is based on distribution

models.Example: EM-clusteringUnpopular because tend to overfitting

– Density Based• In density-based clustering, clusters are defined as areas of higher density than the

remainder of the data set.

Based on the data• Clustering are categorized into:

– Numerical data clustering– Categorical data clustering

Clustering

K-MEANS ALGORITHM

Andrew Ng

Input:- (number of clusters)- Training set

(drop convention)

K-means algorithm

Andrew Ng

Randomly initialize cluster centroids

K-means algorithm

Repeat {for = 1 to

:= index (from 1 to ) of cluster centroid closest to

for = 1 to := average (mean) of points assigned to cluster

Andrew Ng

K-means for non-separated clusters

T-shirt sizing

Height

Clustering

OPTIMIZATION OBJECTIVE

Machine Learning

Andrew Ng

K-means optimization objective

= index of cluster (1,2,…, ) to which example is currently assigned

= cluster centroid ( )= cluster centroid of cluster to which example has been

assignedOptimization objective:

Andrew Ng

K-means algorithm

Repeat {for = 1 to

Clustering

RANDOM INITIALIZATION

Machine Learning

Andrew Ng

K-means algorithm

Repeat {for = 1 to

Andrew Ng

Random initialization

Should have

Randomly pick training examples.

Set equal to these examples.

Andrew Ng

Local optima

Andrew Ng

For i = 1 to 100 {

Randomly initialize K-means.Run K-means. Get .Compute cost function (distortion)

Pick clustering that gave lowest cost

Random initialization

ClusteringCHOOSING THE NUMBER OF CLUSTERS

Machine Learning

Andrew Ng

What is the right value of K?

Andrew Ng

Choosing the value of K

Elbow method:

1 2 3 4 5 6 7 8

(no. of clusters)1 2 3 4 5 6 7 8

(no. of clusters)

Andrew Ng

Choosing the value of KSometimes, you’re running K-means to get clusters to use for some later/downstream purpose. Evaluate K-means based on a metric for how well it performs for that later purpose.

E.g. T-shirt sizing

Height

T-shirt sizing

HeightW

Clustering UNSUPERVISED LEARNING INTRODUCTION Machine Learning.

Documents

Transcript of Clustering UNSUPERVISED LEARNING INTRODUCTION Machine Learning.

Unsupervised learning Clustering using the k-means algorithm Avi Libster.

28 Machine Learning Unsupervised Hierarchical Clustering

Unsupervised Learning: Clustering 1 Lecture 16: Clustering Web Search and Mining.

Machine Learning Problems Unsupervised Learning – Clustering – Density estimation – Dimensionality Reduction Supervised Learning – Classification – Regression.

Improving Unsupervised Image Clustering With Robust Learning

Neural & Evolutionary Computing - Lecture 3 1 Neural Networks with Unsupervised Learning Particularities of unsupervised learning Data clustering with.

Unsupervised Learning Clustering Algorithms - IT - websiteafred/tutorials/B_Clustering_Algorithms.pdf · 2 3 Unsupervised Learning Clustering Algorithms Unsupervised Learning -- Ana

Clustering Supervised vs. Unsupervised Learning Examples of clustering in Web IR Characteristics of clustering Clustering algorithms Cluster Labeling 1.

Unsupervised Learning K-means clustering The EM Algorithm Competetive Learning & SOM Networks

Machine Learning II: Unsupervised Learning€¦ · 2 Our agenda today Unsupervised Learning and Applications • Clustering – Application: Image Segmentation • Density Estimation

Day 9: Unsupervised learning, dimensionality reductionsuriya/website-intromlss2018/...semi-supervised learning 2 Clustering 3 Clustering languages 4 Clustering species (phylogeny)

Clustering - cs.bilkent.edu.trcs.bilkent.edu.tr/~gunduz/teaching/cs550/documents/CS550... · Clustering / Unsupervised Learning In SUPERVISED learning ... Heterogeneous vs homogeneous:

Unsupervised Learning: Clustering Rong Jin Outline Unsupervised learning K means for clustering Expectation Maximization algorithm for clustering.

Deep Clustering for Unsupervised Learning of Visual Featuresopenaccess.thecvf.com/...Caron_Deep_Clustering_for_ECCV_2018_paper.pdf · Deep Clustering for Unsupervised Learning of

CS 478 - Clustering1 Unsupervised Learning and Clustering In unsupervised learning you are given a data set with no output classifications Clustering is.

Clustering and machine learning for gene expression dataxray.bmc.uu.se/kurs/BioinfX3/2009/l13_clustering.pdf · Data analysis methods • Unsupervised learning (clustering, class

Unsupervised Learning (a.k.a Clustering)pelillo/Didattica/Artificial... · Artificial Intelligence a.y. 2017/18 Unsupervised Learning ... Clustering problems abound in many areas

Unsupervised Learning and Clustering

Chapter 9 UNSUPERVISED LEARNING: Clustering Part 1 Cios / Pedrycz / Swiniarski / Kurgan.

Lecture 4 Unsupervised Learning Clustering & Dimensionality Reduction 273A Intro Machine Learning.