Welcome to Scribd!

0% found this document useful (0 votes)

18 views

K-Means: Step-By-Step Example

Uploaded by

This document summarizes the k-means clustering algorithm steps using a sample dataset of 7 individuals scored on 2 variables. It shows how the algorithm initially assigns individuals to 2 clusters based on the furthest data points, then iteratively recalculates cluster means and reassigns individuals to minimize distances until cluster memberships stabilize. For the sample data, the algorithm converges after one individual is reassigned between clusters based on closest mean distances, resulting in final clusters of individuals 1,2 and 3,4,5,6,7.

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

K-Means: Step-By-Step Example

Uploaded by

amarpoonam

0% found this document useful (0 votes)

18 views2 pages

Original Description:

Simple example to demonstrate how K-means algorithm works

Original Title

k-means

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

18 views2 pages

K-Means: Step-By-Step Example

Uploaded by

amarpoonam

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

k-Means: Step-By-Step Example

As a simple illustration of a k-means algorithm, consider the following data set consisting of the scores of two variables on each of
seven individuals:

Subject
1
2
3
4
5
6
7

A
1.0
1.5
3.0
5.0
3.5
4.5
3.5

B
1.0
2.0
4.0
7.0
5.0
5.0
4.5

This data set is to be grouped into two clusters. As a first step in finding a sensible initial partition, let the A & B values of the two
individuals furthest apart (using the Euclidean distance measure), define the initial cluster means, giving:

Individual
Group 1
Group 2

1
4

Mean Vector
(centroid)
(1.0, 1.0)
(5.0, 7.0)

The remaining individuals are now examined in sequence and allocated to the cluster to which they are closest, in terms of Euclidean
distance to the cluster mean. The mean vector is recalculated each time a new member is added. This leads to the following series of
steps:

Cluster 1
Step

Individual

1
2
3
4
5
6

1
1, 2
1, 2, 3
1, 2, 3
1, 2, 3
1, 2, 3

Mean
Vector
(centroid)
(1.0, 1.0)
(1.2, 1.5)
(1.8, 2.3)
(1.8, 2.3)
(1.8, 2.3)
(1.8, 2.3)

Cluster 2
Individual
4
4
4
4, 5
4, 5, 6
4, 5, 6, 7

Mean
Vector
(centroid)
(5.0, 7.0)
(5.0, 7.0)
(5.0, 7.0)
(4.2, 6.0)
(4.3, 5.7)
(4.1, 5.4)

Now the initial partition has changed, and the two clusters at this stage having the following characteristics:

Individual
Cluster 1
Cluster 2

1, 2, 3
4, 5, 6, 7

Mean Vector
(centroid)
(1.8, 2.3)
(4.1, 5.4)

But we cannot yet be sure that each individual has been assigned to the right cluster. So, we compare each individuals distance to its
own cluster mean and to
that of the opposite cluster. And we find:

Individual
1
2
3
4

Distance to Distance to
mean
mean
(centroid) of (centroid) of
Cluster 1
Cluster 2
1.5
5.4
0.4
4.3
2.1
1.8
5.7
1.8

5
6
7

3.2
3.8
2.8

0.7
0.6
1.1

Only individual 3 is nearer to the mean of the opposite cluster (Cluster 2) than its own (Cluster 1). In other words, each individual's
distance to its own cluster mean should be smaller that the distance to the other cluster's mean (which is not the case with individual 3).
Thus, individual 3 is relocated to Cluster 2 resulting in the new partition:

Mean Vector
(centroid)
1, 2
(1.3, 1.5)
3, 4, 5, 6, 7 (3.9, 5.1)
Individual

Cluster 1
Cluster 2

The iterative relocation would now continue from this new partition until no more relocations occur. However, in this example each
individual is now nearer its own cluster mean than that of the other cluster and the iteration stops, choosing the latest partitioning as the
final cluster solution.
Also, it is possible that the k-means algorithm won't find a final solution. In this case it would be a good idea to consider stopping the
algorithm after a pre-chosen maximum of iterations.

Module - 4 K Means Clustering
Document20 pages
Module - 4 K Means Clustering
k
No ratings yet
K-Means Clustering
Document6 pages
K-Means Clustering
hifzan786
No ratings yet
K Mean Clustering
Document36 pages
K Mean Clustering
Navjot Wadhwa
No ratings yet
K Mean Clustering
Document45 pages
K Mean Clustering
hello125643
No ratings yet
K Mean Clustering 1
Document26 pages
K Mean Clustering 1
Nada Ahmed
No ratings yet
K Mean Clustering
Document48 pages
K Mean Clustering
Rexline S J
No ratings yet
Introduction To Unsupervised Learning:: Clustering
Document21 pages
Introduction To Unsupervised Learning:: Clustering
mohini sen
No ratings yet
Lecture 013
Document20 pages
Lecture 013
Hammad Khokhar
No ratings yet
K Mean Clustering
Document27 pages
K Mean Clustering
ashishamitav123
No ratings yet
Session 18-Cluster Analysis
Document20 pages
Session 18-Cluster Analysis
Pratyusha Voruganti
No ratings yet
KMean Merged
Document13 pages
KMean Merged
Abhyudya Singh
No ratings yet
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
Document20 pages
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
Muneeba Hussain
No ratings yet
Lecture Notes - Clustering
Document13 pages
Lecture Notes - Clustering
gunjan Bhardwaj
No ratings yet
Lecture+Notes+ +clustering
Document13 pages
Lecture+Notes+ +clustering
Pankaj Pandey
No ratings yet
A Paper With 12pt Global Font Size
Document13 pages
A Paper With 12pt Global Font Size
vishnugorantla0308
No ratings yet
Cluster Analysis (Continued)
Document3 pages
Cluster Analysis (Continued)
Anonymous Zcd9Uuzjz9
No ratings yet
6 Clustering
Document15 pages
6 Clustering
Monis Khan
No ratings yet
Bis Distance
Document8 pages
Bis Distance
yoga_laddo
No ratings yet
K Mean Clustering 1
Document12 pages
K Mean Clustering 1
HykaVirtasari
100% (1)
Clustering Dendogram
Document13 pages
Clustering Dendogram
Oliver Queen
No ratings yet
ML Unit-2
Document31 pages
ML Unit-2
2021pcecscharul037
No ratings yet
Data Mining and Clustering - Benjamin Lam
Document49 pages
Data Mining and Clustering - Benjamin Lam
Arijit Das
No ratings yet
Cluster Analysis
Document30 pages
Cluster Analysis
Jashid Hameed
No ratings yet
Business Research: Cluster Analysis
Document10 pages
Business Research: Cluster Analysis
popat vishal
No ratings yet
Data Clustering..
Document10 pages
Data Clustering..
ArjunSahoo
No ratings yet
Cluster Analysis Techniques
Document33 pages
Cluster Analysis Techniques
बिक्रम नेपाली
No ratings yet
A Famous Example of Cluster Analysis
Document5 pages
A Famous Example of Cluster Analysis
Vinit Shah
No ratings yet
K Mean Clustering1
Document23 pages
K Mean Clustering1
Khyati Chhabra
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
Document9 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
Nikhil Jojen
No ratings yet
Quality of Clustering: Clustering (K-Means Algorithm)
Document4 pages
Quality of Clustering: Clustering (K-Means Algorithm)
Sk Arif Ahmed
No ratings yet
Agnes
Document25 pages
Agnes
Dyah Septi Andryani
No ratings yet
Clustering
Document17 pages
Clustering
Aatri Pal
No ratings yet
K-Means Clustering - Numerical Example
Document6 pages
K-Means Clustering - Numerical Example
Γιαννης Σκλαβος
100% (1)
First Paper Before
Document19 pages
First Paper Before
زيد الدين
No ratings yet
Cluster Analysis: Motivation: Why Cluster Analysis Dissimilarity Matrices Introduction To Clustering Algorithms
Document34 pages
Cluster Analysis: Motivation: Why Cluster Analysis Dissimilarity Matrices Introduction To Clustering Algorithms
bs_sharath
No ratings yet
Lp2-Etl Model Assignment No. 2: R (2) C (4) V (2) T (2) Total (10) Dated Sign
Document7 pages
Lp2-Etl Model Assignment No. 2: R (2) C (4) V (2) T (2) Total (10) Dated Sign
Ishwari Pawar
No ratings yet
Spss 8
Document4 pages
Spss 8
Jacob Tan
No ratings yet
K-Means Clustering Clustering Algorithms Implementation and Comparison
Document4 pages
K-Means Clustering Clustering Algorithms Implementation and Comparison
FrankySaputra
No ratings yet
Somnath Clustered Ranksum
Document8 pages
Somnath Clustered Ranksum
Adnan Shoaib
No ratings yet
Lecture - 9 Unsupervised Learning (K-Means, Association Analysis and Frequuent Items)
Document73 pages
Lecture - 9 Unsupervised Learning (K-Means, Association Analysis and Frequuent Items)
ABDURAHMAN ABDELLA
No ratings yet
Cluster Analysis BRM Session 14
Document25 pages
Cluster Analysis BRM Session 14
akhil107043
No ratings yet
K Means Algo
Document7 pages
K Means Algo
Prakash Chorage
No ratings yet
UnSupervisedLearning
Document22 pages
UnSupervisedLearning
bhattnirmal15
No ratings yet
An Introduction To Multivariate Analysis
Document28 pages
An Introduction To Multivariate Analysis
Ramadhana Dio Gradianta
No ratings yet
Jaipur National University: Project Design With Seminar
Document26 pages
Jaipur National University: Project Design With Seminar
Faizan Shaikh
100% (1)
Distance-Based Techniques
Document7 pages
Distance-Based Techniques
George Wang
No ratings yet
Clustering
Document23 pages
Clustering
Aditya Mohite
No ratings yet
Clustering Analysis: What Is Cluster Analysis?
Document5 pages
Clustering Analysis: What Is Cluster Analysis?
shyama
No ratings yet
Lesson 5 Con't Central Tendency For Grouped Data
Document24 pages
Lesson 5 Con't Central Tendency For Grouped Data
Junvy Abordo
No ratings yet
Hierarchical Clustering: Required Data
Document6 pages
Hierarchical Clustering: Required Data
Hritik Agrawal
No ratings yet
5 Pca
Document14 pages
5 Pca
SAMRIDDHI JAISWAL
No ratings yet
K-Mean Clustering Final
Document21 pages
K-Mean Clustering Final
211119
No ratings yet
Machine Learning
Document29 pages
Machine Learning
bikram2128
No ratings yet
K Mean
Document12 pages
K Mean
Shivram Dwivedi
No ratings yet
jU7fSi-f1500 Chapter 15
Document23 pages
jU7fSi-f1500 Chapter 15
san.ras0715
No ratings yet
Gradient Expectations: Structure, Origins, and Synthesis of Predictive Neural Networks
From Everand
Gradient Expectations: Structure, Origins, and Synthesis of Predictive Neural Networks
Keith L. Downing
No ratings yet
Full Free Motion of Celestial Bodies Around a Central Mass - Why Do They Mostly Orbit in the Equatorial Plane?
From Everand
Full Free Motion of Celestial Bodies Around a Central Mass - Why Do They Mostly Orbit in the Equatorial Plane?
Raul Fattore
No ratings yet
Mechanics I Essentials
From Everand
Mechanics I Essentials
The Editors of REA
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
SAP Modularization MCQ Questions With Answer
Document7 pages
SAP Modularization MCQ Questions With Answer
amarpoonam
No ratings yet
Sap Script
Document15 pages
Sap Script
amarpoonam
No ratings yet
ABAP Exercises
Document45 pages
ABAP Exercises
amarpoonam
No ratings yet
Exception Handaling PDF
Document59 pages
Exception Handaling PDF
amarpoonam
No ratings yet
Let's Web Dynpro. Part I: Web Dynpro: It Is Neither A Tool With Only Drag and Drop (Those Who Have
Document80 pages
Let's Web Dynpro. Part I: Web Dynpro: It Is Neither A Tool With Only Drag and Drop (Those Who Have
amarpoonam
No ratings yet
List of A Few SAP Supplied Function Modules
Document2 pages
List of A Few SAP Supplied Function Modules
amarpoonam
No ratings yet
Handling Imbalanced Dataset
Document23 pages
Handling Imbalanced Dataset
amarpoonam
No ratings yet
MCQ RM
Document16 pages
MCQ RM
amarpoonam
89% (9)
CH 2
Document32 pages
CH 2
amarpoonam
No ratings yet
DAA Introduction PDF
Document13 pages
DAA Introduction PDF
amarpoonam
No ratings yet
A Survey On Audio Cryptography
Document18 pages
A Survey On Audio Cryptography
amarpoonam
No ratings yet
CH 9
Document24 pages
CH 9
amarpoonam
No ratings yet