Welcome to Scribd!

0% found this document useful (0 votes)

93 views

Introduction To Kmeans

Uploaded by

K-means clustering is an algorithm that aims to minimize the distance between data points and their assigned cluster centroids. It works by first selecting k random centroids, then assigning each point to its nearest centroid and recalculating the centroids, repeating until the centroids are stable or a maximum number of iterations is reached. The goal is to minimize the sum of distances between points and centroids.

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Introduction To Kmeans

Uploaded by

SatishKakarla

0% found this document useful (0 votes)

93 views4 pages

Original Title

Introduction to Kmeans

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

93 views4 pages

Introduction To Kmeans

Uploaded by

SatishKakarla

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 4

Search inside document

Introduction to K-Means Clustering

We have finally arrived at the meat of this article!

Recall the first property of clusters – it states that the points within a cluster should be
similar to each other. So, our aim here is to minimize the distance between the
points within a cluster.

There is an algorithm that tries to minimize the distance of the points in

a cluster with their centroid – the k-means clustering technique.
K-means is a centroid-based algorithm, or a distance-based algorithm, where we
calculate the distances to assign a point to a cluster. In K-Means, each cluster is
associated with a centroid.

The main objective of the K-Means algorithm is to minimize

the sum of distances between the points and their respective
cluster centroid.
Let’s now take an example to understand how K-Means actually works:

We have these 8 points and we want to apply k-means to create clusters for these
points. Here’s how we can do it.

Step 1: Choose the number of clusters k

The first step in k-means is to pick the number of clusters, k.

Step 2: Select k random points from the data as centroids

Next, we randomly select the centroid for each cluster. Let’s say we want to have 2
clusters, so k is equal to 2 here. We then randomly select the centroid:
Here, the red and green circles represent the centroid for these clusters.

Step 3: Assign all the points to the closest cluster centroid

Once we have initialized the centroids, we assign each point to the closest cluster
centroid:

Here you can see that the points which are closer to the red point are assigned to the
red cluster whereas the points which are closer to the green point are assigned to the
green cluster.

Step 4: Recompute the centroids of newly formed clusters

Now, once we have assigned all of the points to either cluster, the next step is to
compute the centroids of newly formed clusters:
Here, the red and green crosses are the new centroids.

Step 5: Repeat steps 3 and 4

We then repeat steps 3 and 4:

The step of computing the centroid and assigning all the points to the cluster based on
their distance from the centroid is a single iteration. But wait – when should we stop this
process? It can’t run till eternity, right?

Stopping Criteria for K-Means Clustering

There are essentially three stopping criteria that can be adopted to stop the K-means
algorithm:

1. Centroids of newly formed clusters do not change

2. Points remain in the same cluster
3. Maximum number of iterations are reached

We can stop the algorithm if the centroids of newly formed clusters are not changing.
Even after multiple iterations, if we are getting the same centroids for all the clusters, we
can say that the algorithm is not learning any new pattern and it is a sign to stop the
training.

Another clear sign that we should stop the training process if the points remain in the
same cluster even after training the algorithm for multiple iterations.

Finally, we can stop the training if the maximum number of iterations is reached.
Suppose if we have set the number of iterations as 100. The process will repeat for 100
iterations before stopping.

Guide Mony - Tony Robbins Guide
Document40 pages
Guide Mony - Tony Robbins Guide
Eric Silva
89% (18)
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Pathfinder 2nd Edition Homebrew Template - GM Binder
Document8 pages
Pathfinder 2nd Edition Homebrew Template - GM Binder
Nelson Gonzalez
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
Document9 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
Nikhil Jojen
No ratings yet
Lecture+Notes+ +clustering
Document13 pages
Lecture+Notes+ +clustering
Pankaj Pandey
No ratings yet
Lecture Notes - Clustering
Document13 pages
Lecture Notes - Clustering
gunjan Bhardwaj
No ratings yet
K Means Algo
Document7 pages
K Means Algo
Prakash Chorage
No ratings yet
Clustering
Document28 pages
Clustering
prabhakaran sridharan
No ratings yet
Clustering
Document10 pages
Clustering
Saif Fazal
No ratings yet
Unit 3 & 4 (p18)
Document18 pages
Unit 3 & 4 (p18)
Kashif Baig
No ratings yet
Text Analytics Unit-3
Document11 pages
Text Analytics Unit-3
aathyukthas.ai20001
No ratings yet
K Means Clustering
Document6 pages
K Means Clustering
Alina Corina Bala
No ratings yet
19 - Sessionppt - Clusteringalgos
Document36 pages
19 - Sessionppt - Clusteringalgos
Graisy Biswal
No ratings yet
Unit 4 Aam
Document26 pages
Unit 4 Aam
davidhackwell531
No ratings yet
A Tutorial On Clustering Algorithms
Document4 pages
A Tutorial On Clustering Algorithms
jczerna
No ratings yet
Zara
Document47 pages
Zara
Davin Malore
No ratings yet
UNIT - 3 - Clustering
Document21 pages
UNIT - 3 - Clustering
Dev Goyal
No ratings yet
K Mean
Document12 pages
K Mean
Shivram Dwivedi
No ratings yet
U1 - KMeans - 5th Sem - DS
Document14 pages
U1 - KMeans - 5th Sem - DS
subbumail051
No ratings yet
KMeans Clustering
Document16 pages
KMeans Clustering
Basant Kothari
No ratings yet
Introduction To The K-Means Clustering Algorithm Based On The Elbow
Document4 pages
Introduction To The K-Means Clustering Algorithm Based On The Elbow
Asyraf Adnil
No ratings yet
Techniques of Cluster Analysis: A Seminar On
Document25 pages
Techniques of Cluster Analysis: A Seminar On
VAIBHAV NANAWARE
No ratings yet
Hierarchical Clustering: Required Data
Document6 pages
Hierarchical Clustering: Required Data
Hritik Agrawal
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
Document14 pages
Hierarchical Clustering - 11.3.2024 - Full
0707thecaptaincool
No ratings yet
Clustering (Unit 3)
Document71 pages
Clustering (Unit 3)
vedang maheshwari
100% (2)
Kmean Clustering
Document3 pages
Kmean Clustering
minichel
No ratings yet
Unit IV
Document96 pages
Unit IV
Sai Manasa
No ratings yet
An Introduction To Clustering and Different Methods of Clustering
Document9 pages
An Introduction To Clustering and Different Methods of Clustering
Leonor Patricia MEDINA SIFUENTES
No ratings yet
ML Unit 5
Document50 pages
ML Unit 5
SUJATA SONWANE
No ratings yet
K Mean
Document7 pages
K Mean
Deergha Tiwari
No ratings yet
Presentation 1
Document47 pages
Presentation 1
mamehasen2015
No ratings yet
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
Document3 pages
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
Carlos Perez S
No ratings yet
Lecture 18 K Means Clustering
Document77 pages
Lecture 18 K Means Clustering
Fasih Ullah
No ratings yet
Unit 3 Data
Document37 pages
Unit 3 Data
Sangam
No ratings yet
KMean Merged
Document13 pages
KMean Merged
Abhyudya Singh
No ratings yet
Techniques of Cluster Analysis: A Seminar On
Document25 pages
Techniques of Cluster Analysis: A Seminar On
VAIBHAV NANAWARE
No ratings yet
DWM Exp7 C49
Document11 pages
DWM Exp7 C49
yadneshshende2223
No ratings yet
Clustering
Document23 pages
Clustering
Aditya Mohite
No ratings yet
ML Unit-2
Document31 pages
ML Unit-2
2021pcecscharul037
No ratings yet
DWDM Unit5
Document14 pages
DWDM Unit5
sri charan
No ratings yet
K-Mean Algo. On Iris Data Set - 15129145 PDF
Document7 pages
K-Mean Algo. On Iris Data Set - 15129145 PDF
Mohammad Waqas Moin Sheikh
No ratings yet
K - Mean Clustering
Document12 pages
K - Mean Clustering
Shuvajit Das amit
No ratings yet
Clustering
Document17 pages
Clustering
Aatri Pal
No ratings yet
Clustering Algorithm
Document47 pages
Clustering Algorithm
asinghal2122003
No ratings yet
CV UNIT 4
Document60 pages
CV UNIT 4
jayalakshmi.mca staff
No ratings yet
K-Means Clustering
Document6 pages
K-Means Clustering
hifzan786
No ratings yet
UNIT 4 K-Means Clustring
Document13 pages
UNIT 4 K-Means Clustring
sahil.utube2003
No ratings yet
Clustering - K-Means: Prerequisite
Document8 pages
Clustering - K-Means: Prerequisite
Varun Bhayana
No ratings yet
Artificial Intelligence Report
Document23 pages
Artificial Intelligence Report
Joan Eborde
No ratings yet
Clustering Lecture
Document46 pages
Clustering Lecture
ahmetdursun03
No ratings yet
Clustering
Document24 pages
Clustering
Vits Rangannavar
No ratings yet
Clustering
Document24 pages
Clustering
1138 Anuj Bhowmick
No ratings yet
CLUSTERING
Document11 pages
CLUSTERING
Swarnlata
No ratings yet
Clustering Analysis: What Is Cluster Analysis?
Document5 pages
Clustering Analysis: What Is Cluster Analysis?
shyama
No ratings yet
Experiment No 07: Mihir Patel Teit 2
Document5 pages
Experiment No 07: Mihir Patel Teit 2
MIHIR PATEL
No ratings yet
AIDS 6 by AKN
Document28 pages
AIDS 6 by AKN
ANIMESH PARAB
No ratings yet
Unsupervised Learning 2024-PPG
Document85 pages
Unsupervised Learning 2024-PPG
adwaitmali2003
No ratings yet
BAR Machine Learning Notes Part 2
Document2 pages
BAR Machine Learning Notes Part 2
surya
No ratings yet
7.introduction To Clustering
Document11 pages
7.introduction To Clustering
Gavi Kiran
No ratings yet
K Mean Clustering
Document24 pages
K Mean Clustering
discodancerhasan
No ratings yet
K-Means Clustering Algorithm - Javatpoint
Document21 pages
K-Means Clustering Algorithm - Javatpoint
mangotwin22
No ratings yet
Machine Learning with Python for Beginners
From Everand
Machine Learning with Python for Beginners
Saimon Carrie
No ratings yet
GeM Bidding 2502553
Document7 pages
GeM Bidding 2502553
SatishKakarla
No ratings yet
GeM Bidding 2454218
Document6 pages
GeM Bidding 2454218
SatishKakarla
No ratings yet
Neural Net - Modi
Document16 pages
Neural Net - Modi
SatishKakarla
No ratings yet
GeM Bidding 2511500
Document6 pages
GeM Bidding 2511500
SatishKakarla
No ratings yet
Supervised Learning: Naïve Bayes
Document15 pages
Supervised Learning: Naïve Bayes
SatishKakarla
No ratings yet
Unsupervised Learning Modi
Document16 pages
Unsupervised Learning Modi
SatishKakarla
No ratings yet
Invest in Omnichannel Retail Strategies: 2. Provide A Personalized Retail Experience
Document4 pages
Invest in Omnichannel Retail Strategies: 2. Provide A Personalized Retail Experience
SatishKakarla
No ratings yet
Pharmaceutical Detailers: Missionary Selling/ Order Creators
Document4 pages
Pharmaceutical Detailers: Missionary Selling/ Order Creators
SatishKakarla
No ratings yet
Analysis of Facebook Page of Infosys
Document2 pages
Analysis of Facebook Page of Infosys
SatishKakarla
No ratings yet
Probabilistic Learning - NB
Document10 pages
Probabilistic Learning - NB
SatishKakarla
No ratings yet
"Why Should We Hire You?": Demonstrate Confidence
Document2 pages
"Why Should We Hire You?": Demonstrate Confidence
SatishKakarla
No ratings yet
Sales Approach
Document2 pages
Sales Approach
SatishKakarla
No ratings yet
Questionnaire
Document72 pages
Questionnaire
SatishKakarla
No ratings yet
Testo 890-2 Infrared Camera Brochure
Document2 pages
Testo 890-2 Infrared Camera Brochure
pierrebellyqd
No ratings yet
Career Planner Guide Final 1template
Document3 pages
Career Planner Guide Final 1template
Patrick Mboma
No ratings yet
Ikengwu Chisom CV
Document3 pages
Ikengwu Chisom CV
Chisom Ikengwu
No ratings yet
European Apparel Pricing: Soft Demand Leads To Ongoing Price Pressure - Downgrade Next To Underperform
Document18 pages
European Apparel Pricing: Soft Demand Leads To Ongoing Price Pressure - Downgrade Next To Underperform
Tung Ngo
No ratings yet
Boeing 787-8 Aeromexico
Document15 pages
Boeing 787-8 Aeromexico
akash.m6201
No ratings yet
Analyst Briefing 2QFY2024
Document39 pages
Analyst Briefing 2QFY2024
Zhixian S
No ratings yet
Habeeb Rahman A: Examination Institution University Percentage Year
Document2 pages
Habeeb Rahman A: Examination Institution University Percentage Year
Moideen
No ratings yet
History of Reserve Bank of India: Swathy V Kumar (Hs09H037) Vigneshkumar S (Hs09H038) Vikash Gunasekar (Hs09H039
Document17 pages
History of Reserve Bank of India: Swathy V Kumar (Hs09H037) Vigneshkumar S (Hs09H038) Vikash Gunasekar (Hs09H039
api-3721555
No ratings yet
AM XI Assignment Company Law
Document3 pages
AM XI Assignment Company Law
Shrujan Sinha
No ratings yet
Masterfinish 202: Solvent Based, Ready-To-Use, Form-Release Agent
Document2 pages
Masterfinish 202: Solvent Based, Ready-To-Use, Form-Release Agent
Fakhrul Firdaus
No ratings yet
Assignment - #2 Geotech
Document16 pages
Assignment - #2 Geotech
Åbhîshęķ Ăryą
No ratings yet
ATMIS, EUCAP Train Second Cohort of Somali Navy and Coast Guard Officers On Maritime Security
Document5 pages
ATMIS, EUCAP Train Second Cohort of Somali Navy and Coast Guard Officers On Maritime Security
AMISOM Public Information Services
No ratings yet
Eq. Divididos 5 A 20 TR Solo Frio R-410
Document125 pages
Eq. Divididos 5 A 20 TR Solo Frio R-410
Leon Hernandez
No ratings yet
Arcelor Mittal CRGO
Document4 pages
Arcelor Mittal CRGO
Ges Sy
No ratings yet
Adobe Scan 19-Sep-2023
Document3 pages
Adobe Scan 19-Sep-2023
CYBER WORLD
No ratings yet
BP 200
Document35 pages
BP 200
HELTONRGS5498
No ratings yet
Conflicts of Law CASE DIGEST
Document7 pages
Conflicts of Law CASE DIGEST
CJ Soriano
No ratings yet
Httprepository - Unp.ac - id201881LENI20MARLINA 1 PDF
Document72 pages
Httprepository - Unp.ac - id201881LENI20MARLINA 1 PDF
yjmvdjsdr5
No ratings yet
RTS Exercises & Solutions
Document23 pages
RTS Exercises & Solutions
rktiwary256034
No ratings yet
XCAP Analyzer User Guide 6 Accuver PDF
Document221 pages
XCAP Analyzer User Guide 6 Accuver PDF
Naeem
No ratings yet
MKE02Z64M20SF0RM
Document597 pages
MKE02Z64M20SF0RM
michael
No ratings yet
Celestron Telescopes - Specs
Document2 pages
Celestron Telescopes - Specs
Michael R
No ratings yet
Variations Report F1-Budget 23
Document23 pages
Variations Report F1-Budget 23
mobility.atom
No ratings yet
NIRC - Tax Table Based On Atty. Lumbera's Lecture
Document1 page
NIRC - Tax Table Based On Atty. Lumbera's Lecture
Red Rose Dickinson
No ratings yet
Fluid Flow Operation
Document2 pages
Fluid Flow Operation
Shubham imts
No ratings yet
Resume PDF
Document1 page
Resume PDF
Azfar Abidin
No ratings yet
Ebooks File Shaking The Money Tree 3rd Edition The Art of Getting Grants Ree The Art of Getting Grants &amp Donations) Morrie Warshawski All Chapters
Document24 pages
Ebooks File Shaking The Money Tree 3rd Edition The Art of Getting Grants Ree The Art of Getting Grants &amp Donations) Morrie Warshawski All Chapters
keshabdaden
100% (6)
Complete Scoundrel
Document162 pages
Complete Scoundrel
Julia Carmo
100% (5)