Send
Close Add comments:
(status displays here)
Got it! This site "robinsnyder.com" uses cookies. You consent to this by clicking on "Got it!" or by continuing to use this website. Note: This appears on each machine/browser from which this site is accessed.
K-Means clustering
1. K-Means clustering
2. K-Means clustering
An important aspect of data science is clustering data. Clustering means partitioning data into groups. A common algorithm for clustering is the k-means algorithm. In this algorithm one must specify the number of cluster groups k.
Details of the algorithm are omitted. (to be added)
3. Example data set
Here is an example data set.
For the remaining groupings, the convex hull is displayed as an outline. The convex hull is the set of points that, when connected, contain all points in the data set.
4. 1 cluster
Here is the k-means grouping of the data set into
1 cluster.
5. 2 clusters
Here is the k-means grouping of the data set into
2 clusters.
6. 3 clusters
Here is the k-means grouping of the data set into
3 clusters.
7. 4 clusters
Here is the k-means grouping of the data set into
4 clusters.
8. 5 clusters
Here is the k-means grouping of the data set into
5 clusters.
9. 6 clusters
Here is the k-means grouping of the data set into
6 clusters.
10. 7 clusters
Here is the k-means grouping of the data set into
7 clusters.
11. Animation
Here is an animation of the clusters.
12. Nonparametric clustering
Whenever the number of desired clusters is known beforehand, the process is called parametric - since the parameter for number of clusters is known.
Whenever the number of desired clusters is not known beforehand, the process is called nonparametric - since the parameter for the number of clusters is not known (beforehand).
13. End of page