K-Means++初始化的示例#

显示的输出的示例 sklearn.cluster.kmeans_plusplus 用于生成用于集群的初始种子的功能。

K-Means++用作默认初始化 K-means .

K-Means++ Initialization
# Authors: The scikit-learn developers
# SPDX-License-Identifier: BSD-3-Clause

import matplotlib.pyplot as plt

from sklearn.cluster import kmeans_plusplus
from sklearn.datasets import make_blobs

# Generate sample data
n_samples = 4000
n_components = 4

X, y_true = make_blobs(
    n_samples=n_samples, centers=n_components, cluster_std=0.60, random_state=0
)
X = X[:, ::-1]

# Calculate seeds from k-means++
centers_init, indices = kmeans_plusplus(X, n_clusters=4, random_state=0)

# Plot init seeds along side sample data
plt.figure(1)
colors = ["#4EACC5", "#FF9C34", "#4E9A06", "m"]

for k, col in enumerate(colors):
    cluster_data = y_true == k
    plt.scatter(X[cluster_data, 0], X[cluster_data, 1], c=col, marker=".", s=10)

plt.scatter(centers_init[:, 0], centers_init[:, 1], c="b", s=50)
plt.title("K-Means++ Initialization")
plt.xticks([])
plt.yticks([])
plt.show()

Total running time of the script: (0分0.044秒)

相关实例

二分K均值和常规K均值性能比较

Bisecting K-Means and Regular K-Means Performance Comparison

GMM检查方法

GMM Initialization Methods

手写数字数据上的K-Means集群演示

A demo of K-Means clustering on the handwritten digits data

K-Means和MiniBatchKMeans集群算法的比较

Comparison of the K-Means and MiniBatchKMeans clustering algorithms

Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io> _