【Python】scikit-learn机器学习(八)——K-means聚类

来源:互联网 发布:包装结构设计软件 编辑:程序博客网 时间:2024/06/05 08:26

数据描述


KMeans函数介绍


代码实现

import numpy as npfrom sklearn.cluster import KMeans  def loadData(filePath):    fr = open(filePath,'r+')    lines = fr.readlines()    retData = []    retCityName = []    for line in lines:        items = line.strip().split(",")        retCityName.append(items[0])        retData.append([float(items[i]) for i in range(1,len(items))])    return retData,retCityName      if __name__ == '__main__':    data,cityName = loadData('city.txt')    km = KMeans(n_clusters=4)    label = km.fit_predict(data)    expenses = np.sum(km.cluster_centers_,axis=1)    #print(expenses)    CityCluster = [[],[],[],[]]    for i in range(len(cityName)):        CityCluster[label[i]].append(cityName[i])    for i in range(len(CityCluster)):        print("Expenses:%.2f" % expenses[i])        print(CityCluster[i])

结果输出



原创粉丝点击