改进的基于广度优先搜索的COP-Kmeans算法

朱煜; 钱景辉; 季正波

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

改进的基于广度优先搜索的COP-Kmeans算法

首发时间：2015-07-09

朱煜 ¹
朱煜(1991-),男（汉族），江苏省泰州市人，硕士研究生，主要研究领域为数据挖掘
钱景辉 ¹
导师：钱景辉（1978-），男，江苏张家港人，副教授，研究方向为计算机网络。
季正波 ¹
College of Electronics and Information Engineering, Nanjing Tech University, Nanjing 211816

1、南京工业大学电子与信息工程学院,南京 211816

摘要：将广度优先搜索BFS应用于COP-Kmeans算法会对相同的约束对产生不同的搜索序列，导致算法的准确率降低。针对这种情况，提出了一种改进的基于BFS的COP-Kmeans算法。算法首先对训练集进行多次聚类，取得聚类结果，然后对聚类结果进行计算，得到各个聚类结果的标准化互信息，根据标准化互信息计算任意两个数据对象的相关性，最终得到各个数据对象的稳定性，将数据对象稳定性作为数据对象的分配次序的参考依据从而提高算法的准确率，最后重新进行聚类，得到最终的聚类结果。实验结果表明，采用改进后的算法比原先算法在准确率上有所提高。

关键词：广度优先搜索算法结合限制的k均值算法标准化互信息数据对象稳定性准确性

For information in English, please click here

An improved COP-Kmeans algorithm based on BFS

Zhu Yu ¹
朱煜(1991-),男（汉族），江苏省泰州市人，硕士研究生，主要研究领域为数据挖掘
Qian JingHui ²
导师：钱景辉（1978-），男，江苏张家港人，副教授，研究方向为计算机网络。
Ji ZhengBo ²
College of Electronics and Information Engineering, Nanjing Tech University, Nanjing 211816

1、Electronic information engineering institute Nanjing University of Technology,Nanjing 211816
2、Electronic information engineering institute NanJing University of Technology,Nanjing 211816

Abstract： The breadth first search applied to the COP-Kmeans algorithm will produce different search sequences for the same constraints，which will reduce the accuracy of the algorithm . In view of this, we propose an improved COP-Kmeans algorithm based on the breadth first search (BFS). Firstly, we trained the data set to get the clustering results, and then the calculation results are calculated to get the normalized mutual information of each calculation result, according to the normalized mutual information we can calculate the correlation between any two instances, take the instance stability as a reference to distribute the instance will improve the accuracy of the algorithm. Finally re-clustering to get the final clustering result. . Experimental results show that the improved algorithm can obtain an increase in the accuracy compared to the original algorithm.

Keywords： Breadth first search Constrained K-means Clustering Normalized mutual information The instance stability Accuracy

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

朱煜，钱景辉，季正波. 改进的基于广度优先搜索的COP-Kmeans算法[EB/OL]. 北京：中国科技论文在线 [2015-07-09]. https://www.paper.edu.cn/releasepaper/content/201507-93.

No.4647089107235214****

同行评议

共计0人参与

全部评论

0/1000

论文编号	201507-93
论文题目	改进的基于广度优先搜索的COP-Kmeans算法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.