加入三角不等式的闭合的改进算法
首发时间:2019-03-06
摘要:本文介绍的是加入三角不等式的闭合的K-means改进算法,它显著提高了传统k-means算法的计算效率。改进方法主要考虑到传统的K-means重分配过程中绝大数的计算过程是非必要的,因为算法需要找到最近的聚类中心点,却不关心数据点与其他聚类中心的距离。这种改进思路是基于Kd-tree with BBF算法的精神拓展的,本文提到的与它有这类似之处。先通过三角不等式确定每个聚类中心不需要计算的点的范围,再通过使用多个随机空间分区树将数据组成邻居点来有效的识别那些易于"移动"的点。实验结果表明,该方法在计算大型视觉词汇数据集时比传统的算法有更高的计算效率,大约减少了了41%的计算时间。 本文还评估了一些参数的影响,包括总的数据量,聚类数量,阈值大小。它们被证明不同的参数选取可以有效的减少时间消耗,但代价是聚类的效果会降低。
关键词: 图像处理 K-means 算法;三角不等式原理 随机树
For information in English, please click here
Closed K-means improved algorithm with triangular inequality
Abstract:This paper introduces a closed K-means improved algorithm that adds triangular inequalities, which significantly improves the computational efficiency of the traditional k-means algorithm. The improvement method mainly considers that the calculation process of the vast majority in the traditional K-means redistribution process is unnecessary, because the algorithm needs to find the nearest cluster center point, but does not care about the distance between the data points and other cluster centers. This improvement is based on the spirit of the Kd-tree with BBF algorithm, and the similarities mentioned in this article.First, the triangle inequality is used to determine the range of points that each cluster center does not need to calculate, and then the data is formed into neighbor points by using multiple random space partition trees to effectively identify those points that are easy to "move". The experimental results show that this method has higher computational efficiency than the traditional algorithm in computing large visual vocabulary datasets, and reduces the computation time by about 41%. This paper also evaluates the impact of some parameters, including the total amount of data, the number of clusters, and the threshold size. They have been shown that different parameter selection can effectively reduce time consumption, but at the cost of clustering.
Keywords: Image Processing K-means algorithm Principle of triangular inequality Random tree
基金:
引用
No.****
动态公开评议
共计0人参与
勘误表
加入三角不等式的闭合的改进算法
评论
全部评论0/1000