基于密度的划分式聚类过程参数选择算法
CSTR:
作者:
作者单位:

中国人民解放军军械工程学院信息工程系,石家庄050003.

作者简介:

吴杨

通讯作者:

中图分类号:

TP309

基金项目:

国家自然科学基金项目(61173191);军内科研项目(YJJXM12033).


Clustering parameters selection algorithm based on density for divisional clustering process
Author:
Affiliation:

Department of Information Engineering,Ordnance Engineering College of PLA,Shijiazhuang 050003,China.

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为确定??-means 等聚类算法的初始聚类中心, 首先由样本总量及其取值区间长度确定对应维上的样本密度统计区间数, 并将满足筛选条件的密度峰值所在区间内的样本均值作为候选初始聚类中心; 然后, 根据密度峰值区间在各维上的映射关系建立候选初始聚类中心关系树, 进一步采用最大最小距离算法获得初始聚类中心; 最后为确定最佳聚类数, 基于类内样本密度及类密度建立聚类有效性评估函数. 针对人工数据集及UCI 数据集的实验结果表明了所提出算法的有效性.

    Abstract:

    In order to select the initial clustering centers for the divisional clustering algorithm such as the ??-means algorithm, the sample density calculating regions number of each dimension is confirmed according to the samples number and their values, firstly. Then, the average value of the samples of the region with peak value satisfying the filtering conditions is taken as the candidate for the initial clustering center, and a relationship tree of the candidates is established on the mapping relations of the regions. Furthermore, the initial clustering centers are selected by using the maximal-minimal distance algorithm. To confirm the best number of the clusters, a clustering quality evaluation function is established according to the sample density and cluster density. Experiment results of the manual and UCI data sets show the effectiveness of the proposed algorithms.

    参考文献
    相似文献
    引证文献
引用本文

吴杨 王韬 李进东.基于密度的划分式聚类过程参数选择算法[J].控制与决策,2016,31(1):21-29

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2014-10-19
  • 最后修改日期:2015-01-26
  • 录用日期:
  • 在线发布日期: 2016-01-20
  • 出版日期:
文章二维码