混合近邻和多簇合并的密度峰值聚类算法

doi:10.13195/j.kzyjc.2024.1232

首页 > 过刊浏览>2025年第40卷第7期 >2194-2202. DOI:10.13195/j.kzyjc.2024.1232

混合近邻和多簇合并的密度峰值聚类算法
DOI:
                        10.13195/j.kzyjc.2024.1232
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP301.6
基金项目:国家自然科学基金项目(62066030).

Density peak clustering algorithm with mixed nearest neighbors and multi-cluster merging

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

密度峰值聚类算法简单、高效, 可识别任意维度和形状类簇, 已在各领域得到广泛应用. 然而, 密度峰值聚类算法也存在一些问题, 如: 对截断距离参数敏感、难以发现低密度区域的类簇中心以及容易产生“多米诺效应”. 为此, 提出混合近邻和多簇合并的密度峰值聚类算法. 首先, 综合考虑样本的全局分布与局部结构, 引入自然近邻与$ k $近邻重新定义局部密度, 消除对截断距离参数的敏感, 并提高低密度区域样本的局部密度以增加类簇中心的识别度; 其次, 将样本划分为多个微簇, 并利用簇间关联度进行合并, 减少距离类簇中心较远的样本的分配错误, 从而有效缓解分配错误连带效应. 使用人工数据与真实数据进行测试, 结果表明, 所提出算法的综合性能优于对比算法.

Abstract:

The density peak clustering algorithm is simple and efficient, capable of identifying clusters of arbitrary dimensions and shapes, and has been widely applied in various fields. However, this algorithm also has some issues, such as sensitivity to the truncation distance parameter, difficulty in finding the cluster centers of low-density regions, and a tendency to produce the ‘domino effect’. To address these issues, this paper proposes a density peak clustering algorithm with mixed nearest neighbors and multi-cluster merging. First, by comprehensively considering the global distribution and local structure of the samples, natural nearest neighbors and $ k $-nearest neighbors are introduced to redefine local density, eliminating sensitivity to the truncation distance parameter and augmenting the local density of samples in low-density regions to improve the identification of cluster centers. Then, the samples are divided into multiple micro-clusters, and inter-cluster association is utilized for merging to reduce the misallocation of samples that are far from the cluster centers, thereby alleviating the ripple effect of allocation errors. Finally, tests conducted on both synthetic and real datasets demonstrate that the proposed algorithm outperforms its comparative counterparts in overall performance.

参考文献

相似文献

引证文献

引用本文

吕莉,赵妞,肖人彬,等.混合近邻和多簇合并的密度峰值聚类算法[J].控制与决策,2025,40(7):2194-2202

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-10-21
最后修改日期:
录用日期:
在线发布日期: 2025-06-05
出版日期: 2025-07-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码