基于网格自组织粒球模型的不平衡回归方法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP39

基金项目:

重庆市自然科学基金创新发展联合基金项目(CSTB2023NSCQ-LZX0006);国家自然科学基金重点项目(62233018);成都市重点研发计划项目(2023-YF11-00059-HZ).


An imbalanced regression method based on grid self-organized granular ball model
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    现有的不平衡数据回归算法需要多次计算样本间的距离, 当样本数量较大时, 处理效率较低. 粒球模型可将样本集合迭代划分为多个粒球, 以降低样本规模. 但是, 当前的粒球模型依赖于样本类别标签, 不适合回归任务. 鉴于此, 首先, 利用粒球内样本区域的网格划分, 定义粒球的填充度, 提出一种网格自组织粒球模型(GSOGB), 能够同时处理回归任务和分类任务; 然后, 在此基础上, 给出粒球内样本在邻域内的过采样策略, 提出基于网格自组织粒球模型的不平衡回归方法(GSOGB-SMOTER). 实验结果表明: 所提出GSOGB在12个分类数据集上优于现有粒球模型; 所提出GSOGB-SMOTER在10个不平衡回归数据集连续目标值域的5个等长分区的MSE指标上略优于文献中的7种算法, 且具有鲁棒性和更高的运行效率, 能够快速处理较大规模数据的不平衡回归.

    Abstract:

    Existing imbalanced data regression algorithms require multiple calculations of distances between samples, which leads to low processing efficiency when the number of samples is large. The granular ball model can iteratively divide a sample set into multiple granular balls, reducing the sample scale. However, current granular ball models rely on sample category labels and are not suitable for regression tasks. This paper utilizes the grid division of the granular ball sample area, defines the filling degree of the granular ball, and proposes a grid self-organized granular ball model (GSOGB) that can handle both regression and classification tasks simultaneously. On this basis, an oversampling strategy for samples within the granular ball in the neighborhood is given, and an imbalanced regression method based on the grid self-organized granular ball model (GSOGB-SMOTER) is proposed. Experimental results show that the proposed GSOGB outperforms existing granular ball models on 12 classification datasets, and the proposed GSOGB-SMOTER slightly outperforms seven algorithms in the literature on the MSE metric of five equal-length target value domains of 10 imbalanced regression datasets, and has robustness and higher operational efficiency, capable of quickly processing large-scale imbalanced data for regression.

    参考文献
    相似文献
    引证文献
引用本文

胡峰,周雨龙,苏祖强,等.基于网格自组织粒球模型的不平衡回归方法[J].控制与决策,2025,40(8):2513-2524

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-11-01
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-07-11
  • 出版日期: 2025-08-20
文章二维码