基于数据分布特性的代价敏感宽度学习系统
CSTR:
作者:
作者单位:

1. 湖南师范大学 信息科学与工程学院,长沙 410081;2. 中南大学 自动化学院,长沙 410083

作者简介:

通讯作者:

E-mail: ljp@hunnu.edu.cn.

中图分类号:

TP273

基金项目:

国家自然科学基金项目(61971188);湖南省自然科学基金项目(2018JJ3349);湖南省教育厅优秀青年项目(19B364);湖南省知识产权战略推进专项项目(2019F012K);湖南省研究生科研创新项目(CX20190415).


Data distribution-based cost-sensitive broad learning system
Author:
Affiliation:

1. College of Information Science and Engineering,Hunan Normal University,Changsha 410081,China;2. College of Automation,Central South University,Changsha 410083,China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    宽度学习系统(broad learning system,BLS)作为深度神经网络的替代框架,具有快速自适应模型结构选择和在线增量学习能力,被认为是知识发现和数据工程领域中一种极具前途的技术.传统的BLS主要应用于数据分 布均衡且误分类代价相同的模式分类任务,但大多数实际应用的数据是非均衡分布的,如网络入侵监测、医疗诊断、信用卡欺诈检测等.基于此,提出一种基于数据分布特性的代价敏感BLS(data distribution-based cost-sensitive-BLS,DDbCs-BLS),解决数据分布不均、误分代价不同的模式分类任务.DDbCs-BLS在充分考虑数据统计分布特性的基础上寻找代价敏感型BLS分类器的最佳分类边界,保证少数类样本信息不被丢失,从而提高BLS在各类数据集上的模式分类性能.在多种公共数据集(包括均衡和不均衡数据集)上进行大量的验证性和对比性实验,结果表明DDbCs-BLS能有效确定分类边界线的最佳位置,无论是在均衡数据集还是在不均衡数据集上均能获得更好的分类性能.

    Abstract:

    Broad learning system(BLS) provides a flexible modeling framework, which is a potential substitute of deep neural network models. Due to its fast adaptive ability of automatic model structure selection and online incremental learning strategies, BLS is referred to as a promising technology in the field of knowledge discovery and data engineering. However, traditional BLS model are mainly aimed at pattern classification tasks with approximately even-distributed data and equal misclassification cost. In real applications, most of pattern recognition tasks are unevenly-distributed, such as credit card fraud detection, network intrusion detection, medical diagnosis, etc. In this paper, a data distribution-based cost-sensitive-BLS (DDbCs-BLS) is proposed for solving the problem of pattern classification tasks with imbalance data and varying misclassification costs on different classes. The DDbCs-BLS can achieve the best classification boundary by adopting the cost sensitive BLS learners, and ensure the lossless of the information of sparse classes, so as to ensure the classification performance of the BLS classifier in various data sets. The DDbCs-BLS is validated on multiple public data sets (including balanced and imbalanced data sets). Extensive validation and comparative results show that the DDbCs-BLS can effectively determine the best location of the classification boundary line, consequently, it can achieve better classification performance on both balanced and imbalanced data sets.

    参考文献
    相似文献
    引证文献
引用本文

徐鹏飞,王敏,刘金平,等.基于数据分布特性的代价敏感宽度学习系统[J].控制与决策,2021,36(7):1686-1692

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2021-06-16
  • 出版日期: 2021-07-20
文章二维码