一种深度信念网络泛化误差边界量化方法及应用
DOI:
CSTR:
作者:
作者单位:

北京工业大学

作者简介:

通讯作者:

中图分类号:

TP183

基金项目:

国家自然科学基金项目(面上项目,重点项目,重大项目),中国博士后科学基金,国家杰出青年科学基金


A method for quantifying generalization error boundary of deep belief network and its applications
Author:
Affiliation:

Fund Project:

China Postdoctoral Science Foundation,The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan),The National Science Fund for Distinguished Young Scholars

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    深度信念网络(Deep Belief Network, DBN)是一种深度学习模型, 解决了传统神经网络因梯度消失导致的深层训练难题, 目前已被广泛应用. 然而, 与其他机器学习模型一样, DBN泛化能力同样难以调控, 这对应用范围及效果提出了严峻挑战. 本文提出了一种深度信念网络泛化误差边界量化方法(DBN with Quantitative Generalization, DBN-QG), 旨在解决DBN泛化误差上界不可控的问题, 提升泛化能力. 首先, 构建一个DBN模型, 分析训练误差与测试误差在数据序列上的关联特性, 给出泛化误差的数学描述. 其次, 通过分析模型的Rademacher复杂度、稀疏度等特性, 提出泛化误差边界上限可量化约束性定理, 给出证明. 最后, 根据泛化误差上界约束, 给出DBN模型在训练阶段的最优参数配置, 并将最优参数配置的DBN-QG用于预测胶州湾近海岸水环境CO2浓度. 实验结果显示, 所提出的DBN-QG模型在预测精度和泛化性能方面均优于其他预测模型. 同时, DBN泛化误差的边界量化方法不仅提高了其在CO2预测方面的可解释性, 同时为机器学习模型的量化分析提供了较为一般化的理论基础.

    Abstract:

    Deep belief network (DBN) is an effective deep learning model, which has addressed the deep training problem caused by vanishing gradients in traditional neural networks and been widely applied in multiple fields. However, like the other machine learning models, DBN also faces the problem of difficulty in regulating generalization ability, which poses a severe challenge to the application scopes and effects. This paper proposes a method for quantifying generalization error boundary of DBN (DBN-QG), aiming to solve the problem that the upper bound of generalization error is uncontrollable and improve the generalization ability. First, a DBN model is constructed, among which the correlation characteristics of training error and testing error are analyzed on the data sequence, and then providing a mathematical description of the generalization error. Second, by analyzing the Rademacher complexity and sparsity degree as well as the other characteristics, a quantifiable constraint theorem for the upper limit of the generalization error boundary is proposed and further proved. Finally, based on the upper bound constraint of the generalization error, the optimal parameter configurations are given in the training process, and the resulting DBN-QG is used to predict the CO2 state of the coastal water environment in Jiaozhou Bay. The experimental results show that the proposed DBN-QG is superior to other models in terms of prediction accuracy and generalization performance. Furthermore, the boundary quantification method of DBN generalization error not only enhances its interpretability in CO2 prediction, but also provides a relatively generalized theoretical basis for the quantitative analysis of machine learning models.

    参考文献
    相似文献
    引证文献
引用本文
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-10-13
  • 最后修改日期:2026-04-16
  • 录用日期:2026-04-17
  • 在线发布日期:
  • 出版日期:
文章二维码