面向医学影像分类任务的深度学习模型鲁棒性与泛化性提升策略
CSTR:
作者:
作者单位:

中山大学

作者简介:

通讯作者:

中图分类号:

TP391.41;R318

基金项目:

国家自然科学基金项目(面上项目,重点项目,重大项目)


Strategies for Enhancing the Robustness and Generalization of Deep Learning Models in Medical Image Classification Tasks
Author:
Affiliation:

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    医学影像作为精准诊断的关键客观依据,正经历由深度学习辅助诊断系统带来的范式转变。然而,真实临床部署中跨设备、跨地域的数据分布偏移,以及模型面对微小对抗扰动时的极度脆弱性,严重阻碍了其安全落地应用。为此,本文提出一种兼容对抗攻击与分布变换的深度学习模型鲁棒性与泛化性提升框架。首先,利用引导滤波器处理肺部CT图像,提取肺腔先验信息生成增强数据,突出关键结构特征。其次,引入样本与权重双层扰动,结合预测反馈机制,在保证预测准确性的前提下施加最强对抗扰动。同时,针对增强数据可能削弱诊断特征的问题,设计基于注意力正则化的策略,通过代理模型生成权重噪声并注入原始模型,隐式融合肺腔轮廓先验,提升模型对关键病灶的关注度。最后,提出基于平方损失与KL散度的混合优化策略,改进传统极小-极大损失函数,缓解模型对数据分布的归纳偏差。在SARS-COV-2和COVID19-C数据集上的实验表明,该方法显著提升了模型的综合性能。特别是在Visformer模型上,其自然准确率较现有最优方法提升约3.4%,域外泛化准确率平均提升约25%。

    Abstract:

    As a crucial objective basis for precise diagnosis, medical imaging is experiencing a paradigm shift driven by deep learning-based computer-aided diagnosis systems. However, in real-world clinical deployment, cross-device and cross-regional data distribution shifts, combined with the extreme vulnerability of models to imperceptible adversarial perturbations, severely hinder their safe practical application. To address this, we propose a robustness and generalization enhancement framework compatible with both adversarial attacks and distribution shifts. First, a guided filter is applied to lung CT images to generate augmented data, highlighting key lung cavity structures. Second, dual-level perturbations on samples and weights are introduced, utilizing an early-stopping feedback mechanism to apply the strongest effective adversarial perturbations. Meanwhile, to compensate for the potential weakening of diagnostic features in augmented data, weight noise generated by a proxy model is injected into the original model, implicitly integrating lung contour priors. Finally, a mixed optimization strategy combining square loss and KL divergence is adopted to improve the traditional loss function and alleviate inductive bias. Experiments on datasets like SARS-COV-2 demonstrate that, on the Visformer model, our method improves natural accuracy by approximately 3.4\% and increases average out-of-distribution generalization accuracy by about 25\%, substantially enhancing the reliability of clinical auxiliary decision-making.

    参考文献
    相似文献
    引证文献
引用本文
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-01-21
  • 最后修改日期:2026-04-17
  • 录用日期:2026-04-18
  • 在线发布日期: 2026-05-06
  • 出版日期:
文章二维码