多模态数据驱动的两阶段区间决策方法
DOI:
CSTR:
作者:
作者单位:

1.合肥工业大学;2.中国科学技术大学

作者简介:

通讯作者:

中图分类号:

C934

基金项目:

国家自然科学基金(72571090, 72171066),安徽省研究生教学改革研究项目(2023jyjxggyjY028).


The two-stage interval decision-making method driven by multimodal data
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在图像和文本等多模态数据可用条件下,利用单一深度网络进行融合决策,存在可解释性和可靠性挑战.针对挑战,提出一种图像和文本数据驱动的两阶段区间决策方法,包括准则预测和准则融合两个阶段.在准则预测阶段,依据各准则上的图像特征提取需求确定深度网络集合,设计基于有放回采样的深度网络性能统计比较方法确定网络排序,进而构建网络接续组合的选择方法确定可靠的最佳组合,产生基于图像的区间数预测值.在准则融合阶段,通过文本训练集学习准则权重,进而构建基于文本验证集的优化模型学习自适应权重函数,最后利用准则权重和自适应权重函数融合各准则上的图像生成预测值,产生可解释的总体预测值.以安徽合肥某三甲医院超声部的乳腺彩超图像和文本数据为基础,将提出方法用于乳腺病灶辅助诊断,验证了方法的有效性.

    Abstract:

    Under the condition that the multimodal data of image and text are available, the use of one deep network to make fusion decisions is a usual way. However, it faces the challenges of interpretability and reliability. To address these two challenges, this article proposes a two-stage interval decision-making method driven by image and text data, including the criterion prediction and criterion aggregation stages. In the criterion prediction stage, a set of deep networks is determined according to the requirements of extracting image characteristics, and then a statistical comparison method of deep network performance is designed based on sampling with replacement to determine the rankings of the deep networks in the set. With the resulting rankings, a method of selecting the sequential combinations of deep networks is constructed to identify the reliable optimal combination, which can be used to derive interval-valued predictions from images. In the criterion aggregation stage, criterion weights are learned from the training dataset of text, and then an optimization is constructed based on the verifying dataset of text to learn adaptive weight function. With the resulting criterion weights and weight function, the interval-valued predictions derived from images on each criterion are finally combined to generate the explainable overall predictions. Based on the image and text data collected from the ultrasonic department of a tertiary hospital in Hefei, Anhui, the proposed method is used to help diagnose breast lesions, which validates its effectiveness.

    参考文献
    相似文献
    引证文献
引用本文
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-12-12
  • 最后修改日期:2026-03-05
  • 录用日期:2026-03-05
  • 在线发布日期:
  • 出版日期:
文章二维码