基于多尺度和注意力机制的红外与可见光图像融合
CSTR:
作者:
作者单位:

1. 沈阳建筑大学 机械工程学院,沈阳 110168;2. 中国科学院沈阳自动化研究所 光电信息处理重点实验室,沈阳 110169

作者简介:

通讯作者:

E-mail: hczhao@sia.cn.

中图分类号:

TP391

基金项目:

装备预研重点基金项目(41401040105).


Infrared and visible image fusion based on multi-scale and attention mechanism
Author:
Affiliation:

1. School of Mechanical Engineering,Shenyang Jianzhu University,Shenyang 110168,China;2. Key Laboratory of Optical-Electronics Information Processing,Shenyang Institute of Automation,Chinses Academy of Sciences,Shenyang 110169,China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    现有的红外与可见光图像融合算法通常从单一尺度提取图像特征,导致融合图像无法全面保留原始特征信息.针对上述问题,提出一种基于多尺度和注意力机制的自编码网络结构实现红外与可见光图像融合.首先,采用密集连接和多尺度注意力模块构建编码器网络,并引入自注意力机制增强像素间的依赖关系,充分提取红外图像的显著目标和可见光图像的细节纹理;然后,特征融合阶段采用基于通道与空间的联合注意融合网络,进一步融合图像典型特征;接着,设计基于像素、结构相似性和色彩的混合损失函数指导网络训练,进一步约束融合图像与源图像的相似性;最后,通过对比实验的主观和客观评价结果,验证所提出算法相比于其他代表性融合算法具有更优异的图像融合能力.

    Abstract:

    Existing infrared and visible image fusion algorithms usually extract image features from a single scale, resulting in fusion images that cannot fully retain original feature information. Aiming at the above problems, an auto-encoder network structure based on multi-scale attention mechanism is proposed to realize the fusion of infrared and visible images. Firstly, an encoder network is constructed with dense connections and multi-scale attention modules, and a self-attention mechanism is introduced to enhance the dependencies between pixels to fully extract the salient objects of infrared images and the detailed textures of visible images. The joint attention fusion network of channels and spaces further fuses the typical features of the image. In addition, a hybrid loss function based on pixels, structural similarity and color is designed to guide the network training, which further constrains the similarity between the fused image and the source image. Finally, by the subjective and objective evaluation results of the comparative experiments, it is proved that the proposed algorithm has better image fusion ability than other representative algorithms.

    参考文献
    相似文献
    引证文献
引用本文

闵莉,田林林,赵怀慈,等.基于多尺度和注意力机制的红外与可见光图像融合[J].控制与决策,2024,39(1):227-235

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2023-12-14
  • 出版日期: 2024-01-20
文章二维码