基于多尺度融合和高分辨特征增强的无人机航拍目标检测
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:

国家自然科学基金项目(61573305);河北省自然科学基金项目(F2022203038, F2019203511);河北省级重点实验室绩效补助经费项目(22567612H).


UAV aerial target detection based on multi-scale fusion and high-resolution feature enhancement
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    无人机飞行高度的动态变化使得航拍图像中往往包含大量小目标, 同时目标尺度变化显著, 这些问题给目标检测任务带来了挑战. 针对上述问题, 提出一种基于多尺度融合和高分辨特征增强的无人机航拍目标检测方法. 首先, 在骨干网络中引入多尺度结构重参数化特征提取模块, 利用普通卷积块和结构重参数化的大核卷积块对多个分支进行不同尺度的卷积运算, 有效提取不同感受野下的特征信息; 然后, 在颈部网络中引入基于特征金字塔网络的多维特征自适应融合模块, 以优化其自下而上的特征聚合过程, 实现对浅层特征中的精细细节和深层特征中的上下文信息的自适应选择, 从而更有效地应对目标尺度显著变化; 最后, 在颈部网络中引入多尺度特征融合小目标增强模块, 以捕捉无人机航拍图像中小目标物体在不同尺度上的变化. 通过在VisDrone2019和TinyPerson两个公开数据集上进行大量的实验, 表明了所提出方法的有效性和优越性.

    Abstract:

    The dynamic change of UAV flight height makes aerial images often contain a large number of small targets, and the target scale changes significantly. These problems bring challenges to the target detection task. In view of the above problems, this paper proposes a UAV aerial target detection method based on multi-scale fusion and high-resolution feature enhancement. Firstly, a multi-scale structure re-parameterized feature extraction module is introduced into the backbone network. The convolution operation of different scales is performed on multiple branches by using ordinary convolution blocks and structure re-parameterized large-core convolution blocks, and the feature information under different receptive fields is effectively extracted. Then, a multi-dimensional feature adaptive fusion module based on the feature pyramid network is introduced into the neck network to optimize its bottom-up feature aggregation process, so as to realize the adaptive selection of fine details in shallow features and context information in deep features, so as to deal with the significant change of target scale more effectively. Finally, a multi-scale feature fusion small target enhancement module is introduced in the neck network to capture the changes of small target objects in UAV aerial images at different scales. Extensive experiments on two public datasets, VisDrone2019 and TinyPerson, demonstrate the effectiveness and superiority of the proposed method.

    参考文献
    相似文献
    引证文献
引用本文

陈志旺,肖迪创,吕昌昊,等.基于多尺度融合和高分辨特征增强的无人机航拍目标检测[J].控制与决策,2025,40(7):2290-2299

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-12-02
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-06-05
  • 出版日期: 2025-07-20
文章二维码