DI-YOLO: 一种面向无人机航拍图像的高效小目标检测框架
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391.41

基金项目:

“十四五”国家重点研发计划重点专项项目(2024YFE0199500, 2022YFF1101100).


DI-YOLO: An efficient small object detection framework for UAV aerial imagery
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对航拍图像中目标尺寸微小、纹理特征模糊以及分布密集带来的检测难题, 提出一种基于改进YOLO架构的DI-YOLO检测模型. 当前主流检测方法在微小目标结构信息保留以及多尺度特征提取和融合方面存在明显不足. 鉴于此, 构建内容感知特征增强模块(CARAFE), 通过动态特征选择机制实现跨层级特征的自适应融合; 同时, 设计并行异构特征调制模块(PHFM), 有效协调全局上下文建模与局部细节特征的关联性; 并引入形状感知交并比损失函数(Shape-IoU)和微小目标检测头, 进一步提升边界框回归精度和微小目标检测能力. 在VisDrone2019和DOTAv1.5基准数据集上的对比实验结果表明, 所提出模型较基准模型YOLOv10取得显著提升: 在VisDrone2019数据集上, mAP@0.5和mAP@0.5 : 0.95指标分别提升了12.7%和13.7%; 在DOTAv1.5数据集上, 对应提升了12.1%和10.2%, 且在计算效率方面保持优势. 消融实验进一步验证了各模块的有效性, 为航拍场景下的高精度目标检测提供了新的解决方案.

    Abstract:

    Addressing the detection challenges posed by small target sizes, blurred texture features, and dense distributions in aerial imagery, this research proposes a detection model based on an improved YOLO architecture, named drone imagery YOLO (DI-YOLO). Current mainstream detection methods show significant deficiencies in preserving structural information of small targets and multi-scale feature extraction and fusion. Therefore, we innovatively construct a content-aware reassembly of features (CARAFE) module, achieving adaptive fusion of cross-level features through a dynamic feature selection mechanism; simultaneously design a parallel heterogeneous feature modulator (PHFM) that effectively coordinates the relationship between global context modeling and local detail features; and introduce a shape-aware intersection over union (Shape-IoU) loss function and a tiny object detection he ad to further enhance bounding box regression accuracy and small target detection capabilities. Through comparative experiments on the VisDrone2019 and DOTAv1.5 benchmark datasets, the proposed model achieves significant improvements over the baseline YOLOv10 model: on the VisDrone2019 dataset, mAP@0.5 and mAP@0.5 : 0.95 metrics improve by 12.7% and 13.7%, respectively, while on the DOTAv1.5 dataset, corresponding improvements of 12.1% and 10.2% are achieved, with maintained advantages in computational efficiency. Ablation experiments further verify the effectiveness of each module, providing a new solution for high-precision object detection in aerial scenes.

    参考文献
    相似文献
    引证文献
引用本文

丁浩晗,贺万程,万俊,等. DI-YOLO: 一种面向无人机航拍图像的高效小目标检测框架[J].控制与决策,2025,40(10):3106-3116

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-04-22
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-09-09
  • 出版日期: 2025-10-20
文章二维码