基于跨模态交互融合与全局特征校准的无人机电线检测
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:

辽宁省自然科学基金项目(2025-BS-0254, 2025-BS-0248);国家重点研发计划项目(2017YFC0821005);辽宁省科技厅联合开放基金机器人学国家重点实验室开放基金项目(2020-KF-12-11);辽宁省研究生教育教学改革研究项目(LNYJG2024310);辽宁省教育科学“十四五”规划课题项目(JG25DB472).


UAV transmission line detection based on cross-modal interaction fusion and global feature modulation
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    无人机对电线的精确感知是电力系统运维与无人机自主避障的核心基础, 但复杂环境下的光照变化、背景干扰及电线自身特性, 给精准识别带来极大挑战. 现有单一模态检测方法依赖可见光或红外数据, 因复杂背景适应性差, 在恶劣条件下表现不佳, 存在明显局限性; 多模态检测虽通过融合可见光与红外数据提升了鲁棒性, 但在复杂环境适应性与任务特异性挖掘上仍有不足. 为此, 提出可见光与红外数据跨模态交互融合与全局特征校准检测方案: 通过跨模态交互引导融合模块(CIGF)实现双模态特征深度交互与优势互补, 通过全局特征重要性校准器(GFSM)精准校准枢纽特征并增强关键信息, 通过多感受野增强解码器(MRED)高效重建电线目标精细空间结构并实现像素级定位; 三大核心模块协同, 形成从特征提取、交互融合到全局校准再到精细解码的完整技术链路. 在无人机电线检测权威数据集VITLD上的实验显示, 该算法可满足检测精度与实时性的双重需求, 尤其在夜间低光照、雾天模糊、雪天遮挡等复杂极端环境中仍保持高精度, 突破传统方法应用瓶颈. 该方案可为解决无人机电线检测问题提供有效思路, 具有重要理论意义与实际应用价值.

    Abstract:

    Accurate perception of transmission lines by unmanned aerial vehicles (UAVs) is a core foundation for power system operation and maintenance as well as UAV autonomous obstacle avoidance. However, in complex environments, factors such as light changes, background interference, and the inherent characteristics of transmission lines themselves pose significant challenges to accurate identification. Existing single-modal detection methods rely on visual or infrared data. Restricted by their poor adaptability to complex backgrounds, these methods perform poorly under harsh conditions and have obvious limitations. Although multi-modal detection, which fuses visual and infrared data, has improved robustness compared with single-modal methods, existing multi-modal approaches still have shortcomings in adapting to complex environments and mining task-specific characteristics. To address these issues, a transmission line detection scheme based on cross-modal interaction fusion and global feature calibration for visual and infrared data is proposed. Specifically, thecross-modal interaction guided fusion (CIGF) module enables deep interaction and complementary advantages of dual-modal features; the global feature significance modulator (GFSM) accurately calibrates pivotal features and enhances key information; and the multi-receptive enhanced decoder (MRED) efficiently reconstructs the fine spatial structure of transmission line targets and achieves pixel-level positioning. These three core modules work together to form a complete technical chain covering feature extraction, interaction fusion, global calibration, and fine decoding. Experiments on VITLD, the authoritative dataset for UAV transmission line detection, show that the proposed algorithm meets the dual requirements of detection accuracy and real-time performance. Notably, it maintains high accuracy even in complex and extreme environments such as low light at night, foggy blur, and snowy occlusion, breaking through the application bottleneck of traditional methods. This scheme provides an effective approach to solving the problem of UAV transmission line detection and holds important theoretical significance and practical application value.

    参考文献
    相似文献
    引证文献
引用本文

田洪坤,姜囡,任涛.基于跨模态交互融合与全局特征校准的无人机电线检测[J].控制与决策,2026,41(3):639-650

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-07-21
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-03-04
  • 出版日期: 2026-03-10
文章二维码