基于卷积混合注意力机制的多目标跟踪算法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:

国家自然科学基金项目(62102272);辽宁省博士科研启动基金计划项目(2023-BS-130).


Multi-target tracking algorithm based on convolutional hybrid attention mechanism
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    基于检测的多目标跟踪方法在复杂场景问题上达到了较好的效果, 但已有研究大多关注于时空特征关联而忽视了提高检测性能所能带来的全局跟踪收益. 据此, 提出一种卷积混合注意力机制, 该模块结合动态稀疏通道注意力和空间位置注意力: 在处理通道注意力时, 整合空间上下文信息, 动态调整通道权重; 在处理空间注意力时, 结合不同通道特征评估空间区域的重要性, 旨在优化注意力分配并提升检测精度. 进一步地, 提出一种两阶段多目标跟踪方法 —— CHAMTrack, 通过在运动目标检测阶段使用该注意力机制, 增强算法在复杂场景中对关键信息的捕捉能力, 提升不同尺度目标的跟踪效果, 降低跟踪过程中漏检和ID切换的发生率. 在MOT17和MOT20数据集上的实验结果表明, CHAMTrack在MOTA指标上分别提升$2.1\, \% $和$1.3\, \% $, 在IDSw.指标上分别提升$ 28 \,\%$和$ 20.5\, \%$, 显著提升了多目标跟踪算法在复杂场景中的效果和鲁棒性.

    Abstract:

    Currently, detection-based multi-target tracking methods have achieved better results in complex scene problems, but most of the existing research focuses on spatio-temporal feature correlation and neglects the global tracking benefit that can be brought by improving detection performance. Accordingly, this paper proposes a convolutional hybrid attention mechanism, which combines dynamic sparse channel attention and spatial location attention: when dealing with channel attention, it integrates spatial context information to dynamically adjust the channel weights; when dealing with spatial attention, it combines different channel features to evaluate the importance of spatial regions, aiming at optimising the allocation of attention and improving the detection accuracy. Further, this paper proposes a two-phase multi-target tracking method, CHAMTrack, to enhance the algorithm's ability to capture key information in complex scenes, improve the tracking effect of targets at different scales, and reduce the occurrence rate of omission and ID switching in the tracking process by using the attention mechanism in the detection phase of moving targets. The incidence of missed detection and ID switching during the tracking process is reduced. The experimental results on MOT17 and MOT20 datasets show that the CHAMTrack improves the MOTA metrics by $2.1\,\% $ and $ 1.3\,\%$, and the IDSw. metrics by $28\,\% $ and $20.5\,\% $, which significantly improves the effectiveness and robustness of the multi-target tracking algorithms in complex scenes.

    参考文献
    相似文献
    引证文献
引用本文

郭崇,刘晟,张文波,等.基于卷积混合注意力机制的多目标跟踪算法[J].控制与决策,2025,40(4):1127-1135

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-05-13
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-03-21
  • 出版日期: 2025-04-20
文章二维码