基于混合注意力机制的多信息行人过街意图预测
CSTR:
作者:
作者单位:

沈阳工业大学 信息科学与工程学院,沈阳 110870

作者简介:

通讯作者:

E-mail: sanghaif@163.com.

中图分类号:

TP391

基金项目:

国家自然科学基金项目(62173078);辽宁省自然科学基金项目(2022-MS-268).


Multi information pedestrian crossing intention prediction based on mixed attention mechanism
Author:
Affiliation:

School of Information Science and Engineering,Shenyang University of Technology,Shenyang 110870,China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    提前预测道路两旁的行人是否存在过街意图或一段时间后是否会出现过街行为是自动驾驶汽车面临的重要挑战之一,如何有效融合不同模态的多元信息是准确预测行人过街意图的重要问题.基于此,提出一种基于混合注意力机制的多信息融合预测模型,使用一种基于交叉注意力机制的图像特征融合网络来提取原始图像与语义图像之间的互补信息,并使模型更加关注与行人过街行为有关的图像部分.同时,提出一种融合注意力机制的分级GRU模块,用以捕捉不同模态的非视觉信息对行人过街意图的影响.在PIE和JAAD数据集上对所提模型进行对比实验,已验证其具有领先于同类研究的性能;针对所提出模块进行广泛的消融实验,表明其有效性.

    Abstract:

    Predicting in advance whether pedestrians on both sides of the road have the intention to cross the street or whether crossing behavior will occur after a period of time is one of the important challenges facing self-driving cars. How to effectively fuse the multi-information from these different modalities is an important issue in accurately predicting pedestrian crossing intentions. Therefore, this paper proposes a multi-information fusion prediction model based on a hybrid attention mechanism. The model uses an image feature fusion network based on a cross-attention mechanism to extract complementary information between the original image and the semantic image and to make the model more attentive to the parts of the image that are relevant to the behavior of the pedestrian crossing the street. We also propose a hierarchical gated recurrent unit(GRU) module incorporating an attentional mechanism to capture the effects of different modalities of non-visual information on pedestrian crossing intentions. Finally, the proposed model is compared on the PIE and JAAD datasets and achieves leading performance, and extensive ablation experiments are conducted on the proposed module to prove its effectiveness.

    参考文献
    相似文献
    引证文献
引用本文

桑海峰,刘玉龙,刘泉恺.基于混合注意力机制的多信息行人过街意图预测[J].控制与决策,2024,39(12):3946-3954

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2024-11-20
  • 出版日期: 2024-12-20
文章二维码