基于ResNet34_D改进YOLOv3模型的行人检测算法
CSTR:
作者:
作者单位:

1. 河海大学 能源与电气学院,南京 211100;2. 南京理工大学 自动化学院, 南京 210094

作者简介:

通讯作者:

E-mail: qhmin0316@163.com.

中图分类号:

TP391

基金项目:

中央高校基本科研业务费专项基金项目(2018B15514).


Pedestrian detection based on developed YOLOv3 with ResNet34_D
Author:
Affiliation:

1. College of Energy and Electrical Engineering,Hohai University,Nanjing 211100,China;2. College of Automation,Nanjing University of Science and Technology,Nanjing 210094,China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对自动驾驶场景下行人检测任务中对中、小尺寸目标和被遮挡目标的检测需求,以及现有深度学习模型的不足,提出基于ResNet34_D的改进YOLOv3模型:通过改进残差网络的卷积块结构提出ResNet34_D,并作为YOLOv3的主干网络以降低模型尺寸和训练难度;在ResNet34_D的3个尺度卷积特征图之后,增加SPP层和DropBlock模块以提高模型的泛化能力;基于K-means聚类算法确定自适应的多尺度锚框尺寸,提高对大、中、小3种尺寸行人目标的检测能力;引入DIoU损失函数,提高对被遮挡目标的识别能力.所提出模型的消融实验验证了各个改进部分在提高模型检测准确率上的有效性.实验结果表明,所提出的基于ResNet34_D的改进YOLOv3模型具有较好的准确率和实时性,在BDD100K-Person数据集上的AP50达到69.8%,检测速度达到130FPS.由所提出方法与现有目标检测方法的对比实验可知,所提出方法对小目标和遮挡目标的误检率更低,速度更快,具有一定的实际应用价值.

    Abstract:

    Pedestrian detection is one of the main tasks of autonomous driving. The existed deep neural network is lack of the ability to detect small-size or medium-size objects and occluded objects, which is the requirement of pedestrian detection since pedestrians in the images acquired by vehicle-equipped cameras are always small or medium or occluded. In this paper, an improved YOLOv3 model based on ResNet34_D is proposed for pedestrian detection. And the contributions of the improved model are as follows. Firstly, the developed residual network ResNet34_D by modifying the structure of convolutional block is proposed, and it is selected as the backbone of YOLOv3 to reduce the size of the model so as to decrease the training difficulty. Secondly, the SPP layer and the DropBlock module are introduced after the feature maps of three stages of ResNet34_D, which can improve the detection accuracy of pedestrian objects with different sizes. Thirdly, to further increase the detection accuracy, the multi-scale anchors are determined using the K-means. Finally, the DIoU loss function is used to improve the ability of detecting the occluded objects. Ablation experiments for the proposed model demonstrate the effectiveness of each developed technologies in improving detection accuracy. And more experimental results show that the AP50 of the proposed model on BDD100K-Person dataset reaches 69.8%, and the detection speed can achieve 130FPS. Comparison experiments between the proposed method and the other existed methods demonstrate that, using the proposed method, the false detection rate for small targets and occlusion targets is lower, and the speed is faster, therefore, the proposed improved YOLOv3 model based on Resnet34_D is valuable in practical applications.

    参考文献
    相似文献
    引证文献
引用本文

钱惠敏,陈纬,马宜龙,等.基于ResNet34_D改进YOLOv3模型的行人检测算法[J].控制与决策,2022,37(7):1713-1720

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2022-05-25
  • 出版日期: 2022-07-20
文章二维码