基于Petri网与多智能体深度强化学习的AGV路径规划
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TH165+.1;TP23;TP18

基金项目:

国家自然科学基金项目(62073107);浙江省自然科学基金项目(LZ21F030002).


AGV path planning based on Petri net and multi-agent deep reinforcement learning
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在无人仓库系统中, 解决自动导引车(AGV)间的碰撞、死锁以及路径规划问题至关重要. 鉴于此, 提出一种用Petri网对仓库环境中AGV系统进行建模的方法, 以有效解决AGV运输货物时产生冲突的问题. 在此基础上, 提出一种多智能体深度强化学习AGV路径规划框架, 视AGV路径规划问题为部分可观测马尔可夫决策过程, 将深度确定性策略梯度算法扩展至多智能体系统, 通过设计AGV的观测空间、状态空间、动作空间以及奖励函数来实现Petri网中AGV无冲突路径规划. 在设置奖励函数时加入Petri网触发条件后的反馈, 以极大程度地减少AGV运输货物时拥塞的产生, 增加仓库在规定时间内的送货总量. 此外, 所提出框架将路径分支点设置为智能体, 以有效地应对多个任务起点随机产生以及环境中AGV数量时刻变化的情况, 提升神经网络泛化能力. 仿真实验在AnyLogic软件平台中进行, 通过对比不同AGV规模下的货物运输情况以及奖励函数中有无Petri网条件正负反馈的对照实验, 验证所提出路径规划方法的可行性和有效性.

    Abstract:

    In unmanned warehouse systems, it is important to solve the collision, deadlock and path planning problems between automated guided vehicles(AGVs). This paper presents a method of a modeling AGV system in warehouse environment with Petri net, which effectively solves the problem of conflict when the AGV transports goods. On this basis, a multi-agent deep reinforcement learning AGV path planning framework is proposed. The AGV path planning problem is regarded as a partially observable Markov decision process, and the deep deterministic policy gradient algorithm is extended to multi-agent systems. Observation space, state space, action space and reward function of the AGV are designed to realize AGV conflict-free path planning in Petri net. Due to the addition of feedback after Petri net trigger condition when setting the reward function, the congestion generated is greatly reduced when the AGV transports goods, and the total amount of delivery in the warehouse is increased within the specified time. In addition, because the proposed framework sets the path branch points as agents, it can effectively cope with the random generation of multiple task starts and the change of the number of AGVs in the environment, and improve the generalization ability of neural networks. Simulation experiments are carried out on the AnyLogic software platform. The feasibility and effectiveness of the path planning method are verified by comparing the cargo transportation situation under different AGV scales and the control experiments with or without Petri net condition positive and negative feedback in the reward function.

    参考文献
    相似文献
    引证文献
引用本文

于绍琪,田玉平.基于Petri网与多智能体深度强化学习的AGV路径规划[J].控制与决策,2025,40(5):1438-1446

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2023-12-29
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-04-15
  • 出版日期: 2025-05-20
文章二维码