A*-PPO融合的建材搬运机器人路径规划
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TU6;TP242.6

基金项目:

低品位煤矸石胶结充填体的梯级强化机制与损伤演变调控项目(52574467);江苏省卓越博士后项目(2023ZB094);矿震下煤矿采空区岩层动力失稳与建筑安全控制研究项目(20240343).


A*-PPO fusion for building material handling robot path planning
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    路径规划是智能建造中建材搬运机器人高效、安全作业的核心挑战, 尤其是在动态复杂工况下, 传统方法常面临路径震荡、避障失败以及全局-局部决策失配等问题. 鉴于此, 提出A*-PPO协同优化框架. 具体做法如下: 1)改进A*算法采用八邻域扩展和切比雪夫距离构建栅格化全局拓扑; 2)设计六维观测向量驱动的动态奖励函数, 集成路径跟踪奖励、碰撞惩罚以及步长约束; 3)建立特征级参数共享机制, 通过动态窗口法(DWA)将A*路径特征嵌入近端策略优化(PPO)网络, 以实现全局代价估计与局部避障决策同步优化. 仿真验证: 在4类典型环境中的仿真表明, 所提出方法相较于RRT*-APF与传统A*算法, 在动态障碍场景下路径成功率提升了42.7% (传统方法均失败), 规划时间减少了55.8%, U型凹面障碍耦合动态干扰时成功避障了98次. 技术突破: 通过渐近式航点验证和双层优化架构, 能够解决拓扑保持与实时避障的兼容性难题以及建筑机器人路径震荡和避障延迟问题.

    Abstract:

    Path planning is the core challenge for efficient and safe operation of building material handling robots in intelligent construction, especially in dynamic and complex working conditions, traditional methods often face problems such as path oscillations, obstacle avoidance failures and global-local decision mismatch. An A* and proximal policy optimization (PPO) collaborative optimisation framework is proposed to address this problem. The specific approaches are as follows: 1) An A* algorithm is improved by using eight-neighbourhood extension with Chebyshev distance to construct a gridded global topology; 2) A six-dimensional observation vector-driven dynamic reward function is designed, which integrates path tracking rewards, collision penalties, and step-size constraints; 3) A feature-level parameter-sharing mechanism is established, and A* path features are embed into the proximal policy optimization network using a dynamic window approach (DWA) to achieve global cost estimation and local obstacle avoidance decision synchronous optimisation. Simulation in four types of typical site environments shows that, compared with the RRT*-APF and traditional A* algorithms, the proposed method improves the success rate of the path in dynamic obstacle scenarios by 42.7% (the traditional algorithm fails), reduces the planning time by 55.8%, and successfully avoids obstacles 98 times when U-shaped concave obstacles are coupled with dynamic interference. Through progressive waypoint verification and two-layer optimisation architecture, the compatibility problem between topology maintenance and real-time obstacle avoidance is solved, and the path oscillation and obstacle avoidance delay problem of the construction robot is solved.

    参考文献
    相似文献
    引证文献
引用本文

尹航,郭岱羲,路沙沙,等. A*-PPO融合的建材搬运机器人路径规划[J].控制与决策,2026,41(4):965-976

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-06-08
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-03-24
  • 出版日期: 2026-04-10
文章二维码