基于增强型 PPO-PID 的机械臂跟踪控制

基于增强型 PPO-PID 的机械臂跟踪控制
DOI:
                        
CSTR:
                        
作者:
                        
作者单位:1.燕山大学电气工程学院;2.石家庄铁道大学机械工程学院
作者简介:
通讯作者:
中图分类号:TP241.3
基金项目:国家自然科学基金项目(62303398)；省级重点实验室绩效补助经费项目(22567612H)；河北省自然科学基金 (E2025210100)，河北省高等学校科学研究项目 (QN2025159)；骨干人才项目(留学回国平台）（C2025006）。

Enhanced PPO-PID for Tracking Control of Robotic Manipulators

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对多自由度机械臂的轨迹跟踪问题, 本文提出一种基于增强型近端策略优化的 PID(Enhanced PPO-PID) 控制算法. 首先, 该方法构建了基于共享特征提取层的统一网络架构, 在提升策略与价值函数协同优化能力的同时, 显著减少模型参数量, 提高了收敛速度与学习效率；其次, 提出一种基于性能回报动态调节训练迭代次数的机制, 从而实现训练初期快速收敛与后期策略稳定之间的平衡；再次, 引入价值函数裁剪, 通过限制单次更新幅度, 有效平滑学习曲线, 增强了高方差环境下的训练鲁棒性；最后, 在 PandaReach-v3 机械臂仿真平台上, 通过对比实验验证了所提方法在轨迹跟踪性能和鲁棒性方面的优越性.

Abstract:

This paper proposes a PID control algorithm based on enhanced proximal policy optimization (Enhanced PPO-PID) to address the trajectory tracking problem of multi-degree-of-freedom robotic arm system. First, the proposed method constructs a unified network architecture based on a shared feature extraction layer, which significantly reduces the number of model parameters while enhancing the synergistic optimization capability of the policy and value functions. The new architecture can improve the convergence speed and learning efficiency. Second, we propose a mechanism to dynamically adjust the number of training iterations based on reward, which results in achieving a balance between rapid convergence in the early stages of training and policy stability in the later stages. Third, we introduce a value function clipping to effectively smooth the learning curve by limiting the amplitude of a single update,thereby enhancing the robustness of training in high-variance environments. Finally, comparative experiments are conducted on the PandaReach-v3 robotic arm system to verify the superiority of the proposed method in trajectory tracking performance and robustness

参考文献

相似文献

引证文献

引用本文

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-12-26
最后修改日期:2026-03-16
录用日期:2026-03-17
在线发布日期:
出版日期:

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码