具有误差约束的机械臂系统自适应强化学习控制

doi:10.13195/j.kzyjc.2025.1010

首页 > 过刊浏览>2026年第41卷第5期 >1331-1337. DOI:10.13195/j.kzyjc.2025.1010

具有误差约束的机械臂系统自适应强化学习控制
DOI:
                        10.13195/j.kzyjc.2025.1010
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP13
基金项目:国家自然科学基金项目(62103243)；山东省自然科学基金项目(ZR2025MS994).

Adaptive reinforcement learning control for robotic manipulator systems with error constraints

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为了提高机械臂控制过程中的安全性, 针对机械臂系统的误差约束问题提出一种基于强化学习的自适应控制方法. 将机械臂的动力学系统转化为关于跟踪误差的动态方程, 然后利用一类误差转换函数, 将受约束的误差系统转换为新的不受约束系统, 并基于此系统设计最优控制器. 为了解决最优控制问题, 利用强化学习的方法求解系统的HJB方程, 其中评价网络用于逼近系统最优值函数, 执行网络用于逼近最优控制器的输出, 并利用一类正定函数来大幅简化评价-执行网络的自适应率. 基于李雅普诺夫稳定性理论, 证明系统所有误差信号半全局一致最终有界. 最后通过一个2自由度机械臂的仿真案例验证所提出方法的有效性.

Abstract:

To enhance safety in the control process of robotic manipulators, an adaptive reinforcement learning-based control method is proposed for addressing error constraints in manipulator systems. For this purpose, the dynamic system of the manipulator is transformed into a dynamic equation concerning tracking errors. Then, an error transformation function is employed to convert the constrained error system into a new unconstrained system, based on which an optimal controller is designed. To solve the optimal control problem, a reinforcement learning approach is used to approximate the solution of the Hamilton-Jacobi-Bellman (HJB) equation, where a critic network approximates the optimal value function and an actor network approximates the output of the optimal controller. A positive definite function is introduced to significantly simplify the adaptation laws of the critic and actor networks. Based on Lyapunov stability theory, all error signals of the system are proven to be semi-globally uniformly ultimately bounded. Finally, the effectiveness of the proposed method is verified through a simulation example of a two-degree-of-freedom robotic manipulator.

参考文献

相似文献

引证文献

引用本文

苏航,张滋林.具有误差约束的机械臂系统自适应强化学习控制[J].控制与决策,2026,41(5):1331-1337

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-09-24
最后修改日期:
录用日期:
在线发布日期: 2026-04-17
出版日期: 2026-05-10

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码