基于强化学习的多智能体系统容错编队最优控制

doi:10.13195/j.kzyjc.2025.0218

首页 > 过刊浏览>2025年第40卷第12期 >3565-3575. DOI:10.13195/j.kzyjc.2025.0218

基于强化学习的多智能体系统容错编队最优控制
DOI:
                        10.13195/j.kzyjc.2025.0218
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP273
基金项目:国家自然科学基金项目(62276119)；江苏省基础研究计划项目(BK20241764)；江苏省研究生实践创新计划项目(SJCX25_1523)；江苏师范大学研究生科研与实践创新计划项目(2024XKT2630).

Fault-tolerant formation optimized control for multi-agent systems via reinforcement learning

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

研究执行器故障下多智能体系统的最优编队控制问题, 提出一种基于强化学习的预设时间最优容错编队控制方法. 首先, 借助预设时间调谐函数, 结合评论家-演员算法, 可显著降低算法复杂度; 然后, 在此基础上, 设计一种高效的容错控制机制, 以确保系统在执行器发生故障时仍然能够稳定地实现预期编队控制目标; 接着, 基于强化学习的自适应算法能力, 进一步增强算法对于复杂动态环境的适应性和鲁棒性; 最后, 通过仿真结果验证了所提出方法在故障场景下的有效性和优越性.

Abstract:

This paper investigates the optimal formation control problem for multi-agent systems under actuator faults and proposes a predefined-time optimized fault-tolerant formation control method based on reinforcement learning. The method employs a predefined-time tuning function and combines with an actor-critic algorithm and significantly reduces algorithmic complexity. Based on this, an efficient fault-tolerant control mechanism is designed to ensure the system achieve the desired formation control objectives, even in the presence of actuator faults. Furthermore, the reinforcement learning-based adaptability further enhances the algorithm's robustness and adaptability in complex dynamic environments. Finally, simulation results validate the effectiveness and superiority of the proposed method across fault scenarios.

参考文献

相似文献

引证文献

引用本文

申思凯,江南,刘小洋.基于强化学习的多智能体系统容错编队最优控制[J].控制与决策,2025,40(12):3565-3575

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-03-03
最后修改日期:
录用日期:
在线发布日期: 2025-11-10
出版日期: 2025-12-10

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码