参数未知的离散系统Q-学习优化状态估计与控制

doi:10.13195/j.kzyjc.2019.0180

首页 > 过刊浏览>2020年第35卷第12期 >2889-2897. DOI:10.13195/j.kzyjc.2019.0180

参数未知的离散系统Q-学习优化状态估计与控制
DOI:
                        10.13195/j.kzyjc.2019.0180
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:(1. 沈阳化工大学信息工程学院，沈阳110142;2. 东北大学流程工业综合自动化国家重点实验室，沈阳110004)
作者简介:
通讯作者:E-mail: lijinna_721@126.com.
中图分类号:TP13
基金项目:国家自然科学基金项目(61673280)；辽宁省高等学校创新人才项目(LR2017006).

Q-learning optimal state estimation and control for discrete systems with unknown parameters

Author:

Affiliation:

(1. College of Information Engineering,Shenyang University of Chemical Technology,Shenyang110142,China;2. State Key Lab of Synthetical Automation for Process Industries,Northeastern University,Shenyang110004,China)

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

控制系统的应用中存在状态不能直接测量或测量成本高的实际问题,给模型参数未知的系统完全利用状态数据学习最优控制器带来挑战性难题.为解决这一问题,首先构建具有状态观测器且系统矩阵中存在未知参数的离散线性增广系统,定义性能优化指标;然后基于分离定理、动态规划以及Q-学习方法,给出一种具有未知模型参数的非策略Q-学习算法,并设计近似最优观测器,得到完全利用可测量的系统输出和控制输入数据的非策略Q-学习算法,实现基于观测器状态反馈的系统优化控制策略,该算法的优点在于不要求系统模型参数全部已知,不要求系统状态直接可测,利用可测量数据实现指定性能指标的优化;最后,通过仿真实验验证所提出方法的有效性.

Abstract:

In the application of control systems, there is a practical problem that the state cannot be directly measured or the measurement cost is high. In order to solve this problem, a linear discrete-time augmented system with unknown parameters and a state observer is first constructed and the prescribed performance index is defined. Then, based on the separation theorem, the dynamic programming theory and the Q-learning method, a novel off-policy Q-learning algorithm is developed to approximate the optimal observer and the optimal controller for systems with unknown parameters and unmeasured states, such that the control performance is minimized using only measured data. The advantage of this algorithm is that it does not require all the system model parameters to be known and the system state to be directly measurable. Finally, the simulation experiment verifies the effectiveness of the proposed method.

参考文献

相似文献

引证文献

引用本文

李金娜,马士凯.参数未知的离散系统Q-学习优化状态估计与控制[J].控制与决策,2020,35(12):2889-2897

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2020-12-02
出版日期: 2020-12-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码