基于评价网络近似误差的自适应动态规划优化控制

doi:10.13195/j.kzyjc.2014.0102

首页 > 过刊浏览>2015年第30卷第3期 >495-499. DOI:10.13195/j.kzyjc.2014.0102

基于评价网络近似误差的自适应动态规划优化控制
DOI:
                        10.13195/j.kzyjc.2014.0102
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:广西大学电气工程学院，南宁530004.
作者简介:丁强
通讯作者:
中图分类号:TP18
基金项目:
国家自然科学基金重点项目(61034002)；国家自然科学基金项目(61364007).

Adaptive dynamic programming optimal control based on approximation error of critic network

Author:

Affiliation:

School of Electrical Engineering，Guangxi University，Nanning 530004，China．

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为了求解有限时域最优控制问题, 自适应动态规划(ADP) 算法要求受控系统能一步控制到零. 针对不能一步控制到零的非线性系统, 提出一种改进的ADP 算法, 其初始代价函数由任意的有限时间容许序列构造. 推导了算法的迭代过程并证明了算法的收敛性. 当考虑评价网络的近似误差并满足假设条件时, 迭代代价函数将收敛到最优代价函数的有界邻域. 仿真例子验证了所提出方法的有效性.

Abstract:

In order to solve finite horizon optimal control problems, the adaptive dynamic programming(ADP) algorithm demands the system can reach zero in one step of control. For the nonlinear systems which cannot be controlled to zero in one step, an improved ADP algorithm is presented, and the initial cost is constructed by arbitrary finite horizon admissible sequence. After giving the iterative process, the convergence analysis of the improved algorithm is conducted. If the approximation error of the critic network is considered and several assumptions are satisfied, the iterative cost function will converge to a finite neighborhood of the optimal cost function. A simulation example is provided to verify the effectiveness of the presented approach.

参考文献

相似文献

引证文献

引用本文

林小峰丁强.基于评价网络近似误差的自适应动态规划优化控制[J].控制与决策,2015,30(3):495-499

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2014-01-17
最后修改日期:2014-06-27
录用日期:
在线发布日期: 2015-03-20
出版日期:

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

分享

文章指标

历史

文章二维码