基于深度强化学习的机器人运动控制研究进展

doi:10.13195/j.kzyjc.2020.1382

首页 > 过刊浏览>2022年第37卷第2期 >278-292. DOI:10.13195/j.kzyjc.2020.1382

基于深度强化学习的机器人运动控制研究进展
DOI:
                        10.13195/j.kzyjc.2020.1382
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:1. 贵州大学 机械工程学院,贵阳 550025;2. 贵州大学 省部共建公共大数据国家重点实验室(筹), 贵阳 550025;3. 贵州大学 现代制造技术教育部重点实验室,贵阳 550025
作者简介:
通讯作者:E-mail: lishaobo@gzu.edu.cn.
中图分类号:TP242
基金项目:国家重点研发计划项目(2018AAA0101803)；国家自然科学基金项目(51475097,91746116)；工信部资助项目(工信部联装[2016]213号)；贵州省科技计划项目(黔科合人才[2015]4011)；贵州省重点实验室建设项目(黔科合平台人才[2016]5103))；贵州大学培育项目(贵大培育[2019]22号).

Research progress of robot motion control based on deep reinforcement learning

Author:

Affiliation:

1. School of Mechanical Engineering,Guizhou University,Guiyang 550025,China;2. State Key Laboratory of Public Big Data,Guizhou University,Guiyang 550025,China;3. Key Laboratory of Advanced Manufacturing Technology of Ministry of Education,Guizhou University,Guiyang 550025,China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

复杂未知环境下智能感知与自动控制是目前机器人在控制领域的研究热点之一,而新一代人工智能为其实现智能自动化赋予了可能.近年来,在高维连续状态-动作空间中,尝试运用深度强化学习进行机器人运动控制的新兴方法受到了相关研究人员的关注.首先,回顾了深度强化学习的兴起与发展,将用于机器人运动控制的深度强化学习算法分为基于值函数和策略梯度2类,并对各自典型算法及其特点进行了详细介绍;其次,针对仿真至现实之前的学习过程,简要介绍5种常用于深度强化学习的机器人运动控制仿真平台;然后,根据研究类型的不同,综述了目前基于深度强化学习的机器人运动控制方法在自主导航、物体抓取、步态控制、人机协作以及群体协同等5个方面的研究进展;最后,对其未来所面临的挑战以及发展趋势进行了总结与展望.

Abstract:

Intelligent perception and automatic control in a complex unknown environment is one of the current research hotspots of robots in the field of control, and a new generation of artificial intelligence makes it possible to realize intelligent automation. In recent years, the new method of robot control using deep reinforcement learning in high-dimensional continuous state-action space has attracted the attention of relevant researchers. Firstly, the rise and development of deep reinforcement learning are first reviewed. The deep reinforcement learning algorithms for robot motion control are classified into two categories: value-based functions and policy gradients, and their typical algorithms and their related features are detailly described. Then, for the learning process before simulation to reality, five kinds of simulation platforms for robot motion control are briefly introduced, which are often used for deep reinforcement learning. Moreover, according to different types of research, the research progress of the deep reinforcement learning approach of robot motion control is expounded in five aspects, including autonomous navigation, object grasping, gait control, human-robot collaborative and multi-robot cooperation. Finally, the future challenges and development trends are summarized and anticipated.

参考文献

相似文献

引证文献

引用本文

董豪,杨静,李少波,等.基于深度强化学习的机器人运动控制研究进展[J].控制与决策,2022,37(2):278-292

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2022-01-07
出版日期: 2022-02-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

分享

文章指标

历史

文章二维码