循环神经网络研究综述
CSTR:
作者:
作者单位:

中国石油大学(北京) 信息科学与工程学院,北京 102249

作者简介:

通讯作者:

E-mail: liujw@cup.edu.cn.

中图分类号:

TP273

基金项目:

中国石油大学(北京)科学基金项目(2462020YXZZ023).


Overview of recurrent neural networks
Author:
Affiliation:

College of Information Science and Engineering,University of Petroleum China(Beijing), Beijing 102249,China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    循环神经网络是神经网络序列模型的主要实现形式,近几年得到迅速发展,其是机器翻译、机器问题回答、序列视频分析的标准处理手段,也是对于手写体自动合成、语音处理和图像生成等问题的主流建模手段.鉴于此,循环神经网络的各分支按照网络结构进行详细分类,大致分为3大类:一是衍生循环神经网络,这类网络是基于基本RNNs模型的结构衍生变体,即对RNNs的内部结构进行修改;二是组合循环神经网络,这类网络将其他一些经典的网络模型或结构与第一类衍生循环神经网络进行组合,得到更好的模型效果,是一种非常有效的手段;三是混合循环神经网络,这类网络模型既有不同网络模型的组合,又在RNNs内部结构上进行修改,是同属于前两类网络分类的结构.为了更加深入地理解循环神经网络,进一步介绍与循环神经网络经常混为一谈的递归神经网络结构以及递归神经网络与循环神经网络的区别和联系.在详略描述上述模型的应用背景、网络结构以及模型变种后,对各个模型的特点进行总结和比较,并对循环神经网络模型进行展望和总结.

    Abstract:

    Recurrent neural networks (RNNs) are the main implementation paradigms for deep neural network sequence model, and have been developed rapidly and widespreadly in last two decades. Now, RNNs are cornerstone and foundation underpinning for machine translation, machine question answering and sequence video analysis, and RNNs are also the mainstream modeling approaches for handwriting automatic synthesis, speech processing and image generation. In this paper, the branches of recurrent neural networks are classified in detail according to the network structure, which can be roughly divided into three categories: the first one is all sorts of variants of recurrent neural networks, which are structural variants based on the basic RNNs architecture, that is, modifying the internal structure of RNNs. The second kind is combined RNNs, which combine some classical other network models or structures with the first kind of RNNs to get better modeling effect. It is a very effective means. The third one is hybrid RNNs, which not only combine different network models, but also modify the internal structure of RNNs. In order to understand the RNNs more deeply, this paper also introduces the structure of recursive neural networks which are often confused with RNNs, and the difference and connection between recursive neural networks and RNNs. After a detailed description of the application background, network structure and model variants of the above models based on RNNs, the characteristics of each model are summarized and compared. Finally, the prospect and summary of the RNNs are given.

    参考文献
    相似文献
    引证文献
引用本文

刘建伟,宋志妍.循环神经网络研究综述[J].控制与决策,2022,37(11):2753-2768

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2022-09-30
  • 出版日期: 2022-11-20
文章二维码