Multi-manned Collaborative Mixed-model Assembly Line Balancing Optimization Based on Deep Reinforcement Learning
Author:
Affiliation:
1.School of Automation Science and Engineering, South China University of Technology;2.School of Software Engineering, South China University of Technology
Fund Project:
National Key R&D Program of China
摘要
|
图/表
|
访问统计
|
参考文献
|
相似文献
|
引证文献
|
资源附件
|
文章评论
摘要:
针对大型设备混流装配过程中的多人协同、多工种等特点,本文提出了基于双深度Q网络(Double Deep Q Network, DDQN)的多人协同混流装配线平衡优化算法.首先以工作站和工人数量、工人以及工作站间的负载为优化目标,建立了多人协同混流装配线平衡问题的多目标优化数学模型;其次,根据装配过程中生产对象的特征设计状态空间,并根据启发式规则设计动作空间,结合优化目标设计奖励函数,从而将数学模型转化为马尔科夫决策模型.在此基础上,对传统DDQN 算法进行改进,采用自适应探索概率完成动作决策,并设计了基于工人利用率的解码方法;最后,在混流装配线标准测试用例以及多人协同混流装配线测试用例上,将DDQN算法与改进离散水波优化算法和模拟退火算法进行对比,验证了算法的寻优精度以及模型的有效性.并在车身混流装配实际案例中采用DDQN 算法进行平衡优化,验证了算法的有效性和实用性.
Abstract:
Considering the characteristics of assembly process such as multiple workers collaborating, the demand for workers with different skills, and mixed-model assembly, this paper proposes a Double Deep Q Network (DDQN) based algorithm to address Multi-manned Cooperation Mixed-model Assembly Line Balancing Problem. Firstly, a mathematical model for Multi-manned Cooperation Mixed-model Assembly Line Balancing Problem is established with the objectives of optimising the number of workstations and workers, the workload between workers and workstations. Secondly, the state space is designed based on the features of production objects. Meanwhile, the action space is designed using heuristic rules. Besides, the reward function is constructed based on the objectives of the model. As a result, the mathematical model is converted into a Markov decision process model. On this basis, an improved DDQN algorithm with an adaptive exploration probability for action decision-making and a decoding method based on worker utilization rate is developed. Finally, the improved DDQN algorithm is compared with the improved Discrete Water Wave Optimization algorithm and the Simulated Annealing algorithm on standard mixed-model assembly line test cases and multi-manned collaborative mixed-model assembly line test cases to verify the accuracy of the algorithm and the effectiveness of the model. The effectiveness and practicality of the algorithm are also verified by applying it to balance optimization in a practical car body mixed-flow assembly process.