基于状态特征融合和多智能体协同的半导体封装车间生产与维护集成调度研究
DOI:
CSTR:
作者:
作者单位:

1.三峡大学;2.澳门科技大学

作者简介:

通讯作者:

中图分类号:

TP273

基金项目:

国家自然科学基金项目(52505562, 52075292, 62573442, 62173356);湖北省自然科学基金项目(2025AFB165);三峡大学科学基金项目(2024RCKJ032);水电机械设备设计与维护湖北省重点实验室开放基金项目(2025KJX03);三峡大学教学改革研究类项目(J2025053)。


Research on integrated scheduling of production and maintenance in semiconductor packaging workshop based on state feature fusion and multi-agent cooperation
Author:
Affiliation:

Fund Project:

National Natural Science Foundation of China (Nos. 52505562, 52075292, 62573442 and 62173356), the Hubei Provincial Natural Science Foundation of China (No. 2025AFB165), the Science Foundation of China Three Gorges University (No. 2024RCKJ032), the Open Foundation of Hubei Key Laboratory of Hydroelectric Machinery Design & Maintenance (No. 2025KJX03), and the Teaching Reform Research Project of China Three Gorges University (No. J2025053).

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对半导体封装车间存在平均流经时间长、在制品库存高和设备换模频繁等问题, 在考虑有限缓冲区和设备维护约束下开展了基于状态特征融合和多智能体协同的半导体封装车间生产与维护集成调度研究. 首先, 构建了最小化最大完工时间、总拖期时间和总换模时间的多目标数学优化模型; 然后, 针对半导体封装车间不同站点的生产特点及其缓冲区容量限制, 利用双延迟深度确定性策略梯度算法(Twin delayed deep deterministic policy gradient algorithm, TD3)设计了三种异构生产智能体; 最后, 针对多智能体TD3 (Multi-agent TD3, MATD3)中存在特征数量多、神经网络学习困难等问题, 提出利用卷积神经网络(Convolutional neural network, CNN)对不同智能体的状态特征进行压缩和融合, 构建了一种基于CNN状态特征融合的MATD3算法(记为:CNN-MATD3). 通过大量的实验结果表明: (1) 状态特征融合在CNN-MATD3算法中发挥着重要作用, 且对IGD和HV指标的平均贡献度分别为216.93%和26.52%; (2) CNN-MATD3算法显著优于2种经典元启发式算法、5种单智能体算法和2种多智能体算法, 且每类算法在IGD指标上的相对百分比偏差分别高达16652.49%、7009.01%和16788.91%、在HV指标上的相对百分比偏差分别不低于-95.46%、-56.81%和-86.65%; (3) 通过敏感性分析发现, 缓冲区容量大小和产品紧急程度对集成调度结果具有显著性影响.

    Abstract:

    To address the issues of long average flow time, high work-in-process (WIP) inventory and frequent equipment changeovers in semiconductor packaging workshop, an integrated scheduling of production and maintenance in semiconductor packaging workshop based on state feature fusion and multi-agent cooperation is studied under the consideration of limited buffer capacity and equipment maintenance constraint. Firstly, a multi-objective mathematical optimization model is established to minimize the makespan, total tardiness and total setup time. Secondly, according to the production characteristics and buffer capacity limits of different stations in the semiconductor packaging workshop, three heterogeneous production agents are designed by employing a twin delayed deep deterministic policy gradient (TD3) algorithm. Finally, to overcome the problems such as a large number of features and the difficulty of neural network learning in multi-agent TD3 (MATD3), a convolutional neural network (CNN) is employed to compress and fuse the state features of different agents, and an improved MATD3 algorithm based on CNN state feature fusion is constructed, named as CNN-MATD3. A large number of experimental results show that: (1) state feature fusion plays a significant role in CNN-MATD3 algorithm, and the average contributions to inverted generational distance (IGD) and hypervolume (HV) indicators are 216.93% and 26.52%, respectively; (2) CNN-MATD3 algorithm is significantly superior to two classical meta-heuristic algorithms, five single-agent algorithms and two multi-agent algorithms, the relative percentage deviations (RPDs) of each type of algorithm in IGD indicator are separately up to 16652.49%、7009.01% and 16788.91%, and their RPDs in HV indicator are separately no less than -95.46%、-56.81% and -86.65%; and (3) buffer capacity and product urgency have significant influences on the integrated scheduling results that are found by sensitivity analysis.

    参考文献
    相似文献
    引证文献
引用本文
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-02-22
  • 最后修改日期:2026-06-02
  • 录用日期:2026-06-03
  • 在线发布日期:
  • 出版日期:
文章二维码