Content area

Abstract

Batch processing machines are often the bottleneck in semiconductor manufacturing and their scheduling plays a key role in production management. Pioneer researches on multi-objective batch machines scheduling mainly focus on evolutionary algorithms, failing to meet the online scheduling demand. To deal with the challenges confronted by incompatible job families, dynamic job arrivals, capacitated machines and multiple objectives, we propose a clustering-aided multi-agent deep reinforcement learning approach (CA-MADRL) for the scheduling problem. Specifically, to achieve diverse nondominated solutions, an offline multi-objective scheduling algorithm named Multi-Subpopulation fast elitist Non-Dominated Sorting Genetic Algorithm (MS-NSGA-II) is firstly developed to obtain the Pareto Fronts, and a clustering algorithm based on cosine distance is employed to analyze the distribution of Pareto frontier solution, which would be used to guide reward functions design in multi-agent deep reinforcement learning. To realize multi-objective optimization, several reinforcement learning base models are trained for different optimization directions, each of which composed of batch forming agent and batch scheduling agent. To alleviate time complexity of model training, a parameter sharing strategy is introduced between different reinforcement learning base model. By validating the proposed approach with 16 instances designed based on actual production data from a semiconductor manufacturing company, it has been demonstrated that the approach not only meets the high-frequency scheduling requirements of manufacturing systems for parallel batch processing machines but also effectively reduces the total job tardiness and machine energy consumption.

Details

1009240
Business indexing term
Title
A clustering-aided multi-agent deep reinforcement learning for multi-objective parallel batch processing machines scheduling in semiconductor manufacturing
Author
Zhang, Peng 1   VIAFID ORCID Logo  ; Jin, Mengyu 1 ; Wang, Ming 2 ; Zhang, Jie 1 ; He, Junjie 1 ; Zheng, Peng 3 

 Shanghai Engineering Research Center of industrial Big Data and Intelligent System, Institute of Artificial Intelligence, Donghua University, Shanghai, China 
 College of Mechanical Engineering, Donghua University, Shanghai, China 
 College of Log istics Engineering, Shanghai Maritime University, Shanghai, China 
Publication title
Volume
58
Issue
5
Pages
614-631
Publication year
2025
Publication date
May 2025
Publisher
Sage Publications Ltd.
Place of publication
London
Country of publication
United Kingdom
ISSN
00202940
e-ISSN
20518730
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Milestone dates
2024-05-24 (Received); 2024-07-22 (Accepted)
ProQuest document ID
3201726493
Document URL
https://www.proquest.com/scholarly-journals/clustering-aided-multi-agent-deep-reinforcement/docview/3201726493/se-2?accountid=208611
Copyright
© The Author(s) 2024. This work is licensed under the Creative Commons Attribution License https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-05-23
Database
ProQuest One Academic