Content area

Abstract

The job shop scheduling problem (JSSP) is a classical NP-hard combinatorial optimization challenge that plays a crucial role in manufacturing systems. Deep reinforcement learning has shown great potential in solving this problem. However, it still has challenges in reward function design and state feature representation, which makes it suffer from slow policy convergence and low learning efficiency in complex production environments. Therefore, a human feedback-based large language model-assisted deep reinforcement learning (HFLLMDRL) framework is proposed to solve this problem, in which few-shot prompt engineering by human feedback is utilized to assist in designing instructive reward functions and guiding policy convergence. Additionally, a self-adaptation symbolic visualization Kolmogorov–Arnold Network (KAN) is integrated as the policy network in DRL to enhance state feature representation, thereby improving learning efficiency. Experimental results demonstrate that the proposed framework significantly boosts both learning performance and policy convergence, presenting a novel approach to the JSSP.

Details

1009240
Title
Large Language Model-Assisted Deep Reinforcement Learning from Human Feedback for Job Shop Scheduling
Author
Zeng Yuhang 1   VIAFID ORCID Logo  ; Lou, Ping 1   VIAFID ORCID Logo  ; Hu, Jianmin 2   VIAFID ORCID Logo  ; Fan Chuannian 1   VIAFID ORCID Logo  ; Liu, Quan 1 ; Hu, Jiwei 1   VIAFID ORCID Logo 

 School of Information Engineering, Wuhan University of Technology, Wuhan 430070, China; [email protected] (Y.Z.); [email protected] (P.L.); [email protected] (C.F.); [email protected] (Q.L.); [email protected] (J.H.) 
 School of Information Engineering, Hubei University of Economics, Wuhan 430205, China, Hubei Key Laboratory of Digital Finance Innovation, Hubei University of Economics, Wuhan 430205, China 
Publication title
Machines; Basel
Volume
13
Issue
5
First page
361
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
e-ISSN
20751702
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-04-27
Milestone dates
2025-03-29 (Received); 2025-04-25 (Accepted)
Publication history
 
 
   First posting date
27 Apr 2025
ProQuest document ID
3212071253
Document URL
https://www.proquest.com/scholarly-journals/large-language-model-assisted-deep-reinforcement/docview/3212071253/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-05-27
Database
ProQuest One Academic