An End-to-End Reinforcement Learning Approach for Job-Shop Scheduling Problems Based on Constraint Programming

AI-generated keywords: Job-Shop Scheduling Problem Constraint Programming Reinforcement Learning Priority Dispatching Rule Neural Network Architecture

AI-generated Key Points

  • End-to-end approach for solving the Job-Shop Scheduling Problem (JSSP) using Constraint Programming (CP) and Reinforcement Learning (RL)
  • CP solvers struggle to scale to larger problems
  • Proposed neural network architecture and training algorithm that only require a generic CP encoding and small instances
  • RL agent learns a Priority Dispatching Rule (PDR) capable of generalizing well to large instances
  • Evaluated on seven JSSP datasets, finds higher-quality solutions for very large instances compared to static PDRs and CP solvers within the same time limit
  • Key contributions: introducing an RL environment based on a generic CP model, developing an efficient neural network architecture, proposing a novel training algorithm utilizing the CP nature by using a CP solver to generate training data
  • Provides a promising solution for scalability issues in solving scheduling problems like JSSP by combining CP and RL
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Pierre Tassel, Martin Gebser, Konstantin Schekotihin

To be published at ICAPS 2023
License: CC BY 4.0

Abstract: Constraint Programming (CP) is a declarative programming paradigm that allows for modeling and solving combinatorial optimization problems, such as the Job-Shop Scheduling Problem (JSSP). While CP solvers manage to find optimal or near-optimal solutions for small instances, they do not scale well to large ones, i.e., they require long computation times or yield low-quality solutions. Therefore, real-world scheduling applications often resort to fast, handcrafted, priority-based dispatching heuristics to find a good initial solution and then refine it using optimization methods. This paper proposes a novel end-to-end approach to solving scheduling problems by means of CP and Reinforcement Learning (RL). In contrast to previous RL methods, tailored for a given problem by including procedural simulation algorithms, complex feature engineering, or handcrafted reward functions, our neural-network architecture and training algorithm merely require a generic CP encoding of some scheduling problem along with a set of small instances. Our approach leverages existing CP solvers to train an agent learning a Priority Dispatching Rule (PDR) that generalizes well to large instances, even from separate datasets. We evaluate our method on seven JSSP datasets from the literature, showing its ability to find higher-quality solutions for very large instances than obtained by static PDRs and by a CP solver within the same time limit.

Submitted to arXiv on 09 Jun. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2306.05747v1

This paper introduces an end-to-end approach for solving the Job-Shop Scheduling Problem (JSSP) using a combination of Constraint Programming (CP) and Reinforcement Learning (RL). CP solvers are effective for finding optimal or near-optimal solutions for small instances but struggle to scale to larger problems. To address this limitation, the authors propose a novel neural network architecture and training algorithm that only require a generic CP encoding of the scheduling problem and a set of small instances. The proposed approach leverages existing CP solvers to train an RL agent that learns a Priority Dispatching Rule (PDR) capable of generalizing well to large instances from separate datasets. This method is evaluated on seven JSSP datasets from the literature and demonstrates its ability to find higher-quality solutions for very large instances compared to static PDRs and CP solvers within the same time limit. The key contributions of this paper include introducing an RL environment based on a generic CP model of JSSP that enables fast propagation of constraints even for large instances; developing an efficient neural network architecture capable of extracting features directly from raw variables; and proposing a novel training algorithm that utilizes the CP nature by using a CP solver to generate training data. This end-to-end approach combining CP and RL provides a promising solution for addressing scalability issues in solving scheduling problems like JSSP. It leverages existing CP solvers and trains an RL agent with a generic encoding and small instance datasets, achieving higher-quality solutions for large instances compared to traditional approaches within similar time limits.
Created on 06 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.