1. A reinforcement learning-based multi-agent framework applied for solving routing and scheduling problems.
- Author
-
Lopes Silva, Maria Amélia, de Souza, Sérgio Ricardo, Freitas Souza, Marcone Jamilson, and Bazzan, Ana Lúcia C.
- Subjects
- *
REINFORCEMENT learning , *VEHICLE routing problem , *COMBINATORIAL optimization , *SETUP time , *SEARCH engines , *LEARNING ability - Abstract
• Multi-agent framework for optimization using metaheuristics. • Agents modify their actions using concepts of Reinforcement Learning. • Learning ability of the agents directly influences the quality of solutions. • Framework validated using Vehicle Routing Problem with Time-Windows (VRPTW) and Unrelated Parallel Machine Scheduling Problem with Sequence-Dependent Setup Times (UPMSP-ST). This article presents a multi-agent framework for optimization using metaheuristics, called AMAM. In this proposal, each agent acts independently in the search space of a combinatorial optimization problem. Agents share information and collaborate with each other through the environment. The goal is to enable the agent to modify their actions based on experiences gained in interacting with the other agents and the environment using the concepts of Reinforcement Learning. For better introduction and validation of the AMAM framework, this article uses the instantiation of the Vehicle Routing Problem with Time Windows (VRPTW) and the Unrelated Parallel Machine Scheduling Problem with Sequence-Dependent Setup Times (UPMSP-ST), i.e., two classic combinatorial optimization problems. The main objective of the experiments is to evaluate the performance of the proposed adaptive agents. The experiments confirm that the ability to learn attributed to the agent directly influences the quality of solutions, both from the individual point of view and from the point of view of teamwork. In this way, the framework presented here is a step forward in relation to the other frameworks of the literature regarding to the adaptation to the particular aspects of the problems. Additionally, the cooperation between agents and their ability to influence the quality of the solutions of the agents involved in the search of the solution is confirmed. The results also strengthen the issue of the scalability of the framework, since, with the addition of new agents, there is an improvement of the solutions obtained. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF