This thesis wants to solve a Vehicle Routing Problem through the use of a non-traditional method, Reinforcement Learning. This is complemented by the resolution of the same problem through heuristic techniques and a deep analysis of the two implementations. Firstly, the problem is solved with open-source tools provided by Google, mathematical and optimization functions. Subsequently, the same problem is solved through the development of an environment and the utilization of specific Reinforcement Learning algorithms. These generate the paths of the vehicles from the warehouse to the several customers by training an agent, which decides the actions to be taken. Lastly, an economic analysis of the two proposals is carried out concentrating especially on the new method. The research shows that the traditional method optimizes the vehicles’ routes but can work well only with small sets of non-real world data, as it faces several limitations. On the other hand, Reinforcement Learning models are more complex and can work with big sets of real world data. It must be said that this study needs further refinement to provide optimal solutions, as the ones offered are not the best ones. As a matter of fact, when trying to generalize unseen data, the model is not efficient enough. However, Reinforcement Learning remains a promising way of optimizing internal business processes, which requires additional resources and study to successfully complete its tasks.

Reinforcement Learning for a Routing Optimization Problem. Solving a VRP with a FedEx data set.

Ricci, Angelica
2024/2025

Abstract

This thesis wants to solve a Vehicle Routing Problem through the use of a non-traditional method, Reinforcement Learning. This is complemented by the resolution of the same problem through heuristic techniques and a deep analysis of the two implementations. Firstly, the problem is solved with open-source tools provided by Google, mathematical and optimization functions. Subsequently, the same problem is solved through the development of an environment and the utilization of specific Reinforcement Learning algorithms. These generate the paths of the vehicles from the warehouse to the several customers by training an agent, which decides the actions to be taken. Lastly, an economic analysis of the two proposals is carried out concentrating especially on the new method. The research shows that the traditional method optimizes the vehicles’ routes but can work well only with small sets of non-real world data, as it faces several limitations. On the other hand, Reinforcement Learning models are more complex and can work with big sets of real world data. It must be said that this study needs further refinement to provide optimal solutions, as the ones offered are not the best ones. As a matter of fact, when trying to generalize unseen data, the model is not efficient enough. However, Reinforcement Learning remains a promising way of optimizing internal business processes, which requires additional resources and study to successfully complete its tasks.
2024-10-21
File in questo prodotto:
File Dimensione Formato  
878539-1289952.pdf

accesso aperto

Tipologia: Altro materiale allegato
Dimensione 3.92 MB
Formato Adobe PDF
3.92 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14247/24048