Back to Search Start Over

Graph Meta-Reinforcement Learning for Transferable Autonomous Mobility-on-Demand

Authors :
Gammelli, Daniele
Yang, Kaidi
Harrison, James
Pereira Duarte Rodrigues, Filipe M
Pereira, Francisco Camara
Pavone, Marco
Gammelli, Daniele
Yang, Kaidi
Harrison, James
Pereira Duarte Rodrigues, Filipe M
Pereira, Francisco Camara
Pavone, Marco
Source :
Gammelli , D , Yang , K , Harrison , J , Pereira Duarte Rodrigues , F M , Pereira , F C & Pavone , M 2022 , Graph Meta-Reinforcement Learning for Transferable Autonomous Mobility-on-Demand . in Proceedings of the ACM SIGKDD '22 Conference on Knowledge Discovery and Data Mining . pp. 20-37 , 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , Washington , District of Columbia , United States , 14/08/2022 .
Publication Year :
2022

Abstract

Autonomous Mobility-on-Demand (AMoD) systems represent an attractive alternative to existing transportation paradigms, currently challenged by urbanization and increasing travel needs. By centrally controlling a fleet of self-driving vehicles, these systems provide mobility service to customers and are currently starting to be deployed in a number of cities around the world. Current learning-based approaches for controlling AMoD systems are limited to the single-city scenario, whereby the service operator is allowed to take an unlimited amount of operational decisions within the same transportation system. However, real-world system operators can hardly afford to fully re-train AMoD controllers for every city they operate in, as this could result in a high number of poor-quality decisions during training, making the single-city strategy a potentially impractical solution. To address these limitations, we propose to formalize the multi-city AMoD problem through the lens of meta-reinforcement learning (meta-RL) and devise an actor-critic algorithm based on recurrent graph neural networks. In our approach, AMoD controllers are explicitly trained such that a small amount of experience within a new city will produce good system performance. Empirically, we show how control policies learned through meta-RL are able to achieve near-optimal performance on unseen cities by learning rapidly adaptable policies, thus making them more robust not only to novel environments, but also to distribution shifts common in real-world operations, such as special events, unexpected congestion, and dynamic pricing schemes.

Details

Database :
OAIster
Journal :
Gammelli , D , Yang , K , Harrison , J , Pereira Duarte Rodrigues , F M , Pereira , F C & Pavone , M 2022 , Graph Meta-Reinforcement Learning for Transferable Autonomous Mobility-on-Demand . in Proceedings of the ACM SIGKDD '22 Conference on Knowledge Discovery and Data Mining . pp. 20-37 , 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , Washington , District of Columbia , United States , 14/08/2022 .
Notes :
application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1372615921
Document Type :
Electronic Resource