1. Talk2Car: Predicting Physical Trajectories for Natural Language Commands
- Author
-
Thierry Deruyttere, Dusan Grujicic, Matthew B. Blaschko, and Marie-Francine Moens
- Subjects
Artificial intelligence ,autonomous driving ,deep-learning ,machine learning ,multimodal fusion ,trajectory prediction ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
In recent years, there has been an increased interest in giving verbal commands to self-driving cars. Even though multiple companies have showcased progress towards fully autonomous vehicles, surveys have indicated that people are wary of relinquishing total control of the vehicle to the AI. Thus, a system allowing passengers to control the vehicle’s actions would be preferable. Natural language, the most widespread form of communication among humans, presents itself as the most natural control interface, and survey results confirm that the ability to give verbal commands to self-driving vehicles would make the passengers more at ease. In this work, we propose a novel system that predicts which object is referred to by the issued command and the path the car should follow through the immediate surroundings to execute the command. We experiment with different approaches and features to predict the object of interest and show that our simple but effective approach achieves state-of-the-art performance. For predicting the trajectory, we propose a model that relies on a mixture density approach for modeling the distributions of key waypoints of the trajectory in the top-down scene layout. Additionally, we investigate the influence of the two tasks on each other and show that improvements in the prediction of the object of interest lead to improvements in the trajectory prediction task. Finally, we provide the research community with an extension to the Talk2Car dataset, with new trajectory annotations for given commands.
- Published
- 2022
- Full Text
- View/download PDF