Full Text

Turn on search term navigation

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Deep reinforcement learning (DRL) has been demonstrated to be effective for several complex decision-making applications, such as autonomous driving and robotics. However, DRL is notoriously limited by its high sample complexity and its lack of stability. Prior knowledge, e.g., as expert demonstrations, is often available but challenging to leverage to mitigate these issues. In this paper, we propose General Reinforced Imitation (GRI), a novel method which combines benefits from exploration and expert data and is straightforward to implement over any off-policy RL algorithm. We make one simplifying hypothesis: expert demonstrations can be seen as perfect data whose underlying policy gets a constant high reward. Based on this assumption, GRI introduces the notion of offline demonstration agent. This agent sends expert data which are processed both concurrently and indistinguishably with the experiences coming from the online RL exploration agent. We show that our approach enables major improvements on camera-based autonomous driving in urban environments. We further validate the GRI method on Mujoco continuous control tasks with different off-policy RL algorithms. Our method ranked first on the CARLA Leaderboard and outperforms World on Rails, the previous state-of-the-art method, by 17%.

Details

Title
GRI: General Reinforced Imitation and Its Application to Vision-Based Autonomous Driving
Author
Chekroun, Raphael 1 ; Marin Toromanoff 2 ; Hornauer, Sascha 3 ; Moutarde, Fabien 3   VIAFID ORCID Logo 

 Center for Robotics, Mines Paris, PSL University, 75006 Paris, France; Valeo Driving Assistant Research, 75017 Paris, France 
 Valeo Driving Assistant Research, 75017 Paris, France 
 Center for Robotics, Mines Paris, PSL University, 75006 Paris, France 
First page
127
Publication year
2023
Publication date
2023
Publisher
MDPI AG
e-ISSN
22186581
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2882603614
Copyright
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.