Optimizing roundabout management via deep reinforcement learning with safety and comfort constraints

Nadar, Ali; Härri, Jérôme
ITSC 2025, IEEE Intelligent Transportation Systems Conference, 18-21 November 2025, Gold Coast, Australia

This paper presents a deep reinforcement learning (DRL) framework to optimize autonomous vehicle maneuver during roundabout approaches, with a focus on safety, efficiency, and passenger comfort. The proposed method incorporates a logistic regression-based Roundabout Exit Probability (REP) model to estimate the likelihood that inbound vehicles will exit the roundabout, as well as a regression-based Time-To-Collision (TTC) predictor to model the ego vehicle's controlled maneuver while maintaining comfort constraints. These predictive models are integrated into a Proximal Policy Optimization (PPO) framework, enhanced with a curriculum learning strategy to gradually shape the agent's behavior toward balanced, human-like decisionmaking. The reward function is designed to penalize unsafe or abrupt actions and encourage smooth, efficient maneuvering. Experimental results in the CARLA simulator demonstrate the effectiveness of the proposed strategy in achieving robust, comfort-aware navigation in roundabout scenarios.


Type:
Conférence
City:
Gold Coast
Date:
2025-11-18
Department:
Systèmes de Communication
Eurecom Ref:
8322
Copyright:
© 2025 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/8322