RL Dresden

Projects

Water

Navigating maritime traffic poses multifaceted challenges, including the intricate interplay of environmental factors like wind, waves, and currents, alongside the dynamic presence of other vessels with varying behaviors. These complexities demand precise control strategies that can adapt to real-world conditions, considering factors such as shallow water impacts and spatial restrictions imposed by waterway geometry. Within our research, we aim to address these challenges through modularized frameworks and advanced reinforcement learning techniques, facilitating realistic path planning and robust control of autonomous vessels in diverse maritime environments.

Ground

Addressing urban congestion and outdated infrastructure while improving traffic flow and reducing pollution is crucial. Implementing advanced traffic management systems equipped with AI algorithms can optimize flow and reduce congestion through dynamic route planning. We do our best to develop those AI algorithms. Among other tasks, to overcome the challenge of transferring trained autonomous vehicle agents from simulation to reality, we focus on separating agents into perception and control modules, enabling smoother integration into real-world environments. Similarly, by incorporating real-world driving datasets into training reinforcement learning agents, we aim to enhance their behavior and generalization capabilities, ultimately improving autonomous driving systems' performance on actual roads.

Air

Urban air mobility presents unique challenges in managing eVTOL vehicles around vertiports, including congestion, safety, and unpredictable factors like weather and passenger demand. Among others, in our project, we address these complexities by proposing self-organized arrival systems using deep reinforcement learning. Treating each aircraft as an individual agent, we develop a shared policy for decentralized actions based on local information.

Methods

The continual development of new algorithms and models is imperative to tackle practical challenges with reinforcement learning. Even established methods like Q-Learning encounter limitations, particularly regarding the maximization operator in target computation. Inherent in the Bellman optimality equation, this operator often inflates Q-values for state-action pairs, impacting subsequent updates. As part of our broader theoretical endeavors in reinforcement learning, we are currently developing a novel suite of estimators tailored to address the max-mu problem, seamlessly integrating them within the framework of deep neural network-based function approximations.

Software

RL Dresden Algorithm Suite [GitHub]

This suite implements several model-free off-policy deep reinforcement learning algorithms for discrete and continuous action spaces in PyTorch.

Mixed Traffic Web Simulators [mtreiber.de]

Fully operational, 2D, Javascript-based web simulator implementing the "Mixed Traffic flow Model" (MTM). This simulation is intended to demonstrate fully two-dimensional but directed traffic flow and visulalize 2D flow models.

Sim2Real Transfer package with Duckiebot [GitHub]

This package includes the training and evaluation code under ROS platform for Sim2Real Transfer with Duckiebot for multiple autonomous driving behaviors.

Publications

2-level reinforcement learning for ships on inland waterways: Path planning and following

Waltz, M., Paulig, N., Okhrin, O. (2025). Expert Systems with Applications, Volume 274, 126933.

Leader–follower identification with vehicle-following calibration for non-lane-based traffic

Kulkarni, M. M., Chaudhari, A. A., Et al. (2025). Transportation Research Part C: Emerging Technologies 171, 104940.

Two-step dynamic obstacle avoidance

Waltz, M., Okhrin, O., Hart, F. (2024). Knowledge-Based Systems, Volume 302, 112402.

Addressing maximization bias in reinforcement learning with two-sample testing

Waltz, M., Okhrin, O. (2024). Artificial Intelligence 336, 104204

Self-organized free-flight arrival for urban air mobility

Waltz, M., Okhrin, O., Schultz, M. (2024). Transportation Research Part C: Emerging Technologies, Volume 167, 104806

Towards robust car-following based on deep reinforcement learning

Hart, F., Okhrin, O., Treiber, M. (2024). Transportation Research Part C: Emerging Technologies, Volume 159, 104486

Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk

Hart, F., & Okhrin, O. (2024). Neurocomputing, 568, 127097.

Robust Path Following on Rivers Using Bootstrapped Reinforcement Learning

Paulig, N., & Okhrin, O. (2024). Ocean Engineering, Volume 298, 117207

Towards Autonomous Driving with Small-Scale Cars: A Survey of Recent Development

Li, D., Auerbach, P., & Okhrin, O. (2024). arXiv preprint arXiv:2404.06229.

An open-source framework for data-driven trajectory extraction from AIS data -- the α-method

Paulig, N., & Okhrin, O. (2024). under revision in Transactions on Intelligent Transportation Systems (T-ITS), preprint arXiv:2407.04402.

Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer

Li, D., & Okhrin, O. (2023). In 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC) (pp. 866-873).

A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer in Autonomous Driving

Li, D., & Okhrin, O. (2024). Communications Engineering volume 3, Article number: 147.

Spatial-temporal recurrent reinforcement learning for autonomous ships

Waltz, M., & Okhrin, O. (2023). Neural Networks, 165, 634-653.

Vessel-following model for inland waterways based on deep reinforcement learning

Hart, F., Okhrin, O., & Treiber, M. (2023). Ocean Engineering, 281, 114679

Modified DDPG car-following model with a real-world human driving experience with CARLA simulator

Li, D., & Okhrin, O. (2023). Transportation Research Part C: Emerging Technologies, Volume 147, 103987.

Student Works

Linienlose Tabelle

Year	Name	Degree	Thesis Title
2025	Bharath Ashok Kumar	Master (D. Li)	Autonomous Navigation with Deep Reinforcement Learning in CARLA Simulator
2024	Yi Feng	Master (D. Li)	Autonomous robot car with Deep Reinforcement Learning
2024	Guangli Chen	Master (D. Li)	Autonomous robot car with Deep Reinforcement Learning
2024	Yuhong Zhangwill	Master (A.Chaudhari)	Driving regime identification using hidden Markov models
2024	Rajasekar Sankar	Master (D. Li)	Transformer enhanced affordance learning for autonomous driving
2024	Sebastian Jost	Master (A.Chaudhari)	Solving general Twisty Puzzles with Reinforcement Learning and automatic Algorithm Generation
2024	Vladyslav Nechai	Master (M.Waltz)	Communication approaches in Multi-Agent Reinforcement Learning
2024	Jonas Keller	Master (M.Waltz)	Explainability in Deep Reinforcement Learning
2023	Peilin Wang	Master (D.Li)	Autonomous Navigation with Deep Reinforcement Learning in Carla
2023	Yuhua Zhu	Master (D.Li)	Autonomous Driving with Deep Reinforcement Learning
2021	Paul Ziebarth	Diplom (F.Hart)	Entwicklung eines Reinforcement Learning Agenten zur Realisierung eines Schifffolgemodells

Team

Prof. Dr. Ostap Okhrin

Ostap Okhrin is the professor for Statistics and Econometrics at the Department of Transportation at the TU Dresden. His expertise lies in mathematical statistics and data science with applications in transportation and economics.

Dr. Martin Treiber

Martin Treiber is a senior expert in traffic flow models including human and automated driving, bicycle, and pedestrian traffic. He also works in traffic data analysis and simulation (traffic-simulation.de, mtreiber.de/mixedTraffic).

Niklas Paulig

Niklas Paulig is a RL Group research associate, with his main field of research being the modeling of autonomous inland vessel traffic based on reinforcement learning methods, and HPC implementations of currently in use algorithms.

Dianzhao Li

Dianzhao Li is a research assistant at RL-Dresden, focusing on the area of trajectory planning for autonomously driving vehicles with reinforcement learning algorithms. He now mixes the human driving datasets with RL in simulated environments to achieve better performance for the vehicles.

Paul Auerbach

Paul Auerbach is a research associate at the Barkhausen Institut and collaborates with RL-Dresden on the simulation and solving of traffic scenarios with the help of reinforcement learning. He aims to transfer the learned RL models to real world model cars.

Gong Chen

Gong Chen is a research associate within RL-Dresden, concentrating on applying reinforcement learning to simulate shipping traffic under shallow water conditions.

Saad Sagheer

Saad Sagheer is a Civil Engineer who is currently working as a research associate and collabrates with BAW (Bundesanstalt für Wasserbau). His main field of research is to conduct micro-traffic simulation of inland vessels in Rhine especially Middle Rhine with different water level conditions related to climate change.

Ankit Anil Chaudhari

Ankit Chaudhari is currently working on "Enhancing Traffic-Flow Understanding by Two-Dimensional Microscopic Models". His research interests are traffic flow modelling, traffic simulation, mixed traffic flow, machine learning and reinforcement learning.

RL Dresden

Reinforcement Learning Research Group

Motivation

Optimal control for traffic dynamics

RLCDD 2022

News

December 10, 2024

September 10-13, 2024

September 2-6, 2024

May 20-24, 2024

March 13-15, 2024

December 5-6, 2023

September 24-28, 2023

September 11-14, 2023

August 29, 2023

June 8, 2023

June 7, 2023

May 8, 2023

April 27, 2023

April 25, 2023