Max Weltevrede
PhD Researcher, TU Delft
About me
I’m a PhD researcher in the Sequential Decision Making group at the Delft University of Technology supervised by Matthijs Spaan and Wendelin Böhmer. I do research in reinforcement learning with a focus on developing RL agents that can generalise to new scenarios. Currently, I investigating the role of exploration for improving generalisation performance, as well as the zero-shot generalisation capabilities of offline RL agents.
Generally, I am interested in many things. At the moment this includes generalisation, adaptation, continual learning, causality, physics, the scientific method, software engineering, playing guitar, singing, painting and collecting fossils.
News
| Sep 18, 2025 | Our paper “How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning” got accepted at NeurIPS 2025! |
|---|---|
| Jul 10, 2025 | Presented our work Exploration Implies Data Augmentation in Cathy Wu’s lab at MIT |