prof_pic.png

Max Weltevrede

PhD Researcher, TU Delft

About me

I’m a PhD researcher in the Sequential Decision Making group at the Delft University of Technology supervised by Matthijs Spaan and Wendelin Böhmer. I do research in reinforcement learning with a focus on developing RL agents that can generalise to new scenarios. Currently, I investigating the role of exploration for improving generalisation performance, as well as the zero-shot generalisation capabilities of offline RL agents.

Generally, I am interested in many things. At the moment this includes generalisation, adaptation, continual learning, causality, physics, the scientific method, software engineering, playing guitar, singing, painting and collecting fossils.

News

Sep 18, 2025 Our paper “How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning” got accepted at NeurIPS 2025!
Jul 10, 2025 Presented our work Exploration Implies Data Augmentation in Cathy Wu’s lab at MIT

Publications

    • How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
      Max Weltevrede, Moritz A. Zanger, Matthijs T. J. Spaan, and Wendelin Böhmer
      Conference on Neural Information Processing Systems (NeurIPS 2025), May 2025
    • Universal Value-Function Uncertainties
      Moritz A. Zanger, Max Weltevrede, Yaniv Oren, Pascal R. Van der Vaart, Caroline Horsch, Wendelin Böhmer, and Matthijs T. J. Spaan
      Preprint. Under Review, May 2025
    • Exploration Implies Data Augmentation: Reachability and Generalisation in Contextual MDPs
      Max Weltevrede, Caroline Horsch, Matthijs T. J. Spaan, and Wendelin Böhmer
      Preprint. Under Review, Feb 2025
    • Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
      Max Weltevrede, Felix Kaubek, Matthijs T. J. Spaan, and Wendelin Böhmer
      Seventeenth European Workshop on Reinforcement Learning (EWRL), Sep 2024
    • The Role of Diverse Replay for Generalisation in Reinforcement Learning
      Max Weltevrede, Matthijs T. J. Spaan, and Wendelin Böhmer
      Sixteenth European Workshop on Reinforcement Learning (EWRL), Aug 2023