prof_pic.png

Max Weltevrede

PhD Researcher, TU Delft

About me

I’m a PhD researcher in the Sequential Decision Making group at the Delft University of Technology supervised by Matthijs Spaan and Wendelin Böhmer. I do research in reinforcement learning with a focus on developing RL agents that can generalise to new scenarios. Currently, I investigating the role of exploration for improving generalisation performance, as well as the zero-shot generalisation capabilities of offline RL agents.

Generally, I am interested in many things. At the moment this includes generalisation, adaptation, continual learning, causality, physics, the scientific method, software engineering, playing guitar, singing, painting and collecting fossils.


Publications

    • Training on more Reachable Tasks for Generalisation in Reinforcement Learning
      Max Weltevrede, Caroline Horsch, Matthijs T. J. Spaan, and Wendelin Böhmer
      Preprint. Under Review, Oct 2024
    • Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
      Max Weltevrede, Felix Kaubek, Matthijs T. J. Spaan, and Wendelin Böhmer
      Seventeenth European Workshop on Reinforcement Learning, Sep 2024
    • The Role of Diverse Replay for Generalisation in Reinforcement Learning
      Max Weltevrede, Matthijs T. J. Spaan, and Wendelin Böhmer
      Sixteenth European Workshop on Reinforcement Learning, Aug 2023