teaching

Currently supervising five bachelor students on research projects related to exploration in (Bayesian) deep reinforcement learning and contextual bandits.

Designed and delivered a lecture on Bayesian Model-free Reinforcement Learning as part of the MSc level course “Sequential Decision Making” (DSAIT4110) at TU Delft.