~
I am a PhD candidate in Reinforcement Learning at UPC, Barcelona. My research focuses on improving the efficiency and stability of (multi-agent) RL. My background is both in computer science and philosophy.
Posts
10 Nov 2025
Parallelised Q-Network for Continuous Action Spaces
Extension of PQN to continuous action spaces.
4 Feb 2025
Simplifying Deep Temporal Difference Learning
A modern implementation of Deep Q-Network without target networks and replay buffers.