Posts
10 May 2026
Scaling Multi-Agent RL for Underwater Acoustic Tracking
GPU-accelerated simulation and Transformer-based MARL for cooperative underwater tracking.
10 Nov 2025
Parallelised Q-Network for Continuous Action Spaces
Extension of PQN to continuous action spaces.
4 Feb 2025
Simplifying Deep Temporal Difference Learning
A modern implementation of Deep Q-Network without target networks and replay buffers.