Posts

10 May 2026

Scaling Multi-Agent RL for Underwater Acoustic Tracking

GPU-accelerated simulation and Transformer-based MARL for cooperative underwater tracking.

10 Nov 2025

Parallelised Q-Network for Continuous Action Spaces

Extension of PQN to continuous action spaces.

4 Feb 2025

Simplifying Deep Temporal Difference Learning

A modern implementation of Deep Q-Network without target networks and replay buffers.