Projects
MARL Cluster Training

Let's dive into MARL training with BenchMARL and how to scale up your experiments to a cluster. We'll cover environment and model customisation, as well as how to run your experiments on a cluster even without internet access.
Training GPT-2 on Stockfish Games

I trained a GPT-2 model on Stockfish self-played games in the most naive way, with no search, and it can play decently. The model is trained to output the next move given the FEN string of the board (single state). While I present some gotchas and caveats, the results are quite acceptable for the amount of work and computing invested. I also present a basic attention visualiser parsing the attention of the text tokens into the board.