Yoann Poupart

Personal website, blog & portfolio.

# Welcome!

I am a French PhD student in AI with a focus on AI Safety. I’ve had diverse experience in research, education and industry. I’ll outline this in a forthcoming story, “A Convoluted Study Path”, giving insight into my career aspiration. More on me in the About section.

You’ll find here a collection of stories, projects and articles.

  • Story: personal thought, claim, discussion or experience
  • Project: concrete application (mostly around AI)
  • Article: paper explanation or discussion (definitely about AI)

You can use tags to find posts on a specific topic!

# Pinned stories

My PhD

My PhD

2024-09-23 | 11 min read | A short-story about why I decided to do a PhD on the subject of explainable multi-agent reinforcement learning. I detail how I weighted this decision and how I created my proposal. I also try to depict what I plan to do for making the best of my PhD.

Creating a Startup

Creating a Startup

2023-12-20 | 10 min read | Creating a startup is not always about having the best or a totally new idea. Entrepreneurship is just another way to deliver value and achieve your goals, and it's within everyone's reach. We chose this path because it is flexible, stimulating, profoundly empowering, but also fun.

My Approach to AI Safety

2023-11-02 | 12 min read | Shortcoming issues should not be undermined because, if not tackled immediately, they will make Alignment a lot more complex. I am convinced that interpretability will be the best tool for monitoring and control, and for that, I will pursue this agenda through research and entrepreneurship.

# Pinned projects

Training GPT-2 on Stockfish Games

Training GPT-2 on Stockfish Games

2024-02-29 | 12 min read | I trained a GPT-2 model on Stockfish self-played games in the most naive way, with no search, and it can play decently. The model is trained to output the next move given the FEN string of the board (single state). While I present some gotchas and caveats, the results are quite acceptable for the amount of work and computing invested. I also present a basic attention visualiser parsing the attention of the text tokens into the board.

# Pinned articles

FHE for Open Model Audits

FHE for Open Model Audits

2024-10-05 | 14 min read | Thanks to recent developments, FHE can now be applied easily and scalably to deep neural networks. I think, like many, that these advancements are a real opportunity to improve AI safety. I thus outline possible applications of FHE in model evaluation and interpretability, the most mature tools in safety as of today in my opinion.

Layer-Wise Relevance Propagation

Layer-Wise Relevance Propagation

2024-01-16 | 16 min read | Layer-Wise Relevance Propagation (LRP) is a propagation method that produces relevances for a given input with regard to a target output. Technically the computation happens using a single back-progation pass similarly to deconvolution. I propose to illustrate this method on an Alpha-Zero network trained to play Othello.