Tagged #AIS

# Stories

23 September 2024 | 11 min read | tags: Experience Research AIS XAI MARL

A short-story about why I decided to do a PhD on the subject of explainable multi-agent reinforcement learning. I detail how I weighted this decision and how I created my proposal. I also try to depict what I plan to do for making the best of my PhD.

2 November 2023 | 12 min read | tags: AIS Agenda Startup

My Approach to AI Safety

Shortcoming issues should not be undermined because, if not tackled immediately, they will make Alignment a lot more complex. I am convinced that interpretability will be the best tool for monitoring and control, and for that, I will pursue this agenda through research and entrepreneurship.

# Articles

5 October 2024 | 14 min read | tags: AIS XAI FHE Eval

FHE for Open Model Audits

Thanks to recent developments, FHE can now be applied easily and scalably to deep neural networks. I think, like many, that these advancements are a real opportunity to improve AI safety. I thus outline possible applications of FHE in model evaluation and interpretability, the most mature tools in safety as of today in my opinion.

Tagged #AIS

# Stories

My PhD

My Approach to AI Safety

# Articles

FHE for Open Model Audits

> All tags