publications

2026

  1. arXiv
    manifold2.png
    Bilinear autoencoders find interpretable manifolds
    May 2026
  2. arXiv
    refinement.png
    From Mechanistic to Compositional Interpretability
    Ward Gauderis*, Thomas Dooms*, Steven T. Holmer, Kola Ayonrinde, and 1 more author
    May 2026

2025

  1. MI @ NeurIPS
    manifold.png
    Finding Manifolds With Bilinear Autoencoders
    In Mechanistic Interpretability Workshop: At the Thirty-Ninth Annual Conference on Neural Information Processing Systems, Oct 2025

2024

  1. Compositionality Unlocks Deep Interpretable Models
    In Connecting Low-Rank Representations in AI: At the 39th Annual AAAI Conference on Artificial Intelligence, Nov 2024
  2. Bilinear MLPs Enable Weight-Based Mechanistic Interpretability
    Michael T. Pearce, Thomas Dooms, Alice Rigg, Jose Oramas, and 1 more author
    In The Thirteenth International Conference on Learning Representations, Oct 2024