publications

2025

  1. MI @ NeurIPS
    manifold.png
    Finding Manifolds With Bilinear Autoencoders
    In Mechanistic Interpretability Workshop: At the Thirty-Ninth Annual Conference on Neural Information Processing Systems, Oct 2025

2024

  1. Compositionality Unlocks Deep Interpretable Models
    In Connecting Low-Rank Representations in AI: At the 39th Annual AAAI Conference on Artificial Intelligence, Nov 2024
  2. Bilinear MLPs Enable Weight-Based Mechanistic Interpretability
    Michael T. Pearce, Thomas Dooms, Alice Rigg, Jose Oramas, and 1 more author
    In The Thirteenth International Conference on Learning Representations, Oct 2024