α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Atticus Geiger
Atticus Geiger
6
papers
626
total citations
papers (6)
Causal Abstractions of Neural Networks
NEURIPS 2021
arXiv
315
citations
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
ICML 2025
arXiv
118
citations
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
NEURIPS 2023
arXiv
112
citations
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior
NEURIPS 2022
arXiv
59
citations
MIB: A Mechanistic Interpretability Benchmark
ICML 2025
arXiv
14
citations
How Do Transformers Learn Variable Binding in Symbolic Programs?
ICML 2025
arXiv
8
citations