α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Christopher Potts
Christopher Potts
14
papers
3,047
total citations
papers (14)
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
ICLR 2025
arXiv
2,226
citations
Causal Abstractions of Neural Networks
NEURIPS 2021
arXiv
315
citations
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
ICML 2025
arXiv
118
citations
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
NEURIPS 2023
arXiv
112
citations
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval
NEURIPS 2021
arXiv
68
citations
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
NEURIPS 2021
arXiv
64
citations
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior
NEURIPS 2022
arXiv
59
citations
Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as a Target for NLP
NEURIPS 2021
arXiv
22
citations
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
ICLR 2025
arXiv
16
citations
Base Models Beat Aligned Models at Randomness and Creativity
COLM 2025
arXiv
16
citations
Bayesian scaling laws for in-context learning
COLM 2025
arXiv
13
citations
GIO: Gradient Information Optimization for Training Dataset Selection
ICLR 2024
arXiv
11
citations
ContextRef: Evaluating Referenceless Metrics for Image Description Generation
ICLR 2024
arXiv
5
citations
Blackbox Model Provenance via Palimpsestic Membership Inference
NEURIPS 2025
arXiv
2
citations