α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Teun van der Weij
Teun van der Weij
2
papers
74
total citations
papers (2)
AI Sandbagging: Language Models can Strategically Underperform on Evaluations
ICLR 2025
arXiv
67
citations
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models
NEURIPS 2025
arXiv
7
citations