α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Oliver Zhang
Oliver Zhang
3
papers
2,595
total citations
papers (3)
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
ICLR 2025
arXiv
2,226
citations
The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning
ICML 2024
arXiv
333
citations
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
NEURIPS 2025
arXiv
36
citations