α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Long Phan
Long Phan
4
papers
1,154
total citations
papers (4)
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
ICML 2024
arXiv
802
citations
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NEURIPS 2022
arXiv
203
citations
Tamper-Resistant Safeguards for Open-Weight LLMs
ICLR 2025
arXiv
113
citations
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
NEURIPS 2025
arXiv
36
citations