α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
John Hughes
John Hughes
5
papers
363
total citations
papers (5)
Debating with More Persuasive LLMs Leads to More Truthful Answers
ICML 2024
arXiv
212
citations
Hierarchical Quantized Autoencoders
NEURIPS 2020
arXiv
78
citations
Looking Inward: Language Models Can Learn About Themselves by Introspection
ICLR 2025
arXiv
44
citations
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
ICLR 2025
arXiv
24
citations
Why Do Some Language Models Fake Alignment While Others Don't?
NEURIPS 2025
arXiv
5
citations