α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Stuart Russell
Stuart Russell
6
papers
221
total citations
papers (6)
Image Hijacks: Adversarial Images can Control Generative Models at Runtime
ICML 2024
arXiv
142
citations
AI Alignment with Changing and Influenceable Reward Functions
ICML 2024
arXiv
43
citations
Monitoring Latent World States in Language Models with Propositional Probes
ICLR 2025
arXiv
22
citations
Diffusion On Syntax Trees For Program Synthesis
ICLR 2025
arXiv
10
citations
AssistanceZero: Scalably Solving Assistance Games
ICML 2025
arXiv
4
citations
Position: Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback
ICML 2024
0
citations