by Sumeet Motwani Papers
2 papers found
Conference
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites
Div Garg, Diego Caples, Andis Draguns et al.
NEURIPS 2025arXiv:2504.11543
20
citations
STARC: A General Framework For Quantifying Differences Between Reward Functions
Joar Skalse, Lucy Farnik, Sumeet Motwani et al.
ICLR 2024arXiv:2309.15257
12
citations