by Yinfang Chen Papers
2 papers found
Conference
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks
Saurabh Jha, Rohan Arora, Yuji Watanabe et al.
ICML 2025oralarXiv:2502.05352
18
citations
STRATUS: A Multi-agent System for Autonomous Reliability Engineering of Modern Clouds
Yinfang Chen, Jiaqi Pan, Jackson Clark et al.
NEURIPS 2025arXiv:2506.02009
4
citations