Poster by Yuting Ning Papers
2 papers found
Conference
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Boyu Gou, Zanming Huang, Yuting Ning et al.
NEURIPS 2025arXiv:2506.21506
21
citations
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Ziru Chen, Shijie Chen, Yuting Ning et al.
ICLR 2025arXiv:2410.05080
61
citations