by Qineng Wang Papers
2 papers found
Conference
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Rui Yang, Hanyang(Jeremy) Chen, Junyu Zhang et al.
ICML 2025oralarXiv:2502.09560
98
citations
VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
Kangrui Wang, Pingyue Zhang, Zihan Wang et al.
NEURIPS 2025arXiv:2510.16907
12
citations