by yikai zhang Papers
4 papers found
Conference
ARIA: Training Language Agents with Intention-driven Reward Aggregation
Ruihan Yang, yikai zhang, Aili Chen et al.
NEURIPS 2025spotlightarXiv:2506.00539
1
citations
ARM: Adaptive Reasoning Model
Tinghui Zhu, Jian Xie, yikai zhang et al.
NEURIPS 2025spotlight
Multi-agent KTO: Enhancing Strategic Interactions of Large Language Model in Language Game
Rong Ye, Yongxin Zhang, yikai zhang et al.
NEURIPS 2025
The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement
Ruihan Yang, Fanghua Ye, Jian Li et al.
NEURIPS 2025arXiv:2503.16024
11
citations