"large language model agents" Papers

15 papers found

CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation

Jie Liu, Pan Zhou, Yingjun Du et al.

ICLR 2025arXiv:2411.04679
8
citations

Conformal Information Pursuit for Interactively Guiding Large Language Models

Kwan Ho Ryan Chan, Yuyan Ge, Edgar Dobriban et al.

NEURIPS 2025arXiv:2507.03279
3
citations

Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems

Guibin Zhang, Yanwei Yue, Zhixun Li et al.

ICLR 2025oralarXiv:2410.02506
64
citations

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Kexun Zhang, Weiran Yao, Zuxin Liu et al.

ICLR 2025arXiv:2408.07060
40
citations

Interactive Speculative Planning: Enhance Agent Efficiency through Co-design of System and User Interface

Wenyue Hua, Mengting Wan, JAGANNATH VADREVU et al.

ICLR 2025arXiv:2410.00079
8
citations

Multi-LLM-Agents Debate - Performance, Efficiency, and Scaling Challenges

Hangfan Zhang, Zhiyao Cui, Qiaosheng Zhang et al.

ICLR 2025

Rational Decision-Making Agent with Learning Internal Utility Judgment

Yining Ye, Xin Cong, Shizuo Tian et al.

ICLR 2025
2
citations

ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents

Zhenyu Zhang, Tianyi Chen, Weiran Xu et al.

NEURIPS 2025arXiv:2510.23822
3
citations

Simulating Human-like Daily Activities with Desire-driven Autonomy

Yiding Wang, Yuxuan Chen, Fangwei Zhong et al.

ICLR 2025oralarXiv:2412.06435
17
citations

Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models

Christopher Chiu, Silviu Pitis, Mihaela van der Schaar

NEURIPS 2025oralarXiv:2510.10278

T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning

NEURIPS 2025arXiv:2505.16986
3
citations

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Yuzhe YANG, Yifei Zhang, Minghao Wu et al.

NEURIPS 2025oralarXiv:2502.01506
21
citations

DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model

Lirui Zhao, Yue Yang, Kaipeng Zhang et al.

CVPR 2024arXiv:2404.01342
7
citations

STEER: Assessing the Economic Rationality of Large Language Models

Narun Raman, Taylor Lundy, Samuel Joseph Amouyal et al.

ICML 2024arXiv:2402.09552
22
citations

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Alexandre Drouin, Maxime Gasse, Massimo Caccia et al.

ICML 2024arXiv:2403.07718
141
citations