Mixture-of-Agents Enhances Large Language Model Capabilities

294citations

arXiv:2406.04692 Project

294

citations

#23

in ICLR 2025

of 3827 papers

Top Authors

Data Points

Top Authors

Junlin Wang Jue Wang Ben Athiwaratkun Ce Zhang James Y Zou

Abstract

Recent advances in large language models (LLMs) demonstrate substantial capabilities in natural language understanding and generation tasks. With the growing number of LLMs, how to harness the collective expertise of multiple LLMs is an exciting open direction. Toward this goal, we propose a new approach that leverages the collective strengths of multiple LLMs through a Mixture-of-Agents (MoA) methodology. In our approach, we construct a layered MoA architecture wherein each layer comprises multiple LLM agents. Each agent takes all the outputs from agents in the previous layer as auxiliary information in generating its response. MoA models achieves state-of-art performance on AlpacaEval 2.0, Arena-Hard, MT-Bench, and FLASK, surpassing GPT-4 Omni. For example, our MoA using only open-source LLMs achieves a score of 65.1% on AlpacaEval 2.0 compared to 57.5% by GPT-4 Omni.

Citation History

Jan 25, 2026

274

Feb 13, 2026

294+20

Feb 13, 2026

294

Feb 13, 2026

294