"llm-as-a-judge paradigm" Papers
3 papers found
Conference
Distributional LLM-as-a-Judge
Luyu Chen, Zeyu Zhang, Haoran Tan et al.
NEURIPS 2025
Limits to scalable evaluation at the frontier: LLM as judge won’t beat twice the data
Florian Eddie Dorner, Vivian Nastl, Moritz Hardt
ICLR 2025
24
citations
Validating LLM-as-a-Judge Systems under Rating Indeterminacy
Luke Guerdan, Solon Barocas, Kenneth Holstein et al.
NEURIPS 2025arXiv:2503.05965
7
citations