"instruction-following benchmarks" Papers
2 papers found
Conference
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Xiaosen Zheng, Tianyu Pang, Chao Du et al.
ICLR 2025arXiv:2410.07137
25
citations
Online Preference Alignment for Language Models via Count-based Exploration
Chenjia Bai, Yang Zhang, Shuang Qiu et al.
ICLR 2025arXiv:2501.12735
20
citations