Poster "automated question generation" Papers
2 papers found
Conference
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities
Ezra Karger, Houtan Bastani, Chen Yueh-Han et al.
ICLR 2025arXiv:2409.19839
34
citations
STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models
Narun Raman, Taylor Lundy, Thiago Amin et al.
NEURIPS 2025arXiv:2502.13119
3
citations