"code generation benchmarks" Papers
5 papers found
Conference
ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments
Hojae Han, seung-won hwang, Rajhans Samdani et al.
ICLR 2025arXiv:2502.19852
13
citations
EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code
Yuhao Qing, Boyu Zhu, Mingzhe Du et al.
NEURIPS 2025arXiv:2505.13004
15
citations
To Code or Not To Code? Exploring Impact of Code in Pre-training
Viraat Aryabumi, Yixuan Su, Raymond Ma et al.
ICLR 2025arXiv:2408.10914
44
citations
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Shusheng Xu, Wei Fu, Jiaxuan Gao et al.
ICML 2024arXiv:2404.10719
253
citations
Self-Infilling Code Generation
Lin Zheng, Jianbo Yuan, Zhi Zhang et al.
ICML 2024arXiv:2311.17972
5
citations