Poster by Longxu Dou Papers
3 papers found
Conference
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
Xiangyan Liu, Jinjie Ni, Zijian Wu et al.
NEURIPS 2025arXiv:2504.13055
57
citations
RegMix: Data Mixture as Regression for Language Model Pre-training
Qian Liu, Xiaosen Zheng, Niklas Muennighoff et al.
ICLR 2025arXiv:2407.01492
105
citations
Unnatural Languages Are Not Bugs but Features for LLMs
Keyu Duan, Yiran Zhao, Zhili Feng et al.
ICML 2025arXiv:2503.01926
3
citations