by Xia Song Papers
2 papers found
Conference
POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Batuhan K. Karaman, ishmam zabir, Alon Benhaim et al.
ICML 2025arXiv:2410.12999
3
citations
Scaling Optimal LR Across Token Horizons
Johan Bjorck, Alon Benhaim, Vishrav Chaudhary et al.
ICLR 2025arXiv:2409.19913
22
citations