"superalignment" Papers
3 papers found
Conference
Limitations of refinement methods for weak to strong generalization
Seamus Somerstep, Yaacov Ritov, Mikhail Yurochkin et al.
COLM 2025paper
1
citations
Robust SuperAlignment: Weak-to-Strong Robustness Generalization for Vision-Language Models
Junhao Dong, Cong Zhang, Xinghua Qu et al.
NEURIPS 2025spotlight
Weak-to-Strong Generalization Through the Data-Centric Lens
Changho Shin, John Cooper, Frederic Sala
ICLR 2025arXiv:2412.03881
14
citations