Poster "representation editing" Papers
2 papers found
Conference
Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
Yisong Xiao, Aishan Liu, Siyuan Liang et al.
NEURIPS 2025arXiv:2510.01243
2
citations
Re-Imagining Multimodal Instruction Tuning: A Representation View
Yiyang Liu, James Liang, Ruixiang Tang et al.
ICLR 2025arXiv:2503.00723
13
citations