"question answering tasks" Papers
3 papers found
Conference
Language Models Learn to Mislead Humans via RLHF
Jiaxin Wen, Ruiqi Zhong, Akbir Khan et al.
ICLR 2025arXiv:2409.12822
78
citations
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Yutao Zhu, Zhaoheng Huang, Zhicheng Dou et al.
AAAI 2025paperarXiv:2405.19670
9
citations
Thinker: Learning to Think Fast and Slow
Stephen Chung, Wenyu Du, Jie Fu
NEURIPS 2025arXiv:2505.21097
8
citations