by Zihan Dong Papers
2 papers found
Conference
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play
Ran Xu, Yuchen Zhuang, Zihan Dong et al.
NEURIPS 2025spotlightarXiv:2509.24193
5
citations
Mitigating Heterogeneous Token Overfitting in LLM Knowledge Editing
Tianci Liu, Ruirui Li, Zihan Dong et al.
ICML 2025arXiv:2502.00602
6
citations