by Naiqiang Tan Papers
3 papers found
Conference
Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization
Haotian Luo, Haiying He, Yibo Wang et al.
NEURIPS 2025arXiv:2504.21659
18
citations
Bag of Tricks for Inference-time Computation of LLM Reasoning
Fan LIU, Wen-Shuo Chao, Naiqiang Tan et al.
NEURIPS 2025arXiv:2502.07191
12
citations
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
Yibo Wang, Tiansheng Huang, Li Shen et al.
NEURIPS 2025arXiv:2501.18100
14
citations