α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yan Song
Yan Song
7
papers
79
total citations
papers (7)
ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning
NEURIPS 2025
arXiv
40
citations
Efficient Reinforcement Learning with Large Language Model Priors
ICLR 2025
arXiv
21
citations
ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning
NEURIPS 2025
arXiv
15
citations
Reinforcement Learning from Imperfect Corrective Actions and Proxy Rewards
ICLR 2025
arXiv
3
citations
Agreement aware and dissimilarity oriented GLOM
ICCV 2025
0
citations
Bootstrapping Large Language Models for Radiology Report Generation
AAAI 2024
0
citations
Learning Semantic Relationship Among Instances for Image-Text Matching
CVPR 2023
0
citations