"gating mechanism" Papers
7 papers found
Conference
Balancing Act: Diversity and Consistency in Large Language Model Ensembles
Ahmed Abdulaal, Chen Jin, Nina Montaña-Brown et al.
ICLR 2025
Hybrid Latent Reasoning via Reinforcement Learning
Zhenrui Yue, Bowen Jin, Huimin Zeng et al.
NEURIPS 2025arXiv:2505.18454
8
citations
On the Expressive Power of Mixture-of-Experts for Structured Complex Tasks
Mingze Wang, Weinan E
NEURIPS 2025spotlightarXiv:2505.24205
1
citations
SeerAttention: Self-distilled Attention Gating for Efficient Long-context Prefilling
Yizhao Gao, Zhichen Zeng, DaYou Du et al.
NEURIPS 2025
Toward Understanding In-context vs. In-weight Learning
Bryan Chan, Xinyi Chen, Andras Gyorgy et al.
ICLR 2025arXiv:2410.23042
15
citations
Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality Generation
Lincan Cai, Shuang Li, Wenxuan Ma et al.
ICML 2024arXiv:2406.09003
4
citations
Pi-DUAL: Using privileged information to distinguish clean from noisy labels
Ke Wang, Guillermo Ortiz-Jimenez, Rodolphe Jenatton et al.
ICML 2024arXiv:2310.06600
5
citations