"model efficiency optimization" Papers
2 papers found
Conference
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
Chenze Shao, Fandong Meng, Jie Zhou
ICLR 2025arXiv:2407.12665
5
citations
SLMRec: Distilling Large Language Models into Small for Sequential Recommendation
Wujiang Xu, Qitian Wu, Zujie Liang et al.
ICLR 2025oralarXiv:2405.17890
18
citations