Paper "length extrapolation" Papers
3 papers found
Conference
DocMamba: Efficient Document Pre-training with State Space Model
Pengfei Hu, Zhenrong Zhang, Jiefeng Ma et al.
AAAI 2025paperarXiv:2409.11887
2
citations
Hansel: Output Length Controlling Framework for Large Language Models
Seoha Song, Junhyun Lee, Hyeonmok Ko
AAAI 2025paperarXiv:2412.14033
1
citations
Exploring Transformer Extrapolation
Zhen Qin, Yiran Zhong, Hui Deng
AAAI 2024paperarXiv:2307.10156
12
citations