"long sequence processing" Papers
4 papers found
Conference
Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction
Jeffrey Willette, Heejun Lee, Sung Ju Hwang
NEURIPS 2025arXiv:2505.11254
3
citations
Efficient Time Series Processing for Transformers and State-Space Models through Token Merging
Leon Götz, Marcel Kollovieh, Stephan Günnemann et al.
ICML 2025arXiv:2405.17951
5
citations
Tensor Product Attention Is All You Need
Yifan Zhang, Yifeng Liu, Huizhuo Yuan et al.
NEURIPS 2025spotlightarXiv:2501.06425
34
citations
Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences
Zicheng Liu, Siyuan Li, Li Wang et al.
ICML 2024arXiv:2406.08128
10
citations