Poster "long sequence modeling" Papers
9 papers found
Conference
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification
Jingwei Zhang, Anh Tien Nguyen, Xi Han et al.
CVPR 2025arXiv:2412.00678
23
citations
Training Free Exponential Context Extension via Cascading KV Cache
Jeff Willette, Heejun Lee, Youngwan Lee et al.
ICLR 2025arXiv:2406.17808
3
citations
Why RoPE Struggles to Maintain Long-Term Decay in Long Sequences?
Wei Shen, Chao Yin, Yuliang Liu et al.
ICLR 2025
ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention
Qiuhao Zeng, Jierui Huang, Peng Lu et al.
ICLR 2025arXiv:2501.14577
5
citations
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Yiran Ding, Li Lyna Zhang, Chengruidong Zhang et al.
ICML 2024arXiv:2402.13753
278
citations
Motion Mamba: Efficient and Long Sequence Motion Generation
Zeyu Zhang, Akide Liu, Ian Reid et al.
ECCV 2024arXiv:2403.07487
114
citations
State-Free Inference of State-Space Models: The *Transfer Function* Approach
Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro et al.
ICML 2024
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
Zhen Qin, Weigao Sun, Dong Li et al.
ICML 2024arXiv:2405.17381
24
citations
xT: Nested Tokenization for Larger Context in Large Images
Ritwik Gupta, Shufan Li, Tyler Zhu et al.
ICML 2024arXiv:2403.01915
8
citations