"sliding window attention" Papers
4 papers found
Conference
Enhancing Image Restoration Transformer via Adaptive Translation Equivariance
JiaKui Hu, Zhengjian Yao, Lujia Jin et al.
ICCV 2025arXiv:2506.18520
3
citations
Robust Tracking via Mamba-based Context-aware Token Learning
Jinxia Xie, Bineng Zhong, Qihua Liang et al.
AAAI 2025paperarXiv:2412.13611
26
citations
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Liliang Ren, Yang Liu, Yadong Lu et al.
ICLR 2025arXiv:2406.07522
122
citations
Simple linear attention language models balance the recall-throughput tradeoff
Simran Arora, Sabri Eyuboglu, Michael Zhang et al.
ICML 2024spotlightarXiv:2402.18668
140
citations