Paper "long-range dependencies" Papers
7 papers found
Conference
Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution
Karam Park, Jae Woong Soh, Nam Ik Cho
AAAI 2025paperarXiv:2501.15774
10
citations
Large Language Model Meets Graph Neural Network in Knowledge Distillation
Shengxiang Hu, Guobing Zou, Song Yang et al.
AAAI 2025paperarXiv:2402.05894
14
citations
Pamba: Enhancing Global Interaction in Point Clouds via State Space Model
Zhuoyuan Li, Yubo Ai, Jiahao Lu et al.
AAAI 2025paperarXiv:2406.17442
6
citations
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Meng Lou, Yunxiang Fu, Yizhou Yu
AAAI 2025paperarXiv:2409.09649
28
citations
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning
Wenjin Hou, Dingjie Fu, Kun Li et al.
AAAI 2025paperarXiv:2408.14868
2
citations
Cached Transformers: Improving Transformers with Differentiable Memory Cached
Zhaoyang Zhang, Wenqi Shao, Yixiao Ge et al.
AAAI 2024paperarXiv:2312.12742
5
citations
S2WAT: Image Style Transfer via Hierarchical Vision Transformer Using Strips Window Attention
Chiyu Zhang, Xiaogang Xu, Lei Wang et al.
AAAI 2024paperarXiv:2210.12381
52
citations