"hardware-aware implementation" Papers
2 papers found
Conference
Theory, Analysis, and Best Practices for Sigmoid Self-Attention
Jason Ramapuram, Federico Danieli, Eeshan Gunesh Dhekane et al.
ICLR 2025arXiv:2409.04431
39
citations
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao, Xinggang Wang, Lianghui Zhu et al.
AAAI 2025paperarXiv:2405.18425
10
citations