"attention mechanism efficiency" Papers
2 papers found
Conference
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
Philipp Becker, Abhinav Mehrotra, Ruchika Chavhan et al.
ICCV 2025arXiv:2503.16726
5
citations
Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation
Thomas Merth, Qichen Fu, Mohammad Rastegari et al.
ICML 2024arXiv:2404.06910
13
citations