"bidirectional cross-attention" Papers
2 papers found
Conference
EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering
Junjue Wang, Zhuo Zheng, Zihang Chen et al.
AAAI 2024paperarXiv:2312.12222
53
citations
LookupViT: Compressing visual information to a limited number of tokens
Rajat Koner, Gagan Jain, Sujoy Paul et al.
ECCV 2024arXiv:2407.12753
16
citations