Highlight "vision transformer" Papers
2 papers found
Conference
Question Aware Vision Transformer for Multimodal Reasoning
Roy Ganz, Yair Kittenplon, Aviad Aberdam et al.
CVPR 2024highlightarXiv:2402.05472
37
citations
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions
Chunlong Xia, Xinliang Wang, Feng Lv et al.
CVPR 2024highlightarXiv:2403.07392
133
citations