α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Amanpreet Singh
Amanpreet Singh
12
papers
4,230
total citations
papers (12)
FLAVA: A Foundational Language and Vision Alignment Model
CVPR 2022
arXiv
883
citations
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
NEURIPS 2020
arXiv
792
citations
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
CVPR 2022
arXiv
535
citations
TextCaps: a Dataset for Image Captioning with Reading Comprehension
ECCV 2020
arXiv
515
citations
UniT: Multimodal Multitask Learning With a Unified Transformer
ICCV 2021
arXiv
346
citations
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents
NEURIPS 2023
arXiv
323
citations
Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA
CVPR 2020
arXiv
226
citations
Generative Representational Instruction Tuning
ICLR 2025
arXiv
222
citations
TextOCR: Towards Large-Scale End-to-End Reasoning for Arbitrary-Shaped Scene Text
CVPR 2021
arXiv
222
citations
Human-Adversarial Visual Question Answering
NEURIPS 2021
arXiv
69
citations
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
ECCV 2020
arXiv
62
citations
Unsupervised Vision-and-Language Pre-Training via Retrieval-Based Multi-Granular Alignment
CVPR 2022
arXiv
35
citations