α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Makarand Tapaswi
Makarand Tapaswi
2
Affiliations
Affiliations
IIIT Hyderabad
Wadhwani AI
11
papers
725
total citations
papers (11)
Think Global, Act Local: Dual-Scale Graph Transformer for Vision-and-Language Navigation
CVPR 2022
arXiv
213
citations
Airbert: In-Domain Pretraining for Vision-and-Language Navigation
ICCV 2021
arXiv
170
citations
Language Conditioned Spatial Relation Reasoning for 3D Object Grounding
NEURIPS 2022
arXiv
133
citations
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
ECCV 2022
arXiv
59
citations
Learning Interactions and Relationships Between Movie Characters
CVPR 2020
arXiv
58
citations
Test of Time: Instilling Video-Language Models With a Sense of Time
CVPR 2023
arXiv
49
citations
Grounded Video Situation Recognition
NEURIPS 2022
arXiv
16
citations
How You Feelin'? Learning Emotions and Mental States in Movie Scenes
CVPR 2023
arXiv
11
citations
MICap: A Unified Model for Identity-Aware Movie Descriptions
CVPR 2024
arXiv
7
citations
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment
CVPR 2025
arXiv
6
citations
Previously on ... From Recaps to Story Summarization
CVPR 2024
3
citations