α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Sayan Nag
Sayan Nag
8
papers
232
total citations
papers (8)
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
ICCV 2023
arXiv
138
citations
Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model
CVPR 2024
arXiv
51
citations
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models
CVPR 2024
arXiv
18
citations
AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs
ICCV 2025
arXiv
10
citations
AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs
ICCV 2025
arXiv
6
citations
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
NEURIPS 2025
arXiv
5
citations
Localizing Knowledge in Diffusion Transformers
NEURIPS 2025
arXiv
2
citations
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
ICCV 2025
arXiv
2
citations