Oral "visual question answering" Papers
3 papers found
Conference
Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360° Firefighting Video
Aditi Tiwari, Farzaneh Masoud, Dac Nguyen et al.
NEURIPS 2025oral
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su, Man Luo, Kris Pan et al.
ICML 2025oralarXiv:2406.19593
6
citations
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data
Jeremy Irvin, Emily Liu, Joyce Chen et al.
ICLR 2025oralarXiv:2410.06234
45
citations