"fine-grained visual understanding" Papers
3 papers found
Conference
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Geng Li, Jinglin Xu, Yunzhen Zhao et al.
CVPR 2025highlightarXiv:2504.14920
29
citations
LLaFEA: Frame-Event Complementary Fusion for Fine-Grained Spatiotemporal Understanding in LMMs
Hanyu Zhou, Gim Hee Lee
ICCV 2025arXiv:2503.06934
3
citations
Reverse Region-to-Entity Annotation for Pixel-Level Visual Entity Linking
Zhengfei Xu, Sijia Zhao, Yanchao Hao et al.
AAAI 2025paperarXiv:2412.13614