"visual recognition tasks" Papers
2 papers found
Conference
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs
jiarui zhang, Mahyar Khayatkhoei, Prateek Chhikara et al.
ICLR 2025arXiv:2502.17422
88
citations
Weighted Multi-Prompt Learning with Description-free Large Language Model Distillation
Sua Lee, Kyubum Shin, Jung Ho Park
ICLR 2025arXiv:2507.07147
1
citations