by jun chen Papers
2 papers found
Conference
Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time
Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta et al.
ECCV 2024arXiv:2407.01851
23
citations
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Deyao Zhu, jun chen, Xiaoqian Shen et al.
ICLR 2024arXiv:2304.10592
2806
citations