"vision-language modeling" Papers
3 papers found
Conference
CLIP-PCQA: Exploring Subjective-Aligned Vision-Language Modeling for Point Cloud Quality Assessment
Yating Liu, Yujie Zhang, Ziyu Shan et al.
AAAI 2025paperarXiv:2501.10071
5
citations
FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation
Fan Yang, Yousong Zhu, Xin Li et al.
NEURIPS 2025arXiv:2506.16806
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
Joya Chen, Yiqi Lin, Ziyun Zeng et al.
CVPR 2025arXiv:2504.16030
4
citations