Poster "plug-and-play integration" Papers
2 papers found
Conference
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
Yang Liu, Ming Ma, Xiaomin Yu et al.
NEURIPS 2025arXiv:2505.12448
21
citations
Your Text Encoder Can Be An Object-Level Watermarking Controller
Naresh Kumar Devulapally, Mingzhen Huang, Vishal Asnani et al.
ICCV 2025arXiv:2503.11945