"multi-modal prompts" Papers
3 papers found
Conference
Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
Xiaoqi Li, Lingyun Xu, Mingxu Zhang et al.
CVPR 2025arXiv:2505.02166
5
citations
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie Yang, Bingliang Li, Ailing Zeng et al.
CVPR 2024arXiv:2406.07221
35
citations
X-Pose: Detecting Any Keypoints
Jie Yang, AILING ZENG, Ruimao Zhang et al.
ECCV 2024arXiv:2310.08530
14
citations