Poster "zero-shot inference" Papers
12 papers found
Conference
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark
Hanlei Zhang, zhuohang li, Hua Xu et al.
NEURIPS 2025arXiv:2504.16427
2
citations
CE-FAM: Concept-Based Explanation via Fusion of Activation Maps
Michihiro Kuroki, Toshihiko Yamasaki
ICCV 2025arXiv:2509.23849
CircuitFusion: Multimodal Circuit Representation Learning for Agile Chip Design
Wenji Fang, Shang Liu, Jing Wang et al.
ICLR 2025arXiv:2505.02168
14
citations
Eliminating Position Bias of Language Models: A Mechanistic Approach
Ziqi Wang, Hanlin Zhang, Xiner Li et al.
ICLR 2025arXiv:2407.01100
50
citations
Large (Vision) Language Models are Unsupervised In-Context Learners
Artyom Gadetsky, Andrei Atanov, Yulun Jiang et al.
ICLR 2025arXiv:2504.02349
3
citations
MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
Siyi Jiao, Wenzheng Zeng, Yerong Li et al.
ICLR 2025arXiv:2504.14606
1
citations
Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
Lei Wang, Senmao Li, Fei Yang et al.
CVPR 2025arXiv:2505.03097
2
citations
Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints
Utkarsh Utkarsh, Pengfei Cai, Alan Edelman et al.
NEURIPS 2025arXiv:2506.04171
17
citations
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Byeongjun Park, Hyojun Go, Hyelin Nam et al.
ICCV 2025arXiv:2503.12024
5
citations
Tree of Preferences for Diversified Recommendation
Hanyang Yuan, Ning Tang, Tongya Zheng et al.
NEURIPS 2025arXiv:2601.02386
Improving Medical Multi-modal Contrastive Learning with Expert Annotations
Yogesh Kumar, Pekka Marttinen
ECCV 2024arXiv:2403.10153
23
citations
Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani, Amit Raj, Kevis-kokitsi Maninis et al.
CVPR 2024arXiv:2404.08636
138
citations