α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zhenfei Yin
Zhenfei Yin
9
papers
527
total citations
papers (9)
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark
NEURIPS 2023
arXiv
207
citations
Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models
ECCV 2024
arXiv
95
citations
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception
CVPR 2024
arXiv
77
citations
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
CVPR 2025
arXiv
68
citations
Benchmarking Omni-Vision Representation through the Lens of Visual Realms
ECCV 2022
arXiv
37
citations
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
ICCV 2025
arXiv
17
citations
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
ICCV 2025
arXiv
13
citations
X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation
ECCV 2022
arXiv
7
citations
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
ICCV 2025
arXiv
6
citations