α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Mengdan Zhang
Mengdan Zhang
Google Scholar
OpenReview
6
h-index
12
papers
2,421
total citations
papers (12)
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
NEURIPS 2025
arXiv
1,277
citations
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
CVPR 2025
arXiv
917
citations
Aligning and Prompting Everything All at Once for Universal Visual Perception
CVPR 2024
arXiv
69
citations
Multi-modal Queried Object Detection in the Wild
NEURIPS 2023
arXiv
49
citations
ARM: Any-Time Super-Resolution Method
ECCV 2022
arXiv
35
citations
Efficient Decoder-Free Object Detection with Transformers
ECCV 2022
arXiv
20
citations
Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal
ECCV 2024
arXiv
18
citations
VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model
NEURIPS 2025
17
citations
Dive Deeper Into Box for Object Detection
ECCV 2020
arXiv
12
citations
Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
ICLR 2025
arXiv
5
citations
Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs
NEURIPS 2025
arXiv
2
citations
Learning To Know Where To See: A Visibility-Aware Approach for Occluded Person Re-Identification
ICCV 2021
0
citations