α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Haiyang Xu
Haiyang Xu
OpenReview
13
papers
1,189
total citations
papers (13)
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
CVPR 2024
arXiv
614
citations
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
ICLR 2025
arXiv
243
citations
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
CVPR 2024
arXiv
121
citations
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training
ICCV 2023
arXiv
92
citations
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
CVPR 2022
arXiv
42
citations
Bayesian Diffusion Models for 3D Shape Reconstruction
CVPR 2024
arXiv
23
citations
Science-T2I: Addressing Scientific Illusions in Image Synthesis
CVPR 2025
arXiv
11
citations
BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-Up Patch Summarization.
ICCV 2023
arXiv
9
citations
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion
ICCV 2025
arXiv
8
citations
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
CVPR 2025
arXiv
7
citations
Learning Trajectory-Word Alignments for Video-Language Tasks
ICCV 2023
arXiv
7
citations
TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training
AAAI 2024
arXiv
6
citations
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
ICCV 2025
arXiv
6
citations