α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xiaoyi Bao
Xiaoyi Bao
8
papers
247
total citations
papers (8)
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
AAAI 2025
arXiv
146
citations
Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation
AAAI 2024
arXiv
33
citations
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Videos Generation
NEURIPS 2025
arXiv
26
citations
CoReS: Orchestrating the Dance of Reasoning and Segmentation
ECCV 2024
arXiv
18
citations
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
NEURIPS 2025
arXiv
14
citations
Aligned Better, Listen Better for Audio-Visual Large Language Models
ICLR 2025
arXiv
9
citations
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
ICCV 2025
arXiv
1
citations
CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
CVPR 2024
0
citations