ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Xiaoyi Bao

Xiaoyi Bao

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 4:51 AM AMS

8

papers

247

total citations

papers (8)

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation

Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Videos Generation

NEURIPS 2025arXiv

CoReS: Orchestrating the Dance of Reasoning and Segmentation

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface

NEURIPS 2025arXiv

Aligned Better, Listen Better for Audio-Visual Large Language Models

DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding

CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training