α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yiyuan Zhang
Yiyuan Zhang
10
papers
491
total citations
papers (10)
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition
CVPR 2024
arXiv
243
citations
OneLLM: One Framework to Align All Modalities with Language
CVPR 2024
arXiv
201
citations
Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors
CVPR 2024
arXiv
28
citations
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
CVPR 2024
arXiv
12
citations
FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions
ICCV 2025
arXiv
3
citations
Breaking the Encoder Barrier for Seamless Video-Language Understanding
ICCV 2025
arXiv
3
citations
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
ICCV 2025
arXiv
1
citations
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification
ECCV 2022
0
citations
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities
ICCV 2025
0
citations
Learning Beyond Still Frames: Scaling Vision-Language Models with Video
ICCV 2025
0
citations