α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xiangyu Yue
Xiangyu Yue
27
papers
2,093
total citations
papers (27)
PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation
CVPR 2020
arXiv
552
citations
Unsupervised Point Cloud Pre-Training via Occlusion Completion
ICCV 2021
arXiv
305
citations
Video-R1: Reinforcing Video Reasoning in MLLMs
NEURIPS 2025
arXiv
257
citations
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition
CVPR 2024
arXiv
243
citations
OneLLM: One Framework to Align All Modalities with Language
CVPR 2024
arXiv
201
citations
Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation
CVPR 2021
arXiv
186
citations
Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
ICCV 2023
arXiv
115
citations
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models
ECCV 2022
arXiv
62
citations
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
CVPR 2025
arXiv
46
citations
Space Engage: Collaborative Space Supervision for Contrastive-Based Semi-Supervised Semantic Segmentation
ICCV 2023
arXiv
16
citations
RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation
ECCV 2022
arXiv
16
citations
Beating Backdoor Attack at Its Own Game
ICCV 2023
arXiv
15
citations
Unleashing Vecset Diffusion Model for Fast Shape Generation
ICCV 2025
arXiv
14
citations
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
CVPR 2024
arXiv
12
citations
Chimera: Improving Generalist Model with Domain-Specific Experts
ICCV 2025
arXiv
9
citations
RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models
CVPR 2025
arXiv
9
citations
SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance
CVPR 2025
arXiv
8
citations
SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
ICCV 2025
arXiv
7
citations
From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision
ICCV 2025
arXiv
6
citations
Training Matting Models Without Alpha Labels
AAAI 2025
arXiv
4
citations
FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions
ICCV 2025
arXiv
3
citations
Breaking the Encoder Barrier for Seamless Video-Language Understanding
ICCV 2025
arXiv
3
citations
CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation
ICCV 2025
arXiv
2
citations
HypDAE: Hyperbolic Diffusion Autoencoders for Hierarchical Few-shot Image Generation
ICCV 2025
arXiv
1
citations
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
CVPR 2025
arXiv
1
citations
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities
ICCV 2025
0
citations
Learning Beyond Still Frames: Scaling Vision-Language Models with Video
ICCV 2025
0
citations