α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Ran Xu
Ran Xu
21
papers
1,366
total citations
papers (21)
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding
CVPR 2023
arXiv
307
citations
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
NEURIPS 2023
arXiv
202
citations
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
arXiv
198
citations
HIVE: Harnessing Human Feedback for Instructional Visual Editing
CVPR 2024
arXiv
168
citations
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
ICLR 2024
arXiv
108
citations
Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
ECCV 2022
arXiv
105
citations
Use All the Labels: A Hierarchical Multi-Label Contrastive Learning Framework
CVPR 2022
arXiv
103
citations
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
CVPR 2021
arXiv
49
citations
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
CVPR 2024
arXiv
27
citations
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
ICCV 2023
arXiv
27
citations
Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation
ICCV 2023
arXiv
23
citations
Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations
CVPR 2023
arXiv
19
citations
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
ECCV 2024
arXiv
14
citations
Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting
NEURIPS 2023
arXiv
6
citations
Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting
ICCV 2025
arXiv
4
citations
Burn after Reading: Online Adaptation for Cross-Domain Streaming Data
ECCV 2022
arXiv
4
citations
Trust but Verify: Programmatic VLM Evaluation in the Wild
ICCV 2025
arXiv
2
citations
SmartAdapt: Multi-Branch Object Detection Framework for Videos on Mobiles
CVPR 2022
0
citations
Text2Data: Low-Resource Data Generation with Textual Control
AAAI 2025
arXiv
0
citations
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue
ICCV 2025
0
citations
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
0
citations