α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yujie Zhong
Yujie Zhong
24
papers
2,162
total citations
papers (24)
TOOD: Task-Aligned One-Stage Object Detection
ICCV 2021
arXiv
1,096
citations
PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images
ECCV 2022
arXiv
210
citations
TriDet: Temporal Action Detection With Relative Boundary Modeling
CVPR 2023
arXiv
194
citations
Exploring Classification Equilibrium in Long-Tailed Object Detection
ICCV 2021
arXiv
113
citations
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
CVPR 2022
arXiv
104
citations
ReAct: Temporal Action Detection with Relational Queries
ECCV 2022
arXiv
88
citations
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
CVPR 2024
arXiv
67
citations
Adaptive Sparse Pairwise Loss for Object Re-Identification
CVPR 2023
arXiv
61
citations
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
ICCV 2023
arXiv
46
citations
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
CVPR 2024
arXiv
34
citations
Cross-Architecture Self-Supervised Video Representation Learning
CVPR 2022
arXiv
30
citations
AeDet: Azimuth-Invariant Multi-View 3D Object Detection
CVPR 2023
arXiv
27
citations
Representation Sharing for Fast Object Detector Search and Beyond
ECCV 2020
arXiv
17
citations
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
ECCV 2024
arXiv
15
citations
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
ICCV 2025
arXiv
13
citations
RoboTron-Drive: All-in-One Large Multimodal Model for Autonomous Driving
ICCV 2025
arXiv
12
citations
Mr. DETR: Instructive Multi-Route Training for Detection Transformers
CVPR 2025
12
citations
CO-MOT: Boosting End-to-end Transformer-based Multi-Object Tracking via Coopetition Label Assignment and Shadow Sets
ICLR 2025
7
citations
DisTime: Distribution-based Time Representation for Video Large Language Models
ICCV 2025
arXiv
5
citations
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver
CVPR 2025
4
citations
RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case
ICCV 2025
arXiv
3
citations
Advancing Visual Large Language Model for Multi-granular Versatile Perception
ICCV 2025
arXiv
2
citations
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
CVPR 2025
arXiv
2
citations
Layer-wise Vision Injection with Disentangled Attention for Efficient LVLMs
ICCV 2025
0
citations