α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yi Jiang
Yi Jiang
27
papers
5,611
total citations
papers (27)
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
ECCV 2022
arXiv
2,026
citations
Sparse R-CNN: End-to-End Object Detection With Learnable Proposals
CVPR 2021
arXiv
1,368
citations
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
CVPR 2022
arXiv
346
citations
Universal Instance Perception As Object Discovery and Retrieval
CVPR 2023
arXiv
237
citations
Language As Queries for Referring Video Object Segmentation
CVPR 2022
arXiv
223
citations
Infinity∞: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
CVPR 2025
arXiv
201
citations
Towards Grand Unification of Object Tracking
ECCV 2022
arXiv
171
citations
In Defense of Online Models for Video Instance Segmentation
ECCV 2022
arXiv
138
citations
SeqFormer: Sequential Transformer for Video Instance Segmentation
ECCV 2022
arXiv
135
citations
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
CVPR 2025
arXiv
128
citations
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
ECCV 2024
arXiv
107
citations
Learning to Segment the Tail
CVPR 2020
arXiv
89
citations
General Object Foundation Model for Images and Videos at Scale
CVPR 2024
arXiv
82
citations
UniTok: a Unified Tokenizer for Visual Generation and Understanding
NEURIPS 2025
arXiv
79
citations
CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
NEURIPS 2023
arXiv
73
citations
Goku: Flow Based Video Generative Foundation Models
CVPR 2025
arXiv
54
citations
Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation
ECCV 2022
arXiv
40
citations
Generative Region-Language Pretraining for Open-Ended Object Detection
CVPR 2024
arXiv
27
citations
Segment Every Reference Object in Spatial and Temporal Spaces
ICCV 2023
arXiv
27
citations
Rethinking Resolution in the Context of Efficient Video Recognition
NEURIPS 2022
arXiv
16
citations
EGC: Image Generation and Classification via a Diffusion Energy-Based Model
ICCV 2023
arXiv
13
citations
InstMove: Instance Motion for Object-Centric Video Segmentation
CVPR 2023
arXiv
8
citations
Enhancing Adversarial Transferability with Adversarial Weight Tuning
AAAI 2025
arXiv
8
citations
Exploring Transformers for Open-world Instance Segmentation
ICCV 2023
arXiv
7
citations
InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation
NEURIPS 2025
5
citations
SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
ICCV 2025
arXiv
3
citations
A Unified Environmental Network for Pedestrian Trajectory Prediction
AAAI 2024
0
citations