α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Sheng Jin
Sheng Jin
24
papers
1,392
total citations
papers (24)
Whole-Body Human Pose Estimation in the Wild
ECCV 2020
arXiv
308
citations
Not All Tokens Are Equal: Human-Centric Visual Analysis via Token Clustering Transformer
CVPR 2022
arXiv
167
citations
Aligning Bag of Regions for Open-Vocabulary Object Detection
CVPR 2023
arXiv
156
citations
Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation
ECCV 2020
arXiv
139
citations
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
ICLR 2024
arXiv
110
citations
When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks
CVPR 2021
arXiv
67
citations
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
ICCV 2021
arXiv
64
citations
ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search
CVPR 2021
arXiv
59
citations
3D Interacting Hand Pose Estimation by Hand De-Occlusion and Removal
ECCV 2022
arXiv
54
citations
Pose for Everything: Towards Category-Agnostic Pose Estimation
ECCV 2022
arXiv
53
citations
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
ICCV 2025
arXiv
37
citations
PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation
ECCV 2022
arXiv
29
citations
Domain Generalization via Balancing Training Difficulty and Model Capability
ICCV 2023
arXiv
26
citations
CLIM: Contrastive Language-Image Mosaic for Region Representation
AAAI 2024
arXiv
25
citations
F-LMM: Grounding Frozen Large Multimodal Models
CVPR 2025
arXiv
22
citations
Uncertainty-aware Unsupervised Multi-Object Tracking
ICCV 2023
arXiv
18
citations
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
AAAI 2025
arXiv
15
citations
Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer
AAAI 2025
arXiv
15
citations
Weakly Supervised Monocular 3D Detection with a Single-View Image
CVPR 2024
arXiv
12
citations
Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions
NEURIPS 2023
arXiv
7
citations
UniFS: Universal Few-shot Instance Perception with Point Representations
ECCV 2024
arXiv
4
citations
NADER: Neural Architecture Design via Multi-Agent Collaboration
CVPR 2025
arXiv
3
citations
Unsupervised Continual Domain Shift Learning with Multi-Prototype Modeling
CVPR 2025
2
citations
When Counterpoint Meets Chinese Folk Melodies
NEURIPS 2020
0
citations