α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zongxin Yang
Zongxin Yang
22
papers
1,613
total citations
papers (22)
Associating Objects with Transformers for Video Object Segmentation
NEURIPS 2021
arXiv
353
citations
Collaborative Video Object Segmentation by Foreground-Background Integration
ECCV 2020
arXiv
280
citations
Gated Channel Transformation for Visual Recognition
CVPR 2020
arXiv
267
citations
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
NEURIPS 2022
arXiv
200
citations
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
ICCV 2023
arXiv
95
citations
SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction
CVPR 2024
arXiv
77
citations
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
ICML 2024
arXiv
64
citations
Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation
CVPR 2023
arXiv
61
citations
Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction
NEURIPS 2023
arXiv
52
citations
DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency
CVPR 2021
arXiv
47
citations
TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering
ICCV 2023
arXiv
28
citations
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
CVPR 2025
arXiv
20
citations
JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery
ICCV 2023
arXiv
20
citations
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation
ICCV 2023
arXiv
18
citations
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
ICCV 2025
arXiv
18
citations
Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation
ECCV 2022
arXiv
12
citations
3DIS: Depth-Driven Decoupled Image Synthesis for Universal Multi-Instance Generation
ICLR 2025
1
citations
Few-Shot Incremental Learning via Foreground Aggregation and Knowledge Transfer for Audio-Visual Semantic Segmentation
AAAI 2025
0
citations
ProD: Prompting-To-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification
CVPR 2023
0
citations
FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation
CVPR 2023
0
citations
H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-Domain Weakly Supervised Object Detection
CVPR 2022
0
citations
SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons
CVPR 2025
0
citations