α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jun Zhang
Jun Zhang
1
Affiliations
Affiliations
Zhejiang University
29
papers
1,044
total citations
papers (29)
Generalized Relation Modeling for Transformer Tracking
CVPR 2023
arXiv
203
citations
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
NEURIPS 2022
arXiv
143
citations
Generalized Predictive Model for Autonomous Driving
CVPR 2024
arXiv
128
citations
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion
ICCV 2025
89
citations
GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering
ECCV 2020
arXiv
78
citations
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
AAAI 2025
arXiv
64
citations
Training-Free Long-Context Scaling of Large Language Models
ICML 2024
arXiv
60
citations
Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians
ECCV 2020
arXiv
59
citations
DReS-FL: Dropout-Resilient Secure Federated Learning for Non-IID Clients via Secret Data Sharing
NEURIPS 2022
arXiv
53
citations
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
NEURIPS 2022
arXiv
38
citations
Boosting Neural Representations for Videos with a Conditional Decoder
CVPR 2024
arXiv
29
citations
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes
ICCV 2025
arXiv
24
citations
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval With Partial Query
ICCV 2021
arXiv
17
citations
Multi-dataset Training of Transformers for Robust Action Recognition
NEURIPS 2022
arXiv
14
citations
Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning
ICML 2024
arXiv
10
citations
Task-Aware Encoder Control for Deep Video Compression
CVPR 2024
arXiv
8
citations
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
ICCV 2025
arXiv
8
citations
FinMMR: Make Financial Numerical Reasoning More Multimodal, Comprehensive, and Challenging
ICCV 2025
arXiv
6
citations
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
AAAI 2025
arXiv
5
citations
FloE: On-the-Fly MoE Inference on Memory-constrained GPU
ICML 2025
arXiv
5
citations
Semi-Supervised Clustering Framework for Fine-grained Scene Graph Generation
AAAI 2025
2
citations
Learn How to Query from Unlabeled Data Streams in Federated Learning
AAAI 2025
arXiv
1
citations
Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification
CVPR 2021
0
citations
Node-Aligned Graph Convolutional Network for Whole-Slide Image Representation and Classification
CVPR 2022
0
citations
Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution
CVPR 2020
0
citations
SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification
NEURIPS 2022
0
citations
On the Convergence of an Adaptive Momentum Method for Adversarial Attacks
AAAI 2024
0
citations
TransLoc4D: Transformer-based 4D Radar Place Recognition
CVPR 2024
0
citations
Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition
ICCV 2021
0
citations