α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jingdong Chen
Jingdong Chen
23
papers
472
total citations
papers (23)
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery
CVPR 2024
arXiv
244
citations
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
ICLR 2025
arXiv
64
citations
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
CVPR 2022
arXiv
33
citations
Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
ECCV 2022
arXiv
26
citations
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
ECCV 2024
arXiv
23
citations
Mimir: Improving Video Diffusion Models for Precise Text Understanding
CVPR 2025
arXiv
16
citations
When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
ICCV 2025
arXiv
14
citations
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
CVPR 2025
arXiv
14
citations
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
CVPR 2024
arXiv
12
citations
VideoMAR: Autoregressive Video Generation with Continuous Tokens
NEURIPS 2025
8
citations
SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing
ICCV 2025
arXiv
7
citations
EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching
ECCV 2024
4
citations
Reversing Flow for Image Restoration
CVPR 2025
arXiv
2
citations
Towards Better Vision-Inspired Vision-Language Models
CVPR 2024
2
citations
HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation
AAAI 2025
2
citations
VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations
NEURIPS 2025
arXiv
1
citations
Uncertainty-guided Learning for Improving Image Manipulation Detection
ICCV 2023
0
citations
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
CVPR 2025
0
citations
LPSNet: A Lightweight Solution for Fast Panoptic Segmentation
CVPR 2021
0
citations
Variational Connectionist Temporal Classification
ECCV 2020
0
citations
Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer
CVPR 2022
0
citations
CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance
ICCV 2025
arXiv
0
citations
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation
CVPR 2023
0
citations