ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Jingdong Chen

Jingdong Chen

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 14, 2026, 9:46 PM AMS

23

papers

472

total citations

papers (23)

SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization

Hierarchical Memory Learning for Fine-Grained Scene Graph Generation

StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models

Mimir: Improving Video Diffusion Models for Precise Text Understanding

When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

VideoMAR: Autoregressive Video Generation with Continuous Tokens

SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing

EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching

Reversing Flow for Image Restoration

Towards Better Vision-Inspired Vision-Language Models

HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation

VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations

NEURIPS 2025arXiv

Uncertainty-guided Learning for Improving Image Manipulation Detection

SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling

LPSNet: A Lightweight Solution for Fast Panoptic Segmentation

Variational Connectionist Temporal Classification

Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer

CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance

Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation