α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jing Wang
Jing Wang
29
papers
713
total citations
papers (29)
From Two to One: A New Scene Text Recognizer With Visual Language Modeling Network
ICCV 2021
arXiv
177
citations
Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation
CVPR 2021
arXiv
165
citations
Learning To Filter: Siamese Relation Network for Robust Tracking
CVPR 2021
arXiv
102
citations
AlphaVC: High-Performance and Efficient Learned Video Compression
ECCV 2022
arXiv
57
citations
Scene Text Retrieval via Joint Text Detection and Similarity Learning
CVPR 2021
arXiv
43
citations
WISA: World simulator assistant for physics-aware text-to-video generation
NEURIPS 2025
arXiv
35
citations
Adaptive FSS: A Novel Few-Shot Segmentation Framework via Prototype Enhancement
AAAI 2024
arXiv
21
citations
Content-Oriented Learned Image Compression
ECCV 2022
arXiv
20
citations
SURER: Structure-Adaptive Unified Graph Neural Network for Multi-View Clustering
AAAI 2024
16
citations
Online Video Understanding: OVBench and VideoChat-Online
CVPR 2025
arXiv
12
citations
Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation
ECCV 2022
arXiv
12
citations
WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
CVPR 2025
arXiv
10
citations
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models
CVPR 2025
arXiv
9
citations
SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering
AAAI 2025
arXiv
8
citations
What Has Been Overlooked in Contrastive Source-Free Domain Adaptation: Leveraging Source-Informed Latent Augmentation within Neighborhood Context
ICLR 2025
arXiv
7
citations
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation
ICCV 2025
arXiv
6
citations
StreamForest: Efficient Online Video Understanding with Persistent Event Memory
NEURIPS 2025
arXiv
6
citations
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution
CVPR 2025
arXiv
4
citations
AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering
CVPR 2025
1
citations
Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling
ECCV 2024
arXiv
1
citations
CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework
AAAI 2025
arXiv
1
citations
Detecting Tampered Scene Text in the Wild
ECCV 2022
0
citations
Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship
CVPR 2021
0
citations
Prior-aware Dynamic Temporal Modeling Framework for Sequential 3D Hand Pose Estimation
ICCV 2025
0
citations
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
CVPR 2025
arXiv
0
citations
MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding
ICCV 2025
arXiv
0
citations
Handling Heterogeneous Curvatures in Bandit LQR Control
ICML 2024
0
citations
SAMPLE: Semantic Alignment through Temporal-Adaptive Multimodal Prompt Learning for Event-Based Open-Vocabulary Action Recognition
ICCV 2025
0
citations
Learning with Adaptive Resource Allocation
ICML 2024
0
citations