α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xi Wang
Xi Wang
26
papers
340
total citations
papers (26)
Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning
CVPR 2022
arXiv
70
citations
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
CVPR 2025
arXiv
42
citations
Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation
NEURIPS 2022
arXiv
41
citations
WANDR: Intention-guided Human Motion Generation
CVPR 2024
arXiv
29
citations
PALM: Predicting Actions through Language Models
ECCV 2024
arXiv
23
citations
Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation
CVPR 2024
arXiv
22
citations
GazeNeRF: 3D-Aware Gaze Redirection With Neural Radiance Fields
CVPR 2023
arXiv
20
citations
What Do You See in Vehicle? Comprehensive Vision Solution for In-Vehicle Gaze Estimation
CVPR 2024
arXiv
18
citations
JAWS: Just a Wild Shot for Cinematic Transfer in Neural Radiance Fields
CVPR 2023
arXiv
14
citations
AKiRa: Augmentation Kit on Rays for Optical Video Generation
CVPR 2025
arXiv
12
citations
Real Appearance Modeling for More General Deepfake Detection
ECCV 2024
12
citations
StateSpaceDiffuser: Bringing Long Context to Diffusion World Models
NEURIPS 2025
arXiv
8
citations
SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering
AAAI 2025
arXiv
8
citations
Exploration-Driven Generative Interactive Environments
CVPR 2025
arXiv
5
citations
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
CVPR 2025
arXiv
4
citations
Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description
ICCV 2025
arXiv
4
citations
Leveraging Driver Field-of-View for Multimodal Ego-Trajectory Prediction
ICLR 2025
arXiv
3
citations
Scale-invariant attention
NEURIPS 2025
arXiv
2
citations
Self-Supervised 3D Hand Pose Estimation From Monocular RGB via Contrastive Learning
ICCV 2021
arXiv
2
citations
Understanding Museum Exhibits using Vision-Language Reasoning
ICCV 2025
arXiv
1
citations
Learning Dual Priors for JPEG Compression Artifacts Removal
ICCV 2021
0
citations
Long-Tail Class Incremental Learning via Independent Sub-prototype Construction
CVPR 2024
0
citations
JPEG Artifacts Removal via Contrastive Representation Learning
ECCV 2022
0
citations
No-regret Online Learning over Riemannian Manifolds
NEURIPS 2021
0
citations
DCTMamba: Advancing JPEG Image Restoration Through Long-Sequence Modeling and Adaptive Frequency Strategy
AAAI 2025
0
citations
Distributed Online Convex Optimization with Compressed Communication
NEURIPS 2022
0
citations