α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shou
Shou
15
papers
851
total citations
papers (15)
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
ICLR 2025
arXiv
484
citations
Drag Anything: Motion Control for Anything using Entity Representation
ECCV 2024
121
citations
Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification
ECCV 2024
arXiv
81
citations
WMAdapter: Adding WaterMark Control to Latent Diffusion Models
ICML 2025
arXiv
38
citations
Image Watermarks are Removable using Controllable Regeneration from Clean Noise
ICLR 2025
arXiv
30
citations
SparseFormer: Sparse Visual Recognition via Limited Latent Tokens
ICLR 2024
arXiv
17
citations
Parrot Captions Teach CLIP to Spot Text
ECCV 2024
arXiv
14
citations
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
NEURIPS 2025
arXiv
13
citations
GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator
ECCV 2024
arXiv
13
citations
Learning Video Context as Interleaved Multimodal Sequences
ECCV 2024
arXiv
12
citations
DOTA: Distributional Test-time Adaptation of Vision-Language Models
NEURIPS 2025
arXiv
11
citations
Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach
ICLR 2025
arXiv
6
citations
macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
NEURIPS 2025
arXiv
5
citations
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
NEURIPS 2025
arXiv
5
citations
PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer
NEURIPS 2025
arXiv
1
citations