α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Rui Shao
Rui Shao
12
papers
428
total citations
papers (12)
Detecting and Grounding Multi-Modal Media Manipulation
CVPR 2023
arXiv
112
citations
LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
CVPR 2024
arXiv
84
citations
Detecting and Recovering Sequential DeepFake Manipulation
ECCV 2022
arXiv
59
citations
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
ECCV 2024
arXiv
50
citations
Open-set Adversarial Defense
ECCV 2020
arXiv
37
citations
LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
CVPR 2025
arXiv
33
citations
Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
CVPR 2025
arXiv
22
citations
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
CVPR 2025
arXiv
13
citations
FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
ICCV 2025
arXiv
11
citations
RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models
ICML 2024
arXiv
4
citations
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
ICCV 2025
arXiv
3
citations
Less is More: Empowering GUI Agent with Context-Aware Simplification
ICCV 2025
arXiv
0
citations