ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Rui Shao

Rui Shao

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 14, 2026, 11:22 PM AMS

12

papers

428

total citations

papers (12)

Detecting and Grounding Multi-Modal Media Manipulation

LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Detecting and Recovering Sequential DeepFake Manipulation

CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

Open-set Adversarial Defense

LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant

Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation

FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers

RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models

Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation

Less is More: Empowering GUI Agent with Context-Aware Simplification