α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Wenqi Shao
Wenqi Shao
26
papers
1,053
total citations
papers (26)
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
ICLR 2024
arXiv
341
citations
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM
CVPR 2024
arXiv
144
citations
GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices
ICCV 2025
arXiv
113
citations
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
ICML 2025
arXiv
76
citations
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers
ICCV 2023
arXiv
75
citations
Beyond One-to-One: Rethinking the Referring Image Segmentation
ICCV 2023
arXiv
72
citations
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation
ICLR 2024
arXiv
50
citations
Not All Models Are Equal: Predicting Model Transferability in a Self-Challenging Fisher Space
ECCV 2022
arXiv
38
citations
Real-Time Controllable Denoising for Image and Video
CVPR 2023
arXiv
24
citations
Foundation Model is Efficient Multimodal Multitask Model Selector
NEURIPS 2023
arXiv
22
citations
OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation
CVPR 2025
arXiv
20
citations
DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation
CVPR 2025
arXiv
12
citations
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
CVPR 2025
arXiv
11
citations
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
ICCV 2025
arXiv
10
citations
Distilling Monocular Foundation Model for Fine-grained Depth Completion
CVPR 2025
arXiv
9
citations
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation
ICLR 2025
9
citations
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
CVPR 2024
arXiv
7
citations
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025
arXiv
6
citations
Cached Transformers: Improving Transformers with Differentiable Memory Cached
AAAI 2024
arXiv
5
citations
Cross-Subject Mind Decoding from Inaccurate Representations
ICCV 2025
arXiv
3
citations
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis
NEURIPS 2025
arXiv
3
citations
JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data
CVPR 2025
arXiv
2
citations
Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space
ICCV 2025
arXiv
1
citations
ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity
ICCV 2025
0
citations
Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation
ICCV 2025
arXiv
0
citations
Rethinking the Pruning Criteria for Convolutional Neural Network
NEURIPS 2021
0
citations