α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xiongkuo Min
Xiongkuo Min
21
papers
948
total citations
papers (21)
Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels
ICML 2024
arXiv
393
citations
Blurry Video Frame Interpolation
CVPR 2020
arXiv
101
citations
End-to-End Human-Gaze-Target Detection With Transformers
CVPR 2022
arXiv
76
citations
MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos
CVPR 2023
arXiv
62
citations
Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop
NEURIPS 2022
arXiv
46
citations
A-Bench: Are LMMs Masters at Evaluating AI-generated Images?
ICLR 2025
arXiv
40
citations
Self-Conditioned Probabilistic Learning of Video Rescaling
ICCV 2021
arXiv
34
citations
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows
ECCV 2022
arXiv
33
citations
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs
CVPR 2025
arXiv
30
citations
FineVQ: Fine-Grained User Generated Content Video Quality Assessment
CVPR 2025
arXiv
26
citations
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM
CVPR 2025
arXiv
25
citations
Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content
CVPR 2025
arXiv
23
citations
Video-based Human-Object Interaction Detection from Tubelet Tokens
NEURIPS 2022
arXiv
19
citations
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs
ICCV 2025
arXiv
16
citations
Image Quality Assessment: From Human to Machine Preference
CVPR 2025
arXiv
7
citations
Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D Graphics
AAAI 2025
arXiv
5
citations
Information Density Principle for MLLM Benchmarks
ICCV 2025
arXiv
5
citations
Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes
CVPR 2025
arXiv
3
citations
Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads
ICCV 2025
arXiv
3
citations
FPEM: Face Prior Enhanced Facial Attractiveness Prediction for Live Videos with Face Retouching
ICCV 2025
1
citations
Learning Invisible Markers for Hidden Codes in Offline-to-Online Photography
CVPR 2022
0
citations