α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Hang Hua
Hang Hua
1
Affiliations
Affiliations
University of Rochester
8
papers
140
total citations
papers (8)
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
AAAI 2025
arXiv
50
citations
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
AAAI 2025
arXiv
25
citations
FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
CVPR 2025
arXiv
18
citations
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
CVPR 2025
arXiv
16
citations
FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction
ECCV 2024
arXiv
14
citations
Latent Chain-of-Thought for Visual Reasoning
NEURIPS 2025
arXiv
13
citations
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
NEURIPS 2025
arXiv
4
citations
PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3
ICCV 2023
0
citations