ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Hang Hua

Hang Hua

1

Affiliations

Affiliations

University of Rochester

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 3:13 AM AMS

8

papers

140

total citations

papers (8)

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Latent Chain-of-Thought for Visual Reasoning

NEURIPS 2025arXiv

MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness

NEURIPS 2025arXiv

PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3