α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Qin Jin
Qin Jin
16
papers
1,225
total citations
papers (16)
Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning
CVPR 2020
arXiv
361
citations
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
CVPR 2023
arXiv
259
citations
Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs
CVPR 2020
arXiv
242
citations
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
ECCV 2022
arXiv
172
citations
Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding
NEURIPS 2025
arXiv
49
citations
WritingBench: A Comprehensive Benchmark for Generative Writing
NEURIPS 2025
arXiv
46
citations
Towards Diverse Paragraph Captioning for Untrimmed Videos
CVPR 2021
arXiv
42
citations
Unifying Event Detection and Captioning as Sequence Generation via Pre-training
ECCV 2022
arXiv
32
citations
Better Captioning With Sequence-Level Exploration
CVPR 2020
arXiv
12
citations
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation
NEURIPS 2023
arXiv
7
citations
Explore and Tell: Embodied Visual Captioning in 3D Environments
ICCV 2023
arXiv
3
citations
Open-Category Human-Object Interaction Pre-Training via Language Modeling Framework
CVPR 2023
0
citations
VRDFormer: End-to-End Video Visual Relation Detection With Transformers
CVPR 2022
0
citations
MotionCtrl: A Real-time Controllable Vision-Language-Motion Model
ICCV 2025
0
citations
Multi-Lingual Acquisition on Multimodal Pre-training for Cross-modal Retrieval
NEURIPS 2022
0
citations
Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning
ECCV 2022
0
citations