α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Paul Hongsuck Seo
Paul Hongsuck Seo
14
papers
1,053
total citations
papers (14)
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
CVPR 2023
arXiv
332
citations
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
CVPR 2024
arXiv
193
citations
End-to-End Generative Pretraining for Multimodal Video Captioning
CVPR 2022
arXiv
187
citations
Learning Audio-Video Modalities from Image Captions
ECCV 2022
arXiv
96
citations
Zero-Shot Referring Image Segmentation With Global-Local Context Features
CVPR 2023
arXiv
82
citations
Look Before You Speak: Visually Contextualized Utterances
CVPR 2021
arXiv
71
citations
Learning Correlation Structures for Vision Transformers
CVPR 2024
arXiv
27
citations
AVFormer: Injecting Vision Into Frozen Speech Models for Zero-Shot AV-ASR
CVPR 2023
arXiv
25
citations
IFSeg: Image-Free Semantic Segmentation via Vision-Language Model
CVPR 2023
arXiv
20
citations
Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
ECCV 2024
arXiv
12
citations
Seg4Diff: Unveiling Open-Vocabulary Semantic Segmentation in Text-to-Image Diffusion Transformers
NEURIPS 2025
6
citations
Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression
CVPR 2025
arXiv
1
citations
DialNav: Multi-turn Dialog Navigation with a Remote Guide
ICCV 2025
arXiv
1
citations
Multi-Granularity Video Object Segmentation
AAAI 2025
arXiv
0
citations