"spatio-temporal representation" Papers
4 papers found
Conference
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
Miran Heo, Min-Hung Chen, De-An Huang et al.
CVPR 2025arXiv:2501.08326
9
citations
Understanding Emotional Body Expressions via Large Language Models
Haifeng Lu, Jiuyi Chen, Feng Liang et al.
AAAI 2025paperarXiv:2412.12581
11
citations
VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Tianxiong Zhong, Xingye Tian, Boyuan Jiang et al.
NEURIPS 2025oralarXiv:2505.12053
3
citations
Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos
Shankhanil Mitra, Rajiv Soundararajan
AAAI 2024paperarXiv:2312.15425
9
citations