"temporal alignment" Papers
17 papers found
Conference
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Edward LOO, Tianyu HUANG, Peng Li et al.
CVPR 2025highlightarXiv:2412.03079
59
citations
Aligned Better, Listen Better for Audio-Visual Large Language Models
Yuxin Guo, Shuailei Ma, Shijie Ma et al.
ICLR 2025oralarXiv:2504.02061
9
citations
CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning
Eloy Geenjaar, Lie Lu
AAAI 2025paperarXiv:2412.11695
1
citations
DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception
Chengchang Tian, Jianwei Ma, Yan Huang et al.
ICCV 2025arXiv:2507.18237
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models
Ziyi Wu, Anil Kag, Ivan Skorokhodov et al.
NEURIPS 2025oralarXiv:2506.03517
14
citations
SmokeViz: A Large-Scale Satellite Dataset for Wildfire Smoke Detection and Segmentation
Rey Koki, Michael McCabe, Dhruv Kedar et al.
NEURIPS 2025oral
SMTPD: A New Benchmark for Temporal Prediction of Social Media Popularity
Yijie Xu, Bolun Zheng, Wei Zhu et al.
CVPR 2025arXiv:2503.04446
3
citations
Synchronization of Multiple Videos
Avihai Naaman, Ron Shapira Weber, Oren Freifeld
ICCV 2025arXiv:2510.14051
2
citations
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better
Zihang Lai, Andrea Vedaldi
CVPR 2025highlightarXiv:2503.19904
4
citations
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
Hang Hua, Yunlong Tang, Chenliang Xu et al.
AAAI 2025paperarXiv:2404.12353
50
citations
Efficient and Effective Time-Series Forecasting with Spiking Neural Networks
Changze Lv, Yansen Wang, Dongqi Han et al.
ICML 2024oralarXiv:2402.01533
24
citations
EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Dachun Kai, Jiayao Lu, Yueyi Zhang et al.
ICML 2024oralarXiv:2406.13457
13
citations
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition
Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Shah Mubarak
ECCV 2024arXiv:2409.01448
5
citations
HowToCaption: Prompting LLMs to Transform Video Annotations at Scale
Nina Shvetsova, Anna Kukleva, Xudong Hong et al.
ECCV 2024arXiv:2310.04900
33
citations
Multi-Sentence Grounding for Long-term Instructional Video
Zeqian Li, QIRUI CHEN, Tengda Han et al.
ECCV 2024arXiv:2312.14055
12
citations
Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs
Camillo Quattrocchi, Antonino Furnari, Daniele Di Mauro et al.
ECCV 2024arXiv:2312.02638
18
citations
VideoLLM-online: Online Video Large Language Model for Streaming Video
Joya Chen, Zhaoyang Lv, Shiwei Wu et al.
CVPR 2024arXiv:2406.11816
116
citations