"temporal alignment" Papers

17 papers found

Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Edward LOO, Tianyu HUANG, Peng Li et al.

CVPR 2025highlightarXiv:2412.03079
59
citations

Aligned Better, Listen Better for Audio-Visual Large Language Models

Yuxin Guo, Shuailei Ma, Shijie Ma et al.

ICLR 2025oralarXiv:2504.02061
9
citations

CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning

Eloy Geenjaar, Lie Lu

AAAI 2025paperarXiv:2412.11695
1
citations

DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception

Chengchang Tian, Jianwei Ma, Yan Huang et al.

ICCV 2025arXiv:2507.18237

DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models

Ziyi Wu, Anil Kag, Ivan Skorokhodov et al.

NEURIPS 2025oralarXiv:2506.03517
14
citations

SmokeViz: A Large-Scale Satellite Dataset for Wildfire Smoke Detection and Segmentation

Rey Koki, Michael McCabe, Dhruv Kedar et al.

NEURIPS 2025oral

SMTPD: A New Benchmark for Temporal Prediction of Social Media Popularity

Yijie Xu, Bolun Zheng, Wei Zhu et al.

CVPR 2025arXiv:2503.04446
3
citations

Synchronization of Multiple Videos

Avihai Naaman, Ron Shapira Weber, Oren Freifeld

ICCV 2025arXiv:2510.14051
2
citations

Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better

Zihang Lai, Andrea Vedaldi

CVPR 2025highlightarXiv:2503.19904
4
citations

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Hang Hua, Yunlong Tang, Chenliang Xu et al.

AAAI 2025paperarXiv:2404.12353
50
citations

Efficient and Effective Time-Series Forecasting with Spiking Neural Networks

Changze Lv, Yansen Wang, Dongqi Han et al.

ICML 2024oralarXiv:2402.01533
24
citations

EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Dachun Kai, Jiayao Lu, Yueyi Zhang et al.

ICML 2024oralarXiv:2406.13457
13
citations

FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition

Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Shah Mubarak

ECCV 2024arXiv:2409.01448
5
citations

HowToCaption: Prompting LLMs to Transform Video Annotations at Scale

Nina Shvetsova, Anna Kukleva, Xudong Hong et al.

ECCV 2024arXiv:2310.04900
33
citations

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024arXiv:2312.14055
12
citations

Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs

Camillo Quattrocchi, Antonino Furnari, Daniele Di Mauro et al.

ECCV 2024arXiv:2312.02638
18
citations

VideoLLM-online: Online Video Large Language Model for Streaming Video

Joya Chen, Zhaoyang Lv, Shiwei Wu et al.

CVPR 2024arXiv:2406.11816
116
citations