Oral "temporal modeling" Papers
8 papers found
Conference
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
Wenhao Chai, Enxin Song, Yilun Du et al.
ICLR 2025oralarXiv:2410.03051
105
citations
CSV-Occ: Fusing Multi-frame Alignment for Occupancy Prediction with Temporal Cross State Space Model and Central Voting Mechanism
Ziming Zhu, Yu Zhu, Jiahao Chen et al.
ICML 2025oral
Dual-Path Temporal Decoder for End-to-End Multi-Object Tracking
Hyunseop Kim, Juheon Jeong, Hanul Kim et al.
NEURIPS 2025oral
FLAME: Fast Long-context Adaptive Memory for Event-based Vision
Biswadeep Chakraborty, Saibal Mukhopadhyay
NEURIPS 2025oral
Kronecker Mask and Interpretive Prompts are Language-Action Video Learners
Jingyi Yang, Zitong YU, Nixiuming et al.
ICLR 2025oralarXiv:2502.03549
3
citations
STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking
Sicheng Shen, Dongcheng Zhao, Linghao Feng et al.
NEURIPS 2025oralarXiv:2505.11151
3
citations
Unhackable Temporal Reward for Scalable Video MLLMs
En Yu, Kangheng Lin, Liang Zhao et al.
ICLR 2025oralarXiv:2502.12081
22
citations
Video-R1: Reinforcing Video Reasoning in MLLMs
Kaituo Feng, Kaixiong Gong, Bohao Li et al.
NEURIPS 2025oralarXiv:2503.21776
257
citations