Oral "temporal consistency" Papers

20 papers found

3D StreetUnveiler with Semantic-aware 2DGS - a simple baseline

Jingwei Xu, Yikai Wang, Yiqun Zhao et al.

ICLR 2025oralarXiv:2405.18416
4
citations

ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation

Zongyi Li, Shujie HU, Shujie LIU et al.

ICLR 2025oralarXiv:2410.20502
28
citations

A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking

Zixiang Zhao, Haowen Bai, Bingxin Ke et al.

NEURIPS 2025oralarXiv:2505.19858
7
citations

Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Han Lin, Jaemin Cho, Abhay Zala et al.

ICLR 2025oralarXiv:2404.09967
50
citations

Depth Any Video with Scalable Synthetic Data

Honghui Yang, Di Huang, Wei Yin et al.

ICLR 2025oralarXiv:2410.10815
46
citations

Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models

Zeyu Yang, Zijie Pan, Chun Gu et al.

ICLR 2025oralarXiv:2404.02148
20
citations

EG4D: Explicit Generation of 4D Object without Score Distillation

Qi Sun, Zhiyang Guo, Ziyu Wan et al.

ICLR 2025oralarXiv:2405.18132
40
citations

Emergent Temporal Correspondences from Video Diffusion Transformers

Jisu Nam, Soowon Son, Dahyun Chung et al.

NEURIPS 2025oralarXiv:2506.17220
11
citations

FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation

Ariel Shaulov, Itay Hazan, Lior Wolf et al.

NEURIPS 2025oralarXiv:2506.01144
8
citations

Glad: A Streaming Scene Generator for Autonomous Driving

Bin Xie, Yingfei Liu, Tiancai Wang et al.

ICLR 2025oralarXiv:2503.00045
11
citations

Image as a World: Generating Interactive World from Single Image via Panoramic Video Generation

Dongnan Gui, Xun Guo, Wengang Zhou et al.

NEURIPS 2025oral
1
citations

Incremental Sequence Classification with Temporal Consistency

Lucas Maystre, Gabriel Barello, Tudor Berariu et al.

NEURIPS 2025oralarXiv:2505.16548

Infinite-Resolution Integral Noise Warping for Diffusion Models

Yitong Deng, Winnie Lin, Lingxiao Li et al.

ICLR 2025oralarXiv:2411.01212
4
citations

PPMStereo: Pick-and-Play Memory Construction for Consistent Dynamic Stereo Matching

WANG Yun, Qiaole Dong, Yongjian Zhang et al.

NEURIPS 2025oralarXiv:2510.20178

Rationalizing and Augmenting Dynamic Graph Neural Networks

Guibin Zhang, Yiyan Qi, Ziyang Cheng et al.

ICLR 2025oral

ReCon-GS: Continuum-Preserved Guassian Streaming for Fast and Compact Reconstruction of Dynamic Scenes

Jiaye Fu, Qiankun Gao, Chengxiang Wen et al.

NEURIPS 2025oral

ST$^2$360D: Spatial-to-Temporal Consistency for Training-free 360 Monocular Depth Estimation

Zidong Cao, Jinjing Zhu, Hao Ai et al.

NEURIPS 2025oral

WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception

Zhiheng Liu, Xueqing Deng, Shoufa Chen et al.

NEURIPS 2025oralarXiv:2508.15720
5
citations

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

Jianxiong Li, Jinliang Zheng, Yinan Zheng et al.

ICML 2024oralarXiv:2402.18137
14
citations

Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices

Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner et al.

ICML 2024oralarXiv:2405.12211
36
citations