Oral "transformer architecture" Papers
15 papers found
Conference
DAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation space
Junchao Gong, Jingyi Xu, Ben Fei et al.
NEURIPS 2025oralarXiv:2510.15978
InterMask: 3D Human Interaction Generation via Collaborative Masked Modeling
Muhammad Gohar Javed, chuan guo, Li Cheng et al.
ICLR 2025oralarXiv:2410.10010
27
citations
MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition
Hao Zhang, Zhan Zhuang, Xuehao Wang et al.
NEURIPS 2025oralarXiv:2505.20744
3
citations
Non-Markovian Discrete Diffusion with Causal Language Models
Yangtian Zhang, Sizhuang He, Daniel Levine et al.
NEURIPS 2025oralarXiv:2502.09767
1
citations
Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think
Ge Wu, Shen Zhang, Ruijing Shi et al.
NEURIPS 2025oralarXiv:2507.01467
33
citations
SCENT: Robust Spatiotemporal Learning for Continuous Scientific Data via Scalable Conditioned Neural Fields
David K Park, Xihaier Luo, Guang Zhao et al.
ICML 2025oralarXiv:2504.12262
1
citations
STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking
Sicheng Shen, Dongcheng Zhao, Linghao Feng et al.
NEURIPS 2025oralarXiv:2505.11151
3
citations
STORM: Spatio-TempOral Reconstruction Model For Large-Scale Outdoor Scenes
Jiawei Yang, Jiahui Huang, Boris Ivanovic et al.
ICLR 2025oralarXiv:2501.00602
25
citations
The emergence of sparse attention: impact of data distribution and benefits of repetition
Nicolas Zucchet, Francesco D'Angelo, Andrew Lampinen et al.
NEURIPS 2025oralarXiv:2505.17863
7
citations
Towards Provable Emergence of In-Context Reinforcement Learning
Jiuqi Wang, Rohan Chandra, Shangtong Zhang
NEURIPS 2025oralarXiv:2509.18389
1
citations
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
Jiuqi Wang, Ethan Blaser, Hadi Daneshmand et al.
ICLR 2025oralarXiv:2405.13861
15
citations
UniMotion: A Unified Motion Framework for Simulation, Prediction and Planning
Nan Song, Junzhe Jiang, jingyu li et al.
NEURIPS 2025oral
ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data
Carmen Martin-Turrero, Maxence Bouvier, Manuel Breitenstein et al.
ICML 2024oralarXiv:2402.01393
6
citations
Longitudinal Targeted Minimum Loss-based Estimation with Temporal-Difference Heterogeneous Transformer
Toru Shirakawa, Yi Li, Yulun Wu et al.
ICML 2024oralarXiv:2404.04399
7
citations
Translation Equivariant Transformer Neural Processes
Matthew Ashman, Cristiana Diaconu, Junhyuck Kim et al.
ICML 2024oralarXiv:2406.12409
10
citations