"model architecture design" Papers
3 papers found
Conference
DeltaFormer: Unlock the state space of Transformer
Mingyu Xu, Tenglong Ao, Jiaao He et al.
NEURIPS 2025
Goku: Flow Based Video Generative Foundation Models
Shoufa Chen, Chongjian GE, Yuqi Zhang et al.
CVPR 2025highlightarXiv:2502.04896
54
citations
Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
Wang Yang, Zirui Liu, Hongye Jin et al.
NEURIPS 2025arXiv:2505.17315
3
citations