"decoder-only transformers" Papers
5 papers found
Conference
Mimir: Improving Video Diffusion Models for Precise Text Understanding
Shuai Tan, Biao Gong, Yutong Feng et al.
CVPR 2025arXiv:2412.03085
16
citations
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Ziqi Pang, Tianyuan Zhang, Fujun Luan et al.
CVPR 2025arXiv:2412.01827
63
citations
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Md Rifat Arefin, Gopeshh Raaj Subbaraj, Nicolas Gontier et al.
ICLR 2025arXiv:2411.02344
5
citations
Towards Neural Scaling Laws for Time Series Foundation Models
Qingren Yao, Chao-Han Huck Yang, Renhe Jiang et al.
ICLR 2025arXiv:2410.12360
26
citations
Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions
Sanjay Kariyappa, Freddy Lecue, Saumitra Mishra et al.
ICML 2024arXiv:2406.02625
6
citations