"decoder-only transformer" Papers
6 papers found
Conference
Auditing Prompt Caching in Language Model APIs
Chenchen Gu, Xiang Li, Rohith Kuditipudi et al.
ICML 2025arXiv:2502.07776
7
citations
EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation
Dongyang Zeng, Yaping Liu, Wei Zhang et al.
NEURIPS 2025
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
Xiaoming Shi, Shiyu Wang, Yuqi Nie et al.
ICLR 2025arXiv:2409.16040
194
citations
Denoising Autoregressive Representation Learning
Yazhe Li, Jorg Bornschein, Ting Chen
ICML 2024arXiv:2403.05196
7
citations
StableMask: Refining Causal Masking in Decoder-only Transformer
Qingyu Yin, Xuzheng He, Xiang Zhuang et al.
ICML 2024arXiv:2402.04779
20
citations
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk, Lijun Yu, Xiuye Gu et al.
ICML 2024arXiv:2312.14125
420
citations