Poster "sequence compression" Papers
2 papers found
Conference
Building, Reusing, and Generalizing Abstract Representations from Concrete Sequences
Shuchen Wu, Mirko Thalmann, Peter Dayan et al.
ICLR 2025arXiv:2410.21332
1
citations
Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models
Pit Neitemeier, Björn Deiseroth, Constantin Eichenberg et al.
ICLR 2025arXiv:2501.10322
13
citations