by Hinrich Schuetze Papers
3 papers found
Conference
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Clemencia Siro, Guy Gur-Ari, Gaurav Mishra et al.
ICLR 2025oralarXiv:2206.04615
2226
citations
NoLiMa: Long-Context Evaluation Beyond Literal Matching
Ali Modarressi, Hanieh Deilamsalehy, Franck Dernoncourt et al.
ICML 2025arXiv:2502.05167
57
citations
Refusal Direction is Universal Across Safety-Aligned Languages
Xinpeng Wang, Mingyang Wang, Yihong Liu et al.
NEURIPS 2025arXiv:2505.17306
5
citations