Poster "model activations analysis" Papers
2 papers found
Conference
On Reasoning Strength Planning in Large Reasoning Models
Leheng Sheng, An Zhang, Zijian Wu et al.
NEURIPS 2025arXiv:2506.08390
15
citations
Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension
Fan Yin, Jayanth Srinivasa, Kai-Wei Chang
ICML 2024arXiv:2402.18048
40
citations