Poster "post-hoc explanation" Papers
2 papers found
Conference
Efficient and Accurate Explanation Estimation with Distribution Compression
Hubert Baniecki, Giuseppe Casalicchio, Bernd Bischl et al.
ICLR 2025arXiv:2406.18334
4
citations
Explainable Reinforcement Learning from Human Feedback to Improve Alignment
Shicheng Liu, Siyuan Xu, Wenjie Qiu et al.
NEURIPS 2025arXiv:2512.13837