Poster "safety-critical tasks" Papers
3 papers found
Conference
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?
Egor Zverev, Sahar Abdelnabi, Soroush Tabesh et al.
ICLR 2025arXiv:2403.06833
48
citations
Enforcing Hard Linear Constraints in Deep Learning Models with Decision Rules
Gonzalo E. Constante, Hao Chen, Can Li
NEURIPS 2025arXiv:2505.13858
5
citations
Fat-to-Thin Policy Optimization: Offline Reinforcement Learning with Sparse Policies
Lingwei Zhu, Han Wang, Yukie Nagai
ICLR 2025