α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Mona T. Diab
Mona T. Diab
2
papers
188
total citations
papers (2)
Can Large Language Models Infer Causation from Correlation?
ICLR 2024
arXiv
171
citations
SAEs Can Improve Unlearning: Dynamic Sparse Autoencoder Guardrails for Precision Unlearning in LLMs
COLM 2025
17
citations