α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Rada Mihalcea
Rada Mihalcea
3
papers
454
total citations
papers (3)
Can Large Language Models Infer Causation from Correlation?
ICLR 2024
arXiv
171
citations
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity
ICML 2024
arXiv
165
citations
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
NEURIPS 2022
arXiv
118
citations