α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Sanmi Koyejo
Sanmi Koyejo
16
papers
1,430
total citations
papers (16)
Are Emergent Abilities of Large Language Models a Mirage?
NEURIPS 2023
arXiv
585
citations
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
NEURIPS 2023
arXiv
571
citations
Diagnosing failures of fairness transfer across distribution shift in real-world medical settings
NEURIPS 2022
arXiv
70
citations
CSER: Communication-efficient SGD with Error Reset
NEURIPS 2020
arXiv
44
citations
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
ICML 2025
arXiv
35
citations
Self-Supervised Learning of Representations for Space Generates Multi-Modular Grid Cells
NEURIPS 2023
arXiv
34
citations
Transforming and Combining Rewards for Aligning Large Language Models
ICML 2024
arXiv
26
citations
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
ICLR 2025
arXiv
24
citations
Fair Performance Metric Elicitation
NEURIPS 2020
arXiv
19
citations
A Reduction to Binary Approach for Debiasing Multiclass Datasets
NEURIPS 2022
arXiv
11
citations
Fair Wrapping for Black-box Predictions
NEURIPS 2022
arXiv
7
citations
Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy and Research
NEURIPS 2025
arXiv
2
citations
Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF
COLM 2025
arXiv
2
citations
CoPur: Certifiably Robust Collaborative Inference via Feature Purification
NEURIPS 2022
0
citations
Implicit Regularization in Feedback Alignment Learning Mechanisms for Neural Networks
ICML 2024
arXiv
0
citations
Fairness with Overlapping Groups; a Probabilistic Perspective
NEURIPS 2020
0
citations