Nika Haghtalab

papers

1,755

total citations

papers (12)

Calibrated Stackelberg Games: Learning Optimal Commitments Against Calibrated Agents

NEURIPS 2023arXiv

citations

A Unifying Perspective on Multi-Calibration: Game Dynamics for Multi-Objective Learning

NEURIPS 2023arXiv

citations

Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition

NEURIPS 2023arXiv

citations

Smoothed Analysis of Sequential Probability Assignment

NEURIPS 2023arXiv

citations

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

NEURIPS 2025arXiv

citations

Learning With Multi-Group Guarantees For Clusterable Subpopulations

ICML 2025arXiv

citations

Oracle-Efficient Online Learning for Smoothed Adversaries

NEURIPS 2022

citations

Sample-Adaptivity Tradeoff in On-Demand Sampling

NEURIPS 2025arXiv

citations

Nika Haghtalab

papers (12)

Jailbroken: How Does LLM Safety Training Fail?

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation

Smoothed Analysis of Online and Differentially Private Learning

On-Demand Sampling: Learning Optimally from Multiple Distributions

Calibrated Stackelberg Games: Learning Optimal Commitments Against Calibrated Agents

A Unifying Perspective on Multi-Calibration: Game Dynamics for Multi-Objective Learning

Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition

Smoothed Analysis of Sequential Probability Assignment

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Learning With Multi-Group Guarantees For Clusterable Subpopulations

Oracle-Efficient Online Learning for Smoothed Adversaries

Sample-Adaptivity Tradeoff in On-Demand Sampling

papers (12)

Jailbroken: How Does LLM Safety Training Fail?

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation

Smoothed Analysis of Online and Differentially Private Learning

On-Demand Sampling: Learning Optimally from Multiple Distributions

Calibrated Stackelberg Games: Learning Optimal Commitments Against Calibrated Agents

A Unifying Perspective on Multi-Calibration: Game Dynamics for Multi-Objective Learning

Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition

Smoothed Analysis of Sequential Probability Assignment

From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

Learning With Multi-Group Guarantees For Clusterable Subpopulations

Oracle-Efficient Online Learning for Smoothed Adversaries

Sample-Adaptivity Tradeoff in On-Demand Sampling