α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Maarten Sap
Maarten Sap
1
Affiliations
Affiliations
Carnegie Mellon University, Allen Institute for AI
14
papers
3,006
total citations
papers (14)
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
ICLR 2025
arXiv
2,226
citations
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
ICLR 2024
arXiv
239
citations
Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory
ICLR 2024
arXiv
166
citations
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
NEURIPS 2022
arXiv
118
citations
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
AAAI 2024
arXiv
93
citations
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
ICML 2025
arXiv
45
citations
AutoPresent: Designing Structured Visuals from Scratch
CVPR 2025
arXiv
28
citations
PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages
COLM 2025
arXiv
19
citations
Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
NEURIPS 2025
arXiv
16
citations
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models
ICLR 2024
arXiv
15
citations
ALFA: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning
COLM 2025
arXiv
15
citations
Fluid Language Model Benchmarking
COLM 2025
arXiv
10
citations
The Delta Learning Hypothesis: Preference Tuning on Weak Data can Yield Strong Gains
COLM 2025
arXiv
8
citations
SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behavior
ICML 2025
arXiv
8
citations