Values in the Wild: Discovering and Mapping Values in Real-World Language Model Interactions

31citations

PDF Project

citations

#48

in COLM 2025

of 418 papers

Top Authors

Data Points

Top Authors

Saffron Huang Esin DURMUS Kunal Handa Miles McCain Alex Tamkin Michael Stern Jerry Hong Deep Ganguli

Topics

language models values AI ethics AI values empirical analysis human-AI interaction value alignment privacy-preserving analysis value pluralism AI and society

Abstract

AI assistants interact with millions of real users everyday, imparting normative judgments that can have significant personal and societal impact—but little is known about what values guide these interactions in practice. To address this, we develop a method to empirically analyze values expressed in hundreds of thousands of real-world conversations with Claude models. We empirically discover and taxonomize 3,308 AI values, and study how model values and responses depend on context. We find that Claude expresses many professional and intellectual values, and typically supports prosocial human values while resisting values like "moral nihilism." While some values appear consistently (e.g. "professionalism"), most are highly context-dependent—"harm prevention" emerges when the model resists users, "historical accuracy" when discussing controversial events, "healthy boundaries" in relationship advice, and "human agency" in technology ethics discussions. By providing the first large-scale empirical mapping of AI values in deployment, this work creates a foundation for more grounded evaluation and design of values in increasingly influential AI systems.

Citation History

Feb 12, 2026