"ai safety risks" Papers
2 papers found
Conference
AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement
J Rosser, Jakob Foerster
NEURIPS 2025spotlightarXiv:2502.00757
6
citations
Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Liwei Jiang, Yuanjun Chai, Margaret Li et al.
NEURIPS 2025oralarXiv:2510.22954
16
citations