Universal Guidance for Diffusion Models

399citations
arXiv:2302.07121
399
citations
#60
in ICLR 2024
of 2297 papers
7
Top Authors
4
Data Points

Abstract

Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance modalities without the need to retrain any use-specific components. We show that our algorithm successfully generates quality images with guidance functions including segmentation, face recognition, object detection, style guidance and classifier signals.

Citation History

Jan 28, 2026
380
Feb 13, 2026
399+19
Feb 13, 2026
399
Feb 13, 2026
399