Arrow Research search
Back to ICLR

ICLR 2024

Universal Guidance for Diffusion Models

Conference Paper Accept (poster) Artificial Intelligence ยท Machine Learning

Abstract

Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance modalities without the need to retrain any use-specific components. We show that our algorithm successfully generates quality images with guidance functions including segmentation, face recognition, object detection, style guidance and classifier signals.

Authors

Keywords

  • Generative Models
  • Computer Vision
  • Diffusion Models

Context

Venue
International Conference on Learning Representations
Archive span
2013-2025
Indexed papers
10294
Paper id
747136457449942268