Universal Guidance for Diffusion Models

Arpit Bansal; Hong-Min Chu; Avi Schwarzschild; Soumyadip Sengupta; Micah Goldblum; Jonas Geiping; Tom Goldstein

Back to ICLR

ICLR 2024

Universal Guidance for Diffusion Models

Conference Paper Accept (poster) Artificial Intelligence · Machine Learning

Details

Abstract

Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance modalities without the need to retrain any use-specific components. We show that our algorithm successfully generates quality images with guidance functions including segmentation, face recognition, object detection, style guidance and classifier signals.

Authors

Keywords

Generative Models
Computer Vision
Diffusion Models

Context

Venue: International Conference on Learning Representations
Archive span: 2013-2025
Indexed papers: 10294
Paper id: 747136457449942268