Text Diffusion with Reinforced Conditioning

Yuxuan Liu; Tianchi Yang; Shaohan Huang; Zihan Zhang; Haizhen Huang; Furu Wei; Weiwei Deng; Feng Sun; Qi Zhang

doi:10.1609/aaai.v38i12.29316

Back to AAAI

AAAI 2024

Text Diffusion with Reinforced Conditioning

Conference Paper AAAI Technical Track on Machine Learning III Artificial Intelligence

PDF Details DOI

Abstract

Diffusion models have demonstrated exceptional capability in generating high-quality images, videos, and audio. Due to their adaptiveness in iterative refinement, they provide a strong potential for achieving better non-autoregressive sequence generation. However, existing text diffusion models still fall short in their performance due to a challenge in handling the discreteness of language. This paper thoroughly analyzes text diffusion models and uncovers two significant limitations: degradation of self-conditioning during training and misalignment between training and sampling. Motivated by our findings, we propose a novel Text Diffusion model called TReC, which mitigates the degradation with Reinforced Conditioning and the misalignment by Time-Aware Variance Scaling. Our extensive experiments demonstrate the competitiveness of TReC against autoregressive, non-autoregressive, and diffusion baselines. Moreover, qualitative analysis shows its advanced ability to fully utilize the diffusion process in refining samples.

Authors

Keywords

ML: Deep Generative Models & Autoencoders
ML: Reinforcement Learning
NLP: (Large) Language Models
NLP: Generation

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 805288530126606456