Arrow Research search
Back to AAAI

AAAI 2023

Layout Generation as Intermediate Action Sequence Prediction

Conference Paper AAAI Technical Track on Machine Learning IV Artificial Intelligence

Abstract

Layout generation plays a crucial role in graphic design intelligence. One important characteristic of the graphic layouts is that they usually follow certain design principles. For example, the principle of repetition emphasizes the reuse of similar visual elements throughout the design. To generate a layout, previous works mainly attempt at predicting the absolute value of bounding box for each element, where such target representation has hidden the information of higher-order design operations like repetition (e.g. copy the size of the previously generated element). In this paper, we introduce a novel action schema to encode these operations for better modeling the generation process. Instead of predicting the bounding box values, our approach autoregressively outputs the intermediate action sequence, which can then be deterministically converted to the final layout. We achieve state-of-the-art performances on three datasets. Both automatic and human evaluations show that our approach generates high-quality and diverse layouts. Furthermore, we revisit the commonly used evaluation metric FID adapted in this task, and observe that previous works use different settings to train the feature extractor for obtaining real/generated data distribution, which leads to inconsistent conclusions. We conduct an in-depth analysis on this metric and settle for a more robust and reliable evaluation setting. Code is available at this website.

Authors

Keywords

  • CV: Applications
  • CV: Computational Photography, Image & Video Synthesis
  • ML: Applications
  • ML: Deep Generative Models & Autoencoders
  • ML: Deep Neural Network Algorithms
  • ML: Evaluation and Analysis (Machine Learning)

Context

Venue
AAAI Conference on Artificial Intelligence
Archive span
1980-2026
Indexed papers
28718
Paper id
442548551722536634