Arrow Research search
Back to ICRA

ICRA 2024

CVAE-SM: A Conditional Variational Autoencoder with Style Modulation for Efficient Uncertainty Quantification

Conference Paper Accepted Paper Artificial Intelligence ยท Robotics

Abstract

Deep learning has brought transformative advancements to object segmentation, especially in marine robotics contexts such as waste management and subaquatic infrastructure oversight. However, a central challenge persists: calibrating the prediction confidence of the model to ensure robust and reliable outcomes, especially within the demanding underwater environment. Existing solutions for estimating uncertainty are often computationally intensive and have largely centered around Bayesian neural networks or ensemble methods. In this paper, we present a Conditional Variational Autoencoder-based framework (CVAE-SM), which is capable of generating diverse latent codes for improved uncertainty quantification in image segmentation. Our method, enhanced by a style modulator, merges content features, and latent codes more effectively, leading to refined prediction of uncertainty levels. We further introduce a dataset of perturbed underwater images to benchmark uncertainty quantification in this domain. The proposed model not only surpasses peers in segmentation metrics but also matches ensemble models in uncertainty predictions, all while being 2. 5 times faster.

Authors

Keywords

  • Waste management
  • Image segmentation
  • Uncertainty
  • Codes
  • Accuracy
  • Modulation
  • Stochastic processes
  • Variational Autoencoder
  • Uncertainty Quantification
  • Conditional Variational Autoencoder
  • Benchmark
  • Ensemble Model
  • Ensemble Method
  • Content Features
  • Object Segmentation
  • Model Confidence
  • Undersea
  • Latent Code
  • Bayesian Neural Network
  • Underwater Image
  • Posterior Probability
  • Decoding
  • Latent Variables
  • Feature Maps
  • Training Phase
  • Kullback-Leibler
  • Latent Space
  • Segmentation Map
  • Entropy Values
  • Brier Score
  • Stochastic Character
  • Image X
  • Ground Truth Segmentation
  • Epistemic Uncertainty
  • Class Dimension
  • Latent Vector

Context

Venue
IEEE International Conference on Robotics and Automation
Archive span
1984-2025
Indexed papers
30179
Paper id
1024662410131708006