Arrow Research search
Back to IROS

IROS 2024

Diff-Control: A Stateful Diffusion-based Policy for Imitation Learning

Conference Paper Accepted Paper Artificial Intelligence ยท Robotics

Abstract

While imitation learning provides a simple and effective framework for policy learning, acquiring consistent action during robot execution remains a challenging task. Existing approaches primarily focus on either modifying the action representation at data curation stage or altering the model itself, both of which do not fully address the scalability of consistent action generation. To overcome this limitation, we introduce the Diff-Control policy, which utilizes a diffusion-based model to learn action representation from a state-space modeling viewpoint. We demonstrate that diffusion-based policies can acquire statefulness through a Bayesian formulation facilitated by ControlNet, leading to improved robustness and success rates. Our experimental results demonstrate the significance of incorporating action statefulness in policy learning, where Diff-Control shows improved performance across various tasks. Specifically, Diff-Control achieves an average success rate of 72% and 84% on stateful and dynamic tasks, respectively. Notably, Diff-Control also shows consistent performance in the presence of perturbations, outperforming other state-of-the-art methods that falter under similar conditions. Project page: https://diff-control.github.io/

Authors

Keywords

  • Training
  • Torque
  • Imitation learning
  • Scalability
  • Perturbation methods
  • Process control
  • Sensor systems and applications
  • Robustness
  • Intelligent sensors
  • Intelligent robots
  • State-space Model
  • Action Representation
  • Policy Learning
  • Presence Of Perturbations
  • Bayesian Formulation
  • Neural Network
  • Denoising
  • Transition State
  • Bimodal
  • Convolutional Layers
  • Sequence Of Actions
  • Lidocaine
  • Diffusion Model
  • Baseline Methods
  • Transition Model
  • Language Status
  • Policy Network
  • Temporal Consistency
  • Diverse Tasks
  • Expert Demonstrations
  • Efficacy Of Policies
  • Distractor Objects
  • Robot Learning
  • Differential Filter
  • Improve Success Rates
  • Task Duration
  • Previous Activity

Context

Venue
IEEE/RSJ International Conference on Intelligent Robots and Systems
Archive span
1988-2025
Indexed papers
26578
Paper id
1125783400582284562