Diff-Control: A Stateful Diffusion-based Policy for Imitation Learning

Xiao Liu; Yifan Zhou; Fabian Clemens Weigend; Shubham D. Sonawani; Shuhei Ikemoto; Heni Ben Amor

Back to IROS

IROS 2024

Diff-Control: A Stateful Diffusion-based Policy for Imitation Learning

Conference Paper Accepted Paper Artificial Intelligence · Robotics

Details

Abstract

While imitation learning provides a simple and effective framework for policy learning, acquiring consistent action during robot execution remains a challenging task. Existing approaches primarily focus on either modifying the action representation at data curation stage or altering the model itself, both of which do not fully address the scalability of consistent action generation. To overcome this limitation, we introduce the Diff-Control policy, which utilizes a diffusion-based model to learn action representation from a state-space modeling viewpoint. We demonstrate that diffusion-based policies can acquire statefulness through a Bayesian formulation facilitated by ControlNet, leading to improved robustness and success rates. Our experimental results demonstrate the significance of incorporating action statefulness in policy learning, where Diff-Control shows improved performance across various tasks. Specifically, Diff-Control achieves an average success rate of 72% and 84% on stateful and dynamic tasks, respectively. Notably, Diff-Control also shows consistent performance in the presence of perturbations, outperforming other state-of-the-art methods that falter under similar conditions. Project page: https://diff-control.github.io/

Authors

Keywords

Training
Torque
Imitation learning
Scalability
Perturbation methods
Process control
Sensor systems and applications
Robustness
Intelligent sensors
Intelligent robots
State-space Model
Action Representation
Policy Learning
Presence Of Perturbations
Bayesian Formulation
Neural Network
Denoising
Transition State
Bimodal
Convolutional Layers
Sequence Of Actions
Lidocaine
Diffusion Model
Baseline Methods
Transition Model
Language Status
Policy Network
Temporal Consistency
Diverse Tasks
Expert Demonstrations
Efficacy Of Policies
Distractor Objects
Robot Learning
Differential Filter
Improve Success Rates
Task Duration
Previous Activity

Context

Venue: IEEE/RSJ International Conference on Intelligent Robots and Systems
Archive span: 1988-2025
Indexed papers: 26578
Paper id: 1125783400582284562