Arrow Research search
Back to IROS

IROS 2018

Multi-Agent Imitation Learning for Driving Simulation

Conference Paper Accepted Paper Artificial Intelligence ยท Robotics

Abstract

Simulation is an appealing option for validating the safety of autonomous vehicles. Generative Adversarial Imitation Learning (GAIL) has recently been shown to learn representative human driver models. These human driver models were learned through training in single-agent environments, but they have difficulty in generalizing to multi-agent driving scenarios. We argue these difficulties arise because observations at training and test time are sampled from different distributions. This difference makes such models unsuitable for the simulation of driving scenes, where multiple agents must interact realistically over long time horizons. We extend GAIL to address these shortcomings through a parameter-sharing approach grounded in curriculum learning. Compared with single-agent GAIL policies, policies generated by our PS-GAIL method prove superior at interacting stably in a multi-agent setting and capturing the emergent behavior of human drivers.

Authors

Keywords

  • Vehicles
  • Training
  • Trajectory
  • Optimization
  • Biological system modeling
  • Testing
  • Markov processes
  • Imitation Learning
  • Horizon
  • Human Model
  • Autonomous Vehicles
  • Emergent Behavior
  • Vehicle Safety
  • Human Drivers
  • Driver Model
  • Neural Network
  • Root Mean Square Error
  • Mean Square Error
  • Environmental Policy
  • Recurrent Neural Network
  • Feed-forward Network
  • Optimal Policy
  • Number Of Agents
  • Transition Model
  • Reward Function
  • Markov Decision Process
  • Policy Learning
  • Observation Space
  • Covariate Shift
  • Multi-agent Reinforcement Learning
  • Inverse Reinforcement Learning
  • Partial Observation
  • Gated Recurrent Unit
  • State-action Pair
  • Step Size Parameter

Context

Venue
IEEE/RSJ International Conference on Intelligent Robots and Systems
Archive span
1988-2025
Indexed papers
26578
Paper id
281243018705276200