Arrow Research search
Back to IROS

IROS 2018

Teaching Robots to Predict Human Motion

Conference Paper Accepted Paper Artificial Intelligence · Robotics

Abstract

Teaching a robot to predict and mimic how a human moves or acts in the near future by observing a series of historical human movements is a crucial first step in human-robot interaction and collaboration. In this paper, we instrument a robot with such a prediction ability by leveraging recent deep learning and computer vision techniques. First, our system takes images from the robot camera as input to produce the corresponding human skeleton based on real-time human pose estimation obtained with the OpenPose library. Then, conditioning on this historical sequence, the robot forecasts plausible motion through a motion predictor, generating a corresponding demonstration. Because of a lack of high-level fidelity validation, existing forecasting algorithms suffer from error accumulation and inaccurate prediction. Inspired by generative adversarial networks (GANs), we introduce a global discriminator that examines whether the predicted sequence is smooth and realistic. Our resulting motion GAN model achieves superior prediction performance to state-of-the-art approaches when evaluated on the standard H3. 6M dataset. Based on this motion GAN model, the robot demonstrates its ability to replay the predicted motion in a human-like manner when interacting with a person.

Authors

Keywords

  • Robots
  • Gallium nitride
  • Generative adversarial networks
  • Predictive models
  • Cameras
  • Skeleton
  • Decoding
  • Human Motion
  • Deep Learning
  • Near Future
  • Pose Estimation
  • Human-robot Interaction
  • Error Accumulation
  • Inaccurate Predictions
  • Motion Prediction
  • Human Pose Estimation
  • Human Pose
  • Historical Sequence
  • Series Of Movements
  • Generation Sequencing
  • Recurrent Network
  • Long Short-term Memory
  • Recurrent Neural Network
  • Point Cloud
  • Variety Of Tasks
  • Input Sequence
  • Gated Recurrent Unit
  • Latent Representation
  • 3D Point
  • Encoder-decoder Network
  • Inference Stage
  • Competitive Game
  • Global Perspective
  • Depth Camera
  • Gated Recurrent Unit Layer
  • Hidden Size

Context

Venue
IEEE/RSJ International Conference on Intelligent Robots and Systems
Archive span
1988-2025
Indexed papers
26578
Paper id
140402950646105596