Arrow Research search
Back to IROS

IROS 2023

Object Goal Navigation with Recursive Implicit Maps

Conference Paper Accepted Paper Artificial Intelligence ยท Robotics

Abstract

Object goal navigation aims to navigate an agent to locations of a given object category in unseen environments. Classical methods explicitly build maps of environments and require extensive engineering while lacking semantic information for object-oriented exploration. On the other hand, end-to-end learning methods alleviate manual map design and predict actions using implicit representations. Such methods, however, lack an explicit notion of geometry and may have limited ability to encode navigation history. In this work, we propose an implicit spatial map for object goal navigation. Our implicit map is recursively updated with new observations at each step using a transformer. To encourage spatial reasoning, we introduce auxiliary tasks and train our model to reconstruct explicit maps as well as to predict visual features, semantic labels and actions. Our method significantly outperforms the state of the art on the challenging MP3D dataset and generalizes well to the HM3D dataset. We successfully deploy our model on a real robot and achieve encouraging object goal navigation results in real scenes using only a few real-world demonstrations. Code, trained models and videos are available at https://www.di.ens.fr/willow/research/onav_rim/.

Authors

Keywords

  • Learning systems
  • Visualization
  • Navigation
  • Object oriented modeling
  • Semantics
  • Manuals
  • Predictive models
  • Goal Navigation
  • Visual Features
  • Implicit Function
  • Explicit Function
  • Real Robot
  • Auxiliary Task
  • Implicit Representation
  • Map Design
  • Unseen Environments
  • Navigation Map
  • Long-term Memory
  • Number Of Steps
  • Recurrent Neural Network
  • Feed-forward Network
  • Target Object
  • Depth Images
  • Map Representation
  • Machine Vision
  • Relative Gain
  • Efficient Exploration
  • Occupancy Map
  • Simultaneous Localization And Mapping
  • Beginning Of Episode
  • Vertical Field Of View
  • Navigation In Environments
  • Semantic Prediction
  • Position Of Agent
  • Semantic Map
  • Recurrence Status

Context

Venue
IEEE/RSJ International Conference on Intelligent Robots and Systems
Archive span
1988-2025
Indexed papers
26578
Paper id
47092951964763977