ICRA 2025
Closed Loop Interactive Embodied Reasoning for Robot Manipulation
Abstract
Embodied reasoning systems integrate robotic hardware and cognitive processes to perform complex tasks, typically in response to a natural language query about a specific physical environment. This usually involves changing the belief about the scene or physically interacting and changing the scene (e. g. sort the objects from lightest to heaviest). In order to facilitate the development of such systems we introduce a new modular Closed Loop Interactive Embodied Reasoning (CLIER) approach that takes into account the measurements of non-visual object properties, changes in the scene caused by external disturbances as well as uncertain outcomes of robotic actions. CLIER performs multi-modal reasoning and action planning and generates a sequence of primitive actions that can be executed by a robot manipulator. Our method operates in a closed loop, responding to changes in the environment. Our approach is developed with the use of MuBle simulation environment and tested in $\mathbf{1 0}$ interactive benchmark scenarios. We extensively evaluate our reasoning approach in simulation and in real-world manipulation tasks with a success rate above $\mathbf{7 6 \%}$ and 64%, respectively.
Authors
Keywords
Context
- Venue
- IEEE International Conference on Robotics and Automation
- Archive span
- 1984-2025
- Indexed papers
- 30179
- Paper id
- 567273527886728470