Arrow Research search
Back to IROS

IROS 2022

CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point Cloud Learning

Conference Paper Accepted Paper Artificial Intelligence ยท Robotics

Abstract

Processing 3D data efficiently has always been a challenge. Spatial operations on large-scale point clouds, stored as sparse data, require extra cost. Attracted by the success of transformers, researchers are using multi-head attention for vision tasks. However, attention calculations in transformers come with quadratic complexity in the number of inputs and miss spatial intuition on sets like point clouds. We redesign set transformers in this work and incorporate them into a hierarchical framework for shape classification and part and scene segmentation. We propose our local attention unit, which captures features in a spatial neighborhood. We also compute efficient and dynamic global cross attentions by leveraging sampling and grouping at each iteration. Finally, to mitigate the non-heterogeneity of point clouds, we propose an efficient Multi-Scale Tokenization (MST), which extracts scale-invariant tokens for attention operations. The proposed hierarchical model achieves state-of-the-art shape classification in mean accuracy and yields results on par with the previous segmentation methods while requiring significantly fewer computations. Our proposed architecture predicts segmentation labels with around half the latency and parameter count of the previous most effi-cient method with comparable performance. The code is available at https://github.com/YigeWang-WHU/CloudAttention.

Authors

Keywords

  • Point cloud compression
  • Three-dimensional displays
  • Shape
  • Pipelines
  • Semantics
  • Transformers
  • Tokenization
  • Point Cloud
  • 3D Point Cloud
  • Point Cloud Learning
  • Attention Mechanism
  • Mean Accuracy
  • Global Attention
  • Local Units
  • Shape Classification
  • Local Attention
  • Quadratic Complexity
  • Scene Segmentation
  • Intersection Over Union
  • Receptive Field
  • Large-scale Applications
  • Semantic Segmentation
  • Segmentation Task
  • 3D Coordinates
  • Self-similarity
  • Multi-scale Features
  • Linear Layer
  • Raw Point
  • Tokenized
  • Input Point Cloud
  • Mean Intersection Over Union
  • Point Cloud Data
  • Graph Neural Networks
  • Millions Of Points
  • Part Segmentation
  • CAD Model
  • Input Point

Context

Venue
IEEE/RSJ International Conference on Intelligent Robots and Systems
Archive span
1988-2025
Indexed papers
26578
Paper id
1047173661168519964