Search | arXiv e-print repository

ConvBKI: Real-Time Probabilistic Semantic Map** Network with Quantifiable Uncertainty

Authors: Joey Wilson, Yuewei Fu, Joshua Friesen, Parker Ewen, Andrew Capodieci, Paramsothy Jayakumar, Kira Barton, Maani Ghaffari

Abstract: In this paper, we develop a modular neural network for real-time semantic map** in uncertain environments, which explicitly updates per-voxel probabilistic distributions within a neural network layer. Our approach combines the reliability of classical probabilistic algorithms with the performance and efficiency of modern neural networks. Although robotic perception is often divided between moder… ▽ More In this paper, we develop a modular neural network for real-time semantic map** in uncertain environments, which explicitly updates per-voxel probabilistic distributions within a neural network layer. Our approach combines the reliability of classical probabilistic algorithms with the performance and efficiency of modern neural networks. Although robotic perception is often divided between modern differentiable methods and classical explicit methods, a union of both is necessary for real-time and trustworthy performance. We introduce a novel Convolutional Bayesian Kernel Inference (ConvBKI) layer which incorporates semantic segmentation predictions online into a 3D map through a depthwise convolution layer by leveraging conjugate priors. We compare ConvBKI against state-of-the-art deep learning approaches and probabilistic algorithms for map** to evaluate reliability and performance. We also create a Robot Operating System (ROS) package of ConvBKI and test it on real-world perceptually challenging off-road driving data. △ Less

Submitted 26 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

Comments: arXiv admin note: text overlap with arXiv:2209.10663

arXiv:2209.10663 [pdf, other]

Convolutional Bayesian Kernel Inference for 3D Semantic Map**

Authors: Joey Wilson, Yuewei Fu, Arthur Zhang, **gyu Song, Andrew Capodieci, Paramsothy Jayakumar, Kira Barton, Maani Ghaffari

Abstract: Robotic perception is currently at a cross-roads between modern methods, which operate in an efficient latent space, and classical methods, which are mathematically founded and provide interpretable, trustworthy results. In this paper, we introduce a Convolutional Bayesian Kernel Inference (ConvBKI) layer which learns to perform explicit Bayesian inference within a depthwise separable convolution… ▽ More Robotic perception is currently at a cross-roads between modern methods, which operate in an efficient latent space, and classical methods, which are mathematically founded and provide interpretable, trustworthy results. In this paper, we introduce a Convolutional Bayesian Kernel Inference (ConvBKI) layer which learns to perform explicit Bayesian inference within a depthwise separable convolution layer to maximize efficency while maintaining reliability simultaneously. We apply our layer to the task of real-time 3D semantic map**, where we learn semantic-geometric probability distributions for LiDAR sensor information and incorporate semantic predictions into a global map. We evaluate our network against state-of-the-art semantic map** algorithms on the KITTI data set, demonstrating improved latency with comparable semantic label inference results. △ Less

Submitted 31 May, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

arXiv:2203.07060 [pdf, other]

MotionSC: Data Set and Network for Real-Time Semantic Map** in Dynamic Environments

Authors: Joey Wilson, **gyu Song, Yuewei Fu, Arthur Zhang, Andrew Capodieci, Paramsothy Jayakumar, Kira Barton, Maani Ghaffari

Abstract: This work addresses a gap in semantic scene completion (SSC) data by creating a novel outdoor data set with accurate and complete dynamic scenes. Our data set is formed from randomly sampled views of the world at each time step, which supervises generalizability to complete scenes without occlusions or traces. We create SSC baselines from state-of-the-art open source networks and construct a bench… ▽ More This work addresses a gap in semantic scene completion (SSC) data by creating a novel outdoor data set with accurate and complete dynamic scenes. Our data set is formed from randomly sampled views of the world at each time step, which supervises generalizability to complete scenes without occlusions or traces. We create SSC baselines from state-of-the-art open source networks and construct a benchmark real-time dense local semantic map** algorithm, MotionSC, by leveraging recent 3D deep learning architectures to enhance SSC with temporal information. Our network shows that the proposed data set can quantify and supervise accurate scene completion in the presence of dynamic objects, which can lead to the development of improved dynamic map** algorithms. All software is available at https://github.com/UMich-CURLY/3DMap**. △ Less

Submitted 30 June, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

arXiv:2108.03180 [pdf, other]

Dynamic Semantic Occupancy Map** using 3D Scene Flow and Closed-Form Bayesian Inference

Authors: Aishwarya Unnikrishnan, Joey Wilson, Lu Gan, Andrew Capodieci, Paramsothy Jayakumar, Kira Barton, Maani Ghaffari

Abstract: This paper reports on a dynamic semantic map** framework that incorporates 3D scene flow measurements into a closed-form Bayesian inference model. Existence of dynamic objects in the environment can cause artifacts and traces in current map** algorithms, leading to an inconsistent map posterior. We leverage state-of-the-art semantic segmentation and 3D flow estimation using deep learning to pr… ▽ More This paper reports on a dynamic semantic map** framework that incorporates 3D scene flow measurements into a closed-form Bayesian inference model. Existence of dynamic objects in the environment can cause artifacts and traces in current map** algorithms, leading to an inconsistent map posterior. We leverage state-of-the-art semantic segmentation and 3D flow estimation using deep learning to provide measurements for map inference. We develop a Bayesian model that propagates the scene with flow and infers a 3D continuous (i.e., can be queried at arbitrary resolution) semantic occupancy map outperforming its static counterpart. Extensive experiments using publicly available data sets show that the proposed framework improves over its predecessors and input measurements from deep neural networks consistently. △ Less

Submitted 6 September, 2022; v1 submitted 6 August, 2021; originally announced August 2021.

Showing 1–4 of 4 results for author: Capodieci, A