Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection

Yang, Mingyu; Chen, Yu; Kim, Hun-Seok

Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.06187 (cs)

[Submitted on 12 May 2022 (v1), last revised 19 Oct 2022 (this version, v2)]

Title:Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection

Authors:Mingyu Yang, Yu Chen, Hun-Seok Kim

View PDF

Abstract:In recent years, deep learning-based approaches for visual-inertial odometry (VIO) have shown remarkable performance outperforming traditional geometric methods. Yet, all existing methods use both the visual and inertial measurements for every pose estimation incurring potential computational redundancy. While visual data processing is much more expensive than that for the inertial measurement unit (IMU), it may not always contribute to improving the pose estimation accuracy. In this paper, we propose an adaptive deep-learning based VIO method that reduces computational redundancy by opportunistically disabling the visual modality. Specifically, we train a policy network that learns to deactivate the visual feature extractor on the fly based on the current motion state and IMU readings. A Gumbel-Softmax trick is adopted to train the policy network to make the decision process differentiable for end-to-end system training. The learned strategy is interpretable, and it shows scenario-dependent decision patterns for adaptive complexity reduction. Experiment results show that our method achieves a similar or even better performance than the full-modality baseline with up to 78.8% computational complexity reduction for KITTI dataset evaluation. The code is available at this https URL.

Comments:	Accepted to ECCV 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.06187 [cs.CV]
	(or arXiv:2205.06187v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.06187

Submission history

From: Mingyu Yang [view email]
[v1] Thu, 12 May 2022 16:17:49 UTC (671 KB)
[v2] Wed, 19 Oct 2022 18:51:53 UTC (1,982 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators