The Treachery of Images: Bayesian Scene Keypoints for Deep Policy Learning in Robotic Manipulation

von Hartz, Jan Ole; Chisari, Eugenio; Welschehold, Tim; Burgard, Wolfram; Boedecker, Joschka; Valada, Abhinav

doi:10.1109/LRA.2023.3313917

Computer Science > Robotics

arXiv:2305.04718 (cs)

[Submitted on 8 May 2023 (v1), last revised 20 Sep 2023 (this version, v3)]

Title:The Treachery of Images: Bayesian Scene Keypoints for Deep Policy Learning in Robotic Manipulation

Authors:Jan Ole von Hartz, Eugenio Chisari, Tim Welschehold, Wolfram Burgard, Joschka Boedecker, Abhinav Valada

View PDF

Abstract:In policy learning for robotic manipulation, sample efficiency is of paramount importance. Thus, learning and extracting more compact representations from camera observations is a promising avenue. However, current methods often assume full observability of the scene and struggle with scale invariance. In many tasks and settings, this assumption does not hold as objects in the scene are often occluded or lie outside the field of view of the camera, rendering the camera observation ambiguous with regard to their location. To tackle this problem, we present BASK, a Bayesian approach to tracking scale-invariant keypoints over time. Our approach successfully resolves inherent ambiguities in images, enabling keypoint tracking on symmetrical objects and occluded and out-of-view objects. We employ our method to learn challenging multi-object robot manipulation tasks from wrist camera observations and demonstrate superior utility for policy learning compared to other representation learning techniques. Furthermore, we show outstanding robustness towards disturbances such as clutter, occlusions, and noisy depth measurements, as well as generalization to unseen objects both in simulation and real-world robotic experiments.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.04718 [cs.RO]
	(or arXiv:2305.04718v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2305.04718
Journal reference:	IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 6931-6938, Nov. 2023
Related DOI:	https://doi.org/10.1109/LRA.2023.3313917

Submission history

From: Jan Ole von Hartz [view email]
[v1] Mon, 8 May 2023 14:05:38 UTC (14,047 KB)
[v2] Sat, 5 Aug 2023 12:09:19 UTC (18,807 KB)
[v3] Wed, 20 Sep 2023 13:24:51 UTC (18,807 KB)

Computer Science > Robotics

Title:The Treachery of Images: Bayesian Scene Keypoints for Deep Policy Learning in Robotic Manipulation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:The Treachery of Images: Bayesian Scene Keypoints for Deep Policy Learning in Robotic Manipulation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators