Skip to main content

Showing 1–10 of 10 results for author: Thalhammer, S

.
  1. arXiv:2406.14385  [pdf, other

    cs.RO

    Semi-Autonomous Mobile Search and Rescue Robot for Radiation Disaster Scenarios

    Authors: Simon Schwaiger, Lucas Muster, Georg Novotny, Michael Schebek, Wilfried Wöber, Stefan Thalhammer, Christoph Böhm

    Abstract: This paper describes a novel semi-autonomous mobile robot system designed to assist search and rescue (SAR) first responders in disaster scenarios. While robots offer significant potential in SAR missions, current solutions are limited in their ability to handle a diverse range of tasks. This gap is addressed by presenting a system capable of (1) autonomous navigation and map**, allowing the rob… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2402.06436  [pdf, other

    cs.CV

    Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation

    Authors: Peter Hönig, Stefan Thalhammer, Markus Vincze

    Abstract: Estimating 2D-3D correspondences between RGB images and 3D space is a fundamental problem in 6D object pose estimation. Recent pose estimators use dense correspondence maps and Point-to-Point algorithms to estimate object poses. The accuracy of pose estimation depends heavily on the quality of the dense correspondence maps and their ability to withstand occlusion, clutter, and challenging material… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Submitted to the First Austrian Symposium on AI, Robotics, and Vision 2024

  3. arXiv:2402.04878  [pdf, other

    cs.CV

    STAR: Shape-focused Texture Agnostic Representations for Improved Object Detection and 6D Pose Estimation

    Authors: Peter Hönig, Stefan Thalhammer, Jean-Baptiste Weibel, Matthias Hirschmanner, Markus Vincze

    Abstract: Recent advances in machine learning have greatly benefited object detection and 6D pose estimation for robotic gras**. However, textureless and metallic objects still pose a significant challenge due to fewer visual cues and the texture bias of CNNs. To address this issue, we propose a texture-agnostic approach that focuses on learning from CAD models and emphasizes object shape features. To ach… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Submitted to IEEE Robotics and Automation Letters

  4. arXiv:2309.11986  [pdf, other

    cs.CV

    ZS6D: Zero-shot 6D Object Pose Estimation using Vision Transformers

    Authors: Philipp Ausserlechner, David Haberger, Stefan Thalhammer, Jean-Baptiste Weibel, Markus Vincze

    Abstract: As robotic systems increasingly encounter complex and unconstrained real-world scenarios, there is a demand to recognize diverse objects. The state-of-the-art 6D object pose estimation methods rely on object-specific training and therefore do not generalize to unseen objects. Recent novel object pose estimation methods are solving this issue using task-specific fine-tuned CNNs for deep template ma… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  5. arXiv:2307.12172  [pdf, ps, other

    cs.RO cs.CV

    Challenges for Monocular 6D Object Pose Estimation in Robotics

    Authors: Stefan Thalhammer, Dominik Bauer, Peter Hönig, Jean-Baptiste Weibel, José García-Rodríguez, Markus Vincze

    Abstract: Object pose estimation is a core perception task that enables, for example, object gras** and scene understanding. The widely available, inexpensive and high-resolution RGB sensors and CNNs that allow for fast inference based on this modality make monocular approaches especially well suited for robotics applications. We observe that previous surveys on object pose estimation establish the state… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.11827

  6. arXiv:2306.00129  [pdf, ps, other

    cs.CV

    Self-supervised Vision Transformers for 3D Pose Estimation of Novel Objects

    Authors: Stefan Thalhammer, Jean-Baptiste Weibel, Markus Vincze, Jose Garcia-Rodriguez

    Abstract: Object pose estimation is important for object manipulation and scene understanding. In order to improve the general applicability of pose estimators, recent research focuses on providing estimates for novel objects, that is objects unseen during training. Such works use deep template matching strategies to retrieve the closest template connected to a query image. This template retrieval implicitl… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  7. arXiv:2302.11827   

    cs.CV

    Open Challenges for Monocular Single-shot 6D Object Pose Estimation

    Authors: Stefan Thalhammer, Peter Hönig, Jean-Baptiste Weibel, Markus Vincze

    Abstract: Object pose estimation is a non-trivial task that enables robotic manipulation, bin picking, augmented reality, and scene understanding, to name a few use cases. Monocular object pose estimation gained considerable momentum with the rise of high-performing deep learning-based solutions and is particularly interesting for the community since sensors are inexpensive and inference is fast. Prior work… ▽ More

    Submitted 20 July, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Revised version in the making

  8. arXiv:2211.08182  [pdf, other

    cs.CV cs.RO

    Gras** the Inconspicuous

    Authors: Hrishikesh Gupta, Stefan Thalhammer, Markus Leitner, Markus Vincze

    Abstract: Transparent objects are common in day-to-day life and hence find many applications that require robot gras**. Many solutions toward object gras** exist for non-transparent objects. However, due to the unique visual properties of transparent objects, standard 3D sensors produce noisy or distorted measurements. Modern approaches tackle this problem by either refining the noisy depth measurements… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  9. arXiv:2208.08807  [pdf, other

    cs.CV

    COPE: End-to-end trainable Constant Runtime Object Pose Estimation

    Authors: Stefan Thalhammer, Timothy Patten, Markus Vincze

    Abstract: State-of-the-art object pose estimation handles multiple instances in a test image by using multi-model formulations: detection as a first stage and then separately trained networks per object for 2D-3D geometric correspondence prediction as a second stage. Poses are subsequently estimated using the Perspective-n-Points algorithm at runtime. Unfortunately, multi-model formulations are slow and do… ▽ More

    Submitted 22 August, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

  10. arXiv:2010.16117  [pdf, other

    cs.CV

    PyraPose: Feature Pyramids for Fast and Accurate Object Pose Estimation under Domain Shift

    Authors: Stefan Thalhammer, Markus Leitner, Timothy Patten, Markus Vincze

    Abstract: Object pose estimation enables robots to understand and interact with their environments. Training with synthetic data is necessary in order to adapt to novel situations. Unfortunately, pose estimation under domain shift, i.e., training on synthetic data and testing in the real world, is challenging. Deep learning-based approaches currently perform best when using encoder-decoder networks but typi… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.