Skip to main content

Showing 1–16 of 16 results for author: Sundermeyer, M

.
  1. arXiv:2403.09799  [pdf, other

    cs.CV cs.RO

    BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

    Authors: Tomas Hodan, Martin Sundermeyer, Yann Labbe, Van Nguyen Nguyen, Gu Wang, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Jiri Matas

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2023, the fifth in a series of public competitions organized to capture the state of the art in model-based 6D object pose estimation from an RGB/RGB-D image and related tasks. Besides the three tasks from 2022 (model-based 2D detection, 2D segmentation, and 6D localization of objects seen during training), the 2023 c… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.13075

  2. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2311.12588  [pdf, other

    cs.CV

    HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation

    Authors: Yongliang Lin, Yongzhi Su, Praveen Nathan, Sandeep Inuganti, Yan Di, Martin Sundermeyer, Fabian Manhardt, Didier Stricker, Jason Rambach, Yu Zhang

    Abstract: In this work, we present a novel dense-correspondence method for 6DoF object pose estimation from a single RGB-D image. While many existing data-driven methods achieve impressive performance, they tend to be time-consuming due to their reliance on rendering-based refinement approaches. To circumvent this limitation, we present HiPose, which establishes 3D-3D correspondences in a coarse-to-fine man… ▽ More

    Submitted 7 April, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  4. arXiv:2303.13241  [pdf, other

    cs.CV cs.RO

    6D Object Pose Estimation from Approximate 3D Models for Orbital Robotics

    Authors: Maximilian Ulmer, Maximilian Durner, Martin Sundermeyer, Manuel Stoiber, Rudolph Triebel

    Abstract: We present a novel technique to estimate the 6D pose of objects from single images where the 3D geometry of the object is only given approximately and not as a precise 3D model. To achieve this, we employ a dense 2D-to-3D correspondence predictor that regresses 3D model coordinates for every pixel. In addition to the 3D coordinates, our model also estimates the pixel-wise coordinate error to disca… ▽ More

    Submitted 31 August, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  5. arXiv:2302.13075  [pdf, other

    cs.CV

    BOP Challenge 2022 on Detection, Segmentation and Pose Estimation of Specific Rigid Objects

    Authors: Martin Sundermeyer, Tomas Hodan, Yann Labbe, Gu Wang, Eric Brachmann, Bertram Drost, Carsten Rother, Jiri Matas

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2022, the fourth in a series of public competitions organized with the goal to capture the status quo in the field of 6D object pose estimation from an RGB/RGB-D image. In 2022, we witnessed another significant improvement in the pose estimation accuracy -- the state of the art, which was 56.9 AR$_C$ in 2019 (Vidal et… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2009.07378

  6. arXiv:2208.01502  [pdf, other

    cs.CV

    A Multi-body Tracking Framework - From Rigid Objects to Kinematic Structures

    Authors: Manuel Stoiber, Martin Sundermeyer, Wout Boerdijk, Rudolph Triebel

    Abstract: Kinematic structures are very common in the real world. They range from simple articulated objects to complex mechanical systems. However, despite their relevance, most model-based 3D tracking methods only consider rigid objects. To overcome this limitation, we propose a flexible framework that allows the extension of existing 6DoF algorithms to kinematic structures. Our approach focuses on method… ▽ More

    Submitted 14 February, 2023; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  7. arXiv:2203.05334  [pdf, other

    cs.CV

    Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects

    Authors: Manuel Stoiber, Martin Sundermeyer, Rudolph Triebel

    Abstract: Tracking objects in 3D space and predicting their 6DoF pose is an essential task in computer vision. State-of-the-art approaches often rely on object texture to tackle this problem. However, while they achieve impressive results, many objects do not contain sufficient texture, violating the main underlying assumption. In the following, we thus propose ICG, a novel probabilistic tracker that fuses… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  8. arXiv:2103.14127  [pdf, other

    cs.RO cs.CV

    Contact-GraspNet: Efficient 6-DoF Grasp Generation in Cluttered Scenes

    Authors: Martin Sundermeyer, Arsalan Mousavian, Rudolph Triebel, Dieter Fox

    Abstract: Gras** unseen objects in unconstrained, cluttered environments is an essential skill for autonomous robotic manipulation. Despite recent progress in full 6-DoF grasp learning, existing approaches often consist of complex sequential pipelines that possess several potential failure points and run-times unsuitable for closed-loop gras**. Therefore, we propose an end-to-end network that efficientl… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: ICRA 2021. Video of the real world experiments and code are available at https://research.nvidia.com/publication/2021-03_Contact-GraspNet%3A--Efficient

  9. arXiv:2103.06796  [pdf, other

    cs.CV cs.LG

    Unknown Object Segmentation from Stereo Images

    Authors: Maximilian Durner, Wout Boerdijk, Martin Sundermeyer, Werner Friedl, Zoltan-Csaba Marton, Rudolph Triebel

    Abstract: Although instance-aware perception is a key prerequisite for many autonomous robotic applications, most of the methods only partially solve the problem by focusing solely on known object categories. However, for robots interacting in dynamic and cluttered environments, this is not realistic and severely limits the range of potential applications. Therefore, we propose a novel object instance segme… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: 8 pages, 5 figures, 6 tables, code will be made available

  10. arXiv:2011.03279  [pdf, other

    cs.CV

    "What's This?" -- Learning to Segment Unknown Objects from Manipulation Sequences

    Authors: Wout Boerdijk, Martin Sundermeyer, Maximilian Durner, Rudolph Triebel

    Abstract: We present a novel framework for self-supervised grasped object segmentation with a robotic manipulator. Our method successively learns an agnostic foreground segmentation followed by a distinction between manipulator and object solely by observing the motion between consecutive RGB frames. In contrast to previous approaches, we propose a single, end-to-end trainable architecture which jointly inc… ▽ More

    Submitted 17 June, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: 8 pages, 6 figures,in Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2021

  11. arXiv:2009.07378  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    BOP Challenge 2020 on 6D Object Localization

    Authors: Tomas Hodan, Martin Sundermeyer, Bertram Drost, Yann Labbe, Eric Brachmann, Frank Michel, Carsten Rother, Jiri Matas

    Abstract: This paper presents the evaluation methodology, datasets, and results of the BOP Challenge 2020, the third in a series of public competitions organized with the goal to capture the status quo in the field of 6D object pose estimation from an RGB-D image. In 2020, to reduce the domain gap between synthetic training and real test RGB images, the participants were provided 350K photorealistic trainin… ▽ More

    Submitted 13 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: In ECCV 2020 Workshops Proceedings

  12. arXiv:2002.06267  [pdf, other

    physics.flu-dyn physics.ao-ph

    A generalized wave-vortex decomposition for rotating Boussinesq flows with arbitrary stratification

    Authors: Jeffrey J. Early, M. Pascale Lelong, Miles A. Sundermeyer

    Abstract: The linear wave and geostrophic (vortex) solutions are shown to be a complete basis for physical variables $(u,v,w,ρ)$ in a rotating non-hydrostatic Boussinesq model with arbitrary stratification. As a consequence, the fluid can be unambiguously separated into linear wave and geostrophic components at each instant in time, without the need for temporal filtering. The fluid can then be diagnosed fo… ▽ More

    Submitted 31 July, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

  13. arXiv:2002.04487  [pdf, other

    cs.CV cs.RO

    Self-Supervised Object-in-Gripper Segmentation from Robotic Motions

    Authors: Wout Boerdijk, Martin Sundermeyer, Maximilian Durner, Rudolph Triebel

    Abstract: Accurate object segmentation is a crucial task in the context of robotic manipulation. However, creating sufficient annotated training data for neural networks is particularly time consuming and often requires manual labeling. To this end, we propose a simple, yet robust solution for learning to segment unknown objects grasped by a robot. Specifically, we exploit motion and temporal cues in RGB vi… ▽ More

    Submitted 6 November, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: 15 pages, 11 figures. Video: https://www.youtube.com/watch?v=srEwuuIIgzI

  14. arXiv:1911.01911  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    BlenderProc

    Authors: Maximilian Denninger, Martin Sundermeyer, Dominik Winkelbauer, Youssef Zidan, Dmitry Olefir, Mohamad Elbadrawy, Ahsan Lodhi, Harinandan Katam

    Abstract: BlenderProc is a modular procedural pipeline, which helps in generating real looking images for the training of convolutional neural networks. These can be used in a variety of use cases including segmentation, depth, normal and pose estimation and many others. A key feature of our extension of blender is the simple to use modular pipeline, which was designed to be easily extendable. By offering s… ▽ More

    Submitted 25 October, 2019; originally announced November 2019.

    Comments: 7 pages, 8 figures

  15. arXiv:1908.00151  [pdf, other

    cs.CV stat.ML

    Multi-path Learning for Object Pose Estimation Across Domains

    Authors: Martin Sundermeyer, Maximilian Durner, En Yen Puang, Zoltan-Csaba Marton, Narunas Vaskevicius, Kai O. Arras, Rudolph Triebel

    Abstract: We introduce a scalable approach for object pose estimation trained on simulated RGB views of multiple 3D models together. We learn an encoding of object views that does not only describe an implicit orientation of all objects seen during training, but can also relate views of untrained objects. Our single-encoder-multi-decoder network is trained using a technique we denote "multi-path learning":… ▽ More

    Submitted 3 April, 2020; v1 submitted 31 July, 2019; originally announced August 2019.

    Comments: To appear at CVPR 2020; Code will be available here: https://github.com/DLR-RM/AugmentedAutoencoder/tree/multipath

  16. arXiv:1902.01275  [pdf, other

    cs.CV

    Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

    Authors: Martin Sundermeyer, Zoltan-Csaba Marton, Maximilian Durner, Manuel Brucker, Rudolph Triebel

    Abstract: We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalize… ▽ More

    Submitted 17 July, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: Code available at: https://github.com/DLR-RM/AugmentedAutoencoder