Skip to main content

Showing 1–18 of 18 results for author: Rambach, J

.
  1. arXiv:2405.10557  [pdf, other

    cs.CV

    Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose Estimation

    Authors: Yongliang Lin, Yongzhi Su, Sandeep Inuganti, Yan Di, Naeem Ajilforoushan, Hanqing Yang, Yu Zhang, Jason Rambach

    Abstract: Estimating the 6D pose of an object from a single RGB image is a critical task that becomes additionally challenging when dealing with symmetric objects. Recent approaches typically establish one-to-one correspondences between image pixels and 3D object surface vertices. However, the utilization of one-to-one correspondences introduces ambiguity for symmetric objects. To address this, we propose S… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 8 pages,10 figures

  2. arXiv:2311.12588  [pdf, other

    cs.CV

    HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation

    Authors: Yongliang Lin, Yongzhi Su, Praveen Nathan, Sandeep Inuganti, Yan Di, Martin Sundermeyer, Fabian Manhardt, Didier Stricker, Jason Rambach, Yu Zhang

    Abstract: In this work, we present a novel dense-correspondence method for 6DoF object pose estimation from a single RGB-D image. While many existing data-driven methods achieve impressive performance, they tend to be time-consuming due to their reliance on rendering-based refinement approaches. To circumvent this limitation, we present HiPose, which establishes 3D-3D correspondences in a coarse-to-fine man… ▽ More

    Submitted 7 April, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  3. arXiv:2309.15465  [pdf, other

    cs.CV

    Cross-Dataset Experimental Study of Radar-Camera Fusion in Bird's-Eye View

    Authors: Lukas Stäcker, Philipp Heidenreich, Jason Rambach, Didier Stricker

    Abstract: By exploiting complementary sensor information, radar and camera fusion systems have the potential to provide a highly robust and reliable perception system for advanced driver assistance systems and automated driving functions. Recent advances in camera-based object detection offer new radar-camera fusion possibilities with bird's eye view feature maps. In this work, we propose a novel and flexib… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: EUSIPCO 2023

  4. arXiv:2308.09369  [pdf, other

    cs.CV

    Single Frame Semantic Segmentation Using Multi-Modal Spherical Images

    Authors: Suresh Guttikonda, Jason Rambach

    Abstract: In recent years, the research community has shown a lot of interest to panoramic images that offer a 360-degree directional perspective. Multiple data modalities can be fed, and complimentary characteristics can be utilized for more robust and rich scene interpretation based on semantic segmentation, to fully realize the potential. Existing research, however, mostly concentrated on pinhole RGB-X s… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted at WACV 2024

  5. arXiv:2308.06383  [pdf, other

    cs.CV

    U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds

    Authors: Yan Di, Chenyangguang Zhang, Ruida Zhang, Fabian Manhardt, Yongzhi Su, Jason Rambach, Didier Stricker, Xiangyang Ji, Federico Tombari

    Abstract: In this paper, we propose U-RED, an Unsupervised shape REtrieval and Deformation pipeline that takes an arbitrary object observation as input, typically captured by RGB images or scans, and jointly retrieves and deforms the geometrically similar CAD models from a pre-established database to tightly match the target. Considering existing methods typically fail to handle noisy partial observations,… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: ICCV2023

  6. arXiv:2306.17636  [pdf, other

    cs.CV cs.AI cs.LG

    Achieving RGB-D level Segmentation Performance from a Single ToF Camera

    Authors: Pranav Sharma, Jigyasa Singh Katrolia, Jason Rambach, Bruno Mirbach, Didier Stricker, Juergen Seiler

    Abstract: Depth is a very important modality in computer vision, typically used as complementary information to RGB, provided by RGB-D cameras. In this work, we show that it is possible to obtain the same level of accuracy as RGB-D cameras on a semantic segmentation task using infrared (IR) and depth images from a single Time-of-Flight (ToF) camera. In order to fuse the IR and depth modalities of the ToF ca… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

  7. arXiv:2305.15883  [pdf, other

    cs.CV

    RC-BEVFusion: A Plug-In Module for Radar-Camera Bird's Eye View Feature Fusion

    Authors: Lukas Stäcker, Shashank Mishra, Philipp Heidenreich, Jason Rambach, Didier Stricker

    Abstract: Radars and cameras belong to the most frequently used sensors for advanced driver assistance systems and automated driving research. However, there has been surprisingly little research on radar-camera fusion with neural networks. One of the reasons is a lack of large-scale automotive datasets with radar and unmasked camera data, with the exception of the nuScenes dataset. Another reason is the di… ▽ More

    Submitted 28 September, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: GCPR 2023

  8. arXiv:2211.01142  [pdf, other

    cs.CV

    OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection

    Authors: Yongzhi Su, Yan Di, Fabian Manhardt, Guangyao Zhai, Jason Rambach, Benjamin Busam, Didier Stricker, Federico Tombari

    Abstract: Despite monocular 3D object detection having recently made a significant leap forward thanks to the use of pre-trained depth estimators for pseudo-LiDAR recovery, such two-stage methods typically suffer from overfitting and are incapable of explicitly encapsulating the geometric relation between depth and object bounding box. To overcome this limitation, we instead propose OPA-3D, a single-stage,… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  9. arXiv:2203.09418  [pdf, other

    cs.CV

    ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation

    Authors: Yongzhi Su, Mahdi Saleh, Torben Fetzer, Jason Rambach, Nassir Navab, Benjamin Busam, Didier Stricker, Federico Tombari

    Abstract: Establishing correspondences from image to 3D has been a key task of 6DoF object pose estimation for a long time. To predict pose more accurately, deeply learned dense maps replaced sparse templates. Dense methods also improved pose estimation in the presence of occlusion. More recently researchers have shown improvements by learning object fragments as segmentation. In this work, we present a dis… ▽ More

    Submitted 29 March, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: CVPR2022 camera ready

  10. arXiv:2203.01052  [pdf, other

    cs.CV

    Unsupervised Anomaly Detection from Time-of-Flight Depth Images

    Authors: Pascal Schneider, Jason Rambach, Bruno Mirbach, Didier Stricker

    Abstract: Video anomaly detection (VAD) addresses the problem of automatically finding anomalous events in video data. The primary data modalities on which current VAD systems work on are monochrome or RGB images. Using depth data in this context instead is still hardly explored in spite of depth images being a popular choice in many other computer vision research areas and the increasing availability of in… ▽ More

    Submitted 12 April, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  11. arXiv:2111.08614  [pdf, other

    cs.CV

    IKEA Object State Dataset: A 6DoF object pose estimation dataset and benchmark for multi-state assembly objects

    Authors: Yongzhi Su, Mingxin Liu, Jason Rambach, Antonia Pehrson, Anton Berg, Didier Stricker

    Abstract: Utilizing 6DoF(Degrees of Freedom) pose information of an object and its components is critical for object state detection tasks. We present IKEA Object State Dataset, a new dataset that contains IKEA furniture 3D models, RGBD video of the assembly process, the 6DoF pose of furniture parts and their bounding box. The proposed dataset will be available at https://github.com/mxllmx/IKEAObjectStateDa… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  12. arXiv:2110.11219  [pdf, other

    cs.CV

    PlaneRecNet: Multi-Task Learning with Cross-Task Consistency for Piece-Wise Plane Detection and Reconstruction from a Single RGB Image

    Authors: Yaxu Xie, Fangwen Shu, Jason Rambach, Alain Pagani, Didier Stricker

    Abstract: Piece-wise 3D planar reconstruction provides holistic scene understanding of man-made environments, especially for indoor scenarios. Most recent approaches focused on improving the segmentation and reconstruction results by introducing advanced network architectures but overlooked the dual characteristics of piece-wise planes as objects and geometric models. Different from other existing approache… ▽ More

    Submitted 30 January, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: accepted to BMVC 2021, code opensource: https://github.com/EryiXie/PlaneRecNet

  13. arXiv:2108.12196  [pdf, other

    cs.CV

    TIMo -- A Dataset for Indoor Building Monitoring with a Time-of-Flight Camera

    Authors: Pascal Schneider, Yuriy Anisimov, Raisul Islam, Bruno Mirbach, Jason Rambach, Frédéric Grandidier, Didier Stricker

    Abstract: We present TIMo (Time-of-flight Indoor Monitoring), a dataset for video-based monitoring of indoor spaces captured using a time-of-flight (ToF) camera. The resulting depth videos feature people performing a set of different predefined actions, for which we provide detailed annotations. Person detection for people counting and anomaly detection are the two targeted applications. Most existing surve… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  14. arXiv:2108.08166  [pdf, other

    cs.CV cs.RO

    Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization

    Authors: Lukas Stäcker, Juncong Fei, Philipp Heidenreich, Frank Bonarens, Jason Rambach, Didier Stricker, Christoph Stiller

    Abstract: Deep neural networks have proven increasingly important for automotive scene understanding with new algorithms offering constant improvements of the detection performance. However, there is little emphasis on experiences and needs for deployment in embedded environments. We therefore perform a case study of the deployment of two representative object detection networks on an edge AI platform. In p… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: To present in ICCV 2021 (ERCVAD Workshop)

  15. arXiv:2108.04281  [pdf, other

    cs.CV cs.RO

    Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction

    Authors: Fangwen Shu, Yaxu Xie, Jason Rambach, Alain Pagani, Didier Stricker

    Abstract: This paper presents a semantic planar SLAM system that improves pose estimation and map** using cues from an instance planar segmentation network. While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting. In the majority of existing work, geometric model esti… ▽ More

    Submitted 21 June, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: accepted to ISMAR 2021 (Poster), v2 fixed some typos and minor errors

  16. arXiv:2103.15428  [pdf, other

    cs.CV cs.RO

    PlaneSegNet: Fast and Robust Plane Estimation Using a Single-stage Instance Segmentation CNN

    Authors: Yaxu Xie, Jason Rambach, Fangwen Shu, Didier Stricker

    Abstract: Instance segmentation of planar regions in indoor scenes benefits visual SLAM and other applications such as augmented reality (AR) where scene understanding is required. Existing methods built upon two-stage frameworks show satisfactory accuracy but are limited by low frame rates. In this work, we propose a real-time deep neural architecture that estimates piece-wise planar regions from a single… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: accepted to ICRA 2021

  17. arXiv:2103.11719  [pdf, other

    cs.CV

    TICaM: A Time-of-flight In-car Cabin Monitoring Dataset

    Authors: Jigyasa Singh Katrolia, Bruno Mirbach, Ahmed El-Sherif, Hartmut Feld, Jason Rambach, Didier Stricker

    Abstract: We present TICaM, a Time-of-flight In-car Cabin Monitoring dataset for vehicle interior monitoring using a single wide-angle depth camera. Our dataset addresses the deficiencies of currently available in-car cabin datasets in terms of the ambit of labeled classes, recorded scenarios and provided annotations; all at the same time. We record an exhaustive list of actions performed while driving and… ▽ More

    Submitted 23 March, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

  18. arXiv:2008.12024  [pdf, other

    cs.HC cs.CV cs.CY cs.GT

    A survey on applications of augmented, mixed and virtual reality for nature and environment

    Authors: Jason Rambach, Gergana Lilligreen, Alexander Schäfer, Ramya Bankanal, Alexander Wiebel, Didier Stricker

    Abstract: Augmented reality (AR), virtual reality (VR) and mixed reality (MR) are technologies of great potential due to the engaging and enriching experiences they are capable of providing. Their use is rapidly increasing in diverse fields such as medicine, manufacturing or entertainment. However, the possibilities that AR, VR and MR offer in the area of environmental applications are not yet widely explor… ▽ More

    Submitted 28 August, 2020; v1 submitted 27 August, 2020; originally announced August 2020.