Skip to main content

Showing 1–14 of 14 results for author: Kehl, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.12682  [pdf, other

    cs.CV cs.RO

    Photo-realistic Neural Domain Randomization

    Authors: Sergey Zakharov, Rares Ambrus, Vitor Guizilini, Wadim Kehl, Adrien Gaidon

    Abstract: Synthetic data is a scalable alternative to manual supervision, but it requires overcoming the sim-to-real domain gap. This discrepancy between virtual and real worlds is addressed by two seemingly opposed approaches: improving the realism of simulation or foregoing realism entirely via domain randomization. In this paper, we show that the recent progress in neural rendering enables a new unified… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted to European Conference on Computer Vision (ECCV), 2022

  2. arXiv:2009.14524  [pdf, other

    cs.CV cs.LG

    Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

    Authors: Deniz Beker, Hiroharu Kato, Mihai Adrian Morariu, Takahiro Ando, Toru Matsuoka, Wadim Kehl, Adrien Gaidon

    Abstract: 3D object detection from monocular images is an ill-posed problem due to the projective entanglement of depth and scale. To overcome this ambiguity, we present a novel self-supervised method for textured 3D shape reconstruction and pose estimation of rigid objects with the help of strong shape priors and 2D instance masks. Our method predicts the 3D location and meshes of each object in an image u… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

    Comments: 20 pages, Supplementary material included, Published in ECCV 2020

  3. arXiv:2006.12057  [pdf, other

    cs.CV cs.GR

    Differentiable Rendering: A Survey

    Authors: Hiroharu Kato, Deniz Beker, Mihai Morariu, Takahiro Ando, Toru Matsuoka, Wadim Kehl, Adrien Gaidon

    Abstract: Deep neural networks (DNNs) have shown remarkable performance improvements on vision-related tasks such as object detection or image segmentation. Despite their success, they generally lack the understanding of 3D objects which form the image, as it is not always possible to collect 3D information about the scene or to easily annotate it. Differentiable rendering is a novel field which allows the… ▽ More

    Submitted 30 July, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

  4. arXiv:1911.11288  [pdf, other

    cs.CV

    Autolabeling 3D Objects with Differentiable Rendering of SDF Shape Priors

    Authors: Sergey Zakharov, Wadim Kehl, Arjun Bhargava, Adrien Gaidon

    Abstract: We present an automatic annotation pipeline to recover 9D cuboids and 3D shapes from pre-trained off-the-shelf 2D detectors and sparse LIDAR data. Our autolabeling method solves an ill-posed inverse problem by considering learned shape priors and optimizing geometric and physical parameters. To address this challenging problem, we apply a novel differentiable shape renderer to signed distance fiel… ▽ More

    Submitted 2 April, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: CVPR 2020 (Oral). 8 pages + supplementary material. The first two authors contributed equally to this work

  5. arXiv:1911.10249  [pdf, other

    cs.CV

    Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core

    Authors: Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab

    Abstract: We present a novel method to track 3D models in color and depth data. To this end, we introduce approximations that accelerate the state-of-the-art in region-based tracking by an order of magnitude while retaining similar accuracy. Furthermore, we show how the method can be made more robust in the presence of depth data and consequently formulate a new joint contour and ICP tracking energy. We pre… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: CVPR 2017

  6. 3D Object Instance Recognition and Pose Estimation Using Triplet Loss with Dynamic Margin

    Authors: Sergey Zakharov, Wadim Kehl, Benjamin Planche, Andreas Hutter, Slobodan Ilic

    Abstract: In this paper, we address the problem of 3D object instance recognition and pose estimation of localized objects in cluttered environments using convolutional neural networks. Inspired by the descriptor learning approach of Wohlhart et al., we propose a method that introduces the dynamic margin in the manifold learning triplet loss function. Such a loss function is designed to map images of differ… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 552-559. IEEE, 2017

  7. arXiv:1904.02750  [pdf, other

    cs.CV

    DeceptionNet: Network-Driven Domain Randomization

    Authors: Sergey Zakharov, Wadim Kehl, Slobodan Ilic

    Abstract: We present a novel approach to tackle domain adaptation between synthetic and real data. Instead, of employing "blind" domain randomization, i.e., augmenting synthetic renderings with random backgrounds or changing illumination and colorization, we leverage the task network as its own adversarial guide toward useful augmentations that maximize the uncertainty of the output. To this end, we design… ▽ More

    Submitted 20 August, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

    Comments: ICCV 2019

  8. arXiv:1812.02781  [pdf, other

    cs.CV

    ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape

    Authors: Fabian Manhardt, Wadim Kehl, Adrien Gaidon

    Abstract: We present a deep learning method for end-to-end monocular 3D object detection and metric shape retrieval. We propose a novel loss formulation by lifting 2D detection, orientation, and scale estimation into 3D space. Instead of optimizing these quantities separately, the 3D instantiation allows to properly measure the metric misalignment of boxes. We experimentally show that our 10D lifting of spa… ▽ More

    Submitted 10 April, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: CVPR 2019

  9. arXiv:1810.03065  [pdf, other

    cs.CV

    Deep Model-Based 6D Pose Refinement in RGB

    Authors: Fabian Manhardt, Wadim Kehl, Nassir Navab, Federico Tombari

    Abstract: We present a novel approach for model-based 6D pose refinement in color data. Building on the established idea of contour-based pose tracking, we teach a deep neural network to predict a translational and rotational update. At the core, we propose a new visual loss that drives the pose update by aligning object contours, thus avoiding the definition of any explicit appearance model. In contrast to… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: The first two authors contributed equally to this work

  10. arXiv:1808.08319  [pdf, other

    cs.CV cs.AI cs.RO

    BOP: Benchmark for 6D Object Pose Estimation

    Authors: Tomas Hodan, Frank Michel, Eric Brachmann, Wadim Kehl, Anders Glent Buch, Dirk Kraft, Bertram Drost, Joel Vidal, Stephan Ihrke, Xenophon Zabulis, Caner Sahin, Fabian Manhardt, Federico Tombari, Tae-Kyun Kim, Jiri Matas, Carsten Rother

    Abstract: We propose a benchmark for 6D pose estimation of a rigid object from a single RGB-D input image. The training data consists of a texture-mapped 3D object model or images of the object in known 6D poses. The benchmark comprises of: i) eight datasets in a unified format that cover different practical scenarios, including two new datasets focusing on varying lighting conditions, ii) an evaluation met… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

    Comments: ECCV 2018

  11. arXiv:1711.10006  [pdf, other

    cs.CV

    SSD-6D: Making RGB-based 3D detection and 6D pose estimation great again

    Authors: Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, Nassir Navab

    Abstract: We present a novel method for detecting 3D model instances and estimating their 6D poses from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover the full 6D pose space and train on synthetic model data only. Our approach competes or surpasses current state-of-the-art methods that leverage RGB-D data on multiple challenging datasets. Furthermore, our method produces… ▽ More

    Submitted 27 November, 2017; originally announced November 2017.

    Comments: The first two authors contributed equally to this work

  12. arXiv:1608.07411  [pdf, other

    cs.CV

    An Octree-Based Approach towards Efficient Variational Range Data Fusion

    Authors: Wadim Kehl, Tobias Holl, Federico Tombari, Slobodan Ilic, Nassir Navab

    Abstract: Volume-based reconstruction is usually expensive both in terms of memory consumption and runtime. Especially for sparse geometric structures, volumetric representations produce a huge computational overhead. We present an efficient way to fuse range data via a variational Octree-based minimization approach by taking the actual range data geometry into account. We transform the data into Octree-bas… ▽ More

    Submitted 26 August, 2016; originally announced August 2016.

    Comments: BMVC 2016

  13. arXiv:1607.06062  [pdf, other

    cs.CV

    Hashmod: A Hashing Method for Scalable 3D Object Detection

    Authors: Wadim Kehl, Federico Tombari, Nassir Navab, Slobodan Ilic, Vincent Lepetit

    Abstract: We present a scalable method for detecting objects and estimating their 3D poses in RGB-D data. To this end, we rely on an efficient representation of object views and employ hashing techniques to match these views against the input frame in a scalable way. While a similar approach already exists for 2D detection, we show how to extend it to estimate the 3D pose of the detected objects. In particu… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: BMVC 2015

  14. arXiv:1607.06038  [pdf, other

    cs.CV

    Deep Learning of Local RGB-D Patches for 3D Object Detection and 6D Pose Estimation

    Authors: Wadim Kehl, Fausto Milletari, Federico Tombari, Slobodan Ilic, Nassir Navab

    Abstract: We present a 3D object detection method that uses regressed descriptors of locally-sampled RGB-D patches for 6D vote casting. For regression, we employ a convolutional auto-encoder that has been trained on a large collection of random local patches. During testing, scene patch descriptors are matched against a database of synthetic model view patches and cast 6D object votes which are subsequently… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: To appear at ECCV 2016