Skip to main content

Showing 151–200 of 226 results for author: Cremers, D

.
  1. arXiv:1902.01785  [pdf, other

    cs.LG stat.ML

    Homogeneous Linear Inequality Constraints for Neural Network Activations

    Authors: Thomas Frerix, Matthias Nießner, Daniel Cremers

    Abstract: We propose a method to impose homogeneous linear inequality constraints of the form $Ax\leq 0$ on neural network activations. The proposed method allows a data-driven training approach to be combined with modeling prior knowledge about the task. One way to achieve this task is by means of a projection step at test time after unconstrained training. However, this is an expensive operation. By direc… ▽ More

    Submitted 28 May, 2020; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: CVPR 2020 DeepVision Workshop

  2. arXiv:1902.00057  [pdf, other

    cs.LG stat.ML

    Probabilistic Discriminative Learning with Layered Graphical Models

    Authors: Yuesong Shen, Tao Wu, Csaba Domokos, Daniel Cremers

    Abstract: Probabilistic graphical models are traditionally known for their successes in generative modeling. In this work, we advocate layered graphical models (LGMs) for probabilistic discriminative learning. To this end, we design LGMs in close analogy to neural networks (NNs), that is, they have deep hierarchical structures and convolutional or local connections between layers. Equipped with tensorized t… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

  3. arXiv:1809.10097  [pdf, other

    cs.CV

    Photometric Depth Super-Resolution

    Authors: Bjoern Haefner, Songyou Peng, Alok Verma, Yvain Quéau, Daniel Cremers

    Abstract: This study explores the use of photometric techniques (shape-from-shading and uncalibrated photometric stereo) for upsampling the low-resolution depth map from an RGB-D sensor to the higher resolution of the companion RGB image. A single-shot variational approach is first put forward, which is effective as long as the target's reflectance is piecewise-constant. It is then shown that this dependenc… ▽ More

    Submitted 25 June, 2019; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2019. First three authors contribute equally

  4. arXiv:1808.03417  [pdf, other

    cs.CV

    DeepWrinkles: Accurate and Realistic Clothing Modeling

    Authors: Zorah Laehner, Daniel Cremers, Tony Tung

    Abstract: We present a novel method to generate accurate and realistic clothing deformation from real data capture. Previous methods for realistic cloth modeling mainly rely on intensive computation of physics-based simulation (with numerous heuristic parameters), while models reconstructed from visual observations typically suffer from lack of geometric details. Here, we propose an original framework consi… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

    Comments: 18 pages, 12 figures, 15th European Conference on Computer Vision (ECCV) 2018, Oral Presentation

  5. Omnidirectional DSO: Direct Sparse Odometry with Fisheye Cameras

    Authors: Hidenobu Matsuki, Lukas von Stumberg, Vladyslav Usenko, Jörg Stückler, Daniel Cremers

    Abstract: We propose a novel real-time direct monocular visual odometry for omnidirectional cameras. Our method extends direct sparse odometry (DSO) by using the unified omnidirectional model as a projection function, which can be applied to fisheye cameras with a field-of-view (FoV) well above 180 degrees. This formulation allows for using the full area of the input image even with strong distortion, while… ▽ More

    Submitted 8 August, 2018; originally announced August 2018.

    Comments: Accepted by IEEE Robotics and Automation Letters (RA-L), 2018 and IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018

  6. arXiv:1808.01834  [pdf, other

    cs.CV

    Detailed Dense Inference with Convolutional Neural Networks via Discrete Wavelet Transform

    Authors: Lingni Ma, Jörg Stückler, Tao Wu, Daniel Cremers

    Abstract: Dense pixelwise prediction such as semantic segmentation is an up-to-date challenge for deep convolutional neural networks (CNNs). Many state-of-the-art approaches either tackle the loss of high-resolution information due to pooling in the encoder stage, or use dilated convolutions or high-resolution lanes to maintain detailed feature maps and predictions. Motivated by the structural analogy betwe… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

    Comments: This work was first submitted to NIPS 2017, May 2017

  7. arXiv:1808.01111  [pdf, other

    cs.CV

    LDSO: Direct Sparse Odometry with Loop Closure

    Authors: Xiang Gao, Rui Wang, Nikolaus Demmel, Daniel Cremers

    Abstract: In this paper we present an extension of Direct Sparse Odometry (DSO) to a monocular visual SLAM system with loop closure detection and pose-graph optimization (LDSO). As a direct technique, DSO can utilize any image pixel with sufficient intensity gradient, which makes it robust even in featureless areas. LDSO retains this robustness, while at the same time ensuring repeatability of some of these… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

  8. Direct Sparse Odometry with Rolling Shutter

    Authors: David Schubert, Nikolaus Demmel, Vladyslav Usenko, Jörg Stückler, Daniel Cremers

    Abstract: Neglecting the effects of rolling-shutter cameras for visual odometry (VO) severely degrades accuracy and robustness. In this paper, we propose a novel direct monocular VO method that incorporates a rolling-shutter model. Our approach extends direct sparse odometry which performs direct bundle adjustment of a set of recent keyframe poses and the depths of a sparse set of image points. We estimate… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

  9. The Double Sphere Camera Model

    Authors: Vladyslav Usenko, Nikolaus Demmel, Daniel Cremers

    Abstract: Vision-based motion estimation and 3D reconstruction, which have numerous applications (e.g., autonomous driving, navigation systems for airborne devices and augmented reality) are receiving significant research attention. To increase the accuracy and robustness, several researchers have recently demonstrated the benefit of using large field-of-view cameras for such applications. In this paper, we… ▽ More

    Submitted 29 October, 2018; v1 submitted 24 July, 2018; originally announced July 2018.

  10. arXiv:1807.02570  [pdf, other

    cs.CV

    Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse Odometry

    Authors: Nan Yang, Rui Wang, Jörg Stückler, Daniel Cremers

    Abstract: Monocular visual odometry approaches that purely rely on geometric cues are prone to scale drift and require sufficient motion parallax in successive frames for motion estimation and 3D reconstruction. In this paper, we propose to leverage deep monocular depth prediction to overcome limitations of geometry-based monocular visual odometry. To this end, we incorporate deep depth predictions into Dir… ▽ More

    Submitted 25 July, 2018; v1 submitted 6 July, 2018; originally announced July 2018.

    Comments: To appear in ECCV 2018, Munich. 17 pages including references, 7 figures, 4 tables. Supplementary material: https://vision.in.tum.de/members/yangn

  11. A Region-based Gauss-Newton Approach to Real-Time Monocular Multiple Object Tracking

    Authors: Henning Tjaden, Ulrich Schwanecke, Elmar Schömer, Daniel Cremers

    Abstract: We propose an algorithm for real-time 6DOF pose tracking of rigid 3D objects using a monocular RGB camera. The key idea is to derive a region-based cost function using temporally consistent local color histograms. While such region-based cost functions are commonly optimized using first-order gradient descent techniques, we systematically derive a Gauss-Newton optimization scheme which gives rise… ▽ More

    Submitted 19 December, 2018; v1 submitted 5 July, 2018; originally announced July 2018.

  12. arXiv:1807.01001  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Modular Vehicle Control for Transferring Semantic Information Between Weather Conditions Using GANs

    Authors: Patrick Wenzel, Qadeer Khan, Daniel Cremers, Laura Leal-Taixé

    Abstract: Even though end-to-end supervised learning has shown promising results for sensorimotor control of self-driving cars, its performance is greatly affected by the weather conditions under which it was trained, showing poor generalization to unseen conditions. In this paper, we show how knowledge can be transferred using semantic maps to new weather conditions without the need to obtain new ground tr… ▽ More

    Submitted 1 October, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 2nd Conference on Robot Learning (CoRL 2018), Zürich, Switzerland

  13. arXiv:1806.10417  [pdf, other

    cs.CV cs.GR

    Divergence-Free Shape Interpolation and Correspondence

    Authors: Marvin Eisenberger, Zorah Lähner, Daniel Cremers

    Abstract: We present a novel method to model and calculate deformation fields between shapes embedded in $\mathbb{R}^D$. Our framework combines naturally interpolating the two input shapes and calculating correspondences at the same time. The key idea is to compute a divergence-free deformation field represented in a coarse-to-fine basis using the Karhunen-Loève expansion. The advantages are that there is n… ▽ More

    Submitted 16 October, 2018; v1 submitted 27 June, 2018; originally announced June 2018.

  14. arXiv:1806.02997  [pdf, other

    stat.ML cs.AI cs.CV cs.LG cs.NE

    q-Space Novelty Detection with Variational Autoencoders

    Authors: Aleksei Vasilev, Vladimir Golkov, Marc Meissner, Ilona Lipp, Eleonora Sgarlata, Valentina Tomassini, Derek K. Jones, Daniel Cremers

    Abstract: In machine learning, novelty detection is the task of identifying novel unseen data. During training, only samples from the normal class are available. Test samples are classified as normal or abnormal by assignment of a novelty score. Here we propose novelty detection methods based on training variational autoencoders (VAEs) on normal data. Since abnormal samples are not used during training, we… ▽ More

    Submitted 25 October, 2018; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: 11 pages, 2 figures

    MSC Class: 62F15; 62G07; 62M45; 68T30 ACM Class: G.3; H.3.3; I.2.4; I.2.6; I.4.6; I.5; I.5.4; J.3

  15. arXiv:1805.00613  [pdf, other

    cs.CV

    Deep Perm-Set Net: Learn to predict sets with unknown permutation and cardinality using deep neural networks

    Authors: S. Hamid Rezatofighi, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Daniel Cremers, Laura Leal-Taixé, Ian Reid

    Abstract: Many real-world problems, e.g. object detection, have outputs that are naturally expressed as sets of entities. This creates a challenge for traditional deep neural networks which naturally deal with structured outputs such as vectors, matrices or tensors. We present a novel approach for learning to predict sets with unknown permutation and cardinality using deep neural networks. Specifically, in… ▽ More

    Submitted 2 October, 2018; v1 submitted 1 May, 2018; originally announced May 2018.

  16. The TUM VI Benchmark for Evaluating Visual-Inertial Odometry

    Authors: David Schubert, Thore Goll, Nikolaus Demmel, Vladyslav Usenko, Jörg Stückler, Daniel Cremers

    Abstract: Visual odometry and SLAM methods have a large variety of applications in domains such as augmented reality or robotics. Complementing vision sensors with inertial measurements tremendously improves tracking accuracy and robustness, and thus has spawned large interest in the development of visual-inertial (VI) odometry approaches. In this paper, we propose the TUM VI benchmark, a novel dataset with… ▽ More

    Submitted 9 March, 2020; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: Updates compared to previous version include additional evaluations and DOI

  17. Direct Sparse Visual-Inertial Odometry using Dynamic Marginalization

    Authors: Lukas von Stumberg, Vladyslav Usenko, Daniel Cremers

    Abstract: We present VI-DSO, a novel approach for visual-inertial odometry, which jointly estimates camera poses and sparse scene geometry by minimizing photometric and IMU measurement errors in a combined energy functional. The visual part of the system performs a bundle-adjustment like optimization on a sparse set of points, but unlike key-point based systems it directly minimizes a photometric error. Thi… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    MSC Class: 68T45

  18. arXiv:1803.02487   

    math.OC

    A Nonlinear Bregman Primal-Dual Framework for Optimizing Nonconvex Infimal Convolutions

    Authors: Emanuel Laude, Daniel Cremers

    Abstract: This work is concerned with the optimization of nonconvex, nonsmooth composite optimization problems, whose objective is a composition of a nonlinear map** and a nonsmooth nonconvex function, that can be written as an infimal convolution (inf-conv). To tackle this problem class we propose to reformulate the problem exploiting its inf-conv structure and derive a block coordinate descent scheme on… ▽ More

    Submitted 27 March, 2018; v1 submitted 6 March, 2018; originally announced March 2018.

    Comments: We withdraw the pre-print as it contains some technical flaws that we hope to resolve in the future

  19. arXiv:1801.07648  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Clustering with Deep Learning: Taxonomy and New Methods

    Authors: Elie Aljalbout, Vladimir Golkov, Yawar Siddiqui, Maximilian Strobel, Daniel Cremers

    Abstract: Clustering methods based on deep neural networks have proven promising for clustering real-world data because of their high representational power. In this paper, we propose a systematic taxonomy of clustering methods that utilize deep neural networks. We base our taxonomy on a comprehensive review of recent work and validate the taxonomy in a case study. In this case study, we show that the taxon… ▽ More

    Submitted 13 September, 2018; v1 submitted 23 January, 2018; originally announced January 2018.

    MSC Class: 62H30; 62M45; 91C20 ACM Class: H.3.3; I.2.6; I.5; I.5.3; I.5.4

  20. What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?

    Authors: Nikolaus Mayer, Eddy Ilg, Philipp Fischer, Caner Hazirbas, Daniel Cremers, Alexey Dosovitskiy, Thomas Brox

    Abstract: The finding that very large networks can be trained efficiently and reliably has led to a paradigm shift in computer vision from engineered solutions to learning formulations. As a result, the research challenge shifts from devising algorithms to creating suitable and abundant training data for supervised learning. How to efficiently create such training data? The dominant data acquisition method… ▽ More

    Submitted 22 March, 2018; v1 submitted 19 January, 2018; originally announced January 2018.

    Comments: added references (UCL dataset); added IJCV copyright information

  21. arXiv:1801.05413  [pdf, other

    math.OC cs.LG stat.ML

    Combinatorial Preconditioners for Proximal Algorithms on Graphs

    Authors: Thomas Möllenhoff, Zhenzhang Ye, Tao Wu, Daniel Cremers

    Abstract: We present a novel preconditioning technique for proximal optimization methods that relies on graph algorithms to construct effective preconditioners. Such combinatorial preconditioners arise from partitioning the graph into forests. We prove that certain decompositions lead to a theoretically optimal condition number. We also show how ideal decompositions can be realized using matroid partitionin… ▽ More

    Submitted 21 February, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

    Comments: Published as a conference paper at AISTATS 2018

  22. arXiv:1711.10824  [pdf, other

    cs.CV cs.GR

    Compression for Smooth Shape Analysis

    Authors: V. Estellers, F. R. Schmidt, D. Cremers

    Abstract: Most 3D shape analysis methods use triangular meshes to discretize both the shape and functions on it as piecewise linear functions. With this representation, shape analysis requires fine meshes to represent smooth shapes and geometric operators like normals, curvatures, or Laplace-Beltrami eigenfunctions at large computational and memory costs. We avoid this bottleneck with a compression techni… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  23. arXiv:1710.10686  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Regularization for Deep Learning: A Taxonomy

    Authors: Jan Kukačka, Vladimir Golkov, Daniel Cremers

    Abstract: Regularization is one of the crucial ingredients of deep learning, yet the term regularization has various definitions, and regularization methods are often studied separately from each other. In our work we present a systematic, unifying taxonomy to categorize existing methods. We distinguish methods that affect data, network architectures, error terms, regularization terms, and optimization proc… ▽ More

    Submitted 29 October, 2017; originally announced October 2017.

    MSC Class: 62M45 ACM Class: I.2.6; I.5

  24. arXiv:1710.06623  [pdf, other

    math.OC

    A Nonconvex Proximal Splitting Algorithm under Moreau-Yosida Regularization

    Authors: Emanuel Laude, Tao Wu, Daniel Cremers

    Abstract: We tackle highly nonconvex, nonsmooth composite optimization problems whose objectives comprise a Moreau-Yosida regularized term. Classical nonconvex proximal splitting algorithms, such as nonconvex ADMM, suffer from lack of convergence for such a problem class. To overcome this difficulty, in this work we consider a lifted variant of the Moreau-Yosida regularized model and propose a novel multibl… ▽ More

    Submitted 26 February, 2018; v1 submitted 18 October, 2017; originally announced October 2017.

    Comments: Accepted to International Conference on Artificial Intelligence and Statistics (AISTATS) 2018

  25. arXiv:1710.02081  [pdf, other

    cs.CV

    Online Photometric Calibration for Auto Exposure Video for Realtime Visual Odometry and SLAM

    Authors: Paul Bergmann, Rui Wang, Daniel Cremers

    Abstract: Recent direct visual odometry and SLAM algorithms have demonstrated impressive levels of precision. However, they require a photometric camera calibration in order to achieve competitive results. Hence, the respective algorithm cannot be directly applied to an off-the-shelf-camera or to a video sequence acquired with an unknown camera. In this work we propose a method for online photometric calibr… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: 7 pages

  26. arXiv:1709.10354  [pdf, other

    cs.CV

    A Variational Approach to Shape-from-shading Under Natural Illumination

    Authors: Yvain Quéau, Jean Mélou, Fabien Castan, Daniel Cremers, Jean-Denis Durou

    Abstract: A numerical solution to shape-from-shading under natural illumination is presented. It builds upon an augmented Lagrangian approach for solving a generic PDE-based shape-from-shading model which handles directional or spherical harmonic lighting, orthographic or perspective projection, and greylevel or multi-channel images. Real-world applications to shading-aware depth map denoising, refinement a… ▽ More

    Submitted 3 December, 2017; v1 submitted 29 September, 2017; originally announced September 2017.

    Comments: Presented at EMMCVPR 2017 conference

  27. arXiv:1709.08378  [pdf, other

    cs.CV

    Variational Reflectance Estimation from Multi-view Images

    Authors: Jean Mélou, Yvain Quéau, Jean-Denis Durou, Fabien Castan, Daniel Cremers

    Abstract: We tackle the problem of reflectance estimation from a set of multi-view images, assuming known geometry. The approach we put forward turns the input images into reflectance maps, through a robust variational method. The variational model comprises an image-driven fidelity term and a term which enforces consistency of the reflectance estimates with respect to each view. If illumination is fixed ac… ▽ More

    Submitted 23 January, 2018; v1 submitted 25 September, 2017; originally announced September 2017.

  28. arXiv:1709.06031  [pdf, other

    cs.CV

    Video Object Segmentation Without Temporal Information

    Authors: Kevis-Kokitsi Maninis, Sergi Caelles, Yuhua Chen, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc Van Gool

    Abstract: Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames. When the temporal smoothness is suddenly broken, such as when an object is occluded, or some frames are missing in a sequence, the result of these methods can deteriorate significantly or they may not even produce a… ▽ More

    Submitted 16 May, 2018; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Accepted to T-PAMI. Extended version of "One-Shot Video Object Segmentation", CVPR 2017 (arXiv:1611.05198). Project page: http://www.vision.ee.ethz.ch/~cvlsegmentation/osvos/

  29. arXiv:1709.03763  [pdf, other

    cs.CV

    Efficient Online Surface Correction for Real-time Large-Scale 3D Reconstruction

    Authors: Robert Maier, Raphael Schaller, Daniel Cremers

    Abstract: State-of-the-art methods for large-scale 3D reconstruction from RGB-D sensors usually reduce drift in camera tracking by globally optimizing the estimated camera poses in real-time without simultaneously updating the reconstructed surface on pose changes. We propose an efficient on-the-fly surface correction method for globally consistent dense 3D reconstruction of large-scale scenes. Our approach… ▽ More

    Submitted 12 September, 2017; originally announced September 2017.

    Comments: British Machine Vision Conference (BMVC), London, September 2017

  30. arXiv:1708.07878  [pdf, other

    cs.CV

    Stereo DSO: Large-Scale Direct Sparse Visual Odometry with Stereo Cameras

    Authors: Rui Wang, Martin Schwörer, Daniel Cremers

    Abstract: We propose Stereo Direct Sparse Odometry (Stereo DSO) as a novel method for highly accurate real-time visual odometry estimation of large-scale environments from stereo cameras. It jointly optimizes for all the model parameters within the active window, including the intrinsic/extrinsic camera parameters of all keyframes and the depth values of all selected pixels. In particular, we propose a nove… ▽ More

    Submitted 25 August, 2017; originally announced August 2017.

    Comments: ICCV 2017

  31. arXiv:1708.01670  [pdf, other

    cs.CV

    Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting

    Authors: Robert Maier, Kihwan Kim, Daniel Cremers, Jan Kautz, Matthias Nießner

    Abstract: We introduce a novel method to obtain high-quality 3D reconstructions from consumer RGB-D sensors. Our core idea is to simultaneously optimize for geometry encoded in a signed distance field (SDF), textures from automatically-selected keyframes, and their camera poses along with material and scene lighting. To this end, we propose a joint surface reconstruction approach that is based on Shape-from… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

  32. arXiv:1708.00938  [pdf, other

    cs.CV

    Associative Domain Adaptation

    Authors: Philip Haeusser, Thomas Frerix, Alexander Mordvintsev, Daniel Cremers

    Abstract: We propose associative domain adaptation, a novel technique for end-to-end domain adaptation with neural networks, the task of inferring class labels for an unlabeled target domain based on the statistical properties of a labeled source domain. Our training scheme follows the paradigm that in order to effectively derive class labels for the target domain, a network should produce statistically dom… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

    Comments: In IEEE International Conference on Computer Vision (ICCV), 2017

  33. arXiv:1708.00411  [pdf, other

    cs.CV

    Depth Super-Resolution Meets Uncalibrated Photometric Stereo

    Authors: Songyou Peng, Bjoern Haefner, Yvain Quéau, Daniel Cremers

    Abstract: A novel depth super-resolution approach for RGB-D sensors is presented. It disambiguates depth super-resolution through high-resolution photometric clues and, symmetrically, it disambiguates uncalibrated photometric stereo through low-resolution depth cues. To this end, an RGB-D sequence is acquired from the same viewing angle, while illuminating the scene from various uncalibrated directions. Thi… ▽ More

    Submitted 24 August, 2017; v1 submitted 1 August, 2017; originally announced August 2017.

    Comments: International Conference on Computer Vision (ICCV) Workshop, 2017

  34. arXiv:1707.08991  [pdf, other

    cs.CV

    Efficient Deformable Shape Correspondence via Kernel Matching

    Authors: Zorah Lähner, Matthias Vestner, Amit Boyarski, Or Litany, Ron Slossberg, Tal Remez, Emanuele Rodolà, Alex Bronstein, Michael Bronstein, Ron Kimmel, Daniel Cremers

    Abstract: We present a method to match three dimensional shapes under non-isometric deformations, topology changes and partiality. We formulate the problem as matching between a set of pair-wise and point-wise descriptors, imposing a continuity prior on the map**, and propose a projected descent optimization procedure inspired by difference of convex functions (DC) programming. Surprisingly, in spite of t… ▽ More

    Submitted 15 September, 2017; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: Accepted for oral presentation at 3DV 2017, including supplementary material

  35. arXiv:1707.01018  [pdf, other

    cs.CV

    LED-based Photometric Stereo: Modeling, Calibration and Numerical Solution

    Authors: Yvain Quéau, Bastien Durix, Tao Wu, Daniel Cremers, François Lauze, Jean-Denis Durou

    Abstract: We conduct a thorough study of photometric stereo under nearby point light source illumination, from modeling to numerical solution, through calibration. In the classical formulation of photometric stereo, the luminous fluxes are assumed to be directional, which is very difficult to achieve in practice. Rather, we use light-emitting diodes (LEDs) to illuminate the scene to reconstruct. Such point… ▽ More

    Submitted 4 September, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

  36. arXiv:1706.04638  [pdf, ps, other

    cs.LG

    Proximal Backpropagation

    Authors: Thomas Frerix, Thomas Möllenhoff, Michael Moeller, Daniel Cremers

    Abstract: We propose proximal backpropagation (ProxProp) as a novel algorithm that takes implicit instead of explicit gradient steps to update the network parameters during neural network training. Our algorithm is motivated by the step size limitation of explicit gradient descent, which poses an impediment for optimization. ProxProp is developed from a general point of view on the backpropagation algorithm… ▽ More

    Submitted 20 February, 2018; v1 submitted 14 June, 2017; originally announced June 2017.

    Comments: Published as a conference paper at ICLR 2018

  37. arXiv:1706.00909  [pdf, other

    cs.CV cs.LG

    Learning by Association - A versatile semi-supervised training method for neural networks

    Authors: Philip Häusser, Alexander Mordvintsev, Daniel Cremers

    Abstract: In many real-world scenarios, labeled data for a specific machine learning task is costly to obtain. Semi-supervised training methods make use of abundantly available unlabeled data and a smaller number of labeled examples. We propose a new framework for semi-supervised training of deep neural networks inspired by learning in humans. "Associations" are made from embeddings of labeled samples to th… ▽ More

    Submitted 3 June, 2017; originally announced June 2017.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017

  38. arXiv:1705.08314  [pdf, other

    cs.CV

    Fusion of Head and Full-Body Detectors for Multi-Object Tracking

    Authors: Roberto Henschel, Laura Leal-Taixé, Daniel Cremers, Bodo Rosenhahn

    Abstract: In order to track all persons in a scene, the tracking-by-detection paradigm has proven to be a very effective approach. Yet, relying solely on a single detector is also a major limitation, as useful image information might be ignored. Consequently, this work demonstrates how to fuse two detectors into a tracking system. To obtain the trajectories, we propose to formulate tracking as a weighted gr… ▽ More

    Submitted 24 April, 2018; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: 10 pages, 4 figures; Winner of the MOT17 challenge; CVPRW 2018

  39. arXiv:1705.05020  [pdf, other

    cs.LG

    Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs

    Authors: Emanuel Laude, Jan-Hendrik Lange, Jonas Schüpfer, Csaba Domokos, Laura Leal-Taixé, Frank R. Schmidt, Bjoern Andres, Daniel Cremers

    Abstract: This paper introduces a novel algorithm for transductive inference in higher-order MRFs, where the unary energies are parameterized by a variable classifier. The considered task is posed as a joint optimization problem in the continuous classifier parameters and the discrete label variables. In contrast to prior approaches such as convex relaxations, we propose an advantageous decoupling of the ob… ▽ More

    Submitted 28 April, 2018; v1 submitted 14 May, 2017; originally announced May 2017.

  40. arXiv:1705.04300  [pdf, other

    cs.CV

    Challenges in Monocular Visual Odometry: Photometric Calibration, Motion Bias and Rolling Shutter Effect

    Authors: Nan Yang, Rui Wang, Xiang Gao, Daniel Cremers

    Abstract: Monocular visual odometry (VO) and simultaneous localization and map** (SLAM) have seen tremendous improvements in accuracy, robustness and efficiency, and have gained increasing popularity over recent years. Nevertheless, not so many discussions have been carried out to reveal the influences of three very influential yet easily overlooked aspects: photometric calibration, motion bias and rollin… ▽ More

    Submitted 7 June, 2018; v1 submitted 11 May, 2017; originally announced May 2017.

    Comments: Accepted by IEEE Robotics and Automation Letters (RA-L), 2018. The first two authors contributed equally to this paper

  41. arXiv:1704.04039  [pdf, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    3D Deep Learning for Biological Function Prediction from Physical Fields

    Authors: Vladimir Golkov, Marcin J. Skwark, Atanas Mirchev, Georgi Dikov, Alexander R. Geanes, Jeffrey Mendenhall, Jens Meiler, Daniel Cremers

    Abstract: Predicting the biological function of molecules, be it proteins or drug-like compounds, from their atomic structure is an important and long-standing problem. Function is dictated by structure, since it is by spatial interactions that molecules interact with each other, both in terms of steric complementarity, as well as intermolecular forces. Thus, the electron density field and electrostatic pot… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    ACM Class: I.2.6; J.3

  42. Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems

    Authors: Tim Meinhardt, Michael Moeller, Caner Hazirbas, Daniel Cremers

    Abstract: While variational methods have been among the most powerful tools for solving linear inverse problems in imaging, deep (convolutional) neural networks have recently taken the lead in many challenging benchmarks. A remaining drawback of deep learning approaches is their requirement for an expensive retraining whenever the specific problem, the noise level, noise type, or desired measure of fidelity… ▽ More

    Submitted 30 August, 2017; v1 submitted 11 April, 2017; originally announced April 2017.

  43. arXiv:1704.02781  [pdf, other

    cs.CV

    Tracking the Trackers: An Analysis of the State of the Art in Multiple Object Tracking

    Authors: Laura Leal-Taixé, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth

    Abstract: Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for research. We present a benchmark for Multiple Object Tracking launched in the late 2014, with the goal of creating a framework for the stan… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

  44. arXiv:1704.01085  [pdf, other

    cs.CV

    Deep Depth From Focus

    Authors: Caner Hazirbas, Sebastian Georg Soyer, Maximilian Christian Staab, Laura Leal-Taixé, Daniel Cremers

    Abstract: Depth from focus (DFF) is one of the classical ill-posed inverse problems in computer vision. Most approaches recover the depth at each pixel based on the focal setting which exhibits maximal sharpness. Yet, it is not obvious how to reliably estimate the sharpness level, particularly in low-textured areas. In this paper, we propose `Deep Depth From Focus (DDFF)' as the first end-to-end learning ap… ▽ More

    Submitted 28 October, 2018; v1 submitted 4 April, 2017; originally announced April 2017.

    Comments: accepted to Asian Conference on Computer Vision (ACCV) 2018

  45. arXiv:1704.00337  [pdf, other

    cs.CV

    Dense Multi-view 3D-reconstruction Without Dense Correspondences

    Authors: Yvain Quéau, Jean Mélou, Jean-Denis Durou, Daniel Cremers

    Abstract: We introduce a variational method for multi-view shape-from-shading under natural illumination. The key idea is to couple PDE-based solutions for single-image based shape-from-shading problems across multiple images and multiple color channels by means of a variational formulation. Rather than alternatingly solving the individual SFS problems and optimizing the consistency across images and channe… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

  46. arXiv:1703.08866  [pdf, other

    cs.CV

    Multi-View Deep Learning for Consistent Semantic Map** with RGB-D Cameras

    Authors: Lingni Ma, Jörg Stückler, Christian Kerl, Daniel Cremers

    Abstract: Visual scene understanding is an important capability that enables robots to purposefully act in their environment. In this paper, we propose a novel approach to object-class segmentation from multiple RGB-D views using deep learning. We train a deep neural network to predict object-class semantics that is consistent from several view points in a semi-supervised way. At test time, the semantics pr… ▽ More

    Submitted 4 December, 2017; v1 submitted 26 March, 2017; originally announced March 2017.

    Comments: the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2017)

  47. arXiv:1703.08001  [pdf, other

    cs.CV math.NA

    Nonlinear Spectral Image Fusion

    Authors: Martin Benning, Michael Möller, Raz Z. Nossek, Martin Burger, Daniel Cremers, Guy Gilboa, Carola-Bibiane Schönlieb

    Abstract: In this paper we demonstrate that the framework of nonlinear spectral decompositions based on total variation (TV) regularization is very well suited for image fusion as well as more general image manipulation tasks. The well-localized and edge-preserving spectral TV decomposition allows to select frequencies of a certain image to transfer particular features, such as wrinkles in a face, from one… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

    Comments: 13 pages, 9 figures, submitted to SSVM conference proceedings 2017

    MSC Class: 35P30; 62H35; 65M70; 94A08 ACM Class: G.1.3; G.1.6; G.1.8; I.4.0; I.4.5

  48. Real-Time Trajectory Replanning for MAVs using Uniform B-splines and a 3D Circular Buffer

    Authors: Vladyslav Usenko, Lukas von Stumberg, Andrej Pangercic, Daniel Cremers

    Abstract: In this paper, we present a real-time approach to local trajectory replanning for microaerial vehicles (MAVs). Current trajectory generation methods for multicopters achieve high success rates in cluttered environments, but assume that the environment is static and require prior knowledge of the map. In the presented study, we use the results of such planners and extend them with a local replannin… ▽ More

    Submitted 23 July, 2017; v1 submitted 4 March, 2017; originally announced March 2017.

  49. arXiv:1701.00669  [pdf, other

    cs.CV

    Product Manifold Filter: Non-Rigid Shape Correspondence via Kernel Density Estimation in the Product Space

    Authors: Matthias Vestner, Roee Litman, Emanuele Rodolà, Alex Bronstein, Daniel Cremers

    Abstract: Many algorithms for the computation of correspondences between deformable shapes rely on some variant of nearest neighbor matching in a descriptor space. Such are, for example, various point-wise correspondence recovery algorithms used as a post-processing stage in the functional correspondence framework. Such frequently used techniques implicitly make restrictive assumptions (e.g., near-isometry)… ▽ More

    Submitted 7 April, 2017; v1 submitted 3 January, 2017; originally announced January 2017.

    Comments: To appear at CVPR 2017

  50. arXiv:1612.03653  [pdf, other

    cs.AI cs.RO

    Learning to Drive using Inverse Reinforcement Learning and Deep Q-Networks

    Authors: Sahand Sharifzadeh, Ioannis Chiotellis, Rudolph Triebel, Daniel Cremers

    Abstract: We propose an inverse reinforcement learning (IRL) approach using Deep Q-Networks to extract the rewards in problems with large state spaces. We evaluate the performance of this approach in a simulation-based autonomous driving scenario. Our results resemble the intuitive relation between the reward function and readings of distance sensors mounted at different poses on the car. We also show that,… ▽ More

    Submitted 21 September, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: NIPS workshop on Deep Learning for Action and Interaction, 2016