Skip to main content

Showing 51–73 of 73 results for author: Rother, C

.
  1. Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic Segmentation

    Authors: Omid Hosseini Jafari, Oliver Groth, Alexander Kirillov, Michael Ying Yang, Carsten Rother

    Abstract: This paper addresses the task of designing a modular neural network architecture that jointly solves different tasks. As an example we use the tasks of depth estimation and semantic segmentation given a single RGB image. The main focus of this work is to analyze the cross-modality influence between depth and semantic prediction maps on their joint refinement. While most previous works solely focus… ▽ More

    Submitted 26 February, 2017; originally announced February 2017.

    Comments: Accepted to ICRA 2017

  2. arXiv:1702.06461  [pdf, other

    cs.CV

    Crowd Sourcing Image Segmentation with iaSTAPLE

    Authors: Dmitrij Schlesinger, Florian Jug, Gene Myers, Carsten Rother, Dagmar Kainmüller

    Abstract: We propose a novel label fusion technique as well as a crowdsourcing protocol to efficiently obtain accurate epithelial cell segmentations from non-expert crowd workers. Our label fusion technique simultaneously estimates the true segmentation, the performance levels of individual crowd workers, and an image segmentation model in the form of a pairwise Markov random field. We term our approach ima… ▽ More

    Submitted 21 February, 2017; originally announced February 2017.

    Comments: Accepted to ISBI2017

  3. arXiv:1612.06573  [pdf, other

    cs.CV cs.RO

    Detecting Unexpected Obstacles for Self-Driving Cars: Fusing Deep Learning and Geometric Modeling

    Authors: Sebastian Ramos, Stefan Gehrig, Peter **gera, Uwe Franke, Carsten Rother

    Abstract: The detection of small road hazards, such as lost cargo, is a vital capability for self-driving cars. We tackle this challenging and rarely addressed problem with a vision system that leverages appearance, contextual as well as geometric cues. To utilize the appearance and contextual cues, we propose a new deep learning-based obstacle detection framework. Here a variant of a fully convolutional ne… ▽ More

    Submitted 20 December, 2016; originally announced December 2016.

    Comments: Submitted to the IEEE International Conference on Robotics and Automation (ICRA) 2017

  4. arXiv:1612.05476  [pdf, other

    cs.CV

    A Study of Lagrangean Decompositions and Dual Ascent Solvers for Graph Matching

    Authors: Paul Swoboda, Carsten Rother, Hassan Abu Alhaija, Dagmar Kainmueller, Bogdan Savchynskyy

    Abstract: We study the quadratic assignment problem, in computer vision also known as graph matching. Two leading solvers for this problem optimize the Lagrange decomposition duals with sub-gradient and dual ascent (also known as message passing) updates. We explore s direction further and propose several additional Lagrangean relaxations of the graph matching problem along with corresponding algorithms, wh… ▽ More

    Submitted 12 January, 2017; v1 submitted 16 December, 2016; originally announced December 2016.

    Comments: Added acknowledgments

  5. arXiv:1612.03779  [pdf, other

    cs.CV

    PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

    Authors: Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother

    Abstract: State-of-the-art computer vision algorithms often achieve efficiency by making discrete choices about which hypotheses to explore next. This allows allocation of computational resources to promising candidates, however, such decisions are non-differentiable. As a result, these algorithms are hard to train in an end-to-end fashion. In this work we propose to learn an efficient algorithm for the tas… ▽ More

    Submitted 11 April, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  6. arXiv:1612.02287  [pdf, other

    cs.CV

    Global Hypothesis Generation for 6D Object Pose Estimation

    Authors: Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother

    Abstract: This paper addresses the task of estimating the 6D pose of a known 3D object from a single RGB-D image. Most modern approaches solve this task in three steps: i) Compute local features; ii) Generate a pool of pose-hypotheses; iii) Select and refine a pose from the pool. This work focuses on the second step. While all existing approaches generate the hypotheses pool via local reasoning, e.g. RANSAC… ▽ More

    Submitted 2 January, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

  7. arXiv:1612.01397  [pdf, other

    cs.LG

    Implicit Modeling -- A Generalization of Discriminative and Generative Approaches

    Authors: Dmitrij Schlesinger, Carsten Rother

    Abstract: We propose a new modeling approach that is a generalization of generative and discriminative models. The core idea is to use an implicit parameterization of a joint probability distribution by specifying only the conditional distributions. The proposed scheme combines the advantages of both worlds -- it can use powerful complex discriminative models as its parts, having at the same time better gen… ▽ More

    Submitted 5 December, 2016; originally announced December 2016.

  8. arXiv:1611.08272  [pdf, other

    cs.CV

    InstanceCut: from Edges to Instances with MultiCut

    Authors: Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, Carsten Rother

    Abstract: This work addresses the task of instance-aware semantic segmentation. Our key motivation is to design a simple method with a new modelling-paradigm, which therefore has a different trade-off between advantages and disadvantages compared to known approaches. Our approach, we term InstanceCut, represents the problem by two output modalities: (i) an instance-agnostic semantic segmentation and (ii) al… ▽ More

    Submitted 24 November, 2016; originally announced November 2016.

    Comments: The code would be released at https://github.com/alexander-kirillov/InstanceCut

  9. arXiv:1611.05705  [pdf, other

    cs.CV

    DSAC - Differentiable RANSAC for Camera Localization

    Authors: Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother

    Abstract: RANSAC is an important algorithm in robust optimization and a central building block for many computer vision applications. In recent years, traditionally hand-crafted pipelines have been replaced by deep learning pipelines, which can be trained in an end-to-end fashion. However, RANSAC has so far not been used as part of such deep learning pipelines, because its hypothesis selection procedure is… ▽ More

    Submitted 21 March, 2018; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: CVPR 2017

  10. arXiv:1611.04399  [pdf, other

    cs.CV cs.DM

    Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications

    Authors: Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres

    Abstract: We state a combinatorial optimization problem whose feasible solutions define both a decomposition and a node labeling of a given graph. This problem offers a common mathematical abstraction of seemingly unrelated computer vision tasks, including instance-separating semantic segmentation, articulated human body pose estimation and multiple object tracking. Conceptually, the problem we state genera… ▽ More

    Submitted 21 February, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

  11. arXiv:1610.00731  [pdf, other

    cs.CV

    Can Ground Truth Label Propagation from Video help Semantic Segmentation?

    Authors: Siva Karthik Mustikovela, Michael Ying Yang, Carsten Rother

    Abstract: For state-of-the-art semantic segmentation task, training convolutional neural networks (CNNs) requires dense pixelwise ground truth (GT) labeling, which is expensive and involves extensive human effort. In this work, we study the possibility of using auxiliary ground truth, so-called \textit{pseudo ground truth} (PGT) to improve the performance. The PGT is obtained by propagating the labels of a… ▽ More

    Submitted 3 October, 2016; originally announced October 2016.

    Comments: To appear at ECCV 2016 Workshop on Video Segmentation

  12. arXiv:1609.05797  [pdf, other

    cs.CV cs.RO

    Random Forests versus Neural Networks - What's Best for Camera Localization?

    Authors: Daniela Massiceti, Alexander Krull, Eric Brachmann, Carsten Rother, Philip H. S. Torr

    Abstract: This work addresses the task of camera localization in a known 3D scene given a single input RGB image. State-of-the-art approaches accomplish this in two steps: firstly, regressing for every pixel in the image its 3D scene coordinate and subsequently, using these coordinates to estimate the final 6D camera pose via RANSAC. To solve the first step, Random Forests (RFs) are typically used. On the o… ▽ More

    Submitted 13 July, 2017; v1 submitted 19 September, 2016; originally announced September 2016.

    Comments: 8 pages, 4 figures

  13. arXiv:1609.04653  [pdf, other

    cs.CV cs.RO

    Lost and Found: Detecting Small Road Hazards for Self-Driving Vehicles

    Authors: Peter **gera, Sebastian Ramos, Stefan Gehrig, Uwe Franke, Carsten Rother, Rudolf Mester

    Abstract: Detecting small obstacles on the road ahead is a critical part of the driving task which has to be mastered by fully autonomous cars. In this paper, we present a method based on stereo vision to reliably detect such obstacles from a moving vehicle. The proposed algorithm performs statistical hypothesis tests in disparity space directly on stereo image data, assessing freespace and obstacle hypothe… ▽ More

    Submitted 15 September, 2016; originally announced September 2016.

    Comments: To be presented at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2016

  14. arXiv:1607.08421  [pdf, other

    cs.CV

    Stereo Video Deblurring

    Authors: Anita Sellent, Carsten Rother, Stefan Roth

    Abstract: Videos acquired in low-light conditions often exhibit motion blur, which depends on the motion of the objects relative to the camera. This is not only visually unpleasing, but can hamper further processing. With this paper we are the first to show how the availability of stereo video can aid the challenging video deblurring task. We leverage 3D scene flow, which can be estimated robustly even unde… ▽ More

    Submitted 28 July, 2016; originally announced July 2016.

    Comments: Accepted to the 14th European Conference on Computer Vision (ECCV 2016). Includes supplemental material

    ACM Class: I.4.3

  15. arXiv:1606.07015  [pdf, other

    cs.CV

    Joint M-Best-Diverse Labelings as a Parametric Submodular Minimization

    Authors: Alexander Kirillov, Alexander Shekhovtsov, Carsten Rother, Bogdan Savchynskyy

    Abstract: We consider the problem of jointly inferring the M-best diverse labelings for a binary (high-order) submodular energy of a graphical model. Recently, it was shown that this problem can be solved to a global optimum, for many practically interesting diversity measures. It was noted that the labelings are, so-called, nested. This nestedness property also holds for labelings of a class of parametric… ▽ More

    Submitted 23 June, 2016; v1 submitted 22 June, 2016; originally announced June 2016.

  16. arXiv:1511.05067  [pdf, other

    cs.CV

    Joint Training of Generic CNN-CRF Models with Stochastic Optimization

    Authors: Alexander Kirillov, Dmitrij Schlesinger, Shuai Zheng, Bogdan Savchynskyy, Philip H. S. Torr, Carsten Rother

    Abstract: We propose a new CNN-CRF end-to-end learning framework, which is based on joint stochastic optimization with respect to both Convolutional Neural Network (CNN) and Conditional Random Field (CRF) parameters. While stochastic gradient descent is a standard technique for CNN training, it was not used for joint models so far. We show that our learning method is (i) general, i.e. it applies to arbitrar… ▽ More

    Submitted 14 September, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: ACCV2016

  17. arXiv:1509.02122  [pdf, other

    cs.CV

    Convexity Shape Constraints for Image Segmentation

    Authors: Loic A. Royer, David L. Richmond, Carsten Rother, Bjoern Andres, Dagmar Kainmueller

    Abstract: Segmenting an image into multiple components is a central task in computer vision. In many practical scenarios, prior knowledge about plausible components is available. Incorporating such prior knowledge into models and algorithms for image segmentation is highly desirable, yet can be non-trivial. In this work, we introduce a new approach that allows, for the first time, to constrain some or all c… ▽ More

    Submitted 7 September, 2015; originally announced September 2015.

  18. arXiv:1508.04546  [pdf, other

    cs.CV

    Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images

    Authors: Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother

    Abstract: Analysis-by-synthesis has been a successful approach for many tasks in computer vision, such as 6D pose estimation of an object in an RGB-D image which is the topic of this work. The idea is to compare the observation with the output of a forward process, such as a rendered image of the object of interest in a particular pose. Due to occlusion or complicated sensor noise, it can be difficult to pe… ▽ More

    Submitted 19 August, 2015; originally announced August 2015.

    Comments: 16 pages, 8 figures

    MSC Class: 65-XX

  19. arXiv:1507.07583  [pdf, other

    cs.CV

    Map** Auto-context Decision Forests to Deep ConvNets for Semantic Segmentation

    Authors: David L. Richmond, Dagmar Kainmueller, Michael Y. Yang, Eugene W. Myers, Carsten Rother

    Abstract: We consider the task of pixel-wise semantic segmentation given a small set of labeled training images. Among two of the most popular techniques to address this task are Decision Forests (DF) and Neural Networks (NN). In this work, we explore the relationship between two special forms of these techniques: stacked DFs (namely Auto-context) and deep Convolutional Neural Networks (ConvNet). Our main c… ▽ More

    Submitted 13 August, 2018; v1 submitted 27 July, 2015; originally announced July 2015.

  20. Cascades of Regression Tree Fields for Image Restoration

    Authors: Uwe Schmidt, Jeremy Jancsary, Sebastian Nowozin, Stefan Roth, Carsten Rother

    Abstract: Conditional random fields (CRFs) are popular discriminative models for computer vision and have been successfully applied in the domain of image restoration, especially to image denoising. For image deblurring, however, discriminative approaches have been mostly lacking. We posit two reasons for this: First, the blur kernel is often only known at test time, requiring any discriminative approach to… ▽ More

    Submitted 20 November, 2014; v1 submitted 8 April, 2014; originally announced April 2014.

    Comments: Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  21. arXiv:1404.0533  [pdf, other

    cs.CV

    A Comparative Study of Modern Inference Techniques for Structured Discrete Energy Minimization Problems

    Authors: Jörg H. Kappes, Bjoern Andres, Fred A. Hamprecht, Christoph Schnörr, Sebastian Nowozin, Dhruv Batra, Sungwoong Kim, Bernhard X. Kausler, Thorben Kröger, Jan Lellmann, Nikos Komodakis, Bogdan Savchynskyy, Carsten Rother

    Abstract: Szeliski et al. published an influential study in 2006 on energy minimization methods for Markov Random Fields (MRF). This study provided valuable insights in choosing the best optimization technique for certain classes of problems. While these insights remain generally useful today, the phenomenal success of random field models means that the kinds of inference problems that have to be solved cha… ▽ More

    Submitted 2 April, 2014; originally announced April 2014.

  22. arXiv:1109.1480  [pdf, other

    cs.CV

    Curvature Prior for MRF-based Segmentation and Shape Inpainting

    Authors: Alexander Shekhovtsov, Pushmeet Kohli, Carsten Rother

    Abstract: Most image labeling problems such as segmentation and image reconstruction are fundamentally ill-posed and suffer from ambiguities and noise. Higher order image priors encode high level structural dependencies between pixels and are key to overcoming these problems. However, these priors in general lead to computationally intractable models. This paper addresses the problem of discovering compact… ▽ More

    Submitted 7 September, 2011; originally announced September 2011.

    Comments: 17 pages, 16 figures

    Report number: CTU--CMP--2011--11

  23. arXiv:0912.2492  [pdf, other

    stat.ML cs.CV stat.ME

    Learning an Interactive Segmentation System

    Authors: Hannes Nickisch, Pushmeet Kohli, Carsten Rother

    Abstract: Many successful applications of computer vision to image or video manipulation are interactive by nature. However, parameters of such systems are often trained neglecting the user. Traditionally, interactive systems have been treated in the same manner as their fully automatic counterparts. Their performance is evaluated by computing the accuracy of their solutions under some fixed set of user i… ▽ More

    Submitted 13 December, 2009; originally announced December 2009.

    Comments: 11 pages, 7 figures, 4 tables