Skip to main content

Showing 51–79 of 79 results for author: Leibe, B

.
  1. arXiv:1904.04552  [pdf, other

    cs.CV

    BoLTVOS: Box-Level Tracking for Video Object Segmentation

    Authors: Paul Voigtlaender, Jonathon Luiten, Bastian Leibe

    Abstract: We approach video object segmentation (VOS) by splitting the task into two sub-tasks: bounding box level tracking, followed by bounding box segmentation. Following this paradigm, we present BoLTVOS (Box-Level Tracking for VOS), which consists of an R-CNN detector conditioned on the first-frame bounding box to detect the object of interest, a temporal consistency rescoring algorithm, and a Box2Seg… ▽ More

    Submitted 29 December, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

  2. 3D-BEVIS: Bird's-Eye-View Instance Segmentation

    Authors: Cathrin Elich, Francis Engelmann, Theodora Kontogianni, Bastian Leibe

    Abstract: Recent deep learning models achieve impressive results on 3D scene analysis tasks by operating directly on unstructured point clouds. A lot of progress was made in the field of object classification and semantic segmentation. However, the task of instance segmentation is less explored. In this work, we present 3D-BEVIS, a deep learning framework for 3D semantic instance segmentation on point cloud… ▽ More

    Submitted 1 August, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: camera-ready version for GCPR '19

  3. arXiv:1903.00362  [pdf, other

    cs.CV

    Large-Scale Object Mining for Object Discovery from Unlabeled Video

    Authors: Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe

    Abstract: This paper addresses the problem of object discovery from unlabeled driving videos captured in a realistic automotive setting. Identifying recurring object categories in such raw video streams is a very challenging problem. Not only do object candidates first have to be localized in the input images, but many interesting object categories occur relatively infrequently. Object discovery will theref… ▽ More

    Submitted 29 April, 2019; v1 submitted 28 February, 2019; originally announced March 2019.

    Comments: Updated version of ICRA'19 paper (additional qualitative results); arXiv admin note: text overlap with arXiv:1712.08832

  4. arXiv:1902.09513  [pdf, other

    cs.CV

    FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

    Authors: Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen

    Abstract: Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use. In this work, we propose FEELVOS as a simple and fast method which does not rely on fine-tuning. In order to segment a video, for each frame FEELVOS uses a semantic pixel-wise embedding together wi… ▽ More

    Submitted 8 April, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: CVPR 2019 camera-ready version

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019

  5. arXiv:1902.03604  [pdf, other

    cs.CV

    MOTS: Multi-Object Tracking and Segmentation

    Authors: Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe

    Abstract: This paper extends the popular task of multi-object tracking to multi-object tracking and segmentation (MOTS). Towards this goal, we create dense pixel-level annotations for two existing tracking datasets using a semi-automatic annotation procedure. Our new annotations comprise 65,213 pixel masks for 977 distinct objects (cars and pedestrians) in 10,870 video frames. For evaluation, we extend exis… ▽ More

    Submitted 8 April, 2019; v1 submitted 10 February, 2019; originally announced February 2019.

    Comments: CVPR 2019 camera-ready version

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019

  6. arXiv:1901.09260  [pdf, other

    cs.CV cs.RO

    4D Generic Video Object Proposals

    Authors: Aljosa Osep, Paul Voigtlaender, Mark Weber, Jonathon Luiten, Bastian Leibe

    Abstract: Many high-level video understanding methods require input in the form of object proposals. Currently, such proposals are predominantly generated with the help of networks that were trained for detecting and segmenting a set of known object classes, which limits their applicability to cases where all objects of interest are represented in the training set. This is a restriction for automotive scena… ▽ More

    Submitted 20 May, 2020; v1 submitted 26 January, 2019; originally announced January 2019.

    Comments: ICRA 2020

  7. Know What Your Neighbors Do: 3D Semantic Segmentation of Point Clouds

    Authors: Francis Engelmann, Theodora Kontogianni, Jonas Schult, Bastian Leibe

    Abstract: In this paper, we present a deep learning architecture which addresses the problem of 3D semantic segmentation of unstructured point clouds. Compared to previous work, we introduce grou** techniques which define point neighborhoods in the initial world space and the learned feature space. Neighborhoods are important as they allow to compute local or global point features depending on the spatial… ▽ More

    Submitted 8 December, 2018; v1 submitted 2 October, 2018; originally announced October 2018.

  8. arXiv:1809.07357  [pdf, other

    cs.CV

    Combined Image- and World-Space Tracking in Traffic Scenes

    Authors: Aljosa Osep, Wolfgang Mehner, Markus Mathias, Bastian Leibe

    Abstract: Tracking in urban street scenes plays a central role in autonomous systems such as self-driving cars. Most of the current vision-based tracking methods perform tracking in the image domain. Other approaches, eg based on LIDAR and radar, track purely in 3D. While some vision-based tracking methods invoke 3D information in parts of their pipeline, and some 3D-based methods utilize image-based inform… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: 8 pages, 7 figures, 2 tables. ICRA 2017 paper

  9. arXiv:1809.07316  [pdf, other

    cs.CV

    Towards Large-Scale Video Video Object Mining

    Authors: Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe

    Abstract: We propose to leverage a generic object tracker in order to perform object mining in large-scale unlabeled videos, captured in a realistic automotive setting. We present a dataset of more than 360'000 automatically mined object tracks from 10+ hours of video data (560'000 frames) and propose a method for automated novel category discovery and detector learning. In addition, we show preliminary res… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: 4 pages, 3 figures, 1 table. ECCV 2018 Workshop on Interactive and Adaptive Learning in an Open World

  10. arXiv:1809.04987  [pdf, other

    cs.CV cs.RO

    Synthetic Occlusion Augmentation with Volumetric Heatmaps for the 2018 ECCV PoseTrack Challenge on 3D Human Pose Estimation

    Authors: István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe

    Abstract: In this paper we present our winning entry at the 2018 ECCV PoseTrack Challenge on 3D human pose estimation. Using a fully-convolutional backbone architecture, we obtain volumetric heatmaps per body joint, which we convert to coordinates using soft-argmax. Absolute person center depth is estimated by a 1D heatmap prediction head. The coordinates are back-projected to 3D camera space, where we mini… ▽ More

    Submitted 6 November, 2018; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: Extended abstract for the 2018 ECCV PoseTrack Workshop, updated with full result tables

  11. arXiv:1808.09316  [pdf, other

    cs.CV cs.RO

    How Robust is 3D Human Pose Estimation to Occlusion?

    Authors: István Sárándi, Timm Linder, Kai O. Arras, Bastian Leibe

    Abstract: Occlusion is commonplace in realistic human-robot shared environments, yet its effects are not considered in standard 3D human pose estimation benchmarks. This leaves the question open: how robust are state-of-the-art 3D pose estimation methods against partial occlusions? We study several types of synthetic occlusions over the Human3.6M dataset and find a method with state-of-the-art benchmark per… ▽ More

    Submitted 29 August, 2018; v1 submitted 28 August, 2018; originally announced August 2018.

    Comments: Accepted for IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'18) - Workshop on Robotic Co-workers 4.0: Human Safety and Comfort in Human-Robot Interactive Social Environments

  12. arXiv:1807.09190  [pdf, other

    cs.CV

    PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation

    Authors: Jonathon Luiten, Paul Voigtlaender, Bastian Leibe

    Abstract: We address semi-supervised video object segmentation, the task of automatically generating accurate and consistent pixel masks for objects in a video sequence, given the first-frame ground truth annotations. Towards this goal, we present the PReMVOS algorithm (Proposal-generation, Refinement and Merging for Video Object Segmentation). Our method separates this problem into two steps, first generat… ▽ More

    Submitted 3 November, 2018; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: Accepted for publication in ACCV18

  13. arXiv:1805.04398  [pdf, other

    cs.CV

    Iteratively Trained Interactive Segmentation

    Authors: Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe

    Abstract: Deep learning requires large amounts of training data to be effective. For the task of object segmentation, manually labeling data is very expensive, and hence interactive methods are needed. Following recent approaches, we develop an interactive object segmentation system which uses user input in the form of clicks as the input to a convolutional network. While previous methods use heuristic clic… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

  14. arXiv:1804.10134  [pdf, other

    cs.RO cs.CV

    Detection-Tracking for Efficient Person Analysis: The DetTA Pipeline

    Authors: Stefan Breuers, Lucas Beyer, Umer Rafi, Bastian Leibe

    Abstract: In the past decade many robots were deployed in the wild, and people detection and tracking is an important component of such deployments. On top of that, one often needs to run modules which analyze persons and extract higher level attributes such as age and gender, or dynamic information like gaze and pose. The latter ones are especially necessary for building a reactive, social robot-person int… ▽ More

    Submitted 28 July, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

    Comments: Code available at: https://github.com/sbreuers/detta

  15. arXiv:1804.02463  [pdf, other

    cs.RO cs.CV

    Deep Person Detection in 2D Range Data

    Authors: Lucas Beyer, Alexander Hermans, Timm Linder, Kai O. Arras, Bastian Leibe

    Abstract: Detecting humans is a key skill for mobile robots and intelligent vehicles in a large variety of applications. While the problem is well studied for certain sensory modalities such as image data, few works exist that address this detection task using 2D range data. However, a widespread sensory setup for many mobile robots in service and domestic applications contains a horizontally mounted 2D las… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

  16. Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds

    Authors: Francis Engelmann, Theodora Kontogianni, Alexander Hermans, Bastian Leibe

    Abstract: Deep learning approaches have made tremendous progress in the field of semantic segmentation over the past few years. However, most current approaches operate in the 2D image space. Direct semantic segmentation of unstructured 3D point clouds is still an open research problem. The recently proposed PointNet architecture presents an interesting step ahead in that it can operate on unstructured poin… ▽ More

    Submitted 18 December, 2019; v1 submitted 5 February, 2018; originally announced February 2018.

  17. arXiv:1712.08832  [pdf, other

    cs.CV

    Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video

    Authors: Aljoša Ošep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe

    Abstract: We explore object discovery and detector adaptation based on unlabeled video sequences captured from a mobile platform. We propose a fully automatic approach for object mining from video which builds upon a generic object tracking approach. By applying this method to three large video datasets from autonomous driving and mobile robotics scenarios, we demonstrate its robustness and generality. Base… ▽ More

    Submitted 23 December, 2017; originally announced December 2017.

    Comments: CVPR'18 submission

  18. arXiv:1712.07920  [pdf, other

    cs.CV

    Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking

    Authors: Aljoša Ošep, Wolfgang Mehner, Paul Voigtlaender, Bastian Leibe

    Abstract: The most common paradigm for vision-based multi-object tracking is tracking-by-detection, due to the availability of reliable detectors for several important object categories such as cars and pedestrians. However, future mobile systems will need a capability to cope with rich human-made environments, in which obtaining detectors for every possible object category would be infeasible. In this pape… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: ICRA'18 submission

  19. arXiv:1706.09364  [pdf, other

    cs.CV

    Online Adaptation of Convolutional Neural Networks for Video Object Segmentation

    Authors: Paul Voigtlaender, Bastian Leibe

    Abstract: We tackle the task of semi-supervised video object segmentation, i.e. segmenting the pixels belonging to an object in the video using the ground truth pixel mask for the first frame. We build on the recently introduced one-shot video object segmentation (OSVOS) approach which uses a pretrained network and fine-tunes it on the first frame. While achieving impressive performance, at test time OSVOS… ▽ More

    Submitted 1 August, 2017; v1 submitted 28 June, 2017; originally announced June 2017.

    Comments: Accepted at BMVC 2017. This version contains minor changes for the camera ready version

  20. arXiv:1705.10998  [pdf, other

    cs.AI

    The Atari Grand Challenge Dataset

    Authors: Vitaly Kurin, Sebastian Nowozin, Katja Hofmann, Lucas Beyer, Bastian Leibe

    Abstract: Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with complex virtual environments, yet real-world applications of RL are still scarce. A key limitation is data efficiency, with current state-of-the-art approaches requiring millions of training samples. A promising way to tackle this problem is to a… ▽ More

    Submitted 31 May, 2017; originally announced May 2017.

  21. arXiv:1705.04608  [pdf, other

    cs.CV

    Towards a Principled Integration of Multi-Camera Re-Identification and Tracking through Optimal Bayes Filters

    Authors: Lucas Beyer, Stefan Breuers, Vitaly Kurin, Bastian Leibe

    Abstract: With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong. Multi-camera multi-target (MCMT) tracking has not fully gone through this transformation yet. We intend to take another step in this direction by presenting a theoretically principled way of integrating ReID with tracking formulated as an optimal Bayes… ▽ More

    Submitted 16 May, 2017; v1 submitted 12 May, 2017; originally announced May 2017.

    Comments: First two authors have equal contribution. This is initial work into a new direction, not a benchmark-beating method. v2 only adds acknowledgements and fixes a typo in e-mail

  22. arXiv:1703.07737  [pdf, other

    cs.CV cs.NE

    In Defense of the Triplet Loss for Person Re-Identification

    Authors: Alexander Hermans, Lucas Beyer, Bastian Leibe

    Abstract: In the past few years, the field of computer vision has gone through a revolution fueled mainly by the advent of large datasets and the adoption of deep convolutional neural networks for end-to-end learning. The person re-identification subfield is no exception to this. Unfortunately, a prevailing belief in the community seems to be that the triplet loss is inferior to using surrogate losses (clas… ▽ More

    Submitted 21 November, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

    Comments: Lucas Beyer and Alexander Hermans contributed equally. Updates: Minor fixes, new SOTA comparisons, add CUHK03 results

  23. arXiv:1702.02706  [pdf, other

    cs.CV

    Semi-Supervised Deep Learning for Monocular Depth Map Prediction

    Authors: Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe

    Abstract: Supervised deep learning often suffers from the lack of sufficient training data. Specifically in the context of monocular depth map prediction, it is barely possible to determine dense ground truth depth images in realistic dynamic outdoor environments. When using LiDAR sensors, for instance, noise is present in the distance measurements, the calibration between sensors cannot be perfect, and the… ▽ More

    Submitted 9 May, 2017; v1 submitted 9 February, 2017; originally announced February 2017.

    Comments: CVPR 2017 Spotlight

  24. Keyframe-Based Visual-Inertial Online SLAM with Relocalization

    Authors: Anton Kasyanov, Francis Engelmann, Jörg Stückler, Bastian Leibe

    Abstract: Complementing images with inertial measurements has become one of the most popular approaches to achieve highly accurate and robust real-time camera pose tracking. In this paper, we present a keyframe-based approach to visual-inertial simultaneous localization and map** (SLAM) for monocular and stereo cameras. Our visual-inertial SLAM system is based on a real-time capable visual-inertial odomet… ▽ More

    Submitted 2 March, 2017; v1 submitted 7 February, 2017; originally announced February 2017.

    Report number: RWTH-2018-221873

  25. Superpixels: An Evaluation of the State-of-the-Art

    Authors: David Stutz, Alexander Hermans, Bastian Leibe

    Abstract: Superpixels group perceptually similar pixels to create visually meaningful entities while heavily reducing the number of primitives for subsequent processing steps. As of these properties, superpixel algorithms have received much attention since their naming in 2003. By today, publicly available superpixel algorithms have turned into standard tools in low-level vision. As such, and due to their q… ▽ More

    Submitted 19 April, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

  26. arXiv:1611.08323  [pdf, other

    cs.CV

    Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

    Authors: Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe

    Abstract: Semantic image segmentation is an essential component of modern autonomous driving systems, as an accurate understanding of the surrounding scene is crucial to navigation and action planning. Current state-of-the-art approaches in semantic image segmentation rely on pre-trained networks that were initially developed for classifying images as a whole. While these networks exhibit outstanding recogn… ▽ More

    Submitted 6 December, 2016; v1 submitted 24 November, 2016; originally announced November 2016.

    Comments: Changes in v2: Fixed equation (10), fixed legend of Figure 6, fixed legend of Figure 9, added page numbers, fixed minor spelling mistakes

  27. The STRANDS Project: Long-Term Autonomy in Everyday Environments

    Authors: Nick Hawes, Chris Burbridge, Ferdian Jovan, Lars Kunze, Bruno Lacerda, Lenka Mudrová, Jay Young, Jeremy Wyatt, Denise Hebesberger, Tobias Körtner, Rares Ambrus, Nils Bore, John Folkesson, Patric Jensfelt, Lucas Beyer, Alexander Hermans, Bastian Leibe, Aitor Aldoma, Thomas Fäulhammer, Michael Zillich, Markus Vincze, Eris Chinellato, Muhannad Al-Omari, Paul Duckworth, Yiannis Gatsoulis , et al. (8 additional authors not shown)

    Abstract: Thanks to the efforts of the robotics and autonomous systems community, robots are becoming ever more capable. There is also an increasing demand from end-users for autonomous service robots that can operate in real environments for extended periods. In the STRANDS project we are tackling this demand head-on by integrating state-of-the-art artificial intelligence and robotics research into mobile… ▽ More

    Submitted 14 October, 2016; v1 submitted 15 April, 2016; originally announced April 2016.

  28. arXiv:1603.02636  [pdf, other

    cs.RO cs.CV cs.LG cs.NE

    DROW: Real-Time Deep Learning based Wheelchair Detection in 2D Range Data

    Authors: Lucas Beyer, Alexander Hermans, Bastian Leibe

    Abstract: We introduce the DROW detector, a deep learning based detector for 2D range data. Laser scanners are lighting invariant, provide accurate range data, and typically cover a large field of view, making them interesting sensors for robotics applications. So far, research on detection in laser range data has been dominated by hand-crafted features and boosted classifiers, potentially losing performanc… ▽ More

    Submitted 5 December, 2016; v1 submitted 8 March, 2016; originally announced March 2016.

    Comments: Lucas Beyer and Alexander Hermans contributed equally

  29. Visual Landmark Recognition from Internet Photo Collections: A Large-Scale Evaluation

    Authors: Tobias Weyand, Bastian Leibe

    Abstract: The task of a visual landmark recognition system is to identify photographed buildings or objects in query photos and to provide the user with relevant information on them. With their increasing coverage of the world's landmark buildings and objects, Internet photo collections are now being used as a source for building such systems in a fully automatic fashion. This process typically consists of… ▽ More

    Submitted 18 September, 2014; originally announced September 2014.