Skip to main content

Showing 1–25 of 25 results for author: Hermans, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11835  [pdf, other

    cs.CV

    OoDIS: Anomaly Instance Segmentation Benchmark

    Authors: Alexey Nekrasov, Rui Zhou, Miriam Ackermann, Alexander Hermans, Bastian Leibe, Matthias Rottmann

    Abstract: Autonomous vehicles require a precise understanding of their environment to navigate safely. Reliable identification of unknown objects, especially those that are absent during training, such as wild animals, is critical due to their potential to cause serious accidents. Significant progress in semantic segmentation of anomalies has been driven by the availability of out-of-distribution (OOD) benc… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted at the VAND 2.0 Workshop at CVPR 2024. Project page: https://vision.rwth-aachen.de/oodis

  2. arXiv:2402.05685  [pdf, ps, other

    cs.CV

    An Ordinal Regression Framework for a Deep Learning Based Severity Assessment for Chest Radiographs

    Authors: Patrick Wienholt, Alexander Hermans, Firas Khader, Behrus Puladi, Bastian Leibe, Christiane Kuhl, Sven Nebelung, Daniel Truhn

    Abstract: This study investigates the application of ordinal regression methods for categorizing disease severity in chest radiographs. We propose a framework that divides the ordinal regression problem into three parts: a model, a target function, and a classification function. Different encoding methods, including one-hot, Gaussian, progress-bar, and our soft-progress-bar, are applied using ResNet50 and V… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 17 pages, 3 figures, the code is available at: https://github.com/paddyOnGithub/ordinal_regression

  3. Cyto R-CNN and CytoNuke Dataset: Towards reliable whole-cell segmentation in bright-field histological images

    Authors: Johannes Raufeisen, Kunpeng Xie, Fabian Hörst, Till Braunschweig, Jianning Li, Jens Kleesiek, Rainer Röhrig, Jan Egger, Bastian Leibe, Frank Hölzle, Alexander Hermans, Behrus Puladi

    Abstract: Background: Cell segmentation in bright-field histological slides is a crucial topic in medical image analysis. Having access to accurate segmentation allows researchers to examine the relationship between cellular morphology and clinical observations. Unfortunately, most segmentation methods known today are limited to nuclei and cannot segmentate the cytoplasm. Material & Methods: We present a… ▽ More

    Submitted 4 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  4. arXiv:2308.02046  [pdf, other

    cs.CV

    UGainS: Uncertainty Guided Anomaly Instance Segmentation

    Authors: Alexey Nekrasov, Alexander Hermans, Lars Kuhnert, Bastian Leibe

    Abstract: A single unexpected object on the road can cause an accident or may lead to injuries. To prevent this, we need a reliable mechanism for finding anomalous objects on the road. This task, called anomaly segmentation, can be a step** stone to safe and reliable autonomous driving. Current approaches tackle anomaly segmentation by assigning an anomaly score to each pixel and by grou** anomalous reg… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted for publication at GCPR 2023; Project page at https://vision.rwth-aachen.de/ugains

  5. arXiv:2304.06668  [pdf, other

    cs.CV

    DynaMITe: Dynamic Query Bootstrap** for Multi-object Interactive Segmentation Transformer

    Authors: Amit Kumar Rana, Sabarinath Mahadevan, Alexander Hermans, Bastian Leibe

    Abstract: Most state-of-the-art instance segmentation methods rely on large amounts of pixel-precise ground-truth annotations for training, which are expensive to create. Interactive segmentation networks help generate such annotations based on an image and the corresponding user interactions such as clicks. Existing methods for this task can only process a single instance at a time and each user interactio… ▽ More

    Submitted 22 August, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted to ICCV 2023

  6. arXiv:2303.16570  [pdf, other

    cs.CV

    Point2Vec for Self-Supervised Representation Learning on Point Clouds

    Authors: Karim Abou Zeid, Jonas Schult, Alexander Hermans, Bastian Leibe

    Abstract: Recently, the self-supervised learning framework data2vec has shown inspiring performance for various modalities using a masked student-teacher approach. However, it remains open whether such a framework generalizes to the unique challenges of 3D point clouds. To answer this question, we extend data2vec to the point cloud domain and report encouraging results on several downstream tasks. In an in-… ▽ More

    Submitted 11 October, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted at GCPR 2023. Project page at https://vision.rwth-aachen.de/point2vec

  7. arXiv:2301.02657  [pdf, other

    cs.CV cs.AI cs.LG

    TarViS: A Unified Approach for Target-based Video Segmentation

    Authors: Ali Athar, Alexander Hermans, Jonathon Luiten, Deva Ramanan, Bastian Leibe

    Abstract: The general domain of video segmentation is currently fragmented into different tasks spanning multiple benchmarks. Despite rapid progress in the state-of-the-art, current methods are overwhelmingly task-specific and cannot conceptually generalize to other tasks. Inspired by recent approaches with multi-task capability, we propose TarViS: a novel, unified network architecture that can be applied t… ▽ More

    Submitted 10 May, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: Accepted to CVPR'23 (Highlight). Code is available at: https://github.com/Ali2500/TarViS

    ACM Class: I.4.6; I.4.8; I.4.10

  8. arXiv:2212.14474  [pdf, other

    cs.CV

    Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats

    Authors: István Sárándi, Alexander Hermans, Bastian Leibe

    Abstract: Deep learning-based 3D human pose estimation performs best when trained on large amounts of labeled data, making combined learning from many datasets an important research direction. One obstacle to this endeavor are the different skeleton formats provided by different datasets, i.e., they do not label the same set of anatomical landmarks. There is little prior research on how to best supervise on… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: Accepted at the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV'23)

    ACM Class: I.2.10; I.4.8

  9. arXiv:2210.03105  [pdf, other

    cs.CV

    Mask3D: Mask Transformer for 3D Semantic Instance Segmentation

    Authors: Jonas Schult, Francis Engelmann, Alexander Hermans, Or Litany, Siyu Tang, Bastian Leibe

    Abstract: Modern 3D semantic instance segmentation approaches predominantly rely on specialized voting mechanisms followed by carefully designed geometric clustering techniques. Building on the successes of recent Transformer-based methods for object detection and image segmentation, we propose the first Transformer-based approach for 3D semantic instance segmentation. We show that we can leverage generic T… ▽ More

    Submitted 12 April, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: ICRA 2023 camera-ready version

  10. arXiv:2208.03791  [pdf, other

    cs.CV

    Global Hierarchical Attention for 3D Point Cloud Analysis

    Authors: Dan Jia, Alexander Hermans, Bastian Leibe

    Abstract: We propose a new attention mechanism, called Global Hierarchical Attention (GHA), for 3D point cloud analysis. GHA approximates the regular global dot-product attention via a series of coarsening and interpolation operations over multiple hierarchy levels. The advantage of GHA is two-fold. First, it has linear complexity with respect to the number of points, enabling the processing of large point… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted to the German Conference on Pattern Recognition (GCPR) 2022

  11. arXiv:2206.00182  [pdf, other

    cs.CV

    Differentiable Soft-Masked Attention

    Authors: Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

    Abstract: Transformers have become prevalent in computer vision due to their performance and flexibility in modelling complex operations. Of particular significance is the 'cross-attention' operation, which allows a vector representation (e.g. of an object in an image) to be learned by attending to an arbitrarily sized set of input features. Recently, "Masked Attention" was proposed in which a given object… ▽ More

    Submitted 5 August, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2112.09131

    ACM Class: I.4.6; I.4.8; I.4.10

  12. HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images

    Authors: Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

    Abstract: Existing state-of-the-art methods for Video Object Segmentation (VOS) learn low-level pixel-to-pixel correspondences between frames to propagate object masks across video. This requires a large amount of densely annotated video data, which is costly to annotate, and largely redundant since frames within a video are highly correlated. In light of this, we propose HODOR: a novel method that tackles… ▽ More

    Submitted 15 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    ACM Class: I.4.6; I.4.8; I.4.10

  13. arXiv:2106.11239  [pdf, other

    cs.RO cs.CV

    2D vs. 3D LiDAR-based Person Detection on Mobile Robots

    Authors: Dan Jia, Alexander Hermans, Bastian Leibe

    Abstract: Person detection is a crucial task for mobile robots navigating in human-populated environments. LiDAR sensors are promising for this task, thanks to their accurate depth measurements and large field of view. Two types of LiDAR sensors exist: the 2D LiDAR sensors, which scan a single plane, and the 3D LiDAR sensors, which scan multiple planes, thus forming a volume. How do they compare for the tas… ▽ More

    Submitted 25 July, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Shortened version accepted at the International Conference on Intelligent Robots and Systems (IROS) 2022

  14. arXiv:2012.08890  [pdf, other

    cs.CV cs.RO

    Self-Supervised Person Detection in 2D Range Data using a Calibrated Camera

    Authors: Dan Jia, Mats Steinweg, Alexander Hermans, Bastian Leibe

    Abstract: Deep learning is the essential building block of state-of-the-art person detectors in 2D range data. However, only a few annotated datasets are available for training and testing these deep networks, potentially limiting their performance when deployed in new environments or with different LiDAR models. We propose a method, which uses bounding boxes from an image-based detector (e.g. Faster R-CNN)… ▽ More

    Submitted 3 June, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 2021 IEEE International Conference on Robotics and Automation (ICRA)

  15. arXiv:2004.14079  [pdf, other

    cs.RO cs.CV

    DR-SPAAM: A Spatial-Attention and Auto-regressive Model for Person Detection in 2D Range Data

    Authors: Dan Jia, Alexander Hermans, Bastian Leibe

    Abstract: Detecting persons using a 2D LiDAR is a challenging task due to the low information content of 2D range data. To alleviate the problem caused by the sparsity of the LiDAR points, current state-of-the-art methods fuse multiple previous scans and perform detection using the combined scans. The downside of such a backward looking fusion is that all the scans need to be aligned explicitly, and the nec… ▽ More

    Submitted 31 July, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

  16. arXiv:2003.10886  [pdf

    cs.HC

    Assessment of Empathy in an Affective VR Environment using EEG Signals

    Authors: Maryam Alimardani, Annabella Hermans, Angelica M. Tinga

    Abstract: With the advancements in social robotics and virtual avatars, it becomes increasingly important that these agents adapt their behavior to the mood, feelings and personality of their users. One such aspect of the user is empathy. Whereas many studies measure empathy through offline measures that are collected after empathic stimulation (e.g. post-hoc questionnaires), the current study aimed to meas… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: 13 pages, 3 figures, 3 tables

  17. Visual Person Understanding through Multi-Task and Multi-Dataset Learning

    Authors: Kilian Pfeiffer, Alexander Hermans, István Sárándi, Mark Weber, Bastian Leibe

    Abstract: We address the problem of learning a single model for person re-identification, attribute classification, body part segmentation, and pose estimation. With predictions for these tasks we gain a more holistic understanding of persons, which is valuable for many applications. This is a classical multi-task learning problem. However, no dataset exists that these tasks could be jointly learned from. H… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  18. arXiv:1804.02463  [pdf, other

    cs.RO cs.CV

    Deep Person Detection in 2D Range Data

    Authors: Lucas Beyer, Alexander Hermans, Timm Linder, Kai O. Arras, Bastian Leibe

    Abstract: Detecting humans is a key skill for mobile robots and intelligent vehicles in a large variety of applications. While the problem is well studied for certain sensory modalities such as image data, few works exist that address this detection task using 2D range data. However, a widespread sensory setup for many mobile robots in service and domestic applications contains a horizontally mounted 2D las… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

  19. Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds

    Authors: Francis Engelmann, Theodora Kontogianni, Alexander Hermans, Bastian Leibe

    Abstract: Deep learning approaches have made tremendous progress in the field of semantic segmentation over the past few years. However, most current approaches operate in the 2D image space. Direct semantic segmentation of unstructured 3D point clouds is still an open research problem. The recently proposed PointNet architecture presents an interesting step ahead in that it can operate on unstructured poin… ▽ More

    Submitted 18 December, 2019; v1 submitted 5 February, 2018; originally announced February 2018.

  20. arXiv:1712.04837  [pdf, other

    cs.CV

    MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features

    Authors: Liang-Chieh Chen, Alexander Hermans, George Papandreou, Florian Schroff, Peng Wang, Hartwig Adam

    Abstract: In this work, we tackle the problem of instance segmentation, the task of simultaneously solving object detection and semantic segmentation. Towards this goal, we present a model, called MaskLab, which produces three outputs: box detection, semantic segmentation, and direction prediction. Building on top of the Faster-RCNN object detector, the predicted boxes provide accurate localization of objec… ▽ More

    Submitted 13 December, 2017; originally announced December 2017.

    Comments: 10 pages including reference

  21. arXiv:1703.07737  [pdf, other

    cs.CV cs.NE

    In Defense of the Triplet Loss for Person Re-Identification

    Authors: Alexander Hermans, Lucas Beyer, Bastian Leibe

    Abstract: In the past few years, the field of computer vision has gone through a revolution fueled mainly by the advent of large datasets and the adoption of deep convolutional neural networks for end-to-end learning. The person re-identification subfield is no exception to this. Unfortunately, a prevailing belief in the community seems to be that the triplet loss is inferior to using surrogate losses (clas… ▽ More

    Submitted 21 November, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

    Comments: Lucas Beyer and Alexander Hermans contributed equally. Updates: Minor fixes, new SOTA comparisons, add CUHK03 results

  22. Superpixels: An Evaluation of the State-of-the-Art

    Authors: David Stutz, Alexander Hermans, Bastian Leibe

    Abstract: Superpixels group perceptually similar pixels to create visually meaningful entities while heavily reducing the number of primitives for subsequent processing steps. As of these properties, superpixel algorithms have received much attention since their naming in 2003. By today, publicly available superpixel algorithms have turned into standard tools in low-level vision. As such, and due to their q… ▽ More

    Submitted 19 April, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

  23. arXiv:1611.08323  [pdf, other

    cs.CV

    Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes

    Authors: Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe

    Abstract: Semantic image segmentation is an essential component of modern autonomous driving systems, as an accurate understanding of the surrounding scene is crucial to navigation and action planning. Current state-of-the-art approaches in semantic image segmentation rely on pre-trained networks that were initially developed for classifying images as a whole. While these networks exhibit outstanding recogn… ▽ More

    Submitted 6 December, 2016; v1 submitted 24 November, 2016; originally announced November 2016.

    Comments: Changes in v2: Fixed equation (10), fixed legend of Figure 6, fixed legend of Figure 9, added page numbers, fixed minor spelling mistakes

  24. The STRANDS Project: Long-Term Autonomy in Everyday Environments

    Authors: Nick Hawes, Chris Burbridge, Ferdian Jovan, Lars Kunze, Bruno Lacerda, Lenka Mudrová, Jay Young, Jeremy Wyatt, Denise Hebesberger, Tobias Körtner, Rares Ambrus, Nils Bore, John Folkesson, Patric Jensfelt, Lucas Beyer, Alexander Hermans, Bastian Leibe, Aitor Aldoma, Thomas Fäulhammer, Michael Zillich, Markus Vincze, Eris Chinellato, Muhannad Al-Omari, Paul Duckworth, Yiannis Gatsoulis , et al. (8 additional authors not shown)

    Abstract: Thanks to the efforts of the robotics and autonomous systems community, robots are becoming ever more capable. There is also an increasing demand from end-users for autonomous service robots that can operate in real environments for extended periods. In the STRANDS project we are tackling this demand head-on by integrating state-of-the-art artificial intelligence and robotics research into mobile… ▽ More

    Submitted 14 October, 2016; v1 submitted 15 April, 2016; originally announced April 2016.

  25. arXiv:1603.02636  [pdf, other

    cs.RO cs.CV cs.LG cs.NE

    DROW: Real-Time Deep Learning based Wheelchair Detection in 2D Range Data

    Authors: Lucas Beyer, Alexander Hermans, Bastian Leibe

    Abstract: We introduce the DROW detector, a deep learning based detector for 2D range data. Laser scanners are lighting invariant, provide accurate range data, and typically cover a large field of view, making them interesting sensors for robotics applications. So far, research on detection in laser range data has been dominated by hand-crafted features and boosted classifiers, potentially losing performanc… ▽ More

    Submitted 5 December, 2016; v1 submitted 8 March, 2016; originally announced March 2016.

    Comments: Lucas Beyer and Alexander Hermans contributed equally