Skip to main content

Showing 1–9 of 9 results for author: Schön, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09288  [pdf, other

    cs.CV

    WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive Segmentation

    Authors: Robin Schön, Daniel Kienzle, Rainer Lienhart

    Abstract: In this paper we introduce a new dataset containing instance segmentation masks for ten different categories of winter sports equipment, called WSESeg (Winter Sports Equipment Segmentation). Furthermore, we carry out interactive segmentation experiments on said dataset to explore possibilities for efficient further labeling. The SAM and HQ-SAM models are conceptualized as foundation models for per… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 7 pages, 1 figure, 3 tables, Accepted at CBMI 2024

  2. arXiv:2405.14467  [pdf, other

    cs.CV cs.AI cs.LG

    Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation

    Authors: Daniel Kienzle, Marco Kantonis, Robin Schön, Rainer Lienhart

    Abstract: Utilizing transformer architectures for semantic segmentation of high-resolution images is hindered by the attention's quadratic computational complexity in the number of tokens. A solution to this challenge involves decreasing the number of tokens through token merging, which has exhibited remarkable enhancements in inference speed, training efficiency, and memory utilization for image classifica… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 7 pages, to be published in IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR) 2024

  3. arXiv:2404.09616  [pdf, other

    cs.CV cs.LG

    A Review and Efficient Implementation of Scene Graph Generation Metrics

    Authors: Julian Lorenz, Robin Schön, Katja Ludwig, Rainer Lienhart

    Abstract: Scene graph generation has emerged as a prominent research field in computer vision, witnessing significant advancements in the recent years. However, despite these strides, precise and thorough definitions for the metrics used to evaluate scene graph generation models are lacking. In this paper, we address this gap in the literature by providing a review and precise definition of commonly used me… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2404.08421  [pdf, other

    cs.CV

    Adapting the Segment Anything Model During Usage in Novel Situations

    Authors: Robin Schön, Julian Lorenz, Katja Ludwig, Rainer Lienhart

    Abstract: The interactive segmentation task consists in the creation of object segmentation masks based on user interactions. The most common way to guide a model towards producing a correct segmentation consists in clicks on the object and background. The recently published Segment Anything Model (SAM) supports a generalized version of the interactive segmentation problem and has been trained on an object… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 11 pages, 2 figures, 4 tables

  5. arXiv:2306.10484  [pdf, other

    eess.IV cs.CV

    The STOIC2021 COVID-19 AI challenge: applying reusable training methodologies to private data

    Authors: Luuk H. Boulogne, Julian Lorenz, Daniel Kienzle, Robin Schon, Katja Ludwig, Rainer Lienhart, Simon Jegou, Guang Li, Cong Chen, Qi Wang, Derik Shi, Mayug Maniparambil, Dominik Muller, Silvan Mertes, Niklas Schroter, Fabio Hellmann, Miriam Elia, Ine Dirks, Matias Nicolas Bossa, Abel Diaz Berenguer, Tanmoy Mukherjee, Jef Vandemeulebroucke, Hichem Sahli, Nikos Deligiannis, Panagiotis Gonidakis , et al. (13 additional authors not shown)

    Abstract: Challenges drive the state-of-the-art of automated medical image analysis. The quantity of public training data that they provide can limit the performance of their solutions. Public access to the training methodology for these solutions remains absent. This study implements the Type Three (T3) challenge format, which allows for training solutions on private data and guarantees reusable training m… ▽ More

    Submitted 25 June, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

  6. arXiv:2304.05716  [pdf, other

    cs.CV

    Impact of Pseudo Depth on Open World Object Segmentation with Minimal User Guidance

    Authors: Robin Schön, Katja Ludwig, Rainer Lienhart

    Abstract: Pseudo depth maps are depth map predicitions which are used as ground truth during training. In this paper we leverage pseudo depth maps in order to segment objects of classes that have never been seen during training. This renders our object segmentation task an open world task. The pseudo depth maps are generated using pretrained networks, which have either been trained with the full intention t… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to L3D-IVU Workshop at CVPR 2023

  7. arXiv:2304.02939  [pdf, other

    cs.CV

    All Keypoints You Need: Detecting Arbitrary Keypoints on the Body of Triple, High, and Long Jump Athletes

    Authors: Katja Ludwig, Julian Lorenz, Robin Schön, Rainer Lienhart

    Abstract: Performance analyses based on videos are commonly used by coaches of athletes in various sports disciplines. In individual sports, these analyses mainly comprise the body posture. This paper focuses on the disciplines of triple, high, and long jump, which require fine-grained locations of the athlete's body. Typical human pose estimation datasets provide only a very limited set of keypoints, which… ▽ More

    Submitted 10 May, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted at CVSports23 (Workshop at CVPR 23)

  8. arXiv:2210.10426  [pdf, other

    cs.CV

    Pseudo-Label Noise Suppression Techniques for Semi-Supervised Semantic Segmentation

    Authors: Sebastian Scherer, Robin Schön, Rainer Lienhart

    Abstract: Semi-supervised learning (SSL) can reduce the need for large labelled datasets by incorporating unlabelled data into the training. This is particularly interesting for semantic segmentation, where labelling data is very costly and time-consuming. Current SSL approaches use an initially supervised trained model to generate predictions for unlabelled images, called pseudo-labels, which are subsequen… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to BMVC 2022

  9. arXiv:2206.15073  [pdf, other

    eess.IV cs.CV

    COVID Detection and Severity Prediction with 3D-ConvNeXt and Custom Pretrainings

    Authors: Daniel Kienzle, Julian Lorenz, Robin Schön, Katja Ludwig, Rainer Lienhart

    Abstract: Since COVID strongly affects the respiratory system, lung CT-scans can be used for the analysis of a patients health. We introduce a neural network for the prediction of the severity of lung damage and the detection of a COVID-infection using three-dimensional CT-data. Therefore, we adapt the recent ConvNeXt model to process three-dimensional data. Furthermore, we design and analyze different pret… ▽ More

    Submitted 17 August, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: 17 pages, 3 figures, informations about challenge submission