Skip to main content

Showing 1–13 of 13 results for author: Schmarje, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.12189  [pdf, other

    cs.CV

    Annotating Ambiguous Images: General Annotation Strategy for High-Quality Data with Real-World Biomedical Validation

    Authors: Lars Schmarje, Vasco Grossmann, Claudius Zelenka, Johannes Brünger, Reinhard Koch

    Abstract: In the field of image classification, existing methods often struggle with biased or ambiguous data, a prevalent issue in real-world scenarios. Current strategies, including semi-supervised learning and class blending, offer partial solutions but lack a definitive resolution. Addressing this gap, our paper introduces a novel strategy for generating high-quality labels in challenging datasets. Cent… ▽ More

    Submitted 29 April, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Accepted at ICLR 2024, DMLR Workshop

  2. arXiv:2305.12811  [pdf, other

    cs.CV

    Label Smarter, Not Harder: CleverLabel for Faster Annotation of Ambiguous Image Classification with Higher Quality

    Authors: Lars Schmarje, Vasco Grossmann, Tim Michels, Jakob Nazarenus, Monty Santarossa, Claudius Zelenka, Reinhard Koch

    Abstract: High-quality data is crucial for the success of machine learning, but labeling large datasets is often a time-consuming and costly process. While semi-supervised learning can help mitigate the need for labeled data, label quality remains an open issue due to ambiguity and disagreement among annotators. Thus, we use proposal-guided annotations as one option which leads to more consistency between a… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  3. Opportunistic hip fracture risk prediction in Men from X-ray: Findings from the Osteoporosis in Men (MrOS) Study

    Authors: Lars Schmarje, Stefan Reinhold, Timo Damm, Eric Orwoll, Claus-C. Glüer, Reinhard Koch

    Abstract: Osteoporosis is a common disease that increases fracture risk. Hip fractures, especially in elderly people, lead to increased morbidity, decreased quality of life and increased mortality. Being a silent disease before fracture, osteoporosis often remains undiagnosed and untreated. Areal bone mineral density (aBMD) assessed by dual-energy X-ray absorptiometry (DXA) is the gold-standard method for o… ▽ More

    Submitted 6 October, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Oral Presentation at MICCAI 2022 Workshop (PRIME), Considered for best paper award Predictive Intelligence in Medicine. PRIME 2022. Lecture Notes in Computer Science, vol 13564

  4. arXiv:2207.06224  [pdf, other

    cs.CV cs.LG

    Beyond Hard Labels: Investigating data label distributions

    Authors: Vasco Grossmann, Lars Schmarje, Reinhard Koch

    Abstract: High-quality data is a key aspect of modern machine learning. However, labels generated by humans suffer from issues like label noise and class ambiguities. We raise the question of whether hard labels are sufficient to represent the underlying ground truth distribution in the presence of these inherent imprecision. Therefore, we compare the disparity of learning with hard and soft labels quantita… ▽ More

    Submitted 6 October, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: https://icml.cc/virtual/2022/workshop/13477

    Journal ref: ICML 2022 Workshop DataPerf: Benchmarking Data for Data-Centric AI

  5. arXiv:2207.06214  [pdf, other

    cs.CV

    Is one annotation enough? A data-centric image classification benchmark for noisy and ambiguous label estimation

    Authors: Lars Schmarje, Vasco Grossmann, Claudius Zelenka, Sabine Dippel, Rainer Kiko, Mariusz Oszust, Matti Pastell, Jenny Stracke, Anna Valros, Nina Volkmann, Reinhard Koch

    Abstract: High-quality data is necessary for modern machine learning. However, the acquisition of such data is difficult due to noisy and ambiguous annotations of humans. The aggregation of such annotations to determine the label of an image leads to a lower data quality. We propose a data-centric image classification benchmark with ten real-world datasets and multiple annotations per image to allow researc… ▽ More

    Submitted 4 November, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at NeurIPS 2022, Benchmark and Dataset Track, Code and Link to data available at https://github.com/Emprime/dcic

  6. Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy

    Authors: Lars Schmarje, Johannes Brünger, Monty Santarossa, Simon-Martin Schröder, Rainer Kiko, Reinhard Koch

    Abstract: Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct cla… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: Source code: https://github.com/Emprime/FuzzyOverclustering Datasets: https://doi.org/10.5281/zenodo.5550918. arXiv admin note: substantial text overlap with arXiv:2012.01768

    Journal ref: Sensors 2021, 21(19), 6661

  7. arXiv:2110.06592  [pdf, other

    cs.CV cs.AI

    Life is not black and white -- Combining Semi-Supervised Learning with fuzzy labels

    Authors: Lars Schmarje, Reinhard Koch

    Abstract: The required amount of labeled data is one of the biggest issues in deep learning. Semi-Supervised Learning can potentially solve this issue by using additional unlabeled data. However, many datasets suffer from variability in the annotations. The aggregated labels from these annotation are not consistent between different annotators and thus are considered fuzzy. These fuzzy labels are often not… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted at LWDA 21: Lernen, Wissen, Daten, Analysen September 2021, Munich, Germany

  8. arXiv:2107.03070  [pdf, other

    cs.CV

    Learning Stixel-based Instance Segmentation

    Authors: Monty Santarossa, Lukas Schneider, Claudius Zelenka, Lars Schmarje, Reinhard Koch, Uwe Franke

    Abstract: Stixels have been successfully applied to a wide range of vision tasks in autonomous driving, recently including instance segmentation. However, due to their sparse occurrence in the image, until now Stixels seldomly served as input for Deep Learning algorithms, restricting their utility for such approaches. In this work we present StixelPointNet, a novel method to perform fast instance segmentati… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in IEEE Intelligent Vehicles Symposium

  9. arXiv:2106.16209  [pdf, other

    cs.CV

    A data-centric approach for improving ambiguous labels with combined semi-supervised classification and clustering

    Authors: Lars Schmarje, Monty Santarossa, Simon-Martin Schröder, Claudius Zelenka, Rainer Kiko, Jenny Stracke, Nina Volkmann, Reinhard Koch

    Abstract: Consistently high data quality is essential for the development of novel loss functions and architectures in the field of deep learning. The existence of such data and labels is usually presumed, while acquiring high-quality datasets is still a major issue in many cases. In real-world datasets we often encounter ambiguous labels due to subjective annotations by annotators. In our data-centric appr… ▽ More

    Submitted 6 October, 2022; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: Source code is available at https://github.com/Emprime/dc3, Datasets available at https://doi.org/10.5281/zenodo.5550916

    Journal ref: Proceedings of the European Conference on Computer Vision (ECCV 2022)

  10. Beyond Cats and Dogs: Semi-supervised Classification of fuzzy labels with overclustering

    Authors: Lars Schmarje, Johannes Brünger, Monty Santarossa, Simon-Martin Schröder, Rainer Kiko, Reinhard Koch

    Abstract: A long-standing issue with deep learning is the need for large and consistently labeled datasets. Although the current research in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes like cats and dogs. However, in the real-world we often encounter problems where different experts have differe… ▽ More

    Submitted 19 October, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Reworked version available at arXiv:2110.06630, Published in Sensors 2021 (see DOI link)

  11. A survey on Semi-, Self- and Unsupervised Learning for Image Classification

    Authors: Lars Schmarje, Monty Santarossa, Simon-Martin Schröder, Reinhard Koch

    Abstract: While deep learning strategies achieve outstanding results in computer vision tasks, one issue remains: The current strategies rely heavily on a huge amount of labeled data. In many real-world problems, it is not feasible to create such an amount of labeled training data. Therefore, it is common to incorporate unlabeled data into the training process to reach equal results with fewer labels. Due t… ▽ More

    Submitted 25 May, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted to IEEE Access 2021

    Journal ref: IEEE Access 2021

  12. 2D and 3D Segmentation of uncertain local collagen fiber orientations in SHG microscopy

    Authors: Lars Schmarje, Claudius Zelenka, Ulf Geisen, Claus-C. Glüer, Reinhard Koch

    Abstract: Collagen fiber orientations in bones, visible with Second Harmonic Generation (SHG) microscopy, represent the inner structure and its alteration due to influences like cancer. While analyses of these orientations are valuable for medical research, it is not feasible to analyze the needed large amounts of local orientations manually. Since we have uncertain borders for these local orientations only… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Journal ref: DAGM GCPR 2019

  13. arXiv:1705.04587  [pdf, ps, other

    cs.CC

    Complexity and Inapproximability Results for Parallel Task Scheduling and Strip Packing

    Authors: Sören Henning, Klaus Jansen, Malin Rau, Lars Schmarje

    Abstract: We study the Parallel Task Scheduling problem $Pm|size_j|C_{\max}$ with a constant number of machines. This problem is known to be strongly NP-complete for each $m \geq 5$, while it is solvable in pseudo-polynomial time for each $m \leq 3$. We give a positive answer to the long-standing open question whether this problem is strongly $NP$-complete for $m=4$. As a second result, we improve the lower… ▽ More

    Submitted 12 May, 2017; originally announced May 2017.