Skip to main content

Showing 1–13 of 13 results for author: Huseljic, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18621  [pdf, other

    cs.SD cs.AI eess.AS

    Towards Deep Active Learning in Avian Bioacoustics

    Authors: Lukas Rauch, Denis Huseljic, Moritz Wirth, Jens Decke, Bernhard Sick, Christoph Scholz

    Abstract: Passive acoustic monitoring (PAM) in avian bioacoustics enables cost-effective and extensive data collection with minimal disruption to natural habitats. Despite advancements in computational avian bioacoustics, deep learning models continue to encounter challenges in adapting to diverse environments in practical PAM scenarios. This is primarily due to the scarcity of annotations, which requires l… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: preprint, under review IAL@ECML-PKDD24

  2. arXiv:2405.03386  [pdf, other

    cs.LG

    Annot-Mix: Learning with Noisy Class Labels from Multiple Annotators via a Mixup Extension

    Authors: Marek Herde, Lukas Lührs, Denis Huseljic, Bernhard Sick

    Abstract: Training with noisy class labels impairs neural networks' generalization performance. In this context, mixup is a popular regularization technique to improve training robustness by making memorizing false class labels more difficult. However, mixup neglects that, typically, multiple annotators, e.g., crowdworkers, provide class labels. Therefore, we propose an extension of mixup, which handles mul… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Under review

    ACM Class: I.2.6; I.5.1

  3. arXiv:2404.08981  [pdf, other

    cs.CV cs.LG

    Fast Fishing: Approximating BAIT for Efficient and Scalable Deep Active Image Classification

    Authors: Denis Huseljic, Paul Hahn, Marek Herde, Lukas Rauch, Bernhard Sick

    Abstract: Deep active learning (AL) seeks to minimize the annotation costs for training deep neural networks. BAIT, a recently proposed AL strategy based on the Fisher Information, has demonstrated impressive performance across various datasets. However, BAIT's high computational and memory requirements hinder its applicability on large-scale classification tasks, resulting in current research neglecting BA… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  4. arXiv:2403.10380  [pdf, other

    cs.SD cs.AI eess.AS

    BirdSet: A Dataset and Benchmark for Classification in Avian Bioacoustics

    Authors: Lukas Rauch, Raphael Schwinger, Moritz Wirth, René Heinrich, Denis Huseljic, Jonas Lange, Stefan Kahl, Bernhard Sick, Sven Tomforde, Christoph Scholz

    Abstract: Deep learning (DL) models have emerged as a powerful tool in avian bioacoustics to assess environmental health. To maximize the potential of cost-effective and minimal-invasive passive acoustic monitoring (PAM), DL models must analyze bird vocalizations across a wide range of species and environmental conditions. However, data fragmentation challenges a comprehensive evaluation of generalization p… ▽ More

    Submitted 17 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Under review @NeurIPS2024 Datasets & Benchmarks

  5. arXiv:2309.06159  [pdf, other

    cs.CV

    Active Label Refinement for Semantic Segmentation of Satellite Images

    Authors: Tuan Pham Minh, Jayan Wijesingha, Daniel Kottke, Marek Herde, Denis Huseljic, Bernhard Sick, Michael Wachendorf, Thomas Esch

    Abstract: Remote sensing through semantic segmentation of satellite images contributes to the understanding and utilisation of the earth's surface. For this purpose, semantic segmentation networks are typically trained on large sets of labelled satellite images. However, obtaining expert labels for these images is costly. Therefore, we propose to rely on a low-cost approach, e.g. crowdsourcing or pretrained… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  6. arXiv:2306.10087  [pdf, other

    cs.LG cs.AI

    ActiveGLAE: A Benchmark for Deep Active Learning with Transformers

    Authors: Lukas Rauch, Matthias Aßenmacher, Denis Huseljic, Moritz Wirth, Bernd Bischl, Bernhard Sick

    Abstract: Deep active learning (DAL) seeks to reduce annotation costs by enabling the model to actively query instance annotations from which it expects to learn the most. Despite extensive research, there is currently no standardized evaluation protocol for transformer-based language models in the field of DAL. Diverse experimental settings lead to difficulties in comparing research and deriving recommenda… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted @ ECML PKDD 2023. This is the author's version of the work. The definitive Version of Record will be published in the Proceedings of ECML PKDD 2023

  7. arXiv:2304.02539  [pdf, other

    cs.LG

    Multi-annotator Deep Learning: A Probabilistic Framework for Classification

    Authors: Marek Herde, Denis Huseljic, Bernhard Sick

    Abstract: Solving complex classification tasks using deep neural networks typically requires large amounts of annotated data. However, corresponding class labels are noisy when provided by error-prone annotators, e.g., crowdworkers. Training standard deep neural networks leads to subpar performances in such multi-annotator supervised learning settings. We address this issue by presenting a probabilistic tra… ▽ More

    Submitted 23 October, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Transactions on Machine Learning Research, see https://openreview.net/forum?id=MgdoxzImlK

    ACM Class: I.2.6; I.5.1

    Journal ref: Transactions on Machine Learning Research, 2023

  8. arXiv:2210.06112  [pdf, other

    cs.LG

    Fast Bayesian Updates for Deep Learning with a Use Case in Active Learning

    Authors: Marek Herde, Zhixin Huang, Denis Huseljic, Daniel Kottke, Stephan Vogt, Bernhard Sick

    Abstract: Retraining deep neural networks when new data arrives is typically computationally expensive. Moreover, certain applications do not allow such costly retraining due to time or computational constraints. Fast Bayesian updates are a possible solution to this issue. Therefore, we propose a Bayesian update based on Monte-Carlo samples and a last-layer Laplace approximation for different Bayesian neura… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 25 pages, 10 figures, submitted to ICLR

  9. arXiv:2210.02935  [pdf, other

    cs.CV

    A Review of Uncertainty Calibration in Pretrained Object Detectors

    Authors: Denis Huseljic, Marek Herde, Mehmet Muejde, Bernhard Sick

    Abstract: In the field of deep learning based computer vision, the development of deep object detection has led to unique paradigms (e.g., two-stage or set-based) and architectures (e.g., Faster-RCNN or DETR) which enable outstanding performance on challenging benchmark datasets. Despite this, the trained object detectors typically do not reliably assess uncertainty regarding their own knowledge, and the qu… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: 17 pages, 6 figures, submitted to IJCV

    ACM Class: I.4.0; I.5.0

  10. A Survey on Cost Types, Interaction Schemes, and Annotator Performance Models in Selection Algorithms for Active Learning in Classification

    Authors: Marek Herde, Denis Huseljic, Bernhard Sick, Adrian Calma

    Abstract: Pool-based active learning (AL) aims to optimize the annotation process (i.e., labeling) as the acquisition of annotations is often time-consuming and therefore expensive. For this purpose, an AL strategy queries annotations intelligently from annotators to train a high-performance classification model at a low annotation cost. Traditional AL strategies operate in an idealized framework. They assu… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Journal ref: IEEE Access 9 (2021) 166970-166989

  11. arXiv:2105.02965  [pdf, other

    cs.LG cs.AI

    Out-of-distribution Detection and Generation using Soft Brownian Offset Sampling and Autoencoders

    Authors: Felix Möller, Diego Botache, Denis Huseljic, Florian Heidecker, Maarten Bieshaar, Bernhard Sick

    Abstract: Deep neural networks often suffer from overconfidence which can be partly remedied by improved out-of-distribution detection. For this purpose, we propose a novel approach that allows for the generation of out-of-distribution datasets based on a given in-distribution dataset. This new dataset can then be used to improve out-of-distribution detection for the given dataset and machine learning task… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: 10 pages, 7 figures, accepted for publication at CVPR 2021 Workshop Safe Artificial Intelligence for Automated Driving (SAIAD)

  12. arXiv:2006.01732  [pdf, other

    cs.LG stat.ML

    Toward Optimal Probabilistic Active Learning Using a Bayesian Approach

    Authors: Daniel Kottke, Marek Herde, Christoph Sandrock, Denis Huseljic, Georg Krempl, Bernhard Sick

    Abstract: Gathering labeled data to train well-performing machine learning models is one of the critical challenges in many applications. Active learning aims at reducing the labeling costs by an efficient and effective allocation of costly labeling resources. In this article, we propose a decision-theoretic selection strategy that (1) directly optimizes the gain in misclassification error, and (2) uses a B… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: 11 pages, 8 figures, appendix

    MSC Class: 68T05 ACM Class: I.2.6

  13. arXiv:1901.10338  [pdf, other

    cs.LG stat.ML

    Limitations of Assessing Active Learning Performance at Runtime

    Authors: Daniel Kottke, Jim Schellinger, Denis Huseljic, Bernhard Sick

    Abstract: Classification algorithms aim to predict an unknown label (e.g., a quality class) for a new instance (e.g., a product). Therefore, training samples (instances and labels) are used to deduct classification hypotheses. Often, it is relatively easy to capture instances but the acquisition of the corresponding labels remain difficult or expensive. Active learning algorithms select the most beneficial… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.