Skip to main content

Showing 1–9 of 9 results for author: Beluch, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13501  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis

    Authors: Yumeng Li, William Beluch, Margret Keuper, Dan Zhang, Anna Khoreva

    Abstract: Despite tremendous progress in the field of text-to-video (T2V) synthesis, open-sourced T2V diffusion models struggle to generate longer videos with dynamically varying and evolving content. They tend to synthesize quasi-static videos, ignoring the necessary visual change-over-time implied in the text prompt. At the same time, scaling these models to enable longer, more dynamic video synthesis oft… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Project page: https://yumengli007.github.io/VSTAR

  2. arXiv:2308.09965  [pdf, other

    cs.CV cs.AI cs.LG

    Anomaly-Aware Semantic Segmentation via Style-Aligned OoD Augmentation

    Authors: Dan Zhang, Kaspar Sakmann, William Beluch, Robin Hutmacher, Yumeng Li

    Abstract: Within the context of autonomous driving, encountering unknown objects becomes inevitable during deployment in the open world. Therefore, it is crucial to equip standard semantic segmentation models with anomaly awareness. Many previous approaches have utilized synthetic out-of-distribution (OoD) data augmentation to tackle this problem. In this work, we advance the OoD synthesis process by reduci… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV2023 Workshop on Robustness and Reliability of Autonomous Vehicles in the Open-world (BRAVO)

  3. arXiv:2210.04675  [pdf, other

    cs.CL cs.AI

    A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing

    Authors: Sophie Henning, William Beluch, Alexander Fraser, Annemarie Friedrich

    Abstract: Many natural language processing (NLP) tasks are naturally imbalanced, as some target categories occur much more frequently than others in the real world. In such scenarios, current NLP models still tend to perform poorly on less frequent classes. Addressing class imbalance in NLP is an active research topic, yet, finding a good approach for a particular task and imbalance scenario is difficult.… ▽ More

    Submitted 22 February, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Camera-ready version for EACL 2023

  4. arXiv:2109.12851  [pdf, other

    cs.LG eess.SP

    Improving Uncertainty of Deep Learning-based Object Classification on Radar Spectra using Label Smoothing

    Authors: Kanil Patel, William Beluch, Kilian Rambach, Michael Pfeiffer, Bin Yang

    Abstract: Object type classification for automotive radar has greatly improved with recent deep learning (DL) solutions, however these developments have mostly focused on the classification accuracy. Before employing DL solutions in safety-critical applications, such as automated driving, an indispensable prerequisite is the accurate quantification of the classifiers' reliability. Unfortunately, DL classifi… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: Submitted to IEEE Radar Conference 2022

  5. arXiv:2108.05779  [pdf, other

    cs.CV

    DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities

    Authors: Elias Eulig, Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Kilian Rambach, William Beluch, Xiahan Shi, Volker Fischer

    Abstract: Common deep neural networks (DNNs) for image classification have been shown to rely on shortcut opportunities (SO) in the form of predictive and easy-to-represent visual factors. This is known as shortcut learning and leads to impaired generalization. In this work, we show that common DNNs also suffer from shortcut learning when predicting only basic visual object factors of variation (FoV) such a… ▽ More

    Submitted 8 October, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at IEEE International Conference on Computer Vision (ICCV) 2021; updated affiliations & corrected typo

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 10655-10664

  6. arXiv:2106.05870  [pdf, other

    cs.LG cs.AI

    Investigation of Uncertainty of Deep Learning-based Object Classification on Radar Spectra

    Authors: Kanil Patel, William Beluch, Kilian Rambach, Adriana-Eliza Cozma, Michael Pfeiffer, Bin Yang

    Abstract: Deep learning (DL) has recently attracted increasing interest to improve object type classification for automotive radar.In addition to high accuracy, it is crucial for decision making in autonomous vehicles to evaluate the reliability of the predictions; however, decisions of DL networks are non-transparent. Current DL research has investigated how uncertainties of predictions can be quantified,… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 6 pages

    Journal ref: IEEE Radar Conference 2021

  7. arXiv:2006.13092  [pdf, other

    cs.LG stat.ML

    Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

    Authors: Kanil Patel, William Beluch, Bin Yang, Michael Pfeiffer, Dan Zhang

    Abstract: Post-hoc multi-class calibration is a common approach for providing high-quality confidence estimates of deep neural network predictions. Recent work has shown that widely used scaling methods underestimate their calibration error, while alternative Histogram Binning (HB) methods often fail to preserve classification accuracy. When classes have small prior probabilities, HB also faces the issue of… ▽ More

    Submitted 24 November, 2022; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Accepted at International Conference on Learning Representations, ICLR 2021. Openreview link: https://openreview.net/forum?id=AICNpd8ke-m

  8. arXiv:1912.07458  [pdf, other

    cs.LG stat.ML

    On-manifold Adversarial Data Augmentation Improves Uncertainty Calibration

    Authors: Kanil Patel, William Beluch, Dan Zhang, Michael Pfeiffer, Bin Yang

    Abstract: Uncertainty estimates help to identify ambiguous, novel, or anomalous inputs, but the reliable quantification of uncertainty has proven to be challenging for modern deep networks. In order to improve uncertainty estimation, we propose On-Manifold Adversarial Data Augmentation or OMADA, which specifically attempts to generate the most challenging examples by following an on-manifold adversarial att… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Accepted for oral at International Conference on Pattern Recognition, ICPR 2020. Nominated (top 4) for Best Industry Related Paper Award (BIRPA)

  9. arXiv:1906.11876  [pdf, other

    cs.CV cs.LG stat.ML

    Uncertainty Based Detection and Relabeling of Noisy Image Labels

    Authors: Jan M. Köhler, Maximilian Autenrieth, William H. Beluch

    Abstract: Deep neural networks (DNNs) are powerful tools in computer vision tasks. However, in many realistic scenarios label noise is prevalent in the training images, and overfitting to these noisy labels can significantly harm the generalization performance of DNNs. We propose a novel technique to identify data with noisy labels based on the different distributions of the predictive uncertainties from a… ▽ More

    Submitted 29 May, 2019; originally announced June 2019.

    Comments: Uncertainty and Robustness in Deep Visual Learning Workshop at CVPR 2019