Skip to main content

Showing 1–29 of 29 results for author: Metzen, J H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01790  [pdf, other

    cs.CV cs.AI cs.LG

    Label-free Neural Semantic Image Synthesis

    Authors: Jiayi Wang, Kevin Alexander Laube, Yumeng Li, Jan Hendrik Metzen, Shin-I Cheng, Julio Borges, Anna Khoreva

    Abstract: Recent work has shown great progress in integrating spatial conditioning to control large, pre-trained text-to-image diffusion models. Despite these advances, existing methods describe the spatial image content using hand-crafted conditioning inputs, which are either semantically ambiguous (e.g., edges) or require expensive manual annotations (e.g., semantic segmentation). To address these limitat… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2404.16637  [pdf, other

    cs.CV

    Zero-Shot Distillation for Image Encoders: How to Make Effective Use of Synthetic Data

    Authors: Niclas Popp, Jan Hendrik Metzen, Matthias Hein

    Abstract: Multi-modal foundation models such as CLIP have showcased impressive zero-shot capabilities. However, their applicability in resource-constrained environments is limited due to their large number of parameters and high inference time. While existing approaches have scaled down the entire CLIP architecture, we focus on training smaller variants of the image encoder, which suffices for efficient zer… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  3. arXiv:2404.07045  [pdf, other

    cs.CV

    Identification of Fine-grained Systematic Errors via Controlled Scene Generation

    Authors: Valentyn Boreiko, Matthias Hein, Jan Hendrik Metzen

    Abstract: Many safety-critical applications, especially in autonomous driving, require reliable object detectors. They can be very effectively assisted by a method to search for and identify potential failures and systematic errors before these detectors are deployed. Systematic errors are characterized by combinations of attributes such as object location, scale, orientation, and color, as well as the comp… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  4. arXiv:2309.16414  [pdf, other

    cs.CV cs.AI cs.LG

    AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models

    Authors: Jan Hendrik Metzen, Piyapat Saranrittichai, Chaithanya Kumar Mummadi

    Abstract: Classifiers built upon vision-language models such as CLIP have shown remarkable zero-shot performance across a broad range of image classification tasks. Prior work has studied different ways of automatically creating descriptor sets for every class based on prompt templates, ranging from manually engineered templates over templates obtained from a large language model to templates built from ran… ▽ More

    Submitted 29 September, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

  5. arXiv:2309.13489  [pdf, other

    cs.CV

    Identifying Systematic Errors in Object Detectors with the SCROD Pipeline

    Authors: Valentyn Boreiko, Matthias Hein, Jan Hendrik Metzen

    Abstract: The identification and removal of systematic errors in object detectors can be a prerequisite for their deployment in safety-critical applications like automated driving and robotics. Such systematic errors can for instance occur under very specific object poses (location, scale, orientation), object colors/textures, and backgrounds. Real images alone are unlikely to cover all relevant combination… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  6. arXiv:2303.05072  [pdf, other

    cs.CV cs.AI cs.LG

    Identification of Systematic Errors of Image Classifiers on Rare Subgroups

    Authors: Jan Hendrik Metzen, Robin Hutmacher, N. Grace Hua, Valentyn Boreiko, Dan Zhang

    Abstract: Despite excellent average-case performance of many image classifiers, their performance can substantially deteriorate on semantically coherent subgroups of the data that were under-represented in the training data. These systematic errors can impact both fairness for demographic minority groups as well as robustness and safety under domain shift. A major challenge is to identify such subgroups wit… ▽ More

    Submitted 12 April, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  7. arXiv:2209.05980  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Certified Defences Against Adversarial Patch Attacks on Semantic Segmentation

    Authors: Maksym Yatsura, Kaspar Sakmann, N. Grace Hua, Matthias Hein, Jan Hendrik Metzen

    Abstract: Adversarial patch attacks are an emerging security threat for real world deep learning applications. We present Demasked Smoothing, the first approach (up to our knowledge) to certify the robustness of semantic segmentation models against this threat model. Previous work on certifiably defending against patch attacks has mostly focused on image classification task and often required changes in the… ▽ More

    Submitted 21 February, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: accepted at ICLR 2023

  8. arXiv:2203.13639  [pdf, other

    cs.CV

    Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness

    Authors: Giulio Lovisotto, Nicole Finnie, Mauricio Munoz, Chaithanya Kumar Mummadi, Jan Hendrik Metzen

    Abstract: Neural architectures based on attention such as vision transformers are revolutionizing image recognition. Their main benefit is that attention allows reasoning about all parts of a scene jointly. In this paper, we show how the global reasoning of (scaled) dot-product attention can be the source of a major vulnerability when confronted with adversarial patch attacks. We provide a theoretical under… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: to be published in IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022, CVPR22

    MSC Class: 68T07 ACM Class: I.4

  9. arXiv:2202.07242  [pdf, other

    cs.CV cs.LG

    Neural Architecture Search for Dense Prediction Tasks in Computer Vision

    Authors: Thomas Elsken, Arber Zela, Jan Hendrik Metzen, Benedikt Staffler, Thomas Brox, Abhinav Valada, Frank Hutter

    Abstract: The success of deep learning in recent years has lead to a rising demand for neural network architecture engineering. As a consequence, neural architecture search (NAS), which aims at automatically designing neural network architectures in a data-driven manner rather than manually, has evolved as a popular field of research. With the advent of weight sharing strategies across architectures, NAS ha… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  10. arXiv:2111.01714  [pdf, other

    cs.LG cs.AI cs.CV

    Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks

    Authors: Maksym Yatsura, Jan Hendrik Metzen, Matthias Hein

    Abstract: Adversarial attacks based on randomized search schemes have obtained state-of-the-art results in black-box robustness evaluation recently. However, as we demonstrate in this work, their efficiency in different query budget regimes depends on manual design and heuristic tuning of the underlying proposal distributions. We study how this issue can be addressed by adapting the proposal distribution on… ▽ More

    Submitted 22 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: accepted at NeurIPS 2021; updated the numbers in Table 5 and added references; added acknowledgements

  11. arXiv:2107.03719  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Bag of Tricks for Neural Architecture Search

    Authors: Thomas Elsken, Benedikt Staffler, Arber Zela, Jan Hendrik Metzen, Frank Hutter

    Abstract: While neural architecture search methods have been successful in previous years and led to new state-of-the-art performance on various problems, they have also been criticized for being unstable, being highly sensitive with respect to their hyperparameters, and often not performing better than random search. To shed some light on this issue, we discuss some practical considerations that help impro… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  12. arXiv:2106.14999  [pdf, other

    stat.ML cs.LG

    Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

    Authors: Chaithanya Kumar Mummadi, Robin Hutmacher, Kilian Rambach, Evgeny Levinkov, Thomas Brox, Jan Hendrik Metzen

    Abstract: Deep neural networks often exhibit poor performance on data that is unlikely under the train-time data distribution, for instance data affected by corruptions. Previous works demonstrate that test-time adaptation to data shift, for instance using entropy minimization, effectively improves performance on such shifted distributions. This paper focuses on the fully test-time adaptation setting, where… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 16 pages, 5 figures, 7 tables

  13. arXiv:2104.09789  [pdf, other

    cs.CV

    Does enhanced shape bias improve neural network robustness to common corruptions?

    Authors: Chaithanya Kumar Mummadi, Ranjitha Subramaniam, Robin Hutmacher, Julien Vitay, Volker Fischer, Jan Hendrik Metzen

    Abstract: Convolutional neural networks (CNNs) learn to extract representations of complex features, such as object shapes and textures to solve image recognition tasks. Recent work indicates that CNNs trained on ImageNet are biased towards features that encode textures and that these alone are sufficient to generalize to unseen test data from the same distribution as the training data but often fail to gen… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: 20 pages, 9 figures, 12 tables, accepted at ICLR 2021

  14. arXiv:2102.04154  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Efficient Certified Defenses Against Patch Attacks on Image Classifiers

    Authors: Jan Hendrik Metzen, Maksym Yatsura

    Abstract: Adversarial patches pose a realistic threat model for physical world attacks on autonomous systems via their perception component. Autonomous systems in safety-critical domains such as automated driving should thus contain a fail-safe fallback component that combines certifiable robustness against patches with efficient inference while maintaining high performance on clean inputs. We propose BagCe… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: accepted at ICLR 2021

  15. arXiv:2101.11453  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Meta Adversarial Training against Universal Patches

    Authors: Jan Hendrik Metzen, Nicole Finnie, Robin Hutmacher

    Abstract: Recently demonstrated physical-world adversarial attacks have exposed vulnerabilities in perception systems that pose severe risks for safety-critical applications such as autonomous driving. These attacks place adversarial artifacts in the physical world that indirectly cause the addition of a universal patch to inputs of a model that can fool it in a variety of contexts. Adversarial training is… ▽ More

    Submitted 22 June, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted by the ICML 2021 workshop on "A Blessing in Disguise: The Prospects and Perils of Adversarial Machine Learning"

  16. Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

    Authors: Christoph Kamann, Burkhard Güssefeld, Robin Hutmacher, Jan Hendrik Metzen, Carsten Rother

    Abstract: For safety-critical applications such as autonomous driving, CNNs have to be robust with respect to unavoidable image corruptions, such as image noise. While previous works addressed the task of robust prediction in the context of full-image classification, we consider it for dense semantic segmentation. We build upon an insight from image classification that output robustness can be improved by i… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  17. arXiv:2010.01401  [pdf, other

    cs.CV cs.AI cs.LG

    Adversarial and Natural Perturbations for General Robustness

    Authors: Sadaf Gulshad, Jan Hendrik Metzen, Arnold Smeulders

    Abstract: In this paper we aim to explore the general robustness of neural network classifiers by utilizing adversarial as well as natural perturbations. Different from previous works which mainly focus on studying the robustness of neural networks against adversarial perturbations, we also evaluate their robustness on natural perturbations before and after robustification. After standardizing the compariso… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: Currently under review

  18. arXiv:1911.11090  [pdf, other

    cs.LG stat.ML

    Meta-Learning of Neural Architectures for Few-Shot Learning

    Authors: Thomas Elsken, Benedikt Staffler, Jan Hendrik Metzen, Frank Hutter

    Abstract: The recent progress in neural architecture search (NAS) has allowed scaling the automated design of neural architectures to real-world domains, such as object detection and semantic segmentation. However, one prerequisite for the application of NAS are large amounts of labeled data and compute resources. This renders its application challenging in few-shot learning scenarios, where many related ta… ▽ More

    Submitted 14 June, 2021; v1 submitted 25 November, 2019; originally announced November 2019.

    Journal ref: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  19. arXiv:1910.07416  [pdf, other

    cs.CV

    Understanding Misclassifications by Attributes

    Authors: Sadaf Gulshad, Zeynep Akata, Jan Hendrik Metzen, Arnold Smeulders

    Abstract: In this paper, we aim to understand and explain the decisions of deep neural networks by studying the behavior of predicted attributes when adversarial examples are introduced. We study the changes in attributes for clean as well as adversarial images in both standard and adversarially robust networks. We propose a metric to quantify the robustness of an adversarially robust network against advers… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.08279

  20. arXiv:1904.08279  [pdf, other

    cs.CV

    Interpreting Adversarial Examples with Attributes

    Authors: Sadaf Gulshad, Jan Hendrik Metzen, Arnold Smeulders, Zeynep Akata

    Abstract: Deep computer vision systems being vulnerable to imperceptible and carefully crafted noise have raised questions regarding the robustness of their decisions. We take a step back and approach this problem from an orthogonal direction. We propose to enable black-box neural networks to justify their reasoning both for clean and for adversarial examples by leveraging attributes, i.e. visually discrimi… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  21. arXiv:1812.03705  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Defending Against Universal Perturbations With Shared Adversarial Training

    Authors: Chaithanya Kumar Mummadi, Thomas Brox, Jan Hendrik Metzen

    Abstract: Classifiers such as deep neural networks have been shown to be vulnerable against adversarial perturbations on problems with high-dimensional input space. While adversarial training improves the robustness of image classifiers against such adversarial perturbations, it leaves them sensitive to perturbations on a non-negligible fraction of the inputs. In this work, we show that adversarial training… ▽ More

    Submitted 13 August, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: ICCV 2019, 8 main pages, 9 appendix pages, 16 figures, 2 tables

  22. arXiv:1808.05377  [pdf, other

    stat.ML cs.LG cs.NE

    Neural Architecture Search: A Survey

    Authors: Thomas Elsken, Jan Hendrik Metzen, Frank Hutter

    Abstract: Deep Learning has enabled remarkable progress over the last years on a variety of tasks, such as image recognition, speech recognition, and machine translation. One crucial aspect for this progress are novel neural architectures. Currently employed architectures have mostly been developed manually by human experts, which is a time-consuming and error-prone process. Because of this, there is growin… ▽ More

    Submitted 26 April, 2019; v1 submitted 16 August, 2018; originally announced August 2018.

    Journal ref: Journal of Machine Learning Research 20 (2019) 1-21

  23. arXiv:1805.12514  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Scaling provable adversarial defenses

    Authors: Eric Wong, Frank R. Schmidt, Jan Hendrik Metzen, J. Zico Kolter

    Abstract: Recent work has developed methods for learning deep network classifiers that are provably robust to norm-bounded adversarial perturbation; however, these methods are currently only possible for relatively small feedforward networks. In this paper, in an effort to scale these approaches to substantially larger models, we extend previous work in three main directions. First, we present a technique f… ▽ More

    Submitted 21 November, 2018; v1 submitted 31 May, 2018; originally announced May 2018.

  24. arXiv:1804.09081  [pdf, other

    stat.ML cs.LG

    Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution

    Authors: Thomas Elsken, Jan Hendrik Metzen, Frank Hutter

    Abstract: Neural Architecture Search aims at automatically finding neural architectures that are competitive with architectures designed by human experts. While recent approaches have achieved state-of-the-art predictive performance for image recognition, they are problematic under resource constraints for two reasons: (1)the neural architectures found are solely optimized for high predictive performance, w… ▽ More

    Submitted 26 February, 2019; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Published as a conference paper at ICLR, International Conference on Learning Representations, 2019

  25. arXiv:1704.05712  [pdf, other

    stat.ML cs.AI cs.CV cs.LG cs.NE

    Universal Adversarial Perturbations Against Semantic Image Segmentation

    Authors: Jan Hendrik Metzen, Mummadi Chaithanya Kumar, Thomas Brox, Volker Fischer

    Abstract: While deep learning is remarkably successful on perceptual tasks, it was also shown to be vulnerable to adversarial perturbations of the input. These perturbations denote noise added to the input that was generated specifically to fool the system while being quasi-imperceptible for humans. More severely, there even exist universal perturbations that are input-agnostic but fool the network on the m… ▽ More

    Submitted 31 July, 2017; v1 submitted 19 April, 2017; originally announced April 2017.

    Comments: Final version for ICCV including supplementary material

  26. arXiv:1703.01101  [pdf, other

    stat.ML cs.CR cs.CV cs.LG cs.NE

    Adversarial Examples for Semantic Image Segmentation

    Authors: Volker Fischer, Mummadi Chaithanya Kumar, Jan Hendrik Metzen, Thomas Brox

    Abstract: Machine learning methods in general and Deep Neural Networks in particular have shown to be vulnerable to adversarial perturbations. So far this phenomenon has mainly been studied in the context of whole-image classification. In this contribution, we analyse how adversarial perturbations can affect the task of semantic segmentation. We show how existing adversarial attackers can be transferred to… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: ICLR 2017 workshop submission

  27. arXiv:1702.04267  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    On Detecting Adversarial Perturbations

    Authors: Jan Hendrik Metzen, Tim Genewein, Volker Fischer, Bastian Bischoff

    Abstract: Machine learning and deep learning in particular has advanced tremendously on perceptual tasks in recent years. However, it remains vulnerable against adversarial perturbations of the input that have been crafted specifically to fool the system while being quasi-imperceptible to a human. In this work, we propose to augment deep neural networks with a small "detector" subnetwork which is trained on… ▽ More

    Submitted 21 February, 2017; v1 submitted 14 February, 2017; originally announced February 2017.

    Comments: Final version for ICLR2017 (see https://openreview.net/forum?id=SJzCSf9xg&noteId=SJzCSf9xg)

  28. arXiv:1602.01064  [pdf, other

    stat.ML cs.IT cs.LG cs.RO

    Minimum Regret Search for Single- and Multi-Task Optimization

    Authors: Jan Hendrik Metzen

    Abstract: We propose minimum regret search (MRS), a novel acquisition function for Bayesian optimization. MRS bears similarities with information-theoretic approaches such as entropy search (ES). However, while ES aims in each query at maximizing the information gain with respect to the global maximum, MRS aims at minimizing the expected simple regret of its ultimate recommendation for the optimum. While em… ▽ More

    Submitted 24 May, 2016; v1 submitted 2 February, 2016; originally announced February 2016.

    Comments: Final version for ICML 2016

  29. arXiv:1511.04211  [pdf, other

    stat.ML cs.LG

    Active Contextual Entropy Search

    Authors: Jan Hendrik Metzen

    Abstract: Contextual policy search allows adapting robotic movement primitives to different situations. For instance, a locomotion primitive might be adapted to different terrain inclinations or desired walking speeds. Such an adaptation is often achievable by modifying a small number of hyperparameters. However, learning, when performed on real robotic systems, is typically restricted to a small number of… ▽ More

    Submitted 16 November, 2015; v1 submitted 13 November, 2015; originally announced November 2015.

    Comments: Corrected title of reference #19