Skip to main content

Showing 1–14 of 14 results for author: Zimmermann, R S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00143  [pdf, other

    cs.LG cs.CV stat.ML

    InfoNCE: Identifying the Gap Between Theory and Practice

    Authors: Evgenia Rusak, Patrik Reizinger, Attila Juhos, Oliver Bringmann, Roland S. Zimmermann, Wieland Brendel

    Abstract: Previous theoretical work on contrastive learning (CL) with InfoNCE showed that, under certain assumptions, the learned representations uncover the ground-truth latent factors. We argue these theories overlook crucial aspects of how CL is deployed in practice. Specifically, they assume that within a positive pair, all latent factors either vary to a similar extent, or that some do not vary at all.… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2307.05471  [pdf, other

    cs.CV

    Scale Alone Does not Improve Mechanistic Interpretability in Vision Models

    Authors: Roland S. Zimmermann, Thomas Klein, Wieland Brendel

    Abstract: In light of the recent widespread adoption of AI systems, understanding the internal information processing of neural networks has become increasingly critical. Most recently, machine vision has seen remarkable progress by scaling neural networks to unprecedented levels in dataset and model size. We here ask whether this extraordinary increase in scale also positively impacts the field of mechanis… ▽ More

    Submitted 30 March, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Spotlight at NeurIPS 2023. The first two authors contributed equally. Code available at https://brendel-group.github.io/imi/

  3. arXiv:2306.04719  [pdf, other

    cs.CV cs.AI cs.HC cs.LG q-bio.NC

    Don't trust your eyes: on the (un)reliability of feature visualizations

    Authors: Robert Geirhos, Roland S. Zimmermann, Blair Bilodeau, Wieland Brendel, Been Kim

    Abstract: How do neural networks extract patterns from pixels? Feature visualizations attempt to answer this important question by visualizing highly activating patterns through optimization. Today, visualization methods form the foundation of our knowledge about the internal workings of neural networks, as a type of mechanistic interpretability. Here we ask: How reliable are feature visualizations? We star… ▽ More

    Submitted 6 June, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: ICML 2024 camera ready version

  4. arXiv:2305.18890  [pdf, other

    cs.CV cs.LG

    Sensitivity of Slot-Based Object-Centric Models to their Number of Slots

    Authors: Roland S. Zimmermann, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Thomas Kipf, Klaus Greff

    Abstract: Self-supervised methods for learning object-centric representations have recently been applied successfully to various datasets. This progress is largely fueled by slot-based methods, whose ability to cluster visual scenes into meaningful objects holds great promise for compositional generalization and downstream learning. In these methods, the number of slots (clusters) $K$ is typically chosen to… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  5. arXiv:2305.14229  [pdf, other

    cs.LG cs.CV

    Provably Learning Object-Centric Representations

    Authors: Jack Brady, Roland S. Zimmermann, Yash Sharma, Bernhard Schölkopf, Julius von Kügelgen, Wieland Brendel

    Abstract: Learning structured representations of the visual world in terms of objects promises to significantly improve the generalization abilities of current machine learning models. While recent efforts to this end have shown promising empirical progress, a theoretical account of when unsupervised object-centric representation learning is possible is still lacking. Consequently, understanding the reasons… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Oral at ICML 2023. The first two authors as well as the last two authors contributed equally. Code is available at https://brendel-group.github.io/objects-identifiability

  6. arXiv:2206.13991  [pdf, other

    cs.LG cs.CR cs.CV

    Increasing Confidence in Adversarial Robustness Evaluations

    Authors: Roland S. Zimmermann, Wieland Brendel, Florian Tramer, Nicholas Carlini

    Abstract: Hundreds of defenses have been proposed to make deep neural networks robust against minimal (adversarial) input perturbations. However, only a handful of these defenses held up their claims because correctly evaluating robustness is extremely challenging: Weak attacks often fail to find adversarial examples even if they unknowingly exist, thereby making a vulnerable network look robust. In this pa… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Oral at CVPR 2022 Workshop (Art of Robustness). Project website https://zimmerrol.github.io/active-tests/

  7. arXiv:2110.00473  [pdf, other

    stat.ML cs.CV cs.LG

    Score-Based Generative Classifiers

    Authors: Roland S. Zimmermann, Lukas Schott, Yang Song, Benjamin A. Dunn, David A. Klindt

    Abstract: The tremendous success of generative models in recent years raises the question whether they can also be used to perform classification. Generative models have been used as adversarially robust classifiers on simple datasets such as MNIST, but this robustness has not been observed on more complex datasets like CIFAR-10. Additionally, on natural image datasets, previous results have suggested a tra… ▽ More

    Submitted 11 December, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: published at https://dgms-and-applications.github.io/2021/ project website https://zimmerrol.github.io/SBGC/

  8. arXiv:2106.12447  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    How Well do Feature Visualizations Support Causal Understanding of CNN Activations?

    Authors: Roland S. Zimmermann, Judy Borowski, Robert Geirhos, Matthias Bethge, Thomas S. A. Wallis, Wieland Brendel

    Abstract: A precise understanding of why units in an artificial network respond to certain stimuli would constitute a big step towards explainable artificial intelligence. One widely used approach towards this goal is to visualize unit responses via activation maximization. These synthetic feature visualizations are purported to provide humans with precise information about the image features that cause a u… ▽ More

    Submitted 12 November, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Presented at NeurIPS 2021. Shared first and last authorship. Project website at https://brendel-group.github.io/causal-understanding-via-visualizations/

  9. arXiv:2102.08850  [pdf, other

    cs.LG cs.CV

    Contrastive Learning Inverts the Data Generating Process

    Authors: Roland S. Zimmermann, Yash Sharma, Steffen Schneider, Matthias Bethge, Wieland Brendel

    Abstract: Contrastive learning has recently seen tremendous success in self-supervised learning. So far, however, it is largely unclear why the learned representations generalize so effectively to a large variety of downstream tasks. We here prove that feedforward models trained with objectives belonging to the commonly used InfoNCE family learn to implicitly invert the underlying generative model of the ob… ▽ More

    Submitted 7 April, 2022; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: Presented at ICML 2021. The first three authors, as well as the last two authors, contributed equally. Code is available at https://brendel-group.github.io/cl-ica

  10. arXiv:2010.12606  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    Exemplary Natural Images Explain CNN Activations Better than State-of-the-Art Feature Visualization

    Authors: Judy Borowski, Roland S. Zimmermann, Judith Schepers, Robert Geirhos, Thomas S. A. Wallis, Matthias Bethge, Wieland Brendel

    Abstract: Feature visualizations such as synthetic maximally activating images are a widely used explanation method to better understand the information processing of convolutional neural networks (CNNs). At the same time, there are concerns that these visualizations might not accurately represent CNNs' inner workings. Here, we measure how much extremely activating images help humans to predict CNN activati… ▽ More

    Submitted 2 May, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Published at ICLR 2021. Joint first and last authors. Code is available at https://bethgelab.github.io/testing_visualizations/

  11. arXiv:2001.06057  [pdf, other

    cs.CV cs.LG stat.ML

    A simple way to make neural networks robust against diverse image corruptions

    Authors: Evgenia Rusak, Lukas Schott, Roland S. Zimmermann, Julian Bitterwolf, Oliver Bringmann, Matthias Bethge, Wieland Brendel

    Abstract: The human visual system is remarkably robust against a wide range of naturally occurring variations and corruptions like rain or snow. In contrast, the performance of modern image recognition models strongly degrades when evaluated on previously unseen corruptions. Here, we demonstrate that a simple but properly tuned training with additive Gaussian and Speckle noise generalizes surprisingly well… ▽ More

    Submitted 22 July, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: Oral presentation at the European Conference for Computer Vision (ECCV 2020)

  12. arXiv:1907.00895  [pdf, ps, other

    cs.LG stat.ML

    Comment on "Adv-BNN: Improved Adversarial Defense through Robust Bayesian Neural Network"

    Authors: Roland S. Zimmermann

    Abstract: A recent paper by Liu et al. combines the topics of adversarial training and Bayesian Neural Networks (BNN) and suggests that adversarially trained BNNs are more robust against adversarial attacks than their non-Bayesian counterparts. Here, I analyze the proposed defense and suggest that one needs to adjust the adversarial attack to incorporate the stochastic nature of a Bayesian network to perfor… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 3 pages

  13. arXiv:1904.07817  [pdf, other

    cs.LG cs.AI

    Simion Zoo: A Workbench for Distributed Experimentation with Reinforcement Learning for Continuous Control Tasks

    Authors: Borja Fernandez-Gauna, Manuel Graña, Roland S. Zimmermann

    Abstract: We present Simion Zoo, a Reinforcement Learning (RL) workbench that provides a complete set of tools to design, run, and analyze the results,both statistically and visually, of RL control applications. The main features that set apart Simion Zoo from similar software packages are its easy-to-use GUI, its support for distributed execution including deployment over graphics processing units (GPUs) ,… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  14. Faster Training of Mask R-CNN by Focusing on Instance Boundaries

    Authors: Roland S. Zimmermann, Julien N. Siems

    Abstract: We present an auxiliary task to Mask R-CNN, an instance segmentation network, which leads to faster training of the mask head. Our addition to Mask R-CNN is a new prediction head, the Edge Agreement Head, which is inspired by the way human annotators perform instance segmentation. Human annotators copy the contour of an object instance and only indirectly the occupied instance area. Hence, the edg… ▽ More

    Submitted 10 August, 2019; v1 submitted 19 September, 2018; originally announced September 2018.

    Comments: 9 pages, 7 figures, 5 tables

    MSC Class: 68T45