Skip to main content

Showing 1–21 of 21 results for author: Stütz, D

.
  1. arXiv:2405.01563  [pdf, other

    cs.LG cs.AI cs.CL

    Mitigating LLM Hallucinations via Conformal Abstention

    Authors: Yasin Abbasi Yadkori, Ilja Kuzborskij, David Stutz, András György, Adam Fisch, Arnaud Doucet, Iuliya Beloshapka, Wei-Hung Weng, Yao-Yuan Yang, Csaba Szepesvári, Ali Taylan Cemgil, Nenad Tomasev

    Abstract: We develop a principled procedure for determining when a large language model (LLM) should abstain from responding (e.g., by saying "I don't know") in a general domain, instead of resorting to possibly "hallucinating" a non-sensical or incorrect answer. Building on earlier approaches that use self-consistency as a more reliable measure of model confidence, we propose using the LLM itself to self-e… ▽ More

    Submitted 4 April, 2024; originally announced May 2024.

  2. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  3. arXiv:2402.10723  [pdf, other

    stat.ML cs.LG

    Conformalized Credal Set Predictors

    Authors: Alireza Javanmardi, David Stutz, Eyke Hüllermeier

    Abstract: Credal sets are sets of probability distributions that are considered as candidates for an imprecisely known ground-truth distribution. In machine learning, they have recently attracted attention as an appealing formalism for uncertainty representation, in particular due to their ability to represent both the aleatoric and epistemic uncertainty in a prediction. However, the design of methods for l… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  4. arXiv:2309.06166  [pdf, other

    cs.LG cs.CV stat.ML

    Certified Robust Models with Slack Control and Large Lipschitz Constants

    Authors: Max Losch, David Stutz, Bernt Schiele, Mario Fritz

    Abstract: Despite recent success, state-of-the-art learning-based models remain highly vulnerable to input changes such as adversarial examples. In order to obtain certifiable robustness against such perturbations, recent work considers Lipschitz-based regularizers or constraints while at the same time increasing prediction margin. Unfortunately, this comes at the cost of significantly decreased accuracy. I… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: To be published at GCPR 2023

  5. arXiv:2308.10888  [pdf, other

    cs.LG cs.CV cs.CY

    Unlocking Accuracy and Fairness in Differentially Private Image Classification

    Authors: Leonard Berrada, Soham De, Judy Hanwen Shen, Jamie Hayes, Robert Stanforth, David Stutz, Pushmeet Kohli, Samuel L. Smith, Borja Balle

    Abstract: Privacy-preserving machine learning aims to train models on private data without leaking sensitive information. Differential privacy (DP) is considered the gold standard framework for privacy-preserving training, as it provides formal privacy guarantees. However, compared to their non-private counterparts, models trained with DP often have significantly reduced accuracy. Private classifiers are al… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  6. Automatic registration with continuous pose updates for marker-less surgical navigation in spine surgery

    Authors: Florentin Liebmann, Marco von Atzigen, Dominik Stütz, Julian Wolf, Lukas Zingg, Daniel Suter, Laura Leoty, Hooman Esfandiari, Jess G. Snedeker, Martin R. Oswald, Marc Pollefeys, Mazda Farshad, Philipp Fürnstahl

    Abstract: Established surgical navigation systems for pedicle screw placement have been proven to be accurate, but still reveal limitations in registration or surgical guidance. Registration of preoperative data to the intraoperative anatomy remains a time-consuming, error-prone task that includes exposure to harmful radiation. Surgical guidance through conventional displays has well-known drawbacks, as inf… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  7. arXiv:2307.09302  [pdf, other

    cs.LG cs.CV stat.ME stat.ML

    Conformal prediction under ambiguous ground truth

    Authors: David Stutz, Abhijit Guha Roy, Tatiana Matejovicova, Patricia Strachan, Ali Taylan Cemgil, Arnaud Doucet

    Abstract: Conformal Prediction (CP) allows to perform rigorous uncertainty quantification by constructing a prediction set $C(X)$ satisfying $\mathbb{P}(Y \in C(X))\geq 1-α$ for a user-chosen $α\in [0,1]$ by relying on calibration data $(X_1,Y_1),...,(X_n,Y_n)$ from $\mathbb{P}=\mathbb{P}^{X} \otimes \mathbb{P}^{Y|X}$. It is typically implicitly assumed that $\mathbb{P}^{Y|X}$ is the "true" posterior label… ▽ More

    Submitted 24 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  8. arXiv:2307.02191  [pdf, other

    cs.LG cs.CV stat.ME stat.ML

    Evaluating AI systems under uncertain ground truth: a case study in dermatology

    Authors: David Stutz, Ali Taylan Cemgil, Abhijit Guha Roy, Tatiana Matejovicova, Melih Barsbey, Patricia Strachan, Mike Schaekermann, Jan Freyberg, Rajeev Rikhye, Beverly Freeman, Javier Perez Matos, Umesh Telang, Dale R. Webster, Yuan Liu, Greg S. Corrado, Yossi Matias, Pushmeet Kohli, Yun Liu, Arnaud Doucet, Alan Karthikesalingam

    Abstract: For safety, AI systems in health undergo thorough evaluations before deployment, validating their predictions against a ground truth that is assumed certain. However, this is actually not the case and the ground truth may be uncertain. Unfortunately, this is largely ignored in standard evaluation of AI models but can have severe consequences such as overestimating the future performance. To avoid… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  9. arXiv:2303.11126  [pdf, other

    cs.CV

    Robustifying Token Attention for Vision Transformers

    Authors: Yong Guo, David Stutz, Bernt Schiele

    Abstract: Despite the success of vision transformers (ViTs), they still suffer from significant drops in accuracy in the presence of common corruptions, such as noise or blur. Interestingly, we observe that the attention mechanism of ViTs tends to rely on few important tokens, a phenomenon we call token overfocusing. More critically, these tokens are not robust to corruptions, often leading to highly diverg… ▽ More

    Submitted 6 September, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: To appear in ICCV 2023

  10. arXiv:2204.12393  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    On Fragile Features and Batch Normalization in Adversarial Training

    Authors: Nils Philipp Walter, David Stutz, Bernt Schiele

    Abstract: Modern deep learning architecture utilize batch normalization (BN) to stabilize training and improve accuracy. It has been shown that the BN layers alone are surprisingly expressive. In the context of robustness against adversarial examples, however, BN is argued to increase vulnerability. That is, BN helps to learn fragile features. Nevertheless, BN is still used in adversarial training, which is… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  11. arXiv:2201.12765  [pdf, other

    cs.CV

    Improving Robustness by Enhancing Weak Subnets

    Authors: Yong Guo, David Stutz, Bernt Schiele

    Abstract: Despite their success, deep networks have been shown to be highly susceptible to perturbations, often causing significant drops in accuracy. In this paper, we investigate model robustness on perturbed inputs by studying the performance of internal sub-networks (subnets). Interestingly, we observe that most subnets show particularly poor robustness against perturbations. More importantly, these wea… ▽ More

    Submitted 20 July, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: To appear in ECCV 2022

  12. arXiv:2110.09192  [pdf, other

    cs.LG cs.CV stat.ME stat.ML

    Learning Optimal Conformal Classifiers

    Authors: David Stutz, Krishnamurthy, Dvijotham, Ali Taylan Cemgil, Arnaud Doucet

    Abstract: Modern deep learning based classifiers show very high accuracy on test data but this does not provide sufficient guarantees for safe deployment, especially in high-stake AI applications such as medical diagnosis. Usually, predictions are obtained without a reliable uncertainty estimate or a formal guarantee. Conformal prediction (CP) addresses these issues by using the classifier's predictions, e.… ▽ More

    Submitted 6 May, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  13. arXiv:2107.05712  [pdf, other

    cs.LG

    A Closer Look at the Adversarial Robustness of Information Bottleneck Models

    Authors: Iryna Korshunova, David Stutz, Alexander A. Alemi, Olivia Wiles, Sven Gowal

    Abstract: We study the adversarial robustness of information bottleneck models for classification. Previous works showed that the robustness of models trained with information bottlenecks can improve upon adversarial training. Our evaluation under a diverse range of white-box $l_{\infty}$ attacks suggests that information bottlenecks alone are not a strong defense strategy, and that previous results were li… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  14. arXiv:2104.08323  [pdf, other

    cs.LG cs.AR cs.CR cs.CV

    Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators

    Authors: David Stutz, Nandhini Chandramoorthy, Matthias Hein, Bernt Schiele

    Abstract: Deep neural network (DNN) accelerators received considerable attention in recent years due to the potential to save energy compared to mainstream hardware. Low-voltage operation of DNN accelerators allows to further reduce energy consumption, however, causes bit-level failures in the memory storing the quantized weights. Furthermore, DNN accelerators are vulnerable to adversarial attacks on voltag… ▽ More

    Submitted 7 June, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

  15. arXiv:2104.04448  [pdf, other

    cs.LG cs.CV stat.ML

    Relating Adversarially Robust Generalization to Flat Minima

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Adversarial training (AT) has become the de-facto standard to obtain models robust against adversarial examples. However, AT exhibits severe robust overfitting: cross-entropy loss on adversarial examples, so-called robust loss, decreases continuously on training examples, while eventually increasing on test examples. In practice, this leads to poor robust generalization, i.e., adversarial robustne… ▽ More

    Submitted 6 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: ICCV'21

  16. arXiv:2006.13977  [pdf, other

    cs.LG cs.AR cs.CR cs.CV stat.ML

    Bit Error Robustness for Energy-Efficient DNN Accelerators

    Authors: David Stutz, Nandhini Chandramoorthy, Matthias Hein, Bernt Schiele

    Abstract: Deep neural network (DNN) accelerators received considerable attention in past years due to saved energy compared to mainstream hardware. Low-voltage operation of DNN accelerators allows to further reduce energy consumption significantly, however, causes bit-level failures in the memory storing the quantized DNN weights. In this paper, we show that a combination of robust fixed-point quantization,… ▽ More

    Submitted 9 April, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  17. arXiv:2005.02313  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Adversarial Training against Location-Optimized Adversarial Patches

    Authors: Sukrut Rao, David Stutz, Bernt Schiele

    Abstract: Deep neural networks have been shown to be susceptible to adversarial examples -- small, imperceptible changes constructed to cause mis-classification in otherwise highly accurate image classifiers. As a practical alternative, recent work proposed so-called adversarial patches: clearly visible, but adversarially crafted rectangular patches in images. These patches can easily be printed and applied… ▽ More

    Submitted 14 December, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: 20 pages, 6 tables, 4 figures, 2 algorithms, European Conference on Computer Vision Workshops 2020

  18. arXiv:1910.06259  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Confidence-Calibrated Adversarial Training: Generalizing to Unseen Attacks

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Adversarial training yields robust models against a specific threat model, e.g., $L_\infty$ adversarial examples. Typically robustness does not generalize to previously unseen threat models, e.g., other $L_p$ norms, or larger perturbations. Our confidence-calibrated adversarial training (CCAT) tackles this problem by biasing the model towards low confidence predictions on adversarial examples. By… ▽ More

    Submitted 30 June, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

  19. arXiv:1812.00740  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    Disentangling Adversarial Robustness and Generalization

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Obtaining deep networks that are robust against adversarial examples and generalize well is an open problem. A recent hypothesis even states that both robust and accurate models are impossible, i.e., adversarial robustness and generalization are conflicting goals. In an effort to clarify the relationship between robustness and generalization, we assume an underlying, low-dimensional data manifold… ▽ More

    Submitted 10 April, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: Conference on Computer Vision and Pattern Recognition 2019

  20. Learning 3D Shape Completion under Weak Supervision

    Authors: David Stutz, Andreas Geiger

    Abstract: We address the problem of 3D shape completion from sparse and noisy point clouds, a fundamental problem in computer vision and robotics. Recent approaches are either data-driven or learning-based: Data-driven approaches rely on a shape model whose parameters are optimized to fit the observations; Learning-based approaches, in contrast, avoid the expensive optimization step by learning to directly… ▽ More

    Submitted 27 November, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

    Journal ref: David Stutz, Andreas Geiger. Learning 3D Shape Completion under Weak Supervision. International Journal of Computer Vision (2018)

  21. Superpixels: An Evaluation of the State-of-the-Art

    Authors: David Stutz, Alexander Hermans, Bastian Leibe

    Abstract: Superpixels group perceptually similar pixels to create visually meaningful entities while heavily reducing the number of primitives for subsequent processing steps. As of these properties, superpixel algorithms have received much attention since their naming in 2003. By today, publicly available superpixel algorithms have turned into standard tools in low-level vision. As such, and due to their q… ▽ More

    Submitted 19 April, 2017; v1 submitted 5 December, 2016; originally announced December 2016.