Skip to main content

Showing 1–24 of 24 results for author: Müller, M N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13433  [pdf, other

    cs.LG cs.AI

    Certificates of Differential Privacy and Unlearning for Gradient-Based Training

    Authors: Matthew Wicker, Philip Sosnin, Adrianna Janik, Mark N. Müller, Adrian Weller, Calvin Tsay

    Abstract: Proper data stewardship requires that model owners protect the privacy of individuals' data used during training. Whether through anonymization with differential privacy or the use of unlearning in non-anonymized settings, the gold-standard techniques for providing privacy guarantees can come with significant performance penalties or be too weak to provide practical assurances. In part, this is du… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 15 pages, 14 figures

  2. arXiv:2406.12952  [pdf, other

    cs.SE cs.AI cs.LG

    Code Agents are State of the Art Software Testers

    Authors: Niels Mündler, Mark Niklas Müller, **gxuan He, Martin Vechev

    Abstract: Rigorous software testing is crucial for develo** and maintaining high-quality code, making automated test generation a promising avenue for both improving software quality and boosting the effectiveness of code generation methods. However, while code generation with Large Language Models (LLMs) is an extraordinarily active research area, test generation remains relatively unexplored. We address… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 20 pages, 14 figures, 7 tables

  3. arXiv:2406.05670  [pdf, other

    cs.LG cs.CR cs.CV

    Certified Robustness to Data Poisoning in Gradient-Based Training

    Authors: Philip Sosnin, Mark N. Müller, Maximilian Baader, Calvin Tsay, Matthew Wicker

    Abstract: Modern machine learning pipelines leverage large amounts of public data, making it infeasible to guarantee data quality and leaving models open to poisoning and backdoor attacks. However, provably bounding model behavior under such attacks remains an open problem. In this work, we address this challenge and develop the first framework providing provable guarantees on the behavior of models trained… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 15 pages, 5 figures

  4. arXiv:2405.16281  [pdf, other

    cs.CL

    ConStat: Performance-Based Contamination Detection in Large Language Models

    Authors: Jasper Dekoninck, Mark Niklas Müller, Martin Vechev

    Abstract: Public benchmarks play an essential role in the evaluation of large language models. However, data contamination can lead to inflated performance, rendering them unreliable for model comparison. It is therefore crucial to detect contamination and estimate its impact on measured performance. Unfortunately, existing detection methods can be easily evaded and fail to quantify contamination. To overco… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  5. arXiv:2405.15586  [pdf, other

    cs.LG cs.DC

    DAGER: Exact Gradient Inversion for Large Language Models

    Authors: Ivo Petrov, Dimitar I. Dimitrov, Maximilian Baader, Mark Niklas Müller, Martin Vechev

    Abstract: Federated learning works by aggregating locally computed gradients from multiple clients, thus enabling collaborative training without sharing private client data. However, prior work has shown that the data can actually be recovered by the server using so-called gradient inversion attacks. While these attacks perform well when applied on images, they are limited in the text domain and only permit… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    ACM Class: I.2.7; I.2.11

  6. arXiv:2403.07095  [pdf, other

    cs.LG

    Overcoming the Paradox of Certified Training with Gaussian Smoothing

    Authors: Stefan Balauca, Mark Niklas Müller, Yuhao Mao, Maximilian Baader, Marc Fischer, Martin Vechev

    Abstract: Training neural networks with high certified accuracy against adversarial examples remains an open problem despite significant efforts. While certification methods can effectively leverage tight convex relaxations for bound computation, in training, these methods perform worse than looser relaxations. Prior work hypothesized that this is caused by the discontinuity and perturbation sensitivity of… ▽ More

    Submitted 25 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  7. arXiv:2403.03945  [pdf, other

    cs.LG cs.CR cs.DC

    SPEAR:Exact Gradient Inversion of Batches in Federated Learning

    Authors: Dimitar I. Dimitrov, Maximilian Baader, Mark Niklas Müller, Martin Vechev

    Abstract: Federated learning is a framework for collaborative machine learning where clients only share gradient updates and not their private data with a server. However, it was recently shown that gradient inversion attacks can reconstruct this data from the shared gradients. In the important honest-but-curious setting, existing attacks enable exact reconstruction only for a batch size of $b=1$, with larg… ▽ More

    Submitted 3 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    ACM Class: I.2.11

  8. arXiv:2402.02823  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    Evading Data Contamination Detection for Language Models is (too) Easy

    Authors: Jasper Dekoninck, Mark Niklas Müller, Maximilian Baader, Marc Fischer, Martin Vechev

    Abstract: Large language models are widespread, with their performance on benchmarks frequently guiding user preferences for one model over another. However, the vast amount of data these models are trained on can inadvertently lead to contamination with public benchmarks, thus compromising performance measurements. While recently developed contamination detection methods try to address this issue, they ove… ▽ More

    Submitted 12 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  9. arXiv:2401.02430  [pdf, other

    cs.CV cs.AI cs.LG

    Automated Classification of Model Errors on ImageNet

    Authors: Momchil Peychev, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: While the ImageNet dataset has been driving computer vision research over the past decade, significant label noise and ambiguity have made top-1 accuracy an insufficient measure of further progress. To address this, new label-sets and evaluation protocols have been proposed for ImageNet showing that state-of-the-art models already achieve over 95% accuracy and shifting the focus on investigating w… ▽ More

    Submitted 13 November, 2023; originally announced January 2024.

    Comments: NeurIPS 2023

  10. arXiv:2311.04954  [pdf, other

    cs.CL cs.AI

    Prompt Sketching for Large Language Models

    Authors: Luca Beurer-Kellner, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Many recent prompting strategies for large language models (LLMs) query the model multiple times sequentially -- first to produce intermediate results and then the final answer. However, using these methods, both decoder and model are unaware of potential follow-up prompts, leading to disconnected and undesirably wordy intermediate responses. In this work, we address this issue by proposing prompt… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  11. arXiv:2311.04015  [pdf, ps, other

    cs.LG cs.AI

    Expressivity of ReLU-Networks under Convex Relaxations

    Authors: Maximilian Baader, Mark Niklas Müller, Yuhao Mao, Martin Vechev

    Abstract: Convex relaxations are a key component of training and certifying provably safe neural networks. However, despite substantial progress, a wide and poorly understood accuracy gap to standard networks remains, raising the question of whether this is due to fundamental limitations of convex relaxations. Initial work investigating this question focused on the simple and widely used IBP relaxation. It… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  12. arXiv:2306.10426  [pdf, other

    cs.LG cs.AI

    Understanding Certified Training with Interval Bound Propagation

    Authors: Yuhao Mao, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: As robustness verification methods are becoming more precise, training certifiably robust neural networks is becoming ever more relevant. To this end, certified training methods compute and then optimize an upper bound on the worst-case loss over a robustness specification. Curiously, training methods based on the imprecise interval bound propagation (IBP) consistently outperform those leveraging… ▽ More

    Submitted 27 February, 2024; v1 submitted 17 June, 2023; originally announced June 2023.

    Comments: ICLR'24

  13. arXiv:2305.04574  [pdf, other

    cs.LG

    TAPS: Connecting Certified and Adversarial Training

    Authors: Yuhao Mao, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Training certifiably robust neural networks remains a notoriously hard problem. On one side, adversarial training optimizes under-approximations of the worst-case loss, which leads to insufficient regularization for certification, while on the other, sound certified training methods optimize loose over-approximations, leading to over-regularization and poor (standard) accuracy. In this work we pro… ▽ More

    Submitted 25 October, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: NeuIPS'23

  14. arXiv:2303.05246  [pdf, other

    cs.LG cs.AI

    Efficient Certified Training and Robustness Verification of Neural ODEs

    Authors: Mustafa Zeqiri, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Neural Ordinary Differential Equations (NODEs) are a novel neural architecture, built around initial value problems with learned dynamics which are solved during inference. Thought to be inherently more robust against adversarial perturbations, they were recently shown to be vulnerable to strong adversarial attacks, highlighting the need for formal guarantees. However, despite significant progress… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted at ICLR23

  15. arXiv:2301.05815  [pdf, other

    cs.LG cs.AI cs.SE

    First Three Years of the International Verification of Neural Networks Competition (VNN-COMP)

    Authors: Christopher Brix, Mark Niklas Müller, Stanley Bak, Taylor T. Johnson, Changliu Liu

    Abstract: This paper presents a summary and meta-analysis of the first three iterations of the annual International Verification of Neural Networks Competition (VNN-COMP) held in 2020, 2021, and 2022. In the VNN-COMP, participants submit software tools that analyze whether given neural networks satisfy specifications describing their input-output behavior. These neural networks and specifications cover a va… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 12 pages, journal preprint

  16. arXiv:2212.10376  [pdf, other

    cs.LG cs.AI cs.SE

    The Third International Verification of Neural Networks Competition (VNN-COMP 2022): Summary and Results

    Authors: Mark Niklas Müller, Christopher Brix, Stanley Bak, Changliu Liu, Taylor T. Johnson

    Abstract: This report summarizes the 3rd International Verification of Neural Networks Competition (VNN-COMP 2022), held as a part of the 5th Workshop on Formal Methods for ML-Enabled Autonomous Systems (FoMLAS), which was collocated with the 34th International Conference on Computer-Aided Verification (CAV). VNN-COMP is held annually to facilitate the fair and objective comparison of state-of-the-art neura… ▽ More

    Submitted 16 February, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Corrected a small error in instance-wise results; 54 pages, 27 tables, and 16 figures

  17. arXiv:2210.04871  [pdf, other

    cs.LG cs.CR

    Certified Training: Small Boxes are All You Need

    Authors: Mark Niklas Müller, Franziska Eckert, Marc Fischer, Martin Vechev

    Abstract: To obtain, deterministic guarantees of adversarial robustness, specialized training methods are used. We propose, SABR, a novel such certified training method, based on the key insight that propagating interval bounds for a small but carefully selected subset of the adversarial input region is sufficient to approximate the worst-case loss over the whole region while significantly reducing approxim… ▽ More

    Submitted 9 March, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted at ICLR23 as Spotlight

  18. arXiv:2205.13909  [pdf, other

    cs.LG cs.AI cs.CR

    (De-)Randomized Smoothing for Decision Stump Ensembles

    Authors: Miklós Z. Horváth, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Tree-based models are used in many high-stakes application domains such as finance and medicine, where robustness and interpretability are of utmost importance. Yet, methods for improving and certifying their robustness are severely under-explored, in contrast to those focusing on neural networks. Targeting this important challenge, we propose deterministic smoothing for decision stump ensembles.… ▽ More

    Submitted 14 November, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022 Paper

  19. arXiv:2205.00263  [pdf, other

    cs.LG cs.SE

    Complete Verification via Multi-Neuron Relaxation Guided Branch-and-Bound

    Authors: Claudio Ferrari, Mark Niklas Muller, Nikola Jovanovic, Martin Vechev

    Abstract: State-of-the-art neural network verifiers are fundamentally based on one of two paradigms: either encoding the whole verification problem via tight multi-neuron convex relaxations or applying a Branch-and-Bound (BaB) procedure leveraging imprecise but fast bounding methods on a large number of easier subproblems. The former can capture complex multi-neuron dependencies but sacrifices completeness… ▽ More

    Submitted 30 April, 2022; originally announced May 2022.

  20. arXiv:2204.00487  [pdf, other

    cs.LG cs.AI cs.CR

    Robust and Accurate -- Compositional Architectures for Randomized Smoothing

    Authors: Miklós Z. Horváth, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Randomized Smoothing (RS) is considered the state-of-the-art approach to obtain certifiably robust models for challenging tasks. However, current RS approaches drastically decrease standard accuracy on unperturbed data, severely limiting their real-world utility. To address this limitation, we propose a compositional architecture, ACES, which certifiably decides on a per-sample basis whether to us… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: Presented at the ICLR 2022 Workshop on Socially Responsible Machine Learning

  21. arXiv:2110.08260  [pdf, other

    cs.LG cs.AI

    Abstract Interpretation of Fixpoint Iterators with Applications to Neural Networks

    Authors: Mark Niklas Müller, Marc Fischer, Robin Staab, Martin Vechev

    Abstract: We present a new abstract interpretation framework for the precise over-approximation of numerical fixpoint iterators. Our key observation is that unlike in standard abstract interpretation (AI), typically used to over-approximate all reachable program states, in this setting, one only needs to abstract the concrete fixpoints, i.e., the final program states. Our framework targets numerical fixpoin… ▽ More

    Submitted 26 April, 2023; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: PLDI'23: 44th ACM SIGPLAN International Conference on Programming Language Design and Implementation

  22. arXiv:2106.06946  [pdf, other

    cs.LG cs.AI cs.CV

    Boosting Randomized Smoothing with Variance Reduced Classifiers

    Authors: Miklós Z. Horváth, Mark Niklas Müller, Marc Fischer, Martin Vechev

    Abstract: Randomized Smoothing (RS) is a promising method for obtaining robustness certificates by evaluating a base model under noise. In this work, we: (i) theoretically motivate why ensembles are a particularly suitable choice as base models for RS, and (ii) empirically confirm this choice, obtaining state-of-the-art results in multiple settings. The key insight of our work is that the reduced variance o… ▽ More

    Submitted 30 March, 2022; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: ICLR 2022 Spotlight Paper

  23. arXiv:2103.03638  [pdf, other

    cs.AI cs.LG

    PRIMA: General and Precise Neural Network Certification via Scalable Convex Hull Approximations

    Authors: Mark Niklas Müller, Gleb Makarchuk, Gagandeep Singh, Markus Püschel, Martin Vechev

    Abstract: Formal verification of neural networks is critical for their safe adoption in real-world applications. However, designing a precise and scalable verifier which can handle different activation functions, realistic network architectures and relevant specifications remains an open and difficult challenge. In this paper, we take a major step forward in addressing this challenge and present a new verif… ▽ More

    Submitted 28 February, 2022; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: 29 pages, 18 figures, 6 tables

    Journal ref: Proceedings of the ACM on Programming Languages, Volume 6, Issue POPL, January 2022, Article No.: 43, pp 1-33

  24. arXiv:1102.3866  [pdf

    cs.IR

    Treatment of Semantic Heterogeneity in Information Retrieval

    Authors: Heiko Hellweg, Jürgen Krause, Thomas Mandl, Jutta Marx, Matthias N. O. Müller, Peter Mutschke, Robert Strötgen

    Abstract: The first step to handle semantic heterogeneity should be the attempt to enrich the semantic information about documents, i.e. to fill up the gaps in the documents meta-data automatically. Section 2 describes a set of cascading deductive and heuristic extraction rules, which were developed in the project CARMEN for the domain of Social Sciences. The map** between different terminologies can be d… ▽ More

    Submitted 18 February, 2011; originally announced February 2011.

    Comments: Technical Report (Arbeitsbericht) GESIS - Leibniz Institute for the Social Sciences

    Report number: IZ-Arbeitsbericht Nr. 23 2001