Skip to main content

Showing 1–10 of 10 results for author: Staab, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07217  [pdf, other

    cs.LG cs.AI cs.CL

    A Synthetic Dataset for Personal Attribute Inference

    Authors: Hanna Yukhymenko, Robin Staab, Mark Vero, Martin Vechev

    Abstract: Recently, powerful Large Language Models (LLMs) have become easily accessible to hundreds of millions of users worldwide. However, their strong capabilities and vast world knowledge do not come without associated privacy risks. In this work, we focus on the emerging privacy threat LLMs pose - the ability to accurately infer personal information from online texts. Despite the growing importance of… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2405.18161  [pdf, other

    cs.LG cs.AI

    Back to the Drawing Board for Fair Representation Learning

    Authors: Angéline Pouget, Nikola Jovanović, Mark Vero, Robin Staab, Martin Vechev

    Abstract: The goal of Fair Representation Learning (FRL) is to mitigate biases in machine learning models by learning data representations that enable high accuracy on downstream tasks while minimizing discrimination based on sensitive attributes. The evaluation of FRL methods in many recent works primarily focuses on the tradeoff between downstream fairness and accuracy with respect to a single task that w… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.18137  [pdf, other

    cs.LG cs.AI cs.CR

    Exploiting LLM Quantization

    Authors: Kazuki Egashira, Mark Vero, Robin Staab, **gxuan He, Martin Vechev

    Abstract: Quantization leverages lower-precision weights to reduce the memory usage of large language models (LLMs) and is a key technique for enabling their deployment on commodity hardware. While LLM quantization's impact on utility has been extensively explored, this work for the first time studies its adverse effects from a security perspective. We reveal that widely used quantization methods can be exp… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2404.10618  [pdf, other

    cs.AI cs.CV cs.LG

    Private Attribute Inference from Images with Vision-Language Models

    Authors: Batuhan Tömekçe, Mark Vero, Robin Staab, Martin Vechev

    Abstract: As large language models (LLMs) become ubiquitous in our daily tasks and digital interactions, associated privacy risks are increasingly in focus. While LLM privacy research has primarily focused on the leakage of model training data, it has recently been shown that the increase in models' capabilities has enabled LLMs to make accurate privacy-infringing inferences from previously unseen texts. Wi… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  5. arXiv:2402.19361  [pdf, other

    cs.LG cs.AI cs.CR

    Watermark Stealing in Large Language Models

    Authors: Nikola Jovanović, Robin Staab, Martin Vechev

    Abstract: LLM watermarking has attracted attention as a promising way to detect AI-generated content, with some works suggesting that current schemes may already be fit for deployment. In this work we dispute this claim, identifying watermark stealing (WS) as a fundamental vulnerability of these schemes. We show that querying the API of the watermarked LLM to approximately reverse-engineer a watermark enabl… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  6. arXiv:2402.13846  [pdf, other

    cs.AI cs.CL cs.CR

    Large Language Models are Advanced Anonymizers

    Authors: Robin Staab, Mark Vero, Mislav Balunović, Martin Vechev

    Abstract: Recent work in privacy research on large language models has shown that they achieve near human-level performance at inferring personal data from real-world online texts. With consistently increasing model capabilities, existing text anonymization methods are currently lacking behind regulatory requirements and adversarial threats. This raises the question of how individuals can effectively protec… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    ACM Class: I.2.7

  7. arXiv:2311.10500  [pdf, other

    cs.LG cs.AI cs.CR

    From Principle to Practice: Vertical Data Minimization for Machine Learning

    Authors: Robin Staab, Nikola Jovanović, Mislav Balunović, Martin Vechev

    Abstract: Aiming to train and deploy predictive models, organizations collect large amounts of detailed client data, risking the exposure of private information in the event of a breach. To mitigate this, policymakers increasingly demand compliance with the data minimization (DM) principle, restricting data collection to only that data which is relevant and necessary for the task. Despite regulatory pressur… ▽ More

    Submitted 22 November, 2023; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE S&P 2024

  8. arXiv:2310.07298  [pdf, other

    cs.AI cs.LG

    Beyond Memorization: Violating Privacy Via Inference with Large Language Models

    Authors: Robin Staab, Mark Vero, Mislav Balunović, Martin Vechev

    Abstract: Current privacy research on large language models (LLMs) primarily focuses on the issue of extracting memorized training data. At the same time, models' inference capabilities have increased drastically. This raises the key question of whether current LLMs could violate individuals' privacy by inferring personal attributes from text given at inference time. In this work, we present the first compr… ▽ More

    Submitted 6 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    ACM Class: I.2.7

  9. arXiv:2111.04706  [pdf, other

    cs.LG cs.CR

    Bayesian Framework for Gradient Leakage

    Authors: Mislav Balunović, Dimitar I. Dimitrov, Robin Staab, Martin Vechev

    Abstract: Federated learning is an established method for training machine learning models without sharing training data. However, recent work has shown that it cannot guarantee data privacy as shared gradients can still leak sensitive information. To formalize the problem of gradient leakage, we propose a theoretical framework that enables, for the first time, analysis of the Bayes optimal adversary phrase… ▽ More

    Submitted 17 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

  10. arXiv:2110.08260  [pdf, other

    cs.LG cs.AI

    Abstract Interpretation of Fixpoint Iterators with Applications to Neural Networks

    Authors: Mark Niklas Müller, Marc Fischer, Robin Staab, Martin Vechev

    Abstract: We present a new abstract interpretation framework for the precise over-approximation of numerical fixpoint iterators. Our key observation is that unlike in standard abstract interpretation (AI), typically used to over-approximate all reachable program states, in this setting, one only needs to abstract the concrete fixpoints, i.e., the final program states. Our framework targets numerical fixpoin… ▽ More

    Submitted 26 April, 2023; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: PLDI'23: 44th ACM SIGPLAN International Conference on Programming Language Design and Implementation