Skip to main content

Showing 1–9 of 9 results for author: Frikha, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02960  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    ObfuscaTune: Obfuscated Offsite Fine-tuning and Inference of Proprietary LLMs on Private Datasets

    Authors: Ahmed Frikha, Nassim Walha, Ricardo Mendes, Krishna Kanth Nakka, Xue Jiang, Xuebing Zhou

    Abstract: This work addresses the timely yet underexplored problem of performing inference and finetuning of a proprietary LLM owned by a model provider entity on the confidential/private data of another data owner entity, in a way that ensures the confidentiality of both the model and the data. Hereby, the finetuning is conducted offsite, i.e., on the computation infrastructure of a third-party cloud provi… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Preprint

  2. arXiv:2407.02956  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    IncogniText: Privacy-enhancing Conditional Text Anonymization via LLM-based Private Attribute Randomization

    Authors: Ahmed Frikha, Nassim Walha, Krishna Kanth Nakka, Ricardo Mendes, Xue Jiang, Xuebing Zhou

    Abstract: In this work, we address the problem of text anonymization where the goal is to prevent adversaries from correctly inferring private attributes of the author, while kee** the text utility, i.e., meaning and semantics. We propose IncogniText, a technique that anonymizes the text to mislead a potential adversary into predicting a wrong private attribute value. Our empirical evaluation shows a redu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Preprint

  3. arXiv:2407.02943  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    PII-Compass: Guiding LLM training data extraction prompts towards the target PII via grounding

    Authors: Krishna Kanth Nakka, Ahmed Frikha, Ricardo Mendes, Xue Jiang, Xuebing Zhou

    Abstract: The latest and most impactful advances in large models stem from their increased size. Unfortunately, this translates into an improved memorization capacity, raising data privacy concerns. Specifically, it has been shown that models can output personal identifiable information (PII) contained in their training data. However, reported PIII extraction performance varies widely, and there is no conse… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024

  4. arXiv:2211.10567  [pdf, other

    cs.CV

    CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering

    Authors: Yao Zhang, Haokun Chen, Ahmed Frikha, Yezi Yang, Denis Krompass, Gengyuan Zhang, **dong Gu, Volker Tresp

    Abstract: Visual Question Answering (VQA) is a multi-discipline research task. To produce the right answer, it requires an understanding of the visual content of images, the natural language questions, as well as commonsense reasoning over the information contained in the image and world knowledge. Recently, large-scale Vision-and-Language Pre-trained Models (VLPMs) have been the mainstream approach to VQA… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 10 pages, 6 figures

  5. arXiv:2205.14900  [pdf, other

    cs.LG cs.AI

    FRAug: Tackling Federated Learning with Non-IID Features via Representation Augmentation

    Authors: Haokun Chen, Ahmed Frikha, Denis Krompass, **dong Gu, Volker Tresp

    Abstract: Federated Learning (FL) is a decentralized learning paradigm, in which multiple clients collaboratively train deep learning models without centralizing their local data, and hence preserve data privacy. Real-world applications usually involve a distribution shift across the datasets of the different clients, which hurts the generalization ability of the clients to unseen samples from their respect… ▽ More

    Submitted 22 August, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: ICCV 2023

  6. arXiv:2110.04545  [pdf, other

    cs.LG cs.CV

    Towards Data-Free Domain Generalization

    Authors: Ahmed Frikha, Haokun Chen, Denis Krompaß, Thomas Runkler, Volker Tresp

    Abstract: In this work, we investigate the unexplored intersection of domain generalization (DG) and data-free learning. In particular, we address the question: How can knowledge contained in models trained on different source domains be merged into a single model that generalizes well to unseen target domains, in the absence of source and target domain data? Machine learning models that can cope with domai… ▽ More

    Submitted 14 November, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021 (DistShift Workshop) and ACML 2022

  7. arXiv:2109.04320  [pdf, other

    cs.LG stat.ML

    Discovery of New Multi-Level Features for Domain Generalization via Knowledge Corruption

    Authors: Ahmed Frikha, Denis Krompaß, Volker Tresp

    Abstract: Machine learning models that can generalize to unseen domains are essential when applied in real-world scenarios involving strong domain shifts. We address the challenging domain generalization (DG) problem, where a model trained on a set of source domains is expected to generalize well in unseen domains without any exposure to their data. The main challenge of DG is that the features learned from… ▽ More

    Submitted 3 October, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted at AAAI 2022 (AIBSD Workshop) and ICPR 2022

  8. ARCADe: A Rapid Continual Anomaly Detector

    Authors: Ahmed Frikha, Denis Krompaß, Volker Tresp

    Abstract: Although continual learning and anomaly detection have separately been well-studied in previous works, their intersection remains rather unexplored. The present work addresses a learning scenario where a model has to incrementally learn a sequence of anomaly detection tasks, i.e. tasks from which only examples from the normal (majority) class are available for training. We define this novel learni… ▽ More

    Submitted 18 October, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: Accepted at ICPR 2020

  9. arXiv:2007.04146  [pdf, other

    cs.LG stat.ML

    Few-Shot One-Class Classification via Meta-Learning

    Authors: Ahmed Frikha, Denis Krompaß, Hans-Georg Köpken, Volker Tresp

    Abstract: Although few-shot learning and one-class classification (OCC), i.e., learning a binary classifier with data from only one class, have been separately well studied, their intersection remains rather unexplored. Our work addresses the few-shot OCC problem and presents a method to modify the episodic data sampling strategy of the model-agnostic meta-learning (MAML) algorithm to learn a model initiali… ▽ More

    Submitted 11 February, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Accepted at AAAI 2021