Skip to main content

Showing 1–27 of 27 results for author: Baracaldo, N

.
  1. arXiv:2406.11780  [pdf, other

    cs.LG cs.AI cs.CL

    Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

    Authors: Swanand Ravindra Kadhe, Farhan Ahmed, Dennis Wei, Nathalie Baracaldo, Inkit Padhi

    Abstract: Large language models (LLMs) have shown to pose social and ethical risks such as generating toxic language or facilitating malicious use of hazardous knowledge. Machine unlearning is a promising approach to improve LLM safety by directly removing harmful behaviors and knowledge. In this paper, we propose "SPlit, UNlearn, MerGE" (SPUNGE), a framework that can be used with any unlearning method to a… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2402.08787  [pdf, other

    cs.LG cs.CL

    Rethinking Machine Unlearning for Large Language Models

    Authors: Sijia Liu, Yuanshun Yao, **ghan Jia, Stephen Casper, Nathalie Baracaldo, Peter Hase, Yuguang Yao, Chris Yuhao Liu, Xiaojun Xu, Hang Li, Kush R. Varshney, Mohit Bansal, Sanmi Koyejo, Yang Liu

    Abstract: We explore machine unlearning (MU) in the domain of large language models (LLMs), referred to as LLM unlearning. This initiative aims to eliminate undesirable data influence (e.g., sensitive or illegal information) and the associated model capabilities, while maintaining the integrity of essential knowledge generation and not affecting causally unrelated information. We envision LLM unlearning bec… ▽ More

    Submitted 1 July, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2401.12406  [pdf, other

    cs.CL cs.AI cs.LG

    Enhancing In-context Learning via Linear Probe Calibration

    Authors: Momin Abbas, Yi Zhou, Parikshit Ram, Nathalie Baracaldo, Horst Samulowitz, Theodoros Salonidis, Tianyi Chen

    Abstract: In-context learning (ICL) is a new paradigm for natural language processing that utilizes Generative Pre-trained Transformer (GPT)-like models. This approach uses prompts that include in-context demonstrations to generate the corresponding output for a new query input. However, applying ICL in real cases does not scale with the number of samples, and lacks robustness to different prompt templates… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted at AISTATS2024

  4. arXiv:2312.07420  [pdf, other

    cs.LG cs.CY

    FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs

    Authors: Swanand Ravindra Kadhe, Anisa Halimi, Ambrish Rawat, Nathalie Baracaldo

    Abstract: Training large language models (LLMs) is a costly endeavour in terms of time and computational resources. The large amount of training data used during the unsupervised pre-training phase makes it difficult to verify all data and, unfortunately, undesirable data may be ingested during training. Re-training from scratch is impractical and has led to the creation of the 'unlearning' discipline where… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted in NeurIPS 2023 Workshop on Socially Responsible Language Modelling Research (SoLaR)

  5. arXiv:2312.04748  [pdf, other

    cs.CR cs.AI cs.CL

    Forcing Generative Models to Degenerate Ones: The Power of Data Poisoning Attacks

    Authors: Shuli Jiang, Swanand Ravindra Kadhe, Yi Zhou, Ling Cai, Nathalie Baracaldo

    Abstract: Growing applications of large language models (LLMs) trained by a third party raise serious concerns on the security vulnerability of LLMs.It has been demonstrated that malicious actors can covertly exploit these vulnerabilities in LLMs through poisoning attacks aimed at generating undesirable outputs. While poisoning attacks have received significant attention in the image domain (e.g., object de… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 19 pages, 6 figures. Published at NeurIPS 2023 Workshop on Backdoors in Deep Learning: The Good, the Bad, and the Ugly

  6. arXiv:2310.19304  [pdf, other

    cs.CR cs.LG

    Privacy-Preserving Federated Learning over Vertically and Horizontally Partitioned Data for Financial Anomaly Detection

    Authors: Swanand Ravindra Kadhe, Heiko Ludwig, Nathalie Baracaldo, Alan King, Yi Zhou, Keith Houck, Ambrish Rawat, Mark Purcell, Naoise Holohan, Mikio Takeuchi, Ryo Kawahara, Nir Drucker, Hayim Shaul, Eyal Kushnir, Omri Soceanu

    Abstract: The effective detection of evidence of financial anomalies requires collaboration among multiple entities who own a diverse set of data, such as a payment network system (PNS) and its partner banks. Trust among these financial institutions is limited by regulation and competition. Federated learning (FL) enables entities to collaboratively train a model when data is either vertically or horizontal… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Prize Winner in the U.S. Privacy Enhancing Technologies (PETs) Prize Challenge

  7. arXiv:2305.02219  [pdf, other

    cs.LG cs.DC

    LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

    Authors: Timothy Castiglia, Yi Zhou, Shiqiang Wang, Swanand Kadhe, Nathalie Baracaldo, Stacy Patterson

    Abstract: We propose LESS-VFL, a communication-efficient feature selection method for distributed systems with vertically partitioned data. We consider a system of a server and several parties with local datasets that share a sample ID space but have different feature sets. The parties wish to collaboratively train a model for a prediction task. As part of the training, the parties wish to remove unimportan… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Published in ICML 2023

  8. arXiv:2209.01340  [pdf, other

    cs.LG cs.AI

    Federated XGBoost on Sample-Wise Non-IID Data

    Authors: Katelinh Jones, Yuya Jeremy Ong, Yi Zhou, Nathalie Baracaldo

    Abstract: Federated Learning (FL) is a paradigm for jointly training machine learning algorithms in a decentralized manner which allows for parties to communicate with an aggregator to create and train a model, without exposing the underlying raw data distribution of the local parties involved in the training process. Most research in FL has been focused on Neural Network-based approaches, however Tree-Base… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 9 Pages, 1 figure, 3 tables

  9. arXiv:2207.07779  [pdf, other

    cs.CR

    DeTrust-FL: Privacy-Preserving Federated Learning in Decentralized Trust Setting

    Authors: Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, Swanand Kadhe, Heiko Ludwig

    Abstract: Federated learning has emerged as a privacy-preserving machine learning approach where multiple parties can train a single model without sharing their raw training data. Federated learning typically requires the utilization of multi-party computation techniques to provide strong privacy guarantees by ensuring that an untrusted or curious aggregator cannot obtain isolated replies from parties invol… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  10. arXiv:2207.05521  [pdf, other

    cs.LG cs.CR

    Federated Unlearning: How to Efficiently Erase a Client in FL?

    Authors: Anisa Halimi, Swanand Kadhe, Ambrish Rawat, Nathalie Baracaldo

    Abstract: With privacy legislation empowering the users with the right to be forgotten, it has become essential to make a model amenable for forgetting some of its training data. However, existing unlearning methods in the machine learning context can not be directly applied in the context of distributed settings like federated learning due to the differences in learning protocol and the presence of multipl… ▽ More

    Submitted 20 October, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

  11. Rings of light caused by gravitational waves

    Authors: Davide Batic, Joud Mojahed Faraji, Marek Nowakowski, Nicolas Maldonaldo Baracaldo

    Abstract: We reconsider the case of the geodesic motion of a massive and massless beam of test particles in a gravitational wave. In particular, we use a direct Lagrangian approach which simplifies the calculation. Our findings differ partly from previously performed calculations The final result can be interpreted as rings of light seen by the observer. We give a new interpretation to this picture and show… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: 11 pages, 2 figures

    Journal ref: Scientific Reports 12, 9688 (2022)

  12. arXiv:2202.12443  [pdf, other

    cs.AI cs.LG

    Towards an Accountable and Reproducible Federated Learning: A FactSheets Approach

    Authors: Nathalie Baracaldo, Ali Anwar, Mark Purcell, Ambrish Rawat, Mathieu Sinn, Bashar Altakrouri, Dian Balta, Mahdi Sellami, Peter Kuhn, Ulrich Schopp, Matthias Buchinger

    Abstract: Federated Learning (FL) is a novel paradigm for the shared training of models based on decentralized and private data. With respect to ethical guidelines, FL is promising regarding privacy, but needs to excel vis-à-vis transparency and trustworthiness. In particular, FL has to address the accountability of the parties involved and their adherence to rules, law and principles. We introduce AF^2 Fra… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 16 pages, 4 figures, 2 tables

  13. arXiv:2202.08338  [pdf, other

    cs.LG cs.DC

    Single-shot Hyper-parameter Optimization for Federated Learning: A General Algorithm & Analysis

    Authors: Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo, Horst Samulowitz, Heiko Ludwig

    Abstract: We address the relatively unexplored problem of hyper-parameter optimization (HPO) for federated learning (FL-HPO). We introduce Federated Loss SuRface Aggregation (FLoRA), a general FL-HPO solution framework that can address use cases of tabular data and any Machine Learning (ML) model including gradient boosting training algorithms and therefore further expands the scope of FL-HPO. FLoRA enables… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2112.08524

  14. arXiv:2112.08524  [pdf, ps, other

    cs.LG cs.DC

    FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

    Authors: Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo, Horst Samulowitz, Heiko Ludwig

    Abstract: We address the relatively unexplored problem of hyper-parameter optimization (HPO) for federated learning (FL-HPO). We introduce Federated Loss suRface Aggregation (FLoRA), the first FL-HPO solution framework that can address use cases of tabular data and gradient boosting training algorithms in addition to stochastic gradient descent/neural networks commonly addressed in the FL literature. The fr… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  15. arXiv:2108.04417  [pdf, other

    cs.LG cs.AI cs.CR

    Privacy-Preserving Machine Learning: Methods, Challenges and Directions

    Authors: Runhua Xu, Nathalie Baracaldo, James Joshi

    Abstract: Machine learning (ML) is increasingly being adopted in a wide variety of application domains. Usually, a well-performing ML model relies on a large volume of training data and high-powered computational resources. Such a need for and the use of huge volumes of data raise serious privacy concerns because of the potential risks of leakage of highly privacy-sensitive information; further, the evolvin… ▽ More

    Submitted 22 September, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

  16. arXiv:2107.12490  [pdf, other

    cs.LG cs.CR cs.DC

    LEGATO: A LayerwisE Gradient AggregaTiOn Algorithm for Mitigating Byzantine Attacks in Federated Learning

    Authors: Kamala Varma, Yi Zhou, Nathalie Baracaldo, Ali Anwar

    Abstract: Federated learning has arisen as a mechanism to allow multiple participants to collaboratively train a model without sharing their data. In these settings, participants (workers) may not trust each other fully; for instance, a set of competitors may collaboratively train a machine learning model to detect fraud. The workers provide local gradients that a central server uses to update a global mode… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  17. arXiv:2103.03918  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    FedV: Privacy-Preserving Federated Learning over Vertically Partitioned Data

    Authors: Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, James Joshi, Heiko Ludwig

    Abstract: Federated learning (FL) has been proposed to allow collaborative training of machine learning (ML) models among multiple parties where each party can keep its data private. In this paradigm, only model updates, such as model weights or gradients, are shared. Many existing approaches have focused on horizontal FL, where each party has the entire feature set and labels in the training data set. Howe… ▽ More

    Submitted 16 June, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

  18. arXiv:2102.00655  [pdf, other

    cs.LG cs.DC

    Curse or Redemption? How Data Heterogeneity Affects the Robustness of Federated Learning

    Authors: Syed Zawad, Ahsan Ali, Pin-Yu Chen, Ali Anwar, Yi Zhou, Nathalie Baracaldo, Yuan Tian, Feng Yan

    Abstract: Data heterogeneity has been identified as one of the key features in federated learning but often overlooked in the lens of robustness to adversarial attacks. This paper focuses on characterizing and understanding its impact on backdooring attacks in federated learning through comprehensive experiments using synthetic and the LEAF benchmarks. The initial impression driven by our experimental resul… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted in AAAI 2021

  19. arXiv:2012.06670  [pdf, other

    cs.LG cs.DC

    Adaptive Histogram-Based Gradient Boosted Trees for Federated Learning

    Authors: Yuya Jeremy Ong, Yi Zhou, Nathalie Baracaldo, Heiko Ludwig

    Abstract: Federated Learning (FL) is an approach to collaboratively train a model across multiple parties without sharing data between parties or an aggregator. It is used both in the consumer domain to protect personal data as well as in enterprise settings, where dealing with data domicile regulation and the pragmatics of data silos are the main drivers. While gradient boosted tree implementations such as… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 11 pages with 1 figure

  20. arXiv:2012.02447  [pdf, other

    cs.LG stat.ML

    Mitigating Bias in Federated Learning

    Authors: Annie Abay, Yi Zhou, Nathalie Baracaldo, Shashank Rajamoni, Ebube Chuba, Heiko Ludwig

    Abstract: As methods to create discrimination-aware models develop, they focus on centralized ML, leaving federated learning (FL) unexplored. FL is a rising approach for collaborative ML, in which an aggregator orchestrates multiple parties to train a global model without sharing their training data. In this paper, we discuss causes of bias in FL and propose three pre-processing and in-processing methods to… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  21. arXiv:2007.10987  [pdf, other

    cs.LG cs.CR cs.DC

    IBM Federated Learning: an Enterprise Framework White Paper V0.1

    Authors: Heiko Ludwig, Nathalie Baracaldo, Gegi Thomas, Yi Zhou, Ali Anwar, Shashank Rajamoni, Yuya Ong, Jayaram Radhakrishnan, Ashish Verma, Mathieu Sinn, Mark Purcell, Ambrish Rawat, Tran Minh, Naoise Holohan, Supriyo Chakraborty, Shalisha Whitherspoon, Dean Steuer, Laura Wynter, Hifaz Hassan, Sean Laguna, Mikhail Yurochkin, Mayank Agarwal, Ebube Chuba, Annie Abay

    Abstract: Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learn… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 17 pages

    ACM Class: I.2.6; I.2.11

  22. arXiv:2001.09249  [pdf, other

    cs.LG cs.PF stat.ML

    TiFL: A Tier-based Federated Learning System

    Authors: Zheng Chai, Ahsan Ali, Syed Zawad, Stacey Truex, Ali Anwar, Nathalie Baracaldo, Yi Zhou, Heiko Ludwig, Feng Yan, Yue Cheng

    Abstract: Federated Learning (FL) enables learning a shared model across many clients without violating the privacy requirements. One of the key attributes in FL is the heterogeneity that exists in both resource and data due to the differences in computation and communication capacity, as well as the quantity and content of data among different clients. We conduct a case study to show that heterogeneity in… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  23. HybridAlpha: An Efficient Approach for Privacy-Preserving Federated Learning

    Authors: Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, Heiko Ludwig

    Abstract: Federated learning has emerged as a promising approach for collaborative and privacy-preserving learning. Participants in a federated learning process cooperatively train a model by exchanging model parameters instead of the actual training data, which they might want to keep private. However, parameter interaction and the resulting model still might disclose information about the training data us… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 12 pages, AISec 2019

  24. arXiv:1909.12946  [pdf, other

    cs.CY cs.CR cs.LG cs.SI q-fin.ST

    Towards Federated Graph Learning for Collaborative Financial Crimes Detection

    Authors: Toyotaro Suzumura, Yi Zhou, Natahalie Baracaldo, Guangnan Ye, Keith Houck, Ryo Kawahara, Ali Anwar, Lucia Larise Stavarache, Yuji Watanabe, Pablo Loyola, Daniel Klyashtorny, Heiko Ludwig, Kumar Bhaskaran

    Abstract: Financial crime is a large and growing problem, in some way touching almost every financial institution. Financial institutions are the front line in the war against financial crime and accordingly, must devote substantial human and technology resources to this effort. Current processes to detect financial misconduct have limitations in their ability to effectively differentiate between malicious… ▽ More

    Submitted 2 October, 2019; v1 submitted 19 September, 2019; originally announced September 2019.

  25. arXiv:1812.03224  [pdf, other

    cs.LG stat.ML

    A Hybrid Approach to Privacy-Preserving Federated Learning

    Authors: Stacey Truex, Nathalie Baracaldo, Ali Anwar, Thomas Steinke, Heiko Ludwig, Rui Zhang, Yi Zhou

    Abstract: Federated learning facilitates the collaborative training of models without the sharing of raw data. However, recent attacks demonstrate that simply maintaining data locality during training processes does not provide sufficient privacy guarantees. Rather, we need a federated learning system capable of preventing inference over both the messages exchanged during training and the final trained mode… ▽ More

    Submitted 14 August, 2019; v1 submitted 7 December, 2018; originally announced December 2018.

  26. arXiv:1811.03728  [pdf, other

    cs.LG cs.CR stat.ML

    Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering

    Authors: Bryant Chen, Wilka Carvalho, Nathalie Baracaldo, Heiko Ludwig, Benjamin Edwards, Taesung Lee, Ian Molloy, Biplav Srivastava

    Abstract: While machine learning (ML) models are being increasingly trusted to make decisions in different and varying areas, the safety of systems using such models has become an increasing concern. In particular, ML models are often trained on data from potentially untrustworthy sources, providing adversaries with the opportunity to manipulate them by inserting carefully crafted samples into the training… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

  27. arXiv:1807.01069  [pdf, other

    cs.LG stat.ML

    Adversarial Robustness Toolbox v1.0.0

    Authors: Maria-Irina Nicolae, Mathieu Sinn, Minh Ngoc Tran, Beat Buesser, Ambrish Rawat, Martin Wistuba, Valentina Zantedeschi, Nathalie Baracaldo, Bryant Chen, Heiko Ludwig, Ian M. Molloy, Ben Edwards

    Abstract: Adversarial Robustness Toolbox (ART) is a Python library supporting developers and researchers in defending Machine Learning models (Deep Neural Networks, Gradient Boosted Decision Trees, Support Vector Machines, Random Forests, Logistic Regression, Gaussian Processes, Decision Trees, Scikit-learn Pipelines, etc.) against adversarial threats and helps making AI systems more secure and trustworthy.… ▽ More

    Submitted 15 November, 2019; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 34 pages