Skip to main content

Showing 1–18 of 18 results for author: Ludwig, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04324  [pdf, other

    cs.AI cs.CL cs.SE

    Granite Code Models: A Family of Open Foundation Models for Code Intelligence

    Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

    Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

  2. arXiv:2310.19304  [pdf, other

    cs.CR cs.LG

    Privacy-Preserving Federated Learning over Vertically and Horizontally Partitioned Data for Financial Anomaly Detection

    Authors: Swanand Ravindra Kadhe, Heiko Ludwig, Nathalie Baracaldo, Alan King, Yi Zhou, Keith Houck, Ambrish Rawat, Mark Purcell, Naoise Holohan, Mikio Takeuchi, Ryo Kawahara, Nir Drucker, Hayim Shaul, Eyal Kushnir, Omri Soceanu

    Abstract: The effective detection of evidence of financial anomalies requires collaboration among multiple entities who own a diverse set of data, such as a payment network system (PNS) and its partner banks. Trust among these financial institutions is limited by regulation and competition. Federated learning (FL) enables entities to collaboratively train a model when data is either vertically or horizontal… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Prize Winner in the U.S. Privacy Enhancing Technologies (PETs) Prize Challenge

  3. arXiv:2207.07779  [pdf, other

    cs.CR

    DeTrust-FL: Privacy-Preserving Federated Learning in Decentralized Trust Setting

    Authors: Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, Swanand Kadhe, Heiko Ludwig

    Abstract: Federated learning has emerged as a privacy-preserving machine learning approach where multiple parties can train a single model without sharing their raw training data. Federated learning typically requires the utilization of multi-party computation techniques to provide strong privacy guarantees by ensuring that an untrusted or curious aggregator cannot obtain isolated replies from parties invol… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  4. arXiv:2204.02283  [pdf, other

    cs.LG cs.AI cs.CV

    Lost in Latent Space: Disentangled Models and the Challenge of Combinatorial Generalisation

    Authors: Milton L. Montero, Jeffrey S. Bowers, Rui Ponte Costa, Casimir J. H. Ludwig, Gaurav Malhotra

    Abstract: Recent research has shown that generative models with highly disentangled representations fail to generalise to unseen combination of generative factor values. These findings contradict earlier research which showed improved performance in out-of-training distribution settings when compared to entangled representations. Additionally, it is not clear if the reported failures are due to (a) encoders… ▽ More

    Submitted 14 June, 2024; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: 10 pages and 7 figures in main text (not including references). 27 pages and 31 figures in appendix. Updated to match the camera-ready version

    ACM Class: I.2.6; I.2.10; I.4.5; I.4.10; I.5.1; I.5.3

    Journal ref: Adv.Neur.Info.Proc.Sys. 35 (2022) 10136-1049

  5. arXiv:2202.08338  [pdf, other

    cs.LG cs.DC

    Single-shot Hyper-parameter Optimization for Federated Learning: A General Algorithm & Analysis

    Authors: Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo, Horst Samulowitz, Heiko Ludwig

    Abstract: We address the relatively unexplored problem of hyper-parameter optimization (HPO) for federated learning (FL-HPO). We introduce Federated Loss SuRface Aggregation (FLoRA), a general FL-HPO solution framework that can address use cases of tabular data and any Machine Learning (ML) model including gradient boosting training algorithms and therefore further expands the scope of FL-HPO. FLoRA enables… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2112.08524

  6. arXiv:2112.08524  [pdf, ps, other

    cs.LG cs.DC

    FLoRA: Single-shot Hyper-parameter Optimization for Federated Learning

    Authors: Yi Zhou, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo, Horst Samulowitz, Heiko Ludwig

    Abstract: We address the relatively unexplored problem of hyper-parameter optimization (HPO) for federated learning (FL-HPO). We introduce Federated Loss suRface Aggregation (FLoRA), the first FL-HPO solution framework that can address use cases of tabular data and gradient boosting training algorithms in addition to stochastic gradient descent/neural networks commonly addressed in the FL literature. The fr… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  7. arXiv:2103.03918  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    FedV: Privacy-Preserving Federated Learning over Vertically Partitioned Data

    Authors: Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, James Joshi, Heiko Ludwig

    Abstract: Federated learning (FL) has been proposed to allow collaborative training of machine learning (ML) models among multiple parties where each party can keep its data private. In this paradigm, only model updates, such as model weights or gradients, are shared. Many existing approaches have focused on horizontal FL, where each party has the entire feature set and labels in the training data set. Howe… ▽ More

    Submitted 16 June, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

  8. arXiv:2012.06670  [pdf, other

    cs.LG cs.DC

    Adaptive Histogram-Based Gradient Boosted Trees for Federated Learning

    Authors: Yuya Jeremy Ong, Yi Zhou, Nathalie Baracaldo, Heiko Ludwig

    Abstract: Federated Learning (FL) is an approach to collaboratively train a model across multiple parties without sharing data between parties or an aggregator. It is used both in the consumer domain to protect personal data as well as in enterprise settings, where dealing with data domicile regulation and the pragmatics of data silos are the main drivers. While gradient boosted tree implementations such as… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: 11 pages with 1 figure

  9. arXiv:2012.02447  [pdf, other

    cs.LG stat.ML

    Mitigating Bias in Federated Learning

    Authors: Annie Abay, Yi Zhou, Nathalie Baracaldo, Shashank Rajamoni, Ebube Chuba, Heiko Ludwig

    Abstract: As methods to create discrimination-aware models develop, they focus on centralized ML, leaving federated learning (FL) unexplored. FL is a rising approach for collaborative ML, in which an aggregator orchestrates multiple parties to train a global model without sharing their training data. In this paper, we discuss causes of bias in FL and propose three pre-processing and in-processing methods to… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  10. arXiv:2007.10987  [pdf, other

    cs.LG cs.CR cs.DC

    IBM Federated Learning: an Enterprise Framework White Paper V0.1

    Authors: Heiko Ludwig, Nathalie Baracaldo, Gegi Thomas, Yi Zhou, Ali Anwar, Shashank Rajamoni, Yuya Ong, Jayaram Radhakrishnan, Ashish Verma, Mathieu Sinn, Mark Purcell, Ambrish Rawat, Tran Minh, Naoise Holohan, Supriyo Chakraborty, Shalisha Whitherspoon, Dean Steuer, Laura Wynter, Hifaz Hassan, Sean Laguna, Mikhail Yurochkin, Mayank Agarwal, Ebube Chuba, Annie Abay

    Abstract: Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learn… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 17 pages

    ACM Class: I.2.6; I.2.11

  11. arXiv:2001.09249  [pdf, other

    cs.LG cs.PF stat.ML

    TiFL: A Tier-based Federated Learning System

    Authors: Zheng Chai, Ahsan Ali, Syed Zawad, Stacey Truex, Ali Anwar, Nathalie Baracaldo, Yi Zhou, Heiko Ludwig, Feng Yan, Yue Cheng

    Abstract: Federated Learning (FL) enables learning a shared model across many clients without violating the privacy requirements. One of the key attributes in FL is the heterogeneity that exists in both resource and data due to the differences in computation and communication capacity, as well as the quantity and content of data among different clients. We conduct a case study to show that heterogeneity in… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  12. HybridAlpha: An Efficient Approach for Privacy-Preserving Federated Learning

    Authors: Runhua Xu, Nathalie Baracaldo, Yi Zhou, Ali Anwar, Heiko Ludwig

    Abstract: Federated learning has emerged as a promising approach for collaborative and privacy-preserving learning. Participants in a federated learning process cooperatively train a model by exchanging model parameters instead of the actual training data, which they might want to keep private. However, parameter interaction and the resulting model still might disclose information about the training data us… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 12 pages, AISec 2019

  13. arXiv:1909.12946  [pdf, other

    cs.CY cs.CR cs.LG cs.SI q-fin.ST

    Towards Federated Graph Learning for Collaborative Financial Crimes Detection

    Authors: Toyotaro Suzumura, Yi Zhou, Natahalie Baracaldo, Guangnan Ye, Keith Houck, Ryo Kawahara, Ali Anwar, Lucia Larise Stavarache, Yuji Watanabe, Pablo Loyola, Daniel Klyashtorny, Heiko Ludwig, Kumar Bhaskaran

    Abstract: Financial crime is a large and growing problem, in some way touching almost every financial institution. Financial institutions are the front line in the war against financial crime and accordingly, must devote substantial human and technology resources to this effort. Current processes to detect financial misconduct have limitations in their ability to effectively differentiate between malicious… ▽ More

    Submitted 2 October, 2019; v1 submitted 19 September, 2019; originally announced September 2019.

  14. arXiv:1908.00073  [pdf, other

    cs.HC cs.GR

    Biased Average Position Estimates in Line and Bar Graphs: Underestimation, Overestimation, and Perceptual Pull

    Authors: Cindy Xiong, Cristina R. Ceja, Casimir J. H. Ludwig, Steven Franconeri

    Abstract: In visual depictions of data, position (i.e., the vertical height of a line or a bar) is believed to be the most precise way to encode information compared to other encodings (e.g., hue). Not only are other encodings less precise than position, but they can also be prone to systematic biases (e.g., color category boundaries can distort perceived differences between hues). By comparison, position's… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

  15. arXiv:1812.03224  [pdf, other

    cs.LG stat.ML

    A Hybrid Approach to Privacy-Preserving Federated Learning

    Authors: Stacey Truex, Nathalie Baracaldo, Ali Anwar, Thomas Steinke, Heiko Ludwig, Rui Zhang, Yi Zhou

    Abstract: Federated learning facilitates the collaborative training of models without the sharing of raw data. However, recent attacks demonstrate that simply maintaining data locality during training processes does not provide sufficient privacy guarantees. Rather, we need a federated learning system capable of preventing inference over both the messages exchanged during training and the final trained mode… ▽ More

    Submitted 14 August, 2019; v1 submitted 7 December, 2018; originally announced December 2018.

  16. arXiv:1811.03728  [pdf, other

    cs.LG cs.CR stat.ML

    Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering

    Authors: Bryant Chen, Wilka Carvalho, Nathalie Baracaldo, Heiko Ludwig, Benjamin Edwards, Taesung Lee, Ian Molloy, Biplav Srivastava

    Abstract: While machine learning (ML) models are being increasingly trusted to make decisions in different and varying areas, the safety of systems using such models has become an increasing concern. In particular, ML models are often trained on data from potentially untrustworthy sources, providing adversaries with the opportunity to manipulate them by inserting carefully crafted samples into the training… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

  17. arXiv:1811.00652  [pdf

    cs.DC

    Modeling IoT-aware Business Processes - A State of the Art Report

    Authors: Nadja Brouns, Samir Tata, Heiko Ludwig, E. Serral Asensio, Paul Grefen

    Abstract: This research report presents an analysis of the state of the art of modeling Internet of Things (IoT)-aware business processes. IOT links the physical world to the digital world. Traditionally, we would find information about events and processes in the physical world in the digital world entered by humans and humans using this information to control the physical world. In the IoT paradigm, the p… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: 42 pages

    Report number: RJ 10540

    Journal ref: IBM Research Report 2018

  18. arXiv:1807.01069  [pdf, other

    cs.LG stat.ML

    Adversarial Robustness Toolbox v1.0.0

    Authors: Maria-Irina Nicolae, Mathieu Sinn, Minh Ngoc Tran, Beat Buesser, Ambrish Rawat, Martin Wistuba, Valentina Zantedeschi, Nathalie Baracaldo, Bryant Chen, Heiko Ludwig, Ian M. Molloy, Ben Edwards

    Abstract: Adversarial Robustness Toolbox (ART) is a Python library supporting developers and researchers in defending Machine Learning models (Deep Neural Networks, Gradient Boosted Decision Trees, Support Vector Machines, Random Forests, Logistic Regression, Gaussian Processes, Decision Trees, Scikit-learn Pipelines, etc.) against adversarial threats and helps making AI systems more secure and trustworthy.… ▽ More

    Submitted 15 November, 2019; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 34 pages