Skip to main content

Showing 1–50 of 86 results for author: GuerraouI, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07215  [pdf, other

    cs.CR cs.DC

    DSig: Breaking the Barrier of Signatures in Data Centers

    Authors: Marcos K. Aguilera, Clément Burgelin, Rachid Guerraoui, Antoine Murat, Athanasios Xygkis, Igor Zablotchi

    Abstract: Data centers increasingly host mutually distrustful users on shared infrastructure. A powerful tool to safeguard such users are digital signatures. Digital signatures have revolutionized Internet-scale applications, but current signatures are too slow for the growing genre of microsecond-scale systems in modern data centers. We propose DSig, the first digital signature system to achieve single-dig… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: To appear in the proceedings of OSDI '24. Authors listed in alphabetical order

  2. arXiv:2405.14670  [pdf, other

    cs.LG

    Overcoming the Challenges of Batch Normalization in Federated Learning

    Authors: Rachid Guerraoui, Rafael Pinot, Geovani Rizk, John Stephan, François Taiani

    Abstract: Batch normalization has proven to be a very beneficial mechanism to accelerate the training and improve the accuracy of deep neural networks in centralized environments. Yet, the scheme faces significant challenges in federated learning, especially under high data heterogeneity. Essentially, the main challenges arise from external covariate shifts and inconsistent statistics across clients. We int… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.14432  [pdf, other

    cs.LG

    Boosting Robustness by Clip** Gradients in Distributed Learning

    Authors: Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Ahmed Jellouli, Geovani Rizk, John Stephan

    Abstract: Robust distributed learning consists in achieving good learning performance despite the presence of misbehaving workers. State-of-the-art (SOTA) robust distributed gradient descent (Robust-DGD) methods, relying on robust aggregation, have been proven to be optimal: Their learning error matches the lower bound established under the standard heterogeneity model of $(G, B)$-gradient dissimilarity. Th… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.01031  [pdf, other

    cs.LG cs.CR cs.DC math.OC stat.ML

    The Privacy Power of Correlated Noise in Decentralized Learning

    Authors: Youssef Allouah, Anastasia Koloskova, Aymane El Firdoussi, Martin Jaggi, Rachid Guerraoui

    Abstract: Decentralized learning is appealing as it enables the scalable usage of large amounts of distributed data and resources (without resorting to any central entity), while promoting privacy since every user minimizes the direct exposure of their data. Yet, without additional precautions, curious users can still leverage models obtained from their peers to violate privacy. In this paper, we propose De… ▽ More

    Submitted 3 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted as conference paper at ICML 2024

  5. arXiv:2405.00491  [pdf, ps, other

    cs.LG

    On the Relevance of Byzantine Robust Optimization Against Data Poisoning

    Authors: Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot

    Abstract: The success of machine learning (ML) has been intimately linked with the availability of large amounts of data, typically collected from heterogeneous sources and processed on vast networks of computing devices (also called {\em workers}). Beyond accuracy, the use of ML in critical domains such as healthcare and autonomous driving calls for robustness against {\em data poisoning}and some {\em faul… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 38 pages

  6. arXiv:2403.08374  [pdf, other

    cs.DC

    Efficient Signature-Free Validated Agreement

    Authors: Pierre Civit, Muhammad Ayaz Dzulfikar, Seth Gilbert, Rachid Guerraoui, Jovan Komatovic, Manuel Vidigueira, Igor Zablotchi

    Abstract: Byzantine agreement enables n processes to agree on a common L-bit value, despite up to t > 0 arbitrary failures. A long line of work has been dedicated to improving the bit complexity of Byzantine agreement in synchrony. This has culminated in COOL, an error-free (deterministically secure against a computationally unbounded adversary) solution that achieves O(nL + n^2 logn) worst-case bit complex… ▽ More

    Submitted 17 May, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  7. arXiv:2402.12780  [pdf, other

    cs.LG

    Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates

    Authors: Youssef Allouah, Sadegh Farhadkhani, Rachid GuerraouI, Nirupam Gupta, Rafael Pinot, Geovani Rizk, Sasha Voitovych

    Abstract: The possibility of adversarial (a.k.a., {\em Byzantine}) clients makes federated learning (FL) prone to arbitrary manipulation. The natural approach to robustify FL against adversarial clients is to replace the simple averaging operation at the server in the standard $\mathsf{FedAvg}$ algorithm by a \emph{robust averaging rule}. While a significant amount of work has been devoted to studying the c… ▽ More

    Submitted 10 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  8. arXiv:2402.10059  [pdf, other

    cs.DC

    Partial Synchrony for Free? New Upper Bounds for Byzantine Agreement

    Authors: Pierre Civit, Muhammad Ayaz Dzulfikar, Seth Gilbert, Rachid Guerraoui, Jovan Komatovic, Manuel Vidigueira, Igor Zablotchi

    Abstract: Byzantine agreement allows n processes to decide on a common value, in spite of arbitrary failures. The seminal Dolev-Reischuk bound states that any deterministic solution to Byzantine agreement exchanges Omega(n^2) bits. In synchronous networks, solutions with optimal O(n^2) bit complexity, optimal fault tolerance, and no cryptography have been established for over three decades. However, these s… ▽ More

    Submitted 5 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  9. arXiv:2312.14712  [pdf, other

    cs.LG cs.CR cs.DC

    Robustness, Efficiency, or Privacy: Pick Two in Machine Learning

    Authors: Youssef Allouah, Rachid Guerraoui, John Stephan

    Abstract: The success of machine learning (ML) applications relies on vast datasets and distributed architectures which, as they grow, present major challenges. In real-world scenarios, where data often contains sensitive information, issues like data poisoning and hardware failures are common. Ensuring privacy and robustness is vital for the broad adoption of ML in public life. This paper examines the cost… ▽ More

    Submitted 11 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  10. arXiv:2311.08060  [pdf, other

    cs.DC

    All Byzantine Agreement Problems are Expensive

    Authors: Pierre Civit, Seth Gilbert, Rachid Guerraoui, Jovan Komatovic, Anton Paramonov, Manuel Vidigueira

    Abstract: Byzantine agreement, arguably the most fundamental problem in distributed computing, operates among n processes, out of which t < n can exhibit arbitrary failures. The problem states that all correct (non-faulty) processes must eventually decide (termination) the same value (agreement) from a set of admissible values defined by the proposals of the processes (validity). Depending on the exact vers… ▽ More

    Submitted 15 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

  11. arXiv:2310.01972  [pdf, other

    cs.LG cs.DC

    Epidemic Learning: Boosting Decentralized Learning with Randomized Communication

    Authors: Martijn de Vos, Sadegh Farhadkhani, Rachid Guerraoui, Anne-Marie Kermarrec, Rafael Pires, Rishi Sharma

    Abstract: We present Epidemic Learning (EL), a simple yet powerful decentralized learning (DL) algorithm that leverages changing communication topologies to achieve faster model convergence compared to conventional DL approaches. At each round of EL, each node sends its model updates to a random sample of $s$ other nodes (in a system of $n$ nodes). We provide an extensive theoretical analysis of EL, demonst… ▽ More

    Submitted 27 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted paper at NeurIPS 2023

  12. arXiv:2309.13591  [pdf, other

    cs.LG cs.DC math.OC

    Robust Distributed Learning: Tight Error Bounds and Breakdown Point under Data Heterogeneity

    Authors: Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Rafaël Pinot, Geovani Rizk

    Abstract: The theory underlying robust distributed learning algorithms, designed to resist adversarial machines, matches empirical observations when data is homogeneous. Under data heterogeneity however, which is the norm in practical scenarios, established lower bounds on the learning error are essentially vacuous and greatly mismatch empirical observations. This is because the heterogeneity model consider… ▽ More

    Submitted 28 October, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted to NeurIPS 2023

  13. arXiv:2309.05395  [pdf, other

    cs.LG cs.CR cs.DC

    SABLE: Secure And Byzantine robust LEarning

    Authors: Antoine Choffrut, Rachid Guerraoui, Rafael Pinot, Renaud Sirdey, John Stephan, Martin Zuber

    Abstract: Due to the widespread availability of data, machine learning (ML) algorithms are increasingly being implemented in distributed topologies, wherein various nodes collaborate to train ML models via the coordination of a central server. However, distributed learning approaches face significant vulnerabilities, primarily stemming from two potential threats. Firstly, the presence of Byzantine nodes pos… ▽ More

    Submitted 14 December, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

  14. arXiv:2308.03524  [pdf, ps, other

    cs.DC

    Strong Byzantine Agreement with Adaptive Word Complexity

    Authors: Pierre Civit, Seth Gilbert, Rachid Guerraoui, Jovan Komatovic, Manuel Vidigueira

    Abstract: The strong Byzantine agreement (SBA) problem is defined among n processes, out of which t < n can be faulty and behave arbitrarily. SBA allows correct (non-faulty) processes to agree on a common value. Moreover, if all correct processes have proposed the same value, only that value can be agreed upon. It has been known for a long time that any solution to the SBA problem incurs quadratic worst-cas… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  15. arXiv:2308.02477  [pdf, other

    cs.DC cs.CR

    On the Inherent Anonymity of Gossi**

    Authors: Rachid Guerraoui, Anne-Marie Kermarrec, Anastasiia Kucherenko, Rafael Pinot, Sasha Voitovych

    Abstract: Detecting the source of a gossip is a critical issue, related to identifying patient zero in an epidemic, or the origin of a rumor in a social network. Although it is widely acknowledged that random and local gossip communications make source identification difficult, there exists no general quantification of the level of anonymity provided to the source. This paper presents a principled method ba… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Full version of DISC2023 paper

  16. arXiv:2306.09991  [pdf, other

    cs.NE cs.LG q-bio.PE

    Evolutionary Algorithms in the Light of SGD: Limit Equivalence, Minima Flatness, and Transfer Learning

    Authors: Andrei Kucharavy, Rachid Guerraoui, Ljiljana Dolamic

    Abstract: Whenever applicable, the Stochastic Gradient Descent (SGD) has shown itself to be unreasonably effective. Instead of underperforming and getting trapped in local minima due to the batch noise, SGD leverages it to learn to generalize better and find minima that are good enough for the entire dataset. This led to numerous theoretical and experimental investigations, especially in the context of Arti… ▽ More

    Submitted 20 May, 2023; originally announced June 2023.

    Comments: To be published in ALIFE 2023; 16 pages, 10 figures, 1 listing

    ACM Class: I.2.8; G.1.6

  17. arXiv:2306.00431  [pdf, other

    cs.DC

    Every Bit Counts in Consensus

    Authors: Pierre Civit, Seth Gilbert, Rachid Guerraoui, Jovan Komatovic, Matteo Monti, Manuel Vidigueira

    Abstract: Consensus enables n processes to agree on a common valid L-bit value, despite t < n/3 processes being faulty and acting arbitrarily. A long line of work has been dedicated to improving the worst-case communication complexity of consensus in partial synchrony. This has recently culminated in the worst-case word complexity of O(n^2). However, the worst-case bit complexity of the best solution is sti… ▽ More

    Submitted 7 August, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  18. arXiv:2304.13540  [pdf, ps, other

    cs.DC cs.LG cs.NE

    Byzantine-Resilient Learning Beyond Gradients: Distributing Evolutionary Search

    Authors: Andrei Kucharavy, Matteo Monti, Rachid Guerraoui, Ljiljana Dolamic

    Abstract: Modern machine learning (ML) models are capable of impressive performances. However, their prowess is not due only to the improvements in their architecture and training algorithms but also to a drastic increase in computational power used to train them. Such a drastic increase led to a growing interest in distributed ML, which in turn made worker failures and adversarial attacks an increasingly… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 10 pages, 4 listings, 2 theorems

    ACM Class: I.2.11; D.1.3; F.1.2

  19. arXiv:2304.08968  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs

    Authors: Da Silva Gameiro Henrique, Andrei Kucharavy, Rachid Guerraoui

    Abstract: The self-attention revolution allowed generative language models to scale and achieve increasingly impressive abilities. Such models - commonly referred to as Large Language Models (LLMs) - have recently gained prominence with the general public, thanks to conversational fine-tuning, putting their behavior in line with public expectations regarding AI. This prominence amplified prior concerns rega… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 15 pages, 6 figures; 10 pages, 7 figures Supplementary Materials; under review at ECML 2023

    ACM Class: I.2.7; K.6.5

  20. arXiv:2304.07081  [pdf, other

    cs.DC cs.CR

    Chop Chop: Byzantine Atomic Broadcast to the Network Limit

    Authors: Martina Camaioni, Rachid Guerraoui, Matteo Monti, Pierre-Louis Roman, Manuel Vidigueira, Gauthier Voron

    Abstract: At the heart of state machine replication, the celebrated technique enabling decentralized and secure universal computation, lies Atomic Broadcast, a fundamental communication primitive that orders, authenticates, and deduplicates messages. This paper presents Chop Chop, a Byzantine Atomic Broadcast system that amortizes the cost of ordering, authenticating and deduplicating messages, achieving "l… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  21. arXiv:2302.04787  [pdf, other

    cs.LG cs.CR cs.DC

    On the Privacy-Robustness-Utility Trilemma in Distributed Learning

    Authors: Youssef Allouah, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, John Stephan

    Abstract: The ubiquity of distributed machine learning (ML) in sensitive public domain applications calls for algorithms that protect data privacy, while being robust to faults and adversarial behaviors. Although privacy and robustness have been extensively studied independently in distributed ML, their synthesis remains poorly understood. We present the first tight analysis of the error incurred by any alg… ▽ More

    Submitted 29 May, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted paper at ICML

  22. arXiv:2302.01772  [pdf, other

    cs.LG cs.DC

    Fixing by Mixing: A Recipe for Optimal Byzantine ML under Heterogeneity

    Authors: Youssef Allouah, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, John Stephan

    Abstract: Byzantine machine learning (ML) aims to ensure the resilience of distributed learning algorithms to misbehaving (or Byzantine) machines. Although this problem received significant attention, prior works often assume the data held by the machines to be homogeneous, which is seldom true in practical settings. Data heterogeneity makes Byzantine ML considerably more challenging, since a Byzantine mach… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted paper at AISTATS 2023

  23. arXiv:2301.04920  [pdf, other

    cs.DC

    On the Validity of Consensus

    Authors: Pierre Civit, Seth Gilbert, Rachid Guerraoui, Jovan Komatovic, Manuel Vidigueira

    Abstract: The Byzantine consensus problem involves $n$ processes, out of which t < n could be faulty and behave arbitrarily. Three properties characterize consensus: (1) termination, requiring correct (non-faulty) processes to eventually reach a decision, (2) agreement, preventing them from deciding different values, and (3) validity, precluding ``unreasonable'' decisions. But, what is a reasonable decision… ▽ More

    Submitted 25 June, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: The extended version of the PODC 2023 paper

  24. arXiv:2210.17174  [pdf, other

    cs.DC

    uBFT: Microsecond-scale BFT using Disaggregated Memory [Extended Version]

    Authors: Marcos K. Aguilera, Naama Ben-David, Rachid Guerraoui, Antoine Murat, Athanasios Xygkis, Igor Zablotchi

    Abstract: We propose uBFT, the first State Machine Replication (SMR) system to achieve microsecond-scale latency in data centers, while using only $2f{+}1$ replicas to tolerate $f$ Byzantine failures. The Byzantine Fault Tolerance (BFT) provided by uBFT is essential as pure crashes appear to be a mere illusion with real-life systems reportedly failing in many unexpected ways. uBFT relies on a small non-tail… ▽ More

    Submitted 16 March, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  25. arXiv:2210.13242  [pdf, other

    cs.CR cs.DC

    SurferMonkey: A Decentralized Anonymous Blockchain Intercommunication System via Zero Knowledge Proofs

    Authors: Miguel Díaz Montiel, Rachid Guerraoui, Pierre-Louis Roman

    Abstract: Blockchain intercommunication systems enable the exchanges of messages between blockchains. This interoperability promotes innovation, unlocks liquidity and access to assets. However, blockchains are isolated systems that originally were not designed for interoperability. This makes cross-chain communication, or bridges for short, insecure by nature. More precisely, cross-chain systems face securi… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  26. arXiv:2210.08650  [pdf, other

    cs.LG cs.DB

    Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores

    Authors: Arsany Guirguis, Diana Petrescu, Florin Dinu, Do Le Quoc, Javier Picorel, Rachid Guerraoui

    Abstract: Near-data computation techniques have been successfully deployed to mitigate the cloud network bottleneck between the storage and compute tiers. At Huawei, we are currently looking to get more value from these techniques by broadening their applicability. Machine learning (ML) applications are an appealing and timely target. This paper describes our experience applying near-data computation techni… ▽ More

    Submitted 9 January, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: 14 pages, 14 figures, 5 tables

  27. arXiv:2209.15259  [pdf, ps, other

    cs.LG cs.AI cs.CR

    On the Impossible Safety of Large AI Models

    Authors: El-Mahdi El-Mhamdi, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Lê-Nguyên Hoang, Rafael Pinot, Sébastien Rouault, John Stephan

    Abstract: Large AI Models (LAIMs), of which large language models are the most prominent recent example, showcase some impressive performance. However they have been empirically found to pose serious security issues. This paper systematizes our knowledge about the fundamental impossibility of building arbitrarily accurate and secure machine learning models. More precisely, we identify key challenging featur… ▽ More

    Submitted 9 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 40 pages

  28. arXiv:2209.13304  [pdf, other

    cs.DC

    Oracular Byzantine Reliable Broadcast [Extended Version]

    Authors: Martina Camaioni, Rachid Guerraoui, Matteo Monti, Manuel Vidigueira

    Abstract: Byzantine Reliable Broadcast (BRB) is a fundamental distributed computing primitive, with applications ranging from notifications to asynchronous payment systems. Motivated by practical consideration, we study Client-Server Byzantine Reliable Broadcast (CSB), a multi-shot variant of BRB whose interface is split between broadcasting clients and delivering servers. We present Draft, an optimally res… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  29. arXiv:2209.10931  [pdf, other

    cs.LG cs.DC

    Robust Collaborative Learning with Linear Gradient Overhead

    Authors: Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Lê Nguyên Hoang, Rafael Pinot, John Stephan

    Abstract: Collaborative learning algorithms, such as distributed SGD (or D-SGD), are prone to faulty machines that may deviate from their prescribed algorithm because of software or hardware bugs, poisoned data or malicious behaviors. While many solutions have been proposed to enhance the robustness of D-SGD to such machines, previous works either resort to strong assumptions (trusted server, homogeneous da… ▽ More

    Submitted 3 June, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: Accepted paper at ICML 2023

  30. arXiv:2209.09580  [pdf, other

    cs.DC

    Carbon: An Asynchronous Voting-Based Payment System for a Client-Server Architecture

    Authors: Martina Camaioni, Rachid Guerraoui, Jovan Komatovic, Matteo Monti, Manuel Vidigueira

    Abstract: We present Carbon, an asynchronous payment system. To the best of our knowledge, Carbon is the first asynchronous payment system designed specifically for a client-server architecture. Namely, besides being able to make payments, clients of Carbon are capable of changing the set of running servers using a novel voting mechanism -- asynchronous, balance-based voting.

    Submitted 30 September, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

  31. arXiv:2208.09262  [pdf, other

    cs.DC

    Byzantine Consensus is Θ(n^2): The Dolev-Reischuk Bound is Tight even in Partial Synchrony! [Extended Version]

    Authors: Pierre Civit, Muhammad Ayaz Dzulfikar, Seth Gilbert, Vincent Gramoli, Rachid Guerraoui, Jovan Komatovic, Manuel Vidigueira

    Abstract: The Dolev-Reischuk bound says that any deterministic Byzantine consensus protocol has (at least) quadratic communication complexity in the worst case. While it has been shown that the bound is tight in synchronous environments, it is still unknown whether a consensus protocol with quadratic communication complexity can be obtained in partial synchrony. Until now, the most efficient known solutions… ▽ More

    Submitted 6 September, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

  32. arXiv:2205.12173  [pdf, other

    cs.LG cs.DC

    Byzantine Machine Learning Made Easy by Resilient Averaging of Momentums

    Authors: Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, John Stephan

    Abstract: Byzantine resilience emerged as a prominent topic within the distributed machine learning community. Essentially, the goal is to enhance distributed optimization algorithms, such as distributed SGD, in a way that guarantees convergence despite the presence of some misbehaving (a.k.a., {\em Byzantine}) workers. Although a myriad of techniques addressing the problem have been proposed, the field arg… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted at ICML 2022

  33. arXiv:2202.08656  [pdf, other

    cs.GT econ.TH

    Robust Sparse Voting

    Authors: Youssef Allouah, Rachid Guerraoui, Lê-Nguyên Hoang, Oscar Villemaud

    Abstract: Many applications, such as content moderation and recommendation, require reviewing and scoring a large number of alternatives. Doing so robustly is however very challenging. Indeed, voters' inputs are inevitably sparse: most alternatives are only scored by a small fraction of voters. This sparsity amplifies the effects of biased voters introducing unfairness, and of malicious voters seeking to ha… ▽ More

    Submitted 25 January, 2024; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted at AISTATS 2024

  34. arXiv:2202.08578  [pdf, other

    cs.LG stat.ML

    An Equivalence Between Data Poisoning and Byzantine Gradient Attacks

    Authors: Sadegh Farhadkhani, Rachid Guerraoui, Lê-Nguyên Hoang, Oscar Villemaud

    Abstract: To study the resilience of distributed learning, the "Byzantine" literature considers a strong threat model where workers can report arbitrary gradients to the parameter server. Whereas this model helped obtain several fundamental results, it has sometimes been considered unrealistic, when the workers are mostly trustworthy machines. In this paper, we show a surprising equivalence between this mod… ▽ More

    Submitted 20 July, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2106.02398

    Journal ref: ICML 2022

  35. arXiv:2110.03991  [pdf, other

    cs.LG cs.CR

    Combining Differential Privacy and Byzantine Resilience in Distributed SGD

    Authors: Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, Sebastien Rouault, John Stephan

    Abstract: Privacy and Byzantine resilience (BR) are two crucial requirements of modern-day distributed machine learning. The two concepts have been extensively studied individually but the question of how to combine them effectively remains unanswered. This paper contributes to addressing this question by studying the extent to which the distributed SGD algorithm, in the standard parameter-server architectu… ▽ More

    Submitted 5 October, 2023; v1 submitted 8 October, 2021; originally announced October 2021.

  36. arXiv:2108.01330  [pdf, other

    cs.DC

    Frugal Byzantine Computing

    Authors: M. K. Aguilera, N. Ben-David, R. Guerraoui, D. Papuc, A. Xygkis, I. Zablotchi

    Abstract: Traditional techniques for handling Byzantine failures are expensive: digital signatures are too costly, while using $3f{+}1$ replicas is uneconomical ($f$ denotes the maximum number of Byzantine processes). We seek algorithms that reduce the number of replicas to $2f{+}1$ and minimize the number of signatures. While the first goal can be achieved in the message-and-memory model, accomplishing the… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: This paper is an extended version of the DISC 2021 paper

  37. arXiv:2106.08676  [pdf, ps, other

    cs.DC

    Velos: One-sided Paxos for RDMA applications

    Authors: Rachid Guerraoui, Antoine Murat, Athanasios Xygkis

    Abstract: Modern data centers are becoming increasingly equipped with RDMA-capable NICs. These devices enable distributed systems to rely on algorithms designed for shared memory. RDMA allows consensus to terminate within a few microsecond in failure-free scenarios, yet, RDMA-optimized algorithms still use expensive two-sided operations in case of failure. In this work, we present a new leader-based consens… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  38. arXiv:2106.02398  [pdf, other

    cs.LG cs.GT stat.ML

    Strategyproof Learning: Building Trustworthy User-Generated Datasets

    Authors: Sadegh Farhadkhani, Rachid Guerraoui, Lê-Nguyên Hoang

    Abstract: We prove in this paper that, perhaps surprisingly, incentivizing data misreporting is not a fatality. By leveraging a careful design of the loss function, we propose Licchavi, a global and personalized learning framework with provable strategyproofness guarantees. Essentially, we prove that no user can gain much by replying to Licchavi's queries with answers that deviate from their true preference… ▽ More

    Submitted 18 February, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: 31 pages

  39. arXiv:2106.02394  [pdf, other

    cs.GT cs.MA

    On the Strategyproofness of the Geometric Median

    Authors: El-Mahdi El-Mhamdi, Sadegh Farhadkhani, Rachid Guerraoui, Lê-Nguyên Hoang

    Abstract: The geometric median, an instrumental component of the secure machine learning toolbox, is known to be effective when robustly aggregating models (or gradients), gathered from potentially malicious (or strategic) users. What is less known is the extent to which the geometric median incentivizes dishonest behaviors. This paper addresses this fundamental question by quantifying its strategyproofness… ▽ More

    Submitted 2 June, 2023; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Accepted paper at AISTATS 2023

  40. arXiv:2102.08166  [pdf, other

    cs.LG cs.CR cs.DC

    Differential Privacy and Byzantine Resilience in SGD: Do They Add Up?

    Authors: Rachid Guerraoui, Nirupam Gupta, Rafaël Pinot, Sébastien Rouault, John Stephan

    Abstract: This paper addresses the problem of combining Byzantine resilience with privacy in machine learning (ML). Specifically, we study if a distributed implementation of the renowned Stochastic Gradient Descent (SGD) learning algorithm is feasible with both differential privacy (DP) and $(α,f)$-Byzantine resilience. To the best of our knowledge, this is the first work to tackle this problem from a theor… ▽ More

    Submitted 24 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  41. arXiv:2010.06288  [pdf, other

    cs.DC

    Microsecond Consensus for Microsecond Applications

    Authors: Marcos K. Aguilera, Naama Ben-David, Rachid Guerraoui, Virendra J. Marathe, Athanasios Xygkis, Igor Zablotchi

    Abstract: We consider the problem of making apps fault-tolerant through replication, when apps operate at the microsecond scale, as in finance, embedded computing, and microservices apps. These apps need a replication scheme that also operates at the microsecond scale, otherwise replication becomes a burden. We propose Mu, a system that takes less than 1.3 microseconds to replicate a (small) request in memo… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Full version of OSDI'20 paper

  42. arXiv:2010.05888  [pdf, other

    cs.LG

    Garfield: System Support for Byzantine Machine Learning

    Authors: Rachid Guerraoui, Arsany Guirguis, Jérémy Max Plassmann, Anton Alexandre Ragot, Sébastien Rouault

    Abstract: We present Garfield, a library to transparently make machine learning (ML) applications, initially built with popular (but fragile) frameworks, e.g., TensorFlow and PyTorch, Byzantine-resilient. Garfield relies on a novel object-oriented design, reducing the coding effort, and addressing the vulnerability of the shared-graph architecture followed by classical ML frameworks. Garfield encompasses va… ▽ More

    Submitted 31 December, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: 31 pages; 16 figures; 2 tables

  43. arXiv:2008.02527  [pdf, other

    cs.DC

    Efficient Multi-word Compare and Swap

    Authors: Rachid Guerraoui, Alex Kogan, Virendra J. Marathe, Igor Zablotchi

    Abstract: Atomic lock-free multi-word compare-and-swap (MCAS) is a powerful tool for designing concurrent algorithms. Yet, its widespread usage has been limited because lock-free implementations of MCAS make heavy use of expensive compare-and-swap (CAS) instructions. Existing MCAS implementations indeed use at least 2k+1 CASes per k-CAS. This leads to the natural desire to minimize the number of CASes requi… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Full version of DISC '20 paper

  44. arXiv:2008.00742  [pdf, other

    cs.LG cs.DC stat.ML

    Collaborative Learning in the Jungle (Decentralized, Byzantine, Heterogeneous, Asynchronous and Nonconvex Learning)

    Authors: El-Mahdi El-Mhamdi, Sadegh Farhadkhani, Rachid Guerraoui, Arsany Guirguis, Lê Nguyên Hoang, Sébastien Rouault

    Abstract: We study Byzantine collaborative learning, where $n$ nodes seek to collectively learn from each others' local data. The data distribution may vary from one node to another. No node is trusted, and $f < n$ nodes can behave arbitrarily. We prove that collaborative learning is equivalent to a new form of agreement, which we call averaging agreement. In this problem, nodes start each with an initial v… ▽ More

    Submitted 1 December, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: 34 pages, 1 figure

    Journal ref: NeurIPS 2021

  45. arXiv:2006.07273  [pdf, other

    cs.LG cs.DC stat.ML

    FLeet: Online Federated Learning via Staleness Awareness and Performance Prediction

    Authors: Georgios Damaskinos, Rachid Guerraoui, Anne-Marie Kermarrec, Vlad Nitu, Rhicheek Patra, Francois Taiani

    Abstract: Federated Learning (FL) is very appealing for its privacy benefits: essentially, a global model is trained with updates computed on mobile devices while kee** the data of users local. Standard FL infrastructures are however designed to have no energy or performance impact on mobile devices, and are therefore not suitable for applications that require frequent (online) model updates, such as news… ▽ More

    Submitted 3 December, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  46. arXiv:2006.07272  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Stochastic Coordinate Descent

    Authors: Georgios Damaskinos, Celestine Mendler-Dünner, Rachid Guerraoui, Nikolaos Papandreou, Thomas Parnell

    Abstract: In this paper we tackle the challenge of making the stochastic coordinate descent algorithm differentially private. Compared to the classical gradient descent algorithm where updates operate on a single model vector and controlled noise addition to this vector suffices to hide critical information about individuals, stochastic coordinate descent crucially relies on kee** auxiliary information in… ▽ More

    Submitted 14 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

  47. arXiv:2006.04720  [pdf, other

    cs.NE cs.LG q-bio.PE stat.ML

    Host-Pathongen Co-evolution Inspired Algorithm Enables Robust GAN Training

    Authors: Andrei Kucharavy, El Mahdi El Mhamdi, Rachid Guerraoui

    Abstract: Generative adversarial networks (GANs) are pairs of artificial neural networks that are trained one against each other. The outputs from a generator are mixed with the real-world inputs to the discriminator and both networks are trained until an equilibrium is reached, where the discriminator cannot distinguish generated inputs from real ones. Since their introduction, GANs have allowed for the ge… ▽ More

    Submitted 9 June, 2020; v1 submitted 22 May, 2020; originally announced June 2020.

    Comments: 8 pages, 10 figures

    MSC Class: 92B20; 68T05; ACM Class: I.5.2

  48. arXiv:2004.13184  [pdf, other

    cs.DC

    Online Payments by Merely Broadcasting Messages (Extended Version)

    Authors: Daniel Collins, Rachid Guerraoui, Jovan Komatovic, Matteo Monti, Athanasios Xygkis, Matej Pavlovic, Petr Kuznetsov, Yvonne-Anne Pignolet, Dragos-Adrian Seredinschi, Andrei Tonkikh

    Abstract: We address the problem of online payments, where users can transfer funds among themselves. We introduce Astro, a system solving this problem efficiently in a decentralized, deterministic, and completely asynchronous manner. Astro builds on the insight that consensus is unnecessary to prevent double-spending. Instead of consensus, Astro relies on a weaker primitive---Byzantine reliable broadcast--… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: This is an extended version of a conference article, appearing in the proceedings of the 50th IEEE/IFIP Int. Conference on Dependable Systems and Networks (DSN 2020). This work has been supported in part by the European grant 862082, AT2 -- ERC-2019-PoC, and in part by a grant from Interchain Foundation

  49. arXiv:2003.00010  [pdf, other

    cs.LG cs.CR cs.DC

    Distributed Momentum for Byzantine-resilient Learning

    Authors: El-Mahdi El-Mhamdi, Rachid Guerraoui, Sébastien Rouault

    Abstract: Momentum is a variant of gradient descent that has been proposed for its benefits on convergence. In a distributed setting, momentum can be implemented either at the server or the worker side. When the aggregation rule used by the server is linear, commutativity with addition makes both deployments equivalent. Robustness and privacy are however among motivations to abandon linear aggregation rules… ▽ More

    Submitted 9 March, 2020; v1 submitted 28 February, 2020; originally announced March 2020.

    Comments: Source code (for academic use only): https://github.com/LPD-EPFL/ByzantineMomentum

  50. arXiv:2001.06271  [pdf, other

    cs.DC

    Dynamic Byzantine Reliable Broadcast [Technical Report]

    Authors: Rachid Guerraoui, Jovan Komatovic, Petr Kuznetsov, Yvonne-Anne Pignolet, Dragos-Adrian Seredinschi, Andrei Tonkikh

    Abstract: Reliable broadcast is a communication primitive guaranteeing, intuitively, that all processes in a distributed system deliver the same set of messages. The reason why this primitive is appealing is twofold: (i) we can implement it deterministically in a completely asynchronous environment, unlike stronger primitives like consensus and total-order broadcast, and yet (ii) reliable broadcast is power… ▽ More

    Submitted 20 November, 2020; v1 submitted 17 January, 2020; originally announced January 2020.

    Comments: This is an extended version of a conference article, appearingin the proceedings of the 24th Int. Conference on Principles of Distributed Systems (OPODIS 2020). This work has been supported in part by a grant from Interchain Foundation