Skip to main content

Showing 1–17 of 17 results for author: Gouy-Pailler, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.07670  [pdf, other

    cs.LG cs.AI

    Federated Dataset Dictionary Learning for Multi-Source Domain Adaptation

    Authors: Fabiola Espinoza Castellon, Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Aurélien Mayoue, Antoine Souloumiac, Cédric Gouy-Pailler

    Abstract: In this article, we propose an approach for federated domain adaptation, a setting where distributional shift exists among clients and some have unlabeled data. The proposed framework, FedDaDiL, tackles the resulting challenge through dictionary learning of empirical distributions. In our setting, clients' distributions represent particular domains, and FedDaDiL collectively trains a federated dic… ▽ More

    Submitted 8 November, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 7 pages,2 figures; v2: fixed typos

  2. arXiv:2304.02959  [pdf, other

    cs.CR cs.LG

    When approximate design for fast homomorphic computation provides differential privacy guarantees

    Authors: Arnaud Grivet Sébert, Martin Zuber, Oana Stan, Renaud Sirdey, Cédric Gouy-Pailler

    Abstract: While machine learning has become pervasive in as diversified fields as industry, healthcare, social networks, privacy concerns regarding the training data have gained a critical importance. In settings where several parties wish to collaboratively train a common model without jeopardizing their sensitive data, the need for a private training protocol is particularly stringent and implies to prote… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 28 pages, 2 figures, 3 tables

  3. arXiv:2206.08752  [pdf, other

    cs.LG

    Federated learning with incremental clustering for heterogeneous data

    Authors: Fabiola Espinoza Castellon, Aurelien Mayoue, Jacques-Henri Sublemontier, Cedric Gouy-Pailler

    Abstract: Federated learning enables different parties to collaboratively build a global model under the orchestration of a server while kee** the training data on clients' devices. However, performance is affected when clients have heterogeneous data. To cope with this problem, we assume that despite data heterogeneity, there are groups of clients who have similar data distributions that can be clustered… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  4. arXiv:2205.04330  [pdf, other

    cs.CR cs.LG

    Protecting Data from all Parties: Combining FHE and DP in Federated Learning

    Authors: Arnaud Grivet Sébert, Renaud Sirdey, Oana Stan, Cédric Gouy-Pailler

    Abstract: This paper tackles the problem of ensuring training data privacy in a federated learning context. Relying on Homomorphic Encryption (HE) and Differential Privacy (DP), we propose a framework addressing threats on the privacy of the training data. Notably, the proposed framework ensures the privacy of the training data from all actors of the learning process, namely the data owners and the aggregat… ▽ More

    Submitted 31 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 21 pages, 2 figures, 2 tables

    ACM Class: I.2.6; E.3

  5. arXiv:2102.10875  [pdf, other

    cs.LG

    On the robustness of randomized classifiers to adversarial examples

    Authors: Rafael Pinot, Laurent Meunier, Florian Yger, Cédric Gouy-Pailler, Yann Chevaleyre, Jamal Atif

    Abstract: This paper investigates the theory of robustness against adversarial attacks. We focus on randomized classifiers (\emph{i.e.} classifiers that output random variables) and provide a thorough analysis of their behavior through the lens of statistical learning theory and information theory. To this aim, we introduce a new notion of robustness for randomized classifiers, enforcing local Lipschitzness… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  6. SPEED: Secure, PrivatE, and Efficient Deep learning

    Authors: Arnaud Grivet Sébert, Rafael Pinot, Martin Zuber, Cédric Gouy-Pailler, Renaud Sirdey

    Abstract: We introduce a deep learning framework able to deal with strong privacy constraints. Based on collaborative learning, differential privacy and homomorphic encryption, the proposed approach advances state-of-the-art of private deep learning against a wider range of threats, in particular the honest-but-curious server assumption. We address threats from both the aggregation server, the global model… ▽ More

    Submitted 26 March, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 32 pages, 3 figures. Mach Learn (2021)

  7. arXiv:1906.07982  [pdf, ps, other

    cs.LG cs.CR stat.ML

    A unified view on differential privacy and robustness to adversarial examples

    Authors: Rafael Pinot, Florian Yger, Cédric Gouy-Pailler, Jamal Atif

    Abstract: This short note highlights some links between two lines of research within the emerging topic of trustworthy machine learning: differential privacy and robustness to adversarial examples. By abstracting the definitions of both notions, we show that they build upon the same theoretical ground and hence results obtained so far in one domain can be transferred to the other. More precisely, our analys… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  8. arXiv:1902.01148  [pdf, other

    cs.LG cs.CR stat.ML

    Theoretical evidence for adversarial robustness through randomization

    Authors: Rafael Pinot, Laurent Meunier, Alexandre Araujo, Hisashi Kashima, Florian Yger, Cédric Gouy-Pailler, Jamal Atif

    Abstract: This paper investigates the theory of robustness against adversarial attacks. It focuses on the family of randomization techniques that consist in injecting noise in the network at inference time. These techniques have proven effective in many contexts, but lack theoretical arguments. We close this gap by presenting a theoretical analysis of these approaches, hence explaining why they perform well… ▽ More

    Submitted 11 June, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

  9. arXiv:1803.03831  [pdf, other

    cs.DS cs.LG

    Graph-based Clustering under Differential Privacy

    Authors: Rafael Pinot, Anne Morvan, Florian Yger, Cédric Gouy-Pailler, Jamal Atif

    Abstract: In this paper, we present the first differentially private clustering method for arbitrary-shaped node clusters in a graph. This algorithm takes as input only an approximate Minimum Spanning Tree (MST) $\mathcal{T}$ released under weight differential privacy constraints from the graph. Then, the underlying nonconvex clustering partition is successfully recovered from cutting optimal cuts on… ▽ More

    Submitted 10 March, 2018; originally announced March 2018.

  10. arXiv:1802.03936  [pdf, other

    cs.LG

    On the Needs for Rotations in Hypercubic Quantization Hashing

    Authors: Anne Morvan, Antoine Souloumiac, Krzysztof Choromanski, Cédric Gouy-Pailler, Jamal Atif

    Abstract: The aim of this paper is to endow the well-known family of hypercubic quantization hashing methods with theoretical guarantees. In hypercubic quantization, applying a suitable (random or learned) rotation after dimensionality reduction has been experimentally shown to improve the results accuracy in the nearest neighbors search problem. We prove in this paper that the use of these rotations is opt… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  11. arXiv:1705.07661  [pdf, other

    cs.LG

    Streaming Binary Sketching based on Subspace Tracking and Diagonal Uniformization

    Authors: Anne Morvan, Antoine Souloumiac, Cédric Gouy-Pailler, Jamal Atif

    Abstract: In this paper, we address the problem of learning compact similarity-preserving embeddings for massive high-dimensional streams of data in order to perform efficient similarity search. We present a new online method for computing binary compressed representations -sketches- of high-dimensional real feature vectors. Given an expected code length $c$ and high-dimensional input data points, our algor… ▽ More

    Submitted 8 February, 2018; v1 submitted 22 May, 2017; originally announced May 2017.

  12. Graph sketching-based Space-efficient Data Clustering

    Authors: Anne Morvan, Krzysztof Choromanski, Cédric Gouy-Pailler, Jamal Atif

    Abstract: In this paper, we address the problem of recovering arbitrary-shaped data clusters from datasets while facing \emph{high space constraints}, as this is for instance the case in many real-world applications when analysis algorithms are directly deployed on resources-limited mobile devices collecting the data. We present DBMSTClu a new space-efficient density-based \emph{non-parametric} method worki… ▽ More

    Submitted 27 May, 2018; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: Proceedings of the 2018 SIAM International Conference on Data Mining

  13. arXiv:1610.06209  [pdf, other

    cs.LG

    Structured adaptive and random spinners for fast machine learning computations

    Authors: Mariusz Bojarski, Anna Choromanska, Krzysztof Choromanski, Francois Fagan, Cedric Gouy-Pailler, Anne Morvan, Nourhan Sakr, Tamas Sarlos, Jamal Atif

    Abstract: We consider an efficient computational framework for speeding up several machine learning algorithms with almost no loss of accuracy. The proposed framework relies on projections via structured matrices that we call Structured Spinners, which are formed as products of three structured matrix-blocks that incorporate rotations. The approach is highly generic, i.e. i) structured matrices under consid… ▽ More

    Submitted 26 November, 2016; v1 submitted 19 October, 2016; originally announced October 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1605.09046

  14. arXiv:1609.09525  [pdf, other

    cs.DS cs.CV cs.LG

    Multi-dimensional signal approximation with sparse structured priors using split Bregman iterations

    Authors: Yoann Isaac, Quentin Barthélemy, Cédric Gouy-Pailler, Michèle Sebag, Jamal Atif

    Abstract: This paper addresses the structurally-constrained sparse decomposition of multi-dimensional signals onto overcomplete families of vectors, called dictionaries. The contribution of the paper is threefold. Firstly, a generic spatio-temporal regularization term is designed and used together with the standard $\ell_1$ regularization term to enforce a sparse decomposition preserving the spatio-temporal… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.

  15. arXiv:1605.09046  [pdf, other

    cs.LG stat.ML

    TripleSpin - a generic compact paradigm for fast machine learning computations

    Authors: Krzysztof Choromanski, Francois Fagan, Cedric Gouy-Pailler, Anne Morvan, Tamas Sarlos, Jamal Atif

    Abstract: We present a generic compact computational framework relying on structured random matrices that can be applied to speed up several machine learning algorithms with almost no loss of accuracy. The applications include new fast LSH-based algorithms, efficient kernel computations via random feature maps, convex optimization algorithms, quantization techniques and many more. Certain models of the pres… ▽ More

    Submitted 6 June, 2016; v1 submitted 29 May, 2016; originally announced May 2016.

  16. arXiv:1303.5197  [pdf, other

    cs.DS stat.ML

    Multi-dimensional sparse structured signal approximation using split Bregman iterations

    Authors: Yoann Isaac, Quentin Barthélemy, Jamal Atif, Cédric Gouy-Pailler, Michèle Sebag

    Abstract: The paper focuses on the sparse approximation of signals using overcomplete representations, such that it preserves the (prior) structure of multi-dimensional signals. The underlying optimization problem is tackled using a multi-dimensional split Bregman optimization approach. An extensive empirical evaluation shows how the proposed approach compares to the state of the art depending on the signal… ▽ More

    Submitted 10 March, 2015; v1 submitted 21 March, 2013; originally announced March 2013.

    Comments: 5 pages, ICASSP 2013 preprint

  17. arXiv:1303.0742  [pdf, ps, other

    cs.LG q-bio.NC stat.ML

    Multivariate Temporal Dictionary Learning for EEG

    Authors: Quentin Barthélemy, Cédric Gouy-Pailler, Yoann Isaac, Antoine Souloumiac, Anthony Larue, Jérôme I. Mars

    Abstract: This article addresses the issue of representing electroencephalographic (EEG) signals in an efficient way. While classical approaches use a fixed Gabor dictionary to analyze EEG signals, this article proposes a data-driven method to obtain an adapted dictionary. To reach an efficient dictionary learning, appropriate spatial and temporal modeling is required. Inter-channels links are taken into ac… ▽ More

    Submitted 4 March, 2013; originally announced March 2013.

    Journal ref: Published in Journal of Neuroscience Methods, vol. 215, pp. 19-28, 2013