Skip to main content

Showing 1–17 of 17 results for author: Scaman, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.04679  [pdf, ps, other

    cs.LG math.OC

    Minimax Excess Risk of First-Order Methods for Statistical Learning with Data-Dependent Oracles

    Authors: Kevin Scaman, Mathieu Even, Batiste Le Bars, Laurent Massoulié

    Abstract: In this paper, our aim is to analyse the generalization capabilities of first-order methods for statistical learning in multiple, different yet related, scenarios including supervised learning, transfer learning, robust learning and federated learning. To do so, we provide sharp upper and lower bounds for the minimax excess risk of strongly convex and smooth statistical learning when the gradient… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 22 pages, 0 figures

  2. arXiv:2306.02939  [pdf, other

    cs.LG stat.ML

    Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm

    Authors: Batiste Le Bars, Aurélien Bellet, Marc Tommasi, Kevin Scaman, Giovanni Neglia

    Abstract: This paper presents a new generalization error analysis for Decentralized Stochastic Gradient Descent (D-SGD) based on algorithmic stability. The obtained results overhaul a series of recent works that suggested an increased instability due to decentralization and a detrimental impact of poorly-connected communication graphs on generalization. On the contrary, we show, for convex, strongly convex… ▽ More

    Submitted 13 June, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

  3. arXiv:2304.11017  [pdf, ps, other

    cs.CC

    Black-box Acceleration of Las Vegas Algorithms and Algorithmic Reverse Jensen's Inequalities

    Authors: Kevin Scaman

    Abstract: Let $\mathcal{A}$ be a Las Vegas algorithm, i.e. an algorithm whose running time $T$ is a random variable drawn according to a certain probability distribution $p$. In 1993, Luby, Sinclair and Zuckerman [LSZ93] proved that a simple universal restart strategy can, for any probability distribution $p$, provide an algorithm executing $\mathcal{A}$ and whose expected running time is… ▽ More

    Submitted 10 July, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: 13 pages, 0 figures

  4. arXiv:2301.08117  [pdf, other

    cs.LG

    Convergence beyond the over-parameterized regime using Rayleigh quotients

    Authors: David A. R. Robin, Kevin Scaman, Marc Lelarge

    Abstract: In this paper, we present a new strategy to prove the convergence of deep learning architectures to a zero training (or even testing) loss by gradient flow. Our analysis is centered on the notion of Rayleigh quotients in order to prove Kurdyka-Łojasiewicz inequalities for a broader set of neural network architectures and loss functions. We show that Rayleigh quotients provide a unified view for se… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: Published at the 36th conference on Neural Information Processing Systems (NeurIPS 2022)

  5. arXiv:2211.11656  [pdf, other

    cs.LG

    SIFU: Sequential Informed Federated Unlearning for Efficient and Provable Client Unlearning in Federated Optimization

    Authors: Yann Fraboni, Martin Van Waerebeke, Kevin Scaman, Richard Vidal, Laetitia Kameni, Marco Lorenzi

    Abstract: Machine Unlearning (MU) is an increasingly important topic in machine learning safety, aiming at removing the contribution of a given data point from a training procedure. Federated Unlearning (FU) consists in extending MU to unlearn a given client's contribution from a federated training routine. While several FU methods have been proposed, we currently lack a general approach providing formal un… ▽ More

    Submitted 15 March, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

  6. arXiv:2106.01257  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Kevin Scaman, Hoi-To Wai

    Abstract: This paper provides a non-asymptotic analysis of linear stochastic approximation (LSA) algorithms with fixed stepsize. This family of methods arises in many machine learning tasks and is used to obtain approximate solutions of a linear system $\bar{A}θ= \bar{b}$ for which $\bar{A}$ and $\bar{b}$ can only be accessed through random estimates $\{({\bf A}_n, {\bf b}_n): n \in \mathbb{N}^*\}$. Our ana… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: 21 pages

  7. arXiv:2103.04886  [pdf, other

    cs.LG stat.ML

    Lipschitz Normalization for Self-Attention Layers with Application to Graph Neural Networks

    Authors: George Dasoulas, Kevin Scaman, Aladin Virmaux

    Abstract: Attention based neural networks are state of the art in a large range of applications. However, their performance tends to degrade when the number of layers increases. In this work, we show that enforcing Lipschitz continuity by normalizing the attention scores can significantly improve the performance of deep attention models. First, we show that, for deep graph attention networks (GAT), gradient… ▽ More

    Submitted 13 September, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: 18 pages. Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021. Copyright 2021 by the author(s)

  8. arXiv:2102.09012  [pdf, other

    cs.LG

    Improving Hierarchical Adversarial Robustness of Deep Neural Networks

    Authors: Avery Ma, Aladin Virmaux, Kevin Scaman, Juwei Lu

    Abstract: Do all adversarial examples have the same consequences? An autonomous driving system misclassifying a pedestrian as a car may induce a far more dangerous -- and even potentially lethal -- behavior than, for instance, a car as a bus. In order to better tackle this important problematic, we introduce the concept of hierarchical adversarial robustness. Given a dataset whose classes can be grouped int… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  9. arXiv:2102.08735  [pdf, other

    cs.LG cs.SI

    Ego-based Entropy Measures for Structural Representations on Graphs

    Authors: George Dasoulas, Giannis Nikolentzos, Kevin Scaman, Aladin Virmaux, Michalis Vazirgiannis

    Abstract: Machine learning on graph-structured data has attracted high research interest due to the emergence of Graph Neural Networks (GNNs). Most of the proposed GNNs are based on the node homophily, i.e neighboring nodes share similar characteristics. However, in many complex networks, nodes that lie to distant parts of the graph share structurally equivalent characteristics and exhibit similar roles (e.… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021. arXiv admin note: substantial text overlap with arXiv:2003.00553

  10. arXiv:2003.00553  [pdf, other

    cs.LG cs.SI stat.ML

    Ego-based Entropy Measures for Structural Representations

    Authors: George Dasoulas, Giannis Nikolentzos, Kevin Scaman, Aladin Virmaux, Michalis Vazirgiannis

    Abstract: In complex networks, nodes that share similar structural characteristics often exhibit similar roles (e.g type of users in a social network or the hierarchical position of employees in a company). In order to leverage this relationship, a growing literature proposed latent representations that identify structurally equivalent nodes. However, most of the existing methods require high time and space… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: 7 pages, 4 figures

  11. arXiv:1912.06058  [pdf, other

    cs.LG stat.ML

    Coloring graph neural networks for node disambiguation

    Authors: George Dasoulas, Ludovic Dos Santos, Kevin Scaman, Aladin Virmaux

    Abstract: In this paper, we show that a simple coloring scheme can improve, both theoretically and empirically, the expressive power of Message Passing Neural Networks(MPNNs). More specifically, we introduce a graph neural network called Colored Local Iterative Procedure (CLIP) that uses colors to disambiguate identical node attributes, and show that this representation is a universal approximator of contin… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 17 pages, 2 figures

  12. arXiv:1910.05104  [pdf, other

    stat.ML cs.DC cs.LG

    Theoretical Limits of Pipeline Parallel Optimization and Application to Distributed Deep Learning

    Authors: Igor Colin, Ludovic Dos Santos, Kevin Scaman

    Abstract: We investigate the theoretical limits of pipeline parallel learning of deep learning architectures, a distributed setup in which the computation is distributed per layer instead of per example. For smooth convex and non-convex objective functions, we provide matching lower and upper complexity bounds and show that a naive pipeline parallelization of Nesterov's accelerated gradient descent is optim… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

  13. arXiv:1805.10965  [pdf, other

    stat.ML cs.LG

    Lipschitz regularity of deep neural networks: analysis and efficient estimation

    Authors: Kevin Scaman, Aladin Virmaux

    Abstract: Deep neural networks are notorious for being sensitive to small well-chosen perturbations, and estimating the regularity of such architectures is of utmost importance for safe and robust practical applications. In this paper, we investigate one of the key characteristics to assess the regularity of such methods: the Lipschitz constant of deep learning architectures. First, we show that, even for t… ▽ More

    Submitted 25 October, 2019; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: 12 pages, 3 figures

  14. arXiv:1805.10014  [pdf, other

    cs.LG stat.ML

    KONG: Kernels for ordered-neighborhood graphs

    Authors: Moez Draief, Konstantin Kutzkov, Kevin Scaman, Milan Vojnovic

    Abstract: We present novel graph kernels for graphs with node and edge labels that have ordered neighborhoods, i.e. when neighbor nodes follow an order. Graphs with ordered neighborhoods are a natural data representation for evolving graphs where edges are created over time, which induces an order. Combining convolutional subgraph kernels and string kernels, we design new scalable algorithms for generation… ▽ More

    Submitted 29 May, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

  15. arXiv:1709.05231  [pdf, ps, other

    stat.ML cs.AI cs.LG cs.SI math.OC

    A Spectral Method for Activity Sha** in Continuous-Time Information Cascades

    Authors: Kevin Scaman, Argyris Kalogeratos, Luca Corinzia, Nicolas Vayatis

    Abstract: Information Cascades Model captures dynamical properties of user activity in a social network. In this work, we develop a novel framework for activity sha** under the Continuous-Time Information Cascades Model which allows the administrator for local control actions by allocating targeted resources that can alter the spread of the process. Our framework employs the optimization of the spectral r… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    MSC Class: 93E20; 91D30 ACM Class: I.2.6

  16. arXiv:1407.4760  [pdf, other

    math.OC cs.SI physics.soc-ph

    What Makes a Good Plan? An Efficient Planning Approach to Control Diffusion Processes in Networks

    Authors: Kevin Scaman, Argyris Kalogeratos, Nicolas Vayatis

    Abstract: In this paper, we analyze the quality of a large class of simple dynamic resource allocation (DRA) strategies which we name priority planning. Their aim is to control an undesired diffusion process by distributing resources to the contagious nodes of the network according to a predefined priority-order. In our analysis, we reduce the DRA problem to the linear arrangement of the nodes of the networ… ▽ More

    Submitted 17 July, 2014; originally announced July 2014.

    Comments: 18 pages, 3 figures

  17. arXiv:1407.4744  [pdf, other

    math.PR cs.SI physics.soc-ph

    Tight Bounds for Influence in Diffusion Networks and Application to Bond Percolation and Epidemiology

    Authors: Remi Lemonnier, Kevin Scaman, Nicolas Vayatis

    Abstract: In this paper, we derive theoretical bounds for the long-term influence of a node in an Independent Cascade Model (ICM). We relate these bounds to the spectral radius of a particular matrix and show that the behavior is sub-critical when this spectral radius is lower than $1$. More specifically, we point out that, in general networks, the sub-critical regime behaves in $O(\sqrt{n})$ where $n$ is t… ▽ More

    Submitted 17 July, 2014; originally announced July 2014.

    Comments: 20 pages, 4 figures