Skip to main content

Showing 1–23 of 23 results for author: Rebeschini, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.08399  [pdf, other

    stat.ML cs.LG

    Differentiable Cost-Parameterized Monge Map Estimators

    Authors: Samuel Howard, George Deligiannidis, Patrick Rebeschini, James Thornton

    Abstract: Within the field of optimal transport (OT), the choice of ground cost is crucial to ensuring that the optimality of a transport map corresponds to usefulness in real-world applications. It is therefore desirable to use known information to tailor cost functions and hence learn OT maps which are adapted to the problem at hand. By considering a class of neural ground costs whose Monge maps have a kn… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2402.05187  [pdf, other

    stat.ML cs.LG math.OC

    Learning mirror maps in policy mirror descent

    Authors: Carlo Alfano, Sebastian Towers, Silvia Sapora, Chris Lu, Patrick Rebeschini

    Abstract: Policy Mirror Descent (PMD) is a popular framework in reinforcement learning, serving as a unifying perspective that encompasses numerous algorithms. These algorithms are derived through the selection of a mirror map and enjoy finite-time convergence guarantees. Despite its popularity, the exploration of PMD's full potential is limited, with the majority of research focusing on a particular mirror… ▽ More

    Submitted 7 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  3. arXiv:2311.00274  [pdf, ps, other

    stat.ML cs.LG math.OC

    Generalization Bounds for Label Noise Stochastic Gradient Descent

    Authors: Jung Eun Huh, Patrick Rebeschini

    Abstract: We develop generalization error bounds for stochastic gradient descent (SGD) with label noise in non-convex settings under uniform dissipativity and smoothness conditions. Under a suitable choice of semimetric, we establish a contraction in Wasserstein distance of the label noise stochastic gradient flow that depends polynomially on the parameter dimension $d$. Using the framework of algorithmic s… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: 27 pages

  4. arXiv:2301.13139  [pdf, other

    stat.ML cs.LG math.OC math.ST

    A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence

    Authors: Carlo Alfano, Rui Yuan, Patrick Rebeschini

    Abstract: Modern policy optimization methods in reinforcement learning, such as TRPO and PPO, owe their success to the use of parameterized policies. However, while theoretical guarantees have been established for this class of algorithms, especially in the tabular setting, the use of general parameterization schemes remains mostly unjustified. In this work, we introduce a novel framework for policy optimiz… ▽ More

    Submitted 13 February, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Post-conference updates

  5. arXiv:2202.11461  [pdf, other

    math.ST cs.LG stat.ML

    Exponential Tail Local Rademacher Complexity Risk Bounds Without the Bernstein Condition

    Authors: Varun Kanade, Patrick Rebeschini, Tomas Vaskevicius

    Abstract: The local Rademacher complexity framework is one of the most successful general-purpose toolboxes for establishing sharp excess risk bounds for statistical estimators based on the framework of empirical risk minimization. Applying this toolbox typically requires using the Bernstein condition, which often restricts applicability to convex and proper settings. Recent years have witnessed several exa… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  6. arXiv:2111.12876  [pdf, other

    stat.ML cs.LG math.OC math.PR

    Time-independent Generalization Bounds for SGLD in Non-convex Settings

    Authors: Tyler Farghly, Patrick Rebeschini

    Abstract: We establish generalization error bounds for stochastic gradient Langevin dynamics (SGLD) with constant learning rate under the assumptions of dissipativity and smoothness, a setting that has received increased attention in the sampling/optimization literature. Unlike existing bounds for SGLD in non-convex settings, ours are time-independent and decay to zero as the sample size increases. Using th… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 22 pages. To appear in Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  7. arXiv:2110.11258  [pdf, other

    stat.ML cs.LG math.ST

    On Optimal Interpolation In Linear Regression

    Authors: Eduard Oravkin, Patrick Rebeschini

    Abstract: Understanding when and why interpolating methods generalize well has recently been a topic of interest in statistical learning theory. However, systematically connecting interpolating methods to achievable notions of optimality has only received partial attention. In this paper, we investigate the question of what is the optimal way to interpolate in linear regression using functions that are line… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 25 pages, 7 figures, to appear in NeurIPS 2021

  8. arXiv:2109.11692  [pdf, ps, other

    cs.LG cs.MA eess.SY math.ST stat.ML

    Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning

    Authors: Carlo Alfano, Patrick Rebeschini

    Abstract: Cooperative multi-agent reinforcement learning is a decentralized paradigm in sequential decision making where agents distributed over a network iteratively collaborate with neighbors to maximize global (network-wide) notions of rewards. Exact computations typically involve a complexity that scales exponentially with the number of agents. To address this curse of dimensionality, we design a scalab… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  9. arXiv:2108.11872  [pdf, other

    math.ST cs.LG math.OC stat.ML

    Comparing Classes of Estimators: When does Gradient Descent Beat Ridge Regression in Linear Models?

    Authors: Dominic Richards, Edgar Dobriban, Patrick Rebeschini

    Abstract: Methods for learning from data depend on various types of tuning parameters, such as penalization strength or step size. Since performance can depend strongly on these parameters, it is important to compare classes of estimators-by considering prescribed finite sets of tuning parameters-not just particularly tuned methods. In this work, we investigate classes of methods via the relative performanc… ▽ More

    Submitted 12 June, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

  10. arXiv:2105.13831  [pdf, other

    stat.ML cs.LG

    Implicit Regularization in Matrix Sensing via Mirror Descent

    Authors: Fan Wu, Patrick Rebeschini

    Abstract: We study discrete-time mirror descent applied to the unregularized empirical risk in matrix sensing. In both the general case of rectangular matrices and the particular case of positive semidefinite matrices, a simple potential-based analysis in terms of the Bregman divergence allows us to establish convergence of mirror descent -- with different choices of the mirror maps -- to a matrix that, amo… ▽ More

    Submitted 27 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  11. arXiv:2105.03678  [pdf, other

    eess.SP cs.LG stat.ML

    Nearly Minimax-Optimal Rates for Noisy Sparse Phase Retrieval via Early-Stopped Mirror Descent

    Authors: Fan Wu, Patrick Rebeschini

    Abstract: This paper studies early-stopped mirror descent applied to noisy sparse phase retrieval, which is the problem of recovering a $k$-sparse signal $\mathbf{x}^\star\in\mathbb{R}^n$ from a set of quadratic Gaussian measurements corrupted by sub-exponential noise. We consider the (non-convex) unregularized empirical risk minimization problem and show that early-stopped mirror descent, when equipped wit… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.10168

  12. arXiv:2010.10168  [pdf, other

    stat.ML cs.LG

    A Continuous-Time Mirror Descent Approach to Sparse Phase Retrieval

    Authors: Fan Wu, Patrick Rebeschini

    Abstract: We analyze continuous-time mirror descent applied to sparse phase retrieval, which is the problem of recovering sparse signals from a set of magnitude-only measurements. We apply mirror descent to the unconstrained empirical risk minimization problem (batch setting), using the square loss and square measurements. We provide a convergence analysis of the algorithm in this non-convex setting and pro… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  13. arXiv:2007.00360  [pdf, other

    stat.ML cs.LG math.ST

    Decentralised Learning with Random Features and Distributed Gradient Descent

    Authors: Dominic Richards, Patrick Rebeschini, Lorenzo Rosasco

    Abstract: We investigate the generalisation performance of Distributed Gradient Descent with Implicit Regularisation and Random Features in the homogenous setting where a network of agents are given data sampled independently from the same unknown distribution. Along with reducing the memory footprint, Random Features are particularly convenient in this setting as they provide a common parameterisation acro… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  14. arXiv:2006.01065  [pdf, other

    stat.ML cs.LG eess.SP

    Hadamard Wirtinger Flow for Sparse Phase Retrieval

    Authors: Fan Wu, Patrick Rebeschini

    Abstract: We consider the problem of reconstructing an $n$-dimensional $k$-sparse signal from a set of noiseless magnitude-only measurements. Formulating the problem as an unregularized empirical risk minimization task, we study the sample complexity performance of gradient descent with Hadamard parametrization, which we call Hadamard Wirtinger flow (HWF). Provided knowledge of the signal sparsity $k$, we p… ▽ More

    Submitted 24 February, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

  15. arXiv:2002.00189  [pdf, other

    stat.ML cs.LG

    The Statistical Complexity of Early-Stopped Mirror Descent

    Authors: Tomas Vaškevičius, Varun Kanade, Patrick Rebeschini

    Abstract: Recently there has been a surge of interest in understanding implicit regularization properties of iterative gradient-based optimization algorithms. In this paper, we study the statistical guarantees on the excess risk achieved by early-stopped unconstrained mirror descent algorithms applied to the unregularized empirical risk with the squared loss for linear models and kernel methods. By completi… ▽ More

    Submitted 27 August, 2020; v1 submitted 1 February, 2020; originally announced February 2020.

  16. arXiv:1912.01417  [pdf, other

    math.ST stat.ML

    Distributed Machine Learning with Sparse Heterogeneous Data

    Authors: Dominic Richards, Sahand N. Negahban, Patrick Rebeschini

    Abstract: Motivated by distributed machine learning settings such as Federated Learning, we consider the problem of fitting a statistical model across a distributed collection of heterogeneous data sets whose similarity structure is encoded by a graph topology. Precisely, we analyse the case where each node is associated with fitting a sparse linear model, and edges join two nodes if the difference of their… ▽ More

    Submitted 27 November, 2021; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: NeurIPS 2021 camera ready

  17. arXiv:1909.05122  [pdf, other

    stat.ML cs.LG eess.SP

    Implicit Regularization for Optimal Sparse Recovery

    Authors: Tomas Vaškevičius, Varun Kanade, Patrick Rebeschini

    Abstract: We investigate implicit regularization schemes for gradient descent methods applied to unpenalized least squares regression to solve the problem of reconstructing a sparse signal from an underdetermined system of linear measurements under the restricted isometry assumption. For a given parametrization yielding a non-convex optimization problem, we show that prescribed choices of initialization, st… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: To appear in NeurIPS 2019

  18. arXiv:1905.03135  [pdf, ps, other

    stat.ML cs.DC cs.LG math.OC

    Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up

    Authors: Dominic Richards, Patrick Rebeschini

    Abstract: We analyse the learning performance of Distributed Gradient Descent in the context of multi-agent decentralised non-parametric regression with the square loss function when i.i.d. samples are assigned to agents. We show that if agents hold sufficiently many samples with respect to the network size, then Distributed Gradient Descent achieves optimal statistical rates with a number of iterations tha… ▽ More

    Submitted 13 November, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

  19. arXiv:1810.04468  [pdf, other

    cs.LG stat.ML

    Decentralized Cooperative Stochastic Bandits

    Authors: David Martínez-Rubio, Varun Kanade, Patrick Rebeschini

    Abstract: We study a decentralized cooperative stochastic multi-armed bandit problem with $K$ arms on a network of $N$ agents. In our model, the reward distribution of each arm is the same for each agent and rewards are drawn independently across agents and time steps. In each round, each agent chooses an arm to play and subsequently sends a message to her neighbors. The goal is to minimize the overall regr… ▽ More

    Submitted 24 October, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

  20. arXiv:1809.06958  [pdf, other

    cs.LG math.OC stat.ML

    Graph-Dependent Implicit Regularisation for Distributed Stochastic Subgradient Descent

    Authors: Dominic Richards, Patrick Rebeschini

    Abstract: We propose graph-dependent implicit regularisation strategies for distributed stochastic subgradient descent (Distributed SGD) for convex problems in multi-agent learning. Under the standard assumptions of convexity, Lipschitz continuity, and smoothness, we establish statistical learning rates that retain, up to logarithmic terms, centralised statistical guarantees through implicit regularisation… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

  21. arXiv:1611.07138  [pdf, other

    math.OC cs.DS stat.ML

    A New Approach to Laplacian Solvers and Flow Problems

    Authors: Patrick Rebeschini, Sekhar Tatikonda

    Abstract: This paper investigates the behavior of the Min-Sum message passing scheme to solve systems of linear equations in the Laplacian matrices of graphs and to compute electric flows. Voltage and flow problems involve the minimization of quadratic functions and are fundamental primitives that arise in several domains. Algorithms that have been proposed are typically centralized and involve multiple gra… ▽ More

    Submitted 7 March, 2019; v1 submitted 21 November, 2016; originally announced November 2016.

  22. arXiv:1602.04227  [pdf, ps, other

    stat.ML math.OC

    Scale-free network optimization: foundations and algorithms

    Authors: Patrick Rebeschini, Sekhar Tatikonda

    Abstract: We investigate the fundamental principles that drive the development of scalable algorithms for network optimization. Despite the significant amount of work on parallel and decentralized algorithms in the optimization community, the methods that have been proposed typically rely on strict separability assumptions for objective function and constraints. Beside sparsity, these methods typically do n… ▽ More

    Submitted 12 February, 2016; originally announced February 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1509.06246

  23. arXiv:1506.02194  [pdf, ps, other

    stat.ML math.ST

    Fast Mixing for Discrete Point Processes

    Authors: Patrick Rebeschini, Amin Karbasi

    Abstract: We investigate the systematic mechanism for designing fast mixing Markov chain Monte Carlo algorithms to sample from discrete point processes under the Dobrushin uniqueness condition for Gibbs measures. Discrete point processes are defined as probability distributions $μ(S)\propto \exp(βf(S))$ over all subsets $S\in 2^V$ of a finite set $V$ through a bounded set function… ▽ More

    Submitted 6 June, 2015; originally announced June 2015.