Skip to main content

Showing 1–17 of 17 results for author: Dieuleveut, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.15641  [pdf, other

    stat.ME

    Predictive Uncertainty Quantification with Missing Covariates

    Authors: Margaux Zaffran, Julie Josse, Yaniv Romano, Aymeric Dieuleveut

    Abstract: Predictive uncertainty quantification is crucial in decision-making problems. We investigate how to adequately quantify predictive uncertainty with missing covariates. A bottleneck is that missing values induce heteroskedasticity on the response's predictive distribution given the observed covariates. Thus, we focus on building predictive sets for the response that are valid conditionally to the m… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2405.03449  [pdf, other

    cs.LG stat.ML

    Byzantine-Robust Gossip: Insights from a Dual Approach

    Authors: Renaud Gaucher, Hadrien Hendrikx, Aymeric Dieuleveut

    Abstract: Distributed approaches have many computational benefits, but they are vulnerable to attacks from a subset of devices transmitting incorrect information. This paper investigates Byzantine-resilient algorithms in a decentralized setting, where devices communicate directly with one another. We leverage the so-called dual approach to design a general robust decentralized optimization method. We provid… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 9 pages, 1 figure

  3. arXiv:2402.03839  [pdf, other

    math.ST stat.ML

    Random features models: a way to study the success of naive imputation

    Authors: Alexis Ayme, Claire Boyer, Aymeric Dieuleveut, Erwan Scornet

    Abstract: Constant (naive) imputation is still widely used in practice as this is a first easy-to-use technique to deal with missing data. Yet, this simple method could be expected to induce a large bias for prediction purposes, as the imputed input may strongly differ from the true underlying data. However, recent works suggest that this bias is low in the context of high-dimensional linear predictors when… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  4. arXiv:2402.01493  [pdf, other

    stat.ML cs.LG

    Sliced-Wasserstein Estimation with Spherical Harmonics as Control Variates

    Authors: Rémi Leluc, Aymeric Dieuleveut, François Portier, Johan Segers, Aigerim Zhuman

    Abstract: The Sliced-Wasserstein (SW) distance between probability measures is defined as the average of the Wasserstein distances resulting for the associated one-dimensional projections. As a consequence, the SW distance can be written as an integral with respect to the uniform measure on the sphere and the Monte Carlo framework can be employed for calculating the SW distance. Spherical harmonics are poly… ▽ More

    Submitted 15 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

    MSC Class: 65C05 (Primary) 65D30; 68Txx; 68Wxx (Secondary)

  5. arXiv:2308.01358  [pdf, other

    cs.LG math.OC stat.ML

    Compressed and distributed least-squares regression: convergence rates with applications to Federated Learning

    Authors: Constantin Philippenko, Aymeric Dieuleveut

    Abstract: In this paper, we investigate the impact of compression on stochastic gradient algorithms for machine learning, a technique widely used in distributed and federated learning. We underline differences in terms of convergence rates between several unbiased compression operators, that all satisfy the same condition on their variance, thus going beyond the classical worst-case analysis. To do so, we f… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  6. arXiv:2306.02732  [pdf, other

    stat.ML cs.LG

    Conformal Prediction with Missing Values

    Authors: Margaux Zaffran, Aymeric Dieuleveut, Julie Josse, Yaniv Romano

    Abstract: Conformal prediction is a theoretically grounded framework for constructing predictive intervals. We study conformal prediction with missing values in the covariates -- a setting that brings new challenges to uncertainty quantification. We first show that the marginal coverage guarantee of conformal prediction holds on imputed data for any missingness distribution and almost all imputation functio… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Code for our experiments can be found at https://github.com/mzaffran/ConformalPredictionMissingValues . To be published in the proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA

  7. arXiv:2302.11147  [pdf, other

    math.OC stat.ML

    Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

    Authors: Aymeric Dieuleveut, Gersende Fort, Eric Moulines, Hoi-To Wai

    Abstract: Stochastic Approximation (SA) is a classical algorithm that has had since the early days a huge impact on signal processing, and nowadays on machine learning, due to the necessity to deal with a large amount of data observed with uncertainties. An exemplar special case of SA pertains to the popular stochastic (sub)gradient algorithm which is the working horse behind many important applications. A… ▽ More

    Submitted 16 July, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted for publication at IEEE Transactions on Signal Processing; 31 pages, 7 pages of supplementary materials

  8. arXiv:2202.07282  [pdf, other

    stat.ML cs.LG

    Adaptive Conformal Predictions for Time Series

    Authors: Margaux Zaffran, Aymeric Dieuleveut, Olivier Féron, Yannig Goude, Julie Josse

    Abstract: Uncertainty quantification of predictive models is crucial in decision-making problems. Conformal prediction is a general and theoretically sound answer. However, it requires exchangeable data, excluding time series. While recent works tackled this issue, we argue that Adaptive Conformal Inference (ACI, Gibbs and Cand{è}s, 2021), developed for distribution-shift time series, is a good procedure fo… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  9. arXiv:2202.01463  [pdf, other

    stat.ML cs.LG

    Minimax rate of consistency for linear models with missing values

    Authors: Alexis Ayme, Claire Boyer, Aymeric Dieuleveut, Erwan Scornet

    Abstract: Missing values arise in most real-world data sets due to the aggregation of multiple sources and intrinsically missing information (sensor failure, unanswered questions in surveys...). In fact, the very nature of missing values usually prevents us from running standard learning algorithms. In this paper, we focus on the extensively-studied linear models, but in presence of missing values, which tu… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  10. arXiv:2106.00797  [pdf, other

    cs.LG cs.AI stat.CO stat.ME stat.ML

    QLSD: Quantised Langevin stochastic dynamics for Bayesian federated learning

    Authors: Maxime Vono, Vincent Plassier, Alain Durmus, Aymeric Dieuleveut, Eric Moulines

    Abstract: The objective of Federated Learning (FL) is to perform statistical inference for data which are decentralised and stored locally on networked clients. FL raises many constraints which include privacy and data ownership, communication overhead, statistical heterogeneity, and partial client participation. In this paper, we address these problems in the framework of the Bayesian paradigm. To this end… ▽ More

    Submitted 31 May, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

  11. arXiv:2007.00534  [pdf, ps, other

    cs.LG math.OC stat.ML

    On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent

    Authors: Scott Pesme, Aymeric Dieuleveut, Nicolas Flammarion

    Abstract: Constant step-size Stochastic Gradient Descent exhibits two phases: a transient phase during which iterates make fast progress towards the optimum, followed by a stationary phase during which iterates oscillate around the optimal point. In this paper, we show that efficiently detecting this transition and appropriately decreasing the step size can lead to fast convergence rates. We analyse the cla… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  12. arXiv:2006.14591  [pdf, other

    cs.LG stat.ML

    Bidirectional compression in heterogeneous settings for distributed or federated learning with partial participation: tight convergence guarantees

    Authors: Constantin Philippenko, Aymeric Dieuleveut

    Abstract: We introduce a framework - Artemis - to tackle the problem of learning in a distributed or federated setting with communication constraints and device partial participation. Several workers (randomly sampled) perform the optimization process using a central server to aggregate their computations. To alleviate the communication cost, Artemis allows to compress the information sent in both direction… ▽ More

    Submitted 19 June, 2022; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 54 pages, 4 theorems, 1 algorithm, code source on GitHub

  13. arXiv:1904.11325  [pdf, other

    cs.LG math.OC stat.ML

    Communication trade-offs for synchronized distributed SGD with large step size

    Authors: Kumar Kshitij Patel, Aymeric Dieuleveut

    Abstract: Synchronous mini-batch SGD is state-of-the-art for large-scale distributed machine learning. However, in practice, its convergence is bottlenecked by slow communication rounds between worker nodes. A natural solution to reduce communication is to use the \emph{`local-SGD'} model in which the workers train their model independently and synchronize every once in a while. This algorithm improves the… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  14. arXiv:1901.10738  [pdf, other

    cs.LG cs.NE stat.ML

    Unsupervised Scalable Representation Learning for Multivariate Time Series

    Authors: Jean-Yves Franceschi, Aymeric Dieuleveut, Martin Jaggi

    Abstract: Time series constitute a challenging data type for machine learning algorithms, due to their highly variable lengths and sparse labeling in practice. In this paper, we tackle this challenge by proposing an unsupervised method to learn universal embeddings of time series. Unlike previous works, it is scalable with respect to their length and we demonstrate the quality, transferability and practicab… ▽ More

    Submitted 3 January, 2020; v1 submitted 30 January, 2019; originally announced January 2019.

    Journal ref: Thirty-third Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, Dec 2019, Vancouver, Canada

  15. arXiv:1808.09663  [pdf, other

    cs.CL cs.LG stat.ML

    Context Mover's Distance & Barycenters: Optimal Transport of Contexts for Building Representations

    Authors: Sidak Pal Singh, Andreas Hug, Aymeric Dieuleveut, Martin Jaggi

    Abstract: We present a framework for building unsupervised representations of entities and their compositions, where each entity is viewed as a probability distribution rather than a vector embedding. In particular, this distribution is supported over the contexts which co-occur with the entity and are embedded in a suitable low-dimensional space. This enables us to consider representation learning from the… ▽ More

    Submitted 29 February, 2020; v1 submitted 29 August, 2018; originally announced August 2018.

    Comments: AISTATS 2020. Also, accepted previously at ICLR 2019 DeepGenStruct Workshop

  16. arXiv:1707.06386  [pdf, ps, other

    stat.ML math.OC

    Bridging the Gap between Constant Step Size Stochastic Gradient Descent and Markov Chains

    Authors: Aymeric Dieuleveut, Alain Durmus, Francis Bach

    Abstract: We consider the minimization of an objective function given access to unbiased estimates of its gradient through stochastic gradient descent (SGD) with constant step-size. While the detailed analysis was only performed for quadratic functions, we provide an explicit asymptotic expansion of the moments of the averaged SGD iterates that outlines the dependence on initial conditions, the effect of no… ▽ More

    Submitted 11 April, 2018; v1 submitted 20 July, 2017; originally announced July 2017.

  17. arXiv:1602.05419  [pdf, ps, other

    math.OC cs.LG stat.ML

    Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression

    Authors: Aymeric Dieuleveut, Nicolas Flammarion, Francis Bach

    Abstract: We consider the optimization of a quadratic objective function whose gradients are only accessible through a stochastic oracle that returns the gradient at any given point plus a zero-mean finite variance random error. We present the first algorithm that achieves jointly the optimal prediction error rates for least-squares regression, both in terms of forgetting of initial conditions in O(1/n 2),… ▽ More

    Submitted 24 February, 2016; v1 submitted 17 February, 2016; originally announced February 2016.