Skip to main content

Showing 1–19 of 19 results for author: Gaïffas, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.17316  [pdf, other

    stat.ML cs.LG

    Robust Stochastic Optimization via Gradient Quantile Clip**

    Authors: Ibrahim Merad, Stéphane Gaïffas

    Abstract: We introduce a clip** strategy for Stochastic Gradient Descent (SGD) which uses quantiles of the gradient norm as clip** thresholds. We prove that this new strategy provides a robust and efficient optimization algorithm for smooth objectives (convex or non-convex), that tolerates heavy-tailed samples (including infinite variance) and a fraction of outliers in the data stream akin to Huber cont… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  2. arXiv:2307.06048  [pdf, other

    math.OC cs.LG stat.ML

    Online Inventory Problems: Beyond the i.i.d. Setting with Online Convex Optimization

    Authors: Massil Hihat, Stéphane Gaïffas, Guillaume Garrigos, Simon Bussy

    Abstract: We study multi-product inventory control problems where a manager makes sequential replenishment decisions based on partial historical information in order to minimize its cumulative losses. Our motivation is to consider general demands, losses and dynamics to go beyond standard models which usually rely on newsvendor-type losses, fixed dynamics, and unrealistic i.i.d. demand assumptions. We propo… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  3. arXiv:2306.11497  [pdf, ps, other

    stat.ML cs.LG math.OC

    Convergence and concentration properties of constant step-size SGD through Markov chains

    Authors: Ibrahim Merad, Stéphane Gaïffas

    Abstract: We consider the optimization of a smooth and strongly convex objective using constant step-size stochastic gradient descent (SGD) and study its properties through the prism of Markov chains. We show that, for unbiased gradient estimates with mildly controlled variance, the iteration converges to an invariant distribution in total variation distance. We also establish this convergence in Wasserstei… ▽ More

    Submitted 4 July, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  4. arXiv:2208.05447  [pdf, other

    stat.ML cs.LG

    Robust Methods for High-Dimensional Linear Learning

    Authors: Ibrahim Merad, Stéphane Gaïffas

    Abstract: We propose statistically robust and computationally efficient linear learning methods in the high-dimensional batch setting, where the number of features $d$ may exceed the sample size $n$. We employ, in a generic learning setting, two algorithms depending on whether the considered loss function is gradient-Lipschitz or not. Then, we instantiate our framework on several applications including vani… ▽ More

    Submitted 29 May, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: accepted version

  5. arXiv:2201.13372  [pdf, other

    stat.ML cs.LG math.ST

    Robust supervised learning with coordinate gradient descent

    Authors: Stéphane Gaïffas, Ibrahim Merad

    Abstract: This paper considers the problem of supervised learning with linear methods when both features and labels can be corrupted, either in the form of heavy tailed data and/or corrupted rows. We introduce a combination of coordinate gradient descent as a learning algorithm together with robust estimators of the partial derivatives. This leads to robust statistical learning methods that have a numerical… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 57 pages, 6 figures

  6. arXiv:2109.08010  [pdf, other

    cs.LG stat.ML

    WildWood: a new Random Forest algorithm

    Authors: Stéphane Gaïffas, Ibrahim Merad, Yiyang Yu

    Abstract: We introduce WildWood (WW), a new ensemble algorithm for supervised learning of Random Forest (RF) type. While standard RF algorithms use bootstrap out-of-bag samples to compute out-of-bag scores, WW uses these samples to produce improved predictions given by an aggregation of the predictions of all possible subtrees of each fully grown tree in the forest. This is achieved by aggregation with expo… ▽ More

    Submitted 13 June, 2023; v1 submitted 16 September, 2021; originally announced September 2021.

  7. arXiv:2106.15268  [pdf, ps, other

    cs.CV cs.LG

    Predicting the Solar Potential of Rooftops using Image Segmentation and Structured Data

    Authors: Daniel de Barros Soares, François Andrieux, Bastien Hell, Julien Lenhardt, Jordi Badosa, Sylvain Gavoille, Stéphane Gaiffas, Emmanuel Bacry

    Abstract: Estimating the amount of electricity that can be produced by rooftop photovoltaic systems is a time-consuming process that requires on-site measurements, a difficult task to achieve on a large scale. In this paper, we present an approach to estimate the solar potential of rooftops based on their location and architectural characteristics, as well as the amount of solar radiation they receive annua… ▽ More

    Submitted 28 May, 2021; originally announced June 2021.

  8. arXiv:2012.01064  [pdf, other

    cs.LG cs.AI stat.ML

    About contrastive unsupervised representation learning for classification and its convergence

    Authors: Ibrahim Merad, Yiyang Yu, Emmanuel Bacry, Stéphane Gaïffas

    Abstract: Contrastive representation learning has been recently proved to be very efficient for self-supervised training. These methods have been successfully used to train encoders which perform comparably to supervised training on downstream classification tasks. A few works have started to build a theoretical framework around contrastive learning in which guarantees for its performance can be proven. We… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

  9. arXiv:1912.10784  [pdf, ps, other

    math.ST cs.LG stat.ML

    An improper estimator with optimal excess risk in misspecified density estimation and logistic regression

    Authors: Jaouad Mourtada, Stéphane Gaïffas

    Abstract: We introduce a procedure for conditional density estimation under logarithmic loss, which we call SMP (Sample Minmax Predictor). This estimator minimizes a new general excess risk bound for statistical learning. On standard examples, this bound scales as $d/n$ with $d$ the model dimension and $n$ the sample size, and critically remains valid under model misspecification. Being an improper (out-of-… ▽ More

    Submitted 8 December, 2021; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: 43 pages, minor revision

  10. arXiv:1911.05346  [pdf, other

    cs.LG stat.ML

    ZiMM: a deep learning model for long term and blurry relapses with non-clinical claims data

    Authors: Anastasiia Kabeshova, Yiyang Yu, Bertrand Lukacs, Emmanuel Bacry, Stéphane Gaïffas

    Abstract: This paper considers the problems of modeling and predicting a long-term and ``blurry'' relapse that occurs after a medical act, such as a surgery. The relapse is observed only indirectly, in a ``blurry'' fashion, through longitudinal prescriptions of drugs over a long period of time after the medical act. We introduce a new model, called ZiMM (Zero-inflated Mixture of Multinomial distributions) i… ▽ More

    Submitted 25 July, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

  11. arXiv:1910.07045  [pdf, other

    cs.DC cs.CY

    SCALPEL3: a scalable open-source library for healthcare claims databases

    Authors: Emmanuel Bacry, Stéphane Gaïffas, Fanny Leroy, Maryan Morel, Dinh Phong Nguyen, Youcef Sebiat, Dian Sun

    Abstract: This article introduces SCALPEL3, a scalable open-source framework for studies involving Large Observational Databases (LODs). Its design eases medical observational studies thanks to abstractions allowing concept extraction, high-level cohort manipulation, and production of data formats compatible with machine learning libraries. SCALPEL3 has successfully been used on the SNDS database (see Tuppi… ▽ More

    Submitted 26 August, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

  12. arXiv:1906.10529  [pdf, other

    stat.ML cs.LG math.ST

    AMF: Aggregated Mondrian Forests for Online Learning

    Authors: Jaouad Mourtada, Stéphane Gaïffas, Erwan Scornet

    Abstract: Random Forests (RF) is one of the algorithms of choice in many supervised learning applications, be it classification or regression. The appeal of such tree-ensemble methods comes from a combination of several characteristics: a remarkable accuracy in a variety of tasks, a small number of parameters to tune, robustness with respect to features scaling, a reasonable computational cost for training… ▽ More

    Submitted 15 May, 2020; v1 submitted 25 June, 2019; originally announced June 2019.

  13. arXiv:1809.01382  [pdf, other

    stat.ML cs.LG

    On the optimality of the Hedge algorithm in the stochastic regime

    Authors: Jaouad Mourtada, Stéphane Gaïffas

    Abstract: In this paper, we study the behavior of the Hedge algorithm in the online stochastic setting. We prove that anytime Hedge with decreasing learning rate, which is one of the simplest algorithm for the problem of prediction with expert advice, is surprisingly both worst-case optimal and adaptive to the easier stochastic and adversarial with a gap problems. This shows that, in spite of its small, non… ▽ More

    Submitted 8 July, 2019; v1 submitted 5 September, 2018; originally announced September 2018.

    Journal ref: Journal of Machine Learning Research, 20(83), 2019

  14. arXiv:1807.09821  [pdf, other

    stat.ML cs.LG

    Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework

    Authors: Simon Bussy, Raphaël Veil, Vincent Looten, Anita Burgun, Stéphane Gaïffas, Agathe Guilloux, Brigitte Ranque, Anne-Sophie Jannot

    Abstract: Background: Choosing the most performing method in terms of outcome prediction or variables selection is a recurring problem in prognosis studies, leading to many publications on methods comparison. But some aspects have received little attention. First, most comparison studies treat prediction performance and variable selection aspects separately. Second, methods are either compared within a bina… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

  15. arXiv:1807.03545  [pdf, other

    stat.ML cs.LG

    Dual optimization for convex constrained objectives without the gradient-Lipschitz assumption

    Authors: Martin Bompaire, Emmanuel Bacry, Stéphane Gaïffas

    Abstract: The minimization of convex objectives coming from linear supervised learning problems, such as penalized generalized linear models, can be formulated as finite sums of convex functions. For such problems, a large set of stochastic first-order solvers based on the idea of variance reduction are available and combine both computational efficiency and sound theoretical guarantees (linear convergence… ▽ More

    Submitted 15 December, 2018; v1 submitted 10 July, 2018; originally announced July 2018.

    MSC Class: 90C25; 65K05; 65K10; 49M29

  16. arXiv:1607.06333  [pdf, other

    stat.ML cs.LG

    Uncovering Causality from Multivariate Hawkes Integrated Cumulants

    Authors: Massil Achab, Emmanuel Bacry, Stéphane Gaïffas, Iacopo Mastromatteo, Jean-Francois Muzy

    Abstract: We design a new nonparametric method that allows one to estimate the matrix of integrated kernels of a multivariate Hawkes process. This matrix not only encodes the mutual influences of each nodes of the process, but also disentangles the causality relationships between them. Our approach is the first that leads to an estimation of this matrix without any parametric modeling and estimation of the… ▽ More

    Submitted 29 May, 2017; v1 submitted 21 July, 2016; originally announced July 2016.

  17. arXiv:1511.01512  [pdf, other

    cs.LG cond-mat.stat-mech

    Mean-field inference of Hawkes point processes

    Authors: Emmanuel Bacry, Stéphane Gaïffas, Iacopo Mastromatteo, Jean-François Muzy

    Abstract: We propose a fast and efficient estimation method that is able to accurately recover the parameters of a d-dimensional Hawkes point-process from a set of observations. We exploit a mean-field approximation that is valid when the fluctuations of the stochastic intensity are small. We show that this is notably the case in situations when interactions are sufficiently weak, when the dimension of the… ▽ More

    Submitted 4 November, 2015; originally announced November 2015.

    Comments: 29 pages, 8 figures

  18. arXiv:1510.04822  [pdf, other

    stat.ML cs.LG

    SGD with Variance Reduction beyond Empirical Risk Minimization

    Authors: Massil Achab, Agathe Guilloux, Stéphane Gaïffas, Emmanuel Bacry

    Abstract: We introduce a doubly stochastic proximal gradient algorithm for optimizing a finite average of smooth convex functions, whose gradients depend on numerically expensive expectations. Our main motivation is the acceleration of the optimization of the regularized Cox partial-likelihood (the core model used in survival analysis), but our algorithm can be used in different settings as well. The propos… ▽ More

    Submitted 8 November, 2016; v1 submitted 16 October, 2015; originally announced October 2015.

    Comments: 17 pages

  19. arXiv:1107.1638  [pdf, other

    cs.IT math.ST

    Weighted algorithms for compressed sensing and matrix completion

    Authors: Stéphane Gaïffas, Guillaume Lecué

    Abstract: This paper is about iteratively reweighted basis-pursuit algorithms for compressed sensing and matrix completion problems. In a first part, we give a theoretical explanation of the fact that reweighted basis pursuit can improve a lot upon basis pursuit for exact recovery in compressed sensing. We exhibit a condition that links the accuracy of the weights to the RIP and incoherency constants, which… ▽ More

    Submitted 8 July, 2011; originally announced July 2011.