Skip to main content

Showing 1–7 of 7 results for author: Arlot, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.14613  [pdf, other

    stat.ML cs.LG math.ST

    A Conditional Randomization Test for Sparse Logistic Regression in High-Dimension

    Authors: Binh T. Nguyen, Bertrand Thirion, Sylvain Arlot

    Abstract: Identifying the relevant variables for a classification model with correct confidence levels is a central but difficult task in high-dimension. Despite the core role of sparse logistic regression in statistics and machine learning, it still lacks a good solution for accurate inference in the regime where the number of features $p$ is as large as or larger than the number of samples $n$. Here, we t… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  2. arXiv:2011.11117  [pdf, other

    stat.ML cs.LG

    Online Orthogonal Matching Pursuit

    Authors: El Mehdi Saad, Gilles Blanchard, Sylvain Arlot

    Abstract: Greedy algorithms for feature selection are widely used for recovering sparse high-dimensional vectors in linear models. In classical procedures, the main emphasis was put on the sample complexity, with little or no consideration of the computation resources required. We present a novel online algorithm: Online Orthogonal Matching Pursuit (OOMP) for online support recovery in the random design set… ▽ More

    Submitted 10 February, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

  3. arXiv:1506.01829  [pdf, ps, other

    cs.LG

    Semidefinite and Spectral Relaxations for Multi-Label Classification

    Authors: Rémi Lajugie, Piotr Bojanowski, Sylvain Arlot, Francis Bach

    Abstract: In this paper, we address the problem of multi-label classification. We consider linear classifiers and propose to learn a prior over the space of labels to directly leverage the performance of such methods. This prior takes the form of a quadratic function of the labels and permits to encode both attractive and repulsive relations between labels. We cast this problem as a structured prediction on… ▽ More

    Submitted 5 June, 2015; originally announced June 2015.

  4. arXiv:1409.3136  [pdf, other

    cs.LG

    Metric Learning for Temporal Sequence Alignment

    Authors: Damien Garreau, Rémi Lajugie, Sylvain Arlot, Francis Bach

    Abstract: In this paper, we propose to learn a Mahalanobis distance to perform alignment of multivariate time series. The learning examples for this task are time series for which the true alignment is known. We cast the alignment problem as a structured prediction task, and propose realistic losses between alignments for which the optimization is tractable. We provide experiments on real data in the audio… ▽ More

    Submitted 10 September, 2014; originally announced September 2014.

  5. arXiv:1407.3939  [pdf, other

    math.ST cs.LG stat.ME

    Analysis of purely random forests bias

    Authors: Sylvain Arlot, Robin Genuer

    Abstract: Random forests are a very effective and commonly used statistical method, but their full theoretical analysis is still an open problem. As a first step, simplified models such as purely random forests have been introduced, in order to shed light on the good performance of random forests. In this paper, we study the approximation error (the bias) of some purely random forest models in a regression… ▽ More

    Submitted 15 July, 2014; originally announced July 2014.

  6. arXiv:1303.1280  [pdf, other

    cs.LG stat.ML

    Large-Margin Metric Learning for Partitioning Problems

    Authors: Rémi Lajugie, Sylvain Arlot, Francis Bach

    Abstract: In this paper, we consider unsupervised partitioning problems, such as clustering, image segmentation, video segmentation and other change-point detection problems. We focus on partitioning problems based explicitly or implicitly on the minimization of Euclidean distortions, which include mean-based change-point detection, K-means, spectral clustering and normalized cuts. Our main goal is to learn… ▽ More

    Submitted 6 March, 2013; originally announced March 2013.

  7. arXiv:1210.5830  [pdf, other

    math.ST cs.LG

    Choice of V for V-Fold Cross-Validation in Least-Squares Density Estimation

    Authors: Sylvain Arlot, Matthieu Lerasle

    Abstract: This paper studies V-fold cross-validation for model selection in least-squares density estimation. The goal is to provide theoretical grounds for choosing V in order to minimize the least-squares loss of the selected estimator. We first prove a non-asymptotic oracle inequality for V-fold cross-validation and its bias-corrected version (V-fold penalization). In particular, this result implies that… ▽ More

    Submitted 11 October, 2015; v1 submitted 22 October, 2012; originally announced October 2012.