Skip to main content

Showing 1–27 of 27 results for author: Giraud, C

.
  1. arXiv:2406.11485  [pdf, other

    stat.ML cs.LG

    Active clustering with bandit feedback

    Authors: Victor Thuot, Alexandra Carpentier, Christophe Giraud, Nicolas Verzelen

    Abstract: We investigate the Active Clustering Problem (ACP). A learner interacts with an $N$-armed stochastic bandit with $d$-dimensional subGaussian feedback. There exists a hidden partition of the arms into $K$ groups, such that arms within the same group, share the same mean vector. The learner's task is to uncover this hidden partition with the smallest budget - i.e., the least number of observation -… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 50 pages

  2. arXiv:2405.08747  [pdf, ps, other

    math.ST

    Minimax optimal seriation in polynomial time

    Authors: Yann Issartel, Christophe Giraud, Nicolas Verzelen

    Abstract: We consider the statistical seriation problem, where the statistician seeks to recover a hidden ordering from a noisy observation of a permuted Robinson matrix. In this paper, we tightly characterize the minimax rate for this problem of matrix reordering when the Robinson matrix is bi-Lipschitz, and we also provide a polynomial time algorithm achieving this rate; thereby answering two open questio… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  3. arXiv:2403.09755  [pdf, other

    stat.ML cs.LG cs.SI

    Estimating the history of a random recursive tree

    Authors: Simon Briend, Christophe Giraud, Gábor Lugosi, Déborah Sulem

    Abstract: This paper studies the problem of estimating the order of arrival of the vertices in a random recursive tree. Specifically, we study two fundamental models: the uniform attachment model and the linear preferential attachment model. We propose an order estimator based on the Jordan centrality measure and define a family of risk measures to quantify the quality of the ordering procedure. Moreover, w… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  4. arXiv:2402.18378  [pdf, other

    math.ST

    Computation-information gap in high-dimensional clustering

    Authors: Bertrand Even, Christophe Giraud, Nicolas Verzelen

    Abstract: We investigate the existence of a fundamental computation-information gap for the problem of clustering a mixture of isotropic Gaussian in the high-dimensional regime, where the ambient dimension $p$ is larger than the number $n$ of points. The existence of a computation-information gap in a specific Bayesian high-dimensional asymptotic regime has been conjectured by arXiv:1610.02918 based on the… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 53 pages

    MSC Class: 62H30

  5. arXiv:2305.19605  [pdf, other

    stat.ML

    Parameter-free projected gradient descent

    Authors: Evgenii Chzhen, Christophe Giraud, Gilles Stoltz

    Abstract: We consider the problem of minimizing a convex function over a closed convex set, with Projected Gradient Descent (PGD). We propose a fully parameter-free version of AdaGrad, which is adaptive to the distance between the initialization and the optimum, and to the sum of the square norm of the subgradients. Our algorithm is able to handle projection steps, does not involve restarts, reweighing alo… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  6. arXiv:2305.15807  [pdf, other

    stat.ML cs.LG

    Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness

    Authors: Evgenii Chzhen, Christophe Giraud, Zhen Li, Gilles Stoltz

    Abstract: We consider contextual bandit problems with knapsacks [CBwK], a problem where at each round, a scalar reward is obtained and vector-valued costs are suffered. The learner aims to maximize the cumulative rewards while ensuring that the cumulative costs are lower than some predetermined cost constraints. We assume that contexts come from a continuous set, that costs can be signed, and that the expe… ▽ More

    Submitted 26 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Advances in Neural Information Processing Systems, Dec 2023, New Orleans, United States

  7. arXiv:2203.09784  [pdf, other

    math.ST stat.ML

    The price of unfairness in linear bandits with biased feedback

    Authors: Solenne Gaucher, Alexandra Carpentier, Christophe Giraud

    Abstract: In this paper, we study the problem of fair sequential decision making with biased linear bandit feedback. At each round, a player selects an action described by a covariate and by a sensitive attribute. The perceived reward is a linear combination of the covariates of the chosen action, but the player only observes a biased evaluation of this reward, depending on the sensitive attribute. To chara… ▽ More

    Submitted 3 June, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

  8. arXiv:2110.15596  [pdf, other

    cs.LG

    Training Integrable Parameterizations of Deep Neural Networks in the Infinite-Width Limit

    Authors: Karl Hajjar, Lénaïc Chizat, Christophe Giraud

    Abstract: To theoretically understand the behavior of trained deep neural networks, it is necessary to study the dynamics induced by gradient methods from a random initialization. However, the nonlinear and compositional structure of these models make these dynamics difficult to analyze. To overcome these challenges, large-width asymptotics have recently emerged as a fruitful viewpoint and led to practical… ▽ More

    Submitted 20 December, 2021; v1 submitted 29 October, 2021; originally announced October 2021.

  9. arXiv:2108.03098  [pdf, other

    math.ST stat.ML

    Localization in 1D non-parametric latent space models from pairwise affinities

    Authors: Christophe Giraud, Yann Issartel, Nicolas Verzelen

    Abstract: We consider the problem of estimating latent positions in a one-dimensional torus from pairwise affinities. The observed affinity between a pair of items is modeled as a noisy observation of a function $f(x^*_{i},x^*_{j})$ of the latent positions $x^*_{i},x^*_{j}$ of the two items on the torus. The affinity function $f$ is unknown, and it is only assumed to fulfill some shape constraints ensuring… ▽ More

    Submitted 11 August, 2023; v1 submitted 6 August, 2021; originally announced August 2021.

  10. arXiv:2106.12242  [pdf, ps, other

    cs.LG cs.AI cs.CY stat.ML

    A Unified Approach to Fair Online Learning via Blackwell Approachability

    Authors: Evgenii Chzhen, Christophe Giraud, Gilles Stoltz

    Abstract: We provide a setting and a general approach to fair online learning with stochastic sensitive and non-sensitive contexts. The setting is a repeated game between the Player and Nature, where at each stage both pick actions based on the contexts. Inspired by the notion of unawareness, we assume that the Player can only access the non-sensitive context before making a decision, while we discuss both… ▽ More

    Submitted 7 November, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Dec 2021, Virtual conference

  11. arXiv:1905.07342  [pdf, other

    stat.ML cs.LG math.ST

    Pair-Matching: Links Prediction with Adaptive Queries

    Authors: Christophe Giraud, Yann Issartel, Luc Lehéricy, Matthieu Lerasle

    Abstract: The pair-matching problem appears in many applications where one wants to discover good matches between pairs of entities or individuals. Formally, the set of individuals is represented by the nodes of a graph where the edges, unobserved at first, represent the good matches. The algorithm queries pairs of nodes and observes the presence/absence of edges. Its goal is to discover as many edges as po… ▽ More

    Submitted 5 March, 2024; v1 submitted 17 May, 2019; originally announced May 2019.

    Comments: 78 pages

    MSC Class: 62h30; 68T05; 05C80

  12. arXiv:1807.07547  [pdf, ps, other

    math.ST cs.LG

    Partial recovery bounds for clustering with the relaxed $K$means

    Authors: Christophe Giraud, Nicolas Verzelen

    Abstract: We investigate the clustering performances of the relaxed $K$means in the setting of sub-Gaussian Mixture Model (sGMM) and Stochastic Block Model (SBM). After identifying the appropriate signal-to-noise ratio (SNR), we prove that the misclassification error decay exponentially fast with respect to this SNR. These partial recovery bounds for the relaxed $K$means improve upon results currently known… ▽ More

    Submitted 19 April, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

    Comments: 39 pages

    MSC Class: 62H30; 68T10

  13. arXiv:1706.08281  [pdf, other

    stat.AP

    Estimation of species relative abundances and habitat preferences using opportunistic data

    Authors: Camille Coron, Clément Calenge, Christophe Giraud, Romain Julliard

    Abstract: We develop a new statistical procedure to monitor, with opportunist data, relative species abundances and their respective preferences for dierent habitat types. Following Giraud et al. (2015), we combine the opportunistic data with some standardized data in order to correct the bias inherent to the opportunistic data collection. Our main contributions are (i) to tackle the bias induced by habitat… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

  14. arXiv:1606.05100  [pdf, ps, other

    math.ST

    PECOK: a convex optimization approach to variable clustering

    Authors: Florentina Bunea, Christophe Giraud, Martin Royer, Nicolas Verzelen

    Abstract: The problem of variable clustering is that of grou** similar components of a $p$-dimensional vector $X=(X_{1},\ldots,X_{p})$, and estimating these groups from $n$ independent copies of $X$. When cluster similarity is defined via $G$-latent models, in which groups of $X$-variables have a common latent generator, and groups are relative to a partition $G$ of the index set $\{1, \ldots, p\}$, the m… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

  15. arXiv:1508.01939  [pdf, other

    stat.ME math.ST stat.ML

    Model Assisted Variable Clustering: Minimax-optimal Recovery and Algorithms

    Authors: Florentina Bunea, Christophe Giraud, Xi Luo, Martin Royer, Nicolas Verzelen

    Abstract: Model-based clustering defines population level clusters relative to a model that embeds notions of similarity. Algorithms tailored to such models yield estimated clusters with a clear statistical interpretation. We take this view here and introduce the class of G-block covariance models as a background model for variable clustering. In such models, two variables in a cluster are deemed similar if… ▽ More

    Submitted 12 December, 2018; v1 submitted 8 August, 2015; originally announced August 2015.

    Comments: Maintext: 38 pages; supplementary information: 37 pages

    MSC Class: 62H30; 62C20

  16. arXiv:1407.2432  [pdf, other

    stat.AP

    Capitalising on Opportunistic Data for Monitoring Species Relative Abundances

    Authors: Christophe Giraud, Clément Calenge, Camille Coron, Romain Julliard

    Abstract: With the internet, a massive amount of information on species abundance can be collected under citizen science programs. However, these data are often difficult to use directly in statistical inference, as their collection is generally opportunistic, and the distribution of the sampling effort is often not known. In this paper, we develop a general statistical framework to combine such "opportuni… ▽ More

    Submitted 26 February, 2015; v1 submitted 9 July, 2014; originally announced July 2014.

  17. arXiv:1404.6769  [pdf, ps, other

    math.ST stat.ML

    Aggregation of predictors for nonstationary sub-linear processes and online adaptive forecasting of time varying autoregressive processes

    Authors: Christophe Giraud, François Roueff, Andres Sanchez-Perez

    Abstract: In this work, we study the problem of aggregating a finite number of predictors for nonstationary sub-linear processes. We provide oracle inequalities relying essentially on three ingredients: (1) a uniform bound of the $\ell^1$ norm of the time varying sub-linear coefficients, (2) a Lipschitz assumption on the predictors and (3) moment conditions on the noise appearing in the linear representatio… ▽ More

    Submitted 17 November, 2015; v1 submitted 27 April, 2014; originally announced April 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1345 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1345

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 6, 2412-2450

  18. Discussion: Latent variable graphical model selection via convex optimization

    Authors: Christophe Giraud, Alexandre Tsybakov

    Abstract: Discussion of "Latent variable graphical model selection via convex optimization" by Venkat Chandrasekaran, Pablo A. Parrilo and Alan S. Willsky [arXiv:1008.1290].

    Submitted 5 November, 2012; originally announced November 2012.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOS984 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS984

    Journal ref: Annals of Statistics 2012, Vol. 40, No. 4, 1984-1988

  19. arXiv:1109.5587  [pdf, other

    math.ST

    High-dimensional regression with unknown variance

    Authors: Christophe Giraud, Sylvie Huet, Nicolas Verzelen

    Abstract: We review recent results for high-dimensional sparse linear regression in the practical case of unknown variance. Different sparsity settings are covered, including coordinate-sparsity, group-sparsity and variation-sparsity. The emphasis is put on non-asymptotic analyses and feasible procedures. In addition, a small numerical study compares the practical performance of three schemes for tuning the… ▽ More

    Submitted 20 February, 2012; v1 submitted 26 September, 2011; originally announced September 2011.

    Comments: 38 pages

  20. arXiv:1106.5599  [pdf, ps, other

    math.ST

    A pseudo-RIP for multivariate regression

    Authors: Christophe Giraud

    Abstract: We give a suitable RI-Property under which recent results for trace regression translate into strong risk bounds for multivariate regression. This pseudo-RIP is compatible with the setting $n < p$.

    Submitted 28 June, 2011; originally announced June 2011.

    Comments: 5 pages

  21. arXiv:1009.5165  [pdf, other

    math.ST

    Low rank Multivariate regression

    Authors: Christophe Giraud

    Abstract: We consider in this paper the multivariate regression problem, when the target regression matrix $A$ is close to a low rank matrix. Our primary interest in on the practical case where the variance of the noise is unknown. Our main contribution is to propose in this setting a criterion to select among a family of low rank estimators and prove a non-asymptotic oracle inequality for the resulting est… ▽ More

    Submitted 22 June, 2011; v1 submitted 27 September, 2010; originally announced September 2010.

    Comments: 23 pages

  22. arXiv:1007.2096  [pdf, ps, other

    math.ST

    Estimator selection in the Gaussian setting

    Authors: Yannick Baraud, Christophe Giraud, Sylvie Huet

    Abstract: We consider the problem of estimating the mean $f$ of a Gaussian vector $Y$ with independent components of common unknown variance $σ^{2}$. Our estimation procedure is based on estimator selection. More precisely, we start with an arbitrary and possibly infinite collection $\FF$ of estimators of $f$ based on $Y$ and, with the same data $Y$, aim at selecting an estimator among $\FF$ with the smalle… ▽ More

    Submitted 22 June, 2011; v1 submitted 13 July, 2010; originally announced July 2010.

    Comments: 44 pages

  23. arXiv:1002.4569  [pdf, ps, other

    cs.CR

    Atomicity Improvement for Elliptic Curve Scalar Multiplication

    Authors: Christophe Giraud, Vincent Verneuil

    Abstract: In this paper we address the problem of protecting elliptic curve scalar multiplication implementations against side-channel analysis by using the atomicity principle. First of all we reexamine classical assumptions made by scalar multiplication designers and we point out that some of them are not relevant in the context of embedded devices. We then describe the state-of-the-art of atomic scalar… ▽ More

    Submitted 2 March, 2010; v1 submitted 24 February, 2010; originally announced February 2010.

    Journal ref: CARDIS 2010, Passau : Germany (2010)

  24. arXiv:0907.0619  [pdf, ps, other

    math.ST

    Graph selection with GGMselect

    Authors: Christophe Giraud, Sylvie Huet, Nicolas Verzelen

    Abstract: Applications on inference of biological networks have raised a strong interest in the problem of graph estimation in high-dimensional Gaussian graphical models. To handle this problem, we propose a two-stage procedure which first builds a family of candidate graphs from the data, and then selects one graph among this family according to a dedicated criterion. This estimation procedure is shown to… ▽ More

    Submitted 15 February, 2012; v1 submitted 3 July, 2009; originally announced July 2009.

    Comments: 44 pages

  25. arXiv:0711.0372  [pdf, ps, other

    math.ST

    Mixing Least-Squares Estimators when the Variance is Unknown

    Authors: Christophe Giraud

    Abstract: We propose a procedure to handle the problem of Gaussian regression when the variance is unknown. We mix least-squares estimators from various models according to a procedure inspired by that of Leung and Barron (2007). We show that in some cases the resulting estimator is a simple shrinkage estimator. We then apply this procedure in various statistical settings such as linear regression or adap… ▽ More

    Submitted 2 November, 2007; originally announced November 2007.

    Comments: 30 pages

    MSC Class: 62G08

  26. Estimation of Gaussian graphs by model selection

    Authors: Christophe Giraud

    Abstract: We investigate in this paper the estimation of Gaussian graphs by model selection from a non-asymptotic point of view. We start from a n-sample of a Gaussian law P_C in R^p and focus on the disadvantageous case where n is smaller than p. To estimate the graph of conditional dependences of P_C, we introduce a collection of candidate graphs and then select one of them by minimizing a penalized emp… ▽ More

    Submitted 16 July, 2008; v1 submitted 10 October, 2007; originally announced October 2007.

    Comments: Published in at http://dx.doi.org/10.1214/08-EJS228 the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    MSC Class: 62G08 (Primary) 15A52; 62J05 (Secondary)

    Journal ref: Electronic Journal of Statistics 2 (2008) 542--563

  27. Gaussian model selection with an unknown variance

    Authors: Yannick Baraud, Christophe Giraud, Sylvie Huet

    Abstract: Let $Y$ be a Gaussian vector whose components are independent with a common unknown variance. We consider the problem of estimating the mean $μ$ of $Y$ by model selection. More precisely, we start with a collection $\mathcal{S}=\{S_m,m\in\mathcal{M}\}$ of linear subspaces of $\mathbb{R}^n$ and associate to each of these the least-squares estimator of $μ$ on $S_m$. Then, we use a data driven pena… ▽ More

    Submitted 1 April, 2009; v1 submitted 9 January, 2007; originally announced January 2007.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOS573 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS573 MSC Class: 62G08 (Primary)

    Journal ref: Annals of Statistics 2009, Vol. 37, No. 2, 630-672