Skip to main content

Showing 1–26 of 26 results for author: Klopp, O

.
  1. arXiv:2405.00619  [pdf, other

    stat.ME

    One-Bit Total Variation Denoising over Networks with Applications to Partially Observed Epidemics

    Authors: Claire Donnat, Olga Klopp, Nicolas Verzelen

    Abstract: This paper introduces a novel approach for epidemic nowcasting and forecasting over networks using total variation (TV) denoising, a method inspired by classical signal processing techniques. Considering a network that models a population as a set of $n$ nodes characterized by their infection statuses $Y_i$ and that represents contacts as edges, we prove the consistency of graph-TV denoising for e… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2404.17209  [pdf, other

    math.ST

    Generalized multi-view model: Adaptive density estimation under low-rank constraints

    Authors: Julien Chhor, Olga Klopp, Alexandre Tsybakov

    Abstract: We study the problem of bivariate discrete or continuous probability density estimation under low-rank constraints.For discrete distributions, we assume that the two-dimensional array to estimate is a low-rank probability matrix. In the continuous case, we assume that the density with respect to the Lebesgue measure satisfies a generalized multi-view model, meaning that it is $β$-H{ö}lder and can… ▽ More

    Submitted 18 June, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  3. arXiv:2305.00311  [pdf, ps, other

    math.ST

    Change point detection in low-rank VAR processes

    Authors: Farida Enikeeva, Olga Klopp, Mathilde Rousselot

    Abstract: Vector autoregressive (VAR) models are widely used in multivariate time series analysis for describing the short-time dynamics of the data. The reduced-rank VAR models are of particular interest when dealing with high-dimensional and highly correlated time series. Many results for these models are based on the stationarity assumption that does not hold in several applications when the data exhibit… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  4. arXiv:2111.03305  [pdf, other

    math.ST

    Optimality of variational inference for stochastic block model with missing links

    Authors: Solenne Gaucher, Olga Klopp

    Abstract: Variational methods are extremely popular in the analysis of network data. Statistical guarantees obtained for these methods typically provide asymptotic normality for the problem of estimation of global model parameters under the stochastic block model. In the present work, we consider the case of networks with missing links that is important in application and show that the variational approxima… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  5. arXiv:2107.03684  [pdf, other

    math.ST

    Assigning Topics to Documents by Successive Projections

    Authors: Olga Klopp, Maxim Panov, Suzanne Sigalla, Alexandre Tsybakov

    Abstract: Topic models provide a useful tool to organize and understand the structure of large corpora of text documents, in particular, to discover hidden thematic structure. Clustering documents from big unstructured corpora into topics is an important task in various areas, such as image analysis, e-commerce, social networks, population genetics. A common approach to topic modeling is to associate each t… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  6. arXiv:2106.14470  [pdf, ps, other

    math.ST

    Change-Point Detection in Dynamic Networks with Missing Links

    Authors: Farida Enikeeva, Olga Klopp

    Abstract: Structural changes occur in dynamic networks quite frequently and its detection is an important question in many situations such as fraud detection or cybersecurity. Real-life networks are often incompletely observed due to individual non-response or network size. In the present paper we consider the problem of change-point detection at a temporal sequence of partially observed networks. The goal… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  7. arXiv:1911.13122  [pdf, other

    stat.ML cs.LG cs.SI stat.ME

    Outliers Detection in Networks with Missing Links

    Authors: Solenne Gaucher, Olga Klopp, Geneviève Robin

    Abstract: Outliers arise in networks due to different reasons such as fraudulent behavior of malicious users or default in measurement instruments and can significantly impair network analyses. In addition, real-life networks are likely to be incompletely observed, with missing links due to individual non-response or machine failures. Identifying outliers in the presence of missing links is therefore a cruc… ▽ More

    Submitted 1 December, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

  8. arXiv:1902.10605  [pdf, other

    math.ST

    Maximum Likelihood Estimation of Sparse Networks with Missing Observations

    Authors: Solenne Gaucher, Olga Klopp

    Abstract: Estimating the matrix of connections probabilities is one of the key questions when studying sparse networks. In this work, we consider networks generated under the sparse graphon model and the in-homogeneous random graph model with missing observations. Using the Stochastic Block Model as a parametric proxy, we bound the risk of the maximum likelihood estimator of network connections probabilitie… ▽ More

    Submitted 27 April, 2021; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: We derive a variational approximation to the maximum likelihood estimator of the connection probabilities. We bound the risk of this tractable estimator

  9. arXiv:1812.08398  [pdf, other

    stat.ML cs.LG

    Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames

    Authors: Geneviève Robin, Hoi-To Wai, Julie Josse, Olga Klopp, Éric Moulines

    Abstract: Many applications of machine learning involve the analysis of large data frames-matrices collecting heterogeneous measurements (binary, numerical, counts, etc.) across samples-with missing values. Low-rank models, as studied by Udell et al. [30], are popular in this framework for tasks such as visualization, clustering and missing value imputation. Yet, available methods with statistical guarantee… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

  10. arXiv:1807.09010  [pdf, other

    stat.ML cs.LG

    Collective Matrix Completion

    Authors: Mokhtar Z. Alaya, Olga Klopp

    Abstract: Matrix completion aims to reconstruct a data matrix based on observations of a small number of its entries. Usually in matrix completion a single matrix is considered, which can be, for example, a rating matrix in recommendation system. However, in practical situations, data is often obtained from multiple sources which results in a collection of matrices rather than a single one. In this work, we… ▽ More

    Submitted 21 October, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

  11. arXiv:1806.09734  [pdf, other

    stat.ME

    Main effects and interactions in mixed and incomplete data frames

    Authors: Geneviève Robin, Olga Klopp, Julie Josse, Éric Moulines, Robert Tibshirani

    Abstract: A mixed data frame (MDF) is a table collecting categorical, numerical and count observations. The use of MDF is widespread in statistics and the applications are numerous from abundance data in ecology to recommender systems. In many cases, an MDF exhibits simultaneously main effects, such as row, column or group effects and interactions, for which a low-rank model has often been suggested. Althou… ▽ More

    Submitted 26 March, 2019; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: 25 pages, 1 figure, 4 tables

  12. arXiv:1707.02090  [pdf, ps, other

    math.ST

    Structured Matrix Estimation and Completion

    Authors: Olga Klopp, Yu Lu, Alexandre B. Tsybakov, Harrison H. Zhou

    Abstract: We study the problem of matrix estimation and matrix completion under a general framework. This framework includes several important models as special cases such as the gaussian mixture model, mixed membership model, bi-clustering model and dictionary learning. We consider the optimal convergence rates in a minimax sense for estimation of the signal matrix under the Frobenius norm and under the sp… ▽ More

    Submitted 7 July, 2017; originally announced July 2017.

  13. arXiv:1704.02760  [pdf, ps, other

    math.ST

    Constructing confidence sets for the matrix completion problem

    Authors: Alexandra Carpentier, Olga Klopp, Matthias Löffler

    Abstract: In the present note we consider the problem of constructing honest and adaptive confidence sets for the matrix completion problem. For the Bernoulli model with known variance of the noise we provide a realizable method for constructing confidence sets that adapt to the unknown rank of the true matrix.

    Submitted 10 April, 2017; originally announced April 2017.

  14. arXiv:1703.05101  [pdf, ps, other

    math.ST

    Optimal graphon estimation in cut distance

    Authors: Olga Klopp, Nicolas Verzelen

    Abstract: Consider the twin problems of estimating the connection probability matrix of an inhomogeneous random graph and the graphon of a W-random graph. We establish the minimax estimation rates with respect to the cut metric for classes of block constant matrices and step function graphons. Surprisingly, our results imply that, from the minimax point of view, the raw data, that is, the adjacency matrix o… ▽ More

    Submitted 16 October, 2018; v1 submitted 15 March, 2017; originally announced March 2017.

  15. arXiv:1608.04861  [pdf, ps, other

    math.ST

    Adaptive confidence sets for matrix completion

    Authors: Alexandra Carpentier, Olga Klopp, Matthias Löffler, Richard Nickl

    Abstract: In the present paper we study the problem of existence of honest and adaptive confidence sets for matrix completion. We consider two statistical models: the trace regression model and the Bernoulli model. In the trace regression model, we show that honest confidence sets that adapt to the unknown rank of the matrix exist even when the error variance is unknown. Contrary to this, we prove that in t… ▽ More

    Submitted 6 February, 2017; v1 submitted 17 August, 2016; originally announced August 2016.

  16. arXiv:1509.00319  [pdf, ps, other

    math.ST

    Estimation of matrices with row sparsity

    Authors: O. Klopp, A. B. Tsybakov

    Abstract: An increasing number of applications is concerned with recovering a sparse matrix from noisy observations. In this paper, we consider the setting where each row of the unknown matrix is sparse. We establish minimax optimal rates of convergence for estimating matrices with row sparsity. A major focus in the present paper is on the derivation of lower bounds.

    Submitted 1 September, 2015; originally announced September 2015.

  17. arXiv:1507.04118  [pdf, ps, other

    math.ST

    Oracle inequalities for network models and sparse graphon estimation

    Authors: Olga Klopp, Alexandre B. Tsybakov, Nicolas Verzelen

    Abstract: Inhomogeneous random graph models encompass many network models such as stochastic block models and latent position models. We consider the problem of statistical estimation of the matrix of connection probabilities based on the observations of the adjacency matrix of the network. Taking the stochastic block model as an approximation, we construct estimators of network connection probabilities --… ▽ More

    Submitted 13 September, 2017; v1 submitted 15 July, 2015; originally announced July 2015.

    Comments: Annals of Statistics, Institute of Mathematical Statistics, 2017

  18. arXiv:1502.00146  [pdf, ps, other

    math.ST

    Matrix completion by singular value thresholding: sharp bounds

    Authors: Olga Klopp

    Abstract: We consider the matrix completion problem where the aim is to esti-mate a large data matrix for which only a relatively small random subset of its entries is observed. Quite popular approaches to matrix completion problem are iterative thresholding methods. In spite of their empirical success, the theoretical guarantees of such iterative thresholding methods are poorly understood. The goal of this… ▽ More

    Submitted 31 January, 2015; originally announced February 2015.

  19. arXiv:1412.8132  [pdf, ps, other

    math.ST

    Robust Matrix Completion

    Authors: Olga Klopp, Karim Lounici, Alexandre B. Tsybakov

    Abstract: This paper considers the problem of recovery of a low-rank matrix in the situation when most of its entries are not observed and a fraction of observed entries are corrupted. The observations are noisy realizations of the sum of a low rank matrix, which we wish to recover, with a second matrix having a complementary sparse structure such as element-wise or column-wise sparsity. We analyze a class… ▽ More

    Submitted 4 July, 2016; v1 submitted 28 December, 2014; originally announced December 2014.

  20. arXiv:1412.2632  [pdf, ps, other

    math.ST stat.ML

    Probabilistic low-rank matrix completion on finite alphabets

    Authors: Jean Lafond, Olga Klopp, Eric Moulines, Jospeh Salmon

    Abstract: The task of reconstructing a matrix given a sample of observedentries is known as the matrix completion problem. It arises ina wide range of problems, including recommender systems, collaborativefiltering, dimensionality reduction, image processing, quantum physics or multi-class classificationto name a few. Most works have focused on recovering an unknown real-valued low-rankmatrix from randomly… ▽ More

    Submitted 8 December, 2014; originally announced December 2014.

    Comments: arXiv admin note: text overlap with arXiv:1408.6218

    Journal ref: NIPS, Dec 2014, Montreal, Canada

  21. arXiv:1408.6218  [pdf, ps, other

    math.ST stat.ML

    Adaptive Multinomial Matrix Completion

    Authors: Olga Klopp, Jean Lafond, Eric Moulines, Joseph Salmon

    Abstract: The task of estimating a matrix given a sample of observed entries is known as the \emph{matrix completion problem}. Most works on matrix completion have focused on recovering an unknown real-valued low-rank matrix from a random sample of its entries. Here, we investigate the case of highly quantized observations when the measurements can take only a small number of values. These quantized outputs… ▽ More

    Submitted 26 August, 2014; originally announced August 2014.

  22. arXiv:1312.4087  [pdf, ps, other

    math.ST

    Sparse high-dimensional varying coefficient model: non-asymptotic minimax study

    Authors: Olga Klopp, Marianna Pensky

    Abstract: The objective of the present paper is to develop a minimax theory for the varying coefficient model in a non-asymptotic setting. We consider a high-dimensional sparse varying coefficient model where only few of the covariates are present and only some of those covariates are time dependent. Our analysis allows the time dependent covariates to have different degrees of smoothness and to be spatia… ▽ More

    Submitted 14 May, 2014; v1 submitted 14 December, 2013; originally announced December 2013.

    Comments: 27 pages

    MSC Class: 62H12; 62J05; 62C20

  23. arXiv:1211.3394  [pdf, ps, other

    math.ST

    Non-asymptotic approach to varying coefficient model

    Authors: Olga Klopp, Marianna Pensky

    Abstract: In the present paper we consider the varying coefficient model which represents a useful tool for exploring dynamic patterns in many applications. Existing methods typically provide asymptotic evaluation of precision of estimation procedures under the assumption that the number of observations tends to infinity. In practical applications, however, only a finite number of measurements are available… ▽ More

    Submitted 6 February, 2013; v1 submitted 14 November, 2012; originally announced November 2012.

  24. Noisy low-rank matrix completion with general sampling distribution

    Authors: Olga Klopp

    Abstract: In the present paper, we consider the problem of matrix completion with noise. Unlike previous works, we consider quite general sampling distribution and we do not need to know or to estimate the variance of the noise. Two new nuclear-norm penalized estimators are proposed, one of them of "square-root" type. We analyse their performance under high-dimensional scaling and provide non-asymptotic bou… ▽ More

    Submitted 5 February, 2014; v1 submitted 1 March, 2012; originally announced March 2012.

    Comments: Published in at http://dx.doi.org/10.3150/12-BEJ486 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ486

    Journal ref: Bernoulli 2014, Vol. 20, No. 1, 282-303

  25. arXiv:1112.3055  [pdf, other

    math.ST

    High dimensional matrix estimation with unknown variance of the noise

    Authors: Olga Klopp, Stéphane Gaiffas

    Abstract: We propose a new pivotal method for estimating high-dimensional matrices. Assume that we observe a small set of entries or linear combinations of entries of an unknown matrix $A\_0$ corrupted by noise. We propose a new method for estimating $A\_0$ which does not rely on the knowledge or an estimation of the standard deviation of the noise $σ$. Our estimator achieves, up to a logarithmic factor, op… ▽ More

    Submitted 31 January, 2015; v1 submitted 13 December, 2011; originally announced December 2011.

  26. arXiv:1104.1244  [pdf, ps, other

    math.ST

    Rank penalized estimators for high-dimensional matrices

    Authors: Olga Klopp

    Abstract: In this paper we consider the trace regression model. Assume that we observe a small set of entries or linear combinations of entries of an unknown matrix $A_0$ corrupted by noise. We propose a new rank penalized estimator of $A_0$. For this estimator we establish general oracle inequality for the prediction error both in probability and in expectation. We also prove upper bounds for the rank of o… ▽ More

    Submitted 12 September, 2011; v1 submitted 7 April, 2011; originally announced April 2011.

    Comments: We added a new section on matrix regression