Skip to main content

Showing 1–50 of 50 results for author: Levina, E

.
  1. arXiv:2308.11218  [pdf, other

    stat.ME

    Computational Inference for Directions in Canonical Correlation Analysis

    Authors: Daniel Kessler, Elizaveta Levina

    Abstract: Canonical Correlation Analysis (CCA) is a method for analyzing pairs of random vectors; it learns a sequence of paired linear transformations such that the resultant canonical variates are maximally correlated within pairs while uncorrelated across pairs. CCA outputs both canonical correlations as well as the canonical directions which define the transformations. While inference for canonical corr… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  2. arXiv:2305.08791  [pdf, other

    stat.ML cs.LG

    Fair Information Spread on Social Networks with Community Structure

    Authors: Octavio Mesner, Elizaveta Levina, Ji Zhu

    Abstract: Information spread through social networks is ubiquitous. Influence maximiza- tion (IM) algorithms aim to identify individuals who will generate the greatest spread through the social network if provided with information, and have been largely devel- oped with marketing in mind. In social networks with community structure, which are very common, IM algorithms focused solely on maximizing spread ma… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  3. arXiv:2303.05909  [pdf, other

    stat.ME stat.ML

    A pseudo-likelihood approach to community detection in weighted networks

    Authors: Andressa Cerqueira, Elizaveta Levina

    Abstract: Community structure is common in many real networks, with nodes clustered in groups sharing the same connections patterns. While many community detection methods have been developed for networks with binary edges, few of them are applicable to networks with weighted edges, which are common in practice. We propose a pseudo-likelihood community estimation algorithm derived under the weighted stochas… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  4. arXiv:2302.10095  [pdf, other

    stat.ME math.ST stat.ML

    Conformal Prediction for Network-Assisted Regression

    Authors: Robert Lunde, Elizaveta Levina, Ji Zhu

    Abstract: An important problem in network analysis is predicting a node attribute using both network covariates, such as graph embedding coordinates or local subgraph counts, and conventional node covariates, such as demographic characteristics. While standard regression methods that make use of both types of covariates may be used for prediction, statistical inference is complicated by the fact that the no… ▽ More

    Submitted 22 February, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Typos in Appendix corrected

  5. arXiv:2210.17519  [pdf, other

    stat.ME stat.AP

    Predicting Responses from Weighted Networks with Node Covariates in an Application to Neuroimaging

    Authors: Daniel Kessler, Keith Levin, Elizaveta Levina

    Abstract: We consider the setting where many networks are observed on a common node set, and each observation comprises edge weights of a network, covariates observed at each node, and an overall response. The goal is to use the edge weights and node covariates to predict the response while identifying an interpretable set of predictive features. Our motivating application is neuroimaging, where edge weight… ▽ More

    Submitted 22 August, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  6. arXiv:2210.07491  [pdf, other

    stat.ME

    Latent process models for functional network data

    Authors: Peter W. MacDonald, Elizaveta Levina, Ji Zhu

    Abstract: Network data are often sampled with auxiliary information or collected through the observation of a complex system over time, leading to multiple network snapshots indexed by a continuous variable. Many methods in statistical network analysis are traditionally designed for a single network, and can be applied to an aggregated network in this setting, but that approach can miss important functional… ▽ More

    Submitted 30 April, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 64 pages, 19 figures; typos corrected, literature review updated

  7. arXiv:2206.13088  [pdf, other

    stat.ME

    Network resampling for estimating uncertainty

    Authors: Qianhua Shan, Elizaveta Levina

    Abstract: With network data becoming ubiquitous in many applications, many models and algorithms for network analysis have been proposed. Yet methods for providing uncertainty estimates in addition to point estimates of network parameters are much less common. While bootstrap and other resampling procedures have been an effective general tool for estimating uncertainty from i.i.d. samples, adapting them to… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  8. arXiv:2205.14220  [pdf, other

    stat.ME stat.AP stat.ML

    Selective Inference for Sparse Multitask Regression with Applications in Neuroimaging

    Authors: Snigdha Panigrahi, Natasha Stewart, Chandra Sekhar Sripada, Elizaveta Levina

    Abstract: Multi-task learning is frequently used to model a set of related response variables from the same set of features, improving predictive performance and modeling accuracy relative to methods that handle each response variable separately. Despite the potential of multi-task learning to yield more powerful inference than single-task alternatives, prior work in this area has largely omitted uncertaint… ▽ More

    Submitted 9 August, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: 46 Pages, 11 Figures, 3 Tables

  9. arXiv:2012.14409  [pdf, other

    stat.ME stat.ML

    Latent space models for multiplex networks with shared structure

    Authors: Peter W. MacDonald, Elizaveta Levina, Ji Zhu

    Abstract: Latent space models are frequently used for modeling single-layer networks and include many popular special cases, such as the stochastic block model and the random dot product graph. However, they are not well-developed for more complex network structures, which are becoming increasingly common in practice. Here we propose a new latent space model for multiplex networks: multiple, heterogeneous n… ▽ More

    Submitted 7 July, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: 41 pages, 8 figures

  10. arXiv:2009.10641  [pdf, other

    cs.SI cs.LG stat.CO stat.ML

    Overlap** community detection in networks via sparse spectral decomposition

    Authors: Jesús Arroyo, Elizaveta Levina

    Abstract: We consider the problem of estimating overlap** community memberships in a network, where each node can belong to multiple communities. More than a few communities per node are difficult to both estimate and interpret, so we focus on sparse node membership vectors. Our algorithm is based on sparse principal subspace estimation with iterative thresholding. The method is computationally efficient,… ▽ More

    Submitted 15 February, 2021; v1 submitted 20 September, 2020; originally announced September 2020.

  11. arXiv:2008.03652  [pdf, other

    stat.ME stat.ML

    Community models for networks observed through edge nominations

    Authors: Tianxi Li, Elizaveta Levina, Ji Zhu

    Abstract: Communities are a common and widely studied structure in networks, typically under the assumption that the network is fully and correctly observed. In practice, network data are often collected by querying nodes about their connections. In some settings, all edges of a sampled node will be recorded, and in others, a node may be asked to name its connections. These sampling mechanisms introduce noi… ▽ More

    Submitted 18 March, 2021; v1 submitted 9 August, 2020; originally announced August 2020.

  12. arXiv:2002.01645  [pdf, other

    stat.ME stat.ML

    Simultaneous prediction and community detection for networks with application to neuroimaging

    Authors: Jesús Arroyo, Elizaveta Levina

    Abstract: Community structure in networks is observed in many different domains, and unsupervised community detection has received a lot of attention in the literature. Increasingly the focus of network analysis is shifting towards using network information in some other prediction or inference task rather than just analyzing the network itself. In particular, in neuroimaging applications brain networks are… ▽ More

    Submitted 27 February, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

  13. arXiv:1910.07434  [pdf, other

    math.ST math.PR

    Matrix Means and a Novel High-Dimensional Shrinkage Phenomenon

    Authors: Asad Lodhia, Keith Levin, Elizaveta Levina

    Abstract: Many statistical settings call for estimating a population parameter, most typically the population mean, based on a sample of matrices. The most natural estimate of the population mean is the arithmetic mean, but there are many other matrix means that may behave differently, especially in high dimensions. Here we consider the matrix harmonic mean as an alternative to the arithmetic matrix mean. W… ▽ More

    Submitted 15 July, 2021; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: 29 pages, 5 figures

    MSC Class: 60B20; 62H12; 47N30

  14. arXiv:1907.10821  [pdf, other

    math.ST stat.ME

    Bootstrap** Networks with Latent Space Structure

    Authors: Keith Levin, Elizaveta Levina

    Abstract: A core problem in statistical network analysis is to develop network analogues of classical techniques. The problem of bootstrap** network data stands out as especially challenging, since typically one observes only a single network, rather than a sample. Here we propose two methods for obtaining bootstrap samples for networks drawn from latent space models. The first method generates bootstrap… ▽ More

    Submitted 11 October, 2021; v1 submitted 24 July, 2019; originally announced July 2019.

  15. arXiv:1907.02443  [pdf, other

    stat.ML cs.LG stat.ME

    High-dimensional Gaussian graphical model for network-linked data

    Authors: Tianxi Li, Cheng Qian, Elizaveta Levina, Ji Zhu

    Abstract: Graphical models are commonly used to represent conditional dependence relationships between variables. There are multiple methods available for exploring them from high-dimensional data, but almost all of them rely on the assumption that the observations are independent and identically distributed. At the same time, observations connected by a network are becoming increasingly common, and tend to… ▽ More

    Submitted 21 April, 2020; v1 submitted 4 July, 2019; originally announced July 2019.

  16. arXiv:1906.07265  [pdf, other

    math.ST cs.LG eess.SP stat.ME stat.ML

    Recovering shared structure from multiple networks with unknown edge distributions

    Authors: Keith Levin, Asad Lodhia, Elizaveta Levina

    Abstract: In increasingly many settings, data sets consist of multiple samples from a population of networks, with vertices aligned across these networks. For example, brain connectivity networks in neuroscience consist of measures of interaction between brain regions that have been aligned to a common template. We consider the setting where the observed networks have a shared expectation, but may differ in… ▽ More

    Submitted 8 May, 2021; v1 submitted 12 June, 2019; originally announced June 2019.

  17. arXiv:1903.02129  [pdf, other

    stat.AP stat.ME stat.ML

    Graph-aware Modeling of Brain Connectivity Networks

    Authors: Yura Kim, Daniel Kessler, Elizaveta Levina

    Abstract: Functional connections in the brain are frequently represented by weighted networks, with nodes representing locations in the brain, and edges representing the strength of connectivity between these locations. One challenge in analyzing such data is that inference at the individual edge level is not particularly biologically meaningful; interpretation is more useful at the level of so-called funct… ▽ More

    Submitted 26 September, 2022; v1 submitted 5 March, 2019; originally announced March 2019.

  18. arXiv:1810.01509  [pdf, other

    stat.ME math.ST stat.ML

    Hierarchical community detection by recursive partitioning

    Authors: Tianxi Li, Lihua Lei, Sharmodeep Bhattacharyya, Koen Van den Berge, Purnamrita Sarkar, Peter J. Bickel, Elizaveta Levina

    Abstract: The problem of community detection in networks is usually formulated as finding a single partition of the network into some "correct" number of communities. We argue that it is more interpretable and in some regimes more accurate to construct a hierarchical tree of communities instead. This can be done with a simple top-down recursive partitioning algorithm, starting with a single community and se… ▽ More

    Submitted 14 May, 2020; v1 submitted 2 October, 2018; originally announced October 2018.

  19. arXiv:1803.04084  [pdf, other

    stat.CO cs.LG stat.ML

    Link prediction for egocentrically sampled networks

    Authors: Yun-Jhong Wu, Elizaveta Levina, Ji Zhu

    Abstract: Link prediction in networks is typically accomplished by estimating or ranking the probabilities of edges for all pairs of nodes. In practice, especially for social networks, the data are often collected by egocentric sampling, which means selecting a subset of nodes and recording all of their edges. This sampling mechanism requires different prediction tools than the typical assumption of links m… ▽ More

    Submitted 11 March, 2018; originally announced March 2018.

  20. arXiv:1801.08724  [pdf, ps, other

    math.ST

    Concentration of random graphs and application to community detection

    Authors: Can M. Le, Elizaveta Levina, Roman Vershynin

    Abstract: Random matrix theory has played an important role in recent work on statistical network analysis. In this paper, we review recent results on regimes of concentration of random graphs around their expectation, showing that dense graphs concentrate and sparse graphs concentrate after regularization. We also review relevant network models that may be of interest to probabilists considering directions… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

    Comments: Submission for International Congress of Mathematicians, Rio de Janeiro, Brazil 2018

  21. arXiv:1710.04765  [pdf, ps, other

    math.ST cs.SI

    Estimating a network from multiple noisy realizations

    Authors: Can M. Le, Keith Levin, Elizaveta Levina

    Abstract: Complex interactions between entities are often represented as edges in a network. In practice, the network is often constructed from noisy measurements and inevitably contains some errors. In this paper we consider the problem of estimating a network from multiple noisy observations where edges of the original network are recorded with both false positives and false negatives. This problem is mot… ▽ More

    Submitted 10 December, 2018; v1 submitted 12 October, 2017; originally announced October 2017.

    MSC Class: 62H12; 62H30; 62F12

  22. arXiv:1705.06772  [pdf, other

    stat.ME stat.ML

    Generalized linear models with low rank effects for network data

    Authors: Yun-Jhong Wu, Elizaveta Levina, Ji Zhu

    Abstract: Networks are a useful representation for data on connections between units of interests, but the observed connections are often noisy and/or include missing values. One common approach to network analysis is to treat the network as a realization from a random graph model, and estimate the underlying edge probability matrix, which is sometimes referred to as network denoising. Here we propose a gen… ▽ More

    Submitted 18 May, 2017; originally announced May 2017.

  23. arXiv:1701.08140  [pdf, other

    stat.ME stat.ML

    Network classification with applications to brain connectomics

    Authors: Jesús D. Arroyo-Relión, Daniel Kessler, Elizaveta Levina, Stephan F. Taylor

    Abstract: While statistical analysis of a single network has received a lot of attention in recent years, with a focus on social networks, analysis of a sample of networks presents its own challenges which require a different set of analytic tools. Here we study the problem of classification of networks with labeled nodes, motivated by applications in neuroimaging. Brain networks are constructed from imagin… ▽ More

    Submitted 1 February, 2019; v1 submitted 27 January, 2017; originally announced January 2017.

  24. arXiv:1612.04717  [pdf, other

    stat.ME stat.ML

    Network cross-validation by edge sampling

    Authors: Tianxi Li, Elizaveta Levina, Ji Zhu

    Abstract: While many statistical models and methods are now available for network analysis, resampling network data remains a challenging problem. Cross-validation is a useful general tool for model selection and parameter tuning, but is not directly applicable to networks since splitting network nodes into groups requires deleting edges and destroys some of the network structure. Here we propose a new netw… ▽ More

    Submitted 1 May, 2020; v1 submitted 14 December, 2016; originally announced December 2016.

  25. arXiv:1602.01192  [pdf, other

    stat.ME

    Prediction models for network-linked data

    Authors: Tianxi Li, Elizaveta Levina, Ji Zhu

    Abstract: Prediction algorithms typically assume the training data are independent samples, but in many modern applications samples come from individuals connected by a network. For example, in adolescent health studies of risk-taking behaviors, information on the subjects' social network is often available and plays an important role through network cohesion, the empirically observed phenomenon of friends… ▽ More

    Submitted 25 June, 2018; v1 submitted 3 February, 2016; originally announced February 2016.

  26. arXiv:1509.08588  [pdf, other

    stat.ML

    Estimating network edge probabilities by neighborhood smoothing

    Authors: Yuan Zhang, Elizaveta Levina, Ji Zhu

    Abstract: The estimation of probabilities of network edges from the observed adjacency matrix has important applications to predicting missing links and network denoising. It has usually been addressed by estimating the graphon, a function that determines the matrix of edge probabilities, but this is ill-defined without strong assumptions on the network structure. Here we propose a novel computationally eff… ▽ More

    Submitted 8 July, 2017; v1 submitted 29 September, 2015; originally announced September 2015.

    Comments: 22 pages, 4 figures, 3 table

  27. Estimating heterogeneous graphical models for discrete data with an application to roll call voting

    Authors: Jian Guo, Jie Cheng, Elizaveta Levina, George Michailidis, Ji Zhu

    Abstract: We consider the problem of jointly estimating a collection of graphical models for discrete data, corresponding to several categories that share some common structure. An example for such a setting is voting records of legislators on different issues, such as defense, energy, and healthcare. We develop a Markov graphical model to characterize the heterogeneous dependence structures arising from su… ▽ More

    Submitted 16 September, 2015; originally announced September 2015.

    Comments: Published at http://dx.doi.org/10.1214/13-AOAS700 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS700

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 2, 821-848

  28. arXiv:1509.01173  [pdf, other

    stat.ML cs.SI physics.soc-ph

    Community Detection in Networks with Node Features

    Authors: Yuan Zhang, Elizaveta Levina, Ji Zhu

    Abstract: Many methods have been proposed for community detection in networks, but most of them do not take into account additional information on the nodes that is often available in practice. In this paper, we propose a new joint community detection criterion that uses both the network edge information and the node features to detect community structures. One advantage our method has over existing joint d… ▽ More

    Submitted 3 September, 2015; originally announced September 2015.

    Comments: 16 pages, 5 pages

    Journal ref: Electronic Journal of Statistics, Volume 10, Number 2 (2016), 3153-3178

  29. arXiv:1507.00827  [pdf, ps, other

    stat.ML cs.SI math.ST

    Estimating the number of communities in networks by spectral methods

    Authors: Can M. Le, Elizaveta Levina

    Abstract: Community detection is a fundamental problem in network analysis with many methods available to estimate communities. Most of these methods assume that the number of communities is known, which is often not the case in practice. We study a simple and very fast method for estimating the number of communities based on the spectral properties of certain graph operators, such as the non-backtracking m… ▽ More

    Submitted 14 November, 2019; v1 submitted 3 July, 2015; originally announced July 2015.

    MSC Class: 62H12; 62H30

  30. arXiv:1506.00669  [pdf, other

    math.PR cs.SI math.ST

    Concentration and regularization of random graphs

    Authors: Can M. Le, Elizaveta Levina, Roman Vershynin

    Abstract: This paper studies how close random graphs are typically to their expectations. We interpret this question through the concentration of the adjacency and Laplacian matrices in the spectral norm. We study inhomogeneous Erdös-Rényi random graphs on $n$ vertices, where edges form independently and possibly with different probabilities $p_{ij}$. Sparse random graphs whose expected degrees are… ▽ More

    Submitted 9 August, 2016; v1 submitted 1 June, 2015; originally announced June 2015.

    Comments: 21 pages. Elizaveta Levina is added as a co-author. Application to community detection of networks is expanded

    MSC Class: 05C80; 60B20; 05C85

  31. arXiv:1502.03049  [pdf, other

    math.ST cs.SI math.PR

    Sparse random graphs: regularization and concentration of the Laplacian

    Authors: Can M. Le, Elizaveta Levina, Roman Vershynin

    Abstract: We study random graphs with possibly different edge probabilities in the challenging sparse regime of bounded expected degrees. Unlike in the dense case, neither the graph adjacency matrix nor its Laplacian concentrate around their expectations due to the highly irregular distribution of node degrees. It has been empirically observed that simply adding a constant of order $1/n$ to each entry of th… ▽ More

    Submitted 23 April, 2015; v1 submitted 10 February, 2015; originally announced February 2015.

    Comments: Added references

    MSC Class: 05C80; 05C85; 60B20; 62H30

  32. arXiv:1412.3432  [pdf, other

    stat.ML

    Detecting Overlap** Communities in Networks Using Spectral Methods

    Authors: Yuan Zhang, Elizaveta Levina, Ji Zhu

    Abstract: Community detection is a fundamental problem in network analysis which is made more challenging by overlaps between communities which often occur in practice. Here we propose a general, flexible, and interpretable generative model for overlap** communities, which can be thought of as a generalization of the degree-corrected stochastic block model. We develop an efficient spectral algorithm for e… ▽ More

    Submitted 12 March, 2015; v1 submitted 10 December, 2014; originally announced December 2014.

    Comments: 29 pages, 2 figures, 3 tables

  33. arXiv:1406.5647  [pdf, ps, other

    cs.LG cs.SI stat.ML

    On semidefinite relaxations for the block model

    Authors: Arash A. Amini, Elizaveta Levina

    Abstract: The stochastic block model (SBM) is a popular tool for community detection in networks, but fitting it by maximum likelihood (MLE) involves a computationally infeasible optimization problem. We propose a new semidefinite programming (SDP) solution to the problem of fitting the SBM, derived as a relaxation of the MLE. We put ours and previously proposed SDPs in a unified framework, as relaxations o… ▽ More

    Submitted 16 March, 2016; v1 submitted 21 June, 2014; originally announced June 2014.

  34. arXiv:1406.0067  [pdf, other

    stat.ML cs.SI math.ST physics.soc-ph

    Optimization via Low-rank Approximation for Community Detection in Networks

    Authors: Can M. Le, Elizaveta Levina, Roman Vershynin

    Abstract: Community detection is one of the fundamental problems of network analysis, for which a number of methods have been proposed. Most model-based or criteria-based methods have to solve an optimization problem over a discrete set of labels to find communities, which is computationally infeasible. Some fast spectral algorithms have been proposed for specific methods or models, but only on a case-by-ca… ▽ More

    Submitted 10 May, 2015; v1 submitted 31 May, 2014; originally announced June 2014.

    Comments: 45 pages, 7 figures; added discussions about computational complexity and extension to more than two communities

    MSC Class: 62E10; 62G05

  35. arXiv:1311.0416  [pdf, ps, other

    stat.AP

    Structured functional regression models for high-dimensional spatial spectroscopy data

    Authors: Arash A. Amini, Elizaveta Levina, Kerby A. Shedden

    Abstract: Modeling and analysis of spectroscopy data is an active area of research with applications to chemistry and biology. This paper focuses on analyzing Raman spectra obtained from a bone fracture healing experiment, although the functional regression model for predicting a scalar response from high-dimensional tensors can be applied to any spectroscopy data. The regression model is built on a sparse… ▽ More

    Submitted 2 November, 2013; originally announced November 2013.

  36. arXiv:1304.2810  [pdf, other

    stat.ML stat.ME

    High-dimensional Mixed Graphical Models

    Authors: Jie Cheng, Tianxi Li, Elizaveta Levina, Ji Zhu

    Abstract: While graphical models for continuous data (Gaussian graphical models) and discrete data (Ising models) have been extensively studied, there is little work on graphical models linking both continuous and discrete variables (mixed data), which are common in many scientific applications. We propose a novel graphical model for mixed data, which is simple enough to be suitable for high-dimensional dat… ▽ More

    Submitted 19 August, 2016; v1 submitted 9 April, 2013; originally announced April 2013.

  37. arXiv:1301.7047  [pdf, ps, other

    stat.ML cs.LG cs.SI

    Link prediction for partially observed networks

    Authors: Yunpeng Zhao, Elizaveta Levina, Ji Zhu

    Abstract: Link prediction is one of the fundamental problems in network analysis. In many applications, notably in genetics, a partially observed network may not contain any negative examples of absent edges, which creates a difficulty for many existing supervised learning approaches. We develop a new method which treats the observed network as a sample of the true network with different sampling rates for… ▽ More

    Submitted 29 January, 2013; originally announced January 2013.

  38. arXiv:1209.6342  [pdf, other

    stat.ML cs.LG

    Sparse Ising Models with Covariates

    Authors: Jie Cheng, Elizaveta Levina, Pei Wang, Ji Zhu

    Abstract: There has been a lot of work fitting Ising models to multivariate binary data in order to understand the conditional dependency relationships between the variables. However, additional covariates are frequently recorded together with the binary data, and may influence the dependence relationships. Motivated by such a dataset on genomic instability collected from tumor samples of several types, we… ▽ More

    Submitted 27 September, 2012; originally announced September 2012.

    Comments: 32 pages (including 5 pages of appendix), 3 figures, 2 tables

  39. arXiv:1207.2340  [pdf, ps, other

    cs.SI cs.LG math.ST physics.soc-ph stat.ML

    Pseudo-likelihood methods for community detection in large sparse networks

    Authors: Arash A. Amini, Aiyou Chen, Peter J. Bickel, Elizaveta Levina

    Abstract: Many algorithms have been proposed for fitting network models with communities, but most of them do not scale well to large networks, and often fail on sparse networks. Here we propose a new fast pseudo-likelihood method for fitting the stochastic block model for networks, as well as a variant that allows for an arbitrary degree distribution by conditioning on degrees. We show that the algorithms… ▽ More

    Submitted 5 November, 2013; v1 submitted 10 July, 2012; originally announced July 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1138 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1138

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 4, 2097-2122

  40. The method of moments and degree distributions for network models

    Authors: Peter J. Bickel, Aiyou Chen, Elizaveta Levina

    Abstract: Probability models on graphs are becoming increasingly important in many applications, but statistical tools for fitting such models are not yet well developed. Here we propose a general method of moments approach that can be used to fit a large class of probability models through empirical counts of certain patterns in a graph. We establish some general asymptotic properties of empirical graph mo… ▽ More

    Submitted 23 February, 2012; originally announced February 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS904 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS904

    Journal ref: Annals of Statistics 2011, Vol. 39, No. 5, 2280-2301

  41. arXiv:1110.3854  [pdf, ps, other

    math.ST cs.SI physics.soc-ph

    Consistency of community detection in networks under degree-corrected stochastic block models

    Authors: Yunpeng Zhao, Elizaveta Levina, Ji Zhu

    Abstract: Community detection is a fundamental problem in network analysis, with applications in many diverse areas. The stochastic block model is a common tool for model-based community detection, and asymptotic tools for checking consistency of community detection under the block model have been recently developed. However, the block model is limited by its assumption that all nodes within a community are… ▽ More

    Submitted 17 March, 2015; v1 submitted 17 October, 2011; originally announced October 2011.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOS1036 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org). With Corrections

    Report number: IMS-AOS-AOS1036

    Journal ref: Annals of Statistics 2012, Vol. 40, No. 4, 2266-2292

  42. arXiv:1008.1716  [pdf, ps, other

    math.ST math.PR

    Partial estimation of covariance matrices

    Authors: Elizaveta Levina, Roman Vershynin

    Abstract: A classical approach to accurately estimating the covariance matrix Σof a p-variate normal distribution is to draw a sample of size n > p and form a sample covariance matrix. However, many modern applications operate with much smaller sample sizes, thus calling for estimation guarantees in the regime n << p. We show that a sample of size n = O(m log^6 p) is sufficient to accurately estimate in ope… ▽ More

    Submitted 11 February, 2011; v1 submitted 10 August, 2010; originally announced August 2010.

    Comments: 15 pages, to appear in PTRF. Small changes in light of comments from the referee

    MSC Class: 62H12 (primary); 60B20 (secondary)

    Journal ref: Probability Theory and Related Fields 153 (2012), 405--419

  43. arXiv:1005.3265  [pdf, ps, other

    stat.ME physics.data-an physics.soc-ph

    Community extraction for social networks

    Authors: Yunpeng Zhao, Elizaveta Levina, Ji Zhu

    Abstract: Analysis of networks and in particular discovering communities within networks has been a focus of recent work in several fields, with applications ranging from citation and friendship networks to food webs and gene regulatory networks. Most of the existing community detection methods focus on partitioning the entire network into communities, with the expectation of many ties within communities an… ▽ More

    Submitted 18 May, 2010; originally announced May 2010.

  44. arXiv:0903.0645  [pdf, ps, other

    stat.ME

    A new approach to Cholesky-based covariance regularization in high dimensions

    Authors: Adam J. Rothman, Elizaveta Levina, Ji Zhu

    Abstract: In this paper we propose a new regression interpretation of the Cholesky factor of the covariance matrix, as opposed to the well known regression interpretation of the Cholesky factor of the inverse covariance, which leads to a new class of regularized covariance estimators suitable for high-dimensional problems. Regularizing the Cholesky factor of the covariance via this regression interpretati… ▽ More

    Submitted 3 March, 2009; originally announced March 2009.

    Comments: Submitted for publication on Feb. 28, 2009

  45. Covariance regularization by thresholding

    Authors: Peter J. Bickel, Elizaveta Levina

    Abstract: This paper considers regularizing a covariance matrix of $p$ variables estimated from $n$ observations, by hard thresholding. We show that the thresholded estimate is consistent in the operator norm as long as the true covariance matrix is sparse in a suitable sense, the variables are Gaussian or sub-Gaussian, and $(\log p)/n\to0$, and obtain explicit rates. The results are uniform over families… ▽ More

    Submitted 20 January, 2009; originally announced January 2009.

    Comments: Published in at http://dx.doi.org/10.1214/08-AOS600 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS600 MSC Class: 62H12 (Primary) 62F12; 62G09 (Secondary)

    Journal ref: Annals of Statistics 2008, Vol. 36, No. 6, 2577-2604

  46. Sparse estimation of large covariance matrices via a nested Lasso penalty

    Authors: Elizaveta Levina, Adam Rothman, Ji Zhu

    Abstract: The paper proposes a new covariance estimator for large covariance matrices when the variables have a natural ordering. Using the Cholesky decomposition of the inverse, we impose a banded structure on the Cholesky factor, and select the bandwidth adaptively for each row of the Cholesky factor, using a novel penalty we call nested Lasso. This structure has more flexibility than regular banding, b… ▽ More

    Submitted 27 March, 2008; originally announced March 2008.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOAS139 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS139

    Journal ref: Annals of Applied Statistics 2008, Vol. 2, No. 1, 245-263

  47. Regularized estimation of large covariance matrices

    Authors: Peter J. Bickel, Elizaveta Levina

    Abstract: This paper considers estimating a covariance matrix of $p$ variables from $n$ observations by either banding or tapering the sample covariance matrix, or estimating a banded version of the inverse of the covariance. We show that these estimates are consistent in the operator norm as long as $(\log p)/n\to0$, and obtain explicit rates. The results are uniform over some fairly natural well-conditi… ▽ More

    Submitted 13 March, 2008; originally announced March 2008.

    Comments: Published in at http://dx.doi.org/10.1214/009053607000000758 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0298 MSC Class: 62H12 (Primary) 62F12; 62G09 (Secondary)

    Journal ref: Annals of Statistics 2008, Vol. 36, No. 1, 199-227

  48. Sparse permutation invariant covariance estimation

    Authors: Adam J. Rothman, Peter J. Bickel, Elizaveta Levina, Ji Zhu

    Abstract: The paper proposes a method for constructing a sparse estimator for the inverse covariance (concentration) matrix in high-dimensional settings. The estimator uses a penalized normal likelihood approach and forces sparsity by using a lasso-type penalty. We establish a rate of convergence in the Frobenius norm as both data dimension $p$ and sample size $n$ are allowed to grow, and show that the ra… ▽ More

    Submitted 26 June, 2008; v1 submitted 31 January, 2008; originally announced January 2008.

    Comments: Published in at http://dx.doi.org/10.1214/08-EJS176 the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-EJS-EJS_2008_176 MSC Class: 62H20 (Primary) 62H12 (Secondary)

    Journal ref: Electronic Journal of Statistics 2008, Vol. 2, 494-515

  49. arXiv:0709.2108  [pdf, ps, other

    physics.data-an cond-mat.stat-mech physics.soc-ph

    Robustness of community structure in networks

    Authors: Brian Karrer, Elizaveta Levina, M. E. J. Newman

    Abstract: The discovery of community structure is a common challenge in the analysis of network data. Many methods have been proposed for finding community structure, but few have been proposed for determining whether the structure found is statistically significant or whether, conversely, it could have arisen purely as a result of chance. In this paper we show that the significance of community structure… ▽ More

    Submitted 13 September, 2007; originally announced September 2007.

    Comments: 10 pages, 2 figures

    Journal ref: Phys. Rev. E 77, 046119 (2008)

  50. Texture synthesis and nonparametric resampling of random fields

    Authors: Elizaveta Levina, Peter J. Bickel

    Abstract: This paper introduces a nonparametric algorithm for bootstrap** a stationary random field and proves certain consistency properties of the algorithm for the case of mixing random fields. The motivation for this paper comes from relating a heuristic texture synthesis algorithm popular in computer vision to general nonparametric bootstrap** of stationary random fields. We give a formal resampl… ▽ More

    Submitted 9 November, 2006; originally announced November 2006.

    Comments: Published at http://dx.doi.org/10.1214/009053606000000588 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0179 MSC Class: 62M40 (Primary) 62G09 (Secondary)

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 4, 1751-1773