Skip to main content

Showing 1–25 of 25 results for author: Kolaczyk, E D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.15654  [pdf, other

    math.ST stat.ME

    Autoregressive Networks with Dependent Edges

    Authors: **yuan Chang, Qin Fang, Eric D. Kolaczyk, Peter W. MacDonald, Qiwei Yao

    Abstract: We propose an autoregressive framework for modelling dynamic networks with dependent edges. It encompasses the models which accommodate, for example, transitivity, density-dependent and other stylized features often observed in real network data. By assuming the edges of network at each time are independent conditionally on their lagged values, the models, which exhibit a close connection with tem… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 27 pages, 2 tables, 4 figures

  2. arXiv:2403.07124  [pdf, other

    stat.ME cs.SI

    Stochastic gradient descent-based inference for dynamic network models with attractors

    Authors: Hancong Pan, Xiao**g Zhu, Cantay Caliskan, Dino P. Christenson, Konstantinos Spiliopoulos, Dylan Walker, Eric D. Kolaczyk

    Abstract: In Coevolving Latent Space Networks with Attractors (CLSNA) models, nodes in a latent space represent social actors, and edges indicate their dynamic interactions. Attractors are added at the latent level to capture the notion of attractive and repulsive forces between nodes, borrowing from dynamical systems theory. However, CLSNA reliance on MCMC estimation makes scaling difficult, and the requir… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  3. arXiv:2308.00836  [pdf, other

    stat.ME cs.CR

    Differentially Private Linear Regression with Linked Data

    Authors: Shurong Lin, Elliot Paquette, Eric D. Kolaczyk

    Abstract: There has been increasing demand for establishing privacy-preserving methodologies for modern statistics and machine learning. Differential privacy, a mathematical notion from computer science, is a rising tool offering robust privacy guarantees. Recent work focuses primarily on develo** differentially private versions of individual statistical and machine learning tasks, with nontrivial upstrea… ▽ More

    Submitted 7 May, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

    MSC Class: 68P27; 62-XX ACM Class: G.3; I.0

  4. arXiv:2301.08324  [pdf, other

    stat.ME cs.CR

    Differentially Private Confidence Intervals for Proportions under Stratified Random Sampling

    Authors: Shurong Lin, Mark Bun, Marco Gaboardi, Eric D. Kolaczyk, Adam Smith

    Abstract: Confidence intervals are a fundamental tool for quantifying the uncertainty of parameters of interest. With the increase of data privacy awareness, develo** a private version of confidence intervals has gained growing attention from both statisticians and computer scientists. Differential privacy is a state-of-the-art framework for analyzing privacy loss when releasing statistics computed from s… ▽ More

    Submitted 11 April, 2024; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: 39 pages, 4 figures

    MSC Class: 68P27; 62G15; 62Dxx

    Journal ref: Electronic Journal of Statistics, Electron. J. Statist. 18(1), 1455-1494, (2024)

  5. arXiv:2202.10513  [pdf, other

    stat.ME

    Quantifying Uncertainty for Temporal Motif Estimation in Graph Streams under Sampling

    Authors: Xiao**g Zhu, Eric D. Kolaczyk

    Abstract: Dynamic networks, a.k.a. graph streams, consist of a set of vertices and a collection of timestamped interaction events (i.e., temporal edges) between vertices. Temporal motifs are defined as classes of (small) isomorphic induced subgraphs on graph streams, considering both edge ordering and duration. As with motifs in static networks, temporal motifs are the fundamental building blocks for tempor… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  6. arXiv:2112.10151  [pdf, ps, other

    math.ST stat.ME

    Edge differentially private estimation in the $β$-model via jittering and method of moments

    Authors: **yuan Chang, Qiao Hu, Eric D. Kolaczyk, Qiwei Yao, Fengting Yi

    Abstract: A standing challenge in data privacy is the trade-off between the level of privacy and the efficiency of statistical inference. Here we conduct an in-depth study of this trade-off for parameter estimation in the $β$-model (Chatterjee, Diaconis and Sly, 2011) for edge differentially private network data released via jittering (Karwa, Krivitsky and Slavković, 2017). Unlike most previous approaches b… ▽ More

    Submitted 2 April, 2024; v1 submitted 19 December, 2021; originally announced December 2021.

    Journal ref: Annals of Statistics 2024, Vol. 52, pp. 708-728

  7. arXiv:2109.13129  [pdf, other

    stat.AP stat.ME

    Disentangling positive and negative partisanship in social media interactions using a coevolving latent space network with attractors model

    Authors: Xiao**g Zhu, Cantay Caliskan, Dino P. Christenson, Konstantinos Spiliopoulos, Dylan Walker, Eric D. Kolaczyk

    Abstract: We develop a broadly applicable class of coevolving latent space network with attractors (CLSNA) models, where nodes represent individual social actors assumed to lie in an unknown latent space, edges represent the presence of a specified interaction between actors, and attractors are added in the latent level to capture the notion of attractive and repulsive forces. We apply the CLSNA models to u… ▽ More

    Submitted 13 August, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: revised version

  8. arXiv:2105.04518  [pdf, other

    stat.ME

    Causal Inference under Network Interference with Noise

    Authors: Wenrui Li, Daniel L. Sussman, Eric D. Kolaczyk

    Abstract: Increasingly, there is a marked interest in estimating causal effects under network interference due to the fact that interference manifests naturally in networked experiments. However, network information generally is available only up to some level of error. We study the propagation of such errors to estimators of average causal effects under network interference. Specifically, assuming a four-l… ▽ More

    Submitted 31 August, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 68 pages, 1 figure

  9. Network Recovery from Unlabeled Noisy Samples

    Authors: Nathaniel Josephs, Wenrui Li, Eric D. Kolaczyk

    Abstract: There is a growing literature on the statistical analysis of multiple networks in which the network is the fundamental data object. However, most of this work requires networks on a shared set of labeled vertices. In this work, we consider the question of recovering a parent network based on noisy unlabeled samples. We identify a specific regime in the noisy network literature for recovery that is… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

  10. arXiv:2102.11948  [pdf, other

    stat.AP stat.ME

    Inferring the Type of Phase Transitions Undergone in Epileptic Seizures Using Random Graph Hidden Markov Models for Percolation in Noisy Dynamic Networks

    Authors: Xiao**g Zhu, Heather Shappell, Mark A. Kramer, Catherine J. Chu, Eric D. Kolaczyk

    Abstract: In clinical neuroscience, epileptic seizures have been associated with the sudden emergence of coupled activity across the brain. The resulting functional networks - in which edges indicate strong enough coupling between brain regions - are consistent with the notion of percolation, which is a phenomenon in complex networks corresponding to the sudden emergence of a giant connected component. Trad… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

  11. arXiv:2011.12416  [pdf, other

    stat.ME math.ST

    A spectral-based framework for hypothesis testing in populations of networks

    Authors: Li Chen, Nathaniel Josephs, Lizhen Lin, Jie Zhou, Eric D. Kolaczyk

    Abstract: In this paper, we propose a new spectral-based approach to hypothesis testing for populations of networks. The primary goal is to develop a test to determine whether two given samples of networks come from the same random model or distribution. Our test statistic is based on the trace of the third order for a centered and scaled adjacency matrix, which we prove converges to the standard normal dis… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  12. Sensor-based localization of epidemic sources on human mobility networks

    Authors: Jun Li, Juliane Manitz, Enrico Bertuzzo, Eric D. Kolaczyk

    Abstract: We investigate the source detection problem in epidemiology, which is one of the most important issues for control of epidemics. Mathematically, we reformulate the problem as one of identifying the relevant component in a multivariate Gaussian mixture model. Focusing on the study of cholera and diseases with similar modes of transmission, we calibrate the parameters of our mixture model using huma… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

  13. arXiv:2004.04765  [pdf, other

    stat.AP

    Bayesian classification, anomaly detection, and survival analysis using network inputs with application to the microbiome

    Authors: Nathaniel Josephs, Lizhen Lin, Steven Rosenberg, Eric D. Kolaczyk

    Abstract: While the study of a single network is well-established, technological advances now allow for the collection of multiple networks with relative ease. Increasingly, anywhere from several to thousands of networks can be created from brain imaging, gene co-expression data, or microbiome measurements. And these networks, in turn, are being looked to as potentially powerful features to be used in model… ▽ More

    Submitted 13 January, 2021; v1 submitted 9 April, 2020; originally announced April 2020.

  14. arXiv:2002.05763  [pdf, other

    stat.ME

    Estimation of the Epidemic Branching Factor in Noisy Contact Networks

    Authors: Wenrui Li, Daniel L. Sussman, Eric D. Kolaczyk

    Abstract: Many fundamental concepts in network-based epidemic modeling depend on the branching factor, which captures a sense of dispersion in the network connectivity and quantifies the rate of spreading across the network. Moreover, contact network information generally is available only up to some level of error. We study the propagation of such errors to the estimation of the branching factor. Specifica… ▽ More

    Submitted 12 October, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: 44 pages, 4 figures

  15. Estimation of subgraph density in noisy networks

    Authors: **yuan Chang, Eric D. Kolaczyk, Qiwei Yao

    Abstract: While it is common practice in applied network analysis to report various standard network summary statistics, these numbers are rarely accompanied by uncertainty quantification. Yet any error inherent in the measurements underlying the construction of the network, or in the network construction procedure itself, necessarily must propagate to any summary statistics reported. Here we study the prob… ▽ More

    Submitted 30 June, 2020; v1 submitted 6 March, 2018; originally announced March 2018.

    Journal ref: Journal of the American Statistical Association 2022, Vol. 117, No. 537, 361-374

  16. arXiv:1712.08586  [pdf, ps, other

    stat.ME

    Dynamic Networks with Multi-scale Temporal Structure

    Authors: Xinyu Kang, Apratim Ganguly, Eric D. Kolaczyk

    Abstract: We describe a novel method for modeling non-stationary multivariate time series, with time-varying conditional dependencies represented through dynamic networks. Our proposed approach combines traditional multi-scale modeling and network based neighborhood selection, aiming at capturing temporally local structure in the data while maintaining sparsity of the potential interactions. Our multi-scale… ▽ More

    Submitted 22 December, 2017; originally announced December 2017.

  17. arXiv:1510.03959  [pdf, other

    stat.ME

    Detection of multiple perturbations in multi-omics biological networks

    Authors: Paula J. Griffin, W. Evan Johnson, Eric D. Kolaczyk

    Abstract: Cellular mechanism-of-action is of fundamental concern in many biological studies. It is of particular interest for identifying the cause of disease and learning the way in which treatments act against disease. However, pinpointing such mechanisms is difficult, due to the fact that small perturbations to the cell can have wide-ranging downstream effects. Given a snapshot of cellular activity, it c… ▽ More

    Submitted 2 October, 2016; v1 submitted 13 October, 2015; originally announced October 2015.

    Comments: Submitted to Biometrics

  18. arXiv:1409.0503  [pdf, other

    stat.AP q-bio.MN

    Perturbation Detection Through Modeling of Gene Expression on a Latent Biological Pathway Network: A Bayesian hierarchical approach

    Authors: Lisa M. Pham, Luis Carvalho, Scott Schaus, Eric D. Kolaczyk

    Abstract: Cellular response to a perturbation is the result of a dynamic system of biological variables linked in a complex network. A major challenge in drug and disease studies is identifying the key factors of a biological network that are essential in determining the cell's fate. Here our goal is the identification of perturbed pathways from high-throughput gene expression data. We develop a three-lev… ▽ More

    Submitted 1 September, 2014; originally announced September 2014.

  19. arXiv:1407.5525  [pdf, other

    stat.AP q-bio.NC stat.ME

    Hypothesis Testing For Network Data in Functional Neuroimaging

    Authors: Cedric E. Ginestet, Jun Li, Prakash Balachandran, Steven Rosenberg, Eric D. Kolaczyk

    Abstract: In recent years, it has become common practice in neuroscience to use networks to summarize relational information in a set of measurements, typically assumed to be reflective of either functional or structural relationships between regions of interest in the brain. One of the most basic tasks of interest in the analysis of such data is the testing of hypotheses, in answer to questions such as "Is… ▽ More

    Submitted 17 March, 2017; v1 submitted 21 July, 2014; originally announced July 2014.

    Comments: 34 pages. 5 figures

  20. Percolation under Noise: Detecting Explosive Percolation Using the Second Largest Component

    Authors: Wes Viles, Cedric E. Ginestet, Ariana Tang, Mark A. Kramer, Eric D. Kolaczyk

    Abstract: We consider the problem of distinguishing classical (Erdős-Rényi) percolation from explosive (Achlioptas) percolation, under noise. A statistical model of percolation is constructed allowing for the birth and death of edges as well as the presence of noise in the observations. This graph-valued stochastic process is composed of a latent and an observed non-stationary process, where the observed gr… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Comments: 9 pages and 8 figures. Submitted to Physics Review, Series E

    Journal ref: Phys. Rev. E 93, 052301 (2016)

  21. Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks

    Authors: Yaonan Zhang, Eric D. Kolaczyk, Bruce D. Spencer

    Abstract: Networks are a popular tool for representing elements in a system and their interconnectedness. Many observed networks can be viewed as only samples of some true underlying network. Such is frequently the case, for example, in the monitoring and study of massive, online social networks. We study the problem of how to estimate the degree distribution - an object of fundamental interest - of a true… ▽ More

    Submitted 28 May, 2015; v1 submitted 21 May, 2013; originally announced May 2013.

    Comments: Published at http://dx.doi.org/10.1214/14-AOAS800 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS800

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 166-199

  22. arXiv:1112.0840  [pdf, ps, other

    math.ST stat.ME

    On the Question of Effective Sample Size in Network Modeling: An Asymptotic Inquiry

    Authors: Pavel N. Krivitsky, Eric D. Kolaczyk

    Abstract: The modeling and analysis of networks and network data has seen an explosion of interest in recent years and represents an exciting direction for potential growth in statistics. Despite the already substantial amount of work done in this area to date by researchers from various disciplines, however, there remain many questions of a decidedly foundational nature - natural analogues of standard ques… ▽ More

    Submitted 5 August, 2015; v1 submitted 5 December, 2011; originally announced December 2011.

    Comments: Published at http://dx.doi.org/10.1214/14-STS502 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS502

    Journal ref: Statistical Science 2015, Vol. 30, No. 2, 184-198

  23. arXiv:1109.4408  [pdf, other

    stat.ME

    A Compressed PCA Subspace Method for Anomaly Detection in High-Dimensional Data

    Authors: Qi Ding, Eric D. Kolaczyk

    Abstract: Random projection is widely used as a method of dimension reduction. In recent years, its combination with standard techniques of regression and classification has been explored. Here we examine its use with principal component analysis (PCA) and subspace detection methods. Specifically, we show that, under appropriate conditions, with high probability the magnitude of the residuals of a PCA analy… ▽ More

    Submitted 11 April, 2012; v1 submitted 20 September, 2011; originally announced September 2011.

  24. arXiv:1109.3160  [pdf, ps, other

    stat.AP cs.SI physics.soc-ph q-bio.MN

    Inference and Characterization of Multi-Attribute Networks with Application to Computational Biology

    Authors: Natallia Katenka, Eric D. Kolaczyk

    Abstract: Our work is motivated by and illustrated with application of association networks in computational biology, specifically in the context of gene/protein regulatory networks. Association networks represent systems of interacting elements, where a link between two different elements indicates a sufficient level of similarity between element attributes. While in reality relational ties between element… ▽ More

    Submitted 27 April, 2012; v1 submitted 14 September, 2011; originally announced September 2011.

    Comments: Updated bibliography references

  25. arXiv:0902.3714  [pdf, ps, other

    stat.ME stat.AP

    Target Detection via Network Filtering

    Authors: Shu Yang, Eric D. Kolaczyk

    Abstract: A method of `network filtering' has been proposed recently to detect the effects of certain external perturbations on the interacting members in a network. However, with large networks, the goal of detection seems a priori difficult to achieve, especially since the number of observations available often is much smaller than the number of variables describing the effects of the underlying network… ▽ More

    Submitted 27 January, 2010; v1 submitted 20 February, 2009; originally announced February 2009.