-
Rejoinder to Discussion of "A Tale of Two Datasets: Representativeness and Generalisability of Inference for Samples of Networks''
Abstract: This rejoinder responds to discussions by of Caimo, Niezink, and Schweinberger and Fritz of ''A Tale of Two Datasets: Representativeness and Generalisability of Inference for Samples of Networks'' by Krivitsky, Coletti, and Hens, all published in the Journal of the American Statistical Association in 2023.
Submitted 10 December, 2023; originally announced December 2023.
Comments: 10 pages, 3 figures, 3 tables
Journal ref: Journal of the American Statistical Association, 118(544), 2235-2238 (2023)
-
Modeling Tie Duration in ERGM-Based Dynamic Network Models
Abstract: Krivitsky and Handcock (2014) proposed a Separable Temporal ERGM (STERGM) framework for modeling social networks, which facilitates separable modeling of the tie duration distributions and the structural dynamics of tie formation. In this note, we explore the hazard structures achievable in this framework, with first- and higher-order Markov assumptions, and propose ways to model a variety of dura… ▽ More
Submitted 14 March, 2022; originally announced March 2022.
Comments: Reposting of Penn State University Department of Statistics Technical Report 12-02 (April 2012), lost in a web site migration; 14 pages, 3 figures, 1 table. arXiv admin note: text overlap with arXiv:2203.06866
-
ergm 4: Computational Improvements
Abstract: The ergm package supports the statistical analysis and simulation of network data. It anchors the statnet suite of packages for network analysis in R introduced in a special issue in Journal of Statistical Software in 2008. This article provides an overview of the performance improvements in the 2021 release of ergm version 4. These include performance enhancements to the Markov chain Monte Carlo… ▽ More
Submitted 15 March, 2022; originally announced March 2022.
Comments: Computational improvements discussion originally in arXiv:2106.04997v1, extracted into its own preprint; 23 pages, 2 figures, 3 tables
-
Modeling of Dynamic Networks based on Egocentric Data with Durational Information
Abstract: Modeling of dynamic networks -- networks that evolve over time -- has manifold applications in many fields. In epidemiology in particular, there is a need for data-driven modeling of human sexual relationship networks for the purpose of modeling and simulation of the spread of sexually transmitted disease. Dynamic network data about such networks are extremely difficult to collect, however, and mu… ▽ More
Submitted 14 March, 2022; originally announced March 2022.
Comments: Reposting of Penn State University Department of Statistics Technical Report 12-01 (April 2012), lost in a web site migration; 36 pages, 4 figures, 1 table
-
A Tale of Two Datasets: Representativeness and Generalisability of Inference for Samples of Networks
Abstract: The last two decades have seen considerable progress in foundational aspects of statistical network analysis, but the path from theory to application is not straightforward. Two large, heterogeneous samples of small networks of within-household contacts in Belgium were collected using two different but complementary sampling designs: one smaller but with all contacts in each household observed, th… ▽ More
Submitted 18 July, 2023; v1 submitted 8 February, 2022; originally announced February 2022.
Comments: 101 pages (3 front matter, 26 body, 72 appendix), 35 figures (4 body, 31 appendix), 66 tables (1 body, 61 appendix)
Journal ref: Journal of the American Statistical Association, 118(544), 2213-2224 (2023)
-
Likelihood-based Inference for Exponential-Family Random Graph Models via Linear Programming
Abstract: This article discusses the problem of determining whether a given point, or set of points, lies within the convex hull of another set of points in $d$ dimensions. This problem arises naturally in a statistical context when using a particular approximation to the loglikelihood function for an exponential family model; in particular, we discuss the application to network models here. While the conve… ▽ More
Submitted 7 February, 2022; originally announced February 2022.
Comments: 26 pages, 4 figures, 1 table
Journal ref: Electronic Journal of Statistics, 17(2): 3337-3356 (2023)
-
ergm 4: New features
Abstract: The ergm package supports the statistical analysis and simulation of network data. It anchors the statnet suite of packages for network analysis in R introduced in a special issue in Journal of Statistical Software in 2008. This article provides an overview of the new functionality in the 2021 release of ergm version 4. These include more flexible handling of nodal covariates, term operators that… ▽ More
Submitted 15 March, 2022; v1 submitted 9 June, 2021; originally announced June 2021.
Comments: Computational improvements discussion in the previous version was split out into another preprint; 30 pages, 2 figures
Journal ref: Journal of Statistical Software, 105(1), 1-44 (2023)
-
Revisiting Bayesian Autoencoders with MCMC
Abstract: Autoencoders gained popularity in the deep learning revolution given their ability to compress data and provide dimensionality reduction. Although prominent deep learning methods have been used to enhance autoencoders, the need to provide robust uncertainty quantification remains a challenge. This has been addressed with variational autoencoders so far. Bayesian inference via Markov Chain Monte Ca… ▽ More
Submitted 28 April, 2022; v1 submitted 12 April, 2021; originally announced April 2021.
Journal ref: R. Chandra, M. Jain, M. Maharana and P. N. Krivitsky, "Revisiting Bayesian Autoencoders With MCMC," in IEEE Access, vol. 10, pp. 40482-40495, 2022, doi: 10.1109/ACCESS.2022.3163270
-
Exponential-Family Models of Random Graphs: Inference in Finite-, Super-, and Infinite Population Scenarios
Abstract: Exponential-family Random Graph Models (ERGMs) constitute a large statistical framework for modeling sparse and dense random graphs, short- and long-tailed degree distributions, covariates, and a wide range of complex dependencies. Special cases of ERGMs are generalized linear models (GLMs), Bernoulli random graphs, $β$-models, $p_1$-models, and models related to Markov random fields in spatial st… ▽ More
Submitted 12 September, 2019; v1 submitted 15 July, 2017; originally announced July 2017.
Journal ref: Statistical Science 35(4): 627 - 662 (2020)
-
Sharing Social Network Data: Differentially Private Estimation of Exponential-Family Random Graph Models
Abstract: Motivated by a real-life problem of sharing social network data that contain sensitive personal information, we propose a novel approach to release and analyze synthetic graphs in order to protect privacy of individual relationships captured by the social network while maintaining the validity of statistical results. A case study using a version of the Enron e-mail corpus dataset demonstrates the… ▽ More
Submitted 23 September, 2016; v1 submitted 9 November, 2015; originally announced November 2015.
Comments: Updated, 39 pages
-
arXiv:1507.08401 [pdf, ps, other]
Capturing Multivariate Spatial Dependence: Model, Estimate and then Predict
Abstract: Physical processes rarely occur in isolation, rather they influence and interact with one another. Thus, there is great benefit in modeling potential dependence between both spatial locations and different processes. It is the interaction between these two dependencies that is the focus of Genton and Kleiber's paper under discussion. We see the problem of ensuring that any multivariate spatial cov… ▽ More
Submitted 30 July, 2015; originally announced July 2015.
Comments: Published at http://dx.doi.org/10.1214/15-STS517 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-STS-STS517
Journal ref: Statistical Science 2015, Vol. 30, No. 2, 170-175
-
Exponential-Family Random Graph Models for Rank-Order Relational Data
Abstract: Rank-order relational data, in which each actor ranks the others according to some criterion, often arise from sociometric measurements of judgment (e.g., self-reported interpersonal interaction) or preference (e.g., relative liking). We propose a class of exponential-family models for rank-order relational data and derive a new class of sufficient statistics for such data, which assume no more th… ▽ More
Submitted 20 June, 2015; v1 submitted 1 October, 2012; originally announced October 2012.
Comments: 50 pages, 6 figures, 4 tables, 1 algorithm. The paper has been expanded and clarified, and some terms were changed
MSC Class: 91D30 (Primary) 90B15; 62M05; 60J20 (Secondary) ACM Class: G.3
Journal ref: Krivitsky, P. N. & Butts, C. T. (2017) Exponential-Family Random Graph Models for Rank-Order Relational Data. Sociological Methodology, 47(1): 68-112
-
arXiv:1112.0840 [pdf, ps, other]
On the Question of Effective Sample Size in Network Modeling: An Asymptotic Inquiry
Abstract: The modeling and analysis of networks and network data has seen an explosion of interest in recent years and represents an exciting direction for potential growth in statistics. Despite the already substantial amount of work done in this area to date by researchers from various disciplines, however, there remain many questions of a decidedly foundational nature - natural analogues of standard ques… ▽ More
Submitted 5 August, 2015; v1 submitted 5 December, 2011; originally announced December 2011.
Comments: Published at http://dx.doi.org/10.1214/14-STS502 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-STS-STS502
Journal ref: Statistical Science 2015, Vol. 30, No. 2, 184-198
-
Exponential-Family Random Graph Models for Valued Networks
Abstract: Exponential-family random graph models (ERGMs) provide a principled and flexible way to model and simulate features common in social networks, such as propensities for homophily, mutuality, and friend-of-a-friend triad closure, through choice of model terms (sufficient statistics). However, those ERGMs modeling the more complex features have, to date, been limited to binary data: presence or absen… ▽ More
Submitted 19 January, 2012; v1 submitted 7 January, 2011; originally announced January 2011.
Comments: 42 pages, including 2 appendixes (3 pages total), 5 figures, 2 tables, 1 algorithm listing; a substantial revision and reorganization: major changes include focus shifted to counts in particular, sections added on modeling actor heterogeneity, a subsection on degeneracy, another example, and an appendix on non-steepness of the CMP distribution
MSC Class: 91D30 (Primary) 60B05 (Secondary) ACM Class: G.3
Journal ref: Electron. J. Statist. 6 (2012) 1100-1128
-
A Separable Model for Dynamic Networks
Abstract: Models of dynamic networks --- networks that evolve over time --- have manifold applications. We develop a discrete-time generative model for social network evolution that inherits the richness and flexibility of the class of exponential-family random graph models. The model --- a Separable Temporal ERGM (STERGM) --- facilitates separable modeling of the tie duration distributions and the structur… ▽ More
Submitted 19 August, 2012; v1 submitted 8 November, 2010; originally announced November 2010.
Comments: 28 pages (including a 4-page appendix); a substantial rewrite, with many corrections, changes in terminology, and a different analysis for the example
MSC Class: 91D30 (Primary); 62M05; 60J20 (Secondary) ACM Class: G.3
Journal ref: Journal of the Royal Statistical Society, Series B 76(1) (2014) 29-46
-
Adjusting for Network Size and Composition Effects in Exponential-Family Random Graph Models
Abstract: Exponential-family random graph models (ERGMs) provide a principled way to model and simulate features common in human social networks, such as propensities for homophily and friend-of-a-friend triad closure. We show that, without adjustment, ERGMs preserve density as network size increases. Density invariance is often not appropriate for social networks. We suggest a simple modification based on… ▽ More
Submitted 27 December, 2010; v1 submitted 29 April, 2010; originally announced April 2010.
Comments: 37 pages, 2 figures, 5 tables; notation revised and clarified, some sections (particularly 4.3 and 5) made more rigorous, some derivations moved into the appendix, typos fixed, some wording changed
MSC Class: 91D30 (Primary) 62D; 62F12; 62F40; 62P25; 62M40 (Secondary)
Journal ref: Statistical Methodology 8 (2011) 319-339