-
ALAAMEE: Open-source software for fitting autologistic actor attribute models
Authors:
Alex Stivala,
Peng Wang,
Alessandro Lomi
Abstract:
The autologistic actor attribute model (ALAAM) is a model for social influence, derived from the more widely known exponential-family random graph model (ERGM). ALAAMs can be used to estimate parameters corresponding to multiple forms of social contagion associated with network structure and actor covariates. This work introduces ALAAMEE, open-source Python software for estimation, simulation, and…
▽ More
The autologistic actor attribute model (ALAAM) is a model for social influence, derived from the more widely known exponential-family random graph model (ERGM). ALAAMs can be used to estimate parameters corresponding to multiple forms of social contagion associated with network structure and actor covariates. This work introduces ALAAMEE, open-source Python software for estimation, simulation, and goodness-of-fit testing for ALAAM models. ALAAMEE implements both the stochastic approximation and equilibrium expectation (EE) algorithms for ALAAM parameter estimation, including estimation from snowball sampled network data. It implements data structures and statistics for undirected, directed, and bipartite networks. We use a simulation study to assess the accuracy of the EE algorithm for ALAAM parameter estimation and statistical inference, and demonstrate the use of ALAAMEE with empirical examples using both small (fewer than 100 nodes) and large (more than 10 000 nodes) networks.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
New network models facilitate analysis of biological networks
Authors:
Alex Stivala
Abstract:
Exponential-family random graph models (ERGMs) are a family of network models originating in social network analysis, which have also been applied to biological networks. Advances in estimation algorithms have increased the practical scope of these models to larger networks, however it is still not always possible to estimate a model without encountering problems of model near-degeneracy, particul…
▽ More
Exponential-family random graph models (ERGMs) are a family of network models originating in social network analysis, which have also been applied to biological networks. Advances in estimation algorithms have increased the practical scope of these models to larger networks, however it is still not always possible to estimate a model without encountering problems of model near-degeneracy, particularly if it is desired to use only simple model parameters, rather than more complex parameters designed to overcome the problem of near-degeneracy. Two new network models related to the ERGM, the Tapered ERGM, and the latent order logistic (LOLOG) model, have recently been proposed to overcome this problem. In this work I illustrate the application of the Tapered ERGM and the LOLOG to a set of biological networks, including protein-protein interaction (PPI) networks, gene regulatory networks, and neural networks. I find that the Tapered ERGM and the LOLOG are able to estimate models for networks for which it was not possible to estimate a conventional ERGM, and are able to do so using only simple model parameters. In the case of two neural networks where data on the spatial position of neurons is available, this allows the estimation of models including terms for spatial distance and triangle structures, allowing triangle motif statistical significance to be estimated while accounting for the effect of spatial proximity on connection probability. For some larger networks, however, Tapered ERGM and LOLOG estimation was not possible in practical time, while conventional ERGM models were able to be estimated only by using the Equilibrium Expectation (EE) algorithm.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Overcoming near-degeneracy in the autologistic actor attribute model
Authors:
Alex Stivala
Abstract:
The autologistic actor attribute model, or ALAAM, is the social influence counterpart of the better-known exponential-family random graph model (ERGM) for social selection. Extensive experience with ERGMs has shown that the problem of near-degeneracy which often occurs with simple models can be overcome by using "geometrically weighted" or "alternating" statistics. In the much more limited empiric…
▽ More
The autologistic actor attribute model, or ALAAM, is the social influence counterpart of the better-known exponential-family random graph model (ERGM) for social selection. Extensive experience with ERGMs has shown that the problem of near-degeneracy which often occurs with simple models can be overcome by using "geometrically weighted" or "alternating" statistics. In the much more limited empirical applications of ALAAMs to date, the problem of near-degeneracy, although theoretically expected, appears to have been less of an issue. In this work I present a comprehensive survey of ALAAM applications, showing that this model has to date only been used with relatively small networks, in which near-degeneracy does not appear to be a problem. I show near-degeneracy does occur in simple ALAAM models of larger empirical networks, define some geometrically weighted ALAAM statistics analogous to those for ERGM, and demonstrate that models with these statistics do not suffer from near-degeneracy and hence can be estimated where they could not be with the simple statistics.
△ Less
Submitted 21 September, 2023; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Geodesic cycle length distributions in fictional character networks
Authors:
Alex Stivala
Abstract:
A geodesic cycle in a graph is a cycle with no shortcuts, so that the shortest path between any two nodes in the cycle is the path along the cycle itself. A recently published paper used random graph models to investigate the geodesic cycle length distributions of a unique set of delusional social networks, first examined in an earlier work, as well as some other publicly available social networks…
▽ More
A geodesic cycle in a graph is a cycle with no shortcuts, so that the shortest path between any two nodes in the cycle is the path along the cycle itself. A recently published paper used random graph models to investigate the geodesic cycle length distributions of a unique set of delusional social networks, first examined in an earlier work, as well as some other publicly available social networks. Here I test the hypothesis, suggested in the former work, that fictional character networks, and in particular those from works by a single author, might have geodesic cycle length distributions which are extremely unlikely under random graph models, as the delusional social networks do. The results do not show any support for this hypothesis. In addition, the recently published work is reproduced using a method for counting geodesic cycles exactly, rather than the approximate method used originally. The substantive conclusions of that work are unchanged, but some differences in the results for particular networks are described.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Selecting and scheduling an optimal subset of road network upgrades
Authors:
A. D. Stivala,
P. J. Stuckey,
M. G. Wallace
Abstract:
We consider the problem of choosing a subset of proposed road network upgrades to implement within a fixed budget in order to optimize the benefit in terms of vehicle hours travelled (VHT), and show how to render the solution of this problem more tractable by reducing the number of traffic assignment problems that must be solved. This technique is extended to develop a heuristic algorithm for find…
▽ More
We consider the problem of choosing a subset of proposed road network upgrades to implement within a fixed budget in order to optimize the benefit in terms of vehicle hours travelled (VHT), and show how to render the solution of this problem more tractable by reducing the number of traffic assignment problems that must be solved. This technique is extended to develop a heuristic algorithm for finding a schedule of road upgrades over a planning period that maximizes the bet present value of the resulting VHT reductions.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Using Sampled Network Data With The Autologistic Actor Attribute Model
Authors:
Alex D. Stivala,
H. Colin Gallagher,
David A. Rolls,
Peng Wang,
Garry L. Robins
Abstract:
Social science research increasingly benefits from statistical methods for understanding the structured nature of social life, including for social network data. However, the application of statistical network models within large-scale community research is hindered by too little understanding of the validity of their inferences under realistic data collection conditions, including sampled or miss…
▽ More
Social science research increasingly benefits from statistical methods for understanding the structured nature of social life, including for social network data. However, the application of statistical network models within large-scale community research is hindered by too little understanding of the validity of their inferences under realistic data collection conditions, including sampled or missing network data. The autologistic actor attribute model (ALAAM) is a statistical model based on the well-established exponential random graph model (ERGM) for social networks. ALAAMs can be regarded as a social influence model, predicting an individual-level outcome based on the actor's network ties, concurrent outcomes of his/her network partners, and attributes of the actor and his/her network partners. In particular, an ALAAM can be used to measure contagion effects, that is, the propensity of two actors connected by a social network tie to both have the same value of an attribute. We investigate the effect of using simple random samples and snowball samples of network data on ALAAM parameter inference, and find that parameter inference can still work well even with a nontrivial fraction of missing nodes. However it is safer to take a snowball sample of the network and estimate conditional on the snowball sampling structure.
△ Less
Submitted 3 February, 2020; v1 submitted 30 January, 2020;
originally announced February 2020.
-
Testing biological network motif significance with exponential random graph models
Authors:
Alex Stivala,
Alessandro Lomi
Abstract:
Analysis of the structure of biological networks often uses statistical tests to establish the over-representation of motifs, which are thought to be important building blocks of such networks, related to their biological functions. However, there is disagreement as to the statistical significance of these motifs, and there are potential problems with standard methods for estimating this significa…
▽ More
Analysis of the structure of biological networks often uses statistical tests to establish the over-representation of motifs, which are thought to be important building blocks of such networks, related to their biological functions. However, there is disagreement as to the statistical significance of these motifs, and there are potential problems with standard methods for estimating this significance. Exponential random graph models (ERGMs) are a class of statistical model that can overcome some of the shortcomings of commonly used methods for testing the statistical significance of motifs. ERGMs were first introduced into the bioinformatics literature over ten years ago but have had limited application to biological networks, possibly due to the practical difficulty of estimating model parameters. Advances in estimation algorithms now afford analysis of much larger networks in practical time. We illustrate the application of ERGM to both an undirected protein-protein interaction (PPI) network and directed gene regulatory networks. ERGM models indicate over-representation of triangles in the PPI network, and confirm results from previous research as to over-representation of transitive triangles (feed-forward loop) in an E. coli and a yeast regulatory network. We also confirm, using ERGMs, previous research showing that under-representation of the cyclic triangle (feedback loop) can be explained as a consequence of other topological features.
△ Less
Submitted 8 November, 2021; v1 submitted 29 January, 2020;
originally announced January 2020.
-
Exponential random graph model parameter estimation for very large directed networks
Authors:
Alex Stivala,
Garry Robins,
Alessandro Lomi
Abstract:
Exponential random graph models (ERGMs) are widely used for modeling social networks observed at one point in time. However the computational difficulty of ERGM parameter estimation has limited the practical application of this class of models to relatively small networks, up to a few thousand nodes at most, with usually only a few hundred nodes or fewer. In the case of undirected networks, snowba…
▽ More
Exponential random graph models (ERGMs) are widely used for modeling social networks observed at one point in time. However the computational difficulty of ERGM parameter estimation has limited the practical application of this class of models to relatively small networks, up to a few thousand nodes at most, with usually only a few hundred nodes or fewer. In the case of undirected networks, snowball sampling can be used to find ERGM parameter estimates of larger networks via network samples, and recently published improvements in ERGM network distribution sampling and ERGM estimation algorithms have allowed ERGM parameter estimates of undirected networks with over one hundred thousand nodes to be made. However the implementations of these algorithms to date have been limited in their scalability, and also restricted to undirected networks. Here we describe an implementation of the recently published Equilibrium Expectation (EE) algorithm for ERGM parameter estimation of large directed networks. We test it on some simulated networks, and demonstrate its application to an online social network with over 1.6 million nodes.
△ Less
Submitted 20 November, 2019; v1 submitted 16 April, 2019;
originally announced April 2019.
-
Fast Maximum Likelihood estimation via Equilibrium Expectation for Large Network Data
Authors:
Maksym Byshkin,
Alex Stivala,
Antonietta Mira,
Garry Robins,
Alessandro Lomi
Abstract:
A major line of contemporary research on complex networks is based on the development of statistical models that specify the local motifs associated with macro-structural properties observed in actual networks. This statistical approach becomes increasingly problematic as network size increases. In the context of current research on efficient estimation of models for large network data sets, we pr…
▽ More
A major line of contemporary research on complex networks is based on the development of statistical models that specify the local motifs associated with macro-structural properties observed in actual networks. This statistical approach becomes increasingly problematic as network size increases. In the context of current research on efficient estimation of models for large network data sets, we propose a fast algorithm for maximum likelihood estimation (MLE) that afords a signifcant increase in the size of networks amenable to direct empirical analysis. The algorithm we propose in this paper relies on properties of Markov chains at equilibrium, and for this reason it is called equilibrium expectation (EE). We demonstrate the performance of the EE algorithm in the context of exponential random graphmodels (ERGMs) a family of statistical models commonly used in empirical research based on network data observed at a single period in time. Thus far, the lack of efcient computational strategies has limited the empirical scope of ERGMs to relatively small networks with a few thousand nodes. The approach we propose allows a dramatic increase in the size of networks that may be analyzed using ERGMs. This is illustrated in an analysis of several biological networks and one social network with 104,103 nodes
△ Less
Submitted 1 August, 2018; v1 submitted 28 February, 2018;
originally announced February 2018.
-
Another phase transition in the Axelrod model
Authors:
Alex Stivala,
Paul Keeler
Abstract:
Axelrod's model of cultural dissemination, despite its apparent simplicity, demonstrates complex behavior that has been of much interest in statistical physics. Despite the many variations and extensions of the model that have been investigated, a systematic investigation of the effects of changing the size of the neighborhood on the lattice in which interactions can occur has not been made. Here…
▽ More
Axelrod's model of cultural dissemination, despite its apparent simplicity, demonstrates complex behavior that has been of much interest in statistical physics. Despite the many variations and extensions of the model that have been investigated, a systematic investigation of the effects of changing the size of the neighborhood on the lattice in which interactions can occur has not been made. Here we investigate the effect of varying the radius R of the von Neumann neighborhood in which agents can interact. We show, in addition to the well-known phase transition at the critical value of q, the number of traits, another phase transition at a critical value of R, and draw a q -- R phase diagram for the Axelrod model on a square lattice. In addition, we present a mean-field approximation of the model in which behavior on an infinite lattice can be analyzed.
△ Less
Submitted 8 December, 2016;
originally announced December 2016.