-
Human-in-the-loop: Towards Label Embeddings for Measuring Classification Difficulty
Authors:
Katharina Hechinger,
Christoph Koller,
Xiao Xiang Zhu,
Göran Kauermann
Abstract:
Uncertainty in machine learning models is a timely and vast field of research. In supervised learning, uncertainty can already occur in the first stage of the training process, the annotation phase. This scenario is particularly evident when some instances cannot be definitively classified. In other words, there is inevitable ambiguity in the annotation step and hence, not necessarily a "ground tr…
▽ More
Uncertainty in machine learning models is a timely and vast field of research. In supervised learning, uncertainty can already occur in the first stage of the training process, the annotation phase. This scenario is particularly evident when some instances cannot be definitively classified. In other words, there is inevitable ambiguity in the annotation step and hence, not necessarily a "ground truth" associated with each instance. The main idea of this work is to drop the assumption of a ground truth label and instead embed the annotations into a multidimensional space. This embedding is derived from the empirical distribution of annotations in a Bayesian setup, modeled via a Dirichlet-Multinomial framework. We estimate the model parameters and posteriors using a stochastic Expectation Maximization algorithm with Markov Chain Monte Carlo steps. The methods developed in this paper readily extend to various situations where multiple annotators independently label instances. To showcase the generality of the proposed approach, we apply our approach to three benchmark datasets for image classification and Natural Language Inference. Besides the embeddings, we can investigate the resulting correlation matrices, which reflect the semantic similarities of the original classes very well for all three exemplary datasets.
△ Less
Submitted 27 May, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Categorising the World into Local Climate Zones -- Towards Quantifying Labelling Uncertainty for Machine Learning Models
Authors:
Katharina Hechinger,
Xiao Xiang Zhu,
Göran Kauermann
Abstract:
Image classification is often prone to labelling uncertainty. To generate suitable training data, images are labelled according to evaluations of human experts. This can result in ambiguities, which will affect subsequent models. In this work, we aim to model the labelling uncertainty in the context of remote sensing and the classification of satellite images. We construct a multinomial mixture mo…
▽ More
Image classification is often prone to labelling uncertainty. To generate suitable training data, images are labelled according to evaluations of human experts. This can result in ambiguities, which will affect subsequent models. In this work, we aim to model the labelling uncertainty in the context of remote sensing and the classification of satellite images. We construct a multinomial mixture model given the evaluations of multiple experts. This is based on the assumption that there is no ambiguity of the image class, but apparently in the experts' opinion about it. The model parameters can be estimated by a stochastic Expectation Maximization algorithm. Analysing the estimates gives insights into sources of label uncertainty. Here, we focus on the general class ambiguity, the heterogeneity of experts, and the origin city of the images. The results are relevant for all machine learning applications where image classification is pursued and labelling is subject to humans.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Estimating excess mortality in high-income countries during the COVID-19 pandemic
Authors:
Giacomo De Nicola,
Göran Kauermann
Abstract:
Quantifying the number of deaths caused by the COVID-19 crisis has been an ongoing challenge for scientists, and no golden standard to do so has yet been established. We propose a principled approach to calculate age-adjusted yearly excess mortality, and apply it to obtain estimates and uncertainty bounds for 30 countries with publicly available data. The results uncover remarkable variation in pa…
▽ More
Quantifying the number of deaths caused by the COVID-19 crisis has been an ongoing challenge for scientists, and no golden standard to do so has yet been established. We propose a principled approach to calculate age-adjusted yearly excess mortality, and apply it to obtain estimates and uncertainty bounds for 30 countries with publicly available data. The results uncover remarkable variation in pandemic outcomes across different countries. We further compare our findings with existing estimates published in other major scientific outlets, highlighting the importance of proper age adjustment to obtain unbiased figures.
△ Less
Submitted 19 December, 2023; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Sources of Uncertainty in Machine Learning -- A Statisticians' View
Authors:
Cornelia Gruber,
Patrick Oliver Schenk,
Malte Schierholz,
Frauke Kreuter,
Göran Kauermann
Abstract:
Machine Learning and Deep Learning have achieved an impressive standard today, enabling us to answer questions that were inconceivable a few years ago. Besides these successes, it becomes clear, that beyond pure prediction, which is the primary strength of most supervised machine learning algorithms, the quantification of uncertainty is relevant and necessary as well. While first concepts and idea…
▽ More
Machine Learning and Deep Learning have achieved an impressive standard today, enabling us to answer questions that were inconceivable a few years ago. Besides these successes, it becomes clear, that beyond pure prediction, which is the primary strength of most supervised machine learning algorithms, the quantification of uncertainty is relevant and necessary as well. While first concepts and ideas in this direction have emerged in recent years, this paper adopts a conceptual perspective and examines possible sources of uncertainty. By adopting the viewpoint of a statistician, we discuss the concepts of aleatoric and epistemic uncertainty, which are more commonly associated with machine learning. The paper aims to formalize the two types of uncertainty and demonstrates that sources of uncertainty are miscellaneous and can not always be decomposed into aleatoric and epistemic. Drawing parallels between statistical concepts and uncertainty in machine learning, we also demonstrate the role of data and their influence on uncertainty.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
The Skellam Distribution revisited -Estimating the unobserved incoming and outgoing ICU COVID-19 patients on a regional level in Germany
Authors:
Martje Rave,
Göran Kauermann
Abstract:
With the beginning of the COVID-19 pandemic, we became aware of the need for comprehensive data collection and its provision to scientists and experts for proper data analyses. In Germany, the Robert Koch Institute (RKI) has tried to keep up with this demand for data on COVID-19, but there were (and still are) relevant data missing that are needed to understand the whole picture of the pandemic. I…
▽ More
With the beginning of the COVID-19 pandemic, we became aware of the need for comprehensive data collection and its provision to scientists and experts for proper data analyses. In Germany, the Robert Koch Institute (RKI) has tried to keep up with this demand for data on COVID-19, but there were (and still are) relevant data missing that are needed to understand the whole picture of the pandemic. In this paper, we take a closer look at the severity of the course of COVID-19 in Germany, for which ideal information would be the number of incoming patients to ICU units. This information was (and still is) not available. Instead, the current occupancy of ICU units on the district level was reported daily. We demonstrate how this information can be used to predict the number of incoming as well as released COVID-19 patients using a stochastic version of the Expectation Maximisation algorithm (SEM). This in turn, allows for estimating the influence of district-specific and age-specific infection rates as well as further covariates, including spatial effects, on the number of incoming patients. The paper demonstrates that even if relevant data are not recorded or provided officially, statistical modelling allows for reconstructing them. This also includes the quantification of uncertainty which naturally results from the application of the SEM algorithm.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
Weighted high dimensional data reduction of finite Element Features -- An Application on High Pressure of an Abdominal Aortic Aneurysm
Authors:
Christoph Striegel,
Göran Kauermann,
Jonas Biehler
Abstract:
In this work we propose a low rank approximation of high fidelity finite element simulations by utilizing weights corresponding to areas of high stress levels for an abdominal aortic aneurysm, i.e. a deformed blood vessel. We focus on the van Mises stress, which corresponds to the rupture risk of the aorta. This is modeled as a Gaussian Markov random field and we define our approximation as a basi…
▽ More
In this work we propose a low rank approximation of high fidelity finite element simulations by utilizing weights corresponding to areas of high stress levels for an abdominal aortic aneurysm, i.e. a deformed blood vessel. We focus on the van Mises stress, which corresponds to the rupture risk of the aorta. This is modeled as a Gaussian Markov random field and we define our approximation as a basis of vectors that solve a series of optimization problems. Each of these problems describes the minimization of an expected weighted quadratic loss. The weights, which encapsulate the importance of each grid point of the finite elements, can be chosen freely - either data driven or by incorporating domain knowledge. Along with a more general discussion of mathematical properties we provide an effective numerical heuristic to compute the basis under general conditions. We explicitly explore two such bases on the surface of a high fidelity finite element grid and show their efficiency for compression. We further utilize the approach to predict the van Mises stress in areas of interest using low and high fidelity simulations. Due to the high dimension of the data we have to take extra care to keep the problem numerically feasible. This is also a major concern of this work.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
The Politics of Language Choice: How the Russian-Ukrainian War Influences Ukrainians' Language Use on Twitter
Authors:
Daniel Racek,
Brittany I. Davidson,
Paul W. Thurner,
Xiao Xiang Zhu,
Göran Kauermann
Abstract:
The use of language is innately political and often a vehicle of cultural identity as well as the basis for nation building. Here, we examine language choice and tweeting activity of Ukrainian citizens based on more than 4 million geo-tagged tweets from over 62,000 users before and during the Russian-Ukrainian War, from January 2020 to October 2022. Using statistical models, we disentangle sample…
▽ More
The use of language is innately political and often a vehicle of cultural identity as well as the basis for nation building. Here, we examine language choice and tweeting activity of Ukrainian citizens based on more than 4 million geo-tagged tweets from over 62,000 users before and during the Russian-Ukrainian War, from January 2020 to October 2022. Using statistical models, we disentangle sample effects, arising from the in- and outflux of users on Twitter, from behavioural effects, arising from behavioural changes of the users. We observe a steady shift from the Russian language towards the Ukrainian language already before the war, which drastically speeds up with its outbreak. We attribute these shifts in large part to users' behavioural changes. Notably, we find that more than half of the Russian-tweeting users shift towards Ukrainian as a result of the war.
△ Less
Submitted 6 June, 2023; v1 submitted 4 May, 2023;
originally announced May 2023.
-
Nonparametric Two-Sample Test for Networks Using Joint Graphon Estimation
Authors:
Benjamin Sischka,
Göran Kauermann
Abstract:
This paper focuses on the comparison of networks on the basis of statistical inference. For that purpose, we rely on smooth graphon models as a nonparametric modeling strategy that is able to capture complex structural patterns. The graphon itself can be viewed more broadly as density or intensity function on networks, making the model a natural choice for comparison purposes. Extending graphon es…
▽ More
This paper focuses on the comparison of networks on the basis of statistical inference. For that purpose, we rely on smooth graphon models as a nonparametric modeling strategy that is able to capture complex structural patterns. The graphon itself can be viewed more broadly as density or intensity function on networks, making the model a natural choice for comparison purposes. Extending graphon estimation towards modeling multiple networks simultaneously consequently provides substantial information about the (dis-)similarity between networks. Fitting such a joint model - which can be accomplished by applying an EM-type algorithm - provides a joint graphon estimate plus a corresponding prediction of the node positions for each network. In particular, it entails a generalized network alignment, where nearby nodes play similar structural roles in their respective domains. Given that, we construct a chi-squared test on equivalence of network structures. Simulation studies and real-world examples support the applicability of our network comparison strategy.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Dependence matters: Statistical models to identify the drivers of tie formation in economic networks
Authors:
Giacomo De Nicola,
Cornelius Fritz,
Marius Mehrl,
Göran Kauermann
Abstract:
Networks are ubiquitous in economic research on organizations, trade, and many other areas. However, while economic theory extensively considers networks, no general framework for their empirical modeling has yet emerged. We thus introduce two different statistical models for this purpose -- the Exponential Random Graph Model (ERGM) and the Additive and Multiplicative Effects network model (AME).…
▽ More
Networks are ubiquitous in economic research on organizations, trade, and many other areas. However, while economic theory extensively considers networks, no general framework for their empirical modeling has yet emerged. We thus introduce two different statistical models for this purpose -- the Exponential Random Graph Model (ERGM) and the Additive and Multiplicative Effects network model (AME). Both model classes can account for network interdependencies between observations, but differ in how they do so. The ERGM allows one to explicitly specify and test the influence of particular network structures, making it a natural choice if one is substantively interested in estimating endogenous network effects. In contrast, AME captures these effects by introducing actor-specific latent variables affecting their propensity to form ties. This makes the latter a good choice if the researcher is interested in capturing the effect of exogenous covariates on tie formation without having a specific theory on the endogenous dependence structures at play. After introducing the two model classes, we showcase them through real-world applications to networks stemming from international arms trade and foreign exchange activity. We further provide full replication materials to facilitate the adoption of these methods in empirical economic research.
△ Less
Submitted 7 July, 2023; v1 submitted 26 October, 2022;
originally announced October 2022.
-
COVID-19 and social media: Beyond polarization
Authors:
Giacomo De Nicola,
Victor H. Tuekam Mambou,
Göran Kauermann
Abstract:
The COVID-19 pandemic brought upon a massive wave of disinformation, exacerbating polarization in the increasingly divided landscape of online discourse. In this context, popular social media users play a major role, as they have the ability to broadcast messages to large audiences and influence public opinion. In this paper, we make use of openly available data to study the behavior of popular us…
▽ More
The COVID-19 pandemic brought upon a massive wave of disinformation, exacerbating polarization in the increasingly divided landscape of online discourse. In this context, popular social media users play a major role, as they have the ability to broadcast messages to large audiences and influence public opinion. In this paper, we make use of openly available data to study the behavior of popular users discussing the pandemic on Twitter. We tackle the issue from a network perspective, considering users as nodes and following relationships as directed edges. The resulting network structure is modeled by embedding the actors in a latent social space, where users closer to one another have a higher probability of following each other. The results suggest the existence of two distinct communities, which can be interpreted as "generally pro" and "generally against" vaccine mandates, corroborating existing evidence on the pervasiveness of echo chambers on the platform. By focusing on a number of notable users, such as politicians, activists, and news outlets, we further show that the two groups are not entirely homogeneous, and that not just the two poles are represented. To the contrary, the latent space captures an entire spectrum of beliefs between the two extremes, demonstrating that polarization, while present, is not the only driver of the network, and that more moderate, "central" users are key players in the discussion.
△ Less
Submitted 24 July, 2023; v1 submitted 27 July, 2022;
originally announced July 2022.
-
Exponential Random Graph Models for Dynamic Signed Networks: An Application to International Relations
Authors:
Cornelius Fritz,
Marius Mehrl,
Paul W. Thurner,
Göran kauermann
Abstract:
Substantive research in the Social Sciences regularly investigates signed networks, where edges between actors are either positive or negative. For instance, schoolchildren can be friends or rivals, just as countries can cooperate or fight each other. This research often builds on structural balance theory, one of the earliest and most prominent network theories, making signed networks one of the…
▽ More
Substantive research in the Social Sciences regularly investigates signed networks, where edges between actors are either positive or negative. For instance, schoolchildren can be friends or rivals, just as countries can cooperate or fight each other. This research often builds on structural balance theory, one of the earliest and most prominent network theories, making signed networks one of the most frequently studied matters in social network analysis. While the theorization and description of signed networks have thus made significant progress, the inferential study of tie formation within them remains limited in the absence of appropriate statistical models. In this paper we fill this gap by proposing the Signed Exponential Random Graph Model (SERGM), extending the well-known Exponential Random Graph Model (ERGM) to networks where ties are not binary but negative or positive if a tie exists. Since most networks are dynamically evolving systems, we specify the model for both cross-sectional and dynamic networks. Based on structural hypotheses derived from structural balance theory, we formulate interpretable signed network statistics, capturing dynamics such as "the enemy of my enemy is my friend". In our empirical application, we use the SERGM to analyze cooperation and conflict between countries within the international state system.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Actor Heterogeneity and Explained Variance in Network Models -- A Scalable Approach through Variational Approximations
Authors:
Nadja Klein,
Göran Kauermann
Abstract:
The analysis of network data has gained considerable interest in recent years. This also includes the analysis of large, high-dimensional networks with hundreds and thousands of nodes. While exponential random graph models serve as workhorse for network data analyses, their applicability to very large networks is problematic via classical inference such as maximum likelihood or exact Bayesian esti…
▽ More
The analysis of network data has gained considerable interest in recent years. This also includes the analysis of large, high-dimensional networks with hundreds and thousands of nodes. While exponential random graph models serve as workhorse for network data analyses, their applicability to very large networks is problematic via classical inference such as maximum likelihood or exact Bayesian estimation owing to scaling and instability issues. The latter trace from the fact that classical network statistics consider nodes as exchangeable, i.e., actors in the network are assumed to be homogeneous. This is often questionable. One way to circumvent the restrictive assumption is to include actor-specific random effects, which account for unobservable heterogeneity. However, this increases the number of unknowns considerably, thus making the model highly-parameterized. As a solution even for very large networks, we propose a scalable approach based on variational approximations, which not only leads to numerically stable estimation but is also applicable to high-dimensional directed as well as undirected networks. We furthermore demonstrate that including node-specific covariates can reduce node heterogeneity, which we facilitate through versatile prior formulations and a new measure that we call posterior explained variance. We illustrate our approach in three diverse examples, covering network data from the Italian Parliament, international arms trading, and Facebook; and conduct detailed simulation studies.
△ Less
Submitted 12 September, 2023; v1 submitted 29 April, 2022;
originally announced April 2022.
-
Stochastic Block Smooth Graphon Model
Authors:
Benjamin Sischka,
Göran Kauermann
Abstract:
The paper proposes the combination of stochastic blockmodels with smooth graphon models. The first allow for partitioning the set of individuals in a network into blocks which represent groups of nodes that presumably connect stochastically equivalently, therefore often also called communities. Smooth graphon models instead assume that the network's nodes can be arranged on a one-dimensional scale…
▽ More
The paper proposes the combination of stochastic blockmodels with smooth graphon models. The first allow for partitioning the set of individuals in a network into blocks which represent groups of nodes that presumably connect stochastically equivalently, therefore often also called communities. Smooth graphon models instead assume that the network's nodes can be arranged on a one-dimensional scale such that closeness implies a similar connectivity behavior. Both models belong to the model class of node-specific latent variables, entailing a natural relationship. While these model strands have developed more or less completely independently, this paper proposes their generalization towards stochastic block smooth graphon models. This approach enables to exploit the advantages of both worlds. We pursue a general EM-type algorithm for estimation and demonstrate the usability by applying the model to both simulated and real-world examples.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Modelling the large and dynamically growing bipartite network of German patents and inventors
Authors:
Cornelius Fritz,
Giacomo De Nicola,
Sevag Kevork,
Dietmar Harhoff,
Göran Kauermann
Abstract:
We analyse the bipartite dynamic network of inventors and patents registered within the main area of electrical engineering in Germany to explore the driving forces behind innovation. The data at hand leads to a bipartite network, where an edge between an inventor and a patent is present if the inventor is a co-owner of the respective patent. Since more than a hundred thousand patents were filed b…
▽ More
We analyse the bipartite dynamic network of inventors and patents registered within the main area of electrical engineering in Germany to explore the driving forces behind innovation. The data at hand leads to a bipartite network, where an edge between an inventor and a patent is present if the inventor is a co-owner of the respective patent. Since more than a hundred thousand patents were filed by similarly as many inventors during the observational period, this amounts to a massive bipartite network, too large to be analysed as a whole. Therefore, we decompose the bipartite network by utilising an essential characteristic of the network: most inventors tend to stay active only for a relatively short period, while new ones become active at each point in time. Consequently, the adjacency matrix carries several structural zeros. To accommodate for these, we propose a bipartite variant of the Temporal Exponential Random Graph Model (TERGM) in which we let the actor set vary over time, differentiate between inventors that already submitted patents and those that did not, and account for pairwise statistics of inventors. Our results corroborate the hypotheses that inventor characteristics and knowledge flows play a crucial role in the dynamics of invention.
△ Less
Submitted 20 January, 2022;
originally announced January 2022.
-
Statistical modelling of COVID-19 data: Putting Generalised Additive Models to work
Authors:
Cornelius Fritz,
Giacomo De Nicola,
Martje Rave,
Maximilian Weigert,
Yeganeh Khazaei,
Ursula Berger,
Helmut Küchenhoff,
Göran Kauermann
Abstract:
Over the course of the COVID-19 pandemic, Generalised Additive Models (GAMs) have been successfully employed on numerous occasions to obtain vital data-driven insights. In this paper we further substantiate the success story of GAMs, demonstrating their flexibility by focusing on three relevant pandemic-related issues. First, we examine the interdepency among infections in different age groups, co…
▽ More
Over the course of the COVID-19 pandemic, Generalised Additive Models (GAMs) have been successfully employed on numerous occasions to obtain vital data-driven insights. In this paper we further substantiate the success story of GAMs, demonstrating their flexibility by focusing on three relevant pandemic-related issues. First, we examine the interdepency among infections in different age groups, concentrating on school children. In this context, we derive the setting under which parameter estimates are independent of the (unknown) case-detection ratio, which plays an important role in COVID-19 surveillance data. Second, we model the incidence of hospitalisations, for which data is only available with a temporal delay. We illustrate how correcting for this reporting delay through a nowcasting procedure can be naturally incorporated into the GAM framework as an offset term. Third, we propose a multinomial model for the weekly occupancy of intensive care units (ICU), where we distinguish between the number of COVID-19 patients, other patients and vacant beds. With these three examples, we aim to showcase the practical and "off-the-shelf" applicability of GAMs to gain new insights from real-world data.
△ Less
Submitted 4 January, 2022;
originally announced January 2022.
-
All that Glitters is not Gold: Relational Events Models with Spurious Events
Authors:
Cornelius Fritz,
Marius Mehrl,
Paul W. Thurner,
Göran Kauermann
Abstract:
As relational event models are an increasingly popular model for studying relational structures, the reliability of large-scale event data collection becomes more and more important. Automated or human-coded events often suffer from non-negligible false-discovery rates in event identification. And most sensor data is primarily based on actors' spatial proximity for predefined time windows; hence,…
▽ More
As relational event models are an increasingly popular model for studying relational structures, the reliability of large-scale event data collection becomes more and more important. Automated or human-coded events often suffer from non-negligible false-discovery rates in event identification. And most sensor data is primarily based on actors' spatial proximity for predefined time windows; hence, the observed events could relate either to a social relationship or random co-location. Both examples imply spurious events that may bias estimates and inference. We propose the Relational Event Model for Spurious Events (REMSE), an extension to existing approaches for interaction data. The model provides a flexible solution for modeling data while controlling for spurious events. Estimation of our model is carried out in an empirical Bayesian approach via data augmentation. Based on a simulation study, we investigate the properties of the estimation procedure. To demonstrate its usefulness in two distinct applications, we employ this model to combat events from the Syrian civil war and student co-location data. Results from the simulation and the applications identify the REMSE as a suitable approach to modeling relational event data in the presence of spurious events.
△ Less
Submitted 24 May, 2022; v1 submitted 6 September, 2021;
originally announced September 2021.
-
On assessing excess mortality in Germany during the COVID-19 pandemic
Authors:
Giacomo De Nicola,
Göran Kauermann,
Michael Höhle
Abstract:
Coronavirus disease 2019 (COVID-19) is associated with a very high number of casualties in the general population. Assessing the exact magnitude of this number is a non-trivial problem, as relying only on officially reported COVID-19 associated fatalities runs the risk of incurring in several kinds of biases. One of the ways to approach the issue is to compare overall mortality during the pandemic…
▽ More
Coronavirus disease 2019 (COVID-19) is associated with a very high number of casualties in the general population. Assessing the exact magnitude of this number is a non-trivial problem, as relying only on officially reported COVID-19 associated fatalities runs the risk of incurring in several kinds of biases. One of the ways to approach the issue is to compare overall mortality during the pandemic with expected mortality computed using the observed mortality figures of previous years. In this paper, we build on existing methodology and propose two ways to compute expected as well as excess mortality, namely at the weekly and at the yearly level. Particular focus is put on the role of age, which plays a central part in both COVID-19-associated and overall mortality. We illustrate our methods by making use of age-stratified mortality data from the years 2016 to 2020 in Germany to compute age group-specific excess mortality during the COVID-19 pandemic in 2020.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
Statistical modeling of on-street parking lot occupancy in smart cities
Authors:
Marc Schneble,
Göran Kauermann
Abstract:
Many studies suggest that searching for parking is associated with significant direct and indirect costs. Therefore, it is appealing to reduce the time which car drivers spend on finding an available parking lot, especially in urban areas where the space for all road users is limited. The prediction of on-street parking lot occupancy can provide drivers a guidance where clear parking lots are like…
▽ More
Many studies suggest that searching for parking is associated with significant direct and indirect costs. Therefore, it is appealing to reduce the time which car drivers spend on finding an available parking lot, especially in urban areas where the space for all road users is limited. The prediction of on-street parking lot occupancy can provide drivers a guidance where clear parking lots are likely to be found. This field of research has gained more and more attention in the last decade through the increasing availability of real-time parking lot occupancy data. In this paper, we pursue a statistical approach for the prediction of parking lot occupancy, where we make use of time to event models and semi-Markov process theory. The latter involves the employment of Laplace transformations as well as their inversion which is an ambitious numerical task. We apply our methodology to data from the City of Melbourne in Australia. Our main result is that the semi-Markov model outperforms a Markov model in terms of both true negative rate and true positive rate while this is essentially achieved by respecting the current duration which a parking lot already sojourns in its initial state.
△ Less
Submitted 11 June, 2021;
originally announced June 2021.
-
Matrix-free Penalized Spline Smoothing with Multiple Covariates
Authors:
Julian Wagner,
Göran Kauermann,
Ralf Münnich
Abstract:
The paper motivates high dimensional smoothing with penalized splines and its numerical calculation in an efficient way. If smoothing is carried out over three or more covariates the classical tensor product spline bases explode in their dimension bringing the estimation to its numerical limits. A recent approach by Siebenborn and Wagner(2019) circumvents storage expensive implementations by propo…
▽ More
The paper motivates high dimensional smoothing with penalized splines and its numerical calculation in an efficient way. If smoothing is carried out over three or more covariates the classical tensor product spline bases explode in their dimension bringing the estimation to its numerical limits. A recent approach by Siebenborn and Wagner(2019) circumvents storage expensive implementations by proposing matrix-free calculations which allows to smooth over several covariates. We extend their approach here by linking penalized smoothing and its Bayesian formulation as mixed model which provides a matrix-free calculation of the smoothing parameter to avoid the use of high-computational cross validation. Further, we show how to extend the ideas towards generalized regression models. The extended approach is applied to remote sensing satellite data in combination with spatial smoothing.
△ Less
Submitted 15 January, 2021;
originally announced January 2021.
-
The Role of Governmental Weapons Procurements in Forecasting Monthly Fatalities in Intrastate Conflicts: A Semiparametric Hierarchical Hurdle Model
Authors:
Cornelius Fritz,
Marius Mehrl,
Paul W. Thurner,
Göran Kauermann
Abstract:
Accurate and interpretable forecasting models predicting spatially and temporally fine-grained changes in the numbers of intrastate conflict casualties are of crucial importance for policymakers and international non-governmental organisations (NGOs). Using a count data approach, we propose a hierarchical hurdle regression model to address the corresponding prediction challenge at the monthly PRIO…
▽ More
Accurate and interpretable forecasting models predicting spatially and temporally fine-grained changes in the numbers of intrastate conflict casualties are of crucial importance for policymakers and international non-governmental organisations (NGOs). Using a count data approach, we propose a hierarchical hurdle regression model to address the corresponding prediction challenge at the monthly PRIO-grid level. More precisely, we model the intensity of local armed conflict at a specific point in time as a three-stage process. Stages one and two of our approach estimate whether we will observe any casualties at the country- and grid-cell-level, respectively, while stage three applies a regression model for truncated data to predict the number of such fatalities conditional upon the previous two stages. Within this modelling framework, we focus on the role of governmental arms imports as a processual factor allowing governments to intensify or deter from fighting. We further argue that a grid cell's geographic remoteness is bound to moderate the effects of these military buildups. Out-of-sample predictions corroborate the effectiveness of our parsimonious and theory-driven model, which enables full transparency combined with accuracy in the forecasting process.
△ Less
Submitted 15 December, 2020;
originally announced December 2020.
-
On the Interplay of Regional Mobility, Social Connectedness, and the Spread of COVID-19 in Germany
Authors:
Cornelius Fritz,
Göran Kauermann
Abstract:
Since the primary mode of respiratory virus transmission is person-to-person interaction, we are required to reconsider physical interaction patterns to mitigate the number of people infected with COVID-19. While research has shown that non-pharmaceutical interventions (NPI) had an evident impact on national mobility patterns, we investigate the relative regional mobility behaviour to assess the e…
▽ More
Since the primary mode of respiratory virus transmission is person-to-person interaction, we are required to reconsider physical interaction patterns to mitigate the number of people infected with COVID-19. While research has shown that non-pharmaceutical interventions (NPI) had an evident impact on national mobility patterns, we investigate the relative regional mobility behaviour to assess the effect of human movement on the spread of COVID-19. In particular, we explore the impact of human mobility and social connectivity derived from Facebook activities on the weekly rate of new infections in Germany between March 3rd and June 22nd, 2020. Our results confirm that reduced social activity lowers the infection rate, accounting for regional and temporal patterns. The extent of social distancing, quantified by the percentage of people staying put within a federal administrative district, has an overall negative effect on the incidence of infections. Additionally, our results show spatial infection patterns based on geographic as well as social distances.
△ Less
Submitted 2 July, 2021; v1 submitted 7 August, 2020;
originally announced August 2020.
-
Regional now- and forecasting for data reported with delay: Towards surveillance of COVID-19 infections
Authors:
Giacomo De Nicola,
Marc Schneble,
Göran Kauermann,
Ursula Berger
Abstract:
Governments around the world continue to act to contain and mitigate the spread of COVID-19. The rapidly evolving situation compels officials and executives to continuously adapt policies and social distancing measures depending on the current state of the spread of the disease. In this context, it is crucial for policymakers to have a firm grasp on what the current state of the pandemic is as wel…
▽ More
Governments around the world continue to act to contain and mitigate the spread of COVID-19. The rapidly evolving situation compels officials and executives to continuously adapt policies and social distancing measures depending on the current state of the spread of the disease. In this context, it is crucial for policymakers to have a firm grasp on what the current state of the pandemic is as well as to have an idea of how the infective situation is going to unfold in the next days. However, as in many other situations of compulsorily-notifiable diseases and beyond, cases are reported with delay to a central register, with this delay deferring an up-to-date view of the state of things. We provide a stable tool for monitoring current infection levels as well as predicting infection numbers in the immediate future at the regional level. We accomplish this through nowcasting of cases that have not yet been reported as well as through predictions of future infections. We apply our model to German data, for which our focus lies in predicting and explain infectious behavior by district.
△ Less
Submitted 18 February, 2021; v1 submitted 31 July, 2020;
originally announced July 2020.
-
Mixture Models and Networks -- Overview of Stochastic Blockmodelling
Authors:
Giacomo De Nicola,
Benjamin Sischka,
Göran Kauermann
Abstract:
Mixture models are probabilistic models aimed at uncovering and representing latent subgroups within a population. In the realm of network data analysis, the latent subgroups of nodes are typically identified by their connectivity behaviour, with nodes behaving similarly belonging to the same community. In this context, mixture modelling is pursued through stochastic blockmodelling. We consider st…
▽ More
Mixture models are probabilistic models aimed at uncovering and representing latent subgroups within a population. In the realm of network data analysis, the latent subgroups of nodes are typically identified by their connectivity behaviour, with nodes behaving similarly belonging to the same community. In this context, mixture modelling is pursued through stochastic blockmodelling. We consider stochastic blockmodels and some of their variants and extensions from a mixture modelling perspective. We also survey some of the main classes of estimation methods available, and propose an alternative approach. In addition to the discussion of inferential properties and estimating procedures, we focus on the application of the models to several real-world network datasets, showcasing the advantages and pitfalls of different approaches.
△ Less
Submitted 26 May, 2020; v1 submitted 19 May, 2020;
originally announced May 2020.
-
Nowcasting fatal COVID-19 infections on a regional level in Germany
Authors:
Marc Schneble,
Giacomo De Nicola,
Göran Kauermann,
Ursula Berger
Abstract:
We analyse the temporal and regional structure in mortality rates related to COVID-19 infections. We relate the fatality date of each deceased patient to the corresponding day of registration of the infection, leading to a nowcasting model which allows us to estimate the number of present-day infections that will, at a later date, prove to be fatal. The numbers are broken down to the district leve…
▽ More
We analyse the temporal and regional structure in mortality rates related to COVID-19 infections. We relate the fatality date of each deceased patient to the corresponding day of registration of the infection, leading to a nowcasting model which allows us to estimate the number of present-day infections that will, at a later date, prove to be fatal. The numbers are broken down to the district level in Germany. Given that death counts generally provide more reliable information on the spread of the disease compared to infection counts, which inevitably depend on testing strategy and capacity, the proposed model and the presented results allow to obtain reliable insight into the current state of the pandemic in Germany.
△ Less
Submitted 21 November, 2020; v1 submitted 15 May, 2020;
originally announced May 2020.
-
Separable and Semiparametric Network-based Counting Processes applied to the International Combat Aircraft Trades
Authors:
Cornelius Fritz,
Paul W. Thurner,
Göran Kauermann
Abstract:
We propose a novel tie-oriented model for longitudinal event network data. The generating mechanism is assumed to be a multivariate Poisson process that governs the onset and repetition of yearly observed events with two separate intensity functions. We apply the model to a network obtained from the number of international deliveries of combat aircraft trades between 1950 and 2017. Based on a modi…
▽ More
We propose a novel tie-oriented model for longitudinal event network data. The generating mechanism is assumed to be a multivariate Poisson process that governs the onset and repetition of yearly observed events with two separate intensity functions. We apply the model to a network obtained from the number of international deliveries of combat aircraft trades between 1950 and 2017. Based on a modified trade gravity approach we identify economic and political factors impeding or lightening the number of transfers. Extensive dynamics as well as country heterogeneity require the specification of semiparametric time-varying effects as well as random effects.
△ Less
Submitted 18 April, 2021; v1 submitted 26 March, 2020;
originally announced March 2020.
-
Intensity Estimation on Geometric Networks with Penalized Splines
Authors:
Marc Schneble,
Göran Kauermann
Abstract:
In the past decades, the growing amount of network data has lead to many novel statistical models. In this paper we consider so called geometric networks. Typical examples are road networks or other infrastructure networks. But also the neurons or the blood vessels in a human body can be interpreted as a geometric network embedded in a three-dimensional space. In all these applications a network s…
▽ More
In the past decades, the growing amount of network data has lead to many novel statistical models. In this paper we consider so called geometric networks. Typical examples are road networks or other infrastructure networks. But also the neurons or the blood vessels in a human body can be interpreted as a geometric network embedded in a three-dimensional space. In all these applications a network specific metric rather than the Euclidean metric is usually used, which makes the analyses on network data challenging. We consider network based point processes and our task is to estimate the intensity (or density) of the process which allows to detect high- and low- intensity regions of the underlying stochastic processes. Available routines that tackle this problem are commonly based on kernel smoothing methods. However, kernel based estimation in general exhibits some drawbacks such as suffering from boundary effects and the locality of the smoother. In an Euclidean space, the disadvantages of kernel methods can be overcome by using penalized spline smoothing. We here extend penalized spline smoothing towards smooth intensity estimation on geometric networks and apply the approach to both, simulated and real world data. The results show that penalized spline based intensity estimation is numerically efficient and outperforms kernel based methods. Furthermore, our approach easily allows to incorporate covariates, which allows to respect the network geometry in a regression model framework.
△ Less
Submitted 24 February, 2020;
originally announced February 2020.
-
Estimation of Latent Network Flows in Bike-Sharing Systems
Authors:
Marc Schneble,
Göran Kauermann
Abstract:
Estimation of latent network flows is a common problem in statistical network analysis. The typical setting is that we know the margins of the network, i.e. in- and outdegrees, but the flows are unobserved. In this paper, we develop a mixed regression model to estimate network flows in a bike-sharing network if only the hourly differences of in- and outdegrees at bike stations are known. We also i…
▽ More
Estimation of latent network flows is a common problem in statistical network analysis. The typical setting is that we know the margins of the network, i.e. in- and outdegrees, but the flows are unobserved. In this paper, we develop a mixed regression model to estimate network flows in a bike-sharing network if only the hourly differences of in- and outdegrees at bike stations are known. We also include exogenous covariates such as weather conditions. Two different parameterizations of the model are considered to estimate 1) the whole network flow and 2) the network margins only. The estimation of the model parameters is proposed via an iterative penalized maximum likelihood approach. This is exemplified by modeling network flows in the Vienna Bike-Sharing Network. Furthermore, a simulation study is conducted to show the performance of the model. For practical purposes it is crucial to predict when and at which station there is a lack or an excess of bikes. For this application, our model shows to be well suited by providing quite accurate predictions.
△ Less
Submitted 22 January, 2020;
originally announced January 2020.
-
Iterative Estimation of Mixed Exponential Random Graph Models with Nodal Random Effects
Authors:
Sevag Kevork,
Göran Kauermann
Abstract:
The presence of unobserved node specific heterogeneity in Exponential Random Graph Models (ERGM) is a general concern, both with respect to model validity as well as estimation instability. We therefore extend the ERGM by including node specific random effects that account for unobserved heterogeneity in the network. This leads to a mixed model with parametric as well as random coefficients, label…
▽ More
The presence of unobserved node specific heterogeneity in Exponential Random Graph Models (ERGM) is a general concern, both with respect to model validity as well as estimation instability. We therefore extend the ERGM by including node specific random effects that account for unobserved heterogeneity in the network. This leads to a mixed model with parametric as well as random coefficients, labelled as mixed ERGM. Estimation is carried out by combining approximate penalized pseudolikelihood estimation for the random effects with maximum likelihood estimation for the remaining parameters in the model. This approach provides a stable algorithm, which allows to fit nodal heterogeneity effects even for large scale networks. We also propose model selection based on the AIC to check for node specific heterogeneity.
△ Less
Submitted 23 December, 2021; v1 submitted 6 November, 2019;
originally announced November 2019.
-
In Search of Lost Edges: A Case Study on Reconstructing Financial Networks
Authors:
Michael Lebacher,
Samantha Cook,
Nadja Klein,
Göran Kauermann
Abstract:
To capture the systemic complexity of international financial systems, network data is an important prerequisite. However, dyadic data is often not available, raising the need for methods that allow for reconstructing networks based on limited information. In this paper, we are reviewing different methods that are designed for the estimation of matrices from their marginals and potentially exogeno…
▽ More
To capture the systemic complexity of international financial systems, network data is an important prerequisite. However, dyadic data is often not available, raising the need for methods that allow for reconstructing networks based on limited information. In this paper, we are reviewing different methods that are designed for the estimation of matrices from their marginals and potentially exogenous information. This includes a general discussion of the available methodology that provides edge probabilities as well as models that are focussed on the reconstruction of edge values. Besides summarizing the advantages, shortfalls and computational issues of the approaches, we put them into a competitive comparison using the SWIFT (Society for Worldwide Interbank Financial Telecommunication) MT 103 payment messages network (MT 103: Single Customer Credit Transfer). This network is not only economically meaningful but also fully observed which allows for an extensive competitive horse race of methods. The comparison concerning the binary reconstruction is divided into an evaluation of the edge probabilities and the quality of the reconstructed degree structures. Furthermore, the accuracy of the predicted edge values is investigated. To test the methods on different topologies, the application is split into two parts. The first part considers the full MT 103 network, being an illustration for the reconstruction of large, sparse financial networks. The second part is concerned with reconstructing a subset of the full network, representing a dense medium-sized network. Regarding substantial outcomes, it can be found that no method is superior in every respect and that the preferred model choice highly depends on the goal of the analysis, the presumed network structure and the availability of exogenous information.
△ Less
Submitted 4 September, 2019; v1 submitted 3 September, 2019;
originally announced September 2019.
-
A smooth dynamic network model for patent collaboration data
Authors:
Verena Bauer,
Dietmar Harhoff,
Göran Kauermann
Abstract:
The development and application of models, which take the evolution of network dynamics into account are receiving increasing attention. We contribute to this field and focus on a profile likelihood approach to model time-stamped event data for a large-scale dynamic network. We investigate the collaboration of inventors using EU patent data. As event we consider the submission of a joint patent an…
▽ More
The development and application of models, which take the evolution of network dynamics into account are receiving increasing attention. We contribute to this field and focus on a profile likelihood approach to model time-stamped event data for a large-scale dynamic network. We investigate the collaboration of inventors using EU patent data. As event we consider the submission of a joint patent and we explore the driving forces for collaboration between inventors. We propose a flexible semiparametric model, which includes external and internal covariates, where the latter are built from the network history.
△ Less
Submitted 3 August, 2020; v1 submitted 2 September, 2019;
originally announced September 2019.
-
Tempus Volat, Hora Fugit -- A Survey of Tie-Oriented Dynamic Network Models in Discrete and Continuous Time
Authors:
Cornelius Fritz,
Michael Lebacher,
Göran Kauermann
Abstract:
Given the growing number of available tools for modeling dynamic networks, the choice of a suitable model becomes central. The goal of this survey is to provide an overview of tie-oriented dynamic network models. The survey is focused on introducing binary network models with their corresponding assumptions, advantages, and shortfalls. The models are divided according to generating processes, oper…
▽ More
Given the growing number of available tools for modeling dynamic networks, the choice of a suitable model becomes central. The goal of this survey is to provide an overview of tie-oriented dynamic network models. The survey is focused on introducing binary network models with their corresponding assumptions, advantages, and shortfalls. The models are divided according to generating processes, operating in discrete and continuous time. First, we introduce the Temporal Exponential Random Graph Model (TERGM) and the Separable TERGM (STERGM), both being time-discrete models. These models are then contrasted with continuous process models, focusing on the Relational Event Model (REM). We additionally show how the REM can handle time-clustered observations, i.e., continuous time data observed at discrete time points. Besides the discussion of theoretical properties and fitting procedures, we specifically focus on the application of the models on two networks that represent international arms transfers and email exchange. The data allow to demonstrate the applicability and interpretation of the network models.
△ Less
Submitted 28 August, 2019; v1 submitted 23 May, 2019;
originally announced May 2019.
-
Regression-based Network Reconstruction with Nodal and Dyadic Covariates and Random Effects
Authors:
Michael Lebacher,
Göran Kauermann
Abstract:
Network (or matrix) reconstruction is a general problem which occurs if the margins of a matrix are given and the matrix entries need to be predicted. In this paper we show that the predictions obtained from the iterative proportional fitting procedure (IPFP) or equivalently maximum entropy (ME) can be obtained by restricted maximum likelihood estimation relying on augmented Lagrangian optimizatio…
▽ More
Network (or matrix) reconstruction is a general problem which occurs if the margins of a matrix are given and the matrix entries need to be predicted. In this paper we show that the predictions obtained from the iterative proportional fitting procedure (IPFP) or equivalently maximum entropy (ME) can be obtained by restricted maximum likelihood estimation relying on augmented Lagrangian optimization. Based on the equivalence we extend the framework of network reconstruction towards regression by allowing for exogenous covariates and random heterogeneity effects. The proposed estimation approach is compared with different competing methods for network reconstruction and matrix estimation. Exemplary, we apply the approach to interbank lending data, provided by the Bank for International Settlement (BIS). This dataset provides full knowledge of the real network and is therefore suitable to evaluate the predictions of our approach. It is shown that the inclusion of exogenous information allows for superior predictions in terms of $L_1$ and $L_2$ errors. Additionally, the approach allows to obtain prediction intervals via bootstrap that can be used to quantify the uncertainty attached to the predictions.
△ Less
Submitted 4 September, 2019; v1 submitted 28 March, 2019;
originally announced March 2019.
-
EM-Based Smooth Graphon Estimation Using Bayesian and Spline-Based Approaches
Authors:
Benjamin Sischka,
Göran Kauermann
Abstract:
This paper proposes the estimation of a smooth graphon model for network data analysis using principles of the EM algorithm. The approach considers both variability with respect to ordering the nodes of a network and smooth estimation of the graphon by nonparametric regression. To do so, (linear) B-splines are used, which allow for smooth estimation of the graphon, conditional on the node ordering…
▽ More
This paper proposes the estimation of a smooth graphon model for network data analysis using principles of the EM algorithm. The approach considers both variability with respect to ordering the nodes of a network and smooth estimation of the graphon by nonparametric regression. To do so, (linear) B-splines are used, which allow for smooth estimation of the graphon, conditional on the node ordering. This provides the M-step. The true ordering of the nodes arising from the graphon model remains unobserved and Bayesian ideas are employed to obtain posterior samples given the network data. This yields the E-step. Combining both steps gives an EM-based approach for smooth graphon estimation. Unlike common other methods, this procedure does not require the restriction of a monotonic marginal function. The proposed graphon estimate allows to explore node-ordering strategies and therefore to compare the common degree-based node ranking with the ordering conditional on the network. Variability and uncertainty are taken into account using MCMC techniques. Examples and simulation studies support the applicability of the approach.
△ Less
Submitted 15 September, 2021; v1 submitted 16 March, 2019;
originally announced March 2019.
-
Censored Regression for Modelling International Small Arms Trading and its "Forensic" Use for Exploring Unreported Trades
Authors:
Michael Lebacher,
Paul W. Thurner,
Göran Kauermann
Abstract:
In this paper we use a censored regression model to investigate data on the international trade of small arms and ammunition (SAA) provided by the Norwegian Initiative on Small Arms Transfers (NISAT). Taking a network based view on the transfers, we not only rely on exogenous covariates but also estimate endogenous network effects. We apply a spatial autocorrelation (SAR) model with multiple weigh…
▽ More
In this paper we use a censored regression model to investigate data on the international trade of small arms and ammunition (SAA) provided by the Norwegian Initiative on Small Arms Transfers (NISAT). Taking a network based view on the transfers, we not only rely on exogenous covariates but also estimate endogenous network effects. We apply a spatial autocorrelation (SAR) model with multiple weight matrices. The likelihood is maximized employing the Monte Carlo Expectation Maximization (MCEM) algorithm. Our approach reveals strong and stable endogenous network effects. Furthermore, we find evidence for a substantial path dependence as well as a close connection between exports of civilian and military small arms. The model is then used in a "forensic" manner to analyse latent network structures and thereby to identify countries with higher or lower tendency to export or import than reflected in the data. The approach is also validated using a simulation study.
△ Less
Submitted 21 August, 2019; v1 submitted 25 February, 2019;
originally announced February 2019.
-
Exploring Dependence Structures in the International Arms Trade Network
Authors:
Michael Lebacher,
Göran Kauermann
Abstract:
In the paper we analyse dependence structures among international trade flows of major conventional weapons from 1952 to 2016. We employ a Network Disturbance Model commonly used in inferential network analysis and spatial econometrics. The dependence structure is represented by pre-defined weight matrices that allow for relating the arms trade flows from the network of international arms exchange…
▽ More
In the paper we analyse dependence structures among international trade flows of major conventional weapons from 1952 to 2016. We employ a Network Disturbance Model commonly used in inferential network analysis and spatial econometrics. The dependence structure is represented by pre-defined weight matrices that allow for relating the arms trade flows from the network of international arms exchange. Several different weight matrices are compared by means of the AIC in order to select the best dependence structure. It turns out that the dependence structure among the arms trade flows is rather complex and can be represented by a specification that, simply speaking, relates each arms trade flow to all exports and imports of the sending and the receiving state. By controlling for explanatory variables we are able to show the influence of political and economic variables on the volume traded.
△ Less
Submitted 8 March, 2018;
originally announced March 2018.
-
A Dynamic Separable Network Model with Actor Heterogeneity: An Application to Global Weapons Transfers
Authors:
Michael Lebacher,
Paul W. Thurner,
Göran Kauermann
Abstract:
In this paper we propose to extend the separable temporal exponential random graph model (STERGM) to account for time-varying network- and actor-specific effects. Our application case is the network of international major conventional weapons transfers, based on data from the Stockholm International Peace Research Institute (SIPRI). The application is particularly suitable since it allows to disti…
▽ More
In this paper we propose to extend the separable temporal exponential random graph model (STERGM) to account for time-varying network- and actor-specific effects. Our application case is the network of international major conventional weapons transfers, based on data from the Stockholm International Peace Research Institute (SIPRI). The application is particularly suitable since it allows to distinguish the potentially differing driving forces for creating new trade relationships and for the endurance of existing ones. In accordance with political economy models we expect security- and network-related covariates to be most important for the formation of transfers, whereas repeated transfers should prevalently be determined by the receivers' market size and military spending. Our proposed modelling approach corroborates the hypothesis and quantifies the corresponding effects. Additionally, we subject the time-varying heterogeneity effects to a functional principal component analysis. This serves as exploratory tool and allows to identify countries that stand out by exceptional increases or decreases of their tendency to import and export weapons.
△ Less
Submitted 4 September, 2019; v1 submitted 7 March, 2018;
originally announced March 2018.
-
Stable Exponential Random Graph Models with Non-parametric Components for Large Dense Networks
Authors:
Stephanie Thiemichen,
Göran Kauermann
Abstract:
Exponential Random Graph Models (ERGM) behave peculiar in large networks with thousand(s) of actors (nodes). Standard models containing two-star or triangle counts as statistics are often unstable leading to completely full or empty networks. Moreover, numerical methods break down which makes it complicated to apply ERGMs to large networks. In this paper we propose two strategies to circumvent the…
▽ More
Exponential Random Graph Models (ERGM) behave peculiar in large networks with thousand(s) of actors (nodes). Standard models containing two-star or triangle counts as statistics are often unstable leading to completely full or empty networks. Moreover, numerical methods break down which makes it complicated to apply ERGMs to large networks. In this paper we propose two strategies to circumvent these obstacles. First, we fit a model to a subsampled network and secondly, we show how linear statistics (like two-stars etc.) can be replaced by smooth functional components. These two steps in combination allow to fit stable models to large network data, which is illustrated by a data example including a residual analysis.
△ Less
Submitted 16 April, 2016;
originally announced April 2016.
-
Bayesian Exponential Random Graph Models with Nodal Random Effects
Authors:
Stephanie Thiemichen,
Nial Friel,
Alberto Caimo,
Göran Kauermann
Abstract:
We extend the well-known and widely used Exponential Random Graph Model (ERGM) by including nodal random effects to compensate for heterogeneity in the nodes of a network. The Bayesian framework for ERGMs proposed by Caimo and Friel (2011) yields the basis of our modelling algorithm. A central question in network models is the question of model selection and following the Bayesian paradigm we focu…
▽ More
We extend the well-known and widely used Exponential Random Graph Model (ERGM) by including nodal random effects to compensate for heterogeneity in the nodes of a network. The Bayesian framework for ERGMs proposed by Caimo and Friel (2011) yields the basis of our modelling algorithm. A central question in network models is the question of model selection and following the Bayesian paradigm we focus on estimating Bayes factors. To do so we develop an approximate but feasible calculation of the Bayes factor which allows one to pursue model selection. Two data examples and a small simulation study illustrate our mixed model approach and the corresponding model selection.
△ Less
Submitted 12 January, 2015; v1 submitted 25 July, 2014;
originally announced July 2014.
-
Mixtures of g-Priors for Generalised Additive Model Selection with Penalised Splines
Authors:
Daniel Sabanés Bové,
Leonhard Held,
Göran Kauermann
Abstract:
We propose an objective Bayesian approach to the selection of covariates and their penalised splines transformations in generalised additive models. Specification of a reasonable default prior for the model parameters and combination with a multiplicity-correction prior for the models themselves is crucial for this task. Here we use well-studied and well-behaved continuous mixtures of g-priors as…
▽ More
We propose an objective Bayesian approach to the selection of covariates and their penalised splines transformations in generalised additive models. Specification of a reasonable default prior for the model parameters and combination with a multiplicity-correction prior for the models themselves is crucial for this task. Here we use well-studied and well-behaved continuous mixtures of g-priors as default priors. We introduce the methodology in the normal model and extend it to non-normal exponential families. A simulation study and an application from the literature illustrate the proposed approach. An efficient implementation is available in the R-package "hypergsplines".
△ Less
Submitted 20 August, 2012; v1 submitted 17 August, 2011;
originally announced August 2011.