Search | arXiv e-print repository

Human-in-the-loop: Towards Label Embeddings for Measuring Classification Difficulty

Authors: Katharina Hechinger, Christoph Koller, Xiao Xiang Zhu, Göran Kauermann

Abstract: Uncertainty in machine learning models is a timely and vast field of research. In supervised learning, uncertainty can already occur in the first stage of the training process, the annotation phase. This scenario is particularly evident when some instances cannot be definitively classified. In other words, there is inevitable ambiguity in the annotation step and hence, not necessarily a "ground tr… ▽ More Uncertainty in machine learning models is a timely and vast field of research. In supervised learning, uncertainty can already occur in the first stage of the training process, the annotation phase. This scenario is particularly evident when some instances cannot be definitively classified. In other words, there is inevitable ambiguity in the annotation step and hence, not necessarily a "ground truth" associated with each instance. The main idea of this work is to drop the assumption of a ground truth label and instead embed the annotations into a multidimensional space. This embedding is derived from the empirical distribution of annotations in a Bayesian setup, modeled via a Dirichlet-Multinomial framework. We estimate the model parameters and posteriors using a stochastic Expectation Maximization algorithm with Markov Chain Monte Carlo steps. The methods developed in this paper readily extend to various situations where multiple annotators independently label instances. To showcase the generality of the proposed approach, we apply our approach to three benchmark datasets for image classification and Natural Language Inference. Besides the embeddings, we can investigate the resulting correlation matrices, which reflect the semantic similarities of the original classes very well for all three exemplary datasets. △ Less

Submitted 27 May, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

arXiv:2309.01440 [pdf, other]

doi 10.1093/jrsssc/qlad089

Categorising the World into Local Climate Zones -- Towards Quantifying Labelling Uncertainty for Machine Learning Models

Authors: Katharina Hechinger, Xiao Xiang Zhu, Göran Kauermann

Abstract: Image classification is often prone to labelling uncertainty. To generate suitable training data, images are labelled according to evaluations of human experts. This can result in ambiguities, which will affect subsequent models. In this work, we aim to model the labelling uncertainty in the context of remote sensing and the classification of satellite images. We construct a multinomial mixture mo… ▽ More Image classification is often prone to labelling uncertainty. To generate suitable training data, images are labelled according to evaluations of human experts. This can result in ambiguities, which will affect subsequent models. In this work, we aim to model the labelling uncertainty in the context of remote sensing and the classification of satellite images. We construct a multinomial mixture model given the evaluations of multiple experts. This is based on the assumption that there is no ambiguity of the image class, but apparently in the experts' opinion about it. The model parameters can be estimated by a stochastic Expectation Maximization algorithm. Analysing the estimates gives insights into sources of label uncertainty. Here, we focus on the general class ambiguity, the heterogeneity of experts, and the origin city of the images. The results are relevant for all machine learning applications where image classification is pursued and labelling is subject to humans. △ Less

Submitted 4 September, 2023; originally announced September 2023.

arXiv:2305.19139 [pdf, other]

Estimating excess mortality in high-income countries during the COVID-19 pandemic

Authors: Giacomo De Nicola, Göran Kauermann

Abstract: Quantifying the number of deaths caused by the COVID-19 crisis has been an ongoing challenge for scientists, and no golden standard to do so has yet been established. We propose a principled approach to calculate age-adjusted yearly excess mortality, and apply it to obtain estimates and uncertainty bounds for 30 countries with publicly available data. The results uncover remarkable variation in pa… ▽ More Quantifying the number of deaths caused by the COVID-19 crisis has been an ongoing challenge for scientists, and no golden standard to do so has yet been established. We propose a principled approach to calculate age-adjusted yearly excess mortality, and apply it to obtain estimates and uncertainty bounds for 30 countries with publicly available data. The results uncover remarkable variation in pandemic outcomes across different countries. We further compare our findings with existing estimates published in other major scientific outlets, highlighting the importance of proper age adjustment to obtain unbiased figures. △ Less

Submitted 19 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

arXiv:2305.16703 [pdf, other]

Sources of Uncertainty in Machine Learning -- A Statisticians' View

Authors: Cornelia Gruber, Patrick Oliver Schenk, Malte Schierholz, Frauke Kreuter, Göran Kauermann

Abstract: Machine Learning and Deep Learning have achieved an impressive standard today, enabling us to answer questions that were inconceivable a few years ago. Besides these successes, it becomes clear, that beyond pure prediction, which is the primary strength of most supervised machine learning algorithms, the quantification of uncertainty is relevant and necessary as well. While first concepts and idea… ▽ More Machine Learning and Deep Learning have achieved an impressive standard today, enabling us to answer questions that were inconceivable a few years ago. Besides these successes, it becomes clear, that beyond pure prediction, which is the primary strength of most supervised machine learning algorithms, the quantification of uncertainty is relevant and necessary as well. While first concepts and ideas in this direction have emerged in recent years, this paper adopts a conceptual perspective and examines possible sources of uncertainty. By adopting the viewpoint of a statistician, we discuss the concepts of aleatoric and epistemic uncertainty, which are more commonly associated with machine learning. The paper aims to formalize the two types of uncertainty and demonstrates that sources of uncertainty are miscellaneous and can not always be decomposed into aleatoric and epistemic. Drawing parallels between statistical concepts and uncertainty in machine learning, we also demonstrate the role of data and their influence on uncertainty. △ Less

Submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.15301 [pdf, other]

The Skellam Distribution revisited -Estimating the unobserved incoming and outgoing ICU COVID-19 patients on a regional level in Germany

Authors: Martje Rave, Göran Kauermann

Abstract: With the beginning of the COVID-19 pandemic, we became aware of the need for comprehensive data collection and its provision to scientists and experts for proper data analyses. In Germany, the Robert Koch Institute (RKI) has tried to keep up with this demand for data on COVID-19, but there were (and still are) relevant data missing that are needed to understand the whole picture of the pandemic. I… ▽ More With the beginning of the COVID-19 pandemic, we became aware of the need for comprehensive data collection and its provision to scientists and experts for proper data analyses. In Germany, the Robert Koch Institute (RKI) has tried to keep up with this demand for data on COVID-19, but there were (and still are) relevant data missing that are needed to understand the whole picture of the pandemic. In this paper, we take a closer look at the severity of the course of COVID-19 in Germany, for which ideal information would be the number of incoming patients to ICU units. This information was (and still is) not available. Instead, the current occupancy of ICU units on the district level was reported daily. We demonstrate how this information can be used to predict the number of incoming as well as released COVID-19 patients using a stochastic version of the Expectation Maximisation algorithm (SEM). This in turn, allows for estimating the influence of district-specific and age-specific infection rates as well as further covariates, including spatial effects, on the number of incoming patients. The paper demonstrates that even if relevant data are not recorded or provided officially, statistical modelling allows for reconstructing them. This also includes the quantification of uncertainty which naturally results from the application of the SEM algorithm. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 30 pages, 10 figures

arXiv:2305.03732 [pdf, other]

Weighted high dimensional data reduction of finite Element Features -- An Application on High Pressure of an Abdominal Aortic Aneurysm

Authors: Christoph Striegel, Göran Kauermann, Jonas Biehler

Abstract: In this work we propose a low rank approximation of high fidelity finite element simulations by utilizing weights corresponding to areas of high stress levels for an abdominal aortic aneurysm, i.e. a deformed blood vessel. We focus on the van Mises stress, which corresponds to the rupture risk of the aorta. This is modeled as a Gaussian Markov random field and we define our approximation as a basi… ▽ More In this work we propose a low rank approximation of high fidelity finite element simulations by utilizing weights corresponding to areas of high stress levels for an abdominal aortic aneurysm, i.e. a deformed blood vessel. We focus on the van Mises stress, which corresponds to the rupture risk of the aorta. This is modeled as a Gaussian Markov random field and we define our approximation as a basis of vectors that solve a series of optimization problems. Each of these problems describes the minimization of an expected weighted quadratic loss. The weights, which encapsulate the importance of each grid point of the finite elements, can be chosen freely - either data driven or by incorporating domain knowledge. Along with a more general discussion of mathematical properties we provide an effective numerical heuristic to compute the basis under general conditions. We explicitly explore two such bases on the surface of a high fidelity finite element grid and show their efficiency for compression. We further utilize the approach to predict the van Mises stress in areas of interest using low and high fidelity simulations. Due to the high dimension of the data we have to take extra care to keep the problem numerically feasible. This is also a major concern of this work. △ Less

Submitted 2 May, 2023; originally announced May 2023.

arXiv:2305.02770 [pdf, other]

The Politics of Language Choice: How the Russian-Ukrainian War Influences Ukrainians' Language Use on Twitter

Authors: Daniel Racek, Brittany I. Davidson, Paul W. Thurner, Xiao Xiang Zhu, Göran Kauermann

Abstract: The use of language is innately political and often a vehicle of cultural identity as well as the basis for nation building. Here, we examine language choice and tweeting activity of Ukrainian citizens based on more than 4 million geo-tagged tweets from over 62,000 users before and during the Russian-Ukrainian War, from January 2020 to October 2022. Using statistical models, we disentangle sample… ▽ More The use of language is innately political and often a vehicle of cultural identity as well as the basis for nation building. Here, we examine language choice and tweeting activity of Ukrainian citizens based on more than 4 million geo-tagged tweets from over 62,000 users before and during the Russian-Ukrainian War, from January 2020 to October 2022. Using statistical models, we disentangle sample effects, arising from the in- and outflux of users on Twitter, from behavioural effects, arising from behavioural changes of the users. We observe a steady shift from the Russian language towards the Ukrainian language already before the war, which drastically speeds up with its outbreak. We attribute these shifts in large part to users' behavioural changes. Notably, we find that more than half of the Russian-tweeting users shift towards Ukrainian as a result of the war. △ Less

Submitted 6 June, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

arXiv:2303.16014 [pdf, other]

Nonparametric Two-Sample Test for Networks Using Joint Graphon Estimation

Authors: Benjamin Sischka, Göran Kauermann

Abstract: This paper focuses on the comparison of networks on the basis of statistical inference. For that purpose, we rely on smooth graphon models as a nonparametric modeling strategy that is able to capture complex structural patterns. The graphon itself can be viewed more broadly as density or intensity function on networks, making the model a natural choice for comparison purposes. Extending graphon es… ▽ More This paper focuses on the comparison of networks on the basis of statistical inference. For that purpose, we rely on smooth graphon models as a nonparametric modeling strategy that is able to capture complex structural patterns. The graphon itself can be viewed more broadly as density or intensity function on networks, making the model a natural choice for comparison purposes. Extending graphon estimation towards modeling multiple networks simultaneously consequently provides substantial information about the (dis-)similarity between networks. Fitting such a joint model - which can be accomplished by applying an EM-type algorithm - provides a joint graphon estimate plus a corresponding prediction of the node positions for each network. In particular, it entails a generalized network alignment, where nearby nodes play similar structural roles in their respective domains. Given that, we construct a chi-squared test on equivalence of network structures. Simulation studies and real-world examples support the applicability of our network comparison strategy. △ Less

Submitted 28 March, 2023; originally announced March 2023.

Comments: 25 pages, 6 figures

arXiv:2210.14860 [pdf, other]

Dependence matters: Statistical models to identify the drivers of tie formation in economic networks

Authors: Giacomo De Nicola, Cornelius Fritz, Marius Mehrl, Göran Kauermann

Abstract: Networks are ubiquitous in economic research on organizations, trade, and many other areas. However, while economic theory extensively considers networks, no general framework for their empirical modeling has yet emerged. We thus introduce two different statistical models for this purpose -- the Exponential Random Graph Model (ERGM) and the Additive and Multiplicative Effects network model (AME).… ▽ More Networks are ubiquitous in economic research on organizations, trade, and many other areas. However, while economic theory extensively considers networks, no general framework for their empirical modeling has yet emerged. We thus introduce two different statistical models for this purpose -- the Exponential Random Graph Model (ERGM) and the Additive and Multiplicative Effects network model (AME). Both model classes can account for network interdependencies between observations, but differ in how they do so. The ERGM allows one to explicitly specify and test the influence of particular network structures, making it a natural choice if one is substantively interested in estimating endogenous network effects. In contrast, AME captures these effects by introducing actor-specific latent variables affecting their propensity to form ties. This makes the latter a good choice if the researcher is interested in capturing the effect of exogenous covariates on tie formation without having a specific theory on the endogenous dependence structures at play. After introducing the two model classes, we showcase them through real-world applications to networks stemming from international arms trade and foreign exchange activity. We further provide full replication materials to facilitate the adoption of these methods in empirical economic research. △ Less

Submitted 7 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

arXiv:2207.13352 [pdf, other]

COVID-19 and social media: Beyond polarization

Authors: Giacomo De Nicola, Victor H. Tuekam Mambou, Göran Kauermann

Abstract: The COVID-19 pandemic brought upon a massive wave of disinformation, exacerbating polarization in the increasingly divided landscape of online discourse. In this context, popular social media users play a major role, as they have the ability to broadcast messages to large audiences and influence public opinion. In this paper, we make use of openly available data to study the behavior of popular us… ▽ More The COVID-19 pandemic brought upon a massive wave of disinformation, exacerbating polarization in the increasingly divided landscape of online discourse. In this context, popular social media users play a major role, as they have the ability to broadcast messages to large audiences and influence public opinion. In this paper, we make use of openly available data to study the behavior of popular users discussing the pandemic on Twitter. We tackle the issue from a network perspective, considering users as nodes and following relationships as directed edges. The resulting network structure is modeled by embedding the actors in a latent social space, where users closer to one another have a higher probability of following each other. The results suggest the existence of two distinct communities, which can be interpreted as "generally pro" and "generally against" vaccine mandates, corroborating existing evidence on the pervasiveness of echo chambers on the platform. By focusing on a number of notable users, such as politicians, activists, and news outlets, we further show that the two groups are not entirely homogeneous, and that not just the two poles are represented. To the contrary, the latent space captures an entire spectrum of beliefs between the two extremes, demonstrating that polarization, while present, is not the only driver of the network, and that more moderate, "central" users are key players in the discussion. △ Less

Submitted 24 July, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

arXiv:2205.13411 [pdf, other]

Exponential Random Graph Models for Dynamic Signed Networks: An Application to International Relations

Authors: Cornelius Fritz, Marius Mehrl, Paul W. Thurner, Göran kauermann

Abstract: Substantive research in the Social Sciences regularly investigates signed networks, where edges between actors are either positive or negative. For instance, schoolchildren can be friends or rivals, just as countries can cooperate or fight each other. This research often builds on structural balance theory, one of the earliest and most prominent network theories, making signed networks one of the… ▽ More Substantive research in the Social Sciences regularly investigates signed networks, where edges between actors are either positive or negative. For instance, schoolchildren can be friends or rivals, just as countries can cooperate or fight each other. This research often builds on structural balance theory, one of the earliest and most prominent network theories, making signed networks one of the most frequently studied matters in social network analysis. While the theorization and description of signed networks have thus made significant progress, the inferential study of tie formation within them remains limited in the absence of appropriate statistical models. In this paper we fill this gap by proposing the Signed Exponential Random Graph Model (SERGM), extending the well-known Exponential Random Graph Model (ERGM) to networks where ties are not binary but negative or positive if a tie exists. Since most networks are dynamically evolving systems, we specify the model for both cross-sectional and dynamic networks. Based on structural hypotheses derived from structural balance theory, we formulate interpretable signed network statistics, capturing dynamics such as "the enemy of my enemy is my friend". In our empirical application, we use the SERGM to analyze cooperation and conflict between countries within the international state system. △ Less

Submitted 24 May, 2022; originally announced May 2022.

arXiv:2204.14214 [pdf, other]

Actor Heterogeneity and Explained Variance in Network Models -- A Scalable Approach through Variational Approximations

Authors: Nadja Klein, Göran Kauermann

Abstract: The analysis of network data has gained considerable interest in recent years. This also includes the analysis of large, high-dimensional networks with hundreds and thousands of nodes. While exponential random graph models serve as workhorse for network data analyses, their applicability to very large networks is problematic via classical inference such as maximum likelihood or exact Bayesian esti… ▽ More The analysis of network data has gained considerable interest in recent years. This also includes the analysis of large, high-dimensional networks with hundreds and thousands of nodes. While exponential random graph models serve as workhorse for network data analyses, their applicability to very large networks is problematic via classical inference such as maximum likelihood or exact Bayesian estimation owing to scaling and instability issues. The latter trace from the fact that classical network statistics consider nodes as exchangeable, i.e., actors in the network are assumed to be homogeneous. This is often questionable. One way to circumvent the restrictive assumption is to include actor-specific random effects, which account for unobservable heterogeneity. However, this increases the number of unknowns considerably, thus making the model highly-parameterized. As a solution even for very large networks, we propose a scalable approach based on variational approximations, which not only leads to numerically stable estimation but is also applicable to high-dimensional directed as well as undirected networks. We furthermore demonstrate that including node-specific covariates can reduce node heterogeneity, which we facilitate through versatile prior formulations and a new measure that we call posterior explained variance. We illustrate our approach in three diverse examples, covering network data from the Italian Parliament, international arms trading, and Facebook; and conduct detailed simulation studies. △ Less

Submitted 12 September, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

arXiv:2203.13304 [pdf, other]

Stochastic Block Smooth Graphon Model

Authors: Benjamin Sischka, Göran Kauermann

Abstract: The paper proposes the combination of stochastic blockmodels with smooth graphon models. The first allow for partitioning the set of individuals in a network into blocks which represent groups of nodes that presumably connect stochastically equivalently, therefore often also called communities. Smooth graphon models instead assume that the network's nodes can be arranged on a one-dimensional scale… ▽ More The paper proposes the combination of stochastic blockmodels with smooth graphon models. The first allow for partitioning the set of individuals in a network into blocks which represent groups of nodes that presumably connect stochastically equivalently, therefore often also called communities. Smooth graphon models instead assume that the network's nodes can be arranged on a one-dimensional scale such that closeness implies a similar connectivity behavior. Both models belong to the model class of node-specific latent variables, entailing a natural relationship. While these model strands have developed more or less completely independently, this paper proposes their generalization towards stochastic block smooth graphon models. This approach enables to exploit the advantages of both worlds. We pursue a general EM-type algorithm for estimation and demonstrate the usability by applying the model to both simulated and real-world examples. △ Less

Submitted 24 March, 2022; originally announced March 2022.

Comments: 31 pages, 8 figures

arXiv:2201.09744 [pdf, other]

Modelling the large and dynamically growing bipartite network of German patents and inventors

Authors: Cornelius Fritz, Giacomo De Nicola, Sevag Kevork, Dietmar Harhoff, Göran Kauermann

Abstract: We analyse the bipartite dynamic network of inventors and patents registered within the main area of electrical engineering in Germany to explore the driving forces behind innovation. The data at hand leads to a bipartite network, where an edge between an inventor and a patent is present if the inventor is a co-owner of the respective patent. Since more than a hundred thousand patents were filed b… ▽ More We analyse the bipartite dynamic network of inventors and patents registered within the main area of electrical engineering in Germany to explore the driving forces behind innovation. The data at hand leads to a bipartite network, where an edge between an inventor and a patent is present if the inventor is a co-owner of the respective patent. Since more than a hundred thousand patents were filed by similarly as many inventors during the observational period, this amounts to a massive bipartite network, too large to be analysed as a whole. Therefore, we decompose the bipartite network by utilising an essential characteristic of the network: most inventors tend to stay active only for a relatively short period, while new ones become active at each point in time. Consequently, the adjacency matrix carries several structural zeros. To accommodate for these, we propose a bipartite variant of the Temporal Exponential Random Graph Model (TERGM) in which we let the actor set vary over time, differentiate between inventors that already submitted patents and those that did not, and account for pairwise statistics of inventors. Our results corroborate the hypotheses that inventor characteristics and knowledge flows play a crucial role in the dynamics of invention. △ Less

Submitted 20 January, 2022; originally announced January 2022.

arXiv:2201.02182 [pdf, other]

doi 10.1177/1471082X221124628

Statistical modelling of COVID-19 data: Putting Generalised Additive Models to work

Authors: Cornelius Fritz, Giacomo De Nicola, Martje Rave, Maximilian Weigert, Yeganeh Khazaei, Ursula Berger, Helmut Küchenhoff, Göran Kauermann

Abstract: Over the course of the COVID-19 pandemic, Generalised Additive Models (GAMs) have been successfully employed on numerous occasions to obtain vital data-driven insights. In this paper we further substantiate the success story of GAMs, demonstrating their flexibility by focusing on three relevant pandemic-related issues. First, we examine the interdepency among infections in different age groups, co… ▽ More Over the course of the COVID-19 pandemic, Generalised Additive Models (GAMs) have been successfully employed on numerous occasions to obtain vital data-driven insights. In this paper we further substantiate the success story of GAMs, demonstrating their flexibility by focusing on three relevant pandemic-related issues. First, we examine the interdepency among infections in different age groups, concentrating on school children. In this context, we derive the setting under which parameter estimates are independent of the (unknown) case-detection ratio, which plays an important role in COVID-19 surveillance data. Second, we model the incidence of hospitalisations, for which data is only available with a temporal delay. We illustrate how correcting for this reporting delay through a nowcasting procedure can be naturally incorporated into the GAM framework as an offset term. Third, we propose a multinomial model for the weekly occupancy of intensive care units (ICU), where we distinguish between the number of COVID-19 patients, other patients and vacant beds. With these three examples, we aim to showcase the practical and "off-the-shelf" applicability of GAMs to gain new insights from real-world data. △ Less

Submitted 4 January, 2022; originally announced January 2022.

arXiv:2109.10348 [pdf, other]

All that Glitters is not Gold: Relational Events Models with Spurious Events

Authors: Cornelius Fritz, Marius Mehrl, Paul W. Thurner, Göran Kauermann

Abstract: As relational event models are an increasingly popular model for studying relational structures, the reliability of large-scale event data collection becomes more and more important. Automated or human-coded events often suffer from non-negligible false-discovery rates in event identification. And most sensor data is primarily based on actors' spatial proximity for predefined time windows; hence,… ▽ More As relational event models are an increasingly popular model for studying relational structures, the reliability of large-scale event data collection becomes more and more important. Automated or human-coded events often suffer from non-negligible false-discovery rates in event identification. And most sensor data is primarily based on actors' spatial proximity for predefined time windows; hence, the observed events could relate either to a social relationship or random co-location. Both examples imply spurious events that may bias estimates and inference. We propose the Relational Event Model for Spurious Events (REMSE), an extension to existing approaches for interaction data. The model provides a flexible solution for modeling data while controlling for spurious events. Estimation of our model is carried out in an empirical Bayesian approach via data augmentation. Based on a simulation study, we investigate the properties of the estimation procedure. To demonstrate its usefulness in two distinct applications, we employ this model to combat events from the Syrian civil war and student co-location data. Results from the simulation and the applications identify the REMSE as a suitable approach to modeling relational event data in the presence of spurious events. △ Less

Submitted 24 May, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

arXiv:2106.13827 [pdf, other]

On assessing excess mortality in Germany during the COVID-19 pandemic

Authors: Giacomo De Nicola, Göran Kauermann, Michael Höhle

Abstract: Coronavirus disease 2019 (COVID-19) is associated with a very high number of casualties in the general population. Assessing the exact magnitude of this number is a non-trivial problem, as relying only on officially reported COVID-19 associated fatalities runs the risk of incurring in several kinds of biases. One of the ways to approach the issue is to compare overall mortality during the pandemic… ▽ More Coronavirus disease 2019 (COVID-19) is associated with a very high number of casualties in the general population. Assessing the exact magnitude of this number is a non-trivial problem, as relying only on officially reported COVID-19 associated fatalities runs the risk of incurring in several kinds of biases. One of the ways to approach the issue is to compare overall mortality during the pandemic with expected mortality computed using the observed mortality figures of previous years. In this paper, we build on existing methodology and propose two ways to compute expected as well as excess mortality, namely at the weekly and at the yearly level. Particular focus is put on the role of age, which plays a central part in both COVID-19-associated and overall mortality. We illustrate our methods by making use of age-stratified mortality data from the years 2016 to 2020 in Germany to compute age group-specific excess mortality during the COVID-19 pandemic in 2020. △ Less

Submitted 25 June, 2021; originally announced June 2021.

arXiv:2106.06197 [pdf, other]

Statistical modeling of on-street parking lot occupancy in smart cities

Authors: Marc Schneble, Göran Kauermann

Abstract: Many studies suggest that searching for parking is associated with significant direct and indirect costs. Therefore, it is appealing to reduce the time which car drivers spend on finding an available parking lot, especially in urban areas where the space for all road users is limited. The prediction of on-street parking lot occupancy can provide drivers a guidance where clear parking lots are like… ▽ More Many studies suggest that searching for parking is associated with significant direct and indirect costs. Therefore, it is appealing to reduce the time which car drivers spend on finding an available parking lot, especially in urban areas where the space for all road users is limited. The prediction of on-street parking lot occupancy can provide drivers a guidance where clear parking lots are likely to be found. This field of research has gained more and more attention in the last decade through the increasing availability of real-time parking lot occupancy data. In this paper, we pursue a statistical approach for the prediction of parking lot occupancy, where we make use of time to event models and semi-Markov process theory. The latter involves the employment of Laplace transformations as well as their inversion which is an ambitious numerical task. We apply our methodology to data from the City of Melbourne in Australia. Our main result is that the semi-Markov model outperforms a Markov model in terms of both true negative rate and true positive rate while this is essentially achieved by respecting the current duration which a parking lot already sojourns in its initial state. △ Less

Submitted 11 June, 2021; originally announced June 2021.

Comments: 28 pages, 8 figures

arXiv:2101.06034 [pdf, other]

Matrix-free Penalized Spline Smoothing with Multiple Covariates

Authors: Julian Wagner, Göran Kauermann, Ralf Münnich

Abstract: The paper motivates high dimensional smoothing with penalized splines and its numerical calculation in an efficient way. If smoothing is carried out over three or more covariates the classical tensor product spline bases explode in their dimension bringing the estimation to its numerical limits. A recent approach by Siebenborn and Wagner(2019) circumvents storage expensive implementations by propo… ▽ More The paper motivates high dimensional smoothing with penalized splines and its numerical calculation in an efficient way. If smoothing is carried out over three or more covariates the classical tensor product spline bases explode in their dimension bringing the estimation to its numerical limits. A recent approach by Siebenborn and Wagner(2019) circumvents storage expensive implementations by proposing matrix-free calculations which allows to smooth over several covariates. We extend their approach here by linking penalized smoothing and its Bayesian formulation as mixed model which provides a matrix-free calculation of the smoothing parameter to avoid the use of high-computational cross validation. Further, we show how to extend the ideas towards generalized regression models. The extended approach is applied to remote sensing satellite data in combination with spatial smoothing. △ Less

Submitted 15 January, 2021; originally announced January 2021.

arXiv:2012.08246 [pdf, other]

doi 10.1080/03050629.2022.1993210

The Role of Governmental Weapons Procurements in Forecasting Monthly Fatalities in Intrastate Conflicts: A Semiparametric Hierarchical Hurdle Model

Authors: Cornelius Fritz, Marius Mehrl, Paul W. Thurner, Göran Kauermann

Abstract: Accurate and interpretable forecasting models predicting spatially and temporally fine-grained changes in the numbers of intrastate conflict casualties are of crucial importance for policymakers and international non-governmental organisations (NGOs). Using a count data approach, we propose a hierarchical hurdle regression model to address the corresponding prediction challenge at the monthly PRIO… ▽ More Accurate and interpretable forecasting models predicting spatially and temporally fine-grained changes in the numbers of intrastate conflict casualties are of crucial importance for policymakers and international non-governmental organisations (NGOs). Using a count data approach, we propose a hierarchical hurdle regression model to address the corresponding prediction challenge at the monthly PRIO-grid level. More precisely, we model the intensity of local armed conflict at a specific point in time as a three-stage process. Stages one and two of our approach estimate whether we will observe any casualties at the country- and grid-cell-level, respectively, while stage three applies a regression model for truncated data to predict the number of such fatalities conditional upon the previous two stages. Within this modelling framework, we focus on the role of governmental arms imports as a processual factor allowing governments to intensify or deter from fighting. We further argue that a grid cell's geographic remoteness is bound to moderate the effects of these military buildups. Out-of-sample predictions corroborate the effectiveness of our parsimonious and theory-driven model, which enables full transparency combined with accuracy in the forecasting process. △ Less

Submitted 15 December, 2020; originally announced December 2020.

arXiv:2008.03013 [pdf, other]

doi 10.1111/rssa.12753

On the Interplay of Regional Mobility, Social Connectedness, and the Spread of COVID-19 in Germany

Authors: Cornelius Fritz, Göran Kauermann

Abstract: Since the primary mode of respiratory virus transmission is person-to-person interaction, we are required to reconsider physical interaction patterns to mitigate the number of people infected with COVID-19. While research has shown that non-pharmaceutical interventions (NPI) had an evident impact on national mobility patterns, we investigate the relative regional mobility behaviour to assess the e… ▽ More Since the primary mode of respiratory virus transmission is person-to-person interaction, we are required to reconsider physical interaction patterns to mitigate the number of people infected with COVID-19. While research has shown that non-pharmaceutical interventions (NPI) had an evident impact on national mobility patterns, we investigate the relative regional mobility behaviour to assess the effect of human movement on the spread of COVID-19. In particular, we explore the impact of human mobility and social connectivity derived from Facebook activities on the weekly rate of new infections in Germany between March 3rd and June 22nd, 2020. Our results confirm that reduced social activity lowers the infection rate, accounting for regional and temporal patterns. The extent of social distancing, quantified by the percentage of people staying put within a federal administrative district, has an overall negative effect on the incidence of infections. Additionally, our results show spatial infection patterns based on geographic as well as social distances. △ Less

Submitted 2 July, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

arXiv:2007.16058 [pdf, other]

Regional now- and forecasting for data reported with delay: Towards surveillance of COVID-19 infections

Authors: Giacomo De Nicola, Marc Schneble, Göran Kauermann, Ursula Berger

Abstract: Governments around the world continue to act to contain and mitigate the spread of COVID-19. The rapidly evolving situation compels officials and executives to continuously adapt policies and social distancing measures depending on the current state of the spread of the disease. In this context, it is crucial for policymakers to have a firm grasp on what the current state of the pandemic is as wel… ▽ More Governments around the world continue to act to contain and mitigate the spread of COVID-19. The rapidly evolving situation compels officials and executives to continuously adapt policies and social distancing measures depending on the current state of the spread of the disease. In this context, it is crucial for policymakers to have a firm grasp on what the current state of the pandemic is as well as to have an idea of how the infective situation is going to unfold in the next days. However, as in many other situations of compulsorily-notifiable diseases and beyond, cases are reported with delay to a central register, with this delay deferring an up-to-date view of the state of things. We provide a stable tool for monitoring current infection levels as well as predicting infection numbers in the immediate future at the regional level. We accomplish this through nowcasting of cases that have not yet been reported as well as through predictions of future infections. We apply our model to German data, for which our focus lies in predicting and explain infectious behavior by district. △ Less

Submitted 18 February, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

Comments: 5 Figures, 23 pages

arXiv:2005.09396 [pdf, other]

Mixture Models and Networks -- Overview of Stochastic Blockmodelling

Authors: Giacomo De Nicola, Benjamin Sischka, Göran Kauermann

Abstract: Mixture models are probabilistic models aimed at uncovering and representing latent subgroups within a population. In the realm of network data analysis, the latent subgroups of nodes are typically identified by their connectivity behaviour, with nodes behaving similarly belonging to the same community. In this context, mixture modelling is pursued through stochastic blockmodelling. We consider st… ▽ More Mixture models are probabilistic models aimed at uncovering and representing latent subgroups within a population. In the realm of network data analysis, the latent subgroups of nodes are typically identified by their connectivity behaviour, with nodes behaving similarly belonging to the same community. In this context, mixture modelling is pursued through stochastic blockmodelling. We consider stochastic blockmodels and some of their variants and extensions from a mixture modelling perspective. We also survey some of the main classes of estimation methods available, and propose an alternative approach. In addition to the discussion of inferential properties and estimating procedures, we focus on the application of the models to several real-world network datasets, showcasing the advantages and pitfalls of different approaches. △ Less

Submitted 26 May, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

Comments: 23 pages, 5 figures

arXiv:2005.07452 [pdf, other]

doi 10.1002/bimj.202000143

Nowcasting fatal COVID-19 infections on a regional level in Germany

Authors: Marc Schneble, Giacomo De Nicola, Göran Kauermann, Ursula Berger

Abstract: We analyse the temporal and regional structure in mortality rates related to COVID-19 infections. We relate the fatality date of each deceased patient to the corresponding day of registration of the infection, leading to a nowcasting model which allows us to estimate the number of present-day infections that will, at a later date, prove to be fatal. The numbers are broken down to the district leve… ▽ More We analyse the temporal and regional structure in mortality rates related to COVID-19 infections. We relate the fatality date of each deceased patient to the corresponding day of registration of the infection, leading to a nowcasting model which allows us to estimate the number of present-day infections that will, at a later date, prove to be fatal. The numbers are broken down to the district level in Germany. Given that death counts generally provide more reliable information on the spread of the disease compared to infection counts, which inevitably depend on testing strategy and capacity, the proposed model and the presented results allow to obtain reliable insight into the current state of the pandemic in Germany. △ Less

Submitted 21 November, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

Comments: 22 pages, 9 Figures

arXiv:2003.12178 [pdf, other]

doi 10.1017/nws.2021.9

Separable and Semiparametric Network-based Counting Processes applied to the International Combat Aircraft Trades

Authors: Cornelius Fritz, Paul W. Thurner, Göran Kauermann

Abstract: We propose a novel tie-oriented model for longitudinal event network data. The generating mechanism is assumed to be a multivariate Poisson process that governs the onset and repetition of yearly observed events with two separate intensity functions. We apply the model to a network obtained from the number of international deliveries of combat aircraft trades between 1950 and 2017. Based on a modi… ▽ More We propose a novel tie-oriented model for longitudinal event network data. The generating mechanism is assumed to be a multivariate Poisson process that governs the onset and repetition of yearly observed events with two separate intensity functions. We apply the model to a network obtained from the number of international deliveries of combat aircraft trades between 1950 and 2017. Based on a modified trade gravity approach we identify economic and political factors impeding or lightening the number of transfers. Extensive dynamics as well as country heterogeneity require the specification of semiparametric time-varying effects as well as random effects. △ Less

Submitted 18 April, 2021; v1 submitted 26 March, 2020; originally announced March 2020.

arXiv:2002.10270 [pdf, other]

Intensity Estimation on Geometric Networks with Penalized Splines

Authors: Marc Schneble, Göran Kauermann

Abstract: In the past decades, the growing amount of network data has lead to many novel statistical models. In this paper we consider so called geometric networks. Typical examples are road networks or other infrastructure networks. But also the neurons or the blood vessels in a human body can be interpreted as a geometric network embedded in a three-dimensional space. In all these applications a network s… ▽ More In the past decades, the growing amount of network data has lead to many novel statistical models. In this paper we consider so called geometric networks. Typical examples are road networks or other infrastructure networks. But also the neurons or the blood vessels in a human body can be interpreted as a geometric network embedded in a three-dimensional space. In all these applications a network specific metric rather than the Euclidean metric is usually used, which makes the analyses on network data challenging. We consider network based point processes and our task is to estimate the intensity (or density) of the process which allows to detect high- and low- intensity regions of the underlying stochastic processes. Available routines that tackle this problem are commonly based on kernel smoothing methods. However, kernel based estimation in general exhibits some drawbacks such as suffering from boundary effects and the locality of the smoother. In an Euclidean space, the disadvantages of kernel methods can be overcome by using penalized spline smoothing. We here extend penalized spline smoothing towards smooth intensity estimation on geometric networks and apply the approach to both, simulated and real world data. The results show that penalized spline based intensity estimation is numerically efficient and outperforms kernel based methods. Furthermore, our approach easily allows to incorporate covariates, which allows to respect the network geometry in a regression model framework. △ Less

Submitted 24 February, 2020; originally announced February 2020.

Comments: 23 pages, 8 figures

arXiv:2001.08146 [pdf, other]

Estimation of Latent Network Flows in Bike-Sharing Systems

Authors: Marc Schneble, Göran Kauermann

Abstract: Estimation of latent network flows is a common problem in statistical network analysis. The typical setting is that we know the margins of the network, i.e. in- and outdegrees, but the flows are unobserved. In this paper, we develop a mixed regression model to estimate network flows in a bike-sharing network if only the hourly differences of in- and outdegrees at bike stations are known. We also i… ▽ More Estimation of latent network flows is a common problem in statistical network analysis. The typical setting is that we know the margins of the network, i.e. in- and outdegrees, but the flows are unobserved. In this paper, we develop a mixed regression model to estimate network flows in a bike-sharing network if only the hourly differences of in- and outdegrees at bike stations are known. We also include exogenous covariates such as weather conditions. Two different parameterizations of the model are considered to estimate 1) the whole network flow and 2) the network margins only. The estimation of the model parameters is proposed via an iterative penalized maximum likelihood approach. This is exemplified by modeling network flows in the Vienna Bike-Sharing Network. Furthermore, a simulation study is conducted to show the performance of the model. For practical purposes it is crucial to predict when and at which station there is a lack or an excess of bikes. For this application, our model shows to be well suited by providing quite accurate predictions. △ Less

Submitted 22 January, 2020; originally announced January 2020.

Comments: 27 pages, 20 figures

arXiv:1911.02397 [pdf, other]

Iterative Estimation of Mixed Exponential Random Graph Models with Nodal Random Effects

Authors: Sevag Kevork, Göran Kauermann

Abstract: The presence of unobserved node specific heterogeneity in Exponential Random Graph Models (ERGM) is a general concern, both with respect to model validity as well as estimation instability. We therefore extend the ERGM by including node specific random effects that account for unobserved heterogeneity in the network. This leads to a mixed model with parametric as well as random coefficients, label… ▽ More The presence of unobserved node specific heterogeneity in Exponential Random Graph Models (ERGM) is a general concern, both with respect to model validity as well as estimation instability. We therefore extend the ERGM by including node specific random effects that account for unobserved heterogeneity in the network. This leads to a mixed model with parametric as well as random coefficients, labelled as mixed ERGM. Estimation is carried out by combining approximate penalized pseudolikelihood estimation for the random effects with maximum likelihood estimation for the remaining parameters in the model. This approach provides a stable algorithm, which allows to fit nodal heterogeneity effects even for large scale networks. We also propose model selection based on the AIC to check for node specific heterogeneity. △ Less

Submitted 23 December, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

Comments: 19 pages, 6 figures

arXiv:1909.01274 [pdf, other]

In Search of Lost Edges: A Case Study on Reconstructing Financial Networks

Authors: Michael Lebacher, Samantha Cook, Nadja Klein, Göran Kauermann

Abstract: To capture the systemic complexity of international financial systems, network data is an important prerequisite. However, dyadic data is often not available, raising the need for methods that allow for reconstructing networks based on limited information. In this paper, we are reviewing different methods that are designed for the estimation of matrices from their marginals and potentially exogeno… ▽ More To capture the systemic complexity of international financial systems, network data is an important prerequisite. However, dyadic data is often not available, raising the need for methods that allow for reconstructing networks based on limited information. In this paper, we are reviewing different methods that are designed for the estimation of matrices from their marginals and potentially exogenous information. This includes a general discussion of the available methodology that provides edge probabilities as well as models that are focussed on the reconstruction of edge values. Besides summarizing the advantages, shortfalls and computational issues of the approaches, we put them into a competitive comparison using the SWIFT (Society for Worldwide Interbank Financial Telecommunication) MT 103 payment messages network (MT 103: Single Customer Credit Transfer). This network is not only economically meaningful but also fully observed which allows for an extensive competitive horse race of methods. The comparison concerning the binary reconstruction is divided into an evaluation of the edge probabilities and the quality of the reconstructed degree structures. Furthermore, the accuracy of the predicted edge values is investigated. To test the methods on different topologies, the application is split into two parts. The first part considers the full MT 103 network, being an illustration for the reconstruction of large, sparse financial networks. The second part is concerned with reconstructing a subset of the full network, representing a dense medium-sized network. Regarding substantial outcomes, it can be found that no method is superior in every respect and that the preferred model choice highly depends on the goal of the analysis, the presumed network structure and the availability of exogenous information. △ Less

Submitted 4 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

arXiv:1909.00736 [pdf, other]

A smooth dynamic network model for patent collaboration data

Authors: Verena Bauer, Dietmar Harhoff, Göran Kauermann

Abstract: The development and application of models, which take the evolution of network dynamics into account are receiving increasing attention. We contribute to this field and focus on a profile likelihood approach to model time-stamped event data for a large-scale dynamic network. We investigate the collaboration of inventors using EU patent data. As event we consider the submission of a joint patent an… ▽ More The development and application of models, which take the evolution of network dynamics into account are receiving increasing attention. We contribute to this field and focus on a profile likelihood approach to model time-stamped event data for a large-scale dynamic network. We investigate the collaboration of inventors using EU patent data. As event we consider the submission of a joint patent and we explore the driving forces for collaboration between inventors. We propose a flexible semiparametric model, which includes external and internal covariates, where the latter are built from the network history. △ Less

Submitted 3 August, 2020; v1 submitted 2 September, 2019; originally announced September 2019.

Comments: Major change: We had a discrepancy in the implementation and the notation in the paper of the covariate vector. Further changes: Wordings and combinations of some figures

arXiv:1905.10351 [pdf, other]

doi 10.1111/stan.12198

Tempus Volat, Hora Fugit -- A Survey of Tie-Oriented Dynamic Network Models in Discrete and Continuous Time

Authors: Cornelius Fritz, Michael Lebacher, Göran Kauermann

Abstract: Given the growing number of available tools for modeling dynamic networks, the choice of a suitable model becomes central. The goal of this survey is to provide an overview of tie-oriented dynamic network models. The survey is focused on introducing binary network models with their corresponding assumptions, advantages, and shortfalls. The models are divided according to generating processes, oper… ▽ More Given the growing number of available tools for modeling dynamic networks, the choice of a suitable model becomes central. The goal of this survey is to provide an overview of tie-oriented dynamic network models. The survey is focused on introducing binary network models with their corresponding assumptions, advantages, and shortfalls. The models are divided according to generating processes, operating in discrete and continuous time. First, we introduce the Temporal Exponential Random Graph Model (TERGM) and the Separable TERGM (STERGM), both being time-discrete models. These models are then contrasted with continuous process models, focusing on the Relational Event Model (REM). We additionally show how the REM can handle time-clustered observations, i.e., continuous time data observed at discrete time points. Besides the discussion of theoretical properties and fitting procedures, we specifically focus on the application of the models on two networks that represent international arms transfers and email exchange. The data allow to demonstrate the applicability and interpretation of the network models. △ Less

Submitted 28 August, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

arXiv:1903.11886 [pdf, other]

Regression-based Network Reconstruction with Nodal and Dyadic Covariates and Random Effects

Authors: Michael Lebacher, Göran Kauermann

Abstract: Network (or matrix) reconstruction is a general problem which occurs if the margins of a matrix are given and the matrix entries need to be predicted. In this paper we show that the predictions obtained from the iterative proportional fitting procedure (IPFP) or equivalently maximum entropy (ME) can be obtained by restricted maximum likelihood estimation relying on augmented Lagrangian optimizatio… ▽ More Network (or matrix) reconstruction is a general problem which occurs if the margins of a matrix are given and the matrix entries need to be predicted. In this paper we show that the predictions obtained from the iterative proportional fitting procedure (IPFP) or equivalently maximum entropy (ME) can be obtained by restricted maximum likelihood estimation relying on augmented Lagrangian optimization. Based on the equivalence we extend the framework of network reconstruction towards regression by allowing for exogenous covariates and random heterogeneity effects. The proposed estimation approach is compared with different competing methods for network reconstruction and matrix estimation. Exemplary, we apply the approach to interbank lending data, provided by the Bank for International Settlement (BIS). This dataset provides full knowledge of the real network and is therefore suitable to evaluate the predictions of our approach. It is shown that the inclusion of exogenous information allows for superior predictions in terms of $L_1$ and $L_2$ errors. Additionally, the approach allows to obtain prediction intervals via bootstrap that can be used to quantify the uncertainty attached to the predictions. △ Less

Submitted 4 September, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

arXiv:1903.06936 [pdf, other]

doi 10.1016/j.socnet.2021.08.007

EM-Based Smooth Graphon Estimation Using Bayesian and Spline-Based Approaches

Authors: Benjamin Sischka, Göran Kauermann

Abstract: This paper proposes the estimation of a smooth graphon model for network data analysis using principles of the EM algorithm. The approach considers both variability with respect to ordering the nodes of a network and smooth estimation of the graphon by nonparametric regression. To do so, (linear) B-splines are used, which allow for smooth estimation of the graphon, conditional on the node ordering… ▽ More This paper proposes the estimation of a smooth graphon model for network data analysis using principles of the EM algorithm. The approach considers both variability with respect to ordering the nodes of a network and smooth estimation of the graphon by nonparametric regression. To do so, (linear) B-splines are used, which allow for smooth estimation of the graphon, conditional on the node ordering. This provides the M-step. The true ordering of the nodes arising from the graphon model remains unobserved and Bayesian ideas are employed to obtain posterior samples given the network data. This yields the E-step. Combining both steps gives an EM-based approach for smooth graphon estimation. Unlike common other methods, this procedure does not require the restriction of a monotonic marginal function. The proposed graphon estimate allows to explore node-ordering strategies and therefore to compare the common degree-based node ranking with the ordering conditional on the network. Variability and uncertainty are taken into account using MCMC techniques. Examples and simulation studies support the applicability of the approach. △ Less

Submitted 15 September, 2021; v1 submitted 16 March, 2019; originally announced March 2019.

Comments: 27 pages, 10 figures

Journal ref: Social Networks 68 (2022) 279-295

arXiv:1902.09292 [pdf, other]

Censored Regression for Modelling International Small Arms Trading and its "Forensic" Use for Exploring Unreported Trades

Authors: Michael Lebacher, Paul W. Thurner, Göran Kauermann

Abstract: In this paper we use a censored regression model to investigate data on the international trade of small arms and ammunition (SAA) provided by the Norwegian Initiative on Small Arms Transfers (NISAT). Taking a network based view on the transfers, we not only rely on exogenous covariates but also estimate endogenous network effects. We apply a spatial autocorrelation (SAR) model with multiple weigh… ▽ More In this paper we use a censored regression model to investigate data on the international trade of small arms and ammunition (SAA) provided by the Norwegian Initiative on Small Arms Transfers (NISAT). Taking a network based view on the transfers, we not only rely on exogenous covariates but also estimate endogenous network effects. We apply a spatial autocorrelation (SAR) model with multiple weight matrices. The likelihood is maximized employing the Monte Carlo Expectation Maximization (MCEM) algorithm. Our approach reveals strong and stable endogenous network effects. Furthermore, we find evidence for a substantial path dependence as well as a close connection between exports of civilian and military small arms. The model is then used in a "forensic" manner to analyse latent network structures and thereby to identify countries with higher or lower tendency to export or import than reflected in the data. The approach is also validated using a simulation study. △ Less

Submitted 21 August, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

arXiv:1803.03536 [pdf, other]

Exploring Dependence Structures in the International Arms Trade Network

Authors: Michael Lebacher, Göran Kauermann

Abstract: In the paper we analyse dependence structures among international trade flows of major conventional weapons from 1952 to 2016. We employ a Network Disturbance Model commonly used in inferential network analysis and spatial econometrics. The dependence structure is represented by pre-defined weight matrices that allow for relating the arms trade flows from the network of international arms exchange… ▽ More In the paper we analyse dependence structures among international trade flows of major conventional weapons from 1952 to 2016. We employ a Network Disturbance Model commonly used in inferential network analysis and spatial econometrics. The dependence structure is represented by pre-defined weight matrices that allow for relating the arms trade flows from the network of international arms exchange. Several different weight matrices are compared by means of the AIC in order to select the best dependence structure. It turns out that the dependence structure among the arms trade flows is rather complex and can be represented by a specification that, simply speaking, relates each arms trade flow to all exports and imports of the sending and the receiving state. By controlling for explanatory variables we are able to show the influence of political and economic variables on the volume traded. △ Less

Submitted 8 March, 2018; originally announced March 2018.

Comments: arXiv admin note: text overlap with arXiv:1803.02707

arXiv:1803.02707 [pdf, other]

A Dynamic Separable Network Model with Actor Heterogeneity: An Application to Global Weapons Transfers

Authors: Michael Lebacher, Paul W. Thurner, Göran Kauermann

Abstract: In this paper we propose to extend the separable temporal exponential random graph model (STERGM) to account for time-varying network- and actor-specific effects. Our application case is the network of international major conventional weapons transfers, based on data from the Stockholm International Peace Research Institute (SIPRI). The application is particularly suitable since it allows to disti… ▽ More In this paper we propose to extend the separable temporal exponential random graph model (STERGM) to account for time-varying network- and actor-specific effects. Our application case is the network of international major conventional weapons transfers, based on data from the Stockholm International Peace Research Institute (SIPRI). The application is particularly suitable since it allows to distinguish the potentially differing driving forces for creating new trade relationships and for the endurance of existing ones. In accordance with political economy models we expect security- and network-related covariates to be most important for the formation of transfers, whereas repeated transfers should prevalently be determined by the receivers' market size and military spending. Our proposed modelling approach corroborates the hypothesis and quantifies the corresponding effects. Additionally, we subject the time-varying heterogeneity effects to a functional principal component analysis. This serves as exploratory tool and allows to identify countries that stand out by exceptional increases or decreases of their tendency to import and export weapons. △ Less

Submitted 4 September, 2019; v1 submitted 7 March, 2018; originally announced March 2018.

arXiv:1604.04732 [pdf, other]

Stable Exponential Random Graph Models with Non-parametric Components for Large Dense Networks

Authors: Stephanie Thiemichen, Göran Kauermann

Abstract: Exponential Random Graph Models (ERGM) behave peculiar in large networks with thousand(s) of actors (nodes). Standard models containing two-star or triangle counts as statistics are often unstable leading to completely full or empty networks. Moreover, numerical methods break down which makes it complicated to apply ERGMs to large networks. In this paper we propose two strategies to circumvent the… ▽ More Exponential Random Graph Models (ERGM) behave peculiar in large networks with thousand(s) of actors (nodes). Standard models containing two-star or triangle counts as statistics are often unstable leading to completely full or empty networks. Moreover, numerical methods break down which makes it complicated to apply ERGMs to large networks. In this paper we propose two strategies to circumvent these obstacles. First, we fit a model to a subsampled network and secondly, we show how linear statistics (like two-stars etc.) can be replaced by smooth functional components. These two steps in combination allow to fit stable models to large network data, which is illustrated by a data example including a residual analysis. △ Less

Submitted 16 April, 2016; originally announced April 2016.

Comments: 26 pages, 10 figures, 3 tables

arXiv:1407.6895 [pdf, other]

Bayesian Exponential Random Graph Models with Nodal Random Effects

Authors: Stephanie Thiemichen, Nial Friel, Alberto Caimo, Göran Kauermann

Abstract: We extend the well-known and widely used Exponential Random Graph Model (ERGM) by including nodal random effects to compensate for heterogeneity in the nodes of a network. The Bayesian framework for ERGMs proposed by Caimo and Friel (2011) yields the basis of our modelling algorithm. A central question in network models is the question of model selection and following the Bayesian paradigm we focu… ▽ More We extend the well-known and widely used Exponential Random Graph Model (ERGM) by including nodal random effects to compensate for heterogeneity in the nodes of a network. The Bayesian framework for ERGMs proposed by Caimo and Friel (2011) yields the basis of our modelling algorithm. A central question in network models is the question of model selection and following the Bayesian paradigm we focus on estimating Bayes factors. To do so we develop an approximate but feasible calculation of the Bayes factor which allows one to pursue model selection. Two data examples and a small simulation study illustrate our mixed model approach and the corresponding model selection. △ Less

Submitted 12 January, 2015; v1 submitted 25 July, 2014; originally announced July 2014.

Comments: 23 pages, 9 figures, 3 tables

arXiv:1108.3520 [pdf, ps, other]

Mixtures of g-Priors for Generalised Additive Model Selection with Penalised Splines

Authors: Daniel Sabanés Bové, Leonhard Held, Göran Kauermann

Abstract: We propose an objective Bayesian approach to the selection of covariates and their penalised splines transformations in generalised additive models. Specification of a reasonable default prior for the model parameters and combination with a multiplicity-correction prior for the models themselves is crucial for this task. Here we use well-studied and well-behaved continuous mixtures of g-priors as… ▽ More We propose an objective Bayesian approach to the selection of covariates and their penalised splines transformations in generalised additive models. Specification of a reasonable default prior for the model parameters and combination with a multiplicity-correction prior for the models themselves is crucial for this task. Here we use well-studied and well-behaved continuous mixtures of g-priors as default priors. We introduce the methodology in the normal model and extend it to non-normal exponential families. A simulation study and an application from the literature illustrate the proposed approach. An efficient implementation is available in the R-package "hypergsplines". △ Less

Submitted 20 August, 2012; v1 submitted 17 August, 2011; originally announced August 2011.

Comments: 34 pages, 2 figures, 5 tables

Showing 1–39 of 39 results for author: Kauermann, G