-
A Heteroscedastic Bayesian Generalized Logistic Regression Model with Application to Scaling Problems
Authors:
Jack Sutton,
Golnaz Shahtahmassebi,
Quentin S. Hanley,
Haroldo V. Ribeiro
Abstract:
Power law scaling models have been used to understand the complexity of systems as diverse as cities, neurological activity, and rainfall and lightning. In the scaling framework, power laws and standard linear regression methods are widely used to estimate model parameters with assumed normality and fixed variance. Generalized linear models (GLM) can accommodate a wider range of distributions wher…
▽ More
Power law scaling models have been used to understand the complexity of systems as diverse as cities, neurological activity, and rainfall and lightning. In the scaling framework, power laws and standard linear regression methods are widely used to estimate model parameters with assumed normality and fixed variance. Generalized linear models (GLM) can accommodate a wider range of distributions where the chosen distribution must meet the assumptions of the data to prevent model bias. We present a widely applicable Bayesian generalized logistic regression (BGLR) framework to more flexibly model a continuous real response addressing skew and heteroscedasticity. The Generalized Logistic Distribution (GLD) was selected to flexibly model skewed continuous data. This resulted in a nonlinear posterior distribution which may not have an analytical solution which can be solved numerically with Markov Chain Monte Carlo (MCMC) methods. We compared the BGLR model to standard and Bayesian normal models having fixed and varying variance when fitting power laws to 759 days of COVID-19 data. The BGLR yielded information beyond existing methods about the evolution of skew and skedasticity while revealing parameter bias of widely used methods. The BGLR flexibly modelled the complex characteristics necessary for an improved understanding of the propagation and dynamics of this infectious disease. The model is generally applicable and can be used as a template for modeling complexity with other distributions.
△ Less
Submitted 25 January, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
Deep Learning Criminal Networks
Authors:
Haroldo V. Ribeiro,
Diego D. Lopes,
Arthur A. B. Pessa,
Alvaro F. Martins,
Bruno R. da Cunha,
Sebastian Goncalves,
Ervin K. Lenzi,
Quentin S. Hanley,
Matjaz Perc
Abstract:
Recent advances in deep learning methods have enabled researchers to develop and apply algorithms for the analysis and modeling of complex networks. These advances have sparked a surge of interest at the interface between network science and machine learning. Despite this, the use of machine learning methods to investigate criminal networks remains surprisingly scarce. Here, we explore the potenti…
▽ More
Recent advances in deep learning methods have enabled researchers to develop and apply algorithms for the analysis and modeling of complex networks. These advances have sparked a surge of interest at the interface between network science and machine learning. Despite this, the use of machine learning methods to investigate criminal networks remains surprisingly scarce. Here, we explore the potential of graph convolutional networks to learn patterns among networked criminals and to predict various properties of criminal networks. Using empirical data from political corruption, criminal police intelligence, and criminal financial networks, we develop a series of deep learning models based on the GraphSAGE framework that are able to recover missing criminal partnerships, distinguish among types of associations, predict the amount of money exchanged among criminal agents, and even anticipate partnerships and recidivism of criminals during the growth dynamics of corruption networks, all with impressive accuracy. Our deep learning models significantly outperform previous shallow learning approaches and produce high-quality embeddings for node and edge properties. Moreover, these models inherit all the advantages of the GraphSAGE framework, including the generalization to unseen nodes and scaling up to large graph structures.
△ Less
Submitted 4 June, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Machine Learning Partners in Criminal Networks
Authors:
Diego D. Lopes,
Bruno R. da Cunha,
Alvaro F. Martins,
Sebastian Goncalves,
Ervin K. Lenzi,
Quentin S. Hanley,
Matjaz Perc,
Haroldo V. Ribeiro
Abstract:
Recent research has shown that criminal networks have complex organizational structures, but whether this can be used to predict static and dynamic properties of criminal networks remains little explored. Here, by combining graph representation learning and machine learning methods, we show that structural properties of political corruption, police intelligence, and money laundering networks can b…
▽ More
Recent research has shown that criminal networks have complex organizational structures, but whether this can be used to predict static and dynamic properties of criminal networks remains little explored. Here, by combining graph representation learning and machine learning methods, we show that structural properties of political corruption, police intelligence, and money laundering networks can be used to recover missing criminal partnerships, distinguish among different types of criminal and legal associations, as well as predict the total amount of money exchanged among criminal agents, all with outstanding accuracy. We also show that our approach can anticipate future criminal associations during the dynamic growth of corruption networks with significant accuracy. Thus, similar to evidence found at crime scenes, we conclude that structural patterns of criminal networks carry crucial information about illegal activities, which allows machine learning methods to predict missing information and even anticipate future criminal behavior.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Universality of political corruption networks
Authors:
Alvaro F. Martins,
Bruno R. da Cunha,
Quentin S. Hanley,
Sebastian Goncalves,
Matjaz Perc,
Haroldo V. Ribeiro
Abstract:
Corruption crimes demand highly coordinated actions among criminal agents to succeed. But research dedicated to corruption networks is still in its infancy and indeed little is known about the properties of these networks. Here we present a comprehensive investigation of corruption networks related to political scandals in Spain and Brazil over nearly three decades. We show that corruption network…
▽ More
Corruption crimes demand highly coordinated actions among criminal agents to succeed. But research dedicated to corruption networks is still in its infancy and indeed little is known about the properties of these networks. Here we present a comprehensive investigation of corruption networks related to political scandals in Spain and Brazil over nearly three decades. We show that corruption networks of both countries share universal structural and dynamical properties, including similar degree distributions, clustering and assortativity coefficients, modular structure, and a growth process that is marked by the coalescence of network components due to a few recidivist criminals. We propose a simple model that not only reproduces these empirical properties but reveals also that corruption networks operate near a critical recidivism rate below which the network is entirely fragmented and above which it is overly connected. Our research thus indicates that actions focused on decreasing corruption recidivism may substantially mitigate this type of organized crime.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
Population Density and Spreading of COVID-19 in England and Wales
Authors:
Jack Sutton,
Golnaz Shahtahmassebi,
Haroldo V. Ribeiro,
Quentin S. Hanley
Abstract:
We investigated daily COVID-19 cases and deaths in the 337 lower tier local authority regions in England and Wales to better understand how the disease propagated over a 15-month period. Population density scaling models revealed residual variance and skewness to be sensitive indicators of the dynamics of propagation. Lockdowns and schools reopening triggered increased variance indicative of outbr…
▽ More
We investigated daily COVID-19 cases and deaths in the 337 lower tier local authority regions in England and Wales to better understand how the disease propagated over a 15-month period. Population density scaling models revealed residual variance and skewness to be sensitive indicators of the dynamics of propagation. Lockdowns and schools reopening triggered increased variance indicative of outbreaks with local impact and country scale heterogeneity. University reopening and December holidays triggered reduced variance indicative of country scale homogenisation which reached a minimum in mid-January 2021. Homogeneous propagation was associated with better correspondence with normally distributed residuals while heterogeneous propagation was more consistent with skewed models. Skewness varied from strongly negative to strongly positive revealing an unappreciated feature of community propagation. Hot spots and super-spreading events are well understood descriptors of regional disease dynamics that would be expected to be associated with positively skewed distributions. Positively skewed behaviour was observed; however, negative skewness indicative of cold-spots and super-isolation dominated for approximately 8 months during the period of study. In contrast, death metrics showed near constant behaviour in scaling, variance, and skewness metrics over the full period with rural regions preferentially affected, an observation consistent with regional age demographics in England and Wales. Regional positions relative to density scaling laws were remarkably persistent after the first 5-9 days of the available data set. The determinants of this persistent behaviour probably precede the pandemic and remain unchanged.
△ Less
Submitted 9 July, 2021; v1 submitted 25 February, 2021;
originally announced February 2021.
-
City size and the spreading of COVID-19 in Brazil
Authors:
Haroldo V. Ribeiro,
Andre S. Sunahara,
Jack Sutton,
Matjaz Perc,
Quentin S. Hanley
Abstract:
The current outbreak of the coronavirus disease 2019 (COVID-19) is an unprecedented example of how fast an infectious disease can spread around the globe (especially in urban areas) and the enormous impact it causes on public health and socio-economic activities. Despite the recent surge of investigations about different aspects of the COVID-19 pandemic, we still know little about the effects of c…
▽ More
The current outbreak of the coronavirus disease 2019 (COVID-19) is an unprecedented example of how fast an infectious disease can spread around the globe (especially in urban areas) and the enormous impact it causes on public health and socio-economic activities. Despite the recent surge of investigations about different aspects of the COVID-19 pandemic, we still know little about the effects of city size on the propagation of this disease in urban areas. Here we investigate how the number of cases and deaths by COVID-19 scale with the population of Brazilian cities. Our results indicate small towns are proportionally more affected by COVID-19 during the initial spread of the disease, such that the cumulative numbers of cases and deaths per capita initially decrease with population size. However, during the long-term course of the pandemic, this urban advantage vanishes and large cities start to exhibit higher incidence of cases and deaths, such that every 1% rise in population is associated with a 0.14% increase in the number of fatalities per capita after about four months since the first two daily deaths. We argue that these patterns may be related to the existence of proportionally more health infrastructure in the largest cities and a lower proportion of older adults in large urban areas. We also find the initial growth rate of cases and deaths to be higher in large cities; however, these growth rates tend to decrease in large cities and to increase in small ones over time.
△ Less
Submitted 25 August, 2020; v1 submitted 28 May, 2020;
originally announced May 2020.
-
The hidden traits of endemic illiteracy in cities
Authors:
Luiz G. A. Alves,
Jose S. Andrade Jr.,
Quentin S. Hanley,
Haroldo V. Ribeiro
Abstract:
In spite of the considerable progress towards reducing illiteracy rates, many countries, including developed ones, have encountered difficulty achieving further reduction in these rates. This is worrying because illiteracy has been related to numerous health, social, and economic problems. Here, we show that the spatial patterns of illiteracy in urban systems have several features analogous to the…
▽ More
In spite of the considerable progress towards reducing illiteracy rates, many countries, including developed ones, have encountered difficulty achieving further reduction in these rates. This is worrying because illiteracy has been related to numerous health, social, and economic problems. Here, we show that the spatial patterns of illiteracy in urban systems have several features analogous to the spread of diseases such as dengue and obesity. Our results reveal that illiteracy rates are spatially long-range correlated, displaying non-trivial clustering structures characterized by percolation-like transitions and fractality. These patterns can be described in the context of percolation theory of long-range correlated systems at criticality. Together, these results provide evidence that the illiteracy incidence can be related to a transmissible process, in which the lack of access to minimal education propagates in a population in a similar fashion to endemic diseases.
△ Less
Submitted 26 September, 2018;
originally announced September 2018.
-
Unveiling Relationships Between Crime and Property in England and Wales Via Density Scale-Adjusted Metrics and Network Tools
Authors:
Haroldo V. Ribeiro,
Quentin S. Hanley,
Dan Lewis
Abstract:
Scale-adjusted metrics (SAMs) are a significant achievement of the urban scaling hypothesis. SAMs remove the inherent biases of per capita measures computed in the absence of isometric allometries. However, this approach is limited to urban areas, while a large portion of the world's population still lives outside cities and rural areas dominate land use worldwide. Here, we extend the concept of S…
▽ More
Scale-adjusted metrics (SAMs) are a significant achievement of the urban scaling hypothesis. SAMs remove the inherent biases of per capita measures computed in the absence of isometric allometries. However, this approach is limited to urban areas, while a large portion of the world's population still lives outside cities and rural areas dominate land use worldwide. Here, we extend the concept of SAMs to population density scale-adjusted metrics (DSAMs) to reveal relationships among different types of crime and property metrics. Our approach allows all human environments to be considered, avoids problems in the definition of urban areas, and accounts for the heterogeneity of population distributions within urban regions. By combining DSAMs, cross-correlation, and complex network analysis, we find that crime and property types have intricate and hierarchically organized relationships leading to some striking conclusions. Drugs and burglary had uncorrelated DSAMs and, to the extent property transaction values are indicators of affluence, twelve out of fourteen crime metrics showed no evidence of specifically targeting affluence. Burglary and robbery were the most connected in our network analysis and the modular structures suggest an alternative to "zero-tolerance" policies by unveiling the crime and/or property types most likely to affect each other.
△ Less
Submitted 3 February, 2018;
originally announced February 2018.
-
Rural to urban population density scaling of crime and property transactions in English and Welsh Parliamentary Constituencies
Authors:
Quentin S. Hanley,
Dan Lewis,
Haroldo V. Ribeiro
Abstract:
Urban population scaling of resource use, creativity metrics, and human behaviors has been widely studied. These studies have not looked in detail at the full range of human environments which represent a continuum from the most rural to heavily urban. We examined monthly police crime reports and property transaction values across all 573 Parliamentary Constituencies in England and Wales, finding…
▽ More
Urban population scaling of resource use, creativity metrics, and human behaviors has been widely studied. These studies have not looked in detail at the full range of human environments which represent a continuum from the most rural to heavily urban. We examined monthly police crime reports and property transaction values across all 573 Parliamentary Constituencies in England and Wales, finding that scaling models based on population density provided a far superior framework to traditional population scaling. We found four types of scaling: i) non-urban scaling in which a single power law explained the relationship between the metrics and population density from the most rural to heavily urban environments, ii) accelerated scaling in which high population density was associated with an increase in the power-law exponent, iii) inhibited scaling where the urban environment resulted in a reduction in the power-law exponent but remained positive, and iv) collapsed scaling where transition to the high density environment resulted in a negative scaling exponent. Urban scaling transitions, when observed, took place universally between 10 and 70 people per hectare. This study significantly refines our understanding of urban scaling, making clear that some of what has been previously ascribed to urban environments may simply be the high density portion of non-urban scaling. It also makes clear that some metrics undergo specific transitions in urban environments and these transitions can include negative scaling exponents indicative of collapse. This study gives promise of far more sophisticated scale adjusted metrics and indicates that studies of urban scaling represent a high density subsection of overall scaling relationships which continue into rural environments.
△ Less
Submitted 17 February, 2016;
originally announced February 2016.