-
Deep Learning Criminal Networks
Authors:
Haroldo V. Ribeiro,
Diego D. Lopes,
Arthur A. B. Pessa,
Alvaro F. Martins,
Bruno R. da Cunha,
Sebastian Goncalves,
Ervin K. Lenzi,
Quentin S. Hanley,
Matjaz Perc
Abstract:
Recent advances in deep learning methods have enabled researchers to develop and apply algorithms for the analysis and modeling of complex networks. These advances have sparked a surge of interest at the interface between network science and machine learning. Despite this, the use of machine learning methods to investigate criminal networks remains surprisingly scarce. Here, we explore the potenti…
▽ More
Recent advances in deep learning methods have enabled researchers to develop and apply algorithms for the analysis and modeling of complex networks. These advances have sparked a surge of interest at the interface between network science and machine learning. Despite this, the use of machine learning methods to investigate criminal networks remains surprisingly scarce. Here, we explore the potential of graph convolutional networks to learn patterns among networked criminals and to predict various properties of criminal networks. Using empirical data from political corruption, criminal police intelligence, and criminal financial networks, we develop a series of deep learning models based on the GraphSAGE framework that are able to recover missing criminal partnerships, distinguish among types of associations, predict the amount of money exchanged among criminal agents, and even anticipate partnerships and recidivism of criminals during the growth dynamics of corruption networks, all with impressive accuracy. Our deep learning models significantly outperform previous shallow learning approaches and produce high-quality embeddings for node and edge properties. Moreover, these models inherit all the advantages of the GraphSAGE framework, including the generalization to unseen nodes and scaling up to large graph structures.
△ Less
Submitted 4 June, 2023; v1 submitted 17 April, 2023;
originally announced April 2023.
-
Machine Learning Partners in Criminal Networks
Authors:
Diego D. Lopes,
Bruno R. da Cunha,
Alvaro F. Martins,
Sebastian Goncalves,
Ervin K. Lenzi,
Quentin S. Hanley,
Matjaz Perc,
Haroldo V. Ribeiro
Abstract:
Recent research has shown that criminal networks have complex organizational structures, but whether this can be used to predict static and dynamic properties of criminal networks remains little explored. Here, by combining graph representation learning and machine learning methods, we show that structural properties of political corruption, police intelligence, and money laundering networks can b…
▽ More
Recent research has shown that criminal networks have complex organizational structures, but whether this can be used to predict static and dynamic properties of criminal networks remains little explored. Here, by combining graph representation learning and machine learning methods, we show that structural properties of political corruption, police intelligence, and money laundering networks can be used to recover missing criminal partnerships, distinguish among different types of criminal and legal associations, as well as predict the total amount of money exchanged among criminal agents, all with outstanding accuracy. We also show that our approach can anticipate future criminal associations during the dynamic growth of corruption networks with significant accuracy. Thus, similar to evidence found at crime scenes, we conclude that structural patterns of criminal networks carry crucial information about illegal activities, which allows machine learning methods to predict missing information and even anticipate future criminal behavior.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Improving Adverse Drug Event Extraction with SpanBERT on Different Text Typologies
Authors:
Beatrice Portelli,
Daniele Passabì,
Edoardo Lenzi,
Giuseppe Serra,
Enrico Santus,
Emmanuele Chersoni
Abstract:
In recent years, Internet users are reporting Adverse Drug Events (ADE) on social media, blogs and health forums. Because of the large volume of reports, pharmacovigilance is seeking to resort to NLP to monitor these outlets. We propose for the first time the use of the SpanBERT architecture for the task of ADE extraction: this new version of the popular BERT transformer showed improved capabiliti…
▽ More
In recent years, Internet users are reporting Adverse Drug Events (ADE) on social media, blogs and health forums. Because of the large volume of reports, pharmacovigilance is seeking to resort to NLP to monitor these outlets. We propose for the first time the use of the SpanBERT architecture for the task of ADE extraction: this new version of the popular BERT transformer showed improved capabilities with multi-token text spans. We validate our hypothesis with experiments on two datasets (SMM4H and CADEC) with different text typologies (tweets and blog posts), finding that SpanBERT combined with a CRF outperforms all the competitors on both of them.
△ Less
Submitted 18 May, 2021;
originally announced May 2021.
-
RECKONition: a NLP-based system for Industrial Accidents at Work Prevention
Authors:
Patrizia Agnello,
Silvia M. Ansaldi,
Emilia Lenzi,
Alessio Mongelluzzo,
Manuel Roveri
Abstract:
Extracting patterns and useful information from Natural Language datasets is a challenging task, especially when dealing with data written in a language different from English, like Italian. Machine and Deep Learning, together with Natural Language Processing (NLP) techniques have widely spread and improved lately, providing a plethora of useful methods to address both Supervised and Unsupervised…
▽ More
Extracting patterns and useful information from Natural Language datasets is a challenging task, especially when dealing with data written in a language different from English, like Italian. Machine and Deep Learning, together with Natural Language Processing (NLP) techniques have widely spread and improved lately, providing a plethora of useful methods to address both Supervised and Unsupervised problems on textual information. We propose RECKONition, a NLP-based system for Industrial Accidents at Work Prevention. RECKONition, which is meant to provide Natural Language Understanding, Clustering and Inference, is the result of a joint partnership with the Italian National Institute for Insurance against Accidents at Work (INAIL). The obtained results showed the ability to process textual data written in Italian describing industrial accidents dynamics and consequences.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
The dynamical structure of political corruption networks
Authors:
Haroldo V. Ribeiro,
Luiz G. A. Alves,
Alvaro F. Martins,
Ervin K. Lenzi,
Matjaz Perc
Abstract:
Corruptive behaviour in politics limits economic growth, embezzles public funds, and promotes socio-economic inequality in modern democracies. We analyse well-documented political corruption scandals in Brazil over the past 27 years, focusing on the dynamical structure of networks where two individuals are connected if they were involved in the same scandal. Our research reveals that corruption ru…
▽ More
Corruptive behaviour in politics limits economic growth, embezzles public funds, and promotes socio-economic inequality in modern democracies. We analyse well-documented political corruption scandals in Brazil over the past 27 years, focusing on the dynamical structure of networks where two individuals are connected if they were involved in the same scandal. Our research reveals that corruption runs in small groups that rarely comprise more than eight people, in networks that have hubs and a modular structure that encompasses more than one corruption scandal. We observe abrupt changes in the size of the largest connected component and in the degree distribution, which are due to the coalescence of different modules when new scandals come to light or when governments change. We show further that the dynamical structure of political corruption networks can be used for successfully predicting partners in future scandals. We discuss the important role of network science in detecting and mitigating political corruption.
△ Less
Submitted 5 January, 2018;
originally announced January 2018.
-
Engagement in the electoral processes: scaling laws and the role of the political positions
Authors:
M. C. Mantovani,
H. V. Ribeiro,
E. K. Lenzi,
S. Picoli Jr.,
R. S. Mendes
Abstract:
We report on a statistical analysis of the engagement in the electoral processes of all Brazilian cities by considering the number of party memberships and the number of candidates for mayor and councillor. By investigating the relationships between the number of party members and the population of voters, we have found that the functional form of these relationships are well described by sub-line…
▽ More
We report on a statistical analysis of the engagement in the electoral processes of all Brazilian cities by considering the number of party memberships and the number of candidates for mayor and councillor. By investigating the relationships between the number of party members and the population of voters, we have found that the functional form of these relationships are well described by sub-linear power laws (allometric scaling) surrounded by a multiplicative log-normal noise. We have observed that this pattern is quite similar to those previously-reported for the relationships between the number candidates (mayor and councillor) and population of voters [EPL 96, 48001 (2011)], suggesting that similar universal laws may be ruling the engagement in the electoral processes. We also note that the power law exponents display a clear hierarchy, where the more influential is the political position the smaller is the value of the exponent. We have also investigated the probability distributions of the number of candidates (mayor and councilor), party memberships and voters. The results indicate that the most influential positions are characterized by distributions with very short-tails, while less influential positions display an intermediate power law decay before showing an exponential-like cutoff. We discuss that, in addition to the political power of the position, limitations in the number of available seats can also be connected with this changing of behavior. We further believe that our empirical findings point out to an underrepresentation effect, where the larger city is, the larger are the obstacles for more individuals to become directly engaged in the electoral process.
△ Less
Submitted 13 August, 2013;
originally announced August 2013.
-
Complexity-entropy causality plane: a useful approach for distinguishing songs
Authors:
H. V. Ribeiro,
L. Zunino,
R. S. Mendes,
E. K. Lenzi
Abstract:
Nowadays we are often faced with huge databases resulting from the rapid growth of data storage technologies. This is particularly true when dealing with music databases. In this context, it is essential to have techniques and tools able to discriminate properties from these massive sets. In this work, we report on a statistical analysis of more than ten thousand songs aiming to obtain a complexit…
▽ More
Nowadays we are often faced with huge databases resulting from the rapid growth of data storage technologies. This is particularly true when dealing with music databases. In this context, it is essential to have techniques and tools able to discriminate properties from these massive sets. In this work, we report on a statistical analysis of more than ten thousand songs aiming to obtain a complexity hierarchy. Our approach is based on the estimation of the permutation entropy combined with an intensive complexity measure, building up the complexity-entropy causality plane. The results obtained indicate that this representation space is very promising to discriminate songs as well as to allow a relative quantitative comparison among songs. Additionally, we believe that the here-reported method may be applied in practical situations since it is simple, robust and has a fast numerical implementation.
△ Less
Submitted 10 December, 2011;
originally announced December 2011.
-
Universal patterns in sound amplitudes of songs and music genres
Authors:
R. S. Mendes,
H. V. Ribeiro,
F. C. M. Freire,
A. A. Tateishi,
E. K. Lenzi
Abstract:
We report a statistical analysis over more than eight thousand songs. Specifically, we investigate the probability distribution of the normalized sound amplitudes. Our findings seems to suggest a universal form of distribution which presents a good agreement with a one-parameter stretched Gaussian. We also argue that this parameter can give information on music complexity, and consequently it goes…
▽ More
We report a statistical analysis over more than eight thousand songs. Specifically, we investigate the probability distribution of the normalized sound amplitudes. Our findings seems to suggest a universal form of distribution which presents a good agreement with a one-parameter stretched Gaussian. We also argue that this parameter can give information on music complexity, and consequently it goes towards classifying songs as well as music genres. Additionally, we present statistical evidences that correlation aspects of the songs are directly related with the non-Gaussian nature of their sound amplitude distributions.
△ Less
Submitted 1 December, 2010;
originally announced December 2010.