Search | arXiv e-print repository

Patterns of link reciprocity in directed, signed networks

Authors: Anna Gallo, Fabio Saracco, Renaud Lambiotte, Diego Garlaschelli, Tiziano Squartini

Abstract: Most of the analyses concerning signed networks have focused on the balance theory, hence identifying frustration with undirected, triadic motifs having an odd number of negative edges; much less attention has been paid to their directed counterparts. To fill this gap, we focus on signed, directed connections, with the aim of exploring the notion of frustration in such a context. When dealing with… ▽ More Most of the analyses concerning signed networks have focused on the balance theory, hence identifying frustration with undirected, triadic motifs having an odd number of negative edges; much less attention has been paid to their directed counterparts. To fill this gap, we focus on signed, directed connections, with the aim of exploring the notion of frustration in such a context. When dealing with signed, directed edges, frustration is a multi-faceted concept, admitting different definitions at different scales: if we limit ourselves to consider cycles of length two, frustration is related to reciprocity, i.e. the tendency of edges to admit the presence of partners pointing in the opposite direction. As the reciprocity of signed networks is still poorly understood, we adopt a principled approach for its study, defining quantities and introducing models to consistently capture empirical patterns of the kind. In order to quantify the tendency of empirical networks to form either mutualistic or antagonistic cycles of length two, we extend the Exponential Random Graphs framework to binary, directed, signed networks with global and local constraints and, then, compare the empirical abundance of the aforementioned patterns with the one expected under each model. We find that the (directed extension of the) balance theory is not capable of providing a consistent explanation of the patterns characterising the directed, signed networks considered in this work. Although part of the ambiguities can be solved by adopting a coarser definition of balance, our results call for a different theory, accounting for the directionality of edges in a coherent manner. In any case, the evidence that the empirical, signed networks can be highly reciprocated leads us to recommend to explicitly account for the role played by bidirectional dyads in determining frustration at higher levels (e.g. the triadic one). △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 35 pages, 9 figures, 4 tables

arXiv:2405.04896 [pdf, other]

Verified authors shape X/Twitter discursive communities

Authors: Stefano Guarino, Ayoub Mounim, Guido Caldarelli, Fabio Saracco

Abstract: Community detection algorithms try to extract a mesoscale structure from the available network data, generally avoiding any explicit assumption regarding the quantity and quality of information conveyed by specific sets of edges. In this paper, we show that the core of ideological/discursive communities on X/Twitter can be effectively identified by uncovering the most informative interactions in a… ▽ More Community detection algorithms try to extract a mesoscale structure from the available network data, generally avoiding any explicit assumption regarding the quantity and quality of information conveyed by specific sets of edges. In this paper, we show that the core of ideological/discursive communities on X/Twitter can be effectively identified by uncovering the most informative interactions in an authors-audience bipartite network through a maximum-entropy null model. The analysis is performed considering three X/Twitter datasets related to the main political events of 2022 in Italy, using as benchmarks four state-of-the-art algorithms - three descriptive, one inferential -, and manually annotating nearly 300 verified users based on their political affiliation. In terms of information content, the communities obtained with the entropy-based algorithm are comparable to those obtained with some of the benchmarks. However, such a methodology on the authors-audience bipartite network: uses just a small sample of the available data to identify the central users of each community; returns a neater partition of the user set in just a few, easy to interpret, communities; clusters well-known political figures in a way that better matches the political alliances when compared with the benchmarks. Our results provide an important insight into online debates, highlighting that online interaction networks are mostly shaped by the activity of a small set of users who enjoy public visibility even outside social media. △ Less

Submitted 8 May, 2024; originally announced May 2024.

arXiv:2402.18664 [pdf, other]

doi 10.1140/epjds/s13688-024-00461-6

Online disinformation in the 2020 U.S. Election: swing vs. safe states

Authors: Manuel Pratelli, Marinella Petrocchi, Fabio Saracco, Rocco De Nicola

Abstract: For U.S. presidential elections, most states use the so-called winner-take-all system, in which the state's presidential electors are awarded to the winning political party in the state after a popular vote phase, regardless of the actual margin of victory. Therefore, election campaigns are especially intense in states where there is no clear direction on which party will be the winning party. The… ▽ More For U.S. presidential elections, most states use the so-called winner-take-all system, in which the state's presidential electors are awarded to the winning political party in the state after a popular vote phase, regardless of the actual margin of victory. Therefore, election campaigns are especially intense in states where there is no clear direction on which party will be the winning party. These states are often referred to as swing states. To measure the impact of such an election law on the campaigns, we analyze the Twitter activity surrounding the 2020 US preelection debate, with a particular focus on the spread of disinformation. We find that about 88% of the online traffic was associated with swing states. In addition, the sharing of links to unreliable news sources is significantly more prevalent in tweets associated with swing states: in this case, untrustworthy tweets are predominantly generated by automated accounts. Furthermore, we observe that the debate is mostly led by two main communities, one with a predominantly Republican affiliation and the other with accounts of different political orientations. Most of the disinformation comes from the former. △ Less

Submitted 12 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: arXiv admin note: text overlap with arXiv:2303.12474

arXiv:2402.18621 [pdf, other]

Unveiling News Publishers Trustworthiness Through Social Interactions

Authors: Manuel Pratelli, Fabio Saracco, Marinella Petrocchi

Abstract: With the primary goal of raising readers' awareness of misinformation phenomena, extensive efforts have been made by both academic institutions and independent organizations to develop methodologies for assessing the trustworthiness of online news publishers. Unfortunately, existing approaches are costly and face critical scalability challenges. This study presents a novel framework for assessing… ▽ More With the primary goal of raising readers' awareness of misinformation phenomena, extensive efforts have been made by both academic institutions and independent organizations to develop methodologies for assessing the trustworthiness of online news publishers. Unfortunately, existing approaches are costly and face critical scalability challenges. This study presents a novel framework for assessing the trustworthiness of online news publishers using user interactions on social media platforms. The proposed methodology provides a versatile solution that serves the dual purpose of i) identifying verifiable online publishers and ii) automatically performing an initial estimation of the trustworthiness of previously unclassified online news outlets. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: A pre-final version of the paper accepted at WebSci'24

arXiv:2310.01284 [pdf, other]

Pattern detection in bipartite networks: a review of terminology, applications and methods

Authors: Zachary Neal, Annabel Cadieux, Diego Garlaschelli, Nicholas J. Gotelli, Fabio Saracco, Tiziano Squartini, Shade T. Shutters, Werner Ulrich, Guanyang Wang, Giovanni Strona

Abstract: Two dimensional matrices with binary (0/1) entries are a common data structure in many research fields. Examples include ecology, economics, mathematics, physics, psychometrics and others. Because the columns and rows of these matrices represent distinct entities, they can equivalently be expressed as a pair of bipartite networks that are linked by projection. A variety of diversity statistics and… ▽ More Two dimensional matrices with binary (0/1) entries are a common data structure in many research fields. Examples include ecology, economics, mathematics, physics, psychometrics and others. Because the columns and rows of these matrices represent distinct entities, they can equivalently be expressed as a pair of bipartite networks that are linked by projection. A variety of diversity statistics and network metrics can then be used to quantify patterns in these matrices and networks. But what should these patterns be compared to? In all of these disciplines, researchers have recognized the necessity of comparing an empirical matrix to a benchmark set of "null" matrices created by randomizing certain elements of the original data. This common need has nevertheless promoted the independent development of methodologies by researchers who come from different backgrounds and use different terminology. Here, we provide a multidisciplinary review of randomization techniques for matrices representing binary, bipartite networks. We aim to translate the concepts from different technical domains into a common language that is accessible to a broad scientific audience. Specifically, after briefly reviewing examples of binary matrix structures across different fields, we introduce the major approaches and common strategies for randomizing these matrices. We then explore the details of and performance of specific techniques, and discuss their limitations and computational challenges. In particular, we focus on the conceptual importance and implementation of structural constraints on the randomization, such as preserving row or columns sums of the original matrix in each of the randomized matrices. Our review serves both as a guide for empiricists in different disciplines, as well as a reference point for researchers working on theoretical and methodological developments in matrix randomization methods. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2308.01750 [pdf, other]

doi 10.1093/pnasnexus/pgae177

Entropy-based detection of Twitter echo chambers

Authors: Manuel Pratelli, Fabio Saracco, Marinella Petrocchi

Abstract: Echo chambers, i.e. clusters of users exposed to news and opinions in line with their previous beliefs, were observed in many online debates on social platforms. We propose a completely unbiased entropy-based method for detecting echo chambers. The method is completely agnostic to the nature of the data. In the Italian Twitter debate about the Covid-19 vaccination, we find a limited presence of us… ▽ More Echo chambers, i.e. clusters of users exposed to news and opinions in line with their previous beliefs, were observed in many online debates on social platforms. We propose a completely unbiased entropy-based method for detecting echo chambers. The method is completely agnostic to the nature of the data. In the Italian Twitter debate about the Covid-19 vaccination, we find a limited presence of users in echo chambers (about 0.35% of all users). Nevertheless, their impact on the formation of a common discourse is strong, as users in echo chambers are responsible for nearly a third of the retweets in the original dataset. Moreover, in the case study observed, echo chambers appear to be a receptacle for disinformative content. △ Less

Submitted 28 February, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

Comments: 30 pages, 11 figures, 7 tables

Journal ref: PNAS Nexus, Volume 3, Issue 5, May 2024, pgae177

arXiv:2304.12245 [pdf, other]

doi 10.1088/2632-072X/ad1411

Inferring comparative advantage via entropy maximization

Authors: Matteo Bruno, Dario Mazzilli, Aurelio Patelli, Tiziano Squartini, Fabio Saracco

Abstract: We revise the procedure proposed by Balassa to infer comparative advantage, which is a standard tool, in Economics, to analyze specialization (of countries, regions, etc.). Balassa's approach compares the export of a product for each country with what would be expected from a benchmark based on the total volumes of countries and products flows. Based on results in the literature, we show that the… ▽ More We revise the procedure proposed by Balassa to infer comparative advantage, which is a standard tool, in Economics, to analyze specialization (of countries, regions, etc.). Balassa's approach compares the export of a product for each country with what would be expected from a benchmark based on the total volumes of countries and products flows. Based on results in the literature, we show that the implementation of Balassa's idea generates a bias: the prescription of the maximum likelihood used to calculate the parameters of the benchmark model conflicts with the model's definition. Moreover, Balassa's approach does not implement any statistical validation. Hence, we propose an alternative procedure to overcome such a limitation, based upon the framework of entropy maximisation and implementing a proper test of hypothesis: the `key products' of a country are, now, the ones whose production is significantly larger than expected, under a null-model constraining the same amount of information employed by Balassa's approach. What we found is that countries diversification is always observed, regardless of the strictness of the validation procedure. Besides, the ranking of countries' fitness is only partially affected by the details of the validation scheme employed for the analysis while large differences are found to affect the rankings of products Complexities. The routine for implementing the entropy-based filtering procedures employed here is freely available through the official Python Package Index PyPI. △ Less

Submitted 24 April, 2023; originally announced April 2023.

Journal ref: 2023 J. Phys. Complex. 4 045011

arXiv:2303.12474 [pdf, other]

Swinging in the States: Does disinformation on Twitter mirror the US presidential election system?

Authors: Manuel Pratelli, Marinella Petrocchi, Fabio Saracco, Rocco De Nicola

Abstract: For more than a decade scholars have been investigating the disinformation flow on social media contextually to societal events, like, e.g., elections. In this paper, we analyze the Twitter traffic related to the US 2020 pre-election debate and ask whether it mirrors the electoral system. The U.S. electoral system provides that, regardless of the actual vote gap, the premier candidate who received… ▽ More For more than a decade scholars have been investigating the disinformation flow on social media contextually to societal events, like, e.g., elections. In this paper, we analyze the Twitter traffic related to the US 2020 pre-election debate and ask whether it mirrors the electoral system. The U.S. electoral system provides that, regardless of the actual vote gap, the premier candidate who received more votes in one state `takes' that state. Criticisms of this system have pointed out that election campaigns can be more intense in particular key states to achieve victory, so-called {\it swing states}. Our intuition is that election debate may cause more traffic on Twitter-and probably be more plagued by misinformation-when associated with swing states. The results mostly confirm the intuition. About 88\% of the entire traffic can be associated with swing states, and links to non-trustworthy news are shared far more in swing-related traffic than the same type of news in safe-related traffic. Considering traffic origin instead, non-trustworthy tweets generated by automated accounts, so-called social bots, are mostly associated with swing states. Our work sheds light on the role an electoral system plays in the evolution of online debates, with, in the spotlight, disinformation and social bots. △ Less

Submitted 22 March, 2023; originally announced March 2023.

Comments: 9 pages, 2 figures; Accepted @CySoc 2023, International Workshop on Cyber Social Threats, co-located with the ACM Web conference 2023, April 30, 2023. The present version is a preprint

arXiv:2303.07023 [pdf, other]

doi 10.1038/s42005-024-01640-7

Testing structural balance theories in heterogeneous signed networks

Authors: Anna Gallo, Diego Garlaschelli, Renaud Lambiotte, Fabio Saracco, Tiziano Squartini

Abstract: The abundance of data about social relationships allows the human behavior to be analyzed as any other natural phenomenon. Here we focus on balance theory, stating that social actors tend to avoid establishing cycles with an odd number of negative links. This statement, however, can be supported only after a comparison with a benchmark. Since the existing ones disregard actors' heterogeneity, we e… ▽ More The abundance of data about social relationships allows the human behavior to be analyzed as any other natural phenomenon. Here we focus on balance theory, stating that social actors tend to avoid establishing cycles with an odd number of negative links. This statement, however, can be supported only after a comparison with a benchmark. Since the existing ones disregard actors' heterogeneity, we extend Exponential Random Graphs to signed networks with both global and local constraints and employ them to assess the significance of empirical unbalanced patterns. We find that the nature of balance crucially depends on the null model: while homogeneous benchmarks favor the weak balance theory, according to which only triangles with one negative link should be under-represented, heterogeneous benchmarks favor the strong balance theory, according to which also triangles with all negative links should be under-represented. Biological networks, instead, display strong frustration under any benchmark, confirming that structural balance inherently characterizes social networks. △ Less

Submitted 11 April, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

Comments: 46 pages, 14 figures, 7 tables

Journal ref: Comm. Phys. 7 (154) (2024)

arXiv:2302.01282 [pdf]

doi 10.3390/agronomy13020576

Bibliometric and social network analysis on the use of satellite imagery in agriculture: an entropy-based approach

Authors: Riccardo Dainelli, Fabio Saracco

Abstract: Satellite imagery is gaining popularity as a valuable tool to lower the impact on natural resources and increase profits for farmers. The purpose of this study is twofold: to mine the scientific literature to reveal the structure of this research domain, and to investigate to what extent scientific results can reach a wider public audience. To meet these two objectives, a Web of Science and a Twit… ▽ More Satellite imagery is gaining popularity as a valuable tool to lower the impact on natural resources and increase profits for farmers. The purpose of this study is twofold: to mine the scientific literature to reveal the structure of this research domain, and to investigate to what extent scientific results can reach a wider public audience. To meet these two objectives, a Web of Science and a Twitter dataset were retrieved and analysed, respectively. For the academic literature, different performances of various countries were observed: the USA and China resulted as the leading actors, both in terms of published papers and employed researchers. Among the categorised keywords, "resolution", "Landsat", "yield", "wheat" and "multispectral" are the most used. Then, analysing the semantic network of the words used in the various abstracts, the different facets of the research in satellite remote sensing were detected. The importance of retrieving meteorological parameters through remote sensing and the broad use of vegetation indexes emerged from these analyses. As emerging topics, classification tasks for land use assessment and crop recognition stand out, alongside the use of hyperspectral sensors. Regarding the interaction of academia with the public, the analysis showed that it is practically absent on Twitter: most of the activity therein stems from private companies advertising their business. This shows that there is still a communication gap between academia and actors from other societal sectors. △ Less

Submitted 17 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

Comments: 30 pages, 13 figures. The version here is a draft, the final version can be found at the link: https://www.mdpi.com/2073-4395/13/2/576

Journal ref: Agronomy 2023, 13(2), 576

arXiv:2209.10439 [pdf, other]

doi 10.1038/s41598-022-22798-6

The Fitness-Corrected Block Model, or how to create maximum-entropy data-driven spatial social networks

Authors: Massimo Bernaschi, Alessandro Celestini, Stefano Guarino, Enrico Mastrostefano, Fabio Saracco

Abstract: Models of networks play a major role in explaining and reproducing empirically observed patterns. Suitable models can be used to randomize an observed network while preserving some of its features, or to generate synthetic graphs whose properties may be tuned upon the characteristics of a given population. In the present paper, we introduce the Fitness-Corrected Block Model, an adjustable-density… ▽ More Models of networks play a major role in explaining and reproducing empirically observed patterns. Suitable models can be used to randomize an observed network while preserving some of its features, or to generate synthetic graphs whose properties may be tuned upon the characteristics of a given population. In the present paper, we introduce the Fitness-Corrected Block Model, an adjustable-density variation of the well-known Degree-Corrected Block Model, and we show that the proposed construction yields a maximum entropy model. When the network is sparse, we derive an analytical expression for the degree distribution of the model that depends on just the constraints and the chosen fitness-distribution. Our model is perfectly suited to define maximum-entropy data-driven spatial social networks, where each block identifies vertices having similar position (e.g., residence) and age, and where the expected block-to-block adjacency matrix can be inferred from the available data. In this case, the sparse-regime approximation coincides with a phenomenological model where the probability of a link binding two individuals is directly proportional to their sociability and to the typical cohesion of their age-groups, whereas it decays as an inverse-power of their geographic distance. We support our analytical findings through simulations of a stylized urban area. △ Less

Submitted 21 September, 2022; originally announced September 2022.

Comments: 14 pages, 1 figure

Journal ref: Sci Rep 12, 18206 (2022)

arXiv:2207.14664 [pdf, other]

doi 10.1038/s41598-023-34024-y

Sustainable Development Goals as unifying narratives in large UK firms' Twitter discussions

Authors: Alessia Patuelli, Fabio Saracco

Abstract: To achieve sustainable development worldwide, the United Nations set 17 Sustainable Development Goals (SDGs) for humanity to reach by 2030. Society is involved in the challenge, with firms playing a crucial role. Thus, a key question is to what extent firms engage with the SDGs. Efforts to map firms' contributions have mainly focused on analysing companies' reports based on limited samples and non… ▽ More To achieve sustainable development worldwide, the United Nations set 17 Sustainable Development Goals (SDGs) for humanity to reach by 2030. Society is involved in the challenge, with firms playing a crucial role. Thus, a key question is to what extent firms engage with the SDGs. Efforts to map firms' contributions have mainly focused on analysing companies' reports based on limited samples and non-real-time data. We present a novel interdisciplinary approach based on analysing big data from an online social network (Twitter) with complex network methods from statistical physics. By doing so, we provide a comprehensive and nearly real-time picture of firms' engagement with SDGs. Results show that: 1) SDGs themes tie conversations among major UK firms together; 2) the social dimension is predominant; 3) the attention to different SDGs themes varies depending on the community and sector firms belong to; 4) stakeholder engagement is higher on posts related to global challenges compared to general ones; 5) large UK companies and stakeholders generally behave differently from Italian ones. This paper provides theoretical contributions and practical implications relevant to firms, policymakers and management education. Most importantly, it provides a novel tool and a set of keywords to monitor the influence of the private sector on the implementation of the 2030 Agenda. △ Less

Submitted 4 May, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

Comments: 22 pages, 6 figures, 8 tables

Journal ref: Sci Rep 13, 7017 (2023)

arXiv:2207.12123 [pdf, other]

Entropy-based random models for hypergraphs

Authors: Fabio Saracco, Giovanni Petri, Renaud Lambiotte, Tiziano Squartini

Abstract: Network theory has primarily focused on pairwise relationships, disregarding many-body interactions: neglecting them, however, can lead to misleading representations of complex systems. Hypergraphs represent an increasingly popular alternative for describing polyadic interactions: our innovation lies in leveraging the representation of hypergraphs based on the incidence matrix for extending the en… ▽ More Network theory has primarily focused on pairwise relationships, disregarding many-body interactions: neglecting them, however, can lead to misleading representations of complex systems. Hypergraphs represent an increasingly popular alternative for describing polyadic interactions: our innovation lies in leveraging the representation of hypergraphs based on the incidence matrix for extending the entropy-based framework to higher-order structures. In analogy with the Exponential Random Graphs, we name the members of this novel class of models Exponential Random Hypergraphs. Here, we focus on two explicit examples, i.e. the generalisations of the Erdös-Rényi Model and of the Configuration Model. After discussing their asymptotic properties, we employ them to analyse real-world configurations: more specifically, i) we extend the definition of several network quantities to hypergraphs, ii) compute their expected value under each null model and iii) compare it with the empirical one, in order to detect deviations from random behaviours. Differently from currently available techniques, ours is analytically tractable, scalable and effective in singling out the structural patterns of real-world hypergraphs differing significantly from those emerging as a consequence of simpler, structural constraints. △ Less

Submitted 14 June, 2024; v1 submitted 21 July, 2022; originally announced July 2022.

Comments: 27 pages, 11 figures, 4 tables

arXiv:2202.03316 [pdf, other]

doi 10.1038/s41598-022-16603-7

Bow-Tie Structures of Twitter Discursive Communities

Authors: Mattia Mattei, Manuel Pratelli, Guido Caldarelli, Marinella Petrocchi, Fabio Saracco

Abstract: In the analysis of Twitter debate, the recent literature focused on discursive communities, i.e. clusters of accounts interacting among themselves via retweets. In the present work, we studied discursive communities in 8 different thematic Twitter datasets in various languages. Surprisingly, we observed that almost all discursive communities therein display a bow-tie structure during political or… ▽ More In the analysis of Twitter debate, the recent literature focused on discursive communities, i.e. clusters of accounts interacting among themselves via retweets. In the present work, we studied discursive communities in 8 different thematic Twitter datasets in various languages. Surprisingly, we observed that almost all discursive communities therein display a bow-tie structure during political or societal debates. Instead, they are absent when the argument of the discussion is different as sport events, as in the case of Euro2020 Turkish and Italian datasets. We furthermore analysed the quality of the content created in the various sectors of the different discursive communities, using the domain annotation from the fact-checking website Newsguard: we observe that, when the discursive community is affected by m/disinformation, the content with the lowest quality is the ones produced and shared in SCC and, in particular, a strong incidence of low- or non-reputable messages is present in the flow of retweets between the SCC and the OUT sectors. In this sense, in discursive communities affected by m/disinformation, the greatest part of the accounts has access to a great variety of contents, but whose quality is, in general, quite low; such a situation perfectly describes the phenomenon of infodemic, i.e. the access to "an excessive amount of information about a problem, which makes it difficult to identify a solution", according to WHO). △ Less

Submitted 28 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

Comments: 47 pages, 25 figures, 7 tables

Journal ref: Sci Rep 12, 12944 (2022)

arXiv:2107.14155 [pdf, other]

doi 10.1140/epjds/s13688-022-00330-0

Brexit and bots: characterizing the behaviour of automated accounts on Twitter during the UK election

Authors: Matteo Bruno, Renaud Lambiotte, Fabio Saracco

Abstract: Online Social Networks represent a novel opportunity for political campaigns, revolutionising the paradigm of political communication. Nevertheless, many studies uncovered the presence of d/misinformation campaigns or of malicious activities by genuine or automated users, putting at severe risk the credibility of online platforms. This phenomenon is particularly evident during crucial political ev… ▽ More Online Social Networks represent a novel opportunity for political campaigns, revolutionising the paradigm of political communication. Nevertheless, many studies uncovered the presence of d/misinformation campaigns or of malicious activities by genuine or automated users, putting at severe risk the credibility of online platforms. This phenomenon is particularly evident during crucial political events, as political elections. In the present paper, we provide a comprehensive description of the structure of the networks of interactions among users and bots during the UK elections of 2019. In particular, we focus on the polarised discussion about Brexit on Twitter analysing a data set made of more than 10 million tweets posted for over a month. We found that the presence of automated accounts fostered the debate particularly in the days before the UK national elections, in which we find a steep increase of bots in the discussion; in the days after the election day, their incidence returned to values similar to the ones observed few weeks before the elections. On the other hand, we found that the number of suspended users (i.e. accounts that were removed by the platform for some violation of the Twitter policy) remained constant until the election day, after which it reached significantly higher values. Remarkably, after the TV debate between Boris Johnson and Jeremy Corbyn, we observed the injection of a large number of novel bots whose behaviour is markedly different from that of pre-existing ones. Finally, we explored the bots' stance, finding that their activity is spread across the whole political spectrum, although in different proportions, and we studied the different usage of hashtags by automated accounts and suspended users, thus targeting the formation of common narratives in different sides of the debate. △ Less

Submitted 29 July, 2021; originally announced July 2021.

Comments: 18 pages, 13 figures

Journal ref: EPJ Data Sci. 11, 17 (2022)

arXiv:2106.05815 [pdf, other]

doi 10.1140/epjds/s13688-021-00301-x

Italian Twitter semantic network during the Covid-19 epidemic

Authors: Mattia Mattei, Guido Caldarelli, Tiziano Squartini, Fabio Saracco

Abstract: The Covid-19 pandemic has had a deep impact on the lives of the entire world population, inducing a participated societal debate. As in other contexts, the debate has been the subject of several d/misinformation campaigns; in a quite unprecedented fashion, however, the presence of false information has seriously put at risk the public health. In this sense, detecting the presence of malicious narr… ▽ More The Covid-19 pandemic has had a deep impact on the lives of the entire world population, inducing a participated societal debate. As in other contexts, the debate has been the subject of several d/misinformation campaigns; in a quite unprecedented fashion, however, the presence of false information has seriously put at risk the public health. In this sense, detecting the presence of malicious narratives and identifying the kinds of users that are more prone to spread them represent the first step to limit the persistence of the former ones. In the present paper we analyse the semantic network observed on Twitter during the first Italian lockdown (induced by the hashtags contained in approximately 1.5 millions tweets published between the 23rd of March 2020 and the 23rd of April 2020) and study the extent to which various discursive communities are exposed to d/misinformation arguments. As observed in other studies, the recovered discursive communities largely overlap with traditional political parties, even if the debated topics concern different facets of the management of the pandemic. Although the themes directly related to d/misinformation are a minority of those discussed within our semantic networks, their popularity is unevenly distributed among the various discursive communities. △ Less

Submitted 10 June, 2021; originally announced June 2021.

Comments: 29 pages, 11 figures

Journal ref: EPJ Data Science 10 (47) (2021)

arXiv:2103.06705 [pdf, other]

doi 10.1371/journal.pone.0254748

Firms' Challenges and Social Responsibilities during Covid-19: a Twitter Analysis

Authors: Alessia Patuelli, Guido Caldarelli, Nicola Lattanzi, Fabio Saracco

Abstract: The Covid-19 pandemic caused disruptive effects for individuals, firms, and societies. In this paper, we offer insights on the major issues and challenges firms are facing in the Covid-19 pandemic, as well as their concerns for Corporate Social Responsibility (CSR) themes. To do so, we investigate large Italian firms' discussion on Twitter in the first nine months of the pandemic. We downloaded al… ▽ More The Covid-19 pandemic caused disruptive effects for individuals, firms, and societies. In this paper, we offer insights on the major issues and challenges firms are facing in the Covid-19 pandemic, as well as their concerns for Corporate Social Responsibility (CSR) themes. To do so, we investigate large Italian firms' discussion on Twitter in the first nine months of the pandemic. We downloaded all Twitter posts from 1st of March, 2020, to 17th of November, 2020 by the accounts of the largest Italian firms, i.e. those with 250 or more employees. We then built the bipartite network of accounts and hashtags and, using an entropy-based null model as a benchmark, we projected the information contained in the network into the accounts layers, identifying a network of accounts in which a link indicates a non trivial similarity in terms of their usage of hashtags. We find that the conversation is focused around 13 communities, 10 of which include Covid-19 themes. The core of the network is formed of 5 communities, which deal with environmental sustainability, digital innovation and safety. Firms' ownership type does not seem to influence the conversation. 10 communities out of 13 mention hashtags related to CSR, with the environmental and social dimensions as the prevalent ones. Interestingly enough, the social dimension seems more relevant in the communities dealing with digital innovation and safety. However, the relevance of CSR hashtags is very small at the single message level, but with some peculiarities arising in specific communities. Overall, our paper highlights the role of network methods on Twitter data as a tool which can support managers and policy makers to design their strategies and decision making, capturing firms' emerging issues and relevant themes. △ Less

Submitted 11 March, 2021; originally announced March 2021.

Comments: 30 pages, 4 figures

Journal ref: PLoS ONE 16(7): e0254748 (2021)

arXiv:2103.05623 [pdf, other]

doi 10.1038/s42254-021-00322-5

The Physics of Financial Networks

Authors: Marco Bardoscia, Paolo Barucca, Stefano Battiston, Fabio Caccioli, Giulio Cimini, Diego Garlaschelli, Fabio Saracco, Tiziano Squartini, Guido Caldarelli

Abstract: The field of Financial Networks is a paramount example of the novel applications of Statistical Physics that have made possible by the present data revolution. As the total value of the global financial market has vastly outgrown the value of the real economy, financial institutions on this planet have created a web of interactions whose size and topology calls for a quantitative analysis by means… ▽ More The field of Financial Networks is a paramount example of the novel applications of Statistical Physics that have made possible by the present data revolution. As the total value of the global financial market has vastly outgrown the value of the real economy, financial institutions on this planet have created a web of interactions whose size and topology calls for a quantitative analysis by means of Complex Networks. Financial Networks are not only a playground for the use of basic tools of statistical physics as ensemble representation and entropy maximization; rather, their particular dynamics and evolution triggered theoretical advancements as the definition of DebtRank to measure the impact and diffusion of shocks in the whole systems. In this review we present the state of the art in this field, starting from the different definitions of financial networks (based either on loans, on assets ownership, on contracts involving several parties -- such as credit default swaps, to multiplex representation when firms are introduced in the game and a link with real economy is drawn) and then discussing the various dynamics of financial contagion as well as applications in financial network inference and validation. We believe that this analysis is particularly timely since financial stability as well as recent innovations in climate finance, once properly analysed and understood in terms of complex network theory, can play a pivotal role in the transformation of our society towards a more sustainable world. △ Less

Submitted 9 March, 2021; originally announced March 2021.

Comments: version submitted to Nature Reviews Physics

Journal ref: Nat. Rev. Phys. 3 (7), 490-507 (2021)

arXiv:2103.04653 [pdf, ps, other]

doi 10.1371/journal.pone.0256705

Networked partisanship and framing: a socio-semantic network analysis of the Italian debate on migration

Authors: Tommaso Radicioni, Tiziano Squartini, Elena Pavan, Fabio Saracco

Abstract: The huge amount of data made available by the massive usage of social media has opened up the unprecedented possibility to carry out a data-driven study of political processes. While particular attention has been paid to phenomena like elite and mass polarization during online debates and echo-chambers formation, the interplay between online partisanship and framing practices, jointly sustaining a… ▽ More The huge amount of data made available by the massive usage of social media has opened up the unprecedented possibility to carry out a data-driven study of political processes. While particular attention has been paid to phenomena like elite and mass polarization during online debates and echo-chambers formation, the interplay between online partisanship and framing practices, jointly sustaining adversarial dynamics, still remains overlooked. With the present paper, we carry out a socio-semantic analysis of the debate about migration policies observed on the Italian Twittersphere, across the period May-November 2019. As regards the social analysis, our methodology allows us to extract relevant information about the political orientation of the communities of users - hereby called partisan communities - without resorting upon any external information. Remarkably, our community detection technique is sensitive enough to clearly highlight the dynamics characterizing the relationship among different political forces.As regards the semantic analysis, our networks of hashtags display a mesoscale structure organized in a core-periphery fashion, across the entire observation period. Taken altogether, our results point at different, yet overlap**, trajectories of conflict played out using migration issues as a backdrop. A first line opposes communities discussing substantively of migration to communities approaching this issue just to fuel hostility against political opponents; within the second line, a mechanism of distancing between partisan communities reflects shifting political alliances within the governmental coalition. Ultimately, our results contribute to shed light on the complexity of the Italian political context characterized by multiple poles of partisan alignment. △ Less

Submitted 22 June, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

Journal ref: PLoS ONE 16 (8): e0256705 (2021)

arXiv:2101.12625 [pdf, other]

doi 10.1038/s41598-021-93830-4

Fast and scalable likelihood maximization for Exponential Random Graph Models with local constraints

Authors: Nicolò Vallarano, Matteo Bruno, Emiliano Marchese, Giuseppe Trapani, Fabio Saracco, Giulio Cimini, Mario Zanon, Tiziano Squartini

Abstract: Exponential Random Graph Models (ERGMs) have gained increasing popularity over the years. Rooted into statistical physics, the ERGMs framework has been successfully employed for reconstructing networks, detecting statistically significant patterns in graphs, counting networked configurations with given properties. From a technical point of view, the ERGMs workflow is defined by two subsequent opti… ▽ More Exponential Random Graph Models (ERGMs) have gained increasing popularity over the years. Rooted into statistical physics, the ERGMs framework has been successfully employed for reconstructing networks, detecting statistically significant patterns in graphs, counting networked configurations with given properties. From a technical point of view, the ERGMs workflow is defined by two subsequent optimization steps: the first one concerns the maximization of Shannon entropy and leads to identify the functional form of the ensemble probability distribution that is maximally non-committal with respect to the missing information; the second one concerns the maximization of the likelihood function induced by this probability distribution and leads to its numerical determination. This second step translates into the resolution of a system of $O(N)$ non-linear, coupled equations (with $N$ being the total number of nodes of the network under analysis), a problem that is affected by three main issues, i.e. accuracy, speed and scalability. The present paper aims at addressing these problems by comparing the performance of three algorithms (i.e. Newton's method, a quasi-Newton method and a recently-proposed fixed-point recipe) in solving several ERGMs, defined by binary and weighted constraints in both a directed and an undirected fashion. While Newton's method performs best for relatively little networks, the fixed-point recipe is to be preferred when large configurations are considered, as it ensures convergence to the solution within seconds for networks with hundreds of thousands of nodes (e.g. the Internet, Bitcoin). We attach to the paper a Python code implementing the three aforementioned algorithms on all the ERGMs considered in the present work. △ Less

Submitted 22 July, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

Comments: Python code available at the following URL: https://pypi.org/project/NEMtropy/

Journal ref: Sci. Rep. 11 (15227) (2021)

arXiv:2011.05933 [pdf, other]

doi 10.1371/journal.pone.0249634

A model for the Twitter sentiment curve

Authors: Giacomo Aletti, Irene Crimaldi, Fabio Saracco

Abstract: Twitter is among the most used online platforms for the political communications, due to the concision of its messages (which is particularly suitable for political slogans) and the quick diffusion of messages. Especially when the argument stimulate the emotionality of users, the content on Twitter is shared with extreme speed and thus studying the tweet sentiment if of utmost importance to predic… ▽ More Twitter is among the most used online platforms for the political communications, due to the concision of its messages (which is particularly suitable for political slogans) and the quick diffusion of messages. Especially when the argument stimulate the emotionality of users, the content on Twitter is shared with extreme speed and thus studying the tweet sentiment if of utmost importance to predict the evolution of the discussions and the register of the relative narratives. In this article, we present a model able to reproduce the dynamics of the sentiments of tweets related to specific topics and periods and to provide a prediction of the sentiment of the future posts based on the observed past. The model is a recent variant of the Pólya urn, introduced and studied in arXiv:1906.10951 and arXiv:2010.06373, which is characterized by a "local" reinforcement, i.e. a reinforcement mechanism mainly based on the most recent observations, and by a random persistent fluctuation of the predictive mean. In particular, this latter feature is capable of capturing the trend fluctuations in the sentiment curve. While the proposed model is extremely general and may be also employed in other contexts, it has been tested on several Twitter data sets and demonstrated greater performances compared to the standard Pólya urn model. Moreover, the different performances on different data sets highlight different emotional sensitivities respect to a public event. △ Less

Submitted 11 November, 2020; originally announced November 2020.

Comments: 19 pages, 12 figures

Journal ref: PLoS ONE 16(4): e0249634 (2021)

arXiv:2010.01913 [pdf, other]

doi 10.1140/epjds/s13688-021-00289-4

Flow of online misinformation during the peak of the COVID-19 pandemic in Italy

Authors: Guido Caldarelli, Rocco de Nicola, Marinella Petrocchi, Manuel Pratelli, Fabio Saracco

Abstract: The COVID-19 pandemic has impacted on every human activity and, because of the urgency of finding the proper responses to such an unprecedented emergency, it generated a diffused societal debate. The online version of this discussion was not exempted by the presence of d/misinformation campaigns, but differently from what already witnessed in other debates, the COVID-19 -- intentional or not -- fl… ▽ More The COVID-19 pandemic has impacted on every human activity and, because of the urgency of finding the proper responses to such an unprecedented emergency, it generated a diffused societal debate. The online version of this discussion was not exempted by the presence of d/misinformation campaigns, but differently from what already witnessed in other debates, the COVID-19 -- intentional or not -- flow of false information put at severe risk the public health, reducing the effectiveness of governments' countermeasures. In the present manuscript, we study the effective impact of misinformation in the Italian societal debate on Twitter during the pandemic, focusing on the various discursive communities. In order to extract the discursive communities, we focus on verified users, i.e. accounts whose identity is officially certified by Twitter. We thus infer the various discursive communities based on how verified users are perceived by standard ones: if two verified accounts are considered as similar by non unverified ones, we link them in the network of certified accounts. We first observe that, beside being a mostly scientific subject, the COVID-19 discussion show a clear division in what results to be different political groups. At this point, by using a commonly available fact-checking software (NewsGuard), we assess the reputation of the pieces of news exchanged. We filter the network of retweets (i.e. users re-broadcasting the same elementary piece of information, or tweet) from random noise and check the presence of messages displaying an url. The impact of misinformation posts reaches the 22.1% in the right and center-right wing community and its contribution is even stronger in absolute numbers, due to the activity of this group: 96% of all non reputable urls shared by political groups come from this community. △ Less

Submitted 23 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

Comments: 25 pages, 4 figures. The Abstract, the Introduction, the Results, the Conclusions and the Methods were substantially rewritten. The plot of the network have been changed, as well as tables

Journal ref: EPJ Data Sci. 10, 34 (2021)

arXiv:2009.02960 [pdf, ps, other]

doi 10.1038/s41598-021-92337-2

Analysing Twitter Semantic Networks: the case of 2018 Italian Elections

Authors: Tommaso Radicioni, Fabio Saracco, Elena Pavan, Tiziano Squartini

Abstract: Social media play a key role in sha** citizens' political opinion. According to the Eurobarometer, the percentage of EU citizens employing online social networks on a daily basis has increased from 18% in 2010 to 48% in 2019. The entwinement between social media and the unfolding of political dynamics has motivated the interest of researchers for the analysis of users online behavior - with part… ▽ More Social media play a key role in sha** citizens' political opinion. According to the Eurobarometer, the percentage of EU citizens employing online social networks on a daily basis has increased from 18% in 2010 to 48% in 2019. The entwinement between social media and the unfolding of political dynamics has motivated the interest of researchers for the analysis of users online behavior - with particular emphasis on group polarization during debates and echo-chambers formation. In this context, attention has been predominantly directed towards the study of online relations between users while semantic aspects have remained under-explored. In the present paper, we aim at filling this gap by adopting a two-steps approach. First, we identify the discursive communities animating the political debate in the run up of the 2018 Italian Elections as groups of users with a significantly-similar retweeting behavior. Second, we study the semantic mechanisms that shape their internal discussions by monitoring, on a daily basis, the structural evolution of the semantic networks they induce. Above and beyond specifying the semantic peculiarities of the Italian electoral competition, our approach innovates studies of online political discussions in two main ways. On the one hand, it grounds semantic analysis within users' behaviors by implementing a method, rooted in statistical theory, that guarantees that our inference of socio-semantic structures is not biased by any unsupported assumption about missing information; on the other, it is completely automated as it does not rest upon any manual labelling (either based on the users' features or on their sharing patterns). These elements make our method applicable to any Twitter discussion regardless of the language or the topic addressed. △ Less

Submitted 24 June, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

Journal ref: Sci. Rep. 11 (13207) (2021)

arXiv:2003.02911 [pdf, other]

doi 10.1103/PhysRevE.101.062148

Towards a generalization of information theory for hierarchical partitions

Authors: Juan I. Perotti, Nahuel Almeira, Fabio Saracco

Abstract: Complex systems often exhibit multiple levels of organization covering a wide range of physical scales, so the study of the hierarchical decomposition of their structure and function is frequently convenient. To better understand this phenomenon, we introduce a generalization of information theory that works with hierarchical partitions. We begin revisiting the recently introduced Hierarchical Mut… ▽ More Complex systems often exhibit multiple levels of organization covering a wide range of physical scales, so the study of the hierarchical decomposition of their structure and function is frequently convenient. To better understand this phenomenon, we introduce a generalization of information theory that works with hierarchical partitions. We begin revisiting the recently introduced Hierarchical Mutual Information (HMI), and show that it can be written as a level by level summation of classical conditional mutual information terms. Then, we prove that the HMI is bounded from above by the corresponding hierarchical joint entropy. In this way, in analogy to the classical case, we derive hierarchical generalizations of many other classical information-theoretic quantities. In particular, we prove that, as opposed to its classical counterpart, the hierarchical generalization of the Variation of Information is not a metric distance, but it admits a transformation into one. Moreover, focusing on potential applications of the existing developments of the theory, we show how to adjust by chance the HMI. We also corroborate and analyze all the presented theoretical results with exhaustive numerical computations, and include an illustrative application example of the introduced formalism. Finally, we mention some open problems that should be eventually addressed for the proposed generalization of information theory to reach maturity. △ Less

Submitted 30 June, 2020; v1 submitted 27 February, 2020; originally announced March 2020.

Comments: 6 figures

Journal ref: Phys. Rev. E 101, 062148 (2020)

arXiv:2001.11805 [pdf, other]

doi 10.1038/s41598-020-76300-1

The ambiguity of nestedness under soft and hard constraints

Authors: Matteo Bruno, Fabio Saracco, Diego Garlaschelli, Claudio J. Tessone, Guido Caldarelli

Abstract: Many real networks feature the property of nestedness, i.e. the neighbours of nodes with a few connections are hierarchically nested within the neighbours of nodes with more connections. Despite the abstract simplicity of this notion, different mathematical definitions of nestedness have been proposed, sometimes giving contrasting results. Moreover, there is an ongoing debate on the statistical si… ▽ More Many real networks feature the property of nestedness, i.e. the neighbours of nodes with a few connections are hierarchically nested within the neighbours of nodes with more connections. Despite the abstract simplicity of this notion, different mathematical definitions of nestedness have been proposed, sometimes giving contrasting results. Moreover, there is an ongoing debate on the statistical significance of nestedness, since even random networks where the number of connections (degree) of each node is fixed to its empirical value are typically as nested as real-world ones. Here we propose a clarification that exploits the recent finding that random networks where the degrees are enforced as hard constraints (microcanonical ensembles) are thermodynamically different from random networks where the degrees are enforced as soft constraints (canonical ensembles). We show that if the real network is perfectly nested, then the two ensembles are trivially equivalent and the observed nestedness, independently of its definition, is indeed an unavoidable consequence of the empirical degrees. On the other hand, if the real network is not perfectly nested, then the two ensembles are not equivalent and alternative definitions of nestedness can be even positively correlated in the canonical ensemble and negatively correlated in the microcanonical one. This result disentangles distinct notions of nestedness captured by different metrics and highlights the importance of making a principled choice between hard and soft constraints in null models of ecological networks. △ Less

Submitted 17 November, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

Comments: 16 pages, 12 figures

Journal ref: Sci Rep 10, 19903 (2020)

arXiv:1905.12687 [pdf, other]

doi 10.1038/s42005-020-0340-4

The role of bot squads in the political propaganda on Twitter

Authors: Guido Caldarelli, Rocco De Nicola, Fabio Del Vigna, Marinella Petrocchi, Fabio Saracco

Abstract: Social Media are nowadays the privileged channel for information spreading and news checking. Unexpectedly for most of the users, automated accounts, also known as social bots, contribute more and more to this process of news spreading. Using Twitter as a benchmark, we consider the traffic exchanged, over one month of observation, on a specific topic, namely the migration flux from Northern Africa… ▽ More Social Media are nowadays the privileged channel for information spreading and news checking. Unexpectedly for most of the users, automated accounts, also known as social bots, contribute more and more to this process of news spreading. Using Twitter as a benchmark, we consider the traffic exchanged, over one month of observation, on a specific topic, namely the migration flux from Northern Africa to Italy. We measure the significant traffic of tweets only, by implementing an entropy-based null model that discounts the activity of users and the virality of tweets. Results show that social bots play a central role in the exchange of significant content. Indeed, not only the strongest hubs have a number of bots among their followers higher than expected, but furthermore a group of them, that can be assigned to the same political tendency, share a common set of bots as followers. The retwitting activity of such automated accounts amplifies the presence on the platform of the hubs' messages. △ Less

Submitted 29 May, 2019; originally announced May 2019.

Comments: Under Submission

Journal ref: Commun Phys 3, 81 (2020)

arXiv:1901.07933 [pdf, other]

doi 10.1057/s41599-019-0300-3

Extracting significant signal of news consumption from social networks: the case of Twitter in Italian political elections

Authors: Carolina Becatti, Guido Caldarelli, Renaud Lambiotte, Fabio Saracco

Abstract: According to the Eurobarometer report about EU media use of May 2018, the number of European citizens who consult on-line social networks for accessing information is considerably increasing. In this work we analyze approximately $10^6$ tweets exchanged during the last Italian elections. By using an entropy-based null model discounting the activity of the users, we first identify potential politic… ▽ More According to the Eurobarometer report about EU media use of May 2018, the number of European citizens who consult on-line social networks for accessing information is considerably increasing. In this work we analyze approximately $10^6$ tweets exchanged during the last Italian elections. By using an entropy-based null model discounting the activity of the users, we first identify potential political alliances within the group of verified accounts: if two verified users are retweeted more than expected by the non-verified ones, they are likely to be related. Then, we derive the users' affiliation to a coalition measuring the polarization of unverified accounts. Finally, we study the bipartite directed representation of the tweets and retweets network, in which tweets and users are collected on the two layers. Users with the highest out-degree identify the most popular ones, whereas highest out-degree posts are the most "viral". We identify significant content spreaders by statistically validating the connections that cannot be explained by users' tweeting activity and posts' virality by using an entropy-based null model as benchmark. The analysis of the directed network of validated retweets reveals signals of the alliances formed after the elections, highlighting commonalities of interests before the event of the national elections. △ Less

Submitted 23 January, 2019; originally announced January 2019.

Journal ref: Palgrave Commun 5, 91 (2019)

arXiv:1811.00418 [pdf, other]

doi 10.1371/journal.pone.0223768

Collaboration and followership: a stochastic model for activities in social networks

Authors: Carolina Becatti, Irene Crimaldi, Fabio Saracco

Abstract: In this work we investigate how future actions are influenced by the previous ones, in the specific contexts of scientific collaborations and friendships on social networks. We are not interested in modeling the process of link formation between the agents themselves, we instead describe the activity of the agents, providing a model for the formation of the bipartite network of actions and their f… ▽ More In this work we investigate how future actions are influenced by the previous ones, in the specific contexts of scientific collaborations and friendships on social networks. We are not interested in modeling the process of link formation between the agents themselves, we instead describe the activity of the agents, providing a model for the formation of the bipartite network of actions and their features. Therefore we only require to know the chronological order in which the actions are performed, and not the order in which the agents are observed. Moreover, the total number of possible features is not specified a priori but is allowed to increase along time, and new actions can independently show some new entry features or exhibit some of the old ones. The choice of the old features is driven by a degree-fitness method. With this term we mean that the probability that a new action shows one of the old features does not solely depend on the "popularity" of that feature (i.e. the number of previous actions showing it), but is also affected by some individual traits of the agents or the features themselves, synthesized in certain quantities, called "fitnesses" or "weights", that can have different forms and different meaning according to the specific setting considered. We show some theoretical properties of the model and provide statistical tools for the parameters' estimation. The model has been tested on three different datasets and the numerical results are provided and discussed. △ Less

Submitted 12 March, 2019; v1 submitted 27 October, 2018; originally announced November 2018.

Journal ref: PLoS ONE 14(10): e0223768 (2019)

arXiv:1810.05095 [pdf, other]

doi 10.1038/s42254-018-0002-6

The Statistical Physics of Real-World Networks

Authors: Giulio Cimini, Tiziano Squartini, Fabio Saracco, Diego Garlaschelli, Andrea Gabrielli, Guido Caldarelli

Abstract: In the last 15 years, statistical physics has been a very successful framework to model complex networks. On the theoretical side, this approach has brought novel insights into a variety of physical phenomena, such as self-organisation, scale invariance, emergence of mixed distributions and ensemble non-equivalence, that display unconventional features on heterogeneous networks. At the same time,… ▽ More In the last 15 years, statistical physics has been a very successful framework to model complex networks. On the theoretical side, this approach has brought novel insights into a variety of physical phenomena, such as self-organisation, scale invariance, emergence of mixed distributions and ensemble non-equivalence, that display unconventional features on heterogeneous networks. At the same time, thanks to their deep connection with information theory, statistical physics and the principle of maximum entropy have led to the definition of null models for networks reproducing some features of real-world systems, but otherwise as random as possible. We review here the statistical physics approach and the various null models for complex networks, focusing in particular on the analytic frameworks reproducing the local network features. We then show how these models have been used to detect statistically significant and predictive structural patterns in real-world networks, as well as to reconstruct the network structure in case of incomplete information. We further survey the statistical physics models that reproduce more complex, semi-local network features using Markov chain Monte Carlo sampling, as well as the models of generalised network structures such as multiplex networks, interacting networks and simplicial complexes. △ Less

Submitted 22 July, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

Comments: accepted version (after revision)

Journal ref: Nat. Rev. Phys. 1 (1), 58-71 (2019)

arXiv:1809.03222 [pdf, other]

doi 10.3390/e20100785

Colombian export capabilities: building the firms-products network

Authors: Matteo Bruno, Fabio Saracco, Tiziano Squartini, Marco Dueñas

Abstract: In this paper we analyse the bipartite Colombian firms-products network, throughout a period of five years, from 2010 to 2014. Our analysis depicts a strongly modular system, with several groups of firms specializing in the export of specific categories of products. These clusters have been detected by running the bipartite variant of the traditional modularity maximization, revealing a bi-modular… ▽ More In this paper we analyse the bipartite Colombian firms-products network, throughout a period of five years, from 2010 to 2014. Our analysis depicts a strongly modular system, with several groups of firms specializing in the export of specific categories of products. These clusters have been detected by running the bipartite variant of the traditional modularity maximization, revealing a bi-modular structure. Interestingly, this finding is refined by applying a recently-proposed algorithm for projecting bipartite networks on the layer of interest and, then, running the Louvain algorithm on the resulting monopartite representations. Important structural differences emerge upon comparing the Colombian firms-products network with the World Trade Web, in particular, the bipartite representation of the latter is not characterized by a similar block-structure, as the modularity maximization fails in revealing (bipartite) nodes clusters. This points out that economic systems behave differently at different scales: while countries tend to diversify their production --potentially exporting a large number of different products-- firms specialize in exporting (substantially very limited) baskets of basically homogeneous products. △ Less

Submitted 2 October, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

Journal ref: Entropy, 20 (10), 785 (2018)

arXiv:1805.06005 [pdf, ps, other]

doi 10.1155/2019/5120581

Reconstructing mesoscale network structures

Authors: Jeroen van Lidth de Jeude, Riccardo Di Clemente, Guido Caldarelli, Fabio Saracco, Tiziano Squartini

Abstract: When facing complex mesoscale network structures, it is generally believed that (null) models encoding the modular organization of nodes must be employed. The present paper focuses on two block structures that characterize the mesoscale organization of many real-world networks, i.e. the bow-tie and the core-periphery ones. Our analysis shows that constraining the network degree sequence is often e… ▽ More When facing complex mesoscale network structures, it is generally believed that (null) models encoding the modular organization of nodes must be employed. The present paper focuses on two block structures that characterize the mesoscale organization of many real-world networks, i.e. the bow-tie and the core-periphery ones. Our analysis shows that constraining the network degree sequence is often enough to reproduce such structures, as confirmed by model selection criteria as AIC or BIC. As a byproduct, our paper enriches the toolbox for the analysis of bipartite networks - still far from being complete. The aforementioned structures, in fact, partition the networks into asymmetric blocks characterized by binary, directed connections, thus calling for the extension of a recently-proposed method to randomize undirected, bipartite networks to the directed case. △ Less

Submitted 23 December, 2018; v1 submitted 15 May, 2018; originally announced May 2018.

Comments: 13 pages, 7 figures, accepted by the journal Complexity

Journal ref: Complexity Volume 2019, Article ID 5120581

arXiv:1805.04307 [pdf, other]

Maximum entropy approach to link prediction in bipartite networks

Authors: M. Baltakiene, K. Baltakys, D. Cardamone, F. Parisi, T. Radicioni, M. Torricelli, J. A. van Lidth de Jeude, F. Saracco

Abstract: Within network analysis, the analytical maximum entropy framework has been very successful for different tasks as network reconstruction and filtering. In a recent paper, the same framework was used for link-prediction for monopartite networks: link probabilities for all unobserved links in a graph are provided and the most probable links are selected. Here we propose the extension of such an appr… ▽ More Within network analysis, the analytical maximum entropy framework has been very successful for different tasks as network reconstruction and filtering. In a recent paper, the same framework was used for link-prediction for monopartite networks: link probabilities for all unobserved links in a graph are provided and the most probable links are selected. Here we propose the extension of such an approach to bipartite graphs. We test our method on two real world networks with different topological characteristics. Our performances are compared to state-of-the-art methods, and the results show that our entropy-based approach has a good overall performance. △ Less

Submitted 11 May, 2018; originally announced May 2018.

Comments: 7 pages, 3 figures. This work is the output of the Complexity72h workshop (https://complexity72h.weebly.com/), held at IMT School for Advanced Studies in Lucca, 7-11 May 2018

arXiv:1805.00717 [pdf, other]

doi 10.1103/PhysRevE.99.022306

Entropy-based randomisation of rating networks

Authors: Carolina Becatti, Guido Caldarelli, Fabio Saracco

Abstract: In the last years, due to the great diffusion of e-commerce, online rating platforms quickly became a common tool for purchase recommendations. However, instruments for their analysis did not evolve at the same speed. Indeed, interesting information about users' habits and tastes can be recovered just considering the bipartite network of users and products, in which links have different weights du… ▽ More In the last years, due to the great diffusion of e-commerce, online rating platforms quickly became a common tool for purchase recommendations. However, instruments for their analysis did not evolve at the same speed. Indeed, interesting information about users' habits and tastes can be recovered just considering the bipartite network of users and products, in which links have different weights due to the score assigned to items. With respect to other weighted bipartite networks, in these systems we observe a maximum possible weight per link, that limits the variability of the outcomes. In the present article we propose an entropy-based randomisation of (bipartite) rating networks by extending the Configuration Model framework: the randomised network satisfies the constraints of the degree per rating, i.e. the number of given ratings received by the specified product or assigned by the single user. We first show that such a null model is able to reproduce several non-trivial features of the real network better than other null models. Then, using it as a benchmark, we project the information contained in the real system on one of the layers, showing, for instance, the division in communities of music albums due to the taste of customers, or, in movies due the audience. △ Less

Submitted 2 May, 2018; originally announced May 2018.

Comments: 12 pages, 30 figures

Journal ref: Phys. Rev. E 99, 022306 (2019)

arXiv:1710.10143 [pdf, other]

doi 10.1007/s10955-018-2039-4

From Ecology to Finance (and Back?): Recent Advancements in the Analysis of Bipartite Networks

Authors: Mika J. Straka, Guido Caldarelli, Tiziano Squartini, Fabio Saracco

Abstract: Bipartite networks provide an insightful representation of many systems, ranging from mutualistic networks of species interactions to investment networks in finance. The analysis of their topological structures has revealed the ubiquitous presence of properties which seem to characterize many - apparently different - systems. Nestedness, for example, has been observed in plants-pollinator as well… ▽ More Bipartite networks provide an insightful representation of many systems, ranging from mutualistic networks of species interactions to investment networks in finance. The analysis of their topological structures has revealed the ubiquitous presence of properties which seem to characterize many - apparently different - systems. Nestedness, for example, has been observed in plants-pollinator as well as in country-product trade networks. This has raised questions about the significance of these patterns, which are often believed to constitute a genuine signature of self-organization. Here, we review several methods that have been developed for the analysis of such evidence. Due to the interdisciplinary character of complex networks, tools developed in one field, for example ecology, can greatly enrich other areas of research, such as economy and finance, and vice versa. With this in mind, we briefly review several entropy-based bipartite null models that have been recently proposed and discuss their application to several real-world systems. The focus on these models is motivated by the fact that they show three very desirable features: analytical character, general applicability and versatility. In this respect, entropy-based methods have been proven to perform satisfactorily both in providing benchmarks for testing evidence-based null hypotheses and in reconstructing unknown network configurations from partial information. On top of that, entropy-based models have been successfully employed to analyze ecological as well as economic systems, thus representing an ideal, interdisciplinary tool to approach the study of bipartite complex systems. [...] △ Less

Submitted 27 October, 2017; originally announced October 2017.

Comments: 26 pages, 12 Figures

Journal ref: J. Stat. Phys. (2018)

arXiv:1703.04090 [pdf, other]

doi 10.1103/PhysRevE.96.022306

Grand canonical validation of the bipartite International Trade Network

Authors: Mika J. Straka, Guido Caldarelli, Fabio Saracco

Abstract: Devising strategies for economic development in a globally competitive landscape requires a solid and unbiased understanding of countries technological advancements and similarities among export products. Both can be addressed through the bipartite representation of the International Trade Network. In the present paper, we apply the recently proposed grand canonical projection algorithm to uncover… ▽ More Devising strategies for economic development in a globally competitive landscape requires a solid and unbiased understanding of countries technological advancements and similarities among export products. Both can be addressed through the bipartite representation of the International Trade Network. In the present paper, we apply the recently proposed grand canonical projection algorithm to uncover country and product communities. Contrary to past endeavors, our methodology, based on information theory, creates monopartite projections in an unbiased and analytically tractable way. Single links between countries or products represent statistically significant signals, which are not accounted for by null-models such as the Bipartite Configuration Model. We find stable country communities reflecting the socioeconomic distinction in developed, newly industrialized, and develo** countries. Furthermore, we observe product clusters based on the aforementioned country groups. Our analysis reveals the existence of a complicate structure in the bipartite International Trade Network: apart from the diversification of export baskets from the most basic to the most exclusive products, we observe a statistically significant signal of an export specialization mechanism towards more sophisticated products. △ Less

Submitted 31 August, 2017; v1 submitted 12 March, 2017; originally announced March 2017.

Comments: 15 pages, 10 figures

Journal ref: Phys. Rev. E 96, 022306 (2017)

arXiv:1607.02481 [pdf, other]

doi 10.1088/1367-2630/aa6b38

Inferring monopartite projections of bipartite networks: an entropy-based approach

Authors: Fabio Saracco, Mika J. Straka, Riccardo Di Clemente, Andrea Gabrielli, Guido Caldarelli, Tiziano Squartini

Abstract: Bipartite networks are currently regarded as providing a major insight into the organization of many real-world systems, unveiling the mechanisms driving the interactions occurring between distinct groups of nodes. One of the most important issues encountered when modeling bipartite networks is devising a way to obtain a (monopartite) projection on the layer of interest, which preserves as much as… ▽ More Bipartite networks are currently regarded as providing a major insight into the organization of many real-world systems, unveiling the mechanisms driving the interactions occurring between distinct groups of nodes. One of the most important issues encountered when modeling bipartite networks is devising a way to obtain a (monopartite) projection on the layer of interest, which preserves as much as possible the information encoded into the original bipartite structure. In the present paper we propose an algorithm to obtain statistically-validated projections of bipartite networks, according to which any two nodes sharing a statistically-significant number of neighbors are linked. Since assessing the statistical significance of nodes similarity requires a proper statistical benchmark, here we consider a set of four null models, defined within the exponential random graph framework. Our algorithm outputs a matrix of link-specific p-values, from which a validated projection is straightforwardly obtainable, upon running a multiple hypothesis testing procedure. Finally, we test our method on an economic network (i.e. the countries-products World Trade Web representation) and a social network (i.e. MovieLens, collecting the users' ratings of a list of movies). In both cases non-trivial communities are detected: while projecting the World Trade Web on the countries layer reveals modules of similarly-industrialized nations, projecting it on the products layer allows communities characterized by an increasing level of complexity to be detected; in the second case, projecting MovieLens on the films layer allows clusters of movies whose affinity cannot be fully accounted for by genre similarity to be individuated. △ Less

Submitted 17 May, 2017; v1 submitted 8 July, 2016; originally announced July 2016.

Comments: 16 pages, 9 figures

Journal ref: New J. Phys. 19, 053022 (2017)

arXiv:1508.03571 [pdf, other]

doi 10.1371/journal.pone.0140420

From innovation to diversification: a simple competitive model

Authors: Fabio Saracco, Riccardo Di Clemente, Andrea Gabrielli, Luciano Pietronero

Abstract: Few attempts have been proposed in order to describe the statistical features and historical evolution of the export bipartite matrix countries/products. An important standpoint is the introduction of a products network, namely a hierarchical forest of products that models the formation and the evolution of commodities. In the present article, we propose a simple dynamical model where countries co… ▽ More Few attempts have been proposed in order to describe the statistical features and historical evolution of the export bipartite matrix countries/products. An important standpoint is the introduction of a products network, namely a hierarchical forest of products that models the formation and the evolution of commodities. In the present article, we propose a simple dynamical model where countries compete with each other to acquire the ability to produce and export new products. Countries will have two possibilities to expand their export: innovating, i.e. introducing new goods, namely new nodes in the product networks, or copying the productive process of others, i.e. occupying a node already present in the same network. In this way, the topology of the products network and the country-product matrix evolve simultaneously, driven by the countries push toward innovation. △ Less

Submitted 6 November, 2015; v1 submitted 14 August, 2015; originally announced August 2015.

Comments: 8 figures, 8 tables

Journal ref: PloS One 10(11): e0140420 (2015)

arXiv:1508.03533 [pdf, other]

doi 10.1038/srep30286

Detecting early signs of the 2007-2008 crisis in the world trade

Authors: Fabio Saracco, Riccardo Di Clemente, Andrea Gabrielli, Tiziano Squartini

Abstract: Since 2007, several contributions have tried to identify early-warning signals of the financial crisis. However, the vast majority of analyses has focused on financial systems and little theoretical work has been done on the economic counterpart. In the present paper we fill this gap and employ the theoretical tools of network theory to shed light on the response of world trade to the financial cr… ▽ More Since 2007, several contributions have tried to identify early-warning signals of the financial crisis. However, the vast majority of analyses has focused on financial systems and little theoretical work has been done on the economic counterpart. In the present paper we fill this gap and employ the theoretical tools of network theory to shed light on the response of world trade to the financial crisis of 2007 and the economic recession of 2008-2009. We have explored the evolution of the bipartite World Trade Web (WTW) across the years 1995-2010, monitoring the behavior of the system both before and after 2007. Our analysis shows early structural changes in the WTW topology: since 2003, the WTW becomes increasingly compatible with the picture of a network where correlations between countries and products are progressively lost. Moreover, the WTW structural modification can be considered as concluded in 2010, after a seemingly stationary phase of three years. We have also refined our analysis by considering specific subsets of countries and products: the most statistically significant early-warning signals are provided by the most volatile macrosectors, especially when measured on develo** countries, suggesting the emerging economies as being the most sensitive ones to the global economic cycles. △ Less

Submitted 8 July, 2016; v1 submitted 31 July, 2015; originally announced August 2015.

Comments: 18 pages, 9 figures

Journal ref: Sci. Rep. 6 (30286) (2016)

arXiv:1503.05098 [pdf, other]

doi 10.1038/srep10595

Randomizing bipartite networks: the case of the World Trade Web

Authors: Fabio Saracco, Riccardo Di Clemente, Andrea Gabrielli, Tiziano Squartini

Abstract: Within the last fifteen years, network theory has been successfully applied both to natural sciences and to socioeconomic disciplines. In particular, bipartite networks have been recognized to provide a particularly insightful representation of many systems, ranging from mutualistic networks in ecology to trade networks in economy, whence the need of a pattern detection-oriented analysis in order… ▽ More Within the last fifteen years, network theory has been successfully applied both to natural sciences and to socioeconomic disciplines. In particular, bipartite networks have been recognized to provide a particularly insightful representation of many systems, ranging from mutualistic networks in ecology to trade networks in economy, whence the need of a pattern detection-oriented analysis in order to identify statistically-significant structural properties. Such an analysis rests upon the definition of suitable null models, i.e. upon the choice of the portion of network structure to be preserved while randomizing everything else. However, quite surprisingly, little work has been done so far to define null models for real bipartite networks. The aim of the present work is to fill this gap, extending a recently-proposed method to randomize monopartite networks to bipartite networks. While the proposed formalism is perfectly general, we apply our method to the binary, undirected, bipartite representation of the World Trade Web, comparing the observed values of a number of structural quantities of interest with the expected ones, calculated via our randomization procedure. Interestingly, the behavior of the World Trade Web in this new representation is strongly different from the monopartite analogue, showing highly non-trivial patterns of self-organization. △ Less

Submitted 6 June, 2015; v1 submitted 17 March, 2015; originally announced March 2015.

Comments: 22 pages, 13 figures

Journal ref: Sci. Rep. 5 (10595) (2015)

arXiv:1305.2929 [pdf, other]

doi 10.1103/PhysRevD.88.045018

Topological resolution of gauge theory singularities

Authors: Fabio Saracco, Alessandro Tomasiello, Gonzalo Torroba

Abstract: Some gauge theories with Coulomb branches exhibit singularities in perturbation theory, which are usually resolved by nonperturbative physics. In string theory this corresponds to the resolution of timelike singularities near the core of orientifold planes by effects from F or M theory. We propose a new mechanism for resolving Coulomb branch singularities in three dimensional gauge theories, based… ▽ More Some gauge theories with Coulomb branches exhibit singularities in perturbation theory, which are usually resolved by nonperturbative physics. In string theory this corresponds to the resolution of timelike singularities near the core of orientifold planes by effects from F or M theory. We propose a new mechanism for resolving Coulomb branch singularities in three dimensional gauge theories, based on Chern-Simons interactions. This is illustrated in a supersymmetric SU(2) Yang-Mills-Chern-Simons theory. We calculate the one loop corrections to the Coulomb branch of this theory and find a result that interpolates smoothly between the high energy metric (that would exhibit the singularity) and a regular singularity-free low energy result. We suggest possible applications to singularity resolution in string theory and speculate a relationship to a similar phenomenon for the orientifold six-plane in massive IIA supergravity. △ Less

Submitted 11 June, 2013; v1 submitted 13 May, 2013; originally announced May 2013.

Comments: 24 pages, 1 figure. Revised version, references added

arXiv:1201.5378 [pdf, other]

doi 10.1007/JHEP07(2012)077

Localized O6-plane solutions with Romans mass

Authors: Fabio Saracco, Alessandro Tomasiello

Abstract: Orientifold solutions have an unphysical region around their source; for the O6, the singularity is resolved in M-theory by the Atiyah-Hitchin metric. Massive IIA, however, does not admit an eleven-dimensional lift, and one wonders what happens to the O6 there. In this paper, we find evidence for the existence of localized (unsmeared) O6 solutions in presence of Romans mass, in the context of four… ▽ More Orientifold solutions have an unphysical region around their source; for the O6, the singularity is resolved in M-theory by the Atiyah-Hitchin metric. Massive IIA, however, does not admit an eleven-dimensional lift, and one wonders what happens to the O6 there. In this paper, we find evidence for the existence of localized (unsmeared) O6 solutions in presence of Romans mass, in the context of four-dimensional compactifications. As a first step, we show that for generic supersymmetric compactifications, the Bianchi identity for the F_4 RR field follows from constancy of F_0. Using this, we find a procedure to deform any O6-D6 Minkowski compactification at first order in F_0. For a single O6, some of the symmetries of the massless solution are broken, but what is left is still enough to obtain a system of ODEs with as many variables as equations. Numerical analysis indicates that Romans mass makes the unphysical region disappear. △ Less

Submitted 25 January, 2012; originally announced January 2012.

Comments: 38 pages, 1 figure

arXiv:0911.5396 [pdf, ps, other]

doi 10.1103/PhysRevD.82.023528

Non-linear Matter Spectra in Coupled Quintessence

Authors: F. Saracco, M. Pietroni, N. Tetradis, V. Pettorino, G. Robbers

Abstract: We consider cosmologies in which a dark-energy scalar field interacts with cold dark matter. The growth of perturbations is followed beyond the linear level by means of the time-renormalization-group method, which is extended to describe a multi-component matter sector. Even in the absence of the extra interaction, a scale-dependent bias is generated as a consequence of the different initial condi… ▽ More We consider cosmologies in which a dark-energy scalar field interacts with cold dark matter. The growth of perturbations is followed beyond the linear level by means of the time-renormalization-group method, which is extended to describe a multi-component matter sector. Even in the absence of the extra interaction, a scale-dependent bias is generated as a consequence of the different initial conditions for baryons and dark matter after decoupling. The effect is enhanced significantly by the extra coupling and can be at the 2-3 percent level in the range of scales of baryonic acoustic oscillations. We compare our results with N-body simulations, finding very good agreement. △ Less

Submitted 10 April, 2014; v1 submitted 28 November, 2009; originally announced November 2009.

Comments: 20 pages, 6 figures, typo corrected

Journal ref: Phys.Rev.D82:023528,2010

Showing 1–42 of 42 results for author: Saracco, F