-
Anatomy of Elite and Mass Polarization in Social Networks
Authors:
Ali Salloum,
Ted Hsuan Yun Chen,
Mikko Kivelä
Abstract:
Existing methods for quantifying polarization in social networks typically report a single value describing the amount of polarization in a social system. While this approach can be used to confirm the observation that many societies have witnessed an increase in political polarization in recent years, it misses the complexities that could be used to understand the reasons behind this phenomenon.…
▽ More
Existing methods for quantifying polarization in social networks typically report a single value describing the amount of polarization in a social system. While this approach can be used to confirm the observation that many societies have witnessed an increase in political polarization in recent years, it misses the complexities that could be used to understand the reasons behind this phenomenon. Notably, opposing groups can have unequal impact on polarization, and the elites are often understood to be more divided than the masses, making it critical to differentiate their roles in polarized systems. We propose a method to characterize these distinct hierarchies in polarized networks, enabling separate polarization measurements for these groups within a single social system. Applied to polarized topics in the Finnish Twittersphere surrounding the 2019 and 2023 parliamentary elections, our analysis reveals valuable insights: 1) The impact of opposing groups on observed polarization is rarely balanced, and 2) while the elite strongly contributes to structural polarization and consistently display greater alignment across various topics, the masses have also recently experienced a surge in issue alignment, a special form of polarization. Our findings suggest that the masses may not be as immune to an increasingly polarized environment as previously thought.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Complex coalitions: political alliances across relational contexts
Authors:
Arttu Malkamäki,
Ted Hsuan Yun Chen,
Antti Gronow,
Mikko Kivelä,
Juho Vesa,
Tuomas Ylä-Anttila
Abstract:
Coalitions are central to politics, including government formation, international relations, and public policy. Coalitions emerge when actors engage one another across multiple relational contexts, but existing literature often approaches coalitions in singular contexts. We introduce complex coalitions, a theoretical-methodological framework that emphasises the relevance of multiple contexts and c…
▽ More
Coalitions are central to politics, including government formation, international relations, and public policy. Coalitions emerge when actors engage one another across multiple relational contexts, but existing literature often approaches coalitions in singular contexts. We introduce complex coalitions, a theoretical-methodological framework that emphasises the relevance of multiple contexts and cross-context dependencies in coalition politics. We also implement tools to statistically infer such coalition structures using multilayer networks. To demonstrate the usefulness of our approach, we compare coalitions among Finnish organisations engaging in climate politics across three con-texts: resource coordination, legacy media discourse, and social media communication. We show that considering coalitions as complex and accounting for cross-context dependencies improves the empirical validity of coalition studies. In our case study, the three contexts represent complementary, but not congruent, channels for enacting coalitions. In conclusion, we argue that the complex coalitions approach is useful for advancing understanding of coalitions in different political realms.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Subnetwork enumeration algorithms for multilayer networks
Authors:
Tarmo Nurmi,
Mikko Kivelä
Abstract:
To understand the structure of a network, it can be useful to break it down into its constituent pieces. This is the approach taken in a multitude of successful network analysis methods, such as motif analysis. These methods require one to enumerate or sample small connected subgraphs of a network, which can be computationally intractable if naive methods are used. Efficient algorithms exists for…
▽ More
To understand the structure of a network, it can be useful to break it down into its constituent pieces. This is the approach taken in a multitude of successful network analysis methods, such as motif analysis. These methods require one to enumerate or sample small connected subgraphs of a network, which can be computationally intractable if naive methods are used. Efficient algorithms exists for both enumeration and uniform sampling of subgraphs, and here we generalize the ESU algorithm for a very general notion of multilayer networks. We show that multilayer network subnetwork enumeration introduces nontrivial complications to the existing algorithm, and present two different generalized algorithms that preserve the desired features of unbiased sampling and trivial parallelization. We evaluate these algorithms in synthetic networks and with real-world data, and show that neither of the algorithms is strictly more efficient but rather the choice depends on the features of the data. Having a general algorithm for finding subnetworks makes advanced multilayer network analysis possible, and enables researchers to apply a variety of methods to previously difficult-to-handle multilayer networks in a variety of domains and across many different types of multilayer networks.
△ Less
Submitted 31 July, 2023;
originally announced August 2023.
-
OVNS: Opportunistic Variable Neighborhood Search for Heaviest Subgraph Problem in Social Networks
Authors:
Ville P. Saarinen,
Ted Hsuan Yun Chen,
Mikko Kivelä
Abstract:
We propose a hybrid heuristic algorithm for solving the Heaviest k-Subgraph Problem in online social networks -- a combinatorial graph optimization problem central to many important applications in weighted social networks, including detection of coordinated behavior, maximizing diversity of a group of users, and detecting social groups. Our approach builds upon an existing metaheuristic framework…
▽ More
We propose a hybrid heuristic algorithm for solving the Heaviest k-Subgraph Problem in online social networks -- a combinatorial graph optimization problem central to many important applications in weighted social networks, including detection of coordinated behavior, maximizing diversity of a group of users, and detecting social groups. Our approach builds upon an existing metaheuristic framework known as Variable Neighborhood Search and takes advantage of empirical insights about social network structures to derive an improved optimization heuristic. We conduct benchmarks in both real life social networks as well as synthetic networks and demonstrate that the proposed modifications match and in the majority of cases supersede those of the current state-of-the-art approaches.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
The Russian invasion of Ukraine selectively depolarized the Finnish NATO discussion
Authors:
Yan Xia,
Antti Gronow,
Arttu Malkamäki,
Tuomas Ylä-Anttila,
Barbara Keller,
Mikko Kivelä
Abstract:
The Russian invasion of Ukraine in 2022 dramatically reshaped the European security landscape. In Finland, public opinion on NATO had long been polarized along the left-right partisan axis, but the invasion led to a rapid convergence of the opinion toward joining NATO. We investigate whether and how this depolarization took place among polarized actors on Finnish Twitter. By analyzing retweeting p…
▽ More
The Russian invasion of Ukraine in 2022 dramatically reshaped the European security landscape. In Finland, public opinion on NATO had long been polarized along the left-right partisan axis, but the invasion led to a rapid convergence of the opinion toward joining NATO. We investigate whether and how this depolarization took place among polarized actors on Finnish Twitter. By analyzing retweeting patterns, we find three separated user groups before the invasion: a pro-NATO, a left-wing anti-NATO, and a conspiracy-charged anti-NATO group. After the invasion, the left-wing anti-NATO group members broke out of their retweeting bubble and connected with the pro-NATO group despite their difference in partisanship, while the conspiracy-charged anti-NATO group mostly remained a separate cluster. Our content analysis reveals that the left-wing anti-NATO group and the pro-NATO group were bridged by a shared condemnation of Russia's actions and shared democratic norms, while the other anti-NATO group, mainly built around conspiracy theories and disinformation, consistently demonstrated a clear anti-NATO attitude. We show that an external threat can bridge partisan divides in issues linked to the threat, but bubbles upheld by conspiracy theories and disinformation may persist even under dramatic external threats.
△ Less
Submitted 24 July, 2023; v1 submitted 15 December, 2022;
originally announced December 2022.
-
Reticula: A temporal network and hypergraph analysis software package
Authors:
Arash Badie-Modiri,
Mikko Kivelä
Abstract:
In the last decade, temporal networks and static and temporal hypergraphs have enabled modelling connectivity and spreading processes in a wide array of real-world complex systems such as economic transactions, information spreading, brain activity and disease spreading. In this manuscript, we present the Reticula C++ library and Python package: A comprehensive suite of tools for working with real…
▽ More
In the last decade, temporal networks and static and temporal hypergraphs have enabled modelling connectivity and spreading processes in a wide array of real-world complex systems such as economic transactions, information spreading, brain activity and disease spreading. In this manuscript, we present the Reticula C++ library and Python package: A comprehensive suite of tools for working with real-world and synthetic static and temporal networks and hypergraphs. This includes various methods of creating synthetic networks and randomised null models based on real-world data, calculating reachability and simulating compartmental models on networks. The library is designed principally on an extensible, cache-friendly representation of networks, with an aim of easing multi-thread use in the high-performance computing environment.
△ Less
Submitted 11 June, 2023; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Communication Now and Then: Analyzing the Republic of Letters as a Communication Network
Authors:
Javier Ureña-Carrion,
Petri Leskinen,
Jouni Tuominen,
Charles van den Heuvel,
Eero Hyvönen,
Mikko Kivelä
Abstract:
Huge advances in understanding patterns of human communication, and the underlying social networks where it takes place, have been made recently using massive automatically recorded data sets from digital communication, such as emails and phone calls. However, it is not clear to what extent these results on human behaviour are artefacts of contemporary communication technology and culture and if t…
▽ More
Huge advances in understanding patterns of human communication, and the underlying social networks where it takes place, have been made recently using massive automatically recorded data sets from digital communication, such as emails and phone calls. However, it is not clear to what extent these results on human behaviour are artefacts of contemporary communication technology and culture and if the fundamental patterns in communication have changed over history. This paper presents an analysis of historical epistolary metadata with the aim of comparing the underlying historical communication patterns with those of contemporary communication. Our work uses a new epistolary dataset containing metadata on over 150 000 letters sent between the 16th and 19th centuries. The analyses indicate striking resemblances between contemporary and epistolary communication network patterns, including dyadic interactions and ego-level behaviour. Despite these positive findings, certain aspects of the letter datasets are insufficient to corroborate other similarities or differences for these communication networks.
△ Less
Submitted 8 December, 2021;
originally announced December 2021.
-
Applicability of Multilayer Diffusion Network Inference to Social Media Data
Authors:
Yan Xia,
Ted Hsuan Yun Chen,
Mikko Kivelä
Abstract:
Information on social media spreads through an underlying diffusion network that connects people of common interests and opinions. This diffusion network often comprises multiple layers, each capturing the spreading dynamics of a certain type of information characterized by, for example, topic, attitude, or language. Researchers have previously proposed methods to infer these underlying multilayer…
▽ More
Information on social media spreads through an underlying diffusion network that connects people of common interests and opinions. This diffusion network often comprises multiple layers, each capturing the spreading dynamics of a certain type of information characterized by, for example, topic, attitude, or language. Researchers have previously proposed methods to infer these underlying multilayer diffusion networks from observed spreading patterns, but little is known about how well these methods perform across the range of realistic spreading data. In this paper, we first introduce an effective implementation of the inference method that can achieve higher accuracy than existing implementations in comparable runtime. Then, we conduct an extensive series of synthetic data experiments to systematically analyze the performance of the method, under varied network structure (e.g. density, number of layers) and information diffusion settings (e.g. cascade size, layer mixing) that are designed to mimic real-world spreading on social media. Our findings include that the inference accuracy varies extremely with network density, and that the method fails to decompose the diffusion network correctly when most cascades in the data reach a limited audience. In demonstrating the conditions under which the inference accuracy is extremely low, our paper highlights the need to carefully evaluate the applicability of the method before running the inference on real data. Practically, our results serve as a reference for this evaluation, and our publicly available implementation supports further testing under personalized settings.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Directed Percolation in Random Temporal Network Models with Heterogeneities
Authors:
Arash Badie-Modiri,
Abbas K. Rizi,
Márton Karsai,
Mikko Kivelä
Abstract:
The event graph representation of temporal networks suggests that the connectivity of temporal structures can be mapped to a directed percolation problem. However, similar to percolation theory on static networks, this map** is valid under the approximation that the structure and interaction dynamics of the temporal network are determined by its local properties, and otherwise, it is maximally r…
▽ More
The event graph representation of temporal networks suggests that the connectivity of temporal structures can be mapped to a directed percolation problem. However, similar to percolation theory on static networks, this map** is valid under the approximation that the structure and interaction dynamics of the temporal network are determined by its local properties, and otherwise, it is maximally random. We challenge these conditions and demonstrate the robustness of this map** in case of more complicated systems. We systematically analyze random and regular network topologies and heterogeneous link-activation processes driven by bursty renewal or self-exciting processes using numerical simulation and finite-size scaling methods. We find that the critical percolation exponents characterizing the temporal network are not sensitive to many structural and dynamical network heterogeneities, while they recover known scaling exponents characterizing directed percolation on low dimensional lattices. While it is not possible to demonstrate the validity of this map** for all temporal network models, our results establish the first batch of evidence supporting the robustness of the scaling relationships in the limited-time reachability of temporal networks.
△ Less
Submitted 11 June, 2023; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Directed Percolation in Temporal Networks
Authors:
Arash Badie-Modiri,
Abbas K. Rizi,
Márton Karsai,
Mikko Kivelä
Abstract:
Connectivity and reachability on temporal networks, which can describe the spreading of a disease, decimation of information or the accessibility of a public transport system over time, have been among the main contemporary areas of study in complex systems for the last decade. However, while isotropic percolation theory successfully describes connectivity in static networks, a similar description…
▽ More
Connectivity and reachability on temporal networks, which can describe the spreading of a disease, decimation of information or the accessibility of a public transport system over time, have been among the main contemporary areas of study in complex systems for the last decade. However, while isotropic percolation theory successfully describes connectivity in static networks, a similar description has not been yet developed for temporal networks. Here address this problem and formalize a map** of the concept of temporal network reachability to percolation theory. We show that the limited-waiting-time reachability, a generic notion of constrained connectivity in temporal networks, displays directed percolation phase transition in connectivity. Consequently, the critical percolation properties of spreading processes on temporal networks can be estimated by a set of known exponents characterising the directed percolation universality class. This result is robust across a diverse set of temporal network models with different temporal and topological heterogeneities, while by using our methodology we uncover similar reachability phase transitions in real temporal networks too. These findings open up an avenue to apply theory, concepts and methodology from the well-developed directed percolation literature to temporal networks.
△ Less
Submitted 11 June, 2023; v1 submitted 3 July, 2021;
originally announced July 2021.
-
Graphlets in multilayer networks
Authors:
Sallamari Sallmen,
Tarmo Nurmi,
Mikko Kivelä
Abstract:
Representing various networked data as multiplex networks, networks of networks and other multilayer networks can reveal completely new types of structures in these system. We introduce a general and principled graphlet framework for multilayer networks which allows one to break any multilayer network into small multilayered building blocks. These multilayer graphlets can be either analyzed themse…
▽ More
Representing various networked data as multiplex networks, networks of networks and other multilayer networks can reveal completely new types of structures in these system. We introduce a general and principled graphlet framework for multilayer networks which allows one to break any multilayer network into small multilayered building blocks. These multilayer graphlets can be either analyzed themselves or used to do tasks such as comparing different systems. The method is flexible in terms of multilayer isomorphism, automorphism orbit definition, and the type of multilayer network. We illustrate our method for multiplex networks and show how it can be used to distinguish networks produced with multiple models from each other in an unsupervised way. In addition, we include an automatic way of generating the hundreds of dependency equations between the orbit counts needed to remove redundant orbit counts. The framework introduced here allows one to analyze multilayer networks with versatile semantics, and these methods can thus be used to analyze the structural building blocks of myriad multilayer networks.
△ Less
Submitted 24 June, 2021;
originally announced June 2021.
-
Epidemic Spreading and Digital Contact Tracing: Effects of Heterogeneous Mixing and Quarantine Failures
Authors:
Abbas K. Rizi,
Ali Faqeeh,
Arash Badie-Modiri,
Mikko Kivelä
Abstract:
Contact tracing via digital tracking applications installed on mobile phones is an important tool for controlling epidemic spreading. Its effectivity can be quantified by modifying the standard methodology for analyzing percolation and connectivity of contact networks. We apply this framework to networks with varying degree distributions, numbers of application users, and probabilities of quaranti…
▽ More
Contact tracing via digital tracking applications installed on mobile phones is an important tool for controlling epidemic spreading. Its effectivity can be quantified by modifying the standard methodology for analyzing percolation and connectivity of contact networks. We apply this framework to networks with varying degree distributions, numbers of application users, and probabilities of quarantine failures. Further, we study structured populations with homophily and heterophily and the possibility of degree-targeted application distribution. Our results are based on a combination of explicit simulations and mean-field analysis. They indicate that there can be major differences in the epidemic size and epidemic probabilities which are equivalent in the normal SIR processes. Further, degree heterogeneity is seen to be especially important for the epidemic threshold but not as much for the epidemic size. The probability that tracing leads to quarantines is not as important as the application adoption rate. Finally, both strong homophily and especially heterophily with regard to application adoption can be detrimental. Overall, epidemic dynamics are very sensitive to all of the parameter values we tested out, which makes the problem of estimating the effect of digital contact tracing an inherently multidimensional problem.
△ Less
Submitted 19 April, 2022; v1 submitted 23 March, 2021;
originally announced March 2021.
-
Separating Polarization from Noise: Comparison and Normalization of Structural Polarization Measures
Authors:
Ali Salloum,
Ted Hsuan Yun Chen,
Mikko Kivelä
Abstract:
Quantifying the amount of polarization is crucial for understanding and studying political polarization in political and social systems. Several methods are used commonly to measure polarization in social networks by purely inspecting their structure. We analyse eight of such methods and show that all of them yield high polarization scores even for random networks with similar density and degree d…
▽ More
Quantifying the amount of polarization is crucial for understanding and studying political polarization in political and social systems. Several methods are used commonly to measure polarization in social networks by purely inspecting their structure. We analyse eight of such methods and show that all of them yield high polarization scores even for random networks with similar density and degree distributions to typical real-world networks. Further, some of the methods are sensitive to degree distributions and relative sizes of the polarized groups. We propose normalization to the existing scores and a minimal set of tests that a score should pass in order for it to be suitable for separating polarized networks from random noise. The performance of the scores increased by 38%-220% after normalization in a classification task of 203 networks. Further, we find that the choice of method is not as important as normalization, after which most of the methods have better performance than the best-performing method before normalization. This work opens up the possibility to critically assess and compare the features and performance of different methods for measuring structural polarization.
△ Less
Submitted 9 December, 2021; v1 submitted 18 January, 2021;
originally announced January 2021.
-
Spread of Tweets in Climate Discussions
Authors:
Yan Xia,
Ted Hsuan Yun Chen,
Mikko Kivelä
Abstract:
Characterising the spreading of ideas within echo chambers is essential for understanding polarisation. In this paper, we explore the characteristics of popular and viral content in climate change discussions on Twitter around the 2019 announcement of the Nobel Peace Prize, where we find the retweet network of users to be polarised into two well-separated groups of activists and sceptics. Operatio…
▽ More
Characterising the spreading of ideas within echo chambers is essential for understanding polarisation. In this paper, we explore the characteristics of popular and viral content in climate change discussions on Twitter around the 2019 announcement of the Nobel Peace Prize, where we find the retweet network of users to be polarised into two well-separated groups of activists and sceptics. Operationalising popularity as the number of retweets and virality as the spreading probability inferred using an independent cascade model, we find that the viral themes echo and differ from the popular themes in interesting ways. Most importantly, we find that the most viral themes in the two groups reflect different types of bonds that tie the community together, yet both function to enhance ingroup connections while repulsing outgroup engagement. With this, our study sheds light, from an information spreading perspective, on the formation and upkeep of echo chambers in climate discussions.
△ Less
Submitted 28 August, 2021; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Privacy and Uniqueness of Neighborhoods in Social Networks
Authors:
Daniele Romanini,
Sune Lehmann,
Mikko Kivelä
Abstract:
The ability to share social network data at the level of individual connections is beneficial to science: not only for reproducing results, but also for researchers who may wish to use it for purposes not foreseen by the data releaser. Sharing such data, however, can lead to serious privacy issues, because individuals could be re-identified, not only based on possible nodes' attributes, but also f…
▽ More
The ability to share social network data at the level of individual connections is beneficial to science: not only for reproducing results, but also for researchers who may wish to use it for purposes not foreseen by the data releaser. Sharing such data, however, can lead to serious privacy issues, because individuals could be re-identified, not only based on possible nodes' attributes, but also from the structure of the network around them. The risk associated with re-identification can be measured and it is more serious in some networks than in others. Various optimization algorithms have been proposed to anonymize the network while kee** the number of changes minimal. However, existing algorithms do not provide guarantees on where the changes will be made, making it difficult to quantify their effect on various measures. Using network models and real data, we show that the average degree of networks is a crucial parameter for the severity of re-identification risk from nodes' neighborhoods. Dense networks are more at risk, and, apart from a small band of average degree values, either almost all nodes are re-identifiable or they are all safe. Our results allow researchers to assess the privacy risk based on a small number of network statistics which are available even before the data is collected. As a rule-of-thumb, the privacy risks are high if the average degree is above 10. Guided by these results we propose a simple method based on edge sampling to mitigate the re-identification risk of nodes. Our method can be implemented already at the data collection phase. Its effect on various network measures can be estimated and corrected using sampling theory. These properties are in contrast with previous methods arbitrarily biasing the data. In this sense, our work could help in sharing network data in a statistically tractable way.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Going beyond communication intensity for estimating tie strengths in social networks
Authors:
Javier Ureña-Carrion,
Jari Saramäki,
Mikko Kivelä
Abstract:
Even though the concept of tie strength is central in social network analysis, it is difficult to quantify how strong social ties are. One typical way of estimating tie strength in data-driven studies has been to simply count the total number or duration of contacts between two people. This, however, disregards many features that can be extracted from the rich data sets used for social network rec…
▽ More
Even though the concept of tie strength is central in social network analysis, it is difficult to quantify how strong social ties are. One typical way of estimating tie strength in data-driven studies has been to simply count the total number or duration of contacts between two people. This, however, disregards many features that can be extracted from the rich data sets used for social network reconstruction. Here, we focus on contact data with temporal information. We systematically study how features of the contact time series are related to topological features usually associated with tie strength. We analyze a large mobile-phone dataset and measure a number of properties of the call time series for each tie, and use these to predict the so-called neighbourhood overlap, a feature related to strong ties in the sociological literature. We observe a strong relationship between temporal features and the neighbourhood overlap, with many features outperforming simple contact counts. Features that stand out include the number of days with calls, number of bursty cascades, typical times of contacts, and temporal stability. Our results suggest that these measures could be adapted for use in social network construction and indicate that the best results can be achieved by combining multiple temporal features.
△ Less
Submitted 28 July, 2020;
originally announced July 2020.
-
Polarization of Climate Politics Results from Partisan Sorting: Evidence from Finnish Twittersphere
Authors:
Ted Hsuan Yun Chen,
Ali Salloum,
Antti Gronow,
Tuomas Ylä-Anttila,
Mikko Kivelä
Abstract:
Prior research shows that public opinion on climate politics sorts along partisan lines. However, they leave open the question of whether climate politics and other politically salient issues exhibit tendencies for issue alignment, which the political polarization literature identifies as among the most deleterious aspects of polarization. Using a network approach and social media data from the Tw…
▽ More
Prior research shows that public opinion on climate politics sorts along partisan lines. However, they leave open the question of whether climate politics and other politically salient issues exhibit tendencies for issue alignment, which the political polarization literature identifies as among the most deleterious aspects of polarization. Using a network approach and social media data from the Twitter platform, we study polarization of public opinion toward climate politics and ten other politically salient topics during the 2019 Finnish elections as the emergence of opposing groups in a public forum. We find that while climate politics is not particularly polarized compared to the other topics, it is subject to partisan sorting and issue alignment within the universalist-communitarian dimension of European politics that arose following the growth of right-wing populism. Notably, climate politics is consistently aligned with the immigration issue, and temporal trends indicate that this phenomenon will likely persist.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
Weighted temporal event graphs
Authors:
Jari Saramäki,
Mikko Kivelä,
Márton Karsai
Abstract:
The times of temporal-network events and their correlations contain information on the function of the network and they influence dynamical processes taking place on it. To extract information out of correlated event times, techniques such as the analysis of temporal motifs have been developed. We discuss a recently-introduced, more general framework that maps temporal-network structure into stati…
▽ More
The times of temporal-network events and their correlations contain information on the function of the network and they influence dynamical processes taking place on it. To extract information out of correlated event times, techniques such as the analysis of temporal motifs have been developed. We discuss a recently-introduced, more general framework that maps temporal-network structure into static graphs while retaining information on time-respecting paths and the time differences between their consequent events. This framework builds on weighted temporal event graphs: directed, acyclic graphs (DAGs) that contain a superposition of all temporal paths. We introduce the reader to the temporal event-graph map** and associated computational methods and illustrate its use by applying the framework to temporal-network percolation.
△ Less
Submitted 9 December, 2019;
originally announced December 2019.
-
Efficient limited-time reachability estimation in temporal networks
Authors:
Arash Badie-Modiri,
Márton Karsai,
Mikko Kivelä
Abstract:
Time-limited states characterise many dynamical processes on networks: disease infected individuals recover after some time, people forget news spreading on social networks, or passengers may not wait forever for a connection. These dynamics can be described as limited waiting-time processes, and they are particularly important for systems modelled as temporal networks. These processes have been s…
▽ More
Time-limited states characterise many dynamical processes on networks: disease infected individuals recover after some time, people forget news spreading on social networks, or passengers may not wait forever for a connection. These dynamics can be described as limited waiting-time processes, and they are particularly important for systems modelled as temporal networks. These processes have been studied via simulations, which is equivalent to repeatedly finding all limited-waiting time temporal paths from a source node and time. We propose a method yielding orders of magnitude more efficient way of tracking the reachability of such temporal paths. Our method gives simultaneous estimates of the in- or out-reachability (with any chosen waiting-time limit) from every possible starting point and time. It works on very large temporal networks with hundreds of millions of events on current commodity computing hardware. This opens up the possibility to analyse reachability and dynamics of spreading processes on large temporal networks in completely new ways. For example, one can now compute centralities based on global reachability for all events or can find with high probability the infected node and time, which would lead to the largest epidemic outbreak.
△ Less
Submitted 11 June, 2023; v1 submitted 30 August, 2019;
originally announced August 2019.
-
Going beneath the shoulders of giants: tracking the cumulative knowledge spreading in a comprehensive citation network
Authors:
Pietro della Briotta Parolo,
Rainer Kujala,
Kimmo Kaski,
Mikko Kivelä
Abstract:
In all of science, the authors of publications depend on the knowledge presented by the previous publications. Thus they "stand on the shoulders of giants" and there is a flow of knowledge from previous publications to more recent ones. The dominating paradigm for tracking this flow of knowledge is to count the number of direct citations, but this neglects the fact that beneath the first layer of…
▽ More
In all of science, the authors of publications depend on the knowledge presented by the previous publications. Thus they "stand on the shoulders of giants" and there is a flow of knowledge from previous publications to more recent ones. The dominating paradigm for tracking this flow of knowledge is to count the number of direct citations, but this neglects the fact that beneath the first layer of citations there is a full body of literature. In this study, we go underneath the "shoulders" by investigating the cumulative knowledge creation process in a citation network of around 35 million publications. In particular, we study stylized models of persistent influence and diffusion that take into account all the possible chains of citations. When we study the persistent influence values of publications and their citation counts, we find that the publications related to Nobel Prizes i.e. Nobel papers have higher ranks in terms of persistent influence than that due to citations, and that the most outperforming publications are typically early works leading to hot research topics of their time. The diffusion model reveals a significant variation in the rates at which different fields of research share knowledge. We find that these rates have been increasing systematically for several decades, which can be explained by the increase in the publication volumes. Overall, our results suggest that analyzing cumulative knowledge creation on a global scale can be useful in estimating the type and scale of scientific influence of individual publications and entire research areas as well as yielding insights which could not be discovered by using only the direct citation counts.
△ Less
Submitted 29 August, 2019;
originally announced August 2019.
-
Cumulative effects of triadic closure and homophily in social networks
Authors:
Aili Asikainen,
Gerardo Iñiguez,
Kimmo Kaski,
Mikko Kivelä
Abstract:
Much of the structure in social networks has been explained by two seemingly independent network evolution mechanisms: triadic closure and homophily. While it is common to consider these mechanisms separately or in the frame of a static model, empirical studies suggest that their dynamic interplay is the very process responsible for the homophilous patterns of association seen in off- and online s…
▽ More
Much of the structure in social networks has been explained by two seemingly independent network evolution mechanisms: triadic closure and homophily. While it is common to consider these mechanisms separately or in the frame of a static model, empirical studies suggest that their dynamic interplay is the very process responsible for the homophilous patterns of association seen in off- and online social networks. By combining these two mechanisms in a minimal solvable dynamic model, we confirm theoretically the long-held and empirically established hypothesis that homophily can be amplified by the triadic closure mechanism. This research approach allows us to estimate how much of the observed homophily in various friendship and communication networks is due to amplification for a given amount of triadic closure. We find that the cumulative advantage-like process leading to homophily amplification can, under certain circumstances, also lead to the widely documented core-periphery structure of social networks, as well as to the emergence of memory of previous homophilic constraints (equivalent to hysteresis phenomena in physics). The theoretical understanding provided by our results highlights the importance of early intervention in managing at the societal level the most adverse effects of homophilic decision-making, such as inequality, segregation and online echo chambers.
△ Less
Submitted 17 September, 2018;
originally announced September 2018.
-
Randomized reference models for temporal networks
Authors:
Laetitia Gauvin,
Mathieu Génois,
Márton Karsai,
Mikko Kivelä,
Taro Takaguchi,
Eugenio Valdano,
Christian L. Vestergaard
Abstract:
Many dynamical systems can be successfully analyzed by representing them as networks. Empirically measured networks and dynamic processes that take place in these situations show heterogeneous, non-Markovian, and intrinsically correlated topologies and dynamics. This makes their analysis particularly challenging. Randomized reference models (RRMs) have emerged as a general and versatile toolbox fo…
▽ More
Many dynamical systems can be successfully analyzed by representing them as networks. Empirically measured networks and dynamic processes that take place in these situations show heterogeneous, non-Markovian, and intrinsically correlated topologies and dynamics. This makes their analysis particularly challenging. Randomized reference models (RRMs) have emerged as a general and versatile toolbox for studying such systems. Defined as random networks with given features constrained to match those of an input (empirical) network, they may, for example, be used to identify important features of empirical networks and their effects on dynamical processes unfolding in the network. RRMs are typically implemented as procedures that reshuffle an empirical network, making them very generally applicable. However, the effects of most shuffling procedures on network features remain poorly understood, rendering their use nontrivial and susceptible to misinterpretation. Here we propose a unified framework for classifying and understanding microcanonical RRMs (MRRMs) that sample networks with uniform probability. Focusing on temporal networks, we survey applications of MRRMs found in the literature, and we use this framework to build a taxonomy of MRRMs that proposes a canonical naming convention, classifies them, and deduces their effects on a range of important network features. We furthermore show that certain classes of MRRMs may be applied in sequential composition to generate new MRRMs from the existing ones surveyed in this article. We finally provide a tutorial showing how to apply a series of MRRMs to analyze how different network features affect a dynamic process in an empirical temporal network.
△ Less
Submitted 15 December, 2022; v1 submitted 11 June, 2018;
originally announced June 2018.
-
Map** temporal-network percolation to weighted, static event graphs
Authors:
Mikko Kivelä,
Jordan Cambe,
Jari Saramäki,
Márton Karsai
Abstract:
Many processes of spreading and diffusion take place on temporal networks, and their outcomes are influenced by correlations in the times of contact. These correlations have a particularly strong influence on processes where the spreading agent has a limited lifetime at nodes: disease spreading (recovery time), diffusion of rumors (lifetime of information), and passenger routing (maximum acceptabl…
▽ More
Many processes of spreading and diffusion take place on temporal networks, and their outcomes are influenced by correlations in the times of contact. These correlations have a particularly strong influence on processes where the spreading agent has a limited lifetime at nodes: disease spreading (recovery time), diffusion of rumors (lifetime of information), and passenger routing (maximum acceptable time between transfers). Here, we introduce weighted event graphs as a powerful and fast framework for studying connectivity determined by time-respecting paths where the allowed waiting times between contacts have an upper limit. We study percolation on the weighted event graphs and in the underlying temporal networks, with simulated and real-world networks. We show that this type of temporal-network percolation is analogous to directed percolation, and that it can be characterized by multiple order parameters.
△ Less
Submitted 17 September, 2017;
originally announced September 2017.
-
Stochastic Block Model Reveals the Map of Citation Patterns and Their Evolution in Time
Authors:
Darko Hric,
Kimmo Kaski,
Mikko Kivelä
Abstract:
In this study we map out the large-scale structure of citation networks of science journals and follow their evolution in time by using stochastic block models (SBMs). The SBM fitting procedures are principled methods that can be used to find hierarchical grou** of journals into blocks that show similar incoming and outgoing citations patterns. These methods work directly on the citation network…
▽ More
In this study we map out the large-scale structure of citation networks of science journals and follow their evolution in time by using stochastic block models (SBMs). The SBM fitting procedures are principled methods that can be used to find hierarchical grou** of journals into blocks that show similar incoming and outgoing citations patterns. These methods work directly on the citation network without the need to construct auxiliary networks based on similarity of nodes. We fit the SBMs to the networks of journals we have constructed from the data set of around 630 million citations and find a variety of different types of blocks, such as clusters, bridges, sources, and sinks. In addition we use a recent generalization of SBMs to determine how much a manually curated classification of journals into subfields of science is related to the block structure of the journal network and how this relationship changes in time. The SBM method tries to find a network of blocks that is the best high-level representation of the network of journals, and we illustrate how these block networks (at various levels of resolution) can be used as maps of science.
△ Less
Submitted 28 April, 2017;
originally announced May 2017.
-
Isomorphisms in Multilayer Networks
Authors:
Mikko Kivelä,
Mason A. Porter
Abstract:
We extend the concept of graph isomorphisms to multilayer networks with any number of "aspects" (i.e., types of layering). In develo** this generalization, we identify multiple types of isomorphisms. For example, in multilayer networks with a single aspect, permuting vertex labels, layer labels, and both vertex labels and layer labels each yield different isomorphism relations between multilayer…
▽ More
We extend the concept of graph isomorphisms to multilayer networks with any number of "aspects" (i.e., types of layering). In develo** this generalization, we identify multiple types of isomorphisms. For example, in multilayer networks with a single aspect, permuting vertex labels, layer labels, and both vertex labels and layer labels each yield different isomorphism relations between multilayer networks. Multilayer network isomorphisms lead naturally to defining isomorphisms in any of the numerous types of networks that can be represented as a multilayer network, and we thereby obtain isomorphisms for multiplex networks, temporal networks, networks with both of these features, and more. We reduce each of the multilayer network isomorphism problems to a graph isomorphism problem, where the size of the graph isomorphism problem grows linearly with the size of the multilayer network isomorphism problem. One can thus use software that has been developed to solve graph isomorphism problems as a practical means for solving multilayer network isomorphism problems. Our theory lays a foundation for extending many network analysis methods --- including motifs, graphlets, structural roles, and network alignment --- to any multilayer network.
△ Less
Submitted 16 February, 2017; v1 submitted 1 June, 2015;
originally announced June 2015.
-
Estimating inter-event time distributions from finite observation periods in communication networks
Authors:
Mikko Kivelä,
Mason A. Porter
Abstract:
A diverse variety of processes --- including recurrent disease episodes, neuron firing, and communication patterns among humans --- can be described using inter-event time (IET) distributions. Many such processes are ongoing, although event sequences are only available during a finite observation window. Because the observation time window is more likely to begin or end during long IETs than durin…
▽ More
A diverse variety of processes --- including recurrent disease episodes, neuron firing, and communication patterns among humans --- can be described using inter-event time (IET) distributions. Many such processes are ongoing, although event sequences are only available during a finite observation window. Because the observation time window is more likely to begin or end during long IETs than during short ones, the analysis of such data is susceptible to a bias induced by the finite observation period. In this paper, we illustrate how this length bias is born and how it can be corrected without assuming any particular shape for the IET distribution. To do this, we model event sequences using stationary renewal processes, and we formulate simple heuristics for determining the severity of the bias. To illustrate our results, we focus on the example of empirical communication networks, which are temporal networks that are constructed from communication events. The IET distributions of such systems guide efforts to build models of human behavior, and the variance of IETs is very important for estimating the spreading rate of information in networks of temporal interactions. We analyze several well-known data sets from the literature, and we find that the resulting bias can lead to systematic underestimates of the variance in the IET distributions and that correcting for the bias can lead to qualitatively different results for the tails of the IET distributions.
△ Less
Submitted 29 July, 2015; v1 submitted 29 December, 2014;
originally announced December 2014.
-
Multilayer Networks
Authors:
Mikko Kivelä,
Alexandre Arenas,
Marc Barthelemy,
James P. Gleeson,
Yamir Moreno,
Mason A. Porter
Abstract:
In most natural and engineered systems, a set of entities interact with each other in complicated patterns that can encompass multiple types of relationships, change in time, and include other types of complications. Such systems include multiple subsystems and layers of connectivity, and it is important to take such "multilayer" features into account to try to improve our understanding of complex…
▽ More
In most natural and engineered systems, a set of entities interact with each other in complicated patterns that can encompass multiple types of relationships, change in time, and include other types of complications. Such systems include multiple subsystems and layers of connectivity, and it is important to take such "multilayer" features into account to try to improve our understanding of complex systems. Consequently, it is necessary to generalize "traditional" network theory by develo** (and validating) a framework and associated tools to study multilayer systems in a comprehensive fashion. The origins of such efforts date back several decades and arose in multiple disciplines, and now the study of multilayer networks has become one of the most important directions in network science. In this paper, we discuss the history of multilayer networks (and related concepts) and review the exploding body of work on such networks. To unify the disparate terminology in the large body of recent work, we discuss a general framework for multilayer networks, construct a dictionary of terminology to relate the numerous existing concepts to each other, and provide a thorough discussion that compares, contrasts, and translates between related notions such as multilayer networks, multiplex networks, interdependent networks, networks of networks, and many others. We also survey and discuss existing data sets that can be represented as multilayer networks. We review attempts to generalize single-layer-network diagnostics to multilayer networks. We also discuss the rapidly expanding research on multilayer-network models and notions like community structure, connected components, tensor decompositions, and various types of dynamical processes on multilayer networks. We conclude with a summary and an outlook.
△ Less
Submitted 3 March, 2014; v1 submitted 27 September, 2013;
originally announced September 2013.
-
Structure of Triadic Relations in Multiplex Networks
Authors:
Emanuele Cozzo,
Mikko Kivelä,
Manlio De Domenico,
Albert Solé,
Alex Arenas,
Sergio Gómez,
Mason A. Porter,
Yamir Moreno
Abstract:
Recent advances in the study of networked systems have highlighted that our interconnected world is composed of networks that are coupled to each other through different "layers" that each represent one of many possible subsystems or types of interactions. Nevertheless, it is traditional to aggregate multilayer networks into a single weighted network in order to take advantage of existing tools. T…
▽ More
Recent advances in the study of networked systems have highlighted that our interconnected world is composed of networks that are coupled to each other through different "layers" that each represent one of many possible subsystems or types of interactions. Nevertheless, it is traditional to aggregate multilayer networks into a single weighted network in order to take advantage of existing tools. This is admittedly convenient, but it is also extremely problematic, as important information can be lost as a result. It is therefore important to develop multilayer generalizations of network concepts. In this paper, we analyze triadic relations and generalize the idea of transitivity to multiplex networks. By focusing on triadic relations, which yield the simplest type of transitivity, we generalize the concept and computation of clustering coefficients to multiplex networks. We show how the layered structure of such networks introduces a new degree of freedom that has a fundamental effect on transitivity. We compute multiplex clustering coefficients for several real multiplex networks and illustrate why one must take great care when generalizing standard network concepts to multiplex networks. We also derive analytical expressions for our clustering coefficients for ensemble averages of networks in a family of random multiplex networks. Our analysis illustrates that social networks have a strong tendency to promote redundancy by closing triads at every layer and that they thereby have a different type of multiplex transitivity from transportation networks, which do not exhibit such a tendency. These insights are invisible if one only studies aggregated networks.
△ Less
Submitted 12 August, 2015; v1 submitted 25 July, 2013;
originally announced July 2013.
-
Multiscale Analysis of Spreading in a Large Communication Network
Authors:
Mikko Kivelä,
Raj Kumar Pan,
Kimmo Kaski,
János Kertész,
Jari Saramäki,
Márton Karsai
Abstract:
In temporal networks, both the topology of the underlying network and the timings of interaction events can be crucial in determining how some dynamic process mediated by the network unfolds. We have explored the limiting case of the speed of spreading in the SI model, set up such that an event between an infectious and susceptible individual always transmits the infection. The speed of this proce…
▽ More
In temporal networks, both the topology of the underlying network and the timings of interaction events can be crucial in determining how some dynamic process mediated by the network unfolds. We have explored the limiting case of the speed of spreading in the SI model, set up such that an event between an infectious and susceptible individual always transmits the infection. The speed of this process sets an upper bound for the speed of any dynamic process that is mediated through the interaction events of the network. With the help of temporal networks derived from large scale time-stamped data on mobile phone calls, we extend earlier results that point out the slowing-down effects of burstiness and temporal inhomogeneities. In such networks, links are not permanently active, but dynamic processes are mediated by recurrent events taking place on the links at specific points in time. We perform a multi-scale analysis and pinpoint the importance of the timings of event sequences on individual links, their correlations with neighboring sequences, and the temporal pathways taken by the network-scale spreading process. This is achieved by studying empirically and analytically different characteristic relay times of links, relevant to the respective scales, and a set of temporal reference models that allow for removing selected time-domain correlations one by one.
△ Less
Submitted 19 December, 2011;
originally announced December 2011.
-
Using explosive percolation in analysis of real-world networks
Authors:
Raj Kumar Pan,
Mikko Kivelä,
Jari Saramäki,
Kimmo Kaski,
János Kertész
Abstract:
We apply a variant of the explosive percolation procedure to large real-world networks, and show with finite-size scaling that the university class, ordinary or explosive, of the resulting percolation transition depends on the structural properties of the network as well as the number of unoccupied links considered for comparison in our procedure. We observe that in our social networks, the percol…
▽ More
We apply a variant of the explosive percolation procedure to large real-world networks, and show with finite-size scaling that the university class, ordinary or explosive, of the resulting percolation transition depends on the structural properties of the network as well as the number of unoccupied links considered for comparison in our procedure. We observe that in our social networks, the percolation clusters close to the critical point are related to the community structure. This relationship is further highlighted by applying the procedure to model networks with pre-defined communities.
△ Less
Submitted 18 April, 2011; v1 submitted 15 October, 2010;
originally announced October 2010.
-
Small But Slow World: How Network Topology and Burstiness Slow Down Spreading
Authors:
M. Karsai,
M. Kivelä,
R. K. Pan,
K. Kaski,
J. Kertész,
A. -L. Barabási,
J. Saramäki
Abstract:
Communication networks show the small-world property of short paths, but the spreading dynamics in them turns out slow. We follow the time evolution of information propagation through communication networks by using the SI model with empirical data on contact sequences. We introduce null models where the sequences are randomly shuffled in different ways, enabling us to distinguish between the cont…
▽ More
Communication networks show the small-world property of short paths, but the spreading dynamics in them turns out slow. We follow the time evolution of information propagation through communication networks by using the SI model with empirical data on contact sequences. We introduce null models where the sequences are randomly shuffled in different ways, enabling us to distinguish between the contributions of different impeding effects. The slowing down of spreading is found to be caused mostly by weight-topology correlations and the bursty activity patterns of individuals.
△ Less
Submitted 22 August, 2010; v1 submitted 10 June, 2010;
originally announced June 2010.
-
Characterizing the community structure of complex networks
Authors:
Andrea Lancichinetti,
Mikko Kivela,
Jari Saramaki,
Santo Fortunato
Abstract:
Community structure is one of the key properties of complex networks and plays a crucial role in their topology and function. While an impressive amount of work has been done on the issue of community detection, very little attention has been so far devoted to the investigation of communities in real networks. We present a systematic empirical analysis of the statistical properties of communities…
▽ More
Community structure is one of the key properties of complex networks and plays a crucial role in their topology and function. While an impressive amount of work has been done on the issue of community detection, very little attention has been so far devoted to the investigation of communities in real networks. We present a systematic empirical analysis of the statistical properties of communities in large information, communication, technological, biological, and social networks. We find that the mesoscopic organization of networks of the same category is remarkably similar. This is reflected in several characteristics of community structure, which can be used as ``fingerprints'' of specific network categories. While community size distributions are always broad, certain categories of networks consist mainly of tree-like communities, while others have denser modules. Average path lengths within communities initially grow logarithmically with community size, but the growth saturates or slows down for communities larger than a characteristic size. This behaviour is related to the presence of hubs within communities, whose roles differ across categories. Also the community embeddedness of nodes, measured in terms of the fraction of links within their communities, has a characteristic distribution for each category. Our findings are verified by the use of two fundamentally different community detection methods.
△ Less
Submitted 24 May, 2010;
originally announced May 2010.