-
Initialisation and Topology Effects in Decentralised Federated Learning
Authors:
Arash Badie-Modiri,
Chiara Boldrini,
Lorenzo Valerio,
János Kertész,
Márton Karsai
Abstract:
Fully decentralised federated learning enables collaborative training of individual machine learning models on distributed devices on a communication network while kee** the training data localised. This approach enhances data privacy and eliminates both the single point of failure and the necessity for central coordination. Our research highlights that the effectiveness of decentralised federat…
▽ More
Fully decentralised federated learning enables collaborative training of individual machine learning models on distributed devices on a communication network while kee** the training data localised. This approach enhances data privacy and eliminates both the single point of failure and the necessity for central coordination. Our research highlights that the effectiveness of decentralised federated learning is significantly influenced by the network topology of connected devices. We propose a strategy for uncoordinated initialisation of the artificial neural networks, which leverages the distribution of eigenvector centralities of the nodes of the underlying communication network, leading to a radically improved training efficiency. Additionally, our study explores the scaling behaviour and choice of environmental parameters under our proposed initialisation strategy. This work paves the way for more efficient and scalable artificial neural network training in a distributed and uncoordinated environment, offering a deeper understanding of the intertwining roles of network structure and learning dynamics.
△ Less
Submitted 22 May, 2024; v1 submitted 23 March, 2024;
originally announced March 2024.
-
Crowdsourcing subjective annotations using pairwise comparisons reduces bias and error compared to the majority-vote method
Authors:
Hasti Narimanzadeh,
Arash Badie-Modiri,
Iuliia Smirnova,
Ted Hsuan Yun Chen
Abstract:
How to better reduce measurement variability and bias introduced by subjectivity in crowdsourced labelling remains an open question. We introduce a theoretical framework for understanding how random error and measurement bias enter into crowdsourced annotations of subjective constructs. We then propose a pipeline that combines pairwise comparison labelling with Elo scoring, and demonstrate that it…
▽ More
How to better reduce measurement variability and bias introduced by subjectivity in crowdsourced labelling remains an open question. We introduce a theoretical framework for understanding how random error and measurement bias enter into crowdsourced annotations of subjective constructs. We then propose a pipeline that combines pairwise comparison labelling with Elo scoring, and demonstrate that it outperforms the ubiquitous majority-voting method in reducing both types of measurement error. To assess the performance of the labelling approaches, we constructed an agent-based model of crowdsourced labelling that lets us introduce different types of subjectivity into the tasks. We find that under most conditions with task subjectivity, the comparison approach produced higher $f_1$ scores. Further, the comparison approach is less susceptible to inflating bias, which majority voting tends to do. To facilitate applications, we show with simulated and real-world data that the number of required random comparisons for the same classification accuracy scales log-linearly $O(N \log N)$ with the number of labelled items. We also implemented the Elo system as an open-source Python package.
△ Less
Submitted 1 June, 2023; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Reticula: A temporal network and hypergraph analysis software package
Authors:
Arash Badie-Modiri,
Mikko Kivelä
Abstract:
In the last decade, temporal networks and static and temporal hypergraphs have enabled modelling connectivity and spreading processes in a wide array of real-world complex systems such as economic transactions, information spreading, brain activity and disease spreading. In this manuscript, we present the Reticula C++ library and Python package: A comprehensive suite of tools for working with real…
▽ More
In the last decade, temporal networks and static and temporal hypergraphs have enabled modelling connectivity and spreading processes in a wide array of real-world complex systems such as economic transactions, information spreading, brain activity and disease spreading. In this manuscript, we present the Reticula C++ library and Python package: A comprehensive suite of tools for working with real-world and synthetic static and temporal networks and hypergraphs. This includes various methods of creating synthetic networks and randomised null models based on real-world data, calculating reachability and simulating compartmental models on networks. The library is designed principally on an extensible, cache-friendly representation of networks, with an aim of easing multi-thread use in the high-performance computing environment.
△ Less
Submitted 11 June, 2023; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Directed Percolation in Random Temporal Network Models with Heterogeneities
Authors:
Arash Badie-Modiri,
Abbas K. Rizi,
Márton Karsai,
Mikko Kivelä
Abstract:
The event graph representation of temporal networks suggests that the connectivity of temporal structures can be mapped to a directed percolation problem. However, similar to percolation theory on static networks, this map** is valid under the approximation that the structure and interaction dynamics of the temporal network are determined by its local properties, and otherwise, it is maximally r…
▽ More
The event graph representation of temporal networks suggests that the connectivity of temporal structures can be mapped to a directed percolation problem. However, similar to percolation theory on static networks, this map** is valid under the approximation that the structure and interaction dynamics of the temporal network are determined by its local properties, and otherwise, it is maximally random. We challenge these conditions and demonstrate the robustness of this map** in case of more complicated systems. We systematically analyze random and regular network topologies and heterogeneous link-activation processes driven by bursty renewal or self-exciting processes using numerical simulation and finite-size scaling methods. We find that the critical percolation exponents characterizing the temporal network are not sensitive to many structural and dynamical network heterogeneities, while they recover known scaling exponents characterizing directed percolation on low dimensional lattices. While it is not possible to demonstrate the validity of this map** for all temporal network models, our results establish the first batch of evidence supporting the robustness of the scaling relationships in the limited-time reachability of temporal networks.
△ Less
Submitted 11 June, 2023; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Directed Percolation in Temporal Networks
Authors:
Arash Badie-Modiri,
Abbas K. Rizi,
Márton Karsai,
Mikko Kivelä
Abstract:
Connectivity and reachability on temporal networks, which can describe the spreading of a disease, decimation of information or the accessibility of a public transport system over time, have been among the main contemporary areas of study in complex systems for the last decade. However, while isotropic percolation theory successfully describes connectivity in static networks, a similar description…
▽ More
Connectivity and reachability on temporal networks, which can describe the spreading of a disease, decimation of information or the accessibility of a public transport system over time, have been among the main contemporary areas of study in complex systems for the last decade. However, while isotropic percolation theory successfully describes connectivity in static networks, a similar description has not been yet developed for temporal networks. Here address this problem and formalize a map** of the concept of temporal network reachability to percolation theory. We show that the limited-waiting-time reachability, a generic notion of constrained connectivity in temporal networks, displays directed percolation phase transition in connectivity. Consequently, the critical percolation properties of spreading processes on temporal networks can be estimated by a set of known exponents characterising the directed percolation universality class. This result is robust across a diverse set of temporal network models with different temporal and topological heterogeneities, while by using our methodology we uncover similar reachability phase transitions in real temporal networks too. These findings open up an avenue to apply theory, concepts and methodology from the well-developed directed percolation literature to temporal networks.
△ Less
Submitted 11 June, 2023; v1 submitted 3 July, 2021;
originally announced July 2021.
-
Epidemic Spreading and Digital Contact Tracing: Effects of Heterogeneous Mixing and Quarantine Failures
Authors:
Abbas K. Rizi,
Ali Faqeeh,
Arash Badie-Modiri,
Mikko Kivelä
Abstract:
Contact tracing via digital tracking applications installed on mobile phones is an important tool for controlling epidemic spreading. Its effectivity can be quantified by modifying the standard methodology for analyzing percolation and connectivity of contact networks. We apply this framework to networks with varying degree distributions, numbers of application users, and probabilities of quaranti…
▽ More
Contact tracing via digital tracking applications installed on mobile phones is an important tool for controlling epidemic spreading. Its effectivity can be quantified by modifying the standard methodology for analyzing percolation and connectivity of contact networks. We apply this framework to networks with varying degree distributions, numbers of application users, and probabilities of quarantine failures. Further, we study structured populations with homophily and heterophily and the possibility of degree-targeted application distribution. Our results are based on a combination of explicit simulations and mean-field analysis. They indicate that there can be major differences in the epidemic size and epidemic probabilities which are equivalent in the normal SIR processes. Further, degree heterogeneity is seen to be especially important for the epidemic threshold but not as much for the epidemic size. The probability that tracing leads to quarantines is not as important as the application adoption rate. Finally, both strong homophily and especially heterophily with regard to application adoption can be detrimental. Overall, epidemic dynamics are very sensitive to all of the parameter values we tested out, which makes the problem of estimating the effect of digital contact tracing an inherently multidimensional problem.
△ Less
Submitted 19 April, 2022; v1 submitted 23 March, 2021;
originally announced March 2021.
-
Efficient limited-time reachability estimation in temporal networks
Authors:
Arash Badie-Modiri,
Márton Karsai,
Mikko Kivelä
Abstract:
Time-limited states characterise many dynamical processes on networks: disease infected individuals recover after some time, people forget news spreading on social networks, or passengers may not wait forever for a connection. These dynamics can be described as limited waiting-time processes, and they are particularly important for systems modelled as temporal networks. These processes have been s…
▽ More
Time-limited states characterise many dynamical processes on networks: disease infected individuals recover after some time, people forget news spreading on social networks, or passengers may not wait forever for a connection. These dynamics can be described as limited waiting-time processes, and they are particularly important for systems modelled as temporal networks. These processes have been studied via simulations, which is equivalent to repeatedly finding all limited-waiting time temporal paths from a source node and time. We propose a method yielding orders of magnitude more efficient way of tracking the reachability of such temporal paths. Our method gives simultaneous estimates of the in- or out-reachability (with any chosen waiting-time limit) from every possible starting point and time. It works on very large temporal networks with hundreds of millions of events on current commodity computing hardware. This opens up the possibility to analyse reachability and dynamics of spreading processes on large temporal networks in completely new ways. For example, one can now compute centralities based on global reachability for all events or can find with high probability the infected node and time, which would lead to the largest epidemic outbreak.
△ Less
Submitted 11 June, 2023; v1 submitted 30 August, 2019;
originally announced August 2019.