-
Distinguishing mechanisms of social contagion from local network view
Authors:
Elsa Andres,
Gergely Ódor,
Iacopo Iacopini,
Márton Karsai
Abstract:
The adoption of individual behavioural patterns is largely determined by stimuli arriving from peers via social interactions or from external sources. Based on these influences, individuals are commonly assumed to follow simple or complex adoption rules, inducing social contagion processes. In reality, multiple adoption rules may coexist even within the same social contagion process, introducing a…
▽ More
The adoption of individual behavioural patterns is largely determined by stimuli arriving from peers via social interactions or from external sources. Based on these influences, individuals are commonly assumed to follow simple or complex adoption rules, inducing social contagion processes. In reality, multiple adoption rules may coexist even within the same social contagion process, introducing additional complexity into the spreading phenomena. Our goal is to understand whether coexisting adoption mechanisms can be distinguished from a microscopic view, at the egocentric network level, without requiring global information about the underlying network, or the unfolding spreading process. We formulate this question as a classification problem, and study it through a Bayesian likelihood approach and with random forest classifiers in various synthetic and data-driven experiments. This study offers a novel perspective on the observations of propagation processes at the egocentric level and a better understanding of landmark contagion mechanisms from a local view.
△ Less
Submitted 27 June, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Epidemic-induced local awareness behavior inferred from surveys and genetic sequence data
Authors:
Gergely Ódor,
Márton Karsai
Abstract:
Behavior-disease models suggest that if individuals are aware and take preventive actions when the prevalence of the disease increases among their close contacts, then the pandemic can be contained in a cost-effective way. To measure the true impact of local awareness behavior on epidemic spreading, we propose an efficient approach to identify superspreading events and assign corresponding Event C…
▽ More
Behavior-disease models suggest that if individuals are aware and take preventive actions when the prevalence of the disease increases among their close contacts, then the pandemic can be contained in a cost-effective way. To measure the true impact of local awareness behavior on epidemic spreading, we propose an efficient approach to identify superspreading events and assign corresponding Event Containment Scores (ECSs) in clinical genetic sequence data. We validate ECS as a measure of local awareness in simulation experiments, and we find that ECS was correlated positively with policy stringency during the COVID-19 pandemic. Finally, we observe a temporary drop in ECS during the Omicron wave in most European countries, matching a survey experiment we carried out at the same time. Our findings bring important insight into the field of awareness modeling through the analysis of large-scale genetic sequence data, one of the most promising data sources in epidemics research.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Source Detection via Contact Tracing in the Presence of Asymptomatic Patients
Authors:
Gergely Ódor,
Jana Vuckovic,
Miguel-Angel Sanchez Ndoye,
Patrick Thiran
Abstract:
Inferring the source of a diffusion in a large network of agents is a difficult but feasible task, if a few agents act as sensors revealing the time at which they got hit by the diffusion. A main limitation of current source detection algorithms is that they assume full knowledge of the contact network, which is rarely the case, especially for epidemics, where the source is called patient zero. In…
▽ More
Inferring the source of a diffusion in a large network of agents is a difficult but feasible task, if a few agents act as sensors revealing the time at which they got hit by the diffusion. A main limitation of current source detection algorithms is that they assume full knowledge of the contact network, which is rarely the case, especially for epidemics, where the source is called patient zero. Inspired by recent contact tracing algorithms, we propose a new framework, which we call Source Detection via Contact Tracing Framework (SDCTF). In the SDCTF, the source detection task starts at the time of the first hospitalization, and initially we have no knowledge about the contact network other than the identity of the first hospitalized agent. We may then explore the network by contact queries, and obtain symptom onset times by test queries in an adaptive way. We also assume that some of the agents may be asymptomatic, and therefore cannot reveal their symptom onset time. Our goal is to find patient zero with as few contact and test queries as possible.
We propose two local search algorithms for the SDCTF: the LS algorithm is more data-efficient, but can fail to find the true source if many asymptomatic agents are present, whereas the LS+ algorithm is more robust to asymptomatic agents. By simulations we show that both LS and LS+ outperform state of the art adaptive and non-adaptive source detection algorithms adapted to the SDCTF, even though these baseline algorithms have full access to the contact network. Extending the theory of random exponential trees, we analytically approximate the probability of success of the LS/ LS+ algorithms, and we show that our analytic results match the simulations. Finally, we benchmark our algorithms on the Data-driven COVID-19 Simulator developed by Lorch et al., which is the first time source detection algorithms are tested on such a complex dataset.
△ Less
Submitted 29 December, 2021;
originally announced December 2021.
-
The power of adaptivity in source identification with time queries on the path
Authors:
Victor Lecomte,
Gergely Ódor,
Patrick Thiran
Abstract:
We study the problem of identifying the source of a stochastic diffusion process spreading on a graph based on the arrival times of the diffusion at a few queried nodes. In a graph $G=(V,E)$, an unknown source node $v^* \in V$ is drawn uniformly at random, and unknown edge weights $w(e)$ for $e\in E$, representing the propagation delays along the edges, are drawn independently from a Gaussian dist…
▽ More
We study the problem of identifying the source of a stochastic diffusion process spreading on a graph based on the arrival times of the diffusion at a few queried nodes. In a graph $G=(V,E)$, an unknown source node $v^* \in V$ is drawn uniformly at random, and unknown edge weights $w(e)$ for $e\in E$, representing the propagation delays along the edges, are drawn independently from a Gaussian distribution of mean $1$ and variance $σ^2$. An algorithm then attempts to identify $v^*$ by querying nodes $q \in V$ and being told the length of the shortest path between $q$ and $v^*$ in graph $G$ weighted by $w$. We consider two settings: non-adaptive, in which all query nodes must be decided in advance, and adaptive, in which each query can depend on the results of the previous ones. Both settings are motivated by an application of the problem to epidemic processes (where the source is called patient zero), which we discuss in detail.
We characterize the query complexity when $G$ is an $n$-node path. In the non-adaptive setting, $Θ(nσ^2)$ queries are needed for $σ^2 \leq 1$, and $Θ(n)$ for $σ^2 \geq 1$. In the adaptive setting, somewhat surprisingly, only $Θ(\log\log_{1/σ}n)$ are needed when $σ^2 \leq 1/2$, and $Θ(\log \log n)+O_σ(1)$ when $σ^2 \geq 1/2$. This is the first mathematical study of source identification with time queries in a non-deterministic diffusion process.
△ Less
Submitted 29 December, 2021; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Sequential metric dimension for random graphs
Authors:
Gergely Ódor,
Patrick Thiran
Abstract:
In the localization game on a graph, the goal is to find a fixed but unknown target node $v^\star$ with the least number of distance queries possible. In the $j^{th}$ step of the game, the player queries a single node $v_j$ and receives, as an answer to their query, the distance between the nodes $v_j$ and $v^\star$. The sequential metric dimension (SMD) is the minimal number of queries that the p…
▽ More
In the localization game on a graph, the goal is to find a fixed but unknown target node $v^\star$ with the least number of distance queries possible. In the $j^{th}$ step of the game, the player queries a single node $v_j$ and receives, as an answer to their query, the distance between the nodes $v_j$ and $v^\star$. The sequential metric dimension (SMD) is the minimal number of queries that the player needs to guess the target with absolute certainty, no matter where the target is.
The term SMD originates from the related notion of metric dimension (MD), which can be defined the same way as the SMD, except that the player's queries are non-adaptive. In this work, we extend the results of \cite{bollobas2012metric} on the MD of Erdős-Rényi graphs to the SMD. We find that, in connected Erdős-Rényi graphs, the MD and the SMD are a constant factor apart. For the lower bound we present a clean analysis by combining tools developed for the MD and a novel coupling argument. For the upper bound we show that a strategy that greedily minimizes the number of candidate targets in each step uses asymptotically optimal queries in Erdős-Rényi graphs. Connections with source localization, binary search on graphs and the birthday problem are discussed.
△ Less
Submitted 15 November, 2021; v1 submitted 22 October, 2019;
originally announced October 2019.
-
A Multi-Pass Approach to Large-Scale Connectomics
Authors:
Yaron Meirovitch,
Alexander Matveev,
Hayk Saribekyan,
David Budden,
David Rolnick,
Gergely Odor,
Seymour Knowles-Barley,
Thouis Raymond Jones,
Hanspeter Pfister,
Jeff William Lichtman,
Nir Shavit
Abstract:
The field of connectomics faces unprecedented "big data" challenges. To reconstruct neuronal connectivity, automated pixel-level segmentation is required for petabytes of streaming electron microscopy data. Existing algorithms provide relatively good accuracy but are unacceptably slow, and would require years to extract connectivity graphs from even a single cubic millimeter of neural tissue. Here…
▽ More
The field of connectomics faces unprecedented "big data" challenges. To reconstruct neuronal connectivity, automated pixel-level segmentation is required for petabytes of streaming electron microscopy data. Existing algorithms provide relatively good accuracy but are unacceptably slow, and would require years to extract connectivity graphs from even a single cubic millimeter of neural tissue. Here we present a viable real-time solution, a multi-pass pipeline optimized for shared-memory multicore systems, capable of processing data at near the terabyte-per-hour pace of multi-beam electron microscopes. The pipeline makes an initial fast-pass over the data, and then makes a second slow-pass to iteratively correct errors in the output of the fast-pass. We demonstrate the accuracy of a sparse slow-pass reconstruction algorithm and suggest new methods for detecting morphological errors. Our fast-pass approach provided many algorithmic challenges, including the design and implementation of novel shallow convolutional neural nets and the parallelization of watershed and object-merging techniques. We use it to reconstruct, from image stack to skeletons, the full dataset of Kasthuri et al. (463 GB capturing 120,000 cubic microns) in a matter of hours on a single multicore machine rather than the weeks it has taken in the past on much larger distributed systems.
△ Less
Submitted 7 December, 2016;
originally announced December 2016.
-
Hierarchical Manifold Clustering on Diffusion Maps for Connectomics (MIT 18.S096 final project)
Authors:
Gergely Odor
Abstract:
In this paper, we introduce a novel algorithm for segmentation of imperfect boundary probability maps (BPM) in connectomics. Our algorithm can be a considered as an extension of spectral clustering. Instead of clustering the diffusion maps with traditional clustering algorithms, we learn the manifold and compute an estimate of the minimum normalized cut. We proceed by divide and conquer. We also i…
▽ More
In this paper, we introduce a novel algorithm for segmentation of imperfect boundary probability maps (BPM) in connectomics. Our algorithm can be a considered as an extension of spectral clustering. Instead of clustering the diffusion maps with traditional clustering algorithms, we learn the manifold and compute an estimate of the minimum normalized cut. We proceed by divide and conquer. We also introduce a novel criterion for determining if further splits are necessary in a component based on it's topological properties. Our algorithm complements the currently popular agglomeration approaches in connectomics, which overlook the geometrical aspects of this segmentation problem.
△ Less
Submitted 19 July, 2016;
originally announced July 2016.
-
Bit-Vectorized GPU Implementation of a Stochastic Cellular Automaton Model for Surface Growth
Authors:
Jeffrey Kelling,
Géza Ódor,
Sibylle Gemming
Abstract:
Stochastic surface growth models aid in studying properties of universality classes like the Kardar--Paris--Zhang class. High precision results obtained from large scale computational studies can be transferred to many physical systems. Many properties, such as roughening and some two-time functions can be studied using stochastic cellular automaton (SCA) variants of stochastic models. Here we pre…
▽ More
Stochastic surface growth models aid in studying properties of universality classes like the Kardar--Paris--Zhang class. High precision results obtained from large scale computational studies can be transferred to many physical systems. Many properties, such as roughening and some two-time functions can be studied using stochastic cellular automaton (SCA) variants of stochastic models. Here we present a highly efficient SCA implementation of a surface growth model capable of simulating billions of lattice sites on a single GPU. We also provide insight into cases requiring arbitrary random probabilities which are not accessible through bit-vectorization.
△ Less
Submitted 1 June, 2016;
originally announced June 2016.
-
Universality of (2+1)-dimensional restricted solid-on-solid models
Authors:
Jeffrey Kelling,
Géza Ódor,
Sibylle Gemming
Abstract:
Extensive dynamical simulations of Restricted Solid on Solid models in $D=2+1$ dimensions have been done using parallel multisurface algorithms implemented on graphics cards. Numerical evidence is presented that these models exhibit KPZ surface growth scaling, irrespective of the step heights $N$. We show that by increasing $N$ the corrections to scaling increase, thus smaller step-sized models de…
▽ More
Extensive dynamical simulations of Restricted Solid on Solid models in $D=2+1$ dimensions have been done using parallel multisurface algorithms implemented on graphics cards. Numerical evidence is presented that these models exhibit KPZ surface growth scaling, irrespective of the step heights $N$. We show that by increasing $N$ the corrections to scaling increase, thus smaller step-sized models describe better the asymptotic, long-wave-scaling behavior.
△ Less
Submitted 29 July, 2016; v1 submitted 9 May, 2016;
originally announced May 2016.
-
Ageing of the 2+1 dimensional Kardar-Parisi-Zhang model
Authors:
Géza Ódor,
Jeffrey Kelling,
Sibylle Gemming
Abstract:
Extended dynamical simulations have been performed on a 2+1 dimensional driven dimer lattice gas model to estimate ageing properties. The auto-correlation and the auto-response functions are determined and the corresponding scaling exponents are tabulated. Since this model can be mapped onto the 2+1 dimensional Kardar-Parisi-Zhang surface growth model, our results contribute to the understanding o…
▽ More
Extended dynamical simulations have been performed on a 2+1 dimensional driven dimer lattice gas model to estimate ageing properties. The auto-correlation and the auto-response functions are determined and the corresponding scaling exponents are tabulated. Since this model can be mapped onto the 2+1 dimensional Kardar-Parisi-Zhang surface growth model, our results contribute to the understanding of the universality class of that basic system.
△ Less
Submitted 17 March, 2014; v1 submitted 20 December, 2013;
originally announced December 2013.
-
Comparison of Different Parallel Implementations of the 2+1-Dimensional KPZ Model and the 3-Dimensional KMC Model
Authors:
Jeffrey Kelling,
Géza Ódor,
Máté Ferenc Nagy,
Henrik Schulz,
Karl-Heinz Heinig
Abstract:
We show that efficient simulations of the Kardar-Parisi-Zhang interface growth in 2 + 1 dimensions and of the 3-dimensional Kinetic Monte Carlo of thermally activated diffusion can be realized both on GPUs and modern CPUs. In this article we present results of different implementations on GPUs using CUDA and OpenCL and also on CPUs using OpenCL and MPI. We investigate the runtime and scaling behav…
▽ More
We show that efficient simulations of the Kardar-Parisi-Zhang interface growth in 2 + 1 dimensions and of the 3-dimensional Kinetic Monte Carlo of thermally activated diffusion can be realized both on GPUs and modern CPUs. In this article we present results of different implementations on GPUs using CUDA and OpenCL and also on CPUs using OpenCL and MPI. We investigate the runtime and scaling behavior on different architectures to find optimal solutions for solving current simulation problems in the field of statistical physics and materials science.
△ Less
Submitted 25 July, 2012; v1 submitted 23 April, 2012;
originally announced April 2012.
-
Extremely large scale simulation of a Kardar-Parisi-Zhang model using graphics cards
Authors:
Jeffrey Kelling,
Géza Ódor
Abstract:
The octahedron model introduced recently has been implemented onto graphics cards, which permits extremely large scale simulations via binary lattice gases and bit coded algorithms. We confirm scaling behaviour belonging to the 2d Kardar-Parisi-Zhang universality class and find a surface growth exponent: beta=0.2415(15) on 2^17 x 2^17 systems, ruling out beta=1/4 suggested by field theory. The max…
▽ More
The octahedron model introduced recently has been implemented onto graphics cards, which permits extremely large scale simulations via binary lattice gases and bit coded algorithms. We confirm scaling behaviour belonging to the 2d Kardar-Parisi-Zhang universality class and find a surface growth exponent: beta=0.2415(15) on 2^17 x 2^17 systems, ruling out beta=1/4 suggested by field theory. The maximum speed-up with respect to a single CPU is 240. The steady state has been analysed by finite size scaling and a growth exponent alpha=0.393(4) is found. Correction to scaling exponents are computed and the power-spectrum density of the steady state is determined. We calculate the universal scaling functions, cumulants and show that the limit distribution can be obtained by the sizes considered. We provide numerical fitting for the small and large tail behaviour of the steady state scaling function of the interface width.
△ Less
Submitted 5 January, 2012; v1 submitted 31 October, 2011;
originally announced October 2011.
-
Simulation of 1+1 dimensional surface growth and lattices gases using GPUs
Authors:
Henrik Schulz,
Géza Ódor,
Gergely Ódor,
Máté Ferenc Nagy
Abstract:
Restricted solid on solid surface growth models can be mapped onto binary lattice gases. We show that efficient simulation algorithms can be realized on GPUs either by CUDA or by OpenCL programming. We consider a deposition/evaporation model following Kardar-Parisi-Zhang growth in 1+1 dimensions related to the Asymmetric Simple Exclusion Process and show that for sizes, that fit into the shared me…
▽ More
Restricted solid on solid surface growth models can be mapped onto binary lattice gases. We show that efficient simulation algorithms can be realized on GPUs either by CUDA or by OpenCL programming. We consider a deposition/evaporation model following Kardar-Parisi-Zhang growth in 1+1 dimensions related to the Asymmetric Simple Exclusion Process and show that for sizes, that fit into the shared memory of GPUs one can achieve the maximum parallelization speedup ~ x100 for a Quadro FX 5800 graphics card with respect to a single CPU of 2.67 GHz). This permits us to study the effect of quenched columnar disorder, requiring extremely long simulation times. We compare the CUDA realization with an OpenCL implementation designed for processor clusters via MPI. A two-lane traffic model with randomized turning points is also realized and the dynamical behavior has been investigated.
△ Less
Submitted 30 March, 2011; v1 submitted 2 December, 2010;
originally announced December 2010.