-
Single-mode emission by phase-delayed coupling between nano-lasers
Authors:
T. V. Raziman,
Anna Fischer,
Riccardo Nori,
Anthony Chan,
Wai Kit Ng,
Dhruv Saxena,
Ortwin Hess,
Korneel Molkens,
Ivo Tanghe,
Pieter Geiregat,
Dries Van Thourhout,
Mauricio Barahona,
Riccardo Sapienza
Abstract:
Near-field coupling between nanolasers enables collective high-power lasing but leads to complex spectral resha** and multimode operation, limiting the emission brightness, spatial coherence and temporal stability. Many lasing architectures have been proposed to circumvent this limitation, based on symmetries, topology, or interference. We show that a much simpler and robust method exploiting ph…
▽ More
Near-field coupling between nanolasers enables collective high-power lasing but leads to complex spectral resha** and multimode operation, limiting the emission brightness, spatial coherence and temporal stability. Many lasing architectures have been proposed to circumvent this limitation, based on symmetries, topology, or interference. We show that a much simpler and robust method exploiting phase-delayed coupling, where light exchanged by the lasers carries a phase, can enable stable single-mode operation. Phase-delayed coupling changes the modal amplification: for pump powers close to the anyonic parity-time (PT) symmetric exceptional point, a high phase delay completely separates the mode thresholds, leading to single mode operation. This is shown by stability analysis with nonlinear coupled mode theory and stochastic differential equations for two coupled nanolasers and confirmed by realistic semi-analytical treatment of a dimer of lasing nanospheres. Finally, we extend the mode control to large arrays of nanolasers, featuring lowered thresholds and higher power. Our work promises a novel solution to engineer bright and stable single-mode lasing from nanolaser arrays with important applications in photonic chips for communication and lidars.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Mode visualisation and control of complex lasers using neural networks
Authors:
Wai Kit Ng,
T. V. Raziman,
Dhruv Saxena,
Korneel Molkens,
Ivo Tanghe,
Zhenghe Xuan,
Pieter Geiregat,
Dries Van Thourhout,
Mauricio Barahona,
Riccardo Sapienza
Abstract:
Understanding the behaviour of complex laser systems is an outstanding challenge, especially in the presence of nonlinear interactions between modes. Hidden features, such as the gain distributions and spatial localisation of lasing modes, often cannot be revealed experimentally, yet they are crucial to determining the laser action. Here, we introduce a lasing spectroscopy method that can visualis…
▽ More
Understanding the behaviour of complex laser systems is an outstanding challenge, especially in the presence of nonlinear interactions between modes. Hidden features, such as the gain distributions and spatial localisation of lasing modes, often cannot be revealed experimentally, yet they are crucial to determining the laser action. Here, we introduce a lasing spectroscopy method that can visualise the gain profiles of the modes in complex lasers using an artificial neural network. The spatial gain distributions of different lasing modes in a disorderly coupled microring array are reconstructed without prior knowledge of the laser topology. We further extend the neural network to a tandem neural network that can control the laser emission by matching the modal gain/loss profile to selectively enhance the targeted modes. This mode visualisation method offers a new approach to extracting hidden spatial mode features from photonic structures, which could improve our understanding and control of complex photonic systems.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Moment-based parameter inference with error guarantees for stochastic reaction networks
Authors:
Zekai Li,
Mauricio Barahona,
Philipp Thomas
Abstract:
Inferring parameters of models of biochemical kinetics from single-cell data remains challenging because of the uncertainty arising from the intractability of the likelihood function of stochastic reaction networks. Such uncertainty falls beyond current error quantification measures, which focus on the effects of finite sample size and identifiability but lack theoretical guarantees when likelihoo…
▽ More
Inferring parameters of models of biochemical kinetics from single-cell data remains challenging because of the uncertainty arising from the intractability of the likelihood function of stochastic reaction networks. Such uncertainty falls beyond current error quantification measures, which focus on the effects of finite sample size and identifiability but lack theoretical guarantees when likelihood approximations are needed. Here, we propose an inference method for stochastic reaction networks with nonlinear and rational propensities at steady state that provides bounds on the parameters via convex optimisation over sets constrained by moment equations and moment matrices. Our approach takes observations from the stochastic reaction network and forms moment intervals, which are then used to constrain parameters through convex sets. The bounds on the parameters contain the true parameters under the condition that the moment intervals contain the true stationary moments, thus providing uncertainty quantification and error guarantees. Our approach does not need to predict moments and distributions for given parameters (i.e., it avoids solving or simulating the forward problem), and hence circumvents intractable likelihood computations or computationally expensive simulations. We demonstrate its use for uncertainty quantification, data integration and prediction of latent species statistics through synthetic data from common nonlinear biochemical models including the Schlögl model, the toggle switch and post-transcriptional regulation.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Patterns of co-occurrent skills in UK job adverts
Authors:
Zhaolu Liu,
Jonathan M. Clarke,
Bertha Rohenkohl,
Mauricio Barahona
Abstract:
A job usually involves the application of several complementary or synergistic skills to perform its required tasks. Such relationships are implicitly recognised by employers in the skills they demand when recruiting new employees. Here we construct a skills network based on their co-occurrence in a national level data set of 65 million job postings from the UK spanning 2016 to 2022. We then apply…
▽ More
A job usually involves the application of several complementary or synergistic skills to perform its required tasks. Such relationships are implicitly recognised by employers in the skills they demand when recruiting new employees. Here we construct a skills network based on their co-occurrence in a national level data set of 65 million job postings from the UK spanning 2016 to 2022. We then apply multiscale graph-based community detection to obtain data-driven skill clusters at different levels of resolution that reveal a modular structure across scales. Skill clusters display diverse levels of demand and occupy varying roles within the skills network: some have broad reach across the network (high closeness centrality) while others have higher levels of within-cluster containment, yet with high interconnection across clusters and no skill silos. The skill clusters also display varying levels of semantic similarity, highlighting the difference between co-occurrence in adverts and intrinsic thematic consistency. Clear geographic variation is evident in the demand for each skill cluster across the UK, broadly reflecting the industrial characteristics of each region, e.g., London appears as an outlier as an international hub for finance, education and business. Comparison of data from 2016 and 2022 reveals employers are demanding a broader range of skills over time, with more adverts featuring skills spanning different clusters. We also show that our data-driven clusters differ from expert-authored categorisations of skills, indicating that important relationships between skills are not captured by expert assessment alone.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
LGDE: Local Graph-based Dictionary Expansion
Authors:
Dominik J. Schindler,
Sneha Jha,
Xixuan Zhang,
Kilian Buehling,
Annett Heft,
Mauricio Barahona
Abstract:
Expanding a dictionary of pre-selected keywords is crucial for tasks in information retrieval, such as database query and online data collection. Here we propose Local Graph-based Dictionary Expansion (LGDE), a method that uses tools from manifold learning and network science for the data-driven discovery of keywords starting from a seed dictionary. At the heart of LGDE lies the creation of a word…
▽ More
Expanding a dictionary of pre-selected keywords is crucial for tasks in information retrieval, such as database query and online data collection. Here we propose Local Graph-based Dictionary Expansion (LGDE), a method that uses tools from manifold learning and network science for the data-driven discovery of keywords starting from a seed dictionary. At the heart of LGDE lies the creation of a word similarity graph derived from word embeddings and the application of local community detection based on graph diffusion to discover semantic neighbourhoods of pre-defined seed keywords. The diffusion in the local graph manifold allows the exploration of the complex nonlinear geometry of word embeddings and can capture word similarities based on paths of semantic association. We validate our method on a corpus of hate speech-related posts from Reddit and Gab and show that LGDE enriches the list of keywords and achieves significantly better performance than threshold methods based on direct word similarities. We further demonstrate the potential of our method through a real-world use case from communication science, where LGDE is evaluated quantitatively on data collected and analysed by domain experts by expanding a conspiracy-related dictionary.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
RamanSPy: An open-source Python package for integrative Raman spectroscopy data analysis
Authors:
Dimitar Georgiev,
Simon Vilms Pedersen,
Ruoxiao Xie,
Álvaro Fernández-Galiana,
Molly M. Stevens,
Mauricio Barahona
Abstract:
Raman spectroscopy is a non-destructive and label-free chemical analysis technique, which plays a key role in the analysis and discovery cycle of various branches of science. Nonetheless, progress in Raman spectroscopic analysis is still impeded by the lack of software, methodological and data standardisation, and the ensuing fragmentation and lack of reproducibility of analysis workflows thereof.…
▽ More
Raman spectroscopy is a non-destructive and label-free chemical analysis technique, which plays a key role in the analysis and discovery cycle of various branches of science. Nonetheless, progress in Raman spectroscopic analysis is still impeded by the lack of software, methodological and data standardisation, and the ensuing fragmentation and lack of reproducibility of analysis workflows thereof. To address these issues, we introduce RamanSPy, an open-source Python package for Raman spectroscopic research and analysis. RamanSPy provides a comprehensive library of ready-to-use tools for spectroscopic analysis, which streamlines day-to-day tasks, integrative analyses, as well as novel research and algorithmic development. RamanSPy is modular and open source, not tied to a particular technology or data format, and can be readily interfaced with the burgeoning ecosystem for data science, statistical analysis and machine learning in Python.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Sensitivity and spectral control of network lasers
Authors:
Dhruv Saxena,
Alexis Arnaudon,
Oscar Cipolato,
Michele Gaio,
Alain Quentel,
Sophia Yaliraki,
Dario Pisignano,
Andrea Camposeo,
Mauricio Barahona,
Riccardo Sapienza
Abstract:
Recently, random lasing in complex networks has shown efficient lasing over more than 50 localised modes, promoted by multiple scattering over the underlying graph. If controlled, these network lasers can lead to fast-switching multifunctional light sources with synthesised spectrum. Here, we observe both in experiment and theory high sensitivity of the network laser to the spatial shape of the pu…
▽ More
Recently, random lasing in complex networks has shown efficient lasing over more than 50 localised modes, promoted by multiple scattering over the underlying graph. If controlled, these network lasers can lead to fast-switching multifunctional light sources with synthesised spectrum. Here, we observe both in experiment and theory high sensitivity of the network laser to the spatial shape of the pump profile, with mode intensity variation of up to 280% for a non-homogeneous 7% pump decrease. We solve the nonlinear equations within the steady state ab-initio laser theory (SALT) approximation over a graph and we show selective lasing of around 90% of the top modes, effectively programming the spectrum of the lasing networks. In our experiments with polymer networks, this high sensitivity enables control of the lasing spectrum through non-uniform pump patterns. We propose the underlying complexity of the network modes as the key element behind efficient spectral control opening the way for the development of optical devices with wide impact for on-chip photonics for communication, sensing and computation.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Multiscale mobility patterns and the restriction of human movement
Authors:
Dominik J. Schindler,
Jonathan Clarke,
Mauricio Barahona
Abstract:
From the perspective of human mobility, the COVID-19 pandemic constituted a natural experiment of enormous reach in space and time. Here, we analyse the inherent multiple scales of human mobility using Facebook Movement Maps collected before and during the first UK lockdown. First, we obtain the pre-lockdown UK mobility graph, and employ multiscale community detection to extract, in an unsupervise…
▽ More
From the perspective of human mobility, the COVID-19 pandemic constituted a natural experiment of enormous reach in space and time. Here, we analyse the inherent multiple scales of human mobility using Facebook Movement Maps collected before and during the first UK lockdown. First, we obtain the pre-lockdown UK mobility graph, and employ multiscale community detection to extract, in an unsupervised manner, a set of robust partitions into flow communities at different levels of coarseness. The partitions so obtained capture intrinsic mobility scales with better coverage than NUTS regions, which suffer from mismatches between human mobility and administrative divisions. Furthermore, the flow communities in the fine scale partition match well the UK Travel to Work Areas (TTWAs) but also capture mobility patterns beyond commuting to work. We also examine the evolution of mobility under lockdown, and show that mobility first reverted towards fine scale flow communities already found in the pre-lockdown data, and then expanded back towards coarser flow communities as restrictions were lifted. The improved coverage induced by lockdown is well captured by a linear decay shock model, which allows us to quantify regional differences both in the strength of the effect and the recovery time from the lockdown shock.
△ Less
Submitted 14 August, 2023; v1 submitted 17 January, 2022;
originally announced January 2022.
-
Relative, local and global dimension in complex networks
Authors:
Robert L. Peach,
Alexis Arnaudon,
Mauricio Barahona
Abstract:
Dimension is a fundamental property of objects and the space in which they are embedded. Yet ideal notions of dimension, as in Euclidean spaces, do not always translate to physical spaces, which can be constrained by boundaries and distorted by inhomogeneities, or to intrinsically discrete systems such as networks. To take into account locality, finiteness and discreteness, dynamical processes can…
▽ More
Dimension is a fundamental property of objects and the space in which they are embedded. Yet ideal notions of dimension, as in Euclidean spaces, do not always translate to physical spaces, which can be constrained by boundaries and distorted by inhomogeneities, or to intrinsically discrete systems such as networks. To take into account locality, finiteness and discreteness, dynamical processes can be used to probe the space geometry and define its dimension. Here we show that each point in space can be assigned a relative dimension with respect to the source of a diffusive process, a concept that provides a scale-dependent definition for local and global dimension also applicable to networks. To showcase its application to physical systems, we demonstrate that the local dimension of structural protein graphs correlates with structural flexibility, and the relative dimension with respect to the active site uncovers regions involved in allosteric communication. In simple models of epidemics on networks, the relative dimension is predictive of the spreading capability of nodes, and identifies scales at which the graph structure is predictive of infectivity. We further apply our dimension measures to neuronal networks, economic trade, social networks, ocean flows, and to the comparison of random graphs.
△ Less
Submitted 19 May, 2022; v1 submitted 9 June, 2021;
originally announced June 2021.
-
Spatial shifts in productivity of the coastal ocean over the past two decades induced by migration of the Pacific Anticyclone and Bakun effect in the Humboldt Upwelling Ecosystem
Authors:
Nicolas Weidberg,
Andres Ospina-Alvarez,
Jessica Bonicelli,
Mario Barahona,
Christopher M. Aiken,
Bernardo R. Broitman,
Sergio A. Navarrete
Abstract:
Intensification and poleward expansion of upwelling favourable winds have been predicted as a response to anthropogenic global climate change and have recently been documented in most Eastern Boundary Upwelling Ecosystems of the world. To identify how these processes are impacting nearshore oceanographic habitats and, especially, long term trends of primary productivity in the Humboldt Upwelling E…
▽ More
Intensification and poleward expansion of upwelling favourable winds have been predicted as a response to anthropogenic global climate change and have recently been documented in most Eastern Boundary Upwelling Ecosystems of the world. To identify how these processes are impacting nearshore oceanographic habitats and, especially, long term trends of primary productivity in the Humboldt Upwelling Ecosystem (HUE), we analysed time series of sea level pressure, wind stress, sea surface and atmospheric surface temperatures, and Chlorophyll-a, as a proxy for primary productivity, along 26° - 36° S. We show that climate induced trends in primary productivity are highly heterogeneous across the region. On the one hand, the well documented poleward migration of the South Pacific Anticyclone (SPA) has led to decreased spring upwelling winds in the region between ca. 30° and 34° S, and to their intensification to the south. Decreased winds have produced slight increases in sea surface temperature and a pronounced and meridionally extensive decrease in surface Chlorophyll-a in this region of central Chile. To the north of 30° S, significant increases in upwelling winds, decreased SST, and enhanced Chlorophyll-a concentration are observed in the nearshore. We show that this increased in upwelling driven coastal productivity is probably produced by the increased land-sea pressure gradients (Bakun's effect) that have occurred over the past two decades north of 30° S. Thus, climate drivers along the HUE are inducing contrasting trends in oceanographic conditions and primary productivity, which can have far-reaching consequences for coastal pelagic and benthic ecosystems and lead to geographic displacements of the major fisheries.
△ Less
Submitted 16 April, 2021;
originally announced April 2021.
-
Network memory in the movement of hospital patients carrying drug-resistant bacteria
Authors:
Ashleigh C. Myall,
Robert L. Peach,
Andrea Y. Weiße,
Frances Davies,
Siddharth Mookerjee,
Alison Holmes,
Mauricio Barahona
Abstract:
Hospitals constitute highly interconnected systems that bring into contact an abundance of infectious pathogens and susceptible individuals, thus making infection outbreaks both common and challenging. In recent years, there has been a sharp incidence of antimicrobial-resistance amongst healthcare-associated infections, a situation now considered endemic in many countries. Here we present network-…
▽ More
Hospitals constitute highly interconnected systems that bring into contact an abundance of infectious pathogens and susceptible individuals, thus making infection outbreaks both common and challenging. In recent years, there has been a sharp incidence of antimicrobial-resistance amongst healthcare-associated infections, a situation now considered endemic in many countries. Here we present network-based analyses of a data set capturing the movement of patients harbouring drug-resistant bacteria across three large London hospitals. We show that there are substantial memory effects in the movement of hospital patients colonised with drug-resistant bacteria. Such memory effects break first-order Markovian transitive assumptions and substantially alter the conclusions from the analysis, specifically on node rankings and the evolution of diffusive processes. We capture variable length memory effects by constructing a lumped-state memory network, which we then use to identify overlap** communities of wards. We find that these communities of wards display a quasi-hierarchical structure at different levels of granularity which is consistent with different aspects of patient flows related to hospital locations and medical specialties.
△ Less
Submitted 7 October, 2020; v1 submitted 30 September, 2020;
originally announced September 2020.
-
Severability of mesoscale components and local time scales in dynamical networks
Authors:
Yun William Yu,
Jean-Charles Delvenne,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
A major goal of dynamical systems theory is the search for simplified descriptions of the dynamics of a large number of interacting states. For overwhelmingly complex dynamical systems, the derivation of a reduced description on the entire dynamics at once is computationally infeasible. Other complex systems are so expansive that despite the continual onslaught of new data only partial information…
▽ More
A major goal of dynamical systems theory is the search for simplified descriptions of the dynamics of a large number of interacting states. For overwhelmingly complex dynamical systems, the derivation of a reduced description on the entire dynamics at once is computationally infeasible. Other complex systems are so expansive that despite the continual onslaught of new data only partial information is available. To address this challenge, we define and optimise for a local quality function severability for measuring the dynamical coherency of a set of states over time. The theoretical underpinnings of severability lie in our local adaptation of the Simon-Ando-Fisher time-scale separation theorem, which formalises the intuition of local wells in the Markov landscape of a dynamical process, or the separation between a microscopic and a macroscopic dynamics. Finally, we demonstrate the practical relevance of severability by applying it to examples drawn from power networks, image segmentation, social networks, metabolic networks, and word association.
△ Less
Submitted 4 June, 2020;
originally announced June 2020.
-
Geometric graphs from data to aid classification tasks with graph convolutional networks
Authors:
Yifan Qian,
Paul Expert,
Pietro Panzarasa,
Mauricio Barahona
Abstract:
Traditional classification tasks learn to assign samples to given classes based solely on sample features. This paradigm is evolving to include other sources of information, such as known relations between samples. Here we show that, even if additional relational information is not available in the data set, one can improve classification by constructing geometric graphs from the features themselv…
▽ More
Traditional classification tasks learn to assign samples to given classes based solely on sample features. This paradigm is evolving to include other sources of information, such as known relations between samples. Here we show that, even if additional relational information is not available in the data set, one can improve classification by constructing geometric graphs from the features themselves, and using them within a Graph Convolutional Network. The improvement in classification accuracy is maximized by graphs that capture sample similarity with relatively low edge density. We show that such feature-derived graphs increase the alignment of the data to the ground truth while improving class separation. We also demonstrate that the graphs can be made more efficient using spectral sparsification, which reduces the number of edges while still improving classification performance. We illustrate our findings using synthetic and real-world data sets from various scientific domains.
△ Less
Submitted 13 April, 2021; v1 submitted 8 May, 2020;
originally announced May 2020.
-
An edge-based formulation of elastic network models
Authors:
Maxwell Hodges,
Sophia N Yaliraki,
Mauricio Barahona
Abstract:
We present an edge-based framework for the study of geometric elastic network models to model mechanical interactions in physical systems. We use a formulation in the edge space, instead of the usual node-centric approach, to characterise edge fluctuations of geometric networks defined in d- dimensional space and define the edge mechanical embeddedness, an edge mechanical susceptibility measuring…
▽ More
We present an edge-based framework for the study of geometric elastic network models to model mechanical interactions in physical systems. We use a formulation in the edge space, instead of the usual node-centric approach, to characterise edge fluctuations of geometric networks defined in d- dimensional space and define the edge mechanical embeddedness, an edge mechanical susceptibility measuring the force felt on each edge given a force applied on the whole system. We further show that this formulation can be directly related to the infinitesimal rigidity of the network, which additionally permits three- and four-centre forces to be included in the network description. We exemplify the approach in protein systems, at both the residue and atomistic levels of description.
△ Less
Submitted 14 November, 2019;
originally announced November 2019.
-
Allostery and cooperativity in multimeric proteins: bond-to-bond propensities in ATCase
Authors:
Maxwell Hodges,
Mauricio Barahona,
Sophia N. Yaliraki
Abstract:
Aspartate carbamoyltransferase (ATCase) is a large dodecameric enzyme with six active sites that exhibits allostery: its catalytic rate is modulated by the binding of various substrates at distal points from the active sites. A recently developed method, bond-to-bond propensity analysis, has proven capable of predicting allosteric sites in a wide range of proteins using an energy-weighted atomisti…
▽ More
Aspartate carbamoyltransferase (ATCase) is a large dodecameric enzyme with six active sites that exhibits allostery: its catalytic rate is modulated by the binding of various substrates at distal points from the active sites. A recently developed method, bond-to-bond propensity analysis, has proven capable of predicting allosteric sites in a wide range of proteins using an energy-weighted atomistic graph obtained from the protein structure and given knowledge only of the location of the active site. Bond-to-bond propensity establishes if energy fluctuations at given bonds have significant effects on any other bond in the protein, by considering their propagation through the protein graph. In this work, we use bond-to-bond propensity analysis to study different aspects of ATCase activity using three different protein structures and sources of fluctuations. First, we predict key residues and bonds involved in the transition between inactive (T) and active (R) states of ATCase by analysing allosteric substrate binding as a source of energy perturbations in the protein graph. Our computational results also indicate that the effect of multiple allosteric binding is non linear: a switching effect is observed after a particular number and arrangement of substrates is bound suggesting a form of long range communication between the distantly arranged allosteric sites. Second, cooperativity is explored by considering a bisubstrate analogue as the source of energy fluctuations at the active site, also leading to the identification of highly significant residues to the T-R transition that enhance cooperativity across active sites. Finally, the inactive (T) structure is shown to exhibit a strong, non linear communication between the allosteric sites and the interface between catalytic subunits, rather than the active site.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Semi-supervised classification on graphs using explicit diffusion dynamics
Authors:
Robert L. Peach,
Alexis Arnaudon,
Mauricio Barahona
Abstract:
Classification tasks based on feature vectors can be significantly improved by including within deep learning a graph that summarises pairwise relationships between the samples. Intuitively, the graph acts as a conduit to channel and bias the inference of class labels. Here, we study classification methods that consider the graph as the originator of an explicit graph diffusion. We show that appen…
▽ More
Classification tasks based on feature vectors can be significantly improved by including within deep learning a graph that summarises pairwise relationships between the samples. Intuitively, the graph acts as a conduit to channel and bias the inference of class labels. Here, we study classification methods that consider the graph as the originator of an explicit graph diffusion. We show that appending graph diffusion to feature-based learning as an \textit{a posteriori} refinement achieves state-of-the-art classification accuracy. This method, which we call Graph Diffusion Reclassification (GDR), uses overshooting events of a diffusive graph dynamics to reclassify individual nodes. The method uses intrinsic measures of node influence, which are distinct for each node, and allows the evaluation of the relationship and importance of features and graph for classification. We also present diff-GCN, a simple extension of Graph Convolutional Neural Network (GCN) architectures that leverages explicit diffusion dynamics, and allows the natural use of directed graphs. To showcase our methods, we use benchmark datasets of documents with associated citation data.
△ Less
Submitted 24 September, 2019;
originally announced September 2019.
-
Graph-based data clustering via multiscale community detection
Authors:
Zi**g Liu,
Mauricio Barahona
Abstract:
We present a graph-theoretical approach to data clustering, which combines the creation of a graph from the data with Markov Stability, a multiscale community detection framework. We show how the multiscale capabilities of the method allow the estimation of the number of clusters, as well as alleviating the sensitivity to the parameters in graph construction. We use both synthetic and benchmark re…
▽ More
We present a graph-theoretical approach to data clustering, which combines the creation of a graph from the data with Markov Stability, a multiscale community detection framework. We show how the multiscale capabilities of the method allow the estimation of the number of clusters, as well as alleviating the sensitivity to the parameters in graph construction. We use both synthetic and benchmark real datasets to compare and evaluate several graph construction methods and clustering algorithms, and show that multiscale graph-based clustering achieves improved performance compared to popular clustering methods without the need to set externally the number of clusters.
△ Less
Submitted 13 January, 2020; v1 submitted 6 September, 2019;
originally announced September 2019.
-
Cellular memory enhances bacterial chemotactic navigation in rugged environments
Authors:
Adam Gosztolai,
Mauricio Barahona
Abstract:
The response of microbes to external signals is mediated by biochemical networks with intrinsic time scales. These time scales give rise to a memory that impacts cellular behaviour. Here we study theoretically the role of cellular memory in Escherichia coli chemotaxis. Using an agent-based model, we show that cells with memory navigating rugged chemoattractant landscapes can enhance their drift sp…
▽ More
The response of microbes to external signals is mediated by biochemical networks with intrinsic time scales. These time scales give rise to a memory that impacts cellular behaviour. Here we study theoretically the role of cellular memory in Escherichia coli chemotaxis. Using an agent-based model, we show that cells with memory navigating rugged chemoattractant landscapes can enhance their drift speed by extracting information from environmental correlations. Maximal advantage is achieved when the memory is comparable to the time scale of fluctuations as perceived during swimming. We derive an analytical approximation for the drift velocity in rugged landscapes that explains the enhanced velocity, and recovers standard Keller-Segel gradient-sensing results in the limits when memory and fluctuation time scales are well separated. Our numerics also show that cellular memory can induce bet-hedging at the population level resulting in long-lived multi-modal distributions in heterogeneous landscapes.
△ Less
Submitted 13 February, 2020; v1 submitted 12 August, 2019;
originally announced August 2019.
-
Learning spatiotemporal signals using a recurrent spiking network that discretizes time
Authors:
Amadeus Maes,
Mauricio Barahona,
Claudia Clopath
Abstract:
Learning to produce spatiotemporal sequences is a common task that the brain has to solve. The same neural substrate may be used by the brain to produce different sequential behaviours. The way the brain learns and encodes such tasks remains unknown as current computational models do not typically use realistic biologically-plausible learning. Here, we propose a model where a spiking recurrent net…
▽ More
Learning to produce spatiotemporal sequences is a common task that the brain has to solve. The same neural substrate may be used by the brain to produce different sequential behaviours. The way the brain learns and encodes such tasks remains unknown as current computational models do not typically use realistic biologically-plausible learning. Here, we propose a model where a spiking recurrent network of excitatory and inhibitory biophysical neurons drives a read-out layer: the dynamics of the driver recurrent network is trained to encode time which is then mapped through the read-out neurons to encode another dimension, such as space or a phase. Different spatiotemporal patterns can be learned and encoded through the synaptic weights to the read-out neurons that follow common Hebbian learning rules. We demonstrate that the model is able to learn spatiotemporal dynamics on time scales that are behaviourally relevant and we show that the learned sequences are robustly replayed during a regime of spontaneous activity.
△ Less
Submitted 19 December, 2019; v1 submitted 20 July, 2019;
originally announced July 2019.
-
Scale-dependent measure of network centrality from diffusion dynamics
Authors:
Alexis Arnaudon,
Robert L. Peach,
Mauricio Barahona
Abstract:
Classic measures of graph centrality capture distinct aspects of node importance, from the local (e.g., degree) to the global (e.g., closeness). Here we exploit the connection between diffusion and geometry to introduce a multiscale centrality measure. A node is defined to be central if it breaks the metricity of the diffusion as a consequence of the effective boundaries and inhomogeneities in the…
▽ More
Classic measures of graph centrality capture distinct aspects of node importance, from the local (e.g., degree) to the global (e.g., closeness). Here we exploit the connection between diffusion and geometry to introduce a multiscale centrality measure. A node is defined to be central if it breaks the metricity of the diffusion as a consequence of the effective boundaries and inhomogeneities in the graph. Our measure is naturally multiscale, as it is computed relative to graph neighbourhoods within the varying time horizon of the diffusion. We find that the centrality of nodes can differ widely at different scales. In particular, our measure correlates with degree (i.e., hubs) at small scales and with closeness (i.e., bridges) at large scales, and also reveals the existence of multi-centric structures in complex networks. By examining centrality across scales, our measure thus provides an evaluation of node importance relative to local and global processes on the network.
△ Less
Submitted 21 July, 2020; v1 submitted 19 July, 2019;
originally announced July 2019.
-
Quantifying the Alignment of Graph and Features in Deep Learning
Authors:
Yifan Qian,
Paul Expert,
Tom Rieu,
Pietro Panzarasa,
Mauricio Barahona
Abstract:
We show that the classification performance of graph convolutional networks (GCNs) is related to the alignment between features, graph, and ground truth, which we quantify using a subspace alignment measure (SAM) corresponding to the Frobenius norm of the matrix of pairwise chordal distances between three subspaces associated with features, graph, and ground truth. The proposed measure is based on…
▽ More
We show that the classification performance of graph convolutional networks (GCNs) is related to the alignment between features, graph, and ground truth, which we quantify using a subspace alignment measure (SAM) corresponding to the Frobenius norm of the matrix of pairwise chordal distances between three subspaces associated with features, graph, and ground truth. The proposed measure is based on the principal angles between subspaces and has both spectral and geometrical interpretations. We showcase the relationship between the SAM and the classification performance through the study of limiting cases of GCNs and systematic randomizations of both features and graph structure applied to a constructive example and several examples of citation networks of different origins. The analysis also reveals the relative importance of the graph and features for classification purposes.
△ Less
Submitted 26 January, 2021; v1 submitted 30 May, 2019;
originally announced May 2019.
-
Data-driven unsupervised clustering of online learner behaviour
Authors:
Robert L. Peach,
Sophia N. Yaliraki,
David Lefevre,
Mauricio Barahona
Abstract:
The widespread adoption of online courses opens opportunities for the analysis of learner behaviour and for the optimisation of web-based material adapted to observed usage. Here we introduce a mathematical framework for the analysis of time series collected from online engagement of learners, which allows the identification of clusters of learners with similar online behaviour directly from the d…
▽ More
The widespread adoption of online courses opens opportunities for the analysis of learner behaviour and for the optimisation of web-based material adapted to observed usage. Here we introduce a mathematical framework for the analysis of time series collected from online engagement of learners, which allows the identification of clusters of learners with similar online behaviour directly from the data, i.e., the groups of learners are not pre-determined subjectively but emerge algorithmically from the analysis and the data.The method uses a dynamic time war** kernel to create a pairwise similarity between time series of learner actions, and combines it with an unsupervised multiscale graph clustering algorithm to cluster groups of learners with similar patterns of behaviour. We showcase our approach on online engagement data of adult learners taking six web-based courses as part of a post-graduate degree at Imperial Business School. Our analysis identifies clusters of learners with statistically distinct patterns of engagement, ranging from distributed to massed learning, with different levels of adherence to pre-planned course structure and/or task completion, and also revealing outlier learners with highly sporadic behaviour. A posteriori comparison with performance showed that, although the majority of low-performing learners are part of in the massed learning cluster, the high performing learners are distributed across clusters with different traits of online engagement. We also show that our methodology is able to identify low performing learners more accurately than common classification methods based on raw statistics extracted from the data.
△ Less
Submitted 16 July, 2019; v1 submitted 11 February, 2019;
originally announced February 2019.
-
Collective search with finite perception: transient dynamics and search efficiency
Authors:
Adam Gosztolai,
Jose A. Carrillo,
Mauricio Barahona
Abstract:
Motile organisms often use finite spatial perception of their surroundings to navigate and search their habitats. Yet standard models of search are usually based on purely local sensory information. To model how a finite perceptual horizon affects ecological search, we propose a framework for optimal navigation that combines concepts from random walks and optimal control theory. We show that, whil…
▽ More
Motile organisms often use finite spatial perception of their surroundings to navigate and search their habitats. Yet standard models of search are usually based on purely local sensory information. To model how a finite perceptual horizon affects ecological search, we propose a framework for optimal navigation that combines concepts from random walks and optimal control theory. We show that, while local strategies are optimal on asymptotically long and short search times, finite perception yields faster convergence and increased search efficiency over transient time scales relevant in biological systems. The benefit of the finite horizon can be maintained by the searchers tuning their response sensitivity to the length scale of the stimulant in the environment, and is enhanced when the agents interact as a result of increased consensus within subpopulations. Our framework sheds light on the role of spatial perception and transients in search movement and collective sensing of the environment.
△ Less
Submitted 13 December, 2018; v1 submitted 17 September, 2018;
originally announced September 2018.
-
Global Network Prediction from Local Node Dynamics
Authors:
Neave O'Clery,
Ye Yuan,
Guy-Bart Stan,
Mauricio Barahona
Abstract:
The study of dynamical systems on networks, describing complex interactive processes, provides insight into how network structure affects global behaviour. Yet many methods for network dynamics fail to cope with large or partially-known networks, a ubiquitous situation in real-world applications. Here we propose a localised method, applicable to a broad class of dynamical models on networks, where…
▽ More
The study of dynamical systems on networks, describing complex interactive processes, provides insight into how network structure affects global behaviour. Yet many methods for network dynamics fail to cope with large or partially-known networks, a ubiquitous situation in real-world applications. Here we propose a localised method, applicable to a broad class of dynamical models on networks, whereby individual nodes monitor and store the evolution of their own state and use these values to approximate, via a simple computation, their own steady state solution. Hence the nodes predict their own final state without actually reaching it. Furthermore, the localised formulation enables nodes to compute global network metrics without knowledge of the full network structure. The method can be used to compute global rankings in the network from local information; to detect community detection from fast, local transient dynamics; and to identify key nodes that compute global network metrics ahead of others. We illustrate some of the applications of the algorithm by efficiently performing web-page ranking for a large internet network and identifying the dynamic roles of inter-neurons in the C. Elegans neural network. The mathematical formulation is simple, widely applicable and easily scalable to real-world datasets suggesting how local computation can provide an approach to the study of large-scale network dynamics.
△ Less
Submitted 2 September, 2018;
originally announced September 2018.
-
Structured networks and coarse-grained descriptions: a dynamical perspective
Authors:
Michael T. Schaub,
Jean-Charles Delvenne,
Renaud Lambiotte,
Mauricio Barahona
Abstract:
This chapter discusses the interplay between structure and dynamics in complex networks. Given a particular network with an endowed dynamics, our goal is to find partitions aligned with the dynamical process acting on top of the network. We thus aim to gain a reduced description of the system that takes into account both its structure and dynamics. In the first part, we introduce the general mathe…
▽ More
This chapter discusses the interplay between structure and dynamics in complex networks. Given a particular network with an endowed dynamics, our goal is to find partitions aligned with the dynamical process acting on top of the network. We thus aim to gain a reduced description of the system that takes into account both its structure and dynamics. In the first part, we introduce the general mathematical setup for the types of dynamics we consider throughout the chapter. We provide two guiding examples, namely consensus dynamics and diffusion processes (random walks), motivating their connection to social network analysis, and provide a brief discussion on the general dynamical framework and its possible extensions. In the second part, we focus on the influence of graph structure on the dynamics taking place on the network, focusing on three concepts that allow us to gain insight into this notion. First, we describe how time scale separation can appear in the dynamics on a network as a consequence of graph structure. Second, we discuss how the presence of particular symmetries in the network give rise to invariant dynamical subspaces that can be precisely described by graph partitions. Third, we show how this dynamical viewpoint can be extended to study dynamics on networks with signed edges, which allow us to discuss connections to concepts in social network analysis, such as structural balance. In the third part, we discuss how to use dynamical processes unfolding on the network to detect meaningful network substructures. We then show how such dynamical measures can be related to seemingly different algorithm for community detection and coarse-graining proposed in the literature. We conclude with a brief summary and highlight interesting open future directions.
△ Less
Submitted 17 April, 2018;
originally announced April 2018.
-
Multiscale dynamical embeddings of complex networks
Authors:
Michael T. Schaub,
Jean-Charles Delvenne,
Renaud Lambiotte,
Mauricio Barahona
Abstract:
Complex systems and relational data are often abstracted as dynamical processes on networks. To understand, predict and control their behavior, a crucial step is to extract reduced descriptions of such networks. Inspired by notions from Control Theory, we propose a time-dependent dynamical similarity measure between nodes, which quantifies the effect a node-input has on the network. This dynamical…
▽ More
Complex systems and relational data are often abstracted as dynamical processes on networks. To understand, predict and control their behavior, a crucial step is to extract reduced descriptions of such networks. Inspired by notions from Control Theory, we propose a time-dependent dynamical similarity measure between nodes, which quantifies the effect a node-input has on the network. This dynamical similarity induces an embedding that can be employed for several analysis tasks. Here we focus on (i)~dimensionality reduction, i.e., projecting nodes onto a low dimensional space that captures dynamic similarity at different time scales, and (ii)~how to exploit our embeddings to uncover functional modules. We exemplify our ideas through case studies focusing on directed networks without strong connectivity, and signed networks. We further highlight how certain ideas from community detection can be generalized and linked to Control Theory, by using the here developed dynamical perspective.
△ Less
Submitted 24 June, 2019; v1 submitted 10 April, 2018;
originally announced April 2018.
-
Geometric Multiscale Community Detection: Markov Stability and Vector Partitioning
Authors:
Zi**g Liu,
Mauricio Barahona
Abstract:
Multiscale community detection can be viewed from a dynamical perspective within the Markov Stability framework, which uses the diffusion of a Markov process on the graph to uncover intrinsic network substructures across all scales. Here we reformulate multiscale community detection as a max-sum length vector partitioning problem with respect to the set of time-dependent node vectors expressed in…
▽ More
Multiscale community detection can be viewed from a dynamical perspective within the Markov Stability framework, which uses the diffusion of a Markov process on the graph to uncover intrinsic network substructures across all scales. Here we reformulate multiscale community detection as a max-sum length vector partitioning problem with respect to the set of time-dependent node vectors expressed in terms of eigenvectors of the transition matrix. This formulation provides a geometric interpretation of Markov Stability in terms of a time-dependent spectral embedding, where the Markov time acts as an inhomogeneous geometric resolution factor that zooms the components of the node vectors at different rates. Our geometric formulation encompasses both modularity and the multi-resolution Potts model, which are shown to correspond to vector partitioning in a pseudo-Euclidean space, and is also linked to spectral partitioning methods, where the number of eigenvectors used corresponds to the dimensionality of the underlying embedding vector space. Inspired by the Louvain optimisation for community detection, we then propose an algorithm based on a graph-theoretical heuristic for the vector partitioning problem. We apply the algorithm to the spectral optimisation of modularity and Markov Stability community detection. The spectral embedding based on the transition matrix eigenvectors leads to improved partitions with higher information content and higher modularity than the eigen-decomposition of the modularity matrix. We illustrate the results with random network benchmarks.
△ Less
Submitted 10 August, 2017;
originally announced August 2017.
-
Graph partitions and cluster synchronization in networks of oscillators
Authors:
Michael T. Schaub,
Neave O'Clery,
Yazan N. Billeh,
Jean-Charles Delvenne,
Renaud Lambiotte,
Mauricio Barahona
Abstract:
Synchronization over networks depends strongly on the structure of the coupling between the oscillators. When the coupling presents certain regularities, the dynamics can be coarse-grained into clusters by means of External Equitable Partitions of the network graph and their associated quotient graphs. We exploit this graph-theoretical concept to study the phenomenon of cluster synchronization, in…
▽ More
Synchronization over networks depends strongly on the structure of the coupling between the oscillators. When the coupling presents certain regularities, the dynamics can be coarse-grained into clusters by means of External Equitable Partitions of the network graph and their associated quotient graphs. We exploit this graph-theoretical concept to study the phenomenon of cluster synchronization, in which different groups of nodes converge to distinct behaviors. We derive conditions and properties of networks in which such clustered behavior emerges, and show that the ensuing dynamics is the result of the localization of the eigenvectors of the associated graph Laplacians linked to the existence of invariant subspaces. The framework is applied to both linear and non-linear models, first for the standard case of networks with positive edges, before being generalized to the case of signed networks with both positive and negative interactions. We illustrate our results with examples of both signed and unsigned graphs for consensus dynamics and for partial synchronization of oscillator networks under the master stability function as well as Kuramoto oscillators.
△ Less
Submitted 16 August, 2016; v1 submitted 15 August, 2016;
originally announced August 2016.
-
Prediction of allosteric sites and mediating interactions through bond-to-bond propensities
Authors:
Benjamin R. C. Amor,
Michael T. Schaub,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
Allosteric regulation is central to many biochemical processes. Allosteric sites provide a target to fine-tune protein activity, yet we lack computational methods to predict them. Here, we present an efficient graph-theoretical approach for identifying allosteric sites and the mediating interactions that connect them to the active site. Using an atomistic graph with edges weighted by covalent and…
▽ More
Allosteric regulation is central to many biochemical processes. Allosteric sites provide a target to fine-tune protein activity, yet we lack computational methods to predict them. Here, we present an efficient graph-theoretical approach for identifying allosteric sites and the mediating interactions that connect them to the active site. Using an atomistic graph with edges weighted by covalent and non-covalent bond energies, we obtain a bond-to-bond propensity that quantifies the effect of instantaneous bond fluctuations propagating through the protein. We use this propensity to detect the sites and communication pathways most strongly linked to the active site, assessing their significance through quantile regression and comparison against a reference set of 100 generic proteins. We exemplify our method in detail with three well-studied allosteric proteins: caspase-1, CheY, and h-Ras, correctly predicting the location of the allosteric site and identifying key allosteric interactions. Consistent prediction of allosteric sites is then attained in a further set of 17 proteins known to exhibit allostery. Because our propensity measure runs in almost linear time, it offers a scalable approach to high-throughput searches for candidate allosteric sites.
△ Less
Submitted 31 May, 2016;
originally announced May 2016.
-
Flux-dependent graphs for metabolic networks
Authors:
Mariano Beguerisse-Díaz,
Gabriel Bosque,
Diego Oyarzún,
Jesús Picó,
Mauricio Barahona
Abstract:
Cells adapt their metabolic fluxes in response to changes in the environment. We present a framework for the systematic construction of flux-based graphs derived from organism-wide metabolic networks. Our graphs encode the directionality of metabolic fluxes via edges that represent the flow of metabolites from source to target reactions. The methodology can be applied in the absence of a specific…
▽ More
Cells adapt their metabolic fluxes in response to changes in the environment. We present a framework for the systematic construction of flux-based graphs derived from organism-wide metabolic networks. Our graphs encode the directionality of metabolic fluxes via edges that represent the flow of metabolites from source to target reactions. The methodology can be applied in the absence of a specific biological context by modelling fluxes probabilistically, or can be tailored to different environmental conditions by incorporating flux distributions computed through constraint-based approaches such as Flux Balance Analysis. We illustrate our approach on the central carbon metabolism of Escherichia coli and on a metabolic model of human hepatocytes. The flux-dependent graphs under various environmental conditions and genetic perturbations exhibit systemic changes in their topological and community structure, which capture the re-routing of metabolic fluxes and the varying importance of specific reactions and pathways. By integrating constraint-based models and tools from network science, our framework allows the study of context-specific metabolic responses at a system level beyond standard pathway descriptions.
△ Less
Submitted 28 March, 2018; v1 submitted 5 May, 2016;
originally announced May 2016.
-
Flow-based network analysis of the Caenorhabditis elegans connectome
Authors:
Karol A. Bacik,
Michael T. Schaub,
Mariano Beguerisse-Díaz,
Yazan N. Billeh,
Mauricio Barahona
Abstract:
We exploit flow propagation on the directed neuronal network of the nematode Caenorhabditis elegans to reveal dynamically relevant features of its connectome. We find flow-based grou**s of neurons at different levels of granularity, which we relate to functional and anatomical constituents of its nervous system. A systematic in silico evaluation of the full set of single and double neuron ablati…
▽ More
We exploit flow propagation on the directed neuronal network of the nematode Caenorhabditis elegans to reveal dynamically relevant features of its connectome. We find flow-based grou**s of neurons at different levels of granularity, which we relate to functional and anatomical constituents of its nervous system. A systematic in silico evaluation of the full set of single and double neuron ablations is used to identify deletions that induce the most severe disruptions of the multi-resolution flow structure. Such ablations are linked to functionally relevant neurons, and suggest potential candidates for further in vivo investigation. In addition, we use the directional patterns of incoming and outgoing network flows at all scales to identify flow profiles for the neurons in the connectome, without pre-imposing a priori categories. The four flow roles identified are linked to signal propagation motivated by biological input-response scenarios.
△ Less
Submitted 8 August, 2016; v1 submitted 2 November, 2015;
originally announced November 2015.
-
The 'who' and 'what' of #diabetes on Twitter
Authors:
Mariano Beguerisse-Díaz,
Amy K. McLennan,
Guillermo Garduño-Hernández,
Mauricio Barahona,
Stanley J. Ulijaszek
Abstract:
Social media are being increasingly used for health promotion, yet the landscape of users, messages and interactions in such fora is poorly understood. Studies of social media and diabetes have focused mostly on patients, or public agencies addressing it, but have not looked broadly at all the participants or the diversity of content they contribute. We study Twitter conversations about diabetes t…
▽ More
Social media are being increasingly used for health promotion, yet the landscape of users, messages and interactions in such fora is poorly understood. Studies of social media and diabetes have focused mostly on patients, or public agencies addressing it, but have not looked broadly at all the participants or the diversity of content they contribute. We study Twitter conversations about diabetes through the systematic analysis of 2.5 million tweets collected over 8 months and the interactions between their authors. We address three questions: (1) what themes arise in these tweets?, (2) who are the most influential users?, (3) which type of users contribute to which themes? We answer these questions using a mixed-methods approach, integrating techniques from anthropology, network science and information retrieval such as thematic coding, temporal network analysis, and community and topic detection. Diabetes-related tweets fall within broad thematic groups: health information, news, social interaction, and commercial. At the same time, humorous messages and references to popular culture appear consistently, more than any other type of tweet. We classify authors according to their temporal 'hub' and 'authority' scores. Whereas the hub landscape is diffuse and fluid over time, top authorities are highly persistent across time and comprise bloggers, advocacy groups and NGOs related to diabetes, as well as for-profit entities without specific diabetes expertise. Top authorities fall into seven interest communities as derived from their Twitter follower network. Our findings have implications for public health professionals and policy makers who seek to use social media as an engagement tool and to inform policy design.
△ Less
Submitted 30 January, 2017; v1 submitted 24 August, 2015;
originally announced August 2015.
-
Community detection and role identification in directed networks: understanding the Twitter network of the care.data debate
Authors:
B. Amor,
S. Vuik,
R. Callahan,
A. Darzi,
S. N. Yaliraki,
M. Barahona
Abstract:
With the rise of social media as an important channel for the debate and discussion of public affairs, online social networks such as Twitter have become important platforms for public information and engagement by policy makers. To communicate effectively through Twitter, policy makers need to understand how influence and interest propagate within its network of users. In this chapter we use grap…
▽ More
With the rise of social media as an important channel for the debate and discussion of public affairs, online social networks such as Twitter have become important platforms for public information and engagement by policy makers. To communicate effectively through Twitter, policy makers need to understand how influence and interest propagate within its network of users. In this chapter we use graph-theoretic methods to analyse the Twitter debate surrounding NHS England's controversial care.data scheme. Directionality is a crucial feature of the Twitter social graph - information flows from the followed to the followers - but is often ignored in social network analyses; our methods are based on the behaviour of dynamic processes on the network and can be applied naturally to directed networks. We uncover robust communities of users and show that these communities reflect how information flows through the Twitter network. We are also able to classify users by their differing roles in directing the flow of information through the network. Our methods and results will be useful to policy makers who would like to use Twitter effectively as a communication medium.
△ Less
Submitted 13 August, 2015;
originally announced August 2015.
-
Great cities look small
Authors:
Aaron Sim,
Sophia N Yaliraki,
Mauricio Barahona,
Michael P H Stumpf
Abstract:
Great cities connect people; failed cities isolate people. Despite the fundamental importance of physical, face-to-face social-ties in the functioning of cities, these connectivity networks are not explicitly observed in their entirety. Attempts at estimating them often rely on unrealistic over-simplifications such as the assumption of spatial homogeneity. Here we propose a mathematical model of h…
▽ More
Great cities connect people; failed cities isolate people. Despite the fundamental importance of physical, face-to-face social-ties in the functioning of cities, these connectivity networks are not explicitly observed in their entirety. Attempts at estimating them often rely on unrealistic over-simplifications such as the assumption of spatial homogeneity. Here we propose a mathematical model of human interactions in terms of a local strategy of maximising the number of beneficial connections attainable under the constraint of limited individual travelling-time budgets. By incorporating census and openly-available online multi-modal transport data, we are able to characterise the connectivity of geometrically and topologically complex cities. Beyond providing a candidate measure of greatness, this model allows one to quantify and assess the impact of transport developments, population growth, and other infrastructure and demographic changes on a city. Supported by validations of GDP and HIV infection rates across United States metropolitan areas, we illustrate the effect of changes in local and city-wide connectivities by considering the economic impact of two contemporary inter- and intra-city transport developments in the United Kingdom: High Speed Rail 2 and London Crossrail. This derivation of the model suggests that the scaling of different urban indicators with population size has an explicitly mechanistic origin.
△ Less
Submitted 20 July, 2015;
originally announced July 2015.
-
Random Walks, Markov Processes and the Multiscale Modular Organization of Complex Networks
Authors:
Renaud Lambiotte,
Jean-Charles Delvenne,
Mauricio Barahona
Abstract:
Most methods proposed to uncover communities in complex networks rely on combinatorial graph properties. Usually an edge-counting quality function, such as modularity, is optimized over all partitions of the graph compared against a null random graph model. Here we introduce a systematic dynamical framework to design and analyze a wide variety of quality functions for community detection. The qual…
▽ More
Most methods proposed to uncover communities in complex networks rely on combinatorial graph properties. Usually an edge-counting quality function, such as modularity, is optimized over all partitions of the graph compared against a null random graph model. Here we introduce a systematic dynamical framework to design and analyze a wide variety of quality functions for community detection. The quality of a partition is measured by its Markov Stability, a time-parametrized function defined in terms of the statistical properties of a Markov process taking place on the graph. The Markov process provides a dynamical swee** across all scales in the graph, and the time scale is an intrinsic parameter that uncovers communities at different resolutions.
This dynamic-based community detection leads to a compound optimization, which favours communities of comparable centrality (as defined by the stationary distribution), and provides a unifying framework for spectral algorithms, as well as different heuristics for community detection, including versions of modularity and Potts model. Our dynamic framework creates a systematic link between different stochastic dynamics and their corresponding notions of optimal communities under distinct (node and edge) centralities. We show that the Markov Stability can be computed efficiently to find multi-scale community structure in large networks.
△ Less
Submitted 15 February, 2015;
originally announced February 2015.
-
Revealing cell assemblies at multiple levels of granularity
Authors:
Yazan N. Billeh,
Michael T. Schaub,
Costas A. Anastassiou,
Mauricio Barahona,
Christof Koch
Abstract:
Background: Current neuronal monitoring techniques, such as calcium imaging and multi-electrode arrays, enable recordings of spiking activity from hundreds of neurons simultaneously. Of primary importance in systems neuroscience is the identification of cell assemblies: groups of neurons that cooperate in some form within the recorded population.
New Method: We introduce a simple, integrated fra…
▽ More
Background: Current neuronal monitoring techniques, such as calcium imaging and multi-electrode arrays, enable recordings of spiking activity from hundreds of neurons simultaneously. Of primary importance in systems neuroscience is the identification of cell assemblies: groups of neurons that cooperate in some form within the recorded population.
New Method: We introduce a simple, integrated framework for the detection of cell-assemblies from spiking data without a priori assumptions about the size or number of groups present. We define a biophysically-inspired measure to extract a directed functional connectivity matrix between both excitatory and inhibitory neurons based on their spiking history. The resulting network representation is analyzed using the Markov Stability framework, a graph theoretical method for community detection across scales, to reveal groups of neurons that are significantly related in the recorded time-series at different levels of granularity.
Results and comparison with existing methods: Using synthetic spike-trains, including simulated data from leaky-integrate-and-fire networks, our method is able to identify important patterns in the data such as hierarchical structure that are missed by other standard methods. We further apply the method to experimental data from retinal ganglion cells of mouse and salamander, in which we identify cell-groups that correspond to known functional types, and to hippocampal recordings from rats exploring a linear track, where we detect place cells with high fidelity.
Conclusions: We present a versatile method to detect neural assemblies in spiking data applicable across a spectrum of relevant scales that contributes to understanding spatio-temporal information gathered from systems neuroscience experiments.
△ Less
Submitted 8 November, 2014;
originally announced November 2014.
-
Interest communities and flow roles in directed networks: the Twitter network of the UK riots
Authors:
Mariano Beguerisse-Díaz,
Guillermo Garduño-Hernández,
Borislav Vangelov,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
Directionality is a crucial ingredient in many complex networks in which information, energy or influence are transmitted. In such directed networks, analysing flows (and not only the strength of connections) is crucial to reveal important features of the network that might go undetected if the orientation of connections is ignored. We showcase here a flow-based approach for community detection in…
▽ More
Directionality is a crucial ingredient in many complex networks in which information, energy or influence are transmitted. In such directed networks, analysing flows (and not only the strength of connections) is crucial to reveal important features of the network that might go undetected if the orientation of connections is ignored. We showcase here a flow-based approach for community detection in networks through the study of the network of the most influential Twitter users during the 2011 riots in England. Firstly, we use directed Markov Stability to extract descriptions of the network at different levels of coarseness in terms of interest communities, i.e., groups of nodes within which flows of information are contained and reinforced. Such interest communities reveal user grou**s according to location, profession, employer, and topic. The study of flows also allows us to generate an interest distance, which affords a personalised view of the attention in the network as viewed from the vantage point of any given user. Secondly, we analyse the profiles of incoming and outgoing long-range flows with a combined approach of role-based similarity and the novel relaxed minimum spanning tree algorithm to reveal that the users in the network can be classified into five roles. These flow roles go beyond the standard leader/follower dichotomy and differ from classifications based on regular/structural equivalence. We then show that the interest communities fall into distinct informational organigrams characterised by a different mix of user roles reflecting the quality of dialogue within them. Our generic framework can be used to provide insight into how flows are generated, distributed, preserved and consumed in directed networks.
△ Less
Submitted 8 October, 2014; v1 submitted 26 November, 2013;
originally announced November 2013.
-
Finding role communities in directed networks using Role-Based Similarity, Markov Stability and the Relaxed Minimum Spanning Tree
Authors:
Mariano Beguerisse-Díaz,
Borislav Vangelov,
Mauricio Barahona
Abstract:
We present a framework to cluster nodes in directed networks according to their roles by combining Role-Based Similarity (RBS) and Markov Stability, two techniques based on flows. First we compute the RBS matrix, which contains the pairwise similarities between nodes according to the scaled number of in- and out-directed paths of different lengths. The weighted RBS similarity matrix is then transf…
▽ More
We present a framework to cluster nodes in directed networks according to their roles by combining Role-Based Similarity (RBS) and Markov Stability, two techniques based on flows. First we compute the RBS matrix, which contains the pairwise similarities between nodes according to the scaled number of in- and out-directed paths of different lengths. The weighted RBS similarity matrix is then transformed into an undirected similarity network using the Relaxed Minimum-Spanning Tree (RMST) algorithm, which uses the geometric structure of the RBS matrix to unblur the network, such that edges between nodes with high, direct RBS are preserved. Finally, we partition the RMST similarity network into role-communities of nodes at all scales using Markov Stability to find a robust set of roles in the network. We showcase our framework through a biological and a man-made network.
△ Less
Submitted 6 September, 2013;
originally announced September 2013.
-
The stability of a graph partition: A dynamics-based framework for community detection
Authors:
Jean-Charles Delvenne,
Michael T. Schaub,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
Recent years have seen a surge of interest in the analysis of complex networks, facilitated by the availability of relational data and the increasingly powerful computational resources that can be employed for their analysis. Naturally, the study of real-world systems leads to highly complex networks and a current challenge is to extract intelligible, simplified descriptions from the network in te…
▽ More
Recent years have seen a surge of interest in the analysis of complex networks, facilitated by the availability of relational data and the increasingly powerful computational resources that can be employed for their analysis. Naturally, the study of real-world systems leads to highly complex networks and a current challenge is to extract intelligible, simplified descriptions from the network in terms of relevant subgraphs, which can provide insight into the structure and function of the overall system.
Sparked by seminal work by Newman and Girvan, an interesting line of research has been devoted to investigating modular community structure in networks, revitalising the classic problem of graph partitioning.
However, modular or community structure in networks has notoriously evaded rigorous definition. The most accepted notion of community is perhaps that of a group of elements which exhibit a stronger level of interaction within themselves than with the elements outside the community. This concept has resulted in a plethora of computational methods and heuristics for community detection. Nevertheless a firm theoretical understanding of most of these methods, in terms of how they operate and what they are supposed to detect, is still lacking to date.
Here, we will develop a dynamical perspective towards community detection enabling us to define a measure named the stability of a graph partition. It will be shown that a number of previously ad-hoc defined heuristics for community detection can be seen as particular cases of our method providing us with a dynamic reinterpretation of those measures. Our dynamics-based approach thus serves as a unifying framework to gain a deeper understanding of different aspects and problems associated with community detection and allows us to propose new dynamically-inspired criteria for community structure.
△ Less
Submitted 7 August, 2013;
originally announced August 2013.
-
Structure of complex networks: Quantifying edge-to-edge relations by failure-induced flow redistribution
Authors:
Michael T. Schaub,
Jörg Lehmann,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
The analysis of complex networks has so far revolved mainly around the role of nodes and communities of nodes. However, the dynamics of interconnected systems is commonly focalised on edge processes, and a dual edge-centric perspective can often prove more natural. Here we present graph-theoretical measures to quantify edge-to-edge relations inspired by the notion of flow redistribution induced by…
▽ More
The analysis of complex networks has so far revolved mainly around the role of nodes and communities of nodes. However, the dynamics of interconnected systems is commonly focalised on edge processes, and a dual edge-centric perspective can often prove more natural. Here we present graph-theoretical measures to quantify edge-to-edge relations inspired by the notion of flow redistribution induced by edge failures. Our measures, which are related to the pseudo-inverse of the Laplacian of the network, are global and reveal the dynamical interplay between the edges of a network, including potentially non-local interactions. Our framework also allows us to define the embeddedness of an edge, a measure of how strongly an edge features in the weighted cuts of the network. We showcase the general applicability of our edge-centric framework through analyses of the Iberian Power grid, traffic flow in road networks, and the C. elegans neuronal network.
△ Less
Submitted 7 April, 2014; v1 submitted 25 March, 2013;
originally announced March 2013.
-
Tracking the 2011 Student-led Collective Movement in Chile through Social Media Use
Authors:
Cristobal Garcia,
Pedro Parraguez,
Matias Barahona,
Peter Gloor
Abstract:
Using social media archives of the 2011 Chilean student unrest and dynamic social network analysis, we study how leaders and participants use social media such as Twitter, and the Web to self-organize and communicate with each other, and thus generate one of the biggest "smart movements" in the history of Chile. In this paper we i) describe the basic network topology of the 2011 student-led social…
▽ More
Using social media archives of the 2011 Chilean student unrest and dynamic social network analysis, we study how leaders and participants use social media such as Twitter, and the Web to self-organize and communicate with each other, and thus generate one of the biggest "smart movements" in the history of Chile. In this paper we i) describe the basic network topology of the 2011 student-led social movement in Chile; ii) explore how the student leaders are connected to, and how are they seen by (a) political leaders, and (b) University authorities; iii) hypothesize about key success factors and risk variables for the Student Network Movement's organization process and sustainability over time. We contend that this social media enabled massive movement is yet another manifestation of the network era, which leverages agents' socio-technical networks, and thus accelerates how agents coordinate, mobilize resources and enact collective intelligence.
△ Less
Submitted 17 April, 2012;
originally announced April 2012.
-
Encoding dynamics for multiscale community detection: Markov time swee** for the Map equation
Authors:
Michael T. Schaub,
Renaud Lambiotte,
Mauricio Barahona
Abstract:
The detection of community structure in networks is intimately related to finding a concise description of the network in terms of its modules. This notion has been recently exploited by the Map equation formalism (M. Rosvall and C.T. Bergstrom, PNAS, 105(4), pp.1118--1123, 2008) through an information-theoretic description of the process of coding inter- and intra-community transitions of a rando…
▽ More
The detection of community structure in networks is intimately related to finding a concise description of the network in terms of its modules. This notion has been recently exploited by the Map equation formalism (M. Rosvall and C.T. Bergstrom, PNAS, 105(4), pp.1118--1123, 2008) through an information-theoretic description of the process of coding inter- and intra-community transitions of a random walker in the network at stationarity. However, a thorough study of the relationship between the full Markov dynamics and the coding mechanism is still lacking. We show here that the original Map coding scheme, which is both block-averaged and one-step, neglects the internal structure of the communities and introduces an upper scale, the `field-of-view' limit, in the communities it can detect. As a consequence, Map is well tuned to detect clique-like communities but can lead to undesirable overpartitioning when communities are far from clique-like. We show that a signature of this behavior is a large compression gap: the Map description length is far from its ideal limit. To address this issue, we propose a simple dynamic approach that introduces time explicitly into the Map coding through the analysis of the weighted adjacency matrix of the time-dependent multistep transition matrix of the Markov process. The resulting Markov time swee** induces a dynamical zooming across scales that can reveal (potentially multiscale) community structure above the field-of-view limit, with the relevant partitions indicated by a small compression gap.
△ Less
Submitted 22 August, 2012; v1 submitted 29 September, 2011;
originally announced September 2011.
-
Markov dynamics as a zooming lens for multiscale community detection: non clique-like communities and the field-of-view limit
Authors:
Michael T. Schaub,
Jean-Charles Delvenne,
Sophia N. Yaliraki,
Mauricio Barahona
Abstract:
In recent years, there has been a surge of interest in community detection algorithms for complex networks. A variety of computational heuristics, some with a long history, have been proposed for the identification of communities or, alternatively, of good graph partitions. In most cases, the algorithms maximize a particular objective function, thereby finding the `right' split into communities. A…
▽ More
In recent years, there has been a surge of interest in community detection algorithms for complex networks. A variety of computational heuristics, some with a long history, have been proposed for the identification of communities or, alternatively, of good graph partitions. In most cases, the algorithms maximize a particular objective function, thereby finding the `right' split into communities. Although a thorough comparison of algorithms is still lacking, there has been an effort to design benchmarks, i.e., random graph models with known community structure against which algorithms can be evaluated. However, popular community detection methods and benchmarks normally assume an implicit notion of community based on clique-like subgraphs, a form of community structure that is not always characteristic of real networks. Specifically, networks that emerge from geometric constraints can have natural non clique-like substructures with large effective diameters, which can be interpreted as long-range communities. In this work, we show that long-range communities escape detection by popular methods, which are blinded by a restricted `field-of-view' limit, an intrinsic upper scale on the communities they can detect. The field-of-view limit means that long-range communities tend to be overpartitioned. We show how by adopting a dynamical perspective towards community detection (Delvenne et al. (2010) PNAS:107: 12755-12760; Lambiotte et al. (2008) arXiv:0812.1770), in which the evolution of a Markov process on the graph is used as a zooming lens over the structure of the network at all scales, one can detect both clique- or non clique-like communities without imposing an upper scale to the detection. Consequently, the performance of algorithms on inherently low-diameter, clique-like benchmarks may not always be indicative of equally good results in real networks with local, sparser connectivity.
△ Less
Submitted 17 January, 2012; v1 submitted 26 September, 2011;
originally announced September 2011.
-
Role-similarity based comparison of directed networks
Authors:
Kathryn Cooper,
Mauricio Barahona
Abstract:
The widespread relevance of complex networks is a valuable tool in the analysis of a broad range of systems. There is a demand for tools which enable the extraction of meaningful information and allow the comparison between different systems. We present a novel measure of similarity between nodes in different networks as a generalization of the concept of self-similarity. A similarity matrix is as…
▽ More
The widespread relevance of complex networks is a valuable tool in the analysis of a broad range of systems. There is a demand for tools which enable the extraction of meaningful information and allow the comparison between different systems. We present a novel measure of similarity between nodes in different networks as a generalization of the concept of self-similarity. A similarity matrix is assembled as the distance between feature vectors that contain the in and out paths of all lengths for each node. Hence, nodes operating in a similar flow environment are considered similar regardless of network membership. We demonstrate that this method has the potential to be influential in tasks such as assigning identity or function to uncharacterized nodes. In addition an innovative application of graph partitioning to the raw results extends the concept to the comparison of networks in terms of their underlying role-structure.
△ Less
Submitted 29 March, 2011;
originally announced March 2011.
-
Role-based similarity in directed networks
Authors:
Kathryn Cooper,
Mauricio Barahona
Abstract:
The widespread relevance of increasingly complex networks requires methods to extract meaningful coarse-grained representations of such systems. For undirected graphs, standard community detection methods use criteria largely based on density of connections to provide such representations. We propose a method for grou** nodes in directed networks based on the role of the nodes in the network, un…
▽ More
The widespread relevance of increasingly complex networks requires methods to extract meaningful coarse-grained representations of such systems. For undirected graphs, standard community detection methods use criteria largely based on density of connections to provide such representations. We propose a method for grou** nodes in directed networks based on the role of the nodes in the network, understood in terms of patterns of incoming and outgoing flows. The role grou**s are obtained through the clustering of a similarity matrix, formed by the distances between feature vectors that contain the number of in and out paths of all lengths for each node. Hence nodes operating in a similar flow environment are grouped together although they may not themselves be densely connected. Our method, which includes a scale factor that reveals robust grou**s based on increasingly global structure, provides an alternative criterion to uncover structure in networks where there is an implicit flow transfer in the system. We illustrate its application to a variety of data from ecology, world trade and cellular metabolism.
△ Less
Submitted 13 December, 2010;
originally announced December 2010.
-
Flow graphs: interweaving dynamics and structure
Authors:
R. Lambiotte,
R. Sinatra,
J. -C. Delvenne,
T. S. Evans,
M. Barahona,
V. Latora
Abstract:
The behavior of complex systems is determined not only by the topological organization of their interconnections but also by the dynamical processes taking place among their constituents. A faithful modeling of the dynamics is essential because different dynamical processes may be affected very differently by network topology. A full characterization of such systems thus requires a formalization t…
▽ More
The behavior of complex systems is determined not only by the topological organization of their interconnections but also by the dynamical processes taking place among their constituents. A faithful modeling of the dynamics is essential because different dynamical processes may be affected very differently by network topology. A full characterization of such systems thus requires a formalization that encompasses both aspects simultaneously, rather than relying only on the topological adjacency matrix. To achieve this, we introduce the concept of flow graphs, namely weighted networks where dynamical flows are embedded into the link weights. Flow graphs provide an integrated representation of the structure and dynamics of the system, which can then be analyzed with standard tools from network theory. Conversely, a structural network feature of our choice can also be used as the basis for the construction of a flow graph that will then encompass a dynamics biased by such a feature. We illustrate the ideas by focusing on the mathematical properties of generic linear processes on complex networks that can be represented as biased random walks and also explore their dual consensus dynamics.
△ Less
Submitted 6 December, 2010;
originally announced December 2010.
-
Robustness of Random Graphs Based on Natural Connectivity
Authors:
Jun Wu,
Mauricio Barahona,
Yue** Tan,
Hongzhong Deng
Abstract:
Recently, it has been proposed that the natural connectivity can be used to efficiently characterise the robustness of complex networks. Natural connectivity quantifies the redundancy of alternative routes in a network by evaluating the weighted number of closed walks of all lengths and can be regarded as the average eigenvalue obtained from the graph spectrum. In this article, we explore the natu…
▽ More
Recently, it has been proposed that the natural connectivity can be used to efficiently characterise the robustness of complex networks. Natural connectivity quantifies the redundancy of alternative routes in a network by evaluating the weighted number of closed walks of all lengths and can be regarded as the average eigenvalue obtained from the graph spectrum. In this article, we explore the natural connectivity of random graphs both analytically and numerically and show that it increases linearly with the average degree. By comparing with regular ring lattices and random regular graphs, we show that random graphs are more robust than random regular graphs; however, the relationship between random graphs and regular ring lattices depends on the average degree and graph size. We derive the critical graph size as a function of the average degree, which can be predicted by our analytical results. When the graph size is less than the critical value, random graphs are more robust than regular ring lattices, whereas regular ring lattices are more robust than random graphs when the graph size is greater than the critical value.
△ Less
Submitted 17 September, 2010;
originally announced September 2010.
-
Robustness of Regular Graphs Based on Natural Connectivity
Authors:
Jun Wu,
Mauricio Barahona,
Yue** Tan,
Hongzhong Deng
Abstract:
It has been recently proposed that the natural connectivity can be used to characterize efficiently the robustness of complex networks. The natural connectivity quantifies the redundancy of alternative routes in the network by evaluating the weighted number of closed walks of all lengths and can be seen as an average eigenvalue obtained from the graph spectrum. In this paper, we explore both ana…
▽ More
It has been recently proposed that the natural connectivity can be used to characterize efficiently the robustness of complex networks. The natural connectivity quantifies the redundancy of alternative routes in the network by evaluating the weighted number of closed walks of all lengths and can be seen as an average eigenvalue obtained from the graph spectrum. In this paper, we explore both analytically and numerically the natural connectivity of regular ring lattices and regular random graphs obtained through degree-preserving random rewirings from regular ring lattices. We reformulate the natural connectivity of regular ring lattices in terms of generalized Bessel functions and show that the natural connectivity of regular ring lattices is independent of network size and increases with monotonically. We also show that random regular graphs have lower natural connectivity, and are thus less robust, than regular ring lattices.
△ Less
Submitted 10 December, 2009;
originally announced December 2009.
-
Stability of graph communities across time scales
Authors:
J. -C. Delvenne,
S. N. Yaliraki,
M. Barahona
Abstract:
The complexity of biological, social and engineering networks makes it desirable to find natural partitions into communities that can act as simplified descriptions and provide insight into the structure and function of the overall system. Although community detection methods abound, there is a lack of consensus on how to quantify and rank the quality of partitions. We show here that the quality…
▽ More
The complexity of biological, social and engineering networks makes it desirable to find natural partitions into communities that can act as simplified descriptions and provide insight into the structure and function of the overall system. Although community detection methods abound, there is a lack of consensus on how to quantify and rank the quality of partitions. We show here that the quality of a partition can be measured in terms of its stability, defined in terms of the clustered autocovariance of a Markov process taking place on the graph. Because the stability has an intrinsic dependence on time scales of the graph, it allows us to compare and rank partitions at each time and also to establish the time spans over which partitions are optimal. Hence the Markov time acts effectively as an intrinsic resolution parameter that establishes a hierarchy of increasingly coarser clusterings. Within our framework we can then provide a unifying view of several standard partitioning measures: modularity and normalized cut size can be interpreted as one-step time measures, whereas Fiedler's spectral clustering emerges at long times. We apply our method to characterize the relevance and persistence of partitions over time for constructive and real networks, including hierarchical graphs and social networks. We also obtain reduced descriptions for atomic level protein structures over different time scales.
△ Less
Submitted 11 March, 2009; v1 submitted 9 December, 2008;
originally announced December 2008.
-
Laplacian Dynamics and Multiscale Modular Structure in Networks
Authors:
R. Lambiotte,
J. -C. Delvenne,
M. Barahona
Abstract:
Most methods proposed to uncover communities in complex networks rely on their structural properties. Here we introduce the stability of a network partition, a measure of its quality defined in terms of the statistical properties of a dynamical process taking place on the graph. The time-scale of the process acts as an intrinsic parameter that uncovers community structures at different resolutio…
▽ More
Most methods proposed to uncover communities in complex networks rely on their structural properties. Here we introduce the stability of a network partition, a measure of its quality defined in terms of the statistical properties of a dynamical process taking place on the graph. The time-scale of the process acts as an intrinsic parameter that uncovers community structures at different resolutions. The stability extends and unifies standard notions for community detection: modularity and spectral partitioning can be seen as limiting cases of our dynamic measure. Similarly, recently proposed multi-resolution methods correspond to linearisations of the stability at short times. The connection between community detection and Laplacian dynamics enables us to establish dynamically motivated stability measures linked to distinct null models. We apply our method to find multi-scale partitions for different networks and show that the stability can be computed efficiently for large networks with extended versions of current algorithms.
△ Less
Submitted 9 October, 2009; v1 submitted 9 December, 2008;
originally announced December 2008.