-
Evaluating Dynamic Environment Difficulty for Obstacle Avoidance Benchmarking
Authors:
Moji Shi,
Gang Chen,
Álvaro Serra Gómez,
Siyuan Wu,
Javier Alonso-Mora
Abstract:
Dynamic obstacle avoidance is a popular research topic for autonomous systems, such as micro aerial vehicles and service robots. Accurately evaluating the performance of dynamic obstacle avoidance methods necessitates the establishment of a metric to quantify the environment's difficulty, a crucial aspect that remains unexplored. In this paper, we propose four metrics to measure the difficulty of…
▽ More
Dynamic obstacle avoidance is a popular research topic for autonomous systems, such as micro aerial vehicles and service robots. Accurately evaluating the performance of dynamic obstacle avoidance methods necessitates the establishment of a metric to quantify the environment's difficulty, a crucial aspect that remains unexplored. In this paper, we propose four metrics to measure the difficulty of dynamic environments. These metrics aim to comprehensively capture the influence of obstacles' number, size, velocity, and other factors on the difficulty. We compare the proposed metrics with existing static environment difficulty metrics and validate them through over 1.5 million trials in a customized simulator. This simulator excludes the effects of perception and control errors and supports different motion and gaze planners for obstacle avoidance. The results indicate that the survivability metric outperforms and establishes a monotonic relationship between the success rate, with a Spearman's Rank Correlation Coefficient (SRCC) of over 0.9. Specifically, for every planner, lower survivability leads to a higher success rate. This metric not only facilitates fair and comprehensive benchmarking but also provides insights for refining collision avoidance methods, thereby furthering the evolution of autonomous systems in dynamic environments.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
ASR advancements for indigenous languages: Quechua, Guarani, Bribri, Kotiria, and Wa'ikhana
Authors:
Monica Romero,
Sandra Gomez,
Iván G. Torre
Abstract:
Indigenous languages are a fundamental legacy in the development of human communication, embodying the unique identity and culture of local communities of America. The Second AmericasNLP Competition Track 1 of NeurIPS 2022 proposed develo** automatic speech recognition (ASR) systems for five indigenous languages: Quechua, Guarani, Bribri, Kotiria, and Wa'ikhana. In this paper, we propose a relia…
▽ More
Indigenous languages are a fundamental legacy in the development of human communication, embodying the unique identity and culture of local communities of America. The Second AmericasNLP Competition Track 1 of NeurIPS 2022 proposed develo** automatic speech recognition (ASR) systems for five indigenous languages: Quechua, Guarani, Bribri, Kotiria, and Wa'ikhana. In this paper, we propose a reliable ASR model for each target language by crawling speech corpora spanning diverse sources and applying data augmentation methods that resulted in the winning approach in this competition. To achieve this, we systematically investigated the impact of different hyperparameters by a Bayesian search on the performance of the language models, specifically focusing on the variants of the Wav2vec2.0 XLS-R model: 300M and 1B parameters. Moreover, we performed a global sensitivity analysis to assess the contribution of various hyperparametric configurations to the performances of our best models. Importantly, our results show that freeze fine-tuning updates and dropout rate are more vital parameters than the total number of epochs of lr. Additionally, we liberate our best models -- with no other ASR model reported until now for two Wa'ikhana and Kotiria -- and the many experiments performed to pave the way to other researchers to continue improving ASR in minority languages. This insight opens up interesting avenues for future work, allowing for the advancement of ASR techniques in the preservation of minority indigenous and acknowledging the complexities involved in this important endeavour.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Utilizing Low-Dimensional Molecular Embeddings for Rapid Chemical Similarity Search
Authors:
Kathryn E. Kirchoff,
James Wellnitz,
Joshua E. Hochuli,
Travis Maxfield,
Konstantin I. Popov,
Shawn Gomez,
Alexander Tropsha
Abstract:
Nearest neighbor-based similarity searching is a common task in chemistry, with notable use cases in drug discovery. Yet, some of the most commonly used approaches for this task still leverage a brute-force approach. In practice this can be computationally costly and overly time-consuming, due in part to the sheer size of modern chemical databases. Previous computational advancements for this task…
▽ More
Nearest neighbor-based similarity searching is a common task in chemistry, with notable use cases in drug discovery. Yet, some of the most commonly used approaches for this task still leverage a brute-force approach. In practice this can be computationally costly and overly time-consuming, due in part to the sheer size of modern chemical databases. Previous computational advancements for this task have generally relied on improvements to hardware or dataset-specific tricks that lack generalizability. Approaches that leverage lower-complexity searching algorithms remain relatively underexplored. However, many of these algorithms are approximate solutions and/or struggle with typical high-dimensional chemical embeddings. Here we evaluate whether a combination of low-dimensional chemical embeddings and a k-d tree data structure can achieve fast nearest neighbor queries while maintaining performance on standard chemical similarity search benchmarks. We examine different dimensionality reductions of standard chemical embeddings as well as a learned, structurally-aware embedding -- SmallSA -- for this task. With this framework, searches on over one billion chemicals execute in less than a second on a single CPU core, five orders of magnitude faster than the brute-force approach. We also demonstrate that SmallSA achieves competitive performance on chemical similarity benchmarks.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Lessons Learned: Reproducibility, Replicability, and When to Stop
Authors:
Milton S. Gomez,
Tom Beucler
Abstract:
While extensive guidance exists for ensuring the reproducibility of one's own study, there is little discussion regarding the reproduction and replication of external studies within one's own research. To initiate this discussion, drawing lessons from our experience reproducing an operational product for predicting tropical cyclogenesis, we present a two-dimensional framework to offer guidance on…
▽ More
While extensive guidance exists for ensuring the reproducibility of one's own study, there is little discussion regarding the reproduction and replication of external studies within one's own research. To initiate this discussion, drawing lessons from our experience reproducing an operational product for predicting tropical cyclogenesis, we present a two-dimensional framework to offer guidance on reproduction and replication. Our framework, representing model fitting on one axis and its use in inference on the other, builds upon three key aspects: the dataset, the metrics, and the model itself. By assessing the trajectories of our studies on this 2D plane, we can better inform the claims made using our research. Additionally, we use this framework to contextualize the utility of benchmark datasets in the atmospheric sciences. Our two-dimensional framework provides a tool for researchers, especially early career researchers, to incorporate prior work in their own research and to inform the claims they can make in this context.
△ Less
Submitted 9 January, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
SALSA: Semantically-Aware Latent Space Autoencoder
Authors:
Kathryn E. Kirchoff,
Travis Maxfield,
Alexander Tropsha,
Shawn M. Gomez
Abstract:
In deep learning for drug discovery, chemical data are often represented as simplified molecular-input line-entry system (SMILES) sequences which allow for straightforward implementation of natural language processing methodologies, one being the sequence-to-sequence autoencoder. However, we observe that training an autoencoder solely on SMILES is insufficient to learn molecular representations th…
▽ More
In deep learning for drug discovery, chemical data are often represented as simplified molecular-input line-entry system (SMILES) sequences which allow for straightforward implementation of natural language processing methodologies, one being the sequence-to-sequence autoencoder. However, we observe that training an autoencoder solely on SMILES is insufficient to learn molecular representations that are semantically meaningful, where semantics are defined by the structural (graph-to-graph) similarities between molecules. We demonstrate by example that autoencoders may map structurally similar molecules to distant codes, resulting in an incoherent latent space that does not respect the structural similarities between molecules. To address this shortcoming we propose Semantically-Aware Latent Space Autoencoder (SALSA), a transformer-autoencoder modified with a contrastive task, tailored specifically to learn graph-to-graph similarity between molecules. Formally, the contrastive objective is to map structurally similar molecules (separated by a single graph edit) to nearby codes in the latent space. To accomplish this, we generate a novel dataset comprised of sets of structurally similar molecules and opt for a supervised contrastive loss that is able to incorporate full sets of positive samples. We compare SALSA to its ablated counterparts, and show empirically that the composed training objective (reconstruction and contrastive task) leads to a higher quality latent space that is more 1) structurally-aware, 2) semantically continuous, and 3) property-aware.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
APIS: A paired CT-MRI dataset for ischemic stroke segmentation challenge
Authors:
Santiago Gómez,
Daniel Mantilla,
Gustavo Garzón,
Edgar Rangel,
Andrés Ortiz,
Franklin Sierra-Jerez,
Fabio Martínez
Abstract:
Stroke is the second leading cause of mortality worldwide. Immediate attention and diagnosis play a crucial role regarding patient prognosis. The key to diagnosis consists in localizing and delineating brain lesions. Standard stroke examination protocols include the initial evaluation from a non-contrast CT scan to discriminate between hemorrhage and ischemia. However, non-contrast CTs may lack se…
▽ More
Stroke is the second leading cause of mortality worldwide. Immediate attention and diagnosis play a crucial role regarding patient prognosis. The key to diagnosis consists in localizing and delineating brain lesions. Standard stroke examination protocols include the initial evaluation from a non-contrast CT scan to discriminate between hemorrhage and ischemia. However, non-contrast CTs may lack sensitivity in detecting subtle ischemic changes in the acute phase. As a result, complementary diffusion-weighted MRI studies are captured to provide valuable insights, allowing to recover and quantify stroke lesions. This work introduced APIS, the first paired public dataset with NCCT and ADC studies of acute ischemic stroke patients. APIS was presented as a challenge at the 20th IEEE International Symposium on Biomedical Imaging 2023, where researchers were invited to propose new computational strategies that leverage paired data and deal with lesion segmentation over CT sequences. Despite all the teams employing specialized deep learning tools, the results suggest that the ischemic stroke segmentation task from NCCT remains challenging. The annotated dataset remains accessible to the public upon registration, inviting the scientific community to deal with stroke characterization from NCCT but guided with paired DWI information.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
mdendro: An R package for extended agglomerative hierarchical clustering
Authors:
Alberto Fernández,
Sergio Gómez
Abstract:
"mdendro" is an R package that provides a comprehensive collection of linkage methods for agglomerative hierarchical clustering on a matrix of proximity data (distances or similarities), returning a multifurcated dendrogram or multidendrogram. Multidendrograms can group more than two clusters at the same time, solving the nonuniqueness problem that arises when there are ties in the data. This prob…
▽ More
"mdendro" is an R package that provides a comprehensive collection of linkage methods for agglomerative hierarchical clustering on a matrix of proximity data (distances or similarities), returning a multifurcated dendrogram or multidendrogram. Multidendrograms can group more than two clusters at the same time, solving the nonuniqueness problem that arises when there are ties in the data. This problem causes that different binary dendrograms are possible depending both on the order of the input data and on the criterion used to break ties. Weighted and unweighted versions of the most common linkage methods are included in the package, which also implements two parametric linkage methods. In addition, package "mdendro" provides five descriptive measures to analyze the resulting dendrograms: cophenetic correlation coefficient, space distortion ratio, agglomeration coefficient, chaining coefficient and tree balance.
△ Less
Submitted 1 June, 2024; v1 submitted 23 September, 2023;
originally announced September 2023.
-
Pattern formation and bifurcation analysis of delay induced fractional-order epidemic spreading on networks
Authors:
Jiaying Zhou,
Yong Ye,
Alex Arenas,
Sergio Gómez,
Yi Zhao
Abstract:
The spontaneous emergence of ordered structures, known as Turing patterns, in complex networks is a phenomenon that holds potential applications across diverse scientific fields, including biology, chemistry, and physics. Here, we present a novel delayed fractional-order susceptible-infected-recovered-susceptible (SIRS) reaction-diffusion model functioning on a network, which is typically used to…
▽ More
The spontaneous emergence of ordered structures, known as Turing patterns, in complex networks is a phenomenon that holds potential applications across diverse scientific fields, including biology, chemistry, and physics. Here, we present a novel delayed fractional-order susceptible-infected-recovered-susceptible (SIRS) reaction-diffusion model functioning on a network, which is typically used to simulate disease transmission but can also model rumor propagation in social contexts. Our theoretical analysis establishes the Turing instability resulting from delay, and we support our conclusions through numerical experiments. We identify the unique impacts of delay, average network degree, and diffusion rate on pattern formation. The primary outcomes of our study are: (i) Delays cause system instability, mainly evidenced by periodic temporal fluctuations; (ii) The average network degree produces periodic oscillatory states in uneven spatial distributions; (iii) The combined influence of diffusion rate and delay results in irregular oscillations in both time and space. However, we also find that fractional-order can suppress the formation of spatiotemporal patterns. These findings are crucial for comprehending the impact of network structure on the dynamics of fractional-order systems.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Selecting Robust Features for Machine Learning Applications using Multidata Causal Discovery
Authors:
Saranya Ganesh S.,
Tom Beucler,
Frederick Iat-Hin Tam,
Milton S. Gomez,
Jakob Runge,
Andreas Gerhardus
Abstract:
Robust feature selection is vital for creating reliable and interpretable Machine Learning (ML) models. When designing statistical prediction models in cases where domain knowledge is limited and underlying interactions are unknown, choosing the optimal set of features is often difficult. To mitigate this issue, we introduce a Multidata (M) causal feature selection approach that simultaneously pro…
▽ More
Robust feature selection is vital for creating reliable and interpretable Machine Learning (ML) models. When designing statistical prediction models in cases where domain knowledge is limited and underlying interactions are unknown, choosing the optimal set of features is often difficult. To mitigate this issue, we introduce a Multidata (M) causal feature selection approach that simultaneously processes an ensemble of time series datasets and produces a single set of causal drivers. This approach uses the causal discovery algorithms PC1 or PCMCI that are implemented in the Tigramite Python package. These algorithms utilize conditional independence tests to infer parts of the causal graph. Our causal feature selection approach filters out causally-spurious links before passing the remaining causal features as inputs to ML models (Multiple linear regression, Random Forest) that predict the targets. We apply our framework to the statistical intensity prediction of Western Pacific Tropical Cyclones (TC), for which it is often difficult to accurately choose drivers and their dimensionality reduction (time lags, vertical levels, and area-averaging). Using more stringent significance thresholds in the conditional independence tests helps eliminate spurious causal relationships, thus hel** the ML model generalize better to unseen TC cases. M-PC1 with a reduced number of features outperforms M-PCMCI, non-causal ML, and other feature selection methods (lagged correlation, random), even slightly outperforming feature selection based on eXplainable Artificial Intelligence. The optimal causal drivers obtained from our causal feature selection help improve our understanding of underlying relationships and suggest new potential drivers of TC intensification.
△ Less
Submitted 30 June, 2023; v1 submitted 11 April, 2023;
originally announced April 2023.
-
Spreading dynamics in networks under context-dependent behavior
Authors:
Giulio Burgio,
Sergio Gómez,
Alex Arenas
Abstract:
In some systems, the behavior of the constituent units can create a `context' that modifies the direct interactions among them. This mechanism of indirect modification inspired us to develop a minimal model of context-dependent spreading. In our model, agents actively impede (favor) or not diffusion during an interaction, depending on the behavior they observe among all the peers in the group with…
▽ More
In some systems, the behavior of the constituent units can create a `context' that modifies the direct interactions among them. This mechanism of indirect modification inspired us to develop a minimal model of context-dependent spreading. In our model, agents actively impede (favor) or not diffusion during an interaction, depending on the behavior they observe among all the peers in the group within which that interaction occurs. We divide the population into two behavioral types and provide a mean-field theory to parametrize mixing patterns of arbitrary type-assortativity within groups of any size. As an application, we examine an epidemic spreading model with context-dependent adoption of prophylactic tools such as face-masks. By analyzing the distributions of groups' size and type-composition, we uncover a rich phenomenology for the basic reproduction number and the endemic state. We analytically show how changing the group organization of contacts can either facilitate or hinder epidemic spreading, eventually moving the system from the subcritical to the supercritical phase and vice versa, depending mainly on sociological factors, such as whether the prophylactic behavior is hardly or easily induced. More generally, our work provides a theoretical foundation to model higher-order contexts and analyze their dynamical implications, envisioning a broad theory of context-dependent interactions that would allow for a new systematic investigation of a variety of complex systems.
△ Less
Submitted 11 June, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Bifurcation analysis of the Microscopic Markov Chain Approach to contact-based epidemic spreading in networks
Authors:
Alex Arenas,
Antonio Garijo,
Sergio Gómez,
Jordi Villadelprat
Abstract:
The dynamics of many epidemic compartmental models for infectious diseases that spread in a single host population present a second-order phase transition. This transition occurs as a function of the infectivity parameter, from the absence of infected individuals to an endemic state. Here, we study this transition, from the perspective of dynamical systems, for a discrete-time compartmental epidem…
▽ More
The dynamics of many epidemic compartmental models for infectious diseases that spread in a single host population present a second-order phase transition. This transition occurs as a function of the infectivity parameter, from the absence of infected individuals to an endemic state. Here, we study this transition, from the perspective of dynamical systems, for a discrete-time compartmental epidemic model known as Microscopic Markov Chain Approach, whose applicability for forecasting future scenarios of epidemic spreading has been proved very useful during the COVID-19 pandemic. We show that there is an endemic state which is stable and a global attractor and that its existence is a consequence of a transcritical bifurcation. This mathematical analysis grounds the results of the model in practical applications.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
A deep learning approach to halo merger tree construction
Authors:
Sandra Robles,
Jonathan S. Gómez,
Adín Ramírez Rivera,
Nelson D. Padilla,
Diego Dujovne
Abstract:
A key ingredient for semi-analytic models (SAMs) of galaxy formation is the mass assembly history of haloes, encoded in a tree structure. The most commonly used method to construct halo merger histories is based on the outcomes of high-resolution, computationally intensive N-body simulations. We show that machine learning (ML) techniques, in particular Generative Adversarial Networks (GANs), are a…
▽ More
A key ingredient for semi-analytic models (SAMs) of galaxy formation is the mass assembly history of haloes, encoded in a tree structure. The most commonly used method to construct halo merger histories is based on the outcomes of high-resolution, computationally intensive N-body simulations. We show that machine learning (ML) techniques, in particular Generative Adversarial Networks (GANs), are a promising new tool to tackle this problem with a modest computational cost and retaining the best features of merger trees from simulations. We train our GAN model with a limited sample of merger trees from the Evolution and Assembly of GaLaxies and their Environments (EAGLE) simulation suite, constructed using two halo finders-tree builder algorithms: SUBFIND-D-TREES and ROCKSTAR-ConsistentTrees. Our GAN model successfully learns to generate well-constructed merger tree structures with high temporal resolution, and to reproduce the statistical features of the sample of merger trees used for training, when considering up to three variables in the training process. These inputs, whose representations are also learned by our GAN model, are mass of the halo progenitors and the final descendant, progenitor type (main halo or satellite) and distance of a progenitor to that in the main branch. The inclusion of the latter two inputs greatly improves the final learned representation of the halo mass growth history, especially for SUBFIND-like ML trees. When comparing equally sized samples of ML merger trees with those of the EAGLE simulation, we find better agreement for SUBFIND-like ML trees. Finally, our GAN-based framework can be utilised to construct merger histories of low- and intermediate-mass haloes, the most abundant in cosmological simulations.
△ Less
Submitted 27 June, 2022; v1 submitted 31 May, 2022;
originally announced May 2022.
-
Diffusion and synchronization dynamics reveal the multi-scale patterns of spatial segregation
Authors:
Aleix Bassolas,
Sergio Gómez,
Alex Arenas
Abstract:
Urban systems are characterized by populations with heterogeneous characteristics, and whose spatial distribution is crucial to understand inequalities in life expectancy or education level. Traditional studies on spatial segregation indicators focus often on first-neighbour correlations but fail to capture complex multi-scale patterns. In this work, we aim at characterizing the spatial distributi…
▽ More
Urban systems are characterized by populations with heterogeneous characteristics, and whose spatial distribution is crucial to understand inequalities in life expectancy or education level. Traditional studies on spatial segregation indicators focus often on first-neighbour correlations but fail to capture complex multi-scale patterns. In this work, we aim at characterizing the spatial distribution heterogeneity of socioeconomic features through diffusion and synchronization dynamics. In particular, we use the time needed to reach the synchronization as a proxy for the spatial heterogeneity of a socioeconomic feature, as for example, the income. Our analysis for 16~income categories in cities from the United States reveals that the spatial distribution of the most deprived and affluent citizens leads to higher diffusion and synchronization times. By measuring the time needed for a neighborhood to reach the global phase we are able to detect those that suffer from a steeper segregation. Overall, the present manuscript exemplifies how diffusion and synchronization dynamics can be used to assess the heterogeneity in the presence of node information.
△ Less
Submitted 18 May, 2022;
originally announced May 2022.
-
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Authors:
Chris Cummins,
Bram Wasti,
Jiadong Guo,
Brandon Cui,
Jason Ansel,
Sahir Gomez,
Somya Jain,
Jia Liu,
Olivier Teytaud,
Benoit Steiner,
Yuandong Tian,
Hugh Leather
Abstract:
Interest in applying Artificial Intelligence (AI) techniques to compiler optimizations is increasing rapidly, but compiler research has a high entry barrier. Unlike in other domains, compiler and AI researchers do not have access to the datasets and frameworks that enable fast iteration and development of ideas, and getting started requires a significant engineering investment. What is needed is a…
▽ More
Interest in applying Artificial Intelligence (AI) techniques to compiler optimizations is increasing rapidly, but compiler research has a high entry barrier. Unlike in other domains, compiler and AI researchers do not have access to the datasets and frameworks that enable fast iteration and development of ideas, and getting started requires a significant engineering investment. What is needed is an easy, reusable experimental infrastructure for real world compiler optimization tasks that can serve as a common benchmark for comparing techniques, and as a platform to accelerate progress in the field.
We introduce CompilerGym, a set of environments for real world compiler optimization tasks, and a toolkit for exposing new optimization tasks to compiler researchers. CompilerGym enables anyone to experiment on production compiler optimization problems through an easy-to-use package, regardless of their experience with compilers. We build upon the popular OpenAI Gym interface enabling researchers to interact with compilers using Python and a familiar API.
We describe the CompilerGym architecture and implementation, characterize the optimization spaces and computational efficiencies of three included compiler environments, and provide extensive empirical evaluations. Compared to prior works, CompilerGym offers larger datasets and optimization spaces, is 27x more computationally efficient, is fault-tolerant, and capable of detecting reproducibility bugs in the underlying compilers.
In making it easy for anyone to experiment with compilers - irrespective of their background - we aim to accelerate progress in the AI and compiler research domains.
△ Less
Submitted 22 December, 2021; v1 submitted 16 September, 2021;
originally announced September 2021.
-
Beyond Expertise and Roles: A Framework to Characterize the Stakeholders of Interpretable Machine Learning and their Needs
Authors:
Harini Suresh,
Steven R. Gomez,
Kevin K. Nam,
Arvind Satyanarayan
Abstract:
To ensure accountability and mitigate harm, it is critical that diverse stakeholders can interrogate black-box automated systems and find information that is understandable, relevant, and useful to them. In this paper, we eschew prior expertise- and role-based categorizations of interpretability stakeholders in favor of a more granular framework that decouples stakeholders' knowledge from their in…
▽ More
To ensure accountability and mitigate harm, it is critical that diverse stakeholders can interrogate black-box automated systems and find information that is understandable, relevant, and useful to them. In this paper, we eschew prior expertise- and role-based categorizations of interpretability stakeholders in favor of a more granular framework that decouples stakeholders' knowledge from their interpretability needs. We characterize stakeholders by their formal, instrumental, and personal knowledge and how it manifests in the contexts of machine learning, the data domain, and the general milieu. We additionally distill a hierarchical typology of stakeholder needs that distinguishes higher-level domain goals from lower-level interpretability tasks. In assessing the descriptive, evaluative, and generative powers of our framework, we find our more nuanced treatment of stakeholders reveals gaps and opportunities in the interpretability literature, adds precision to the design and comparison of user studies, and facilitates a more reflexive approach to conducting this research.
△ Less
Submitted 24 January, 2021;
originally announced January 2021.
-
Network clique cover approximation to analyze complex contagions through group interactions
Authors:
Giulio Burgio,
Alex Arenas,
Sergio Gómez,
Joan T. Matamalas
Abstract:
Contagion processes have been proven to fundamentally depend on the structural properties of the interaction networks conveying them. Many real networked systems are characterized by clustered substructures representing either collections of all-to-all pair-wise interactions (cliques) and/or group interactions, involving many of their members at once. In this work, focusing on interaction structur…
▽ More
Contagion processes have been proven to fundamentally depend on the structural properties of the interaction networks conveying them. Many real networked systems are characterized by clustered substructures representing either collections of all-to-all pair-wise interactions (cliques) and/or group interactions, involving many of their members at once. In this work, focusing on interaction structures represented as simplicial complexes, we present a discrete-time microscopic model of complex contagion for a susceptible-infected-susceptible dynamics. Introducing a particular edge clique cover and a heuristic to find it, the model accounts for the higher-order dynamical correlations among the members of the substructures (cliques/simplices). The analytical computation of the critical point reveals that higher-order correlations are responsible for its dependence on the higher-order couplings. While such dependence eludes any mean-field model, the possibility of a bi-stable region is extended to structured populations.
△ Less
Submitted 16 May, 2021; v1 submitted 10 January, 2021;
originally announced January 2021.
-
Evolution of Cooperation in the Presence of Higher-Order Interactions: from Networks to Hypergraphs
Authors:
Giulio Burgio,
Joan T. Matamalas,
Sergio Gómez,
Alex Arenas
Abstract:
Many real systems are strongly characterized by collective cooperative phenomena whose existence and properties still need a satisfactory explanation. Coherently with their collective nature, they call for new and more accurate descriptions going beyond pairwise models, such as graphs, in which all the interactions are considered as involving only two individuals at a time. Hypergraphs respond to…
▽ More
Many real systems are strongly characterized by collective cooperative phenomena whose existence and properties still need a satisfactory explanation. Coherently with their collective nature, they call for new and more accurate descriptions going beyond pairwise models, such as graphs, in which all the interactions are considered as involving only two individuals at a time. Hypergraphs respond to this need, providing a mathematical representation of a system allowing from pairs to larger groups. In this work, through the use of different hypergraphs, we study how group interactions influence the evolution of cooperation in a structured population, by analyzing the evolutionary dynamics of the public goods game. Here we show that, likewise network reciprocity, group interactions also promote cooperation. More importantly, by means of an invasion analysis in which the conditions for a strategy to survive are studied, we show how, in heterogeneously-structured populations, reciprocity among players is expected to grow with the increasing of the order of the interactions. This is due to the heterogeneity of connections and, particularly, to the presence of individuals standing out as hubs in the population. Our analysis represents a first step towards the study of evolutionary dynamics through higher-order interactions, and gives insights into why cooperation in heterogeneous higher-order structures is enhanced. Lastly, it also gives clues about the co-existence of cooperative and non-cooperative behaviors related to the structural properties of the interaction patterns.
△ Less
Submitted 6 June, 2020;
originally announced June 2020.
-
Multiple abrupt phase transitions in urban transport congestion
Authors:
Aniello Lampo,
Javier Borge-Holthoefer,
Sergio Gómez,
Albert Solé-Ribalta
Abstract:
During the last decades, the study of cities has been transformed by new approaches combining engineering and complexity sciences. Network theory is playing a central role, facilitating the quantitative analysis of crucial urban dynamics, such as mobility, city growth or urban planning. In this work, we focus on the spatial aspects of congestion. Analyzing a large amount of real city networks, we…
▽ More
During the last decades, the study of cities has been transformed by new approaches combining engineering and complexity sciences. Network theory is playing a central role, facilitating the quantitative analysis of crucial urban dynamics, such as mobility, city growth or urban planning. In this work, we focus on the spatial aspects of congestion. Analyzing a large amount of real city networks, we show that the location of the onset of congestion changes according to the considered urban area, defining, in turn, a set of congestion regimes separated by abrupt transitions. To help unveiling these spatial dependencies of congestion (in terms of network betweenness analysis), we introduce a family of planar road network models composed of a dense urban center connected to an arboreal periphery. These models, coined as GT and DT-MST models, allow us to analytically, numerically and experimentally describe how and why congestion emerges in particular geographical areas of monocentric cities and, subsequently, to describe the congestion regimes and the factors that promote the appearance of their abrupt transitions. We show that the fundamental ingredient behind the observed abrupt transitions is the spatial separation between the urban center and the periphery, and the number of separate areas that form the periphery. Elaborating on the implications of our results, we show that they may have an influence on the design and optimization of road networks regarding urban growth and the management of daily traffic dynamics.
△ Less
Submitted 11 January, 2021; v1 submitted 26 May, 2020;
originally announced May 2020.
-
Committee Draft of JPEG XL Image Coding System
Authors:
Alexander Rhatushnyak,
Jan Wassenberg,
Jon Sneyers,
Jyrki Alakuijala,
Lode Vandevenne,
Luca Versari,
Robert Obryk,
Zoltan Szabadka,
Evgenii Kliuchnikov,
Iulia-Maria Comsa,
Krzysztof Potempa,
Martin Bruse,
Moritz Firsching,
Renata Khasanova,
Ruud van Asseldonk,
Sami Boukortt,
Sebastian Gomez,
Thomas Fischbacher
Abstract:
JPEG XL is a practical approach focused on scalable web distribution and efficient compression of high-quality images. It provides various benefits compared to existing image formats: 60% size reduction at equivalent subjective quality; fast, parallelizable decoding and encoding configurations; features such as progressive, lossless, animation, and reversible transcoding of existing JPEG with 22%…
▽ More
JPEG XL is a practical approach focused on scalable web distribution and efficient compression of high-quality images. It provides various benefits compared to existing image formats: 60% size reduction at equivalent subjective quality; fast, parallelizable decoding and encoding configurations; features such as progressive, lossless, animation, and reversible transcoding of existing JPEG with 22% size reduction; support for high-quality applications including wide gamut, higher resolution/bit depth/dynamic range, and visually lossless coding. The JPEG XL architecture is traditional block-transform coding with upgrades to each component.
△ Less
Submitted 13 August, 2019; v1 submitted 12 August, 2019;
originally announced August 2019.
-
Simplicial degree in complex networks. Applications of Topological Data Analysis to Network Science
Authors:
Daniel Hernández Serrano,
Juan Hernández Serrano,
Darío Sánchez Gómez
Abstract:
Network Science provides a universal formalism for modelling and studying complex systems based on pairwise interactions between agents. However, many real networks in the social, biological or computer sciences involve interactions among more than two agents, having thus an inherent structure of a simplicial complex. We propose new notions of higher-order degrees of adjacency for simplices in a s…
▽ More
Network Science provides a universal formalism for modelling and studying complex systems based on pairwise interactions between agents. However, many real networks in the social, biological or computer sciences involve interactions among more than two agents, having thus an inherent structure of a simplicial complex. We propose new notions of higher-order degrees of adjacency for simplices in a simplicial complex, allowing any dimensional comparison among them and their faces, which as far as we know were lacked in the literature. We introduce multi-parameter boundary and coboundary operators in an oriented simplicial complex and also a novel multi-combinatorial Laplacian is defined, which generalises the graph and combinatorial Laplacian. To illustrate the potential applications of these theoretical results, we perform a structural analysis of higher-order connectivity in simplicial-complex networks by studying the associated distributions with these simplicial degrees in 17 real-world datasets coming from different domains such as coauthor networks, cosponsoring Congress bills, contacts in schools, drug abuse warning networks, e-mail networks or publications and users in online forums. We find rich and diverse higher-order connectivity structures and observe that datasets of the same type reflect similar higher-order collaboration patterns. Furthermore, we show that if we use what we have called the maximal simplicial degree (which counts the distinct maximal communities in which our simplex and all its strict sub-communities are contained), then its degree distribution is, in general, surprisingly different from the classical node degree distribution.
△ Less
Submitted 14 April, 2020; v1 submitted 2 August, 2019;
originally announced August 2019.
-
A Halo Merger Tree Generation and Evaluation Framework
Authors:
Sandra Robles,
Jonathan S. Gómez,
Adín Ramírez Rivera,
Jenny A. González,
Nelson D. Padilla,
Diego Dujovne
Abstract:
Semi-analytic models are best suited to compare galaxy formation and evolution theories with observations. These models rely heavily on halo merger trees, and their realistic features (i.e., no drastic changes on halo mass or jumps on physical locations). Our aim is to provide a new framework for halo merger tree generation that takes advantage of the results of large volume simulations, with a mo…
▽ More
Semi-analytic models are best suited to compare galaxy formation and evolution theories with observations. These models rely heavily on halo merger trees, and their realistic features (i.e., no drastic changes on halo mass or jumps on physical locations). Our aim is to provide a new framework for halo merger tree generation that takes advantage of the results of large volume simulations, with a modest computational cost. We treat halo merger tree construction as a matrix generation problem, and propose a Generative Adversarial Network that learns to generate realistic halo merger trees. We evaluate our proposal on merger trees from the EAGLE simulation suite, and show the quality of the generated trees.
△ Less
Submitted 21 June, 2019;
originally announced June 2019.
-
Versatile linkage: a family of space-conserving strategies for agglomerative hierarchical clustering
Authors:
Alberto Fernández,
Sergio Gómez
Abstract:
Agglomerative hierarchical clustering can be implemented with several strategies that differ in the way elements of a collection are grouped together to build a hierarchy of clusters. Here we introduce versatile linkage, a new infinite system of agglomerative hierarchical clustering strategies based on generalized means, which go from single linkage to complete linkage, passing through arithmetic…
▽ More
Agglomerative hierarchical clustering can be implemented with several strategies that differ in the way elements of a collection are grouped together to build a hierarchy of clusters. Here we introduce versatile linkage, a new infinite system of agglomerative hierarchical clustering strategies based on generalized means, which go from single linkage to complete linkage, passing through arithmetic average linkage and other clustering methods yet unexplored such as geometric linkage and harmonic linkage. We compare the different clustering strategies in terms of cophenetic correlation, mean absolute error, and also tree balance and space distortion, two new measures proposed to describe hierarchical trees. Unlike the $β$-flexible clustering system, we show that the versatile linkage family is space-conserving.
△ Less
Submitted 21 June, 2019;
originally announced June 2019.
-
Hermite-Gaussian model for quantum states
Authors:
Marcelo Losada,
Ignacio S. Gomez,
Federico Holik
Abstract:
In order to characterize quantum states within the context of information geometry, we propose a generalization of the Gaussian model, which we called the Hermite-Gaussian model. We obtain the Fisher-Rao metric and the scalar curvature for this model, and we show its relation with the one-dimensional quantum harmonic oscillator. Moreover, using this model we characterize some failies of states of…
▽ More
In order to characterize quantum states within the context of information geometry, we propose a generalization of the Gaussian model, which we called the Hermite-Gaussian model. We obtain the Fisher-Rao metric and the scalar curvature for this model, and we show its relation with the one-dimensional quantum harmonic oscillator. Moreover, using this model we characterize some failies of states of the quantum harmonic oscillator. We find that for the eigenstates of the Hamiltonian, mixtures of eigenstates and even or odd superpositions of eienstates the associated Fisher-Rao metrics are diagonal.
△ Less
Submitted 3 November, 2018;
originally announced November 2018.
-
Effect of shortest path multiplicity on congestion of multiplex networks
Authors:
Albert Solé-Ribalta,
Alex Arenas,
Sergio Gómez
Abstract:
Shortest paths are representative of discrete geodesic distances in graphs, and many descriptors of networks depend on their counting. In multiplex networks, this counting is radically important to quantify the switch between layers and it has crucial implications in the transportation efficiency and congestion processes. Here we present a mathematical approach to the computation of the joint dist…
▽ More
Shortest paths are representative of discrete geodesic distances in graphs, and many descriptors of networks depend on their counting. In multiplex networks, this counting is radically important to quantify the switch between layers and it has crucial implications in the transportation efficiency and congestion processes. Here we present a mathematical approach to the computation of the joint distribution of distance and multiplicity (degeneration) of shortest paths in multiplex networks, and exploit its relation to congestion processes. The results allow to approximate semi-analytically the onset of congestion in multiplex networks as a function of the congestion of its layers.
△ Less
Submitted 5 February, 2019; v1 submitted 30 October, 2018;
originally announced October 2018.
-
Neonatal EEG Interpretation and Decision Support Framework for Mobile Platforms
Authors:
Mark O'Sullivan,
Sergi Gomez,
Alison O'Shea,
Eduard Salgado,
Kevin Huillca,
Sean Mathieson,
Geraldine Boylan,
Emanuel Popovici,
Andriy Temko
Abstract:
This paper proposes and implements an intuitive and pervasive solution for neonatal EEG monitoring assisted by sonification and deep learning AI that provides information about neonatal brain health to all neonatal healthcare professionals, particularly those without EEG interpretation expertise. The system aims to increase the demographic of clinicians capable of diagnosing abnormalities in neona…
▽ More
This paper proposes and implements an intuitive and pervasive solution for neonatal EEG monitoring assisted by sonification and deep learning AI that provides information about neonatal brain health to all neonatal healthcare professionals, particularly those without EEG interpretation expertise. The system aims to increase the demographic of clinicians capable of diagnosing abnormalities in neonatal EEG. The proposed system uses a low-cost and low-power EEG acquisition system. An Android app provides single-channel EEG visualization, traffic-light indication of the presence of neonatal seizures provided by a trained, deep convolutional neural network and an algorithm for EEG sonification, designed to facilitate the perception of changes in EEG morphology specific to neonatal seizures. The multifaceted EEG interpretation framework is presented and the implemented mobile platform architecture is analyzed with respect to its power consumption and accuracy.
△ Less
Submitted 8 June, 2018;
originally announced June 2018.
-
On sound-based interpretation of neonatal EEG
Authors:
Sergi Gomez,
Mark O'Sullivan,
Emanuel Popovici,
Sean Mathieson,
Geraldine Boylan,
Andriy Temko
Abstract:
Significant training is required to visually interpret neonatal EEG signals. This study explores alternative sound-based methods for EEG interpretation which are designed to allow for intuitive and quick differentiation between healthy background activity and abnormal activity such as seizures. A novel method based on frequency and amplitude modulation (FM/AM) is presented. The algorithm is tuned…
▽ More
Significant training is required to visually interpret neonatal EEG signals. This study explores alternative sound-based methods for EEG interpretation which are designed to allow for intuitive and quick differentiation between healthy background activity and abnormal activity such as seizures. A novel method based on frequency and amplitude modulation (FM/AM) is presented. The algorithm is tuned to facilitate the audio domain perception of rhythmic activity which is specific to neonatal seizures. The method is compared with the previously developed phase vocoder algorithm for different time compressing factors. A survey is conducted amongst a cohort of non-EEG experts to quantitatively and qualitatively examine the performance of sound-based methods in comparison with the visual interpretation. It is shown that both sonification methods perform similarly well, with a smaller inter-observer variability in comparison with visual. A post-survey analysis of results is performed by examining the sensitivity of the ear to frequency evolution in audio.
△ Less
Submitted 8 June, 2018;
originally announced June 2018.
-
Impact of origin-destination information in epidemic spreading
Authors:
Sergio Gómez,
Alberto Fernández,
Sandro Meloni,
Alex Arenas
Abstract:
The networked structure of contacts shapes the spreading of epidemic processes. Recent advances on network theory have improved our understanding of the epidemic processes at large scale. The relevance of several considerations still needs to be evaluated in the study of epidemic spreading. One of them is that of accounting for the influence of origin and destination patterns in the flow of the ca…
▽ More
The networked structure of contacts shapes the spreading of epidemic processes. Recent advances on network theory have improved our understanding of the epidemic processes at large scale. The relevance of several considerations still needs to be evaluated in the study of epidemic spreading. One of them is that of accounting for the influence of origin and destination patterns in the flow of the carriers of an epidemic. Here we compute origin-destination patterns compatible with empirical data of coarse grained flows in the air transportation network. We study the incidence of epidemic processes in a metapopulation approach considering different alternatives to the flows prior knowledge. The data-driven scenario where the estimation of origin and destination flows is considered turns out to be relevant to assess the impact of the epidemics at a microscopic level (in our scenario, which populations are infected). However, this information is irrelevant to assess its macroscopic incidence (fraction of infected populations). These results are of interest to implement even better computational platforms to forecast epidemic incidence.
△ Less
Submitted 24 December, 2018; v1 submitted 7 April, 2018;
originally announced April 2018.
-
Effective approach to epidemic containment using link equations in complex networks
Authors:
Joan T. Matamalas,
Alex Arenas,
Sergio Gómez
Abstract:
Epidemic containment is a major concern when confronting large-scale infections in complex networks. Many works have been devoted to analytically understand how to restructure the network to minimize the impact of major outbreaks of infections at large scale. In many cases, the strategies consist in the isolation of certain nodes, while less attention has been paid to the intervention on links. In…
▽ More
Epidemic containment is a major concern when confronting large-scale infections in complex networks. Many works have been devoted to analytically understand how to restructure the network to minimize the impact of major outbreaks of infections at large scale. In many cases, the strategies consist in the isolation of certain nodes, while less attention has been paid to the intervention on links. In epidemic spreading, links inform about the probability of carrying the contagion of the disease from infected to susceptible individuals. Note that these states depend on the full structure of the network, and its determination is not straightforward from the knowledge of nodes' states. Here, we confront this challenge and propose a set of discrete-time governing equations \rev{which} can be closed and analyzed, assessing the contribution of links to spreading processes in complex networks. Our approach allows a scheme for the \rev{containment} of epidemics, based on deactivating the most important links in transmitting the disease. The model is validated in synthetic and real networks, obtaining an accurate determination of the epidemic incidence and the critical thresholds. Epidemic containment based on links' deactivation promises to be an effective tool to maintain functionality on networks while controlling the spread of diseases, as for example in air transportation networks.
△ Less
Submitted 24 December, 2018; v1 submitted 28 November, 2017;
originally announced November 2017.
-
Understanding a Version of Multivariate Symmetric Uncertainty to assist in Feature Selection
Authors:
Gustavo Sosa-Cabrera,
Miguel García-Torres,
Santiago Gómez,
Christian Schaerer,
Federico Divina
Abstract:
In this paper, we analyze the behavior of the multivariate symmetric uncertainty (MSU) measure through the use of statistical simulation techniques under various mixes of informative and non-informative randomly generated features. Experiments show how the number of attributes, their cardinalities, and the sample size affect the MSU. We discovered a condition that preserves good quality in the MSU…
▽ More
In this paper, we analyze the behavior of the multivariate symmetric uncertainty (MSU) measure through the use of statistical simulation techniques under various mixes of informative and non-informative randomly generated features. Experiments show how the number of attributes, their cardinalities, and the sample size affect the MSU. We discovered a condition that preserves good quality in the MSU under different combinations of these three factors, providing a new useful criterion to help drive the process of dimension reduction.
△ Less
Submitted 25 September, 2017;
originally announced September 2017.
-
Notions of the ergodic hierarchy for curved statistical manifolds
Authors:
Ignacio S. Gomez
Abstract:
We present an extension of the ergodic, mixing, and Bernoulli levels of the ergodic hierarchy for statistical models on curved manifolds, making use of elements of the information geometry. This extension focuses on the notion of statistical independence between the microscopical variables of the system. Moreover, we establish an intimately relationship between statistical models and family of pro…
▽ More
We present an extension of the ergodic, mixing, and Bernoulli levels of the ergodic hierarchy for statistical models on curved manifolds, making use of elements of the information geometry. This extension focuses on the notion of statistical independence between the microscopical variables of the system. Moreover, we establish an intimately relationship between statistical models and family of probability distributions belonging to the canonical ensemble, which for the case of the quadratic Hamiltonian systems provides a closed form for the correlations between the microvariables in terms of the temperature of the heat bath as a power law. From this we obtain an information geometric method for studying Hamiltonian dynamics in the canonical ensemble. We illustrate the results with two examples: a pair of interacting harmonic oscillators presenting phase transitions and the 2x2 Gaussian ensembles. In both examples the scalar curvature results a global indicator of the dynamics.
△ Less
Submitted 9 March, 2017;
originally announced March 2017.
-
An empirical study on the impact of IDE tool support in pair and solo programming
Authors:
Omar S. Gómez
Abstract:
Agile software development has been widespread adopted. One well-known agile approach is eXtreme Programming (XP) where pair programming (PP) is a relevant practice. Although various aspects of PP have been studied, we have not found, under a traditional model of PP, studies that examine the impact of using an IDE tool support. In an attempt to obtain a better understanding of the impact of using…
▽ More
Agile software development has been widespread adopted. One well-known agile approach is eXtreme Programming (XP) where pair programming (PP) is a relevant practice. Although various aspects of PP have been studied, we have not found, under a traditional model of PP, studies that examine the impact of using an IDE tool support. In an attempt to obtain a better understanding of the impact of using an IDE, we present the results of a controlled experiment that expose the influence on quality, measured as the number of defects injected per hour, and cost, measured as the time necessary to complete programming assignments, of pair and solo programming with and without the use of an IDE. For quality, our findings suggest that the use of an IDE results in significantly higher defect injection rates (for both pairs and solos) when the programming assignment is not very complicated. Nevertheless, defect injection rates seem to decrease when pairs work on more complicated programming assignments irrespective of the tool support used. For cost, the programming assignment significantly affects the time necessary to complete the assignment. Finally, both aspects (quality and cost) are affected in a similar manner when either pair or solo programming is used.
△ Less
Submitted 18 July, 2016;
originally announced July 2016.
-
Learning to learn by gradient descent by gradient descent
Authors:
Marcin Andrychowicz,
Misha Denil,
Sergio Gomez,
Matthew W. Hoffman,
David Pfau,
Tom Schaul,
Brendan Shillingford,
Nando de Freitas
Abstract:
The move from hand-designed features to learned features in machine learning has been wildly successful. In spite of this, optimization algorithms are still designed by hand. In this paper we show how the design of an optimization algorithm can be cast as a learning problem, allowing the algorithm to learn to exploit structure in the problems of interest in an automatic way. Our learned algorithms…
▽ More
The move from hand-designed features to learned features in machine learning has been wildly successful. In spite of this, optimization algorithms are still designed by hand. In this paper we show how the design of an optimization algorithm can be cast as a learning problem, allowing the algorithm to learn to exploit structure in the problems of interest in an automatic way. Our learned algorithms, implemented by LSTMs, outperform generic, hand-designed competitors on the tasks for which they are trained, and also generalize well to new tasks with similar structure. We demonstrate this on a number of tasks, including simple convex problems, training neural networks, and styling images with neural art.
△ Less
Submitted 30 November, 2016; v1 submitted 14 June, 2016;
originally announced June 2016.
-
Influence of trust in the spreading of information
Authors:
Hongrun Wu,
Alex Arenas,
Sergio Gómez
Abstract:
The understanding and prediction of information diffusion processes on networks is a major challenge in network theory with many implications in social sciences. Many theoretical advances occurred due to stochastic spreading models. Nevertheless, these stochastic models overlooked the influence of rational decisions on the outcome of the process. For instance, different levels of trust in acquaint…
▽ More
The understanding and prediction of information diffusion processes on networks is a major challenge in network theory with many implications in social sciences. Many theoretical advances occurred due to stochastic spreading models. Nevertheless, these stochastic models overlooked the influence of rational decisions on the outcome of the process. For instance, different levels of trust in acquaintances do play a role in information spreading, and actors may change their spreading decisions during the information diffusion process accordingly. Here, we study an information-spreading model in which the decision to transmit or not is based on trust. We explore the interplay between the propagation of information and the trust dynamics happening on a two-layer multiplex network. Actors' trustable or untrustable states are defined as accumulated cooperation or defection behaviors, respectively, in a Prisoner's Dilemma set up, and they are controlled by a memory span. The propagation of information is abstracted as a threshold model on the information-spreading layer, where the threshold depends on the trustability of agents. The analysis of the model is performed using a tree approximation and validated on homogeneous and heterogeneous networks. The results show that the memory of previous actions has a significant effect on the spreading of information. For example, the less memory that is considered, the higher is the diffusion. Information is highly promoted by the emergence of trustable acquaintances. These results provide insight into the effect of plausible biases on spreading dynamics in a multilevel networked system.
△ Less
Submitted 28 December, 2016; v1 submitted 6 June, 2016;
originally announced June 2016.
-
Decongestion of urban areas with hotspot-pricing
Authors:
Albert Solé-Ribalta,
Sergio Gómez,
Alex Arenas
Abstract:
The rapid growth of population in urban areas is jeopardizing the mobility and air quality worldwide. One of the most notable problems arising is that of traffic congestion which in turn affects air pollution. With the advent of technologies able to sense real-time data about cities, and its public distribution for analysis, we are in place to forecast scenarios valuable to ameliorate and control…
▽ More
The rapid growth of population in urban areas is jeopardizing the mobility and air quality worldwide. One of the most notable problems arising is that of traffic congestion which in turn affects air pollution. With the advent of technologies able to sense real-time data about cities, and its public distribution for analysis, we are in place to forecast scenarios valuable to ameliorate and control congestion. Here, we analyze a local congestion pricing scheme, hotspot pricing, that surcharges vehicles traversing congested junctions. The proposed tax is computed from the estimation of the evolution of congestion at local level, and the expected response of users to the tax (elasticity). Results on cities' road networks, considering real-traffic data, show that the proposed hotspot pricing scheme would be more effective than current mechanisms to decongest urban areas, and paves the way towards sustainable congestion in urban areas.
△ Less
Submitted 26 April, 2018; v1 submitted 26 April, 2016;
originally announced April 2016.
-
Cournot-Nash Equilibria for Bandwidth Allocation under Base-Station Cooperation
Authors:
J S Gomez,
A Vergne,
P Martins,
Laurent Decreusefond,
Wei Chen
Abstract:
-In this paper, a novel resource allocation scheme based on discrete Cournot-Nash equilibria and optimal transport theory is proposed. The originality of this framework lies in the joint optimization of downlink bandwidth allocation and cooperation between base stations. A tractable formalization is given in the form of a quadratic optimization problem. A low complexity approximate solution is der…
▽ More
-In this paper, a novel resource allocation scheme based on discrete Cournot-Nash equilibria and optimal transport theory is proposed. The originality of this framework lies in the joint optimization of downlink bandwidth allocation and cooperation between base stations. A tractable formalization is given in the form of a quadratic optimization problem. A low complexity approximate solution is derived and theoretically characterized. Simulations highlight the existence of an optimal working point, that maximizes user satisfaction ratio and network load. The impact of the network deployment on the optimum is numerically investigated, thanks to the $β$-Ginibre model. Indeed, base stations are assumed to be drawn according to $β$-Ginibre point processes. Numerical analysis shows that the network performance increases with $β$ going to one.
△ Less
Submitted 6 April, 2016;
originally announced April 2016.
-
Detection of timescales in evolving complex systems
Authors:
Richard K. Darst,
Clara Granell,
Alex Arenas,
Sergio Gómez,
Jari Saramäki,
Santo Fortunato
Abstract:
Most complex systems are intrinsically dynamic in nature. The evolution of a dynamic complex system is typically represented as a sequence of snapshots, where each snapshot describes the configuration of the system at a particular instant of time. Then, one may directly follow how the snapshots evolve in time, or aggregate the snapshots within some time intervals to form representative "slices" of…
▽ More
Most complex systems are intrinsically dynamic in nature. The evolution of a dynamic complex system is typically represented as a sequence of snapshots, where each snapshot describes the configuration of the system at a particular instant of time. Then, one may directly follow how the snapshots evolve in time, or aggregate the snapshots within some time intervals to form representative "slices" of the evolution of the system configuration. This is often done with constant intervals, whose duration is based on arguments on the nature of the system and of its dynamics. A more refined approach would be to consider the rate of activity in the system to perform a separation of timescales. However, an even better alternative would be to define dynamic intervals that match the evolution of the system's configuration. To this end, we propose a method that aims at detecting evolutionary changes in the configuration of a complex system, and generates intervals accordingly. We show that evolutionary timescales can be identified by looking for peaks in the similarity between the sets of events on consecutive time intervals of data. Tests on simple toy models reveal that the technique is able to detect evolutionary timescales of time-varying data both when the evolution is smooth as well as when it changes sharply. This is further corroborated by analyses of several real datasets. Our method is scalable to extremely large datasets and is computationally efficient. This allows a quick, parameter-free detection of multiple timescales in the evolution of a complex system.
△ Less
Submitted 4 April, 2016;
originally announced April 2016.
-
Congestion induced by the structure of multiplex networks
Authors:
Albert Solé-Ribalta,
Sergio Gómez,
Alex Arenas
Abstract:
Multiplex networks are representations of multilayer interconnected complex networks where the nodes are the same at every layer. They turn out to be good abstractions of the intricate connectivity of multimodal transportation networks, among other types of complex systems. One of the most important critical phenomena arising in such networks is the emergence of congestion in transportation flows.…
▽ More
Multiplex networks are representations of multilayer interconnected complex networks where the nodes are the same at every layer. They turn out to be good abstractions of the intricate connectivity of multimodal transportation networks, among other types of complex systems. One of the most important critical phenomena arising in such networks is the emergence of congestion in transportation flows. Here we prove analytically that the structure of multiplex networks can induce congestion for flows that otherwise will be decongested if the individual layers were not interconnected. We provide explicit equations for the onset of congestion and approximations that allow to compute this onset from individual descriptors of the individual layers. The observed cooperative phenomenon reminds the Braess' paradox in which adding extra capacity to a network when the moving entities selfishly choose their route can in some cases reduce overall performance. Similarly, in the multiplex structure, the efficiency in transportation can unbalance the transportation loads resulting in unexpected congestion.
△ Less
Submitted 24 February, 2016;
originally announced February 2016.
-
Approximate Hubel-Wiesel Modules and the Data Structures of Neural Computation
Authors:
Joel Z. Leibo,
Julien Cornebise,
Sergio Gómez,
Demis Hassabis
Abstract:
This paper describes a framework for modeling the interface between perception and memory on the algorithmic level of analysis. It is consistent with phenomena associated with many different brain regions. These include view-dependence (and invariance) effects in visual psychophysics and inferotemporal cortex physiology, as well as episodic memory recall interference effects associated with the me…
▽ More
This paper describes a framework for modeling the interface between perception and memory on the algorithmic level of analysis. It is consistent with phenomena associated with many different brain regions. These include view-dependence (and invariance) effects in visual psychophysics and inferotemporal cortex physiology, as well as episodic memory recall interference effects associated with the medial temporal lobe. The perspective developed here relies on a novel interpretation of Hubel and Wiesel's conjecture for how receptive fields tuned to complex objects, and invariant to details, could be achieved. It complements existing accounts of two-speed learning systems in neocortex and hippocampus (e.g., McClelland et al. 1995) while significantly expanding their scope to encompass a unified view of the entire pathway from V1 to hippocampus.
△ Less
Submitted 28 December, 2015;
originally announced December 2015.
-
Modeling and Querying Data Cubes on the Semantic Web
Authors:
Lorena Etcheverry,
Silvia Silvia Gomez,
Alejandro Vaisman
Abstract:
The web is changing the way in which data warehouses are designed, used, and queried. With the advent of initiatives such as Open Data and Open Government, organizations want to share their multidimensional data cubes and make them available to be queried online. The RDF data cube vocabulary (QB), the W3C standard to publish statistical data in RDF, presents several limitations to fully support th…
▽ More
The web is changing the way in which data warehouses are designed, used, and queried. With the advent of initiatives such as Open Data and Open Government, organizations want to share their multidimensional data cubes and make them available to be queried online. The RDF data cube vocabulary (QB), the W3C standard to publish statistical data in RDF, presents several limitations to fully support the multidimensional model. The QB4OLAP vocabulary extends QB to overcome these limitations, allowing to im- plement the typical OLAP operations, such as rollup, slice, dice, and drill-across using standard SPARQL queries. In this paper we introduce a formal data model where the main object is the data cube, and define OLAP operations using this model, independent of the underlying representation of the cube. We show then that a cube expressed using our model can be represented using the QB4OLAP vocabulary, and finally we provide a SPARQL implementation of OLAP operations over data cubes in QB4OLAP.
△ Less
Submitted 18 December, 2015;
originally announced December 2015.
-
Bond percolation on multiplex networks
Authors:
A. Hackett,
D. Cellai,
S. Gómez,
A. Arenas,
J. P. Gleeson
Abstract:
We present an analytical approach for bond percolation on multiplex networks and use it to determine the expected size of the giant connected component and the value of the critical bond occupation probability in these networks. We advocate the relevance of these tools to the modeling of multilayer robustness and contribute to the debate on whether any benefit is to be yielded from studying a full…
▽ More
We present an analytical approach for bond percolation on multiplex networks and use it to determine the expected size of the giant connected component and the value of the critical bond occupation probability in these networks. We advocate the relevance of these tools to the modeling of multilayer robustness and contribute to the debate on whether any benefit is to be yielded from studying a full multiplex structure as opposed to its monoplex projection, especially in the seemingly irrelevant case of a bond occupation probability that does not depend on the layer. Although we find that in many cases the predictions of our theory for multiplex networks coincide with previously derived results for monoplex networks, we also uncover the remarkable result that for a certain class of multiplex networks, well described by our theory, new critical phenomena occur as multiple percolation phase transitions are present. We provide an instance of this phenomenon in a multipex network constructed from London rail and European air transportation datasets.
△ Less
Submitted 3 April, 2016; v1 submitted 30 September, 2015;
originally announced September 2015.
-
Information transfer in community structured multiplex networks
Authors:
Albert Solé-Ribalta,
Clara Granell,
Sergio Gómez,
Alex Arenas
Abstract:
The study of complex networks that account for different types of interactions has become a subject of interest in the last few years, specially because its representational power in the description of users interactions in diverse online social platforms (Facebook, Twitter, Instagram, etc.). The mathematical description of these interacting networks has been coined under the name of multilayer ne…
▽ More
The study of complex networks that account for different types of interactions has become a subject of interest in the last few years, specially because its representational power in the description of users interactions in diverse online social platforms (Facebook, Twitter, Instagram, etc.). The mathematical description of these interacting networks has been coined under the name of multilayer networks, where each layer accounts for a type of interaction. It has been shown that diffusive processes on top of these networks present a phenomenology that cannot be explained by the naive superposition of single layer diffusive phenomena but require the whole structure of interconnected layers. Nevertheless, the description of diffusive phenomena on multilayer networks has obviated the fact that social networks have strong mesoscopic structure represented by different communities of individuals driven by common interests, or any other social aspect. In this work, we study the transfer of information in multilayer networks with community structure. The final goal is to understand and quantify, if the existence of well-defined community structure at the level of individual layers, together with the multilayer structure of the whole network, enhances or deteriorates the diffusion of packets of information.
△ Less
Submitted 15 September, 2015;
originally announced September 2015.
-
Layer-layer competition in multiplex complex networks
Authors:
Jesús Gómez-Gardeñes,
Manlio De Domenico,
Gerardo Gutiérrez,
Alex Arenas,
Sergio Gómez
Abstract:
The coexistence of multiple types of interactions within social, technological and biological networks has moved the focus of the physics of complex systems towards a multiplex description of the interactions between their constituents. This novel approach has unveiled that the multiplex nature of complex systems has strong influence in the emergence of collective states and their critical propert…
▽ More
The coexistence of multiple types of interactions within social, technological and biological networks has moved the focus of the physics of complex systems towards a multiplex description of the interactions between their constituents. This novel approach has unveiled that the multiplex nature of complex systems has strong influence in the emergence of collective states and their critical properties. Here we address an important issue that is intrinsic to the coexistence of multiple means of interactions within a network: their competition. To this aim, we study a two-layer multiplex in which the activity of users can be localized in each of the layer or shared between them, favoring that neighboring nodes within a layer focus their activity on the same layer. This framework mimics the coexistence and competition of multiple communication channels, in a way that the prevalence of a particular communication platform emerges as a result of the localization of users activity in one single interaction layer. Our results indicate that there is a transition from localization (use of a preferred layer) to delocalization (combined usage of both layers) and that the prevalence of a particular layer (in the localized state) depends on their structural properties.
△ Less
Submitted 1 September, 2015;
originally announced September 2015.
-
Random walk centrality in interconnected multilayer networks
Authors:
Albert Solé-Ribalta,
Manlio De Domenico,
Sergio Gómez,
Alex Arenas
Abstract:
Real-world complex systems exhibit multiple levels of relationships. In many cases they require to be modeled as interconnected multilayer networks, characterizing interactions of several types simultaneously. It is of crucial importance in many fields, from economics to biology and from urban planning to social sciences, to identify the most (or the less) influential nodes in a network using cent…
▽ More
Real-world complex systems exhibit multiple levels of relationships. In many cases they require to be modeled as interconnected multilayer networks, characterizing interactions of several types simultaneously. It is of crucial importance in many fields, from economics to biology and from urban planning to social sciences, to identify the most (or the less) influential nodes in a network using centrality measures. However, defining the centrality of actors in interconnected complex networks is not trivial. In this paper, we rely on the tensorial formalism recently proposed to characterize and investigate this kind of complex topologies, and extend two well known random walk centrality measures, the random walk betweenness and closeness centrality, to interconnected multilayer networks. For each of the measures we provide analytical expressions that completely agree with numerically results.
△ Less
Submitted 23 June, 2015;
originally announced June 2015.
-
Efectividad de técnicas de prueba de software aplicadas por sujetos novicios de pregrado
Authors:
Omar S. Gómez,
Raúl A. Aguilar,
Juan P. Ucán
Abstract:
The main objective of this work is to examine possible effects of using freshman student subjects in software engineering experiments. Particularly in this work we report the effectiveness measured as percentage of observed and observable defects of two software testing techniques: Black-box and white-box.
Regarding observed defects, both techniques show an effectiveness around of 4%. With respe…
▽ More
The main objective of this work is to examine possible effects of using freshman student subjects in software engineering experiments. Particularly in this work we report the effectiveness measured as percentage of observed and observable defects of two software testing techniques: Black-box and white-box.
Regarding observed defects, both techniques show an effectiveness around of 4%. With respect of observable defects by test cases, black-box testing is slightly more effective (21%) than white-box testing (16%), although this difference is not significant. We observe a considerable lack of technical skills of subjects for applying both software testing techniques. Due to observed findings, we suggest to employ students with more technical skills for carrying out software engineering experiments.
-----
El objetivo de este trabajo se centra en investigar los efectos que conlleva realizar experimentos en ingeniería de software (IS) empleando como sujetos experimentales a estudiantes de pregrado cursando su primer año de estudios de la carrera en ingeniería de software. De manera particular en este trabajo se investiga la efectividad medida en porcentaje de defectos observados y observables de las técnicas de prueba de software funcional (caja negra) y estructural (caja blanca).
Con respecto a los defectos observados por los sujetos, ambas técnicas obtuvieron una efectividad del 4%. Con respecto a los defectos observables por los casos de prueba, la técnica funcional es ligeramente superior (21%) que la técnica estructural (16%), aunque esta diferencia no es significativa. Se observa un nivel de inexperiencia considerable en los sujetos para aplicar las técnicas. Dado los hallazgos encontrados, se sugiere emplear sujetos de pregrado con un nivel mayor de experiencia.
△ Less
Submitted 29 April, 2015;
originally announced April 2015.
-
Strategical incoherence regulates cooperation in social dilemmas on multiplex networks
Authors:
Joan T. Matamalas,
Julia Poncela-Casasnovas,
Sergio Gómez,
Alex Arenas
Abstract:
Cooperation is a very common, yet not fully-understood phenomenon in natural and human systems. The introduction of a network within the population is known to affect the outcome of cooperative dynamics, allowing for the survival of cooperation in adverse scenarios. Recently, the introduction of multiplex networks has yet again modified the expectations for the outcome of the Prisoner's Dilemma ga…
▽ More
Cooperation is a very common, yet not fully-understood phenomenon in natural and human systems. The introduction of a network within the population is known to affect the outcome of cooperative dynamics, allowing for the survival of cooperation in adverse scenarios. Recently, the introduction of multiplex networks has yet again modified the expectations for the outcome of the Prisoner's Dilemma game, compared to the monoplex case. However, much remains unstudied regarding other social dilemmas on multiplex, as well as the unexplored microscopic underpinnings of it. In this paper, we systematically study the evolution of cooperation in all four games in the $T-S$ plane on multiplex. More importantly, we find some remarkable and previously unknown features in the microscopic organization of the strategies, that are responsible for the important differences between cooperative dynamics in monoplex and multiplex. Specifically, we find that in the stationary state, there are individuals that play the same strategy in all layers (coherent), and others that don't (incoherent). This second group of players is responsible for the surprising fact of a non full-cooperation in the Harmony Game on multiplex, never observed before, as well as a higher-than-expected cooperation rates in some regions of the other three social dilemmas.
△ Less
Submitted 18 April, 2015;
originally announced April 2015.
-
Benchmark model to assess community structure in evolving networks
Authors:
Clara Granell,
Richard K. Darst,
Alex Arenas,
Santo Fortunato,
Sergio Gómez
Abstract:
Detecting the time evolution of the community structure of networks is crucial to identify major changes in the internal organization of many complex systems, which may undergo important endogenous or exogenous events. This analysis can be done in two ways: considering each snapshot as an independent community detection problem or taking into account the whole evolution of the network. In the firs…
▽ More
Detecting the time evolution of the community structure of networks is crucial to identify major changes in the internal organization of many complex systems, which may undergo important endogenous or exogenous events. This analysis can be done in two ways: considering each snapshot as an independent community detection problem or taking into account the whole evolution of the network. In the first case, one can apply static methods on the temporal snapshots, which correspond to configurations of the system in short time windows, and match afterwards the communities across layers. Alternatively, one can develop dedicated dynamic procedures, so that multiple snapshots are simultaneously taken into account while detecting communities, which allows us to keep memory of the flow. To check how well a method of any kind could capture the evolution of communities, suitable benchmarks are needed. Here we propose a model for generating simple dynamic benchmark graphs, based on stochastic block models. In them, the time evolution consists of a periodic oscillation of the system's structure between configurations with built-in community structure. We also propose the extension of quality comparison indices to the dynamic scenario.
△ Less
Submitted 19 July, 2015; v1 submitted 23 January, 2015;
originally announced January 2015.
-
Competing spreading processes on multiplex networks: awareness and epidemics
Authors:
Clara Granell,
Sergio Gomez,
Alex Arenas
Abstract:
Epidemic-like spreading processes on top of multilayered interconnected complex networks reveal a rich phase diagram of intertwined competition effects. A recent study by the authors [Granell et al. Phys. Rev. Lett. 111, 128701 (2013)] presented the analysis of the interrelation between two processes accounting for the spreading of an epidemics, and the spreading of information awareness to preven…
▽ More
Epidemic-like spreading processes on top of multilayered interconnected complex networks reveal a rich phase diagram of intertwined competition effects. A recent study by the authors [Granell et al. Phys. Rev. Lett. 111, 128701 (2013)] presented the analysis of the interrelation between two processes accounting for the spreading of an epidemics, and the spreading of information awareness to prevent its infection, on top of multiplex networks. The results in the case in which awareness implies total immunization to the disease, revealed the existence of a metacritical point at which the critical onset of the epidemics starts depending on the reaching of the awareness process. Here we present a full analysis of these critical properties in the more general scenario where the awareness spreading does not imply total immunization, and where infection does not imply immediate awareness of it. We find the critical relation between both competing processes for a wide spectrum of parameters representing the interaction between them. We also analyze the consequences of a massive broadcast of awareness (mass media) on the final outcome of the epidemic incidence. Importantly enough, the mass media makes the metacritical point to disappear. The results reveal that the main finding i.e. existence of a metacritical point, is rooted on the competition principle and holds for a large set of scenarios.
△ Less
Submitted 18 May, 2014;
originally announced May 2014.
-
Structural patterns in complex systems using multidendrograms
Authors:
Sergio Gomez,
Alberto Fernandez,
Clara Granell,
Alex Arenas
Abstract:
Complex systems are usually represented as an intricate set of relations between their components forming a complex graph or network. The understanding of their functioning and emergent properties are strongly related to their structural properties. The finding of structural patterns is of utmost importance to reduce the problem of understanding the structure-function relationships. Here we propos…
▽ More
Complex systems are usually represented as an intricate set of relations between their components forming a complex graph or network. The understanding of their functioning and emergent properties are strongly related to their structural properties. The finding of structural patterns is of utmost importance to reduce the problem of understanding the structure-function relationships. Here we propose the analysis of similarity measures between nodes using hierarchical clustering methods. The discrete nature of the networks usually leads to a small set of different similarity values, making standard hierarchical clustering algorithms ambiguous. We propose the use of "multidendrograms", an algorithm that computes agglomerative hierarchical clusterings implementing a variable-group technique that solves the non-uniqueness problem found in the standard pair-group algorithm. This problem arises when there are more than two clusters separated by the same maximum similarity (or minimum distance) during the agglomerative process. Forcing binary trees in this case means breaking ties in some way, thus giving rise to different output clusterings depending on the criterion used. Multidendrograms solves this problem grou** more than two clusters at the same time when ties occur.
△ Less
Submitted 6 January, 2014;
originally announced January 2014.
-
Centrality in Interconnected Multilayer Networks
Authors:
Manlio De Domenico,
Albert Solé-Ribalta,
Elisa Omodei,
Sergio Gómez,
Alex Arenas
Abstract:
Real-world complex systems exhibit multiple levels of relationships. In many cases, they require to be modeled by interconnected multilayer networks, characterizing interactions on several levels simultaneously. It is of crucial importance in many fields, from economics to biology, from urban planning to social sciences, to identify the most (or the less) influent nodes in a network. However, defi…
▽ More
Real-world complex systems exhibit multiple levels of relationships. In many cases, they require to be modeled by interconnected multilayer networks, characterizing interactions on several levels simultaneously. It is of crucial importance in many fields, from economics to biology, from urban planning to social sciences, to identify the most (or the less) influent nodes in a network. However, defining the centrality of actors in an interconnected structure is not trivial.
In this paper, we capitalize on the tensorial formalism, recently proposed to characterize and investigate this kind of complex topologies, to show how several centrality measures -- well-known in the case of standard ("monoplex") networks -- can be extended naturally to the realm of interconnected multiplexes. We consider diagnostics widely used in different fields, e.g., computer science, biology, communication and social sciences, to cite only some of them. We show, both theoretically and numerically, that using the weighted monoplex obtained by aggregating the multilayer network leads, in general, to relevant differences in ranking the nodes by their importance.
△ Less
Submitted 12 November, 2013;
originally announced November 2013.
-
Structure of Triadic Relations in Multiplex Networks
Authors:
Emanuele Cozzo,
Mikko Kivelä,
Manlio De Domenico,
Albert Solé,
Alex Arenas,
Sergio Gómez,
Mason A. Porter,
Yamir Moreno
Abstract:
Recent advances in the study of networked systems have highlighted that our interconnected world is composed of networks that are coupled to each other through different "layers" that each represent one of many possible subsystems or types of interactions. Nevertheless, it is traditional to aggregate multilayer networks into a single weighted network in order to take advantage of existing tools. T…
▽ More
Recent advances in the study of networked systems have highlighted that our interconnected world is composed of networks that are coupled to each other through different "layers" that each represent one of many possible subsystems or types of interactions. Nevertheless, it is traditional to aggregate multilayer networks into a single weighted network in order to take advantage of existing tools. This is admittedly convenient, but it is also extremely problematic, as important information can be lost as a result. It is therefore important to develop multilayer generalizations of network concepts. In this paper, we analyze triadic relations and generalize the idea of transitivity to multiplex networks. By focusing on triadic relations, which yield the simplest type of transitivity, we generalize the concept and computation of clustering coefficients to multiplex networks. We show how the layered structure of such networks introduces a new degree of freedom that has a fundamental effect on transitivity. We compute multiplex clustering coefficients for several real multiplex networks and illustrate why one must take great care when generalizing standard network concepts to multiplex networks. We also derive analytical expressions for our clustering coefficients for ensemble averages of networks in a family of random multiplex networks. Our analysis illustrates that social networks have a strong tendency to promote redundancy by closing triads at every layer and that they thereby have a different type of multiplex transitivity from transportation networks, which do not exhibit such a tendency. These insights are invisible if one only studies aggregated networks.
△ Less
Submitted 12 August, 2015; v1 submitted 25 July, 2013;
originally announced July 2013.