Search | arXiv e-print repository

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

Authors: Bernal Jiménez Gutiérrez, Yiheng Shu, Yu Gu, Michihiro Yasunaga, Yu Su

Abstract: In order to thrive in hostile and ever-changing natural environments, mammalian brains evolved to store large amounts of knowledge about the world and continually integrate new information while avoiding catastrophic forgetting. Despite the impressive accomplishments, large language models (LLMs), even with retrieval-augmented generation (RAG), still struggle to efficiently and effectively integra… ▽ More In order to thrive in hostile and ever-changing natural environments, mammalian brains evolved to store large amounts of knowledge about the world and continually integrate new information while avoiding catastrophic forgetting. Despite the impressive accomplishments, large language models (LLMs), even with retrieval-augmented generation (RAG), still struggle to efficiently and effectively integrate a large amount of new experiences after pre-training. In this work, we introduce HippoRAG, a novel retrieval framework inspired by the hippocampal indexing theory of human long-term memory to enable deeper and more efficient knowledge integration over new experiences. HippoRAG synergistically orchestrates LLMs, knowledge graphs, and the Personalized PageRank algorithm to mimic the different roles of neocortex and hippocampus in human memory. We compare HippoRAG with existing RAG methods on multi-hop question answering and show that our method outperforms the state-of-the-art methods remarkably, by up to 20%. Single-step retrieval with HippoRAG achieves comparable or better performance than iterative retrieval like IRCoT while being 10-30 times cheaper and 6-13 times faster, and integrating HippoRAG into IRCoT brings further substantial gains. Finally, we show that our method can tackle new types of scenarios that are out of reach of existing methods. Code and data are available at https://github.com/OSU-NLP-Group/HippoRAG. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2403.01622 [pdf, other]

A Human-Centered Approach for Bootstrap** Causal Graph Creation

Authors: Minh Q. Tram, Nolan B. Gutierrez, William J. Beksi

Abstract: Causal inference, a cornerstone in disciplines such as economics, genomics, and medicine, is increasingly being recognized as fundamental to advancing the field of robotics. In particular, the ability to reason about cause and effect from observational data is crucial for robust generalization in robotic systems. However, the construction of a causal graphical model, a mechanism for representing c… ▽ More Causal inference, a cornerstone in disciplines such as economics, genomics, and medicine, is increasingly being recognized as fundamental to advancing the field of robotics. In particular, the ability to reason about cause and effect from observational data is crucial for robust generalization in robotic systems. However, the construction of a causal graphical model, a mechanism for representing causal relations, presents an immense challenge. Currently, a nuanced grasp of causal inference, coupled with an understanding of causal relationships, must be manually programmed into a causal graphical model. To address this difficulty, we present initial results towards a human-centered augmented reality framework for creating causal graphical models. Concretely, our system bootstraps the causal discovery process by involving humans in selecting variables, establishing relationships, performing interventions, generating counterfactual explanations, and evaluating the resulting causal graph at every step. We highlight the potential of our framework via a physical robot manipulator on a pick-and-place task. △ Less

Submitted 3 March, 2024; originally announced March 2024.

Comments: To be presented at the 2024 ACM/IEEE International Conference on Human-Robot Interaction (HRI) Workshop on Causal Learning for Human-Robot Interaction (Causal-HRI)

arXiv:2311.15106 [pdf, other]

Solving the Right Problem is Key for Translational NLP: A Case Study in UMLS Vocabulary Insertion

Authors: Bernal Jimenez Gutierrez, Yuqing Mao, Vinh Nguyen, Kin Wah Fung, Yu Su, Olivier Bodenreider

Abstract: As the immense opportunities enabled by large language models become more apparent, NLP systems will be increasingly expected to excel in real-world settings. However, in many instances, powerful models alone will not yield translational NLP solutions, especially if the formulated problem is not well aligned with the real-world task. In this work, we study the case of UMLS vocabulary insertion, an… ▽ More As the immense opportunities enabled by large language models become more apparent, NLP systems will be increasingly expected to excel in real-world settings. However, in many instances, powerful models alone will not yield translational NLP solutions, especially if the formulated problem is not well aligned with the real-world task. In this work, we study the case of UMLS vocabulary insertion, an important real-world task in which hundreds of thousands of new terms, referred to as atoms, are added to the UMLS, one of the most comprehensive open-source biomedical knowledge bases. Previous work aimed to develop an automated NLP system to make this time-consuming, costly, and error-prone task more efficient. Nevertheless, practical progress in this direction has been difficult to achieve due to a problem formulation and evaluation gap between research output and the real-world task. In order to address this gap, we introduce a new formulation for UMLS vocabulary insertion which mirrors the real-world task, datasets which faithfully represent it and several strong baselines we developed through re-purposing existing solutions. Additionally, we propose an effective rule-enhanced biomedical language model which enables important new model behavior, outperforms all strong baselines and provides measurable qualitative improvements to editors who carry out the UVI task. We hope this case study provides insight into the considerable importance of problem formulation for the success of translational NLP solutions. △ Less

Submitted 25 November, 2023; originally announced November 2023.

Comments: EMNLP 2023 Findings; Code is available at https://github.com/OSU-NLP-Group/UMLS-Vocabulary-Insertion

arXiv:2307.16193 [pdf, other]

Minimal numerical ingredients describe chemical microswimmers's 3D motion

Authors: Maximilian R. Bailey, C. Miguel Barriuso Gutiérrez, José Martín-Roca, Vincent Niggel, Virginia Carrasco-Fadanelli, Ivo Buttinoni, Ignacio Pagonabarraga, Lucio Isa, Chantal Valeriani

Abstract: The underlying mechanisms and physics of catalytic Janus microswimmers is highly complex, requiring details of the associated phoretic fields and the physiochemical properties of catalyst, particle, boundaries, and the fuel used. Therefore, develo** a minimal (and more general) model capable of capturing the overall dynamics of these autonomous particles is highly desirable. In the presented wor… ▽ More The underlying mechanisms and physics of catalytic Janus microswimmers is highly complex, requiring details of the associated phoretic fields and the physiochemical properties of catalyst, particle, boundaries, and the fuel used. Therefore, develo** a minimal (and more general) model capable of capturing the overall dynamics of these autonomous particles is highly desirable. In the presented work, we demonstrate that a coarse-grained dissipative particle-hydrodynamics model is capable of describing the behaviour of various chemical microswimmer systems. Specifically, we show how a competing balance between hydrodynamic interactions experienced by a squirmer in the presence of a substrate, gravity, and mass and shape asymmetries can reproduce a range of dynamics seen in different experimental systems. We hope that our general model will inspire further synthetic work where various modes of swimmer motion can be encoded via shape and mass during fabrication, hel** to realise the still outstanding goal of microswimmers capable of complex 3-D behaviour △ Less

Submitted 30 July, 2023; originally announced July 2023.

arXiv:2306.17649 [pdf, other]

Biomedical Language Models are Robust to Sub-optimal Tokenization

Authors: Bernal Jiménez Gutiérrez, Huan Sun, Yu Su

Abstract: As opposed to general English, many concepts in biomedical terminology have been designed in recent history by biomedical professionals with the goal of being precise and concise. This is often achieved by concatenating meaningful biomedical morphemes to create new semantic units. Nevertheless, most modern biomedical language models (LMs) are pre-trained using standard domain-specific tokenizers d… ▽ More As opposed to general English, many concepts in biomedical terminology have been designed in recent history by biomedical professionals with the goal of being precise and concise. This is often achieved by concatenating meaningful biomedical morphemes to create new semantic units. Nevertheless, most modern biomedical language models (LMs) are pre-trained using standard domain-specific tokenizers derived from large scale biomedical corpus statistics without explicitly leveraging the agglutinating nature of biomedical language. In this work, we first find that standard open-domain and biomedical tokenizers are largely unable to segment biomedical terms into meaningful components. Therefore, we hypothesize that using a tokenizer which segments biomedical terminology more accurately would enable biomedical LMs to improve their performance on downstream biomedical NLP tasks, especially ones which involve biomedical terms directly such as named entity recognition (NER) and entity linking. Surprisingly, we find that pre-training a biomedical LM using a more accurate biomedical tokenizer does not improve the entity representation quality of a language model as measured by several intrinsic and extrinsic measures such as masked language modeling prediction (MLM) accuracy as well as NER and entity linking performance. These quantitative findings, along with a case study which explores entity representation quality more directly, suggest that the biomedical pre-training process is quite robust to instances of sub-optimal tokenization. △ Less

Submitted 10 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

Comments: BioNLP @ ACL 2023

arXiv:2306.05314 [pdf, other]

Mode-locked laser in nanophotonic lithium niobate

Authors: Qiushi Guo, Ryoto Sekine, James A. Williams, Benjamin K. Gutierrez, Robert M. Gray, Luis Ledezma, Luis Costa, Arkadev Roy, Selina Zhou, Mingchen Liu, Alireza Marandi

Abstract: Mode-locked lasers (MLLs) have enabled ultrafast sciences and technologies by generating ultrashort pulses with peak powers substantially exceeding their average powers. Recently, tremendous efforts have been focused on realizing integrated MLLs not only to address the challenges associated with their size and power demand, but also to enable transforming the ultrafast technologies into nanophoton… ▽ More Mode-locked lasers (MLLs) have enabled ultrafast sciences and technologies by generating ultrashort pulses with peak powers substantially exceeding their average powers. Recently, tremendous efforts have been focused on realizing integrated MLLs not only to address the challenges associated with their size and power demand, but also to enable transforming the ultrafast technologies into nanophotonic chips, and ultimately to unlock their potential for a plethora of applications. However, till now the prospect of integrated MLLs driving ultrafast nanophotonic circuits has remained elusive because of their typically low peak powers, lack of controllability, and challenges with integration with appropriate nanophotonic platforms. Here, we overcome these limitations by demonstrating an electrically-pumped actively MLL in nanophotonic lithium niobate based on its hybrid integration with a III-V semiconductor optical amplifier. Our MLL generates $\sim$4.8 ps optical pulses around 1065 nm at a repetition rate of $\sim$10 GHz, with pulse energy exceeding 2.6 pJ and a high peak power beyond 0.5 W. We show that both the repetition rate and the carrier-envelope-offset of the resulting frequency comb can be flexibly controlled in a wide range using the RF driving frequency and the pump current, paving the way for fully-stabilized on-chip frequency combs in nanophotonics. Our work marks an important step toward fully-integrated nonlinear and ultrafast photonic systems in nanophotonic lithium niobate. △ Less

Submitted 9 June, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

arXiv:2305.11159 [pdf, other]

Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors

Authors: Kai Zhang, Bernal Jiménez Gutiérrez, Yu Su

Abstract: Recent work has shown that fine-tuning large language models (LLMs) on large-scale instruction-following datasets substantially improves their performance on a wide range of NLP tasks, especially in the zero-shot setting. However, even advanced instruction-tuned LLMs still fail to outperform small LMs on relation extraction (RE), a fundamental information extraction task. We hypothesize that instr… ▽ More Recent work has shown that fine-tuning large language models (LLMs) on large-scale instruction-following datasets substantially improves their performance on a wide range of NLP tasks, especially in the zero-shot setting. However, even advanced instruction-tuned LLMs still fail to outperform small LMs on relation extraction (RE), a fundamental information extraction task. We hypothesize that instruction-tuning has been unable to elicit strong RE capabilities in LLMs due to RE's low incidence in instruction-tuning datasets, making up less than 1% of all tasks (Wang et al., 2022). To address this limitation, we propose QA4RE, a framework that aligns RE with question answering (QA), a predominant task in instruction-tuning datasets. Comprehensive zero-shot RE experiments over four datasets with two series of instruction-tuned LLMs (six LLMs in total) demonstrate that our QA4RE framework consistently improves LLM performance, strongly verifying our hypothesis and enabling LLMs to outperform strong zero-shot baselines by a large margin. Additionally, we provide thorough experiments and discussions to show the robustness, few-shot effectiveness, and strong transferability of our QA4RE framework. This work illustrates a promising way of adapting LLMs to challenging and underrepresented tasks by aligning these tasks with more common instruction-tuning tasks like QA. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: ACL 2023 Findings; The code is available at https://github.com/OSU-NLP-Group/QA4RE

arXiv:2301.13256 [pdf, other]

Spatial scales of COVID-19 transmission in Mexico

Authors: Brennan Klein, Harrison Hartle, Munik Shrestha, Ana Cecilia Zenteno, David Barros Sierra Cordera, José R. Nicolas-Carlock, Ana I. Bento, Benjamin M. Althouse, Bernardo Gutierrez, Marina Escalera-Zamudio, Arturo Reyes-Sandoval, Oliver G. Pybus, Alessandro Vespignani, Jose Alberto Diaz-Quiñonez, Samuel V. Scarpino, Moritz U. G. Kraemer

Abstract: During outbreaks of emerging infectious diseases, internationally connected cities often experience large and early outbreaks, while rural regions follow after some delay. This hierarchical structure of disease spread is influenced primarily by the multiscale structure of human mobility. However, during the COVID-19 epidemic, public health responses typically did not take into consideration the ex… ▽ More During outbreaks of emerging infectious diseases, internationally connected cities often experience large and early outbreaks, while rural regions follow after some delay. This hierarchical structure of disease spread is influenced primarily by the multiscale structure of human mobility. However, during the COVID-19 epidemic, public health responses typically did not take into consideration the explicit spatial structure of human mobility when designing non-pharmaceutical interventions (NPIs). NPIs were applied primarily at national or regional scales. Here we use weekly anonymized and aggregated human mobility data and spatially highly resolved data on COVID-19 cases, deaths and hospitalizations at the municipality level in Mexico to investigate how behavioural changes in response to the pandemic have altered the spatial scales of transmission and interventions during its first wave (March - June 2020). We find that the epidemic dynamics in Mexico were initially driven by SARS-CoV-2 exports from Mexico State and Mexico City, where early outbreaks occurred. The mobility network shifted after the implementation of interventions in late March 2020, and the mobility network communities became more disjointed while epidemics in these communities became increasingly synchronised. Our results provide actionable and dynamic insights into how to use network science and epidemiological modelling to inform the spatial scale at which interventions are most impactful in mitigating the spread of COVID-19 and infectious diseases in general. △ Less

Submitted 30 January, 2023; originally announced January 2023.

arXiv:2211.10872 [pdf, other]

MetaMax: Improved Open-Set Deep Neural Networks via Weibull Calibration

Authors: Zongyao Lyu, Nolan B. Gutierrez, William J. Beksi

Abstract: Open-set recognition refers to the problem in which classes that were not seen during training appear at inference time. This requires the ability to identify instances of novel classes while maintaining discriminative capability for closed-set classification. OpenMax was the first deep neural network-based approach to address open-set recognition by calibrating the predictive scores of a standard… ▽ More Open-set recognition refers to the problem in which classes that were not seen during training appear at inference time. This requires the ability to identify instances of novel classes while maintaining discriminative capability for closed-set classification. OpenMax was the first deep neural network-based approach to address open-set recognition by calibrating the predictive scores of a standard closed-set classification network. In this paper we present MetaMax, a more effective post-processing technique that improves upon contemporary methods by directly modeling class activation vectors. MetaMax removes the need for computing class mean activation vectors (MAVs) and distances between a query image and a class MAV as required in OpenMax. Experimental results show that MetaMax outperforms OpenMax and is comparable in performance to other state-of-the-art approaches. △ Less

Submitted 20 November, 2022; originally announced November 2022.

Comments: To be presented at the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshop on Dealing with Novelty in Open Worlds (DNOW)

arXiv:2207.12291 [pdf, other]

On the convergence and sampling of randomized primal-dual algorithms and their application to parallel MRI reconstruction

Authors: Eric B Gutierrez, Claire Delplancke, Matthias J Ehrhardt

Abstract: Stochastic Primal-Dual Hybrid Gradient (SPDHG) is an algorithm proposed by Chambolle et al. (2018) to efficiently solve a wide class of nonsmooth large-scale optimization problems. In this paper we contribute to its theoretical foundations and prove its almost sure convergence for convex but neither necessarily strongly convex nor smooth functionals, as well as for any random sampling. In addition… ▽ More Stochastic Primal-Dual Hybrid Gradient (SPDHG) is an algorithm proposed by Chambolle et al. (2018) to efficiently solve a wide class of nonsmooth large-scale optimization problems. In this paper we contribute to its theoretical foundations and prove its almost sure convergence for convex but neither necessarily strongly convex nor smooth functionals, as well as for any random sampling. In addition, we study SPDHG for parallel Magnetic Resonance Imaging reconstruction, where data from different coils are randomly selected at each iteration. We apply SPDHG using a wide range of random sampling methods and compare its performance across a range of settings, including mini-batch size and step size parameters. We show that the sampling can significantly affect the convergence speed of SPDHG and for many cases an optimal sampling can be identified. △ Less

Submitted 24 November, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

arXiv:2205.07160 [pdf, other]

Evaluating Uncertainty Calibration for Open-Set Recognition

Authors: Zongyao Lyu, Nolan B. Gutierrez, William J. Beksi

Abstract: Despite achieving enormous success in predictive accuracy for visual classification problems, deep neural networks (DNNs) suffer from providing overconfident probabilities on out-of-distribution (OOD) data. Yet, accurate uncertainty estimation is crucial for safe and reliable robot autonomy. In this paper, we evaluate popular calibration techniques for open-set conditions in a way that is distinct… ▽ More Despite achieving enormous success in predictive accuracy for visual classification problems, deep neural networks (DNNs) suffer from providing overconfident probabilities on out-of-distribution (OOD) data. Yet, accurate uncertainty estimation is crucial for safe and reliable robot autonomy. In this paper, we evaluate popular calibration techniques for open-set conditions in a way that is distinctly different from the conventional evaluation of calibration methods on OOD data. Our results show that closed-set DNN calibration approaches are much less effective for open-set recognition, which highlights the need to develop new DNN calibration methods to address this problem. △ Less

Submitted 14 May, 2022; originally announced May 2022.

Comments: To be presented at the 2022 IEEE International Conference on Robotics and Automation (ICRA) Workshop on Safe and Reliable Robot Autonomy under Uncertainty

arXiv:2204.10612 [pdf, other]

Simulating active agents under confinement with Dissipative Particles (hydro)Dynamics

Authors: Carlos Miguel Barriuso Gutierrez, Jose Martin-Roca, Valentino Bianco, Ignacio Pagonabarraga, Chantal Valeriani

Abstract: We study active agents embedded in bulk or in confinement explicitly considering hydrodynamics and simulating the swimmers via an implementation inspired by the squirmer model. We develop a Dissipative Particle Dynamics scheme for the solvent. This approach allows us to properly deal not only with hydrodynamics but also with thermal fluctuations. On the other side, this approach enables us to stud… ▽ More We study active agents embedded in bulk or in confinement explicitly considering hydrodynamics and simulating the swimmers via an implementation inspired by the squirmer model. We develop a Dissipative Particle Dynamics scheme for the solvent. This approach allows us to properly deal not only with hydrodynamics but also with thermal fluctuations. On the other side, this approach enables us to study active agents with complex shapes, ranging from spherical colloids to polymers. To start with, we study a simple spherical colloid. We analyze the features of the velocity fields of the surrounding solvent, when the colloid is a pusher, a puller or a neutral swimmer either in bulk or confined in a cylindrical channel. Next, we characterise its dynamical behaviour by computing the mean square displacement and the long time diffusion when the active colloid is in bulk or in a channel (varying its radius) and analyze the orientation autocorrelation function in the latter case. While the three studied squirmer types are characterised by the same bulk diffusion, the cylindrical confinement considerably modulates the diffusion and the orientation autocorrelation function. Finally, we focus our attention on a more complex shape: an active polymer. We first characterise the structural features computing its radius of gyration when in bulk or in cylindrical confinement, and compare to known results obtained without hydrodynamics. Next, we characterise the dynamical behaviour of the active polymer by computing its mean square displacement and the long time diffusion. On the one hand, both diffusion and radius of gyration decrease due to the hydrodynamic interaction when the system is in bulk. On the other hand, the effect of confinement is to decrease the radius of gyration, disturbing the motion of the polymer and thus reducing its diffusion. △ Less

Submitted 22 April, 2022; originally announced April 2022.

arXiv:2203.14846 [pdf, other]

Discovering dynamic laws from observations: the case of self-propelled, interacting colloids

Authors: Miguel Ruiz-Garcia, C. Miguel Barriuso Gutierrez, Lachlan C. Alexander, Dirk G. A. L. Aarts, Luca Ghiringhelli, Chantal Valeriani

Abstract: Active matter spans a wide range of time and length scales, from groups of cells and synthetic self-propelled particles to schools of fish, flocks of birds, or even human crowds. The theoretical framework describing these systems has shown tremendous success at finding universal phenomenology. However, further progress is often burdened by the difficulty of determining the forces that control the… ▽ More Active matter spans a wide range of time and length scales, from groups of cells and synthetic self-propelled particles to schools of fish, flocks of birds, or even human crowds. The theoretical framework describing these systems has shown tremendous success at finding universal phenomenology. However, further progress is often burdened by the difficulty of determining the forces that control the dynamics of the individual elements within each system. Accessing this local information is key to understanding the physics dominating the system and to create the models that can explain the observed collective phenomena. In this work, we present a machine-learning model, a graph neural network, that uses the collective movement of the system to learn the active and two-body forces controlling the individual dynamics of the particles. We verify our approach using numerical simulations of active brownian particles, considering different interaction potentials and levels of activity. Finally, we apply our model to experiments of electrophoretic Janus particles, extracting the active and two-body forces that control the dynamics of the colloids. Due to this, we can uncover the physics dominating the behavior of the system. We extract an active force that depends on the electric field and also area fraction. We also discover a dependence of the two-body interaction with the electric field that leads us to propose that the dominant force between these colloids is a screened electrostatic interaction with a constant length scale. We expect that this methodology can open a new avenue for the study and modeling of experimental systems of active particles. △ Less

Submitted 19 July, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

arXiv:2203.08410 [pdf, other]

Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again

Authors: Bernal Jiménez Gutiérrez, Nikolas McNeal, Clay Washington, You Chen, Lang Li, Huan Sun, Yu Su

Abstract: The strong few-shot in-context learning capability of large pre-trained language models (PLMs) such as GPT-3 is highly appealing for application domains such as biomedicine, which feature high and diverse demands of language technologies but also high data annotation costs. In this paper, we present the first systematic and comprehensive study to compare the few-shot performance of GPT-3 in-contex… ▽ More The strong few-shot in-context learning capability of large pre-trained language models (PLMs) such as GPT-3 is highly appealing for application domains such as biomedicine, which feature high and diverse demands of language technologies but also high data annotation costs. In this paper, we present the first systematic and comprehensive study to compare the few-shot performance of GPT-3 in-context learning with fine-tuning smaller (i.e., BERT-sized) PLMs on two highly representative biomedical information extraction tasks, named entity recognition and relation extraction. We follow the true few-shot setting to avoid overestimating models' few-shot performance by model selection over a large validation set. We also optimize GPT-3's performance with known techniques such as contextual calibration and dynamic in-context example retrieval. However, our results show that GPT-3 still significantly underperforms compared to simply fine-tuning a smaller PLM. In addition, GPT-3 in-context learning also yields smaller gains in accuracy when more training data becomes available. Our in-depth analyses further reveal issues of the in-context learning setting that may be detrimental to information extraction tasks in general. Given the high cost of experimenting with GPT-3, we hope our study provides guidance for biomedical researchers and practitioners towards more promising directions such as fine-tuning small PLMs. △ Less

Submitted 5 November, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: EMNLP-Findings 2022

arXiv:2108.00094 [pdf, other]

Thermal Image Super-Resolution Using Second-Order Channel Attention with Varying Receptive Fields

Authors: Nolan B. Gutierrez, William J. Beksi

Abstract: Thermal images model the long-infrared range of the electromagnetic spectrum and provide meaningful information even when there is no visible illumination. Yet, unlike imagery that represents radiation from the visible continuum, infrared images are inherently low-resolution due to hardware constraints. The restoration of thermal images is critical for applications that involve safety, search and… ▽ More Thermal images model the long-infrared range of the electromagnetic spectrum and provide meaningful information even when there is no visible illumination. Yet, unlike imagery that represents radiation from the visible continuum, infrared images are inherently low-resolution due to hardware constraints. The restoration of thermal images is critical for applications that involve safety, search and rescue, and military operations. In this paper, we introduce a system to efficiently reconstruct thermal images. Specifically, we explore how to effectively attend to contrasting receptive fields (RFs) where increasing the RFs of a network can be computationally expensive. For this purpose, we introduce a deep attention to varying receptive fields network (AVRFN). We supply a gated convolutional layer with higher-order information extracted from disparate RFs, whereby an RF is parameterized by a dilation rate. In this way, the dilation rate can be tuned to use fewer parameters thus increasing the efficacy of AVRFN. Our experimental results show an improvement over the state of the art when compared against competing thermal image super-resolution methods. △ Less

Submitted 30 July, 2021; originally announced August 2021.

Comments: To be published in the 2021 13th International Conference on Computer Vision Systems (ICVS)

arXiv:2106.15007 [pdf, other]

An Uncertainty Estimation Framework for Probabilistic Object Detection

Authors: Zongyao Lyu, Nolan B. Gutierrez, William J. Beksi

Abstract: In this paper, we introduce a new technique that combines two popular methods to estimate uncertainty in object detection. Quantifying uncertainty is critical in real-world robotic applications. Traditional detection models can be ambiguous even when they provide a high-probability output. Robot actions based on high-confidence, yet unreliable predictions, may result in serious repercussions. Our… ▽ More In this paper, we introduce a new technique that combines two popular methods to estimate uncertainty in object detection. Quantifying uncertainty is critical in real-world robotic applications. Traditional detection models can be ambiguous even when they provide a high-probability output. Robot actions based on high-confidence, yet unreliable predictions, may result in serious repercussions. Our framework employs deep ensembles and Monte Carlo dropout for approximating predictive uncertainty, and it improves upon the uncertainty estimation quality of the baseline method. The proposed approach is evaluated on publicly available synthetic image datasets captured from sequences of video. △ Less

Submitted 28 June, 2021; originally announced June 2021.

Comments: To be published in the 2021 International Conference on Automation Science and Engineering (CASE)

arXiv:2102.07249 [pdf, ps, other]

Effectual Topological Complexity

Authors: Natalia Cadavid-Aguilar, Jesús González, Bárbara Gutiérrez, Cesar A. Ipanaque-Zapata

Abstract: We introduce the effectual topological complexity (ETC) of a $G$-space $X$. This is a $G$-equivariant homotopy invariant sitting in between the effective topological complexity of the pair $(X,G)$ and the (regular) topological complexity of the orbit space $X/G$. We study ETC for spheres and surfaces with antipodal involution, obtaining a full computation in the case of the torus. This allows us t… ▽ More We introduce the effectual topological complexity (ETC) of a $G$-space $X$. This is a $G$-equivariant homotopy invariant sitting in between the effective topological complexity of the pair $(X,G)$ and the (regular) topological complexity of the orbit space $X/G$. We study ETC for spheres and surfaces with antipodal involution, obtaining a full computation in the case of the torus. This allows us to prove the vanishing of twice the non-trivial obstruction responsible for the fact that the topological complexity of the Klein bottle is 4. In addition, this gives a counterexample to the possibility -- suggested in Pavešić's work on the topological complexity of a map -- that ETC of $(X,G)$ would agree with Farber's $TC(X)$ whenever the projection map $X\to X/G$ is finitely sheeted. We conjecture that ETC of spheres with antipodal action recasts the Hopf invariant one problem, and describe (conjecturally optimal) effectual motion planners. △ Less

Submitted 14 February, 2021; originally announced February 2021.

Comments: 19 pages

MSC Class: 55M30; 57S25; 68T40; 93C85

arXiv:2012.01255 [pdf, other]

Convergence Properties of a Randomized Primal-Dual Algorithm with Applications to Parallel MRI

Authors: Eric B. Gutierrez, Claire Delplancke, Matthias J. Ehrhardt

Abstract: The Stochastic Primal-Dual Hybrid Gradient (SPDHG) was proposed by Chambolle et al. (2018) and is an efficient algorithm to solve some nonsmooth large-scale optimization problems. In this paper we prove its almost sure convergence for convex but not necessarily strongly convex functionals. We also look into its application to parallel Magnetic Resonance Imaging reconstruction in order to test perf… ▽ More The Stochastic Primal-Dual Hybrid Gradient (SPDHG) was proposed by Chambolle et al. (2018) and is an efficient algorithm to solve some nonsmooth large-scale optimization problems. In this paper we prove its almost sure convergence for convex but not necessarily strongly convex functionals. We also look into its application to parallel Magnetic Resonance Imaging reconstruction in order to test performance of SPDHG. Our numerical results show that for a range of settings SPDHG converges significantly faster than its deterministic counterpart. △ Less

Submitted 31 March, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

arXiv:2006.13816 [pdf, other]

Document Classification for COVID-19 Literature

Authors: Bernal Jiménez Gutiérrez, Juncheng Zeng, Dongdong Zhang, ** Zhang, Yu Su

Abstract: The global pandemic has made it more important than ever to quickly and accurately retrieve relevant scientific literature for effective consumption by researchers in a wide range of fields. We provide an analysis of several multi-label document classification models on the LitCovid dataset, a growing collection of 23,000 research papers regarding the novel 2019 coronavirus. We find that pre-train… ▽ More The global pandemic has made it more important than ever to quickly and accurately retrieve relevant scientific literature for effective consumption by researchers in a wide range of fields. We provide an analysis of several multi-label document classification models on the LitCovid dataset, a growing collection of 23,000 research papers regarding the novel 2019 coronavirus. We find that pre-trained language models fine-tuned on this dataset outperform all other baselines and that BioBERT surpasses the others by a small margin with micro-F1 and accuracy scores of around 86% and 75% respectively on the test set. We evaluate the data efficiency and generalizability of these models as essential features of any system prepared to deal with an urgent situation like the current health crisis. Finally, we explore 50 errors made by the best performing models on LitCovid documents and find that they often (1) correlate certain labels too closely together and (2) fail to focus on discriminative sections of the articles; both of which are important issues to address in future work. Both data and code are available on GitHub. △ Less

Submitted 9 September, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

Comments: 8 pages, 9 figures

arXiv:2006.08546 [pdf, other]

Derivative couplings in gravitational production in the early universe

Authors: Daniel E. Borrajo Gutiérrez, Jose A. R. Cembranos, Luis J. Garay, Jose M. Sánchez Velázquez

Abstract: Gravitational particle production in the early universe is due to the coupling of matter fields to curvature. This coupling may include derivative terms that modify the kinetic term. The most general first order action contains derivative couplings to the curvature scalar and to the traceless Ricci tensor, which can be dominant in the case of (pseudo-)Nambu-Goldstone bosons or disformal scalars, s… ▽ More Gravitational particle production in the early universe is due to the coupling of matter fields to curvature. This coupling may include derivative terms that modify the kinetic term. The most general first order action contains derivative couplings to the curvature scalar and to the traceless Ricci tensor, which can be dominant in the case of (pseudo-)Nambu-Goldstone bosons or disformal scalars, such as branons. In the presence of these derivative couplings, the density of produced particles for the adiabatic regime in the de Sitter phase (which mimics inflation) is constant in time and decays with the inverse effective mass (which in turn depends on the coupling to the curvature scalar). In the reheating phase following inflation, the presence of derivative couplings to the background curvature modifies in a nontrivial way the gravitational production even in the perturbative regime. We also show that the two couplings -- to the curvature scalar and to the traceless Ricci tensor -- are drastically different, specially for large masses. In this regime, the production becomes highly sensitive to the former coupling while it becomes independent of the latter. △ Less

Submitted 15 June, 2020; originally announced June 2020.

Comments: 24 pages, 6 figures

arXiv:2005.00574 [pdf, other]

Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset

Authors: Xiang Yue, Bernal Jimenez Gutierrez, Huan Sun

Abstract: Machine reading comprehension has made great progress in recent years owing to large-scale annotated datasets. In the clinical domain, however, creating such datasets is quite difficult due to the domain expertise required for annotation. Recently, Pampari et al. (EMNLP'18) tackled this issue by using expert-annotated question templates and existing i2b2 annotations to create emrQA, the first larg… ▽ More Machine reading comprehension has made great progress in recent years owing to large-scale annotated datasets. In the clinical domain, however, creating such datasets is quite difficult due to the domain expertise required for annotation. Recently, Pampari et al. (EMNLP'18) tackled this issue by using expert-annotated question templates and existing i2b2 annotations to create emrQA, the first large-scale dataset for question answering (QA) based on clinical notes. In this paper, we provide an in-depth analysis of this dataset and the clinical reading comprehension (CliniRC) task. From our qualitative analysis, we find that (i) emrQA answers are often incomplete, and (ii) emrQA questions are often answerable without using domain knowledge. From our quantitative experiments, surprising results include that (iii) using a small sampled subset (5%-20%), we can obtain roughly equal performance compared to the model trained on the entire dataset, (iv) this performance is close to human expert's performance, and (v) BERT models do not beat the best performing base model. Following our analysis of the emrQA, we further explore two desired aspects of CliniRC systems: the ability to utilize clinical domain knowledge and to generalize to unseen questions and contexts. We argue that both should be considered when creating future datasets. △ Less

Submitted 1 May, 2020; originally announced May 2020.

Comments: Accepted by ACL 2020

arXiv:1912.01282 [pdf, other]

Collective motion of run-and-tumble particles drives aggregation in one-dimensional systems

Authors: C. Miguel Barriuso Gutierrez, Christian Vanhille Campos, Francisco Alarcon Oseguera, Ignacio Pagonabarraga, Ricardo Brito, Chantal Valeriani

Abstract: Active matter deals with systems whose particles consume energy at the individual level in order to move. To unravel features such as the emergence of collective structures several models have been suggested, such as the on-lattice model of run-and-tumble particles implemented via the Persistent Exclusion Process (PEP). In our work, we study a one dimensional system of run-and-tumble repulsive or… ▽ More Active matter deals with systems whose particles consume energy at the individual level in order to move. To unravel features such as the emergence of collective structures several models have been suggested, such as the on-lattice model of run-and-tumble particles implemented via the Persistent Exclusion Process (PEP). In our work, we study a one dimensional system of run-and-tumble repulsive or attractive particles, both on and off lattice. Additionally, we implement a cluster motility dynamics in the on-lattice case (since in the off-lattice case cluster motility arises from the individual particle dynamics). While we observe important differences between discrete and continuous dynamics, few common features are of particular importance. Increasing particle density drives aggregation across all different systems explored. For non-attractive particles, the effects of particle activity on aggregation are largely independent of the details of the dynamics. On the contrary, once attractive interactions are introduced, the steady state, which is completely determined by the interplay between these and the particles' activity, becomes highly dependent on the details of the dynamics. △ Less

Submitted 14 October, 2021; v1 submitted 3 December, 2019; originally announced December 2019.

arXiv:1812.06018 [pdf, other]

doi 10.23731/CYRM-2018-002

The Compact Linear Collider (CLIC) - 2018 Summary Report

Authors: The CLIC, CLICdp collaborations, :, T. K. Charles, P. J. Giansiracusa, T. G. Lucas, R. P. Rassool, M. Volpi, C. Balazs, K. Afanaciev, V. Makarenko, A. Patapenka, I. Zhuk, C. Collette, M. J. Boland, A. C. Abusleme Hoffman, M. A. Diaz, F. Garay, Y. Chi, X. He, G. Pei, S. Pei, G. Shu, X. Wang, J. Zhang , et al. (671 additional authors not shown)

Abstract: The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear $e^+e^-$ collider under development at CERN. Following the CLIC conceptual design published in 2012, this report provides an overview of the CLIC project, its current status, and future developments. It presents the CLIC physics potential and reports on design, technology, and implementation aspects of the accelerator and the… ▽ More The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear $e^+e^-$ collider under development at CERN. Following the CLIC conceptual design published in 2012, this report provides an overview of the CLIC project, its current status, and future developments. It presents the CLIC physics potential and reports on design, technology, and implementation aspects of the accelerator and the detector. CLIC is foreseen to be built and operated in stages, at centre-of-mass energies of 380 GeV, 1.5 TeV and 3 TeV, respectively. CLIC uses a two-beam acceleration scheme, in which 12 GHz accelerating structures are powered via a high-current drive beam. For the first stage, an alternative with X-band klystron powering is also considered. CLIC accelerator optimisation, technical developments and system tests have resulted in an increased energy efficiency (power around 170 MW) for the 380 GeV stage, together with a reduced cost estimate at the level of 6 billion CHF. The detector concept has been refined using improved software tools. Significant progress has been made on detector technology developments for the tracking and calorimetry systems. A wide range of CLIC physics studies has been conducted, both through full detector simulations and parametric studies, together providing a broad overview of the CLIC physics potential. Each of the three energy stages adds cornerstones of the full CLIC physics programme, such as Higgs width and couplings, top-quark properties, Higgs self-coupling, direct searches, and many precision electroweak measurements. The interpretation of the combined results gives crucial and accurate insight into new physics, largely complementary to LHC and HL-LHC. The construction of the first CLIC energy stage could start by 2026. First beams would be available by 2035, marking the beginning of a broad CLIC physics programme spanning 25-30 years. △ Less

Submitted 6 May, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

Comments: 112 pages, 59 figures; published as CERN Yellow Report Monograph Vol. 2/2018; corresponding editors: Philip N. Burrows, Nuria Catalan Lasheras, Lucie Linssen, Marko Petrič, Aidan Robson, Daniel Schulte, Eva Sicking, Steinar Stapnes

Report number: CERN-2018-005-M

arXiv:1811.11813 [pdf, other]

The SWAG Algorithm; a Mathematical Approach that Outperforms Traditional Deep Learning. Theory and Implementation

Authors: Saeid Safaei, Vahid Safaei, Solmazi Safaei, Zerotti Woods, Hamid R. Arabnia, Juan B. Gutierrez

Abstract: The performance of artificial neural networks (ANNs) is influenced by weight initialization, the nature of activation functions, and their architecture. There is a wide range of activation functions that are traditionally used to train a neural network, e.g. sigmoid, tanh, and Rectified Linear Unit (ReLU). A widespread practice is to use the same type of activation function in all neurons in a giv… ▽ More The performance of artificial neural networks (ANNs) is influenced by weight initialization, the nature of activation functions, and their architecture. There is a wide range of activation functions that are traditionally used to train a neural network, e.g. sigmoid, tanh, and Rectified Linear Unit (ReLU). A widespread practice is to use the same type of activation function in all neurons in a given layer. In this manuscript, we present a type of neural network in which the activation functions in every layer form a polynomial basis; we name this method SWAG after the initials of the last names of the authors. We tested SWAG on three complex highly non-linear functions as well as the MNIST handwriting data set. SWAG outperforms and converges faster than the state of the art performance in fully connected neural networks. Given the low computational complexity of SWAG, and the fact that it was capable of solving problems current architectures cannot, it has the potential to change the way that we approach deep learning. △ Less

Submitted 28 November, 2018; originally announced November 2018.

Comments: 20 pages, 16 figures

MSC Class: 68T05; 68Q32 ACM Class: I.2.6

arXiv:1811.09587 [pdf]

Matemáticas, Espacios Públicos e Integración Vecinal. El caso de Cuernavaca (México)

Authors: Igor Barahona, Lucía López de Medrano, Barbara Martínez Moreno, Beatríz Limón Gutiérrez

Abstract: We investigate the impact of mathematics on improving neighbourhood integration and perception of security. The main square of Chamilpa colony in Cuernavaca, México is take as study case. This city is featured by its precarious recreational and leisurial infrastructure. Chamilpa is among the top of social outcast levels in the city. Data was collected through a questionnaire applied among attendee… ▽ More We investigate the impact of mathematics on improving neighbourhood integration and perception of security. The main square of Chamilpa colony in Cuernavaca, México is take as study case. This city is featured by its precarious recreational and leisurial infrastructure. Chamilpa is among the top of social outcast levels in the city. Data was collected through a questionnaire applied among attendees of the ARTEMAT festival. Results provide empirical evidence for supporting that performing mathematics activities on public spaces, increases the perception of security and improve the social cohesion. △ Less

Submitted 28 November, 2018; v1 submitted 23 November, 2018; originally announced November 2018.

Comments: in Spanish. Proyecto ARTEMAT. Matemáticas para la paz

arXiv:1707.02919 [pdf, other]

A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques

Authors: Mehdi Allahyari, Seyedamin Pouriyeh, Mehdi Assefi, Saied Safaei, Elizabeth D. Trippe, Juan B. Gutierrez, Krys Kochut

Abstract: The amount of text that is generated every day is increasing dramatically. This tremendous volume of mostly unstructured text cannot be simply processed and perceived by computers. Therefore, efficient and effective techniques and algorithms are required to discover useful patterns. Text mining is the task of extracting meaningful information from text, which has gained significant attentions in r… ▽ More The amount of text that is generated every day is increasing dramatically. This tremendous volume of mostly unstructured text cannot be simply processed and perceived by computers. Therefore, efficient and effective techniques and algorithms are required to discover useful patterns. Text mining is the task of extracting meaningful information from text, which has gained significant attentions in recent years. In this paper, we describe several of the most fundamental text mining tasks and techniques including text pre-processing, classification and clustering. Additionally, we briefly explain text mining in biomedical and health care domains. △ Less

Submitted 28 July, 2017; v1 submitted 10 July, 2017; originally announced July 2017.

Comments: some of References format have updated

arXiv:1707.02268 [pdf, ps, other]

Text Summarization Techniques: A Brief Survey

Authors: Mehdi Allahyari, Seyedamin Pouriyeh, Mehdi Assefi, Saeid Safaei, Elizabeth D. Trippe, Juan B. Gutierrez, Krys Kochut

Abstract: In recent years, there has been a explosion in the amount of text data from a variety of sources. This volume of text is an invaluable source of information and knowledge which needs to be effectively summarized to be useful. In this review, the main approaches to automatic text summarization are described. We review the different processes for summarization and describe the effectiveness and shor… ▽ More In recent years, there has been a explosion in the amount of text data from a variety of sources. This volume of text is an invaluable source of information and knowledge which needs to be effectively summarized to be useful. In this review, the main approaches to automatic text summarization are described. We review the different processes for summarization and describe the effectiveness and shortcomings of the different methods. △ Less

Submitted 28 July, 2017; v1 submitted 7 July, 2017; originally announced July 2017.

Comments: Some of references format have updated

arXiv:1706.08836 [pdf, other]

Correlates of severity of disease in Macaca mulatta infected with Plasmodium cynomolgi

Authors: Yi H. Yan, Diego M. Moncada, Elizabeth D. Trippe, Juan B. Gutierrez

Abstract: Characterization of host responses associated with severe malaria through an integrative approach is necessary to understand the dynamics of a \textit{Plasmodium cynomolgi} infection. In this study, we conducted temporal immune profiling, cytokine profiling and transcriptomic analysis of five \textit{Macaca mulatta} infected with \textit{P. cynomolgi}. This experiment resulted in two severe infect… ▽ More Characterization of host responses associated with severe malaria through an integrative approach is necessary to understand the dynamics of a \textit{Plasmodium cynomolgi} infection. In this study, we conducted temporal immune profiling, cytokine profiling and transcriptomic analysis of five \textit{Macaca mulatta} infected with \textit{P. cynomolgi}. This experiment resulted in two severe infections, and two mild infections. Our analysis reveals that differential transcriptional up-regulation of genes linked with response to pathogen-associated molecular pattern (PAMP) and pro-inflammatory cytokines is characteristic of hosts experiencing severe malaria. Furthermore, our analysis discovered associations of transcriptional differential regulation unique to severe hosts with specific cellular and cytokine responses. The combined data provide a molecular and cellular basis for the development of severe malaria during \textit{P. cynomolgi} infection. △ Less

Submitted 29 June, 2017; v1 submitted 25 June, 2017; originally announced June 2017.

Comments: 10 pages, 8 figures

MSC Class: 92C42

arXiv:1706.08151 [pdf, other]

The lunar cycle's influence on sex determination at conception in humans

Authors: Derek Onken, Eric Marty, Roberto Palomares, Rui Xie, Leyao Zhang, Jonathan Arnold, Juan B. Gutierrez

Abstract: The lunar cycle has long been suspected to influence biological phenomena. Folklore alludes to such a relationship, but previous scientific analyses have failed to find significant associations. It has been shown that lunar cycles indeed have effects on animals; significant associations between human circadian rhythms and lunar cycles have also been reported. We set out to determine whether a sign… ▽ More The lunar cycle has long been suspected to influence biological phenomena. Folklore alludes to such a relationship, but previous scientific analyses have failed to find significant associations. It has been shown that lunar cycles indeed have effects on animals; significant associations between human circadian rhythms and lunar cycles have also been reported. We set out to determine whether a significant statistical correlation exists between the lunar phase and sex determination during conception. We found that significant associations (\textit{p}-value $< 5 \times 10^{-5}$) exist between the average sex ratio (male:female) and the lunar month. The likelihood of conception of a male is at its highest point five days after the full moon, whereas the highest likelihood of female conception occurs nineteen days after the full moon. Furthermore, we found that the strength of this influence is correlated with the amount of solar radiation (which is proportional to moonlight). Our results suggest that sex determination may be influenced by the moon cycle, which suggests the possibility of lunar influence on other biological phenomena. We suggest for future research the exploration of similar effects in other phenomena involving humans and other species. △ Less

Submitted 25 June, 2017; originally announced June 2017.

Comments: 12 pages, 12 figures

arXiv:1706.08139 [pdf, other]

Quantification of Healthy Red Blood Cell Removal and Preferential Invasion of Reticulocytes in Macaca mulatta during Plasmodium cynomolgi Infection

Authors: Yi H. Yan, Jacob B. Aguilar, Elizabeth D. Trippe, Juan B. Gutierrez

Abstract: We derived an ordinary differential equation model to capture the disease dynamics during blood-stage malaria. The model was directly derived from an earlier age-structured partial differential equation model. The original model was simplified due to experimental constraints. Here we calibrated the simplified model with experimental data using a multiple objective genetic algorithm. Through the ca… ▽ More We derived an ordinary differential equation model to capture the disease dynamics during blood-stage malaria. The model was directly derived from an earlier age-structured partial differential equation model. The original model was simplified due to experimental constraints. Here we calibrated the simplified model with experimental data using a multiple objective genetic algorithm. Through the calibration process, we quantified the removal of healthy red blood cells and the the preferential infection of reticulocytes during \textit{Plamodium cynomolgi} infection of \textit{Macaca mulatta}. The calibration of our model also revealed the existence of host erythropoietic response prior to blood stage infection. △ Less

Submitted 30 June, 2017; v1 submitted 25 June, 2017; originally announced June 2017.

Comments: 17 pages, 14 figures

MSC Class: 92B05

arXiv:1706.08131 [pdf, other]

Introducing Data Primitives: Data Formats for the SKED Framework

Authors: Elizabeth D. Trippe, Jacob B. Aguilar, Yi H. Yan, Mustafa V. Nural, Jessica A. Brady, Juan B. Gutierrez

Abstract: Background: The past few years have seen a tremendous increase in the size and complexity of datasets. Scientific and clinical studies must to incorporate datasets that cross multiple spatial and temporal scales to describe a particular phenomenon. The storage and accessibility of these heterogeneous datasets in a way that is useful to researchers and yet extensible to new data types is a major ch… ▽ More Background: The past few years have seen a tremendous increase in the size and complexity of datasets. Scientific and clinical studies must to incorporate datasets that cross multiple spatial and temporal scales to describe a particular phenomenon. The storage and accessibility of these heterogeneous datasets in a way that is useful to researchers and yet extensible to new data types is a major challenge. Methods: In order to overcome these obstacles, we propose the use of data primitives as a common currency between analytical methods. The four data primitives we have identified are time series, text, annotated graph and triangulated mesh, with associated metadata. Using only data primitives to store data and as algorithm input, output, and intermediate results, promotes interoperability, scalability, and reproducibility in scientific studies. Results: Data primitives were used in a multi-omic, multi-scale systems biology study of malaria infection in non-human primates to perform many types of integrative analysis quickly and efficiently. Conclusions: Using data primitives as a common currency for both data storage and for cross talk between analytical methods enables the analysis of complex multi-omic, multi-scale datasets in a reproducible modular fashion. △ Less

Submitted 25 June, 2017; originally announced June 2017.

Comments: 10 pages, 3 figures

arXiv:1706.07992 [pdf, other]

A Vision for Health Informatics: Introducing the SKED Framework.An Extensible Architecture for Scientific Knowledge Extraction from Data

Authors: Elizabeth D. Trippe, Jacob B. Aguilar, Yi H. Yan, Mustafa V. Nural, Jessica A. Brady, Mehdi Assefi, Saeid Safaei, Mehdi Allahyari, Seyedamin Pouriyeh, Mary R. Galinski, Jessica C. Kissinger, Juan B. Gutierrez

Abstract: The goals of the Triple Aim of health care and the goals of P4 medicine outline objectives that require a significant health informatics component. However, the goals do not provide specifications about how all of the new individual patient data will be combined in meaningful ways and with data from other sources, like epidemiological data, to promote the health of individuals and society. We seem… ▽ More The goals of the Triple Aim of health care and the goals of P4 medicine outline objectives that require a significant health informatics component. However, the goals do not provide specifications about how all of the new individual patient data will be combined in meaningful ways and with data from other sources, like epidemiological data, to promote the health of individuals and society. We seem to have more data than ever before but few resources and means to use it efficiently. We need a general, extensible solution that integrates and homogenizes data of disparate origin, incompatible formats, and multiple spatial and temporal scales. To address this problem, we introduce the Scientific Knowledge Extraction from Data (SKED) architecture, as a technology-agnostic framework to minimize the overhead of data integration, permit reuse of analytical pipelines, and guarantee reproducible quantitative results. The SKED architecture consists of a Resource Allocation Service to locate resources, and the definition of data primitives to simplify and harmonize data. SKED allows automated knowledge discovery and provides a platform for the realization of the major goals of modern health care. △ Less

Submitted 24 June, 2017; originally announced June 2017.

Comments: 8 pages, 4 figures

MSC Class: 68M99; 92-08 ACM Class: H.5.2; H.2.5; H.3.3; I.2.6

arXiv:1705.08111 [pdf, other]

A Multi-Armed Bandit to Smartly Select a Training Set from Big Medical Data

Authors: Benjamín Gutiérrez, Loïc Peter, Tassilo Klein, Christian Wachinger

Abstract: With the availability of big medical image data, the selection of an adequate training set is becoming more important to address the heterogeneity of different datasets. Simply including all the data does not only incur high processing costs but can even harm the prediction. We formulate the smart and efficient selection of a training dataset from big medical image data as a multi-armed bandit pro… ▽ More With the availability of big medical image data, the selection of an adequate training set is becoming more important to address the heterogeneity of different datasets. Simply including all the data does not only incur high processing costs but can even harm the prediction. We formulate the smart and efficient selection of a training dataset from big medical image data as a multi-armed bandit problem, solved by Thompson sampling. Our method assumes that image features are not available at the time of the selection of the samples, and therefore relies only on meta information associated with the images. Our strategy simultaneously exploits data sources with high chances of yielding useful samples and explores new data regions. For our evaluation, we focus on the application of estimating the age from a brain MRI. Our results on 7,250 subjects from 10 datasets show that our approach leads to higher accuracy while only requiring a fraction of the training data. △ Less

Submitted 29 May, 2017; v1 submitted 23 May, 2017; originally announced May 2017.

Comments: MICCAI 2017 Proceedings

arXiv:1703.06010 [pdf, other]

Considerations on Interdisciplinary Instruction and Design Influenced by Adaptive Learning. A Case Study Involving Biology, Computer Science, Mathematics, and Statistics

Authors: Karen Aguar, Charles C. Sanchez, Diego Boada Beltran, Saeid Safaei, Mehdi Asefi, Jonathan Arnold, Pedro Portes, Hamid R. Arabnia, Juan B. Gutierrez

Abstract: ALICE (Adaptive Learning for Interdisciplinary Collaborative Environments) is an open-source web based adaptive learning system designed for interdisciplinary instruction. ALICE has the potential to transform education by empowering transdisciplinary knowledge acquisition. This is particularly important in fields that accept newcomers with diverse scholastic backgrounds, e.g. Systems Biology. With… ▽ More ALICE (Adaptive Learning for Interdisciplinary Collaborative Environments) is an open-source web based adaptive learning system designed for interdisciplinary instruction. ALICE has the potential to transform education by empowering transdisciplinary knowledge acquisition. This is particularly important in fields that accept newcomers with diverse scholastic backgrounds, e.g. Systems Biology. With traditional interdisciplinary instruction, the instructor must cover pre-requisite information from multiple disciplines to ensure all students begin at a common baseline - slowing the learning process. With ALICE, students follow a personalized syllabus based on their previous knowledge and work towards individual goals. Implementing an adaptive learning system in an interdisciplinary course requires careful considerations of the instructional design. Structuring material, formulating assessments, and other instructional design aspects must be carefully considered. These considerations are detailed through the exploration of a case study implementing ALICE in a graduate level Systems Biology course. △ Less

Submitted 4 May, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

Comments: 17 pages, 4 figures. Version 2 of this document had a minor change in the title

MSC Class: 97A99; 97P99 ACM Class: K.3.1

arXiv:1612.08759 [pdf, other]

A Method for Massively Parallel Analysis of Time Series

Authors: Yi H. Yan, Elizabeth D. Trippe, Juan B. Gutierrez

Abstract: Quantification of system-wide perturbations from time series -omic data (i.e. a large number of variables with multiple measures in time) provides the basis for many downstream hypothesis generating tools. Here we propose a method, Massively Parallel Analysis of Time Series (MPATS) that can be applied to quantify transcriptome-wide perturbations. The proposed method characterizes each individual t… ▽ More Quantification of system-wide perturbations from time series -omic data (i.e. a large number of variables with multiple measures in time) provides the basis for many downstream hypothesis generating tools. Here we propose a method, Massively Parallel Analysis of Time Series (MPATS) that can be applied to quantify transcriptome-wide perturbations. The proposed method characterizes each individual time series through its $\ell_1$ distance to every other time series. Application of MPATS to compare biological conditions produces a ranked list of time series based on their magnitude of differences in their $\ell_1$ representation, which then can be further interpreted through enrichment analysis. The performance of MPATS was validated through its application to a study of IFN$α$ dendritic cell responses to viral and bacterial infection. In conjunction with Gene Set Enrichment Analysis (GSEA), MPATS produced consistently identified signature gene sets of anti-bacterial and anti-viral response. Traditional methods such as EDGE and GSEA Time Series (GSEA-TS) failed to identify the relevant signature gene sets. Furthermore, the results of MPATS highlighted the crucial functional difference between STAT1/STAT2 during anti-viral and anti-bacterial response. In our simulation study, MPATS exhibited acceptable performance with small group size (n = 3), when the appropriate effect size is considered. This method can be easily adopted for other -omic data types. △ Less

Submitted 27 December, 2016; originally announced December 2016.

Comments: 18 pages, 8 figures

MSC Class: 62P10

arXiv:1611.04668 [pdf, ps, other]

An Epidemiological Model of Malaria Accounting for Asymptomatic Carriers

Authors: Jacob B. Aguilar, Juan B. Gutierrez

Abstract: Asymptomatic individuals in the context of malarial disease refers to subjects who carry a parasite load but do not show clinical symptoms. A correct understanding of the influence of asymptomatic individuals on transmission dynamics will provide a comprehensive description of the complex interplay between the definitive host (female \textit{Anopheles} mosquito), intermediate host (human) and agen… ▽ More Asymptomatic individuals in the context of malarial disease refers to subjects who carry a parasite load but do not show clinical symptoms. A correct understanding of the influence of asymptomatic individuals on transmission dynamics will provide a comprehensive description of the complex interplay between the definitive host (female \textit{Anopheles} mosquito), intermediate host (human) and agent (\textit{Plasmodium} parasite). The goal of this article is to conduct a rigorous mathematical analysis of a new compartmentalized malaria model accounting for asymptomatic human hosts for the purpose of calculating the basic reproductive number ($\mathcal{R}_0$), and determining the bifurcations that might occur at the onset of disease free equilibrium. A point of departure of this model from others appearing in literature is that the asymptomatic compartment is decomposed into two mutually disjoint sub-compartments by making use of the naturally acquired immunity (NAI) of the population under consideration. After deriving the model, a qualitative analysis is carried out to classify the stability of the equilibria of the system. Our results show that the dynamical system is locally asymptotically stable provided that $\mathcal{R}_0<1$. However this stability is not global, owning to the occurrence of a sub-critical bifurcation in which additional non-trivial sub-threshold equilibrium solutions appear in response to a specified parameter being perturbed. To ensure that the model does not undergo a backward bifurcation, we demand that an auxiliary parameter denoted $Λ<1$ in addition to the threshold constraint $\mathcal{R}_0<1$. The authors hope that this qualitative analysis will fill in the gaps of what is currently known about asymptomatic malaria and aid in designing strategies that assist the further development of malaria control and eradication efforts. △ Less

Submitted 16 February, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

arXiv:1610.03521 [pdf, other]

A Review of Mathematical Models for Muscular Dystrophy: A Systems Biology Approach

Authors: Amanda N. Cameron, Matthew T. Houston, Juan B. Gutierrez

Abstract: Muscular dystrophy (MD) describes generalized progressive muscular weakness due to the wasting of muscle fibers. The progression of the disease is affected by known immunological and mechanical factors, and possibly other unknown mechanisms. These dynamics have begun to be elucidated in the last two decades. This article reviews mathematical models of MD that characterize molecular and cellular co… ▽ More Muscular dystrophy (MD) describes generalized progressive muscular weakness due to the wasting of muscle fibers. The progression of the disease is affected by known immunological and mechanical factors, and possibly other unknown mechanisms. These dynamics have begun to be elucidated in the last two decades. This article reviews mathematical models of MD that characterize molecular and cellular components implicated in MD progression. A biological background for these processes is also presented. Molecular effectors that contribute to MD include mitochondrial bioenergetics and genetic factors; both drive cellular metabolism, communication and signaling. These molecular events leave cells vulnerable to mechanical stress which can activate an immunological cascade that weakens cells and surrounding tissues. This review article lays the foundation for a systems biology approach to study MD progression. △ Less

Submitted 28 October, 2016; v1 submitted 11 October, 2016; originally announced October 2016.

Comments: 23 pages, 2 figures

MSC Class: 92C42

arXiv:1608.07537 [pdf, other]

doi 10.5170/CERN-2016-004

Updated baseline for a staged Compact Linear Collider

Authors: The CLIC, CLICdp collaborations, :, M. J. Boland, U. Felzmann, P. J. Giansiracusa, T. G. Lucas, R. P. Rassool, C. Balazs, T. K. Charles, K. Afanaciev, I. Emeliantchik, A. Ignatenko, V. Makarenko, N. Shumeiko, A. Patapenka, I. Zhuk, A. C. Abusleme Hoffman, M. A. Diaz Gutierrez, M. Vogel Gonzalez, Y. Chi, X. He, G. Pei, S. Pei, G. Shu , et al. (493 additional authors not shown)

Abstract: The Compact Linear Collider (CLIC) is a multi-TeV high-luminosity linear e+e- collider under development. For an optimal exploitation of its physics potential, CLIC is foreseen to be built and operated in a staged approach with three centre-of-mass energy stages ranging from a few hundred GeV up to 3 TeV. The first stage will focus on precision Standard Model physics, in particular Higgs and top-q… ▽ More The Compact Linear Collider (CLIC) is a multi-TeV high-luminosity linear e+e- collider under development. For an optimal exploitation of its physics potential, CLIC is foreseen to be built and operated in a staged approach with three centre-of-mass energy stages ranging from a few hundred GeV up to 3 TeV. The first stage will focus on precision Standard Model physics, in particular Higgs and top-quark measurements. Subsequent stages will focus on measurements of rare Higgs processes, as well as searches for new physics processes and precision measurements of new states, e.g. states previously discovered at LHC or at CLIC itself. In the 2012 CLIC Conceptual Design Report, a fully optimised 3 TeV collider was presented, while the proposed lower energy stages were not studied to the same level of detail. This report presents an updated baseline staging scenario for CLIC. The scenario is the result of a comprehensive study addressing the performance, cost and power of the CLIC accelerator complex as a function of centre-of-mass energy and it targets optimal physics output based on the current physics landscape. The optimised staging scenario foresees three main centre-of-mass energy stages at 380 GeV, 1.5 TeV and 3 TeV for a full CLIC programme spanning 22 years. For the first stage, an alternative to the CLIC drive beam scheme is presented in which the main linac power is produced using X-band klystrons. △ Less

Submitted 27 March, 2017; v1 submitted 26 August, 2016; originally announced August 2016.

Comments: 57 pages, 27 figures, 12 tables, published as CERN Yellow Report. Updated version: Minor layout changes for print version

Report number: CERN-2016-004

arXiv:1607.07667 [pdf, ps, other]

Topological complexity of collision-free multi-tasking motion planning on orientable surfaces

Authors: Jesús González, Bárbara Gutiérrez

Abstract: We compute the higher topological complexity of ordered configuration spaces of orientable surfaces, thus extending Cohen-Farber's description of the ordinary topological complexity of those spaces. We compute the higher topological complexity of ordered configuration spaces of orientable surfaces, thus extending Cohen-Farber's description of the ordinary topological complexity of those spaces. △ Less

Submitted 26 July, 2016; originally announced July 2016.

Comments: 18 pages

MSC Class: 55M30; 55R80; 55T99; 68T40; 70B15

arXiv:1601.02996 [pdf, ps, other]

Pairwise disjoint maximal cliques in random graphs and sequential motion planning on random right angled Artin groups

Authors: Jesús González, Bárbara Gutiérrez, Hugo Mas

Abstract: The clique number of a random graph in the Erdos-Renyi model G(n,p) yields a random variable which is known to be asymptotically (as n tends to infinity) almost surely within one of an explicit logarithmic (on n) function r(n,p). We extend this fact by showing that random graphs have, asymptotically almost surely, arbitrarily many pairwise disjoint complete subgraphs with as many vertices as r(n,p… ▽ More The clique number of a random graph in the Erdos-Renyi model G(n,p) yields a random variable which is known to be asymptotically (as n tends to infinity) almost surely within one of an explicit logarithmic (on n) function r(n,p). We extend this fact by showing that random graphs have, asymptotically almost surely, arbitrarily many pairwise disjoint complete subgraphs with as many vertices as r(n,p). The result is motivated by and applied to the sequential motion planning problem on random right angled Artin groups. Indeed, we give an asymptotical description of all the higher topological complexities of Eilenberg-MacLane spaces associated to random graph groups. △ Less

Submitted 12 January, 2016; originally announced January 2016.

Comments: 15 pages

MSC Class: 05C80; 60C05; 05C25; 55M30; 20F36; 52B70; 55U10; 68T40

arXiv:1509.02898 [pdf, ps, other]

Motion planning in real flag manifolds

Authors: Jesús González, Barbara Gutiérrez, Darwin Gutiérrez, Adriana Lara

Abstract: Starting from Borel's description of the mod-2 cohomology of real flag manifolds, we give a minimal presentation of the cohomology ring for semi complete flag manifolds $F_{k,m}:=F(1,\ldots,1,m)$ where $1$ is repeated $k$ times. The information is used in order to estimate Farber's topological complexity of these spaces when $m$ approaches (from below) a 2-power. In particular, we get almost sharp… ▽ More Starting from Borel's description of the mod-2 cohomology of real flag manifolds, we give a minimal presentation of the cohomology ring for semi complete flag manifolds $F_{k,m}:=F(1,\ldots,1,m)$ where $1$ is repeated $k$ times. The information is used in order to estimate Farber's topological complexity of these spaces when $m$ approaches (from below) a 2-power. In particular, we get almost sharp estimates for $F_{2,2^e-1}$ which resemble the known situation for the real projective spaces $F_{1,2^e}$. Our results indicate that the agreement between the topological complexity and the immersion dimension of real projective spaces no longer holds for other flag manifolds. More interestingly, we also get corresponding results for the $s$-th (higher) topological complexity of these spaces. Actually, we prove the surprising fact that, as $s$ increases, the estimates become stronger. Indeed, we get several full computations of the higher motion planning problem of these manifolds. This property is also shown to hold for surfaces: we get a complete computation of the higher topological complexity of all closed surfaces (orientable or not). A homotopy-obstruction explanation is included for the phenomenon of having a cohomologically accessible higher topological complexity even when the regular topological complexity is not so accessible. △ Less

Submitted 18 November, 2015; v1 submitted 9 September, 2015; originally announced September 2015.

Comments: This is a much expanded second version of the paper. The main results have been extended and sharpened by considering the $s$-th higher topological complexity of semi complete real flag manifolds. The methods and new results depend heavily on techniques from computational topology. 20 pages. Two authors have joined this version of the paper. Final version of this paper

MSC Class: 57T15; 55M30; 68T40; 70B15

arXiv:1501.07474 [pdf, ps, other]

The higher topological complexity of subcomplexes of products of spheres---and related polyhedral product spaces

Authors: Jesús González, Bárbara Gutiérrez, Sergey Yuzvinsky

Abstract: We construct "higher" motion planners for automated systems whose space of states are homotopy equivalent to a polyhedral product space $Z(K,\{(S^{k_i},\star)\})$, e.g. robot arms with restrictions on the possible combinations of simultaneously moving nodes. Our construction is shown to be optimal by explicit cohomology calculations. The higher topological complexity of other families of polyhedra… ▽ More We construct "higher" motion planners for automated systems whose space of states are homotopy equivalent to a polyhedral product space $Z(K,\{(S^{k_i},\star)\})$, e.g. robot arms with restrictions on the possible combinations of simultaneously moving nodes. Our construction is shown to be optimal by explicit cohomology calculations. The higher topological complexity of other families of polyhedral product spaces is also determined. △ Less

Submitted 25 March, 2015; v1 submitted 29 January, 2015; originally announced January 2015.

Comments: 30 pages. This second version of the paper extends the results of the first version to the case of polyhedral product spaces $Z(K,\{(S^{k_i},\star)\})$ where no restriction is assumed on the sphere dimensions $k_i$

MSC Class: 55M30; 20F36; 52B70; 52C35; 55U10; 68T40

arXiv:1404.4333 [pdf, ps, other]

On the Riemann Hypothesis and its generalizations

Authors: Daniel E. Borrajo Gutiérrez

Abstract: A proof for the original Riemann hypothesis is proposed based on the infinite Hadamard product representation for the Riemann zeta function and later generalized to Dirichlet L-functions. The extension of the hypothesis to other functions is also discussed. A proof for the original Riemann hypothesis is proposed based on the infinite Hadamard product representation for the Riemann zeta function and later generalized to Dirichlet L-functions. The extension of the hypothesis to other functions is also discussed. △ Less

Submitted 28 April, 2014; v1 submitted 15 April, 2014; originally announced April 2014.

Comments: 8 pages, 0 figures

MSC Class: 11M26

Showing 1–43 of 43 results for author: Gutiérrez, B