Skip to main content

Showing 1–17 of 17 results for author: Rabadan, R

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2106.07292  [pdf, other

    q-bio.PE cs.CG q-bio.GN q-bio.QM

    Topological data analysis identifies emerging adaptive mutations in SARS-CoV-2

    Authors: Michael Bleher, Lukas Hahn, Maximilian Neumann, Juan Angel Patino-Galindo, Mathieu Carriere, Ulrich Bauer, Raul Rabadan, Andreas Ott

    Abstract: The COVID-19 pandemic has initiated an unprecedented worldwide effort to characterize its evolution through the map** of mutations of the coronavirus SARS-CoV-2. The early identification of mutations that could confer adaptive advantages to the virus, such as higher infectivity or immune evasion, is of paramount importance. However, the large number of currently available genomes precludes the e… ▽ More

    Submitted 25 August, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: Major revisions; new analyses added

    MSC Class: 62R40; 55N31; 68U05; 68T09; 92-08; 92C60; 92D15

  2. arXiv:2001.01666  [pdf, other

    stat.ML cs.LG q-bio.GN

    MREC: a fast and versatile framework for aligning and matching point clouds with applications to single cell molecular data

    Authors: Andrew J. Blumberg, Mathieu Carriere, Michael A. Mandell, Raul Rabadan, Soledad Villar

    Abstract: Comparing and aligning large datasets is a pervasive problem occurring across many different knowledge domains. We introduce and study MREC, a recursive decomposition algorithm for computing matchings between data sets. The basic idea is to partition the data, match the partitions, and then recursively match the points within each pair of identified partitions. The matching itself is done using bl… ▽ More

    Submitted 20 February, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

  3. arXiv:1812.01360  [pdf, other

    cs.CG math.AT q-bio.GN

    Topological Data Analysis of Single-cell Hi-C Contact Maps

    Authors: Mathieu Carriere, Raul Rabadan

    Abstract: In this article, we show how the recent statistical techniques developed in Topological Data Analysis for the Mapper algorithm can be extended and leveraged to formally define and statistically quantify the presence of topological structures coming from biological phenomena in datasets of CCC contact maps.

    Submitted 4 December, 2018; originally announced December 2018.

  4. arXiv:1810.03602  [pdf

    q-bio.QM cond-mat.stat-mech math.PR physics.app-ph physics.data-an

    Quasi-universality in single-cell sequencing data

    Authors: Luis Aparicio, Mykola Bordyuh, Andrew J. Blumberg, Raul Rabadan

    Abstract: The development of single-cell technologies provides the opportunity to identify new cellular states and reconstruct novel cell-to-cell relationships. Applications range from understanding the transcriptional and epigenetic processes involved in metazoan development to characterizing distinct cells types in heterogeneous populations like cancers or immune cells. However, analysis of the data is im… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Comments: Main text has 18 pages and 5 figures. Supplementary material (methods) has 21 pages and 16 figures

  5. arXiv:1804.01398  [pdf, other

    math.AT cs.CG q-bio.PE q-bio.QM

    Quantifying Genetic Innovation: Mathematical Foundations for the Topological Study of Reticulate Evolution

    Authors: Michael Lesnick, Raúl Rabadán, Daniel I. S. Rosenbloom

    Abstract: A topological approach to the study of genetic recombination, based on persistent homology, was introduced by Chan, Carlsson, and Rabadán in 2013. This associates a sequence of signatures called barcodes to genomic data sampled from an evolutionary history. In this paper, we develop theoretical foundations for this approach. First, we present a novel formulation of the underlying inference problem… ▽ More

    Submitted 16 January, 2020; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: Expository improvements and minor corrections. To appear in the SIAM Journal on Applied Algebra and Geometry. 47 pages

  6. arXiv:1705.06823  [pdf, other

    q-bio.QM

    Fast and Accurate Semi-Automatic Segmentation Tool for Brain Tumor MRIs

    Authors: Andrew X. Chen, Raúl Rabadán

    Abstract: Segmentation, the process of delineating tumor apart from healthy tissue, is a vital part of both the clinical assessment and the quantitative analysis of brain cancers. Here, we provide an open-source algorithm (MITKats), built on the Medical Imaging Interaction Toolkit, to provide user-friendly and expedient tools for semi-automatic segmentation. To evaluate its performance against competing alg… ▽ More

    Submitted 18 May, 2017; originally announced May 2017.

  7. arXiv:1611.03890  [pdf, other

    physics.soc-ph cs.SI physics.data-an q-bio.QM

    A Theory of Taxonomy

    Authors: Guido D'Amico, Raul Rabadan, Matthew Kleban

    Abstract: A taxonomy is a standardized framework to classify and organize items into categories. Hierarchical taxonomies are ubiquitous, ranging from the classification of organisms to the file system on a computer. Characterizing the typical distribution of items within taxonomic categories is an important question with applications in many disciplines. Ecologists have long sought to account for the patter… ▽ More

    Submitted 4 November, 2016; originally announced November 2016.

    Comments: 7+13 pages, 5 figures. Comments welcome

  8. arXiv:1607.07503  [pdf, other

    q-bio.GN q-bio.QM

    Genomic data analysis in tree spaces

    Authors: Sakellarios Zairis, Hossein Khiabanian, Andrew J. Blumberg, Raul Rabadan

    Abstract: Recently, an elegant approach in phylogenetics was introduced by Billera-Holmes-Vogtmann that allows a systematic comparison of different evolutionary histories using the metric geometry of tree spaces. In many problem settings one encounters heavily populated phylogenetic trees, where the large number of leaves encumbers visualization and analysis in the relevant evolutionary moduli spaces. To ad… ▽ More

    Submitted 25 July, 2016; originally announced July 2016.

  9. arXiv:1511.01429  [pdf, other

    q-bio.QM

    Quantifying Reticulation in Phylogenetic Complexes Using Homology

    Authors: Kevin Emmett, Raul Rabadan

    Abstract: Reticulate evolutionary processes result in phylogenetic histories that cannot be modeled using a tree topology. Here, we apply methods from topological data analysis to molecular sequence data with reticulations. Using a simple example, we demonstrate the correspondence between nontrivial higher homology and reticulate evolution. We discuss the sensitivity of the standard filtration and show case… ▽ More

    Submitted 4 November, 2015; originally announced November 2015.

    Comments: 4 pages, 8 figures. Accepted for presentation at BICT 2015 Special Track on Topology-driven bio-inspired methods and models for complex systems (TOPDRIM4bio)

    ACM Class: J.3; G.2.2

  10. arXiv:1511.01426  [pdf, other

    q-bio.GN q-bio.BM

    Multiscale Topology of Chromatin Folding

    Authors: Kevin Emmett, Benjamin Schweinhart, Raul Rabadan

    Abstract: The three dimensional structure of DNA in the nucleus (chromatin) plays an important role in many cellular processes. Recent experimental advances have led to high-throughput methods of capturing information about chromatin conformation on genome-wide scales. New models are needed to quantitatively interpret this data at a global scale. Here we introduce the use of tools from topological data anal… ▽ More

    Submitted 4 November, 2015; originally announced November 2015.

    Comments: 4 pages, 7 figures. Accepted for presentation at BICT 2015 Special Track on Topology-driven bio-inspired methods and models for complex systems (TOPDRIM4bio)

    ACM Class: J.3

  11. arXiv:1505.05815  [pdf, other

    q-bio.QM math.AT q-bio.PE

    Inference of Ancestral Recombination Graphs through Topological Data Analysis

    Authors: Pablo G. Camara, Arnold J. Levine, Raul Rabadan

    Abstract: The recent explosion of genomic data has underscored the need for interpretable and comprehensive analyses that can capture complex phylogenetic relationships within and across species. Recombination, reassortment and horizontal gene transfer constitute examples of pervasive biological phenomena that cannot be captured by tree-like representations. Starting from hundreds of genomes, we are interes… ▽ More

    Submitted 26 July, 2016; v1 submitted 21 May, 2015; originally announced May 2015.

    Comments: 33 pages, 12 figures. The accompanying software, instructions and example files used in the manuscript can be obtained from https://github.com/RabadanLab/TARGet

  12. arXiv:1410.0980  [pdf, other

    q-bio.QM q-bio.GN

    Moduli Spaces of Phylogenetic Trees Describing Tumor Evolutionary Patterns

    Authors: Sakellarios Zairis, Hossein Khiabanian, Andrew J. Blumberg, Raul Rabadan

    Abstract: Cancers follow a clonal Darwinian evolution, with fitter subclones replacing more quiescent cells, ultimately giving rise to macroscopic disease. High-throughput genomics provides the opportunity to investigate these processes and determine specific genetic alterations driving disease progression. Genomic sampling of a patient's cancer provides a molecular history, represented by a phylogenetic tr… ▽ More

    Submitted 3 October, 2014; originally announced October 2014.

    Journal ref: Lecture Notes in Computer Science, 8609:528-539, 2014

  13. arXiv:1406.4582  [pdf, other

    q-bio.QM

    Parametric Inference using Persistence Diagrams: A Case Study in Population Genetics

    Authors: Kevin Emmett, Daniel Rosenbloom, Pablo Camara, Raul Rabadan

    Abstract: Persistent homology computes topological invariants from point cloud data. Recent work has focused on develo** statistical methods for data analysis in this framework. We show that, in certain models, parametric inference can be performed using statistics defined on the computed invariants. We develop this idea with a model from population genetics, the coalescent with recombination. We apply ou… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: 5 pages, 4 figures. Prepared for the ICML 2014 Workshop on Topological Methods in Machine Learning

  14. arXiv:1406.1219  [pdf, other

    q-bio.PE q-bio.GN

    Characterizing Scales of Genetic Recombination and Antibiotic Resistance in Pathogenic Bacteria Using Topological Data Analysis

    Authors: Kevin J. Emmett, Raul Rabadan

    Abstract: Pathogenic bacteria present a large disease burden on human health. Control of these pathogens is hampered by rampant lateral gene transfer, whereby pathogenic strains may acquire genes conferring resistance to common antibiotics. Here we introduce tools from topological data analysis to characterize the frequency and scale of lateral gene transfer in bacteria, focusing on a set of pathogens of si… ▽ More

    Submitted 4 June, 2014; originally announced June 2014.

    Comments: 12 pages, 6 figures. To appear in AMT 2014 Special Session on Advanced Methods of Interactive Data Mining for Personalized Medicine

  15. Identifying Hosts of Families of Viruses: A Machine Learning Approach

    Authors: Anil Raj, Michael Dewar, Gustavo Palacios, Raul Rabadan, Chris H. Wiggins

    Abstract: Identifying viral pathogens and characterizing their transmission is essential to develo** effective public health measures in response to a pandemic. Phylogenetics, though currently the most popular tool used to characterize the likely host of a virus, can be ambiguous when studying species very distant to known species and when there is very little reliable sequence information available in th… ▽ More

    Submitted 29 May, 2011; originally announced May 2011.

    Comments: 11 pages, 7 figures, 1 table

  16. arXiv:1104.4568  [pdf

    q-bio.PE q-bio.GN

    Understanding the Origins of a Pandemic Virus

    Authors: Carlos Xavier Hernandez, Joseph Chan, Hossein Khiabanian, Raul Rabadan

    Abstract: Understanding the origin of infectious diseases provides scientifically based rationales for implementing public health measures that may help to avoid or mitigate future epidemics. The recent ancestors of a pandemic virus provide invaluable information about the set of minimal genomic alterations that transformed a zoonotic agent into a full human pandemic. Since the first confirmed cases of the… ▽ More

    Submitted 23 April, 2011; originally announced April 2011.

    Comments: 13 pages, 2 figures

    Report number: ac:129992

  17. arXiv:1010.4328  [pdf, other

    q-bio.QM math.PR stat.AP

    Fractal-like Distributions over the Rational Numbers in High-throughput Biological and Clinical Data

    Authors: Vladimir Trifonov, Laura Pasqualucci, Riccardo Dalla-Favera, Raul Rabadan

    Abstract: Recent developments in extracting and processing biological and clinical data are allowing quantitative approaches to studying living systems. High-throughput sequencing, expression profiles, proteomics, and electronic health records are some examples of such technologies. Extracting meaningful information from those technologies requires careful analysis of the large volumes of data they produce.… ▽ More

    Submitted 20 October, 2010; originally announced October 2010.

    Comments: 16 pages, 4 figures