-
Towards quantum computing for clinical trial design and optimization: A perspective on new opportunities and challenges
Authors:
Hakan Doga,
M. Emre Sahin,
Joao Bettencourt-Silva,
Anh Pham,
Eunyoung Kim,
Alan Andress,
Sudhir Saxena,
Aritra Bose,
Laxmi Parida,
Jan Lukas Robertus,
Hideaki Kawaguchi,
Radwa Soliman,
Daniel Blankenberg
Abstract:
Clinical trials are pivotal in the drug discovery process to determine the safety and efficacy of a drug candidate. The high failure rates of these trials are attributed to deficiencies in clinical model development and protocol design. Improvements in the clinical drug design process could therefore yield significant benefits for all stakeholders involved. This paper examines the current challeng…
▽ More
Clinical trials are pivotal in the drug discovery process to determine the safety and efficacy of a drug candidate. The high failure rates of these trials are attributed to deficiencies in clinical model development and protocol design. Improvements in the clinical drug design process could therefore yield significant benefits for all stakeholders involved. This paper examines the current challenges faced in clinical trial design and optimization, reviews established classical computational approaches, and introduces quantum algorithms aimed at enhancing these processes. Specifically, the focus is on three critical aspects: clinical trial simulations, site selection, and cohort identification. This study aims to provide a comprehensive framework that leverages quantum computing to innovate and refine the efficiency and effectiveness of clinical trials.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Quantum Analog of Shannon's Lower Bound Theorem
Authors:
Saugata Basu,
Laxmi Parida
Abstract:
Shannon proved that almost all Boolean functions require a circuit of size $Θ(2^n/n)$. We prove a quantum analog of this classical result. Unlike in the classical case the number of quantum circuits of any fixed size that we allow is uncountably infinite. Our main tool is a classical result in real algebraic geometry bounding the number of realizable sign conditions of any finite set of real polyn…
▽ More
Shannon proved that almost all Boolean functions require a circuit of size $Θ(2^n/n)$. We prove a quantum analog of this classical result. Unlike in the classical case the number of quantum circuits of any fixed size that we allow is uncountably infinite. Our main tool is a classical result in real algebraic geometry bounding the number of realizable sign conditions of any finite set of real polynomials in many variables.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Sequents, barcodes, and homology
Authors:
Saugata Basu,
Negin Karisani,
Laxmi Parida
Abstract:
We consider the problem of generating hypothesis from data based on ideas from logic. We introduce a notion of barcodes, which we call sequent barcodes, that mirrors the barcodes in persistent homology theory in topological data analysis. We prove a theoretical result on the stability of these barcodes in analogy with similar results in persistent homology theory. Additionally we show that our new…
▽ More
We consider the problem of generating hypothesis from data based on ideas from logic. We introduce a notion of barcodes, which we call sequent barcodes, that mirrors the barcodes in persistent homology theory in topological data analysis. We prove a theoretical result on the stability of these barcodes in analogy with similar results in persistent homology theory. Additionally we show that our new notion of barcodes can be interpreted in terms of a persistent homology of a particular filtration of topological spaces induced by the data. Finally, we discuss a concrete application of the sequent barcodes in a discovery problem arising from the area of cancer genomics.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Inferring COVID-19 Biological Pathways from Clinical Phenotypes via Topological Analysis
Authors:
Negin Karisani,
Daniel E. Platt,
Saugata Basu,
Laxmi Parida
Abstract:
COVID-19 has caused thousands of deaths around the world and also resulted in a large international economic disruption. Identifying the pathways associated with this illness can help medical researchers to better understand the properties of the condition. This process can be carried out by analyzing the medical records. It is crucial to develop tools and models that can aid researchers with this…
▽ More
COVID-19 has caused thousands of deaths around the world and also resulted in a large international economic disruption. Identifying the pathways associated with this illness can help medical researchers to better understand the properties of the condition. This process can be carried out by analyzing the medical records. It is crucial to develop tools and models that can aid researchers with this process in a timely manner. However, medical records are often unstructured clinical notes, and this poses significant challenges to develo** the automated systems. In this article, we propose a pipeline to aid practitioners in analyzing clinical notes and revealing the pathways associated with this disease. Our pipeline relies on topological properties and consists of three steps: 1) pre-processing the clinical notes to extract the salient concepts, 2) constructing a feature space of the patients to characterize the extracted concepts, and finally, 3) leveraging the topological properties to distill the available knowledge and visualize the result. Our experiments on a publicly available dataset of COVID-19 clinical notes testify that our pipeline can indeed extract meaningful pathways.
△ Less
Submitted 1 May, 2022; v1 submitted 18 January, 2021;
originally announced January 2021.
-
Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data
Authors:
Ling**g Jiang,
Niina Haiminen,
Anna-Paola Carrieri,
Shi Huang,
Yoshiki Vazquez-Baeza,
Laxmi Parida,
Ho-Cheol Kim,
Austin D. Swafford,
Rob Knight,
Loki Natarajan
Abstract:
Feature selection is indispensable in microbiome data analysis, but it can be particularly challenging as microbiome data sets are high-dimensional, underdetermined, sparse and compositional. Great efforts have recently been made on develo** new methods for feature selection that handle the above data characteristics, but almost all methods were evaluated based on performance of model prediction…
▽ More
Feature selection is indispensable in microbiome data analysis, but it can be particularly challenging as microbiome data sets are high-dimensional, underdetermined, sparse and compositional. Great efforts have recently been made on develo** new methods for feature selection that handle the above data characteristics, but almost all methods were evaluated based on performance of model predictions. However, little attention has been paid to address a fundamental question: how appropriate are those evaluation criteria? Most feature selection methods often control the model fit, but the ability to identify meaningful subsets of features cannot be evaluated simply based on the prediction accuracy. If tiny changes to the training data would lead to large changes in the chosen feature subset, then many of the biological features that an algorithm has found are likely to be a data artifact rather than real biological signal. This crucial need of identifying relevant and reproducible features motivated the reproducibility evaluation criterion such as Stability, which quantifies how robust a method is to perturbations in the data. In our paper, we compare the performance of popular model prediction metric MSE and proposed reproducibility criterion Stability in evaluating four widely used feature selection methods in both simulations and experimental microbiome applications. We conclude that Stability is a preferred feature selection criterion over MSE because it better quantifies the reproducibility of the feature selection method.
△ Less
Submitted 30 November, 2020;
originally announced December 2020.
-
IPED2: Inheritance Path based Pedigree Reconstruction Algorithm for Complicated Pedigrees
Authors:
Dan He,
Zhanyong Wang,
Laxmi Parida,
Eleazar Eskin
Abstract:
Reconstruction of family trees, or pedigree reconstruction, for a group of individuals is a fundamental problem in genetics. The problem is known to be NP-hard even for datasets known to only contain siblings. Some recent methods have been developed to accurately and efficiently reconstruct pedigrees. These methods, however, still consider relatively simple pedigrees, for example, they are not abl…
▽ More
Reconstruction of family trees, or pedigree reconstruction, for a group of individuals is a fundamental problem in genetics. The problem is known to be NP-hard even for datasets known to only contain siblings. Some recent methods have been developed to accurately and efficiently reconstruct pedigrees. These methods, however, still consider relatively simple pedigrees, for example, they are not able to handle half-sibling situations where a pair of individuals only share one parent. In this work, we propose an efficient method, IPED2, based on our previous work, which specifically targets reconstruction of complicated pedigrees that include half-siblings. We note that the presence of half-siblings makes the reconstruction problem significantly more challenging which is why previous methods exclude the possibility of half-siblings. We proposed a novel model as well as an efficient graph algorithm and experiments show that our algorithm achieves relatively accurate reconstruction. To our knowledge, this is the first method that is able to handle pedigree reconstruction based on genotype data only when half-sibling exists in any generation of the pedigree.
△ Less
Submitted 23 August, 2014;
originally announced August 2014.
-
MINT: Mutual Information based Transductive Feature Selection for Genetic Trait Prediction
Authors:
Dan He,
Irina Rish,
David Haws,
Simon Teyssedre,
Zivan Karaman,
Laxmi Parida
Abstract:
Whole genome prediction of complex phenotypic traits using high-density genoty** arrays has attracted a great deal of attention, as it is relevant to the fields of plant and animal breeding and genetic epidemiology. As the number of genotypes is generally much bigger than the number of samples, predictive models suffer from the curse-of-dimensionality. The curse-of-dimensionality problem not onl…
▽ More
Whole genome prediction of complex phenotypic traits using high-density genoty** arrays has attracted a great deal of attention, as it is relevant to the fields of plant and animal breeding and genetic epidemiology. As the number of genotypes is generally much bigger than the number of samples, predictive models suffer from the curse-of-dimensionality. The curse-of-dimensionality problem not only affects the computational efficiency of a particular genomic selection method, but can also lead to poor performance, mainly due to correlation among markers. In this work we proposed the first transductive feature selection method based on the MRMR (Max-Relevance and Min-Redundancy) criterion which we call MINT. We applied MINT on genetic trait prediction problems and showed that in general MINT is a better feature selection method than the state-of-the-art inductive method mRMR.
△ Less
Submitted 6 October, 2013;
originally announced October 2013.
-
Spectral Sequences, Exact Couples and Persistent Homology of filtrations
Authors:
Saugata Basu,
Laxmi Parida
Abstract:
In this paper we study the relationship between a very classical algebraic object associated to a filtration of spaces, namely a spectral sequence introduced by Leray in the 1940's, and a more recently invented object that has found many applications -- namely, its persistent homology groups. We show the existence of a long exact sequence of groups linking these two objects and using it derive for…
▽ More
In this paper we study the relationship between a very classical algebraic object associated to a filtration of spaces, namely a spectral sequence introduced by Leray in the 1940's, and a more recently invented object that has found many applications -- namely, its persistent homology groups. We show the existence of a long exact sequence of groups linking these two objects and using it derive formulas expressing the dimensions of each individual groups of one object in terms of the dimensions of the groups in the other object. The main tool used to mediate between these objects is the notion of exact couples first introduced by Massey in 1952.
△ Less
Submitted 17 December, 2018; v1 submitted 4 August, 2013;
originally announced August 2013.