-
Gate-based Quantum Computing for Protein Design
Authors:
Mohammad Hassan Khatami,
Udson C. Mendes,
Nathan Wiebe,
Philip M. Kim
Abstract:
Protein design is a technique to engineer proteins by modifying their sequence to obtain novel functionalities. In this method, amino acids in the sequence are permutated to find the low energy states satisfying the configuration. However, exploring all possible combinations of amino acids is generally impossible to achieve on conventional computers due to the exponential growth of possibilities w…
▽ More
Protein design is a technique to engineer proteins by modifying their sequence to obtain novel functionalities. In this method, amino acids in the sequence are permutated to find the low energy states satisfying the configuration. However, exploring all possible combinations of amino acids is generally impossible to achieve on conventional computers due to the exponential growth of possibilities with the number of designable sites. Thus, sampling methods are currently used as a conventional approach to address the protein design problems. Recently, quantum computation methods have shown the potential to solve similar types of problems. In the present work, we use the general idea of Grover's algorithm, a pure quantum computation method, to design circuits at the gate-based level and address the protein design problem. In our quantum algorithms, we use custom pair-wise energy tables consisting of eight different amino acids. Also, the distance reciprocals between designable sites are included in calculating energies in the circuits. Due to the noisy state of current quantum computers, we mainly use quantum computer simulators for this study. However, a very simple version of our circuits is implemented on real quantum devices to examine their capabilities to run these algorithms. Our results show that using $\mathcal{O}(\sqrt N)$ iterations, the circuits find the correct results among all $N$ possibilities, providing the expected quadratic speed up of Grover's algorithm over classical methods.
△ Less
Submitted 22 November, 2022; v1 submitted 28 January, 2022;
originally announced January 2022.
-
Copy Number Variants and Segmental Duplications Show Different Formation Signatures
Authors:
Philip M. Kim,
Jan O. Korbel,
Xueying Chen,
Mark B. Gerstein
Abstract:
In addition to variation in terms of single nucleotide polymorphisms (SNPs), whole regions ranging from several kilobases up to a megabase in length differ in copy number among individuals. These differences are referred to as Copy Number Variants (CNVs) and extensive map** of these is underway. Recent studies have highlighted their great prevalence in the human genome. Segmental Duplications…
▽ More
In addition to variation in terms of single nucleotide polymorphisms (SNPs), whole regions ranging from several kilobases up to a megabase in length differ in copy number among individuals. These differences are referred to as Copy Number Variants (CNVs) and extensive map** of these is underway. Recent studies have highlighted their great prevalence in the human genome. Segmental Duplications (SDs) are long (>1kb) stretches of duplicated DNA with high sequence identity. First, we analyzed the co-localization of SDs and find that SDs are significantly co-localized with each other, resulting in a power-law distribution, which suggests a preferential attachment mechanism, i.e. existing SDs are likely to be involved in creating new ones nearby. Second, we look at the relationship of CNVs/SDs with various types of repeats. We we find that the previously recognized association of SDs with Alu elements is significantly stronger for older SDs and is sharply decreasing for younger ones. While it might be expected that the patterns should be similar for SDs and CNVs, we find, surprisingly, no association of CNVs with Alu elements. This trend is consistent with the decreasing correlation between Alu elements and younger SDs, the activity of Alu elements has been decreasing and by now it they seem no longer active. Furthermore, we find a striking association of SDs with processed pseudogenes suggesting that they may also have mediated SD formation. Moreover, find strong association with microsatellites for both SDs and CNVs that suggests a role for satellites in the formation of both.
△ Less
Submitted 26 September, 2007;
originally announced September 2007.
-
Comparing Classical Pathways and Modern Networks: Towards the Development of an Edge Ontology
Authors:
Long J. Lu,
Andrea Sboner,
Yuanpeng J. Huang,
Hao Xin Lu,
Tara A. Gianoulis,
Kevin Y. Yip,
Philip M. Kim,
Gaetano T. Montelione,
Mark B. Gerstein
Abstract:
Pathways are integral to systems biology. Their classical representation has proven useful but is inconsistent in the meaning assigned to each arrow (or edge) and inadvertently implies the isolation of one pathway from another. Conversely, modern high-throughput experiments give rise to standardized networks facilitating topological calculations. Combining these perspectives, we can embed classi…
▽ More
Pathways are integral to systems biology. Their classical representation has proven useful but is inconsistent in the meaning assigned to each arrow (or edge) and inadvertently implies the isolation of one pathway from another. Conversely, modern high-throughput experiments give rise to standardized networks facilitating topological calculations. Combining these perspectives, we can embed classical pathways within large-scale networks and thus demonstrate the crosstalk between them. As more diverse types of high-throughput data become available, we can effectively merge both perspectives, embedding pathways simultaneously in multiple networks. However, the original problem still remains - the current edge representation is inadequate to accurately convey all the information in pathways. Therefore, we suggest that a standardized, well-defined, edge ontology is necessary and propose a prototype here, as a starting point for reaching this goal.
△ Less
Submitted 1 June, 2007;
originally announced June 2007.