Skip to main content

Showing 1–21 of 21 results for author: Yang, K

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2406.09841  [pdf, other

    cs.LG q-bio.BM

    Learning Multi-view Molecular Representations with Structured and Unstructured Knowledge

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zikun Nie, Hao Zhou, Zaiqing Nie

    Abstract: Capturing molecular knowledge with representation learning approaches holds significant potential in vast scientific fields such as chemistry and life science. An effective and generalizable molecular representation is expected to capture the consensus and complementary molecular expertise from diverse views and perspectives. However, existing works fall short in learning multi-view molecular repr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  2. arXiv:2312.17670  [pdf, other

    cs.CV cs.LG q-bio.QM q-bio.TO

    Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

    Authors: Kaiyuan Yang, Fabio Musio, Yihui Ma, Norman Juchler, Johannes C. Paetzold, Rami Al-Maskari, Luciano Höher, Hongwei Bran Li, Ibrahim Ethem Hamamci, Anjany Sekuboyina, Suprosanna Shit, Hou**g Huang, Chinmay Prabhakar, Ezequiel de la Rosa, Diana Waldmannstetter, Florian Kofler, Fernando Navarro, Martin Menten, Ivan Ezhov, Daniel Rueckert, Iris Vos, Ynte Ruigrok, Birgitta Velthuis, Hugo Kuijf, Julien Hämmerli , et al. (59 additional authors not shown)

    Abstract: The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modaliti… ▽ More

    Submitted 29 April, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 24 pages, 11 figures, 9 tables. Summary Paper for the MICCAI TopCoW 2023 Challenge

  3. arXiv:2307.09484  [pdf, other

    q-bio.BM cs.CE cs.LG physics.chem-ph

    MolFM: A Multimodal Molecular Foundation Model

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zaiqing Nie

    Abstract: Molecular knowledge resides within three different modalities of information sources: molecular structures, biomedical documents, and knowledge bases. Effective incorporation of molecular knowledge from these modalities holds paramount significance in facilitating biomedical research. However, existing multimodal molecular foundation models exhibit limitations in capturing intricate connections be… ▽ More

    Submitted 21 July, 2023; v1 submitted 6 June, 2023; originally announced July 2023.

    Comments: 31 pages, 15 figures, and 15 tables

  4. arXiv:2305.16634  [pdf, other

    q-bio.BM

    Machine Learning for Protein Engineering

    Authors: Kadina E. Johnston, Clara Fannjiang, Bruce J. Wittmann, Brian L. Hie, Kevin K. Yang, Zachary Wu

    Abstract: Directed evolution of proteins has been the most effective method for protein engineering. However, a new paradigm is emerging, fusing the library generation and screening approaches of traditional directed evolution with computation through the training of machine learning models on protein sequence fitness data. This chapter highlights successful applications of machine learning to protein engin… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Initial book chapter submission on February 28, 2022, to be published by Springer Nature

  5. Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction

    Authors: Kaiyuan Yang, Hou**g Huang, Olafs Vandans, Adithya Murali, Fujia Tian, Roland H. C. Yap, Liang Dai

    Abstract: A central problem in computational biophysics is protein structure prediction, i.e., finding the optimal folding of a given amino acid sequence. This problem has been studied in a classical abstract model, the HP model, where the protein is modeled as a sequence of H (hydrophobic) and P (polar) amino acids on a lattice. The objective is to find conformations maximizing H-H contacts. It is known th… ▽ More

    Submitted 9 December, 2022; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Published at Physica A: Statistical Mechanics and its Applications, available online 7 December 2022. Extended abstract accepted by the Machine Learning and the Physical Sciences workshop, NeurIPS 2022

  6. arXiv:2209.15611  [pdf, other

    q-bio.BM cs.AI

    Protein structure generation via folding diffusion

    Authors: Kevin E. Wu, Kevin K. Yang, Rianne van den Berg, James Y. Zou, Alex X. Lu, Ava P. Amini

    Abstract: The ability to computationally generate novel yet physically foldable protein structures could lead to new biological discoveries and new treatments targeting yet incurable diseases. Despite recent advances in protein structure prediction, directly generating diverse, novel protein structures from neural networks remains difficult. In this work, we present a new diffusion-based generative model th… ▽ More

    Submitted 23 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    ACM Class: I.2.0; J.3

  7. arXiv:2208.00982  [pdf, other

    math.NA math.DS q-bio.PE

    Dominant Eigenvalue-Eigenvector Pair Estimation via Graph Infection

    Authors: Kaiyuan Yang, Li Xia, Y. C. Tay

    Abstract: We present a novel method to estimate the dominant eigenvalue and eigenvector pair of any non-negative real matrix via graph infection. The key idea in our technique lies in approximating the solution to the first-order matrix ordinary differential equation (ODE) with the Euler method. Graphs, which can be weighted, directed, and with loops, are first converted to its adjacency matrix A. Then by a… ▽ More

    Submitted 7 May, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Research paper accepted by Proc. 16th International Conference on Graph Transformation (ICGT 2023), Leicester, UK. Extended abstract accepted by the Graph Signal Processing (GSP) Workshop 2023, Oxford, UK. GitHub source code: https://github.com/FeynmanDNA/Dominant_EigenPair_Est_Graph_Infection

  8. arXiv:2206.06583  [pdf, other

    q-bio.QM cs.AI

    Exploring evolution-aware & -free protein language models as protein function predictors

    Authors: Mingyang Hu, Fajie Yuan, Kevin K. Yang, Fusong Ju, ** Su, Hui Wang, Fei Yang, Qiuyang Ding

    Abstract: Large-scale Protein Language Models (PLMs) have improved performance in protein prediction tasks, ranging from 3D structure prediction to various function predictions. In particular, AlphaFold, a ground-breaking AI system, could potentially reshape structural biology. However, the utility of the PLM module in AlphaFold, Evoformer, has not been explored beyond structure prediction. In this paper, w… ▽ More

    Submitted 16 October, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  9. Magnetoelectric Bio-Implants Powered and Programmed by a Single Transmitter for Coordinated Multisite Stimulation

    Authors: Zhanghao Yu, Joshua C. Chen, Yan He, Fatima T. Alrashdan, Benjamin W. Avants, Amanda Singer, Jacob T. Robinson, Kaiyuan Yang

    Abstract: This article presents a hardware platform including stimulating implants wirelessly powered and controlled by a shared transmitter (TX) for coordinated leadless multisite stimulation. The adopted novel single-TX, multiple-implant structure can flexibly deploy stimuli, improve system efficiency, easily scale stimulating channel quantity, and relieve efforts in device synchronization. In the propose… ▽ More

    Submitted 31 December, 2021; originally announced December 2021.

    Comments: This paper has been published in IEEE Journal of Solid-State Circuits, 2021

    Journal ref: IEEE Journal of Solid-State Circuits, 2021

  10. Machine learning modeling of family wide enzyme-substrate specificity screens

    Authors: Samuel Goldman, Ria Das, Kevin K. Yang, Connor W. Coley

    Abstract: Biocatalysis is a promising approach to sustainably synthesize pharmaceuticals, complex natural products, and commodity chemicals at scale. However, the adoption of biocatalysis is limited by our ability to select enzymes that will catalyze their natural chemical transformation on non-natural substrates. While machine learning and in silico directed evolution are well-posed for this predictive mod… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  11. MagNI: A Magnetoelectrically Powered and Controlled Wireless Neurostimulating Implant

    Authors: Zhanghao Yu, Joshua C. Chen, Fatima T. Alrashdan, Benjamin W. Avants, Yan He, Amanda Singer, Jacob T. Robinson, Kaiyuan Yang

    Abstract: This paper presents the first wireless and programmable neural stimulator leveraging magnetoelectric (ME) effects for power and data transfer. Thanks to low tissue absorption, low misalignment sensitivity and high power transfer efficiency, the ME effect enables safe delivery of high power levels (a few milliwatts) at low resonant frequencies (~250 kHz) to mm-sized implants deep inside the body (3… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: This work has been accepted to 2020 IEEE Transactions on Biomedical Circuits and Systems (TBioCAS)

    Journal ref: IEEE Transactions on Biomedical Circuits and Systems (TBioCAS), Volume: 14, Issue: 6, Pages: 1241-1252, Dec. 2020

  12. arXiv:2106.05466  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    Adaptive machine learning for protein engineering

    Authors: Brian L. Hie, Kevin K. Yang

    Abstract: Machine-learning models that learn from data to predict how protein sequence encodes function are emerging as a useful protein engineering tool. However, when using these models to suggest new protein designs, one must deal with the vast combinatorial complexity of protein sequences. Here, we review how to use a sequence-to-function machine-learning surrogate model to select sequences for experime… ▽ More

    Submitted 6 July, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 9 pages, 2 figures

  13. arXiv:2104.04457  [pdf, other

    q-bio.QM cs.LG q-bio.BM stat.ML

    Protein sequence design with deep generative models

    Authors: Zachary Wu, Kadina E. Johnston, Frances H. Arnold, Kevin K. Yang

    Abstract: Protein engineering seeks to identify protein sequences with optimized properties. When guided by machine learning, protein sequence generation methods can draw on prior knowledge and experimental efforts to improve this process. In this review, we highlight recent applications of machine learning to generate protein sequences, focusing on the emerging field of deep generative methods.

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: 11 pages, 2 figures

  14. arXiv:2007.00776  [pdf, other

    cs.ET q-bio.NC

    Emulation of Astrocyte Induced Neural Phase Synchrony in Spin-Orbit Torque Oscillator Neurons

    Authors: Umang Garg, Kezhou Yang, Abhronil Sengupta

    Abstract: Astrocytes play a central role in inducing concerted phase synchronized neural-wave patterns inside the brain. In this article, we demonstrate that injected radio-frequency signal in underlying heavy metal layer of spin-orbit torque oscillator neurons mimic the neuron phase synchronization effect realized by glial cells. Potential application of such phase coupling effects is illustrated in the co… ▽ More

    Submitted 16 September, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

  15. arXiv:2006.08532  [pdf, other

    q-bio.BM cs.CV cs.LG eess.IV q-bio.QM

    Improved Conditional Flow Models for Molecule to Image Synthesis

    Authors: Karren Yang, Samuel Goldman, Wengong **, Alex Lu, Regina Barzilay, Tommi Jaakkola, Caroline Uhler

    Abstract: In this paper, we aim to synthesize cell microscopy images under different molecular interventions, motivated by practical applications to drug development. Building on the recent success of graph neural networks for learning molecular embeddings and flow-based models for image generation, we propose Mol2Image: a flow-based generative model for molecule to cell image synthesis. To generate cell fe… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    MSC Class: 92-08

  16. Causal Network Models of SARS-CoV-2 Expression and Aging to Identify Candidates for Drug Repurposing

    Authors: Anastasiya Belyaeva, Louis Cammarata, Adityanarayanan Radhakrishnan, Chandler Squires, Karren Dai Yang, G. V. Shivashankar, Caroline Uhler

    Abstract: Given the severity of the SARS-CoV-2 pandemic, a major challenge is to rapidly repurpose existing approved drugs for clinical interventions. While a number of data-driven and experimental approaches have been suggested in the context of drug repurposing, a platform that systematically integrates available transcriptomic, proteomic and structural data is missing. More importantly, given that SARS-C… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

  17. arXiv:2005.10036  [pdf, other

    cs.LG q-bio.QM stat.ML

    Uncertainty Quantification Using Neural Networks for Molecular Property Prediction

    Authors: Lior Hirschfeld, Kyle Swanson, Kevin Yang, Regina Barzilay, Connor W. Coley

    Abstract: Uncertainty quantification (UQ) is an important component of molecular property prediction, particularly for drug discovery applications where model predictions direct experimental design and where unanticipated imprecision wastes valuable time and resources. The need for UQ is especially acute for neural models, which are becoming increasingly standard yet are challenging to interpret. While seve… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

  18. arXiv:2001.10530  [pdf

    q-bio.PE physics.soc-ph

    Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV

    Authors: Tao Zhou, Quanhui Liu, Zimo Yang, **gyi Liao, Kexin Yang, Wei Bai, Xin Lü, Wei Zhang

    Abstract: Objectives.--To estimate the basic reproduction number of the Wuhan novel coronavirus (2019-nCoV). Methods.--Based on the susceptible-exposed-infected-removed (SEIR) compartment model and the assumption that the infectious cases with symptoms occurred before January 25, 2020 are resulted from free propagation without intervention, we estimate the basic reproduction number of 2019-nCoV according to… ▽ More

    Submitted 31 January, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: 8 pages, 1 table and 1 figure

    Journal ref: Journal of Evidence Based Medicine (2020) 1

  19. arXiv:1904.08102  [pdf, other

    cs.LG q-bio.QM stat.ML

    Batched Stochastic Bayesian Optimization via Combinatorial Constraints Design

    Authors: Kevin K. Yang, Yuxin Chen, Alycia Lee, Yisong Yue

    Abstract: In many high-throughput experimental design settings, such as those common in biochemical engineering, batched queries are more cost effective than one-by-one sequential queries. Furthermore, it is often not possible to directly choose items to query. Instead, the experimenter specifies a set of constraints that generates a library of possible items, which are then selected stochastically. Motivat… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  20. arXiv:1811.10775  [pdf, other

    q-bio.BM

    Machine learning-guided directed evolution for protein engineering

    Authors: Kevin K. Yang, Zachary Wu, Frances H. Arnold

    Abstract: Machine learning (ML)-guided directed evolution is a new paradigm for biological design that enables optimization of complex functions. ML methods use data to predict how sequence maps to function without requiring a detailed model of the underlying physics or biological pathways. To demonstrate ML-guided directed evolution, we introduce the steps required to build ML sequence-function models and… ▽ More

    Submitted 19 April, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: Made significant revisions to focus on aspects most relevant to applying machine learning to speed up directed evolution

  21. arXiv:1403.7413  [pdf

    q-bio.TO q-bio.PE

    Niche inheritance: a cooperative pathway to enhance cancer cell fitness though ecosystem engineering

    Authors: Kimberline R. Yang, Steven Mooney, Jelani C. Zarif, Donald S. Coffey, Russell S. Taichman, Kenneth J. Pienta

    Abstract: Cancer cells can be described as an invasive species that is able to establish itself in a new environment. The concept of niche construction can be utilized to describe the process by which cancer cells terraform their environment, thereby engineering an ecosystem that promotes the genetic fitness of the species. Ecological dispersion theory can then be utilized to describe and model the steps an… ▽ More

    Submitted 28 March, 2014; originally announced March 2014.

    Comments: 8 pages, 1 Table, 4 Figures