Skip to main content

Showing 1–15 of 15 results for author: Morehead, A

.
  1. arXiv:2406.13864  [pdf, other

    cs.LG q-bio.BM

    Evaluating representation learning on the protein structure universe

    Authors: Arian R. Jamasb, Alex Morehead, Chaitanya K. Joshi, Zuobai Zhang, Kieran Didi, Simon V. Mathis, Charles Harris, Jian Tang, Jianlin Cheng, Pietro Lio, Tom L. Blundell

    Abstract: We introduce ProteinWorkshop, a comprehensive benchmark suite for representation learning on protein structures with Geometric Graph Neural Networks. We consider large-scale pre-training and downstream tasks on both experimental and predicted structures to enable the systematic evaluation of the quality of the learned structural representation and their usefulness in capturing functional relations… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ICLR 2024

  2. arXiv:2406.13839  [pdf, other

    q-bio.BM cs.LG q-bio.GN

    RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design

    Authors: Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian R. Jamasb, Charles Harris, Simon V. Mathis, Kieran Didi, Bryan Hooi, Pietro Liò

    Abstract: We introduce RNA-FrameFlow, the first generative model for 3D RNA backbone design. We build upon SE(3) flow matching for protein backbone generation and establish protocols for data preparation and evaluation to address unique challenges posed by RNA modeling. We formulate RNA structures as a set of rigid-body frames and associated loss functions which account for larger, more conformationally fle… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: To be presented as an Oral at ICML 2024 Structured Probabilistic Inference & Generative Modeling Workshop, and a Spotlight at ICML 2024 AI4Science Workshop

  3. arXiv:2405.14108  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM

    Deep Learning for Protein-Ligand Docking: Are We There Yet?

    Authors: Alex Morehead, Nabin Giri, Jian Liu, Jianlin Cheng

    Abstract: The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein-ligand docking have recently been introduced, to date no prior works have systematically studied the behavior of dockin… ▽ More

    Submitted 7 July, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: 31 pages, 2 tables, 27 figures. Under review. Code, data, tutorials, and benchmark results are available at https://github.com/BioinfoMachineLearning/PoseBench

    ACM Class: I.2.1; J.3

  4. arXiv:2401.06151  [pdf, other

    q-bio.BM cs.AI cs.LG q-bio.QM

    Towards Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion

    Authors: Alex Morehead, Jeffrey Ruffolo, Aadyot Bhatnagar, Ali Madani

    Abstract: Generative models of macromolecules carry abundant and impactful implications for industrial and biomedical efforts in protein engineering. However, existing methods are currently limited to modeling protein structures or sequences, independently or jointly, without regard to the interactions that commonly occur between proteins and other macromolecules. In this work, we introduce MMDiff, a genera… ▽ More

    Submitted 21 December, 2023; originally announced January 2024.

    Comments: 15 pages, 11 figures, presented at the NeurIPS 2023 Machine Learning in Structural Biology (MLSB) workshop. Code available at https://github.com/Profluent-Internships/MMDiff

    ACM Class: I.2.1; J.3

  5. arXiv:2305.14749  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    gRNAde: Geometric Deep Learning for 3D RNA inverse design

    Authors: Chaitanya K. Joshi, Arian R. Jamasb, Ramon Viñas, Charles Harris, Simon V. Mathis, Alex Morehead, Rishabh Anand, Pietro Liò

    Abstract: Computational RNA design tasks are often posed as inverse problems, where sequences are designed based on adopting a single desired secondary structure without considering 3D geometry and conformational diversity. We introduce gRNAde, a geometric RNA design pipeline operating on 3D RNA backbones to design sequences that explicitly account for structure and dynamics. Under the hood, gRNAde is a mul… ▽ More

    Submitted 25 May, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Previously titled 'Multi-State RNA Design with Geometric Multi-Graph Neural Networks', presented at ICML 2023 Computational Biology Workshop

  6. arXiv:2302.04313  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM stat.ML

    Geometry-Complete Diffusion for 3D Molecule Generation and Optimization

    Authors: Alex Morehead, Jianlin Cheng

    Abstract: Denoising diffusion probabilistic models (DDPMs) have pioneered new state-of-the-art results in disciplines such as computer vision and computational biology for diverse tasks ranging from text-guided image generation to structure-guided protein design. Along this latter line of research, methods have recently been proposed for generating 3D molecules using equivariant graph neural networks (GNNs)… ▽ More

    Submitted 24 May, 2024; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: 33 pages, 6 figures, 5 tables. Under review. Also presented at ICLR 2023's MLDD workshop. Code available at https://github.com/BioinfoMachineLearning/Bio-Diffusion

    ACM Class: I.2.1; J.3

  7. arXiv:2211.02504  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM stat.ML

    Geometry-Complete Perceptron Networks for 3D Molecular Graphs

    Authors: Alex Morehead, Jianlin Cheng

    Abstract: The field of geometric deep learning has had a profound impact on the development of innovative and powerful graph neural network architectures. Disciplines such as computer vision and computational biology have benefited significantly from such methodological advances, which has led to breakthroughs in scientific domains such as protein structure prediction and design. In this work, we introduce… ▽ More

    Submitted 26 April, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: 36 pages, 3 figures, 12 tables. Under review. Also presented at DLG-AAAI 2023 and AI2ASE-AAAI 2023. Code available at https://github.com/BioinfoMachineLearning/GCPNet

    ACM Class: I.2.1; J.3

  8. arXiv:2205.13594  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM

    DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning

    Authors: Elham Soltanikazemi, Raj S. Roy, Farhan Quadir, Nabin Giri, Alex Morehead, Jianlin Cheng

    Abstract: Predicted inter-chain residue-residue contacts can be used to build the quaternary structure of protein complexes from scratch. However, only a small number of methods have been developed to reconstruct protein quaternary structures using predicted inter-chain contacts. Here, we present an agent-based self-learning method based on deep reinforcement learning (DRLComplex) to build protein complex s… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: 20 pages, 8 figures, 12 tables. Under review

    ACM Class: I.2.1; J.3

  9. arXiv:2205.10627  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM

    DProQ: A Gated-Graph Transformer for Protein Complex Structure Assessment

    Authors: Xiao Chen, Alex Morehead, Jian Liu, Jianlin Cheng

    Abstract: Proteins interact to form complexes to carry out essential biological functions. Computational methods have been developed to predict the structures of protein complexes. However, an important challenge in protein complex structure prediction is to estimate the quality of predicted protein complex structures without any knowledge of the corresponding native structures. Such estimations can then be… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Comments: 18 pages, 3 figures, 13 tables. Under review

    ACM Class: I.2.1; J.3

  10. arXiv:2205.10390  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM stat.ML

    EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex Structures

    Authors: Alex Morehead, Xiao Chen, Tianqi Wu, Jian Liu, Jianlin Cheng

    Abstract: Protein complexes are macromolecules essential to the functioning and well-being of all living organisms. As the structure of a protein complex, in particular its region of interaction between multiple protein subunits (i.e., chains), has a notable influence on the biological function of the complex, computational methods that can quickly and effectively be used to refine and assess the quality of… ▽ More

    Submitted 24 May, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: 18 pages, 3 figures, and 8 tables. Under review

    ACM Class: I.2.1; J.3

  11. arXiv:2204.08584  [pdf

    cs.CV

    A Region-Based Deep Learning Approach to Automated Retail Checkout

    Authors: Maged Shoman, Armstrong Aboah, Alex Morehead, Ye Duan, Abdulateef Daud, Yaw Adu-Gyamfi

    Abstract: Automating the product checkout process at conventional retail stores is a task poised to have large impacts on society generally speaking. Towards this end, reliable deep learning models that enable automated product counting for fast customer checkout can make this goal a reality. In this work, we propose a novel, region-based deep learning approach to automate product counting using a customize… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  12. arXiv:2203.12522  [pdf, other

    cs.LG cs.AI

    Semi-Supervised Graph Learning Meets Dimensionality Reduction

    Authors: Alex Morehead, Watchanan Chantapakul, Jianlin Cheng

    Abstract: Semi-supervised learning (SSL) has recently received increased attention from machine learning researchers. By enabling effective propagation of known labels in graph-based deep learning (GDL) algorithms, SSL is poised to become an increasingly used technique in GDL in the coming years. However, there are currently few explorations in the graph-based SSL literature on exploiting classical dimensio… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: 8 pages, 8 figures, and 5 tables. Submitted to the 2022 International Joint Conference on Neural Networks (IJCNN 2022). Source code is available at https://github.com/amorehead/SSL-With-DR-And-GNNs

    ACM Class: I.2.1; J.3

  13. arXiv:2110.02423  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    Geometric Transformers for Protein Interface Contact Prediction

    Authors: Alex Morehead, Chen Chen, Jianlin Cheng

    Abstract: Computational methods for predicting the interface contacts between proteins come highly sought after for drug discovery as they can significantly advance the accuracy of alternative approaches, such as protein-protein docking, protein function analysis tools, and other computational methods for protein bioinformatics. In this work, we present the Geometric Transformer, a novel geometry-evolving g… ▽ More

    Submitted 4 March, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: 18 pages, 5 figures, and 9 tables. Camera-ready version for ICLR 2022, with a minor update to Figure 2 in Section 4.1 (Methods - Problem Formulation)

    ACM Class: I.2.1; J.3

  14. arXiv:2106.04362  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    DIPS-Plus: The Enhanced Database of Interacting Protein Structures for Interface Prediction

    Authors: Alex Morehead, Chen Chen, Ada Sedova, Jianlin Cheng

    Abstract: How and where proteins interface with one another can ultimately impact the proteins' functions along with a range of other biological processes. As such, precise computational methods for protein interface prediction (PIP) come highly sought after as they could yield significant advances in drug discovery and design as well as protein function analysis. However, the traditional benchmark dataset… ▽ More

    Submitted 6 October, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: 19 pages, 1 figure, and 4 tables. Updated URLs

    ACM Class: I.2.1; J.3

  15. KEPLER's First Rocky Planet: Kepler-10b

    Authors: Natalie M. Batalha, William J. Borucki, Stephen T. Bryson, Lars A. Buchhave, Douglas A. Caldwell, Jorgen Christensen-Dalsgaard, David Ciardi, Edward W. Dunham, Francois Fressin, Thomas N. Gautier III, Ronald L. Gilliland, Michael R. Haas, Steve B. Howell, Jon M. Jenkins, Hans Kjeldsen, David G. Koch, David W. Latham, Jack J. Lissauer, Geoffrey W. Marcy, Jason F. Rowe, Dimitar D. Sasselov, Sara Seager, Jason H. Steffen, Guillermo Torres, Gibor S. Basri , et al. (27 additional authors not shown)

    Abstract: NASA's Kepler Mission uses transit photometry to determine the frequency of earth-size planets in or near the habitable zone of Sun-like stars. The mission reached a milestone toward meeting that goal: the discovery of its first rocky planet, Kepler-10b. Two distinct sets of transit events were detected: 1) a 152 +/- 4 ppm dimming lasting 1.811 +/- 0.024 hours with ephemeris T[BJD]=2454964.57375+N… ▽ More

    Submitted 3 February, 2011; originally announced February 2011.

    Comments: Accepted, Astrophysical Journal, November 25, 2010; Eexpected publication date: February 20, 2011

    Journal ref: Astrophys.J.729:27,2011