Skip to main content

Showing 1–9 of 9 results for author: Flam-Shepherd, D

.
  1. arXiv:2308.09482  [pdf, other

    q-bio.BM cs.LG

    Atom-by-atom protein generation and beyond with language models

    Authors: Daniel Flam-Shepherd, Kevin Zhu, Alán Aspuru-Guzik

    Abstract: Protein language models learn powerful representations directly from sequences of amino acids. However, they are constrained to generate proteins with only the set of amino acids represented in their vocabulary. In contrast, chemical language models learn atom-level representations of smaller molecules that include every atom, bond, and ring. In this work, we show that chemical language models can… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  2. arXiv:2305.05708  [pdf, other

    cs.LG q-bio.QM

    Language models can generate molecules, materials, and protein binding sites directly in three dimensions as XYZ, CIF, and PDB files

    Authors: Daniel Flam-Shepherd, Alán Aspuru-Guzik

    Abstract: Language models are powerful tools for molecular design. Currently, the dominant paradigm is to parse molecular graphs into linear string representations that can easily be trained on. This approach has been very successful, however, it is limited to chemical structures that can be completely represented by a graph -- like organic molecules -- while materials and biomolecular structures like prote… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  3. arXiv:2202.00658  [pdf, other

    cs.LG cs.AI

    Scalable Fragment-Based 3D Molecular Design with Reinforcement Learning

    Authors: Daniel Flam-Shepherd, Alexander Zhigalin, Alán Aspuru-Guzik

    Abstract: Machine learning has the potential to automate molecular design and drastically accelerate the discovery of new functional compounds. Towards this goal, generative models and reinforcement learning (RL) using string and graph representations have been successfully used to search for novel molecules. However, these approaches are limited since their representations ignore the three-dimensional (3D)… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  4. arXiv:2112.03041  [pdf, other

    cs.LG cs.AI q-bio.QM

    Kee** it Simple: Language Models can learn Complex Molecular Distributions

    Authors: Daniel Flam-Shepherd, Kevin Zhu, Alán Aspuru-Guzik

    Abstract: Deep generative models of molecules have grown immensely in popularity, trained on relevant datasets, these models are used to search through chemical space. The downstream utility of generative models for the inverse design of novel functional compounds depends on their ability to learn a training distribution of molecules. The most simple example is a language model that takes the form of a recu… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Journal ref: Nat Commun 13, 3293 (2022)

  5. Learning quantum dynamics with latent neural ODEs

    Authors: Matthew Choi, Daniel Flam-Shepherd, Thi Ha Kyaw, Alán Aspuru-Guzik

    Abstract: The core objective of machine-assisted scientific discovery is to learn physical laws from experimental data without prior knowledge of the systems in question. In the area of quantum physics, making progress towards these goals is significantly more challenging due to the curse of dimensionality as well as the counter-intuitive nature of quantum mechanics. Here, we present the QNODE, a latent neu… ▽ More

    Submitted 4 February, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: 11 Pages. 8 Figures. This is a resubmission. We added more results and plots for more quantitative analysis

    Journal ref: Phys. Rev. A 105, 042403 (2022)

  6. Learning Interpretable Representations of Entanglement in Quantum Optics Experiments using Deep Generative Models

    Authors: Daniel Flam-Shepherd, Tony Wu, Xuemei Gu, Alba Cervera-Lierta, Mario Krenn, Alan Aspuru-Guzik

    Abstract: Quantum physics experiments produce interesting phenomena such as interference or entanglement, which are core properties of numerous future quantum technologies. The complex relationship between the setup structure of a quantum experiment and its entanglement properties is essential to fundamental research in quantum optics but is difficult to intuitively understand. We present a deep generative… ▽ More

    Submitted 16 June, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: Published in Nature Machine Intelligence https://doi.org/10.1038/s42256-022-00493-5

    Journal ref: Nature Machine Intelligence 4, 544 (2022)

  7. arXiv:2011.02004  [pdf, other

    cs.LG math.OC stat.ML

    Bayesian Variational Optimization for Combinatorial Spaces

    Authors: Tony C. Wu, Daniel Flam-Shepherd, Alán Aspuru-Guzik

    Abstract: This paper focuses on Bayesian Optimization in combinatorial spaces. In many applications in the natural science. Broad applications include the study of molecules, proteins, DNA, device structures and quantum circuit designs, a on optimization over combinatorial categorical spaces is needed to find optimal or pareto-optimal solutions. However, only a limited amount of methods have been proposed t… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  8. arXiv:2002.10413  [pdf, other

    cs.LG stat.ML

    Neural Message Passing on High Order Paths

    Authors: Daniel Flam-Shepherd, Tony Wu, Pascal Friederich, Alan Aspuru-Guzik

    Abstract: Graph neural network have achieved impressive results in predicting molecular properties, but they do not directly account for local and hidden structures in the graph such as functional groups and molecular geometry. At each propagation step, GNNs aggregate only over first order neighbours, ignoring important information contained in subsequent neighbours as well as the relationships between thos… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

  9. arXiv:2002.07087  [pdf, other

    cs.LG stat.ML

    Graph Deconvolutional Generation

    Authors: Daniel Flam-Shepherd, Tony Wu, Alan Aspuru-Guzik

    Abstract: Graph generation is an extremely important task, as graphs are found throughout different areas of science and engineering. In this work, we focus on the modern equivalent of the Erdos-Renyi random graph model: the graph variational autoencoder (GVAE). This model assumes edges and nodes are independent in order to generate entire graphs at a time using a multi-layer perceptron decoder. As a result… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.