Skip to main content

Showing 1–14 of 14 results for author: Frey, N C

.
  1. arXiv:2306.12360  [pdf, other

    q-bio.BM cs.LG

    Protein Discovery with Discrete Walk-Jump Sampling

    Authors: Nathan C. Frey, Daniel Berenberg, Karina Zadorozhny, Joseph Kleinhenz, Julien Lafrance-Vanasse, Isidro Hotzel, Yan Wu, Stephen Ra, Richard Bonneau, Kyunghyun Cho, Andreas Loukas, Vladimir Gligorijevic, Saeed Saremi

    Abstract: We resolve difficulties in training and sampling from a discrete generative model by learning a smoothed energy function, sampling from the smoothed data manifold with Langevin Markov chain Monte Carlo (MCMC), and projecting back to the true data manifold with one-step denoising. Our Discrete Walk-Jump Sampling formalism combines the contrastive divergence training of an energy-based model and imp… ▽ More

    Submitted 15 March, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: ICLR 2024 oral presentation, top 1.2% of submissions; {ICLR 2023 Physics for Machine Learning, NeurIPS 2023 GenBio, MLCB 2023} Spotlight

  2. arXiv:2305.20009  [pdf, other

    cs.LG q-bio.BM

    Protein Design with Guided Discrete Diffusion

    Authors: Nate Gruver, Samuel Stanton, Nathan C. Frey, Tim G. J. Rudner, Isidro Hotzel, Julien Lafrance-Vanasse, Arvind Rajpal, Kyunghyun Cho, Andrew Gordon Wilson

    Abstract: A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling. The generative model samples plausible sequences while the discriminative model guides a search for sequences with high fitness. Given its broad success in conditional sampling, classifier-guided diffusion modeling is a promising foundation for protein design, leading many to… ▽ More

    Submitted 12 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Advances in Neural Information Processing Systems 36, December 10-16, 2023

  3. arXiv:2302.07754  [pdf, other

    cs.LG

    SupSiam: Non-contrastive Auxiliary Loss for Learning from Molecular Conformers

    Authors: Michael Maser, Ji Won Park, Joshua Yao-Yu Lin, Jae Hyeon Lee, Nathan C. Frey, Andrew Watkins

    Abstract: We investigate Siamese networks for learning related embeddings for augmented samples of molecular conformers. We find that a non-contrastive (positive-pair only) auxiliary task aids in supervised training of Euclidean neural networks (E3NNs) and increases manifold smoothness (MS) around point-cloud geometries. We demonstrate this property for multiple drug-activity prediction tasks while maintain… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: Submitted to the MLDD workshop, ICLR 2023

  4. arXiv:2301.11581  [pdf, other

    cs.AI cs.CY cs.DC cs.LG

    A Green(er) World for A.I

    Authors: Dan Zhao, Nathan C. Frey, Joseph McDonald, Matthew Hubbell, David Bestor, Michael Jones, Andrew Prout, Vijay Gadepally, Siddharth Samsi

    Abstract: As research and practice in artificial intelligence (A.I.) grow in leaps and bounds, the resources necessary to sustain and support their operations also grow at an increasing pace. While innovations and applications from A.I. have brought significant advances, from applications to vision and natural language to improvements to fields like medical imaging and materials engineering, their costs sho… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: 8 pages, published in 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

    Journal ref: D. Zhao et al., "A Green(er) World for A.I.," 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Lyon, France, 2022, pp. 742-750

  5. arXiv:2211.13408  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Graph Contrastive Learning for Materials

    Authors: Teddy Koker, Keegan Quigley, Will Spaeth, Nathan C. Frey, Lin Li

    Abstract: Recent work has shown the potential of graph neural networks to efficiently predict material properties, enabling high-throughput screening of materials. Training these models, however, often requires large quantities of labelled data, obtained via costly methods such as ab initio calculations or experimental evaluation. By leveraging a series of material-specific transformations, we introduce Cry… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 7 pages, 3 figures, NeurIPS 2022 AI for Accelerated Materials Design Workshop

  6. arXiv:2210.10838  [pdf, other

    cs.LG q-bio.QM

    A Pareto-optimal compositional energy-based model for sampling and optimization of protein sequences

    Authors: Nataša Tagasovska, Nathan C. Frey, Andreas Loukas, Isidro Hötzel, Julien Lafrance-Vanasse, Ryan Lewis Kelly, Yan Wu, Arvind Rajpal, Richard Bonneau, Kyunghyun Cho, Stephen Ra, Vladimir Gligorijević

    Abstract: Deep generative models have emerged as a popular machine learning-based approach for inverse design problems in the life sciences. However, these problems often require sampling new designs that satisfy multiple properties of interest in addition to learning the data distribution. This multi-objective optimization becomes more challenging when properties are independent or orthogonal to each other… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  7. arXiv:2204.00056  [pdf, other

    physics.chem-ph cs.LG

    SELFIES and the future of molecular string representations

    Authors: Mario Krenn, Qianxiang Ai, Senja Barthel, Nessa Carson, Angelo Frei, Nathan C. Frey, Pascal Friederich, Théophile Gaudin, Alberto Alexander Gayle, Kevin Maik Jablonka, Rafael F. Lameiro, Dominik Lemm, Alston Lo, Seyed Mohamad Moosavi, José Manuel Nápoles-Duarte, AkshatKumar Nigam, Robert Pollice, Kohulan Rajan, Ulrich Schatzschneider, Philippe Schwaller, Marta Skreta, Berend Smit, Felix Strieth-Kalthoff, Chong Sun, Gary Tom , et al. (6 additional authors not shown)

    Abstract: Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: 34 pages, 15 figures, comments and suggestions for additional references are welcome!

    Journal ref: Cell Patterns 3(10), 100588(2022)

  8. arXiv:2201.12423  [pdf, other

    cs.LG cs.DC

    Benchmarking Resource Usage for Efficient Distributed Deep Learning

    Authors: Nathan C. Frey, Baolin Li, Joseph McDonald, Dan Zhao, Michael Jones, David Bestor, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi

    Abstract: Deep learning (DL) workflows demand an ever-increasing budget of compute and energy in order to achieve outsized gains. Neural architecture searches, hyperparameter sweeps, and rapid prototy** consume immense resources that can prevent resource-constrained researchers from experimenting with large models and carry considerable environmental impact. As such, it becomes essential to understand how… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 14 pages, 17 figures

  9. arXiv:2201.12419  [pdf, other

    physics.chem-ph cs.LG

    FastFlows: Flow-Based Models for Molecular Graph Generation

    Authors: Nathan C. Frey, Vijay Gadepally, Bharath Ramsundar

    Abstract: We propose a framework using normalizing-flow based models, SELF-Referencing Embedded Strings, and multi-objective optimization that efficiently generates small molecules. With an initial training set of only 100 small molecules, FastFlows generates thousands of chemically valid molecules in seconds. Because of the efficient sampling, substructure filters can be applied as desired to eliminate com… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 7 pages, 4 figures, ELLIS Machine Learning for Molecule Discovery Workshop 2021

  10. arXiv:2112.04977  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    Bringing Atomistic Deep Learning to Prime Time

    Authors: Nathan C. Frey, Siddharth Samsi, Bharath Ramsundar, Connor W. Coley, Vijay Gadepally

    Abstract: Artificial intelligence has not yet revolutionized the design of materials and molecules. In this perspective, we identify four barriers preventing the integration of atomistic deep learning, molecular science, and high-performance computing. We outline focused research efforts to address the opportunities presented by these challenges.

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: 6 pages, 1 figure, NeurIPS 2021 AI for Science workshop

  11. arXiv:2112.03364  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    Scalable Geometric Deep Learning on Molecular Graphs

    Authors: Nathan C. Frey, Siddharth Samsi, Joseph McDonald, Lin Li, Connor W. Coley, Vijay Gadepally

    Abstract: Deep learning in molecular and materials sciences is limited by the lack of integration between applied science, artificial intelligence, and high-performance computing. Bottlenecks with respect to the amount of training data, the size and complexity of model architectures, and the scale of the compute infrastructure are all key factors limiting the scaling of deep learning for molecules and mater… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 7 pages, 3 figures, NeurIPS 2021 AI for Science workshop

  12. arXiv:2111.07140  [pdf, ps, other

    eess.SP cs.LG

    The Pseudo Projection Operator: Applications of Deep Learning to Projection Based Filtering in Non-Trivial Frequency Regimes

    Authors: Matthew L. Weiss, Nathan C. Frey, Siddharth Samsi, Randy C. Paffenroth, Vijay Gadepally

    Abstract: Traditional frequency based projection filters, or projection operators (PO), separate signal and noise through a series of transformations which remove frequencies where noise is present. However, this technique relies on a priori knowledge of what frequencies contain signal and noise and that these frequencies do not overlap, which is difficult to achieve in practice. To address these issues, we… ▽ More

    Submitted 13 April, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

  13. arXiv:2006.01075  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el physics.comp-ph

    High-throughput search for magnetic and topological order in transition metal oxides

    Authors: Nathan C. Frey, Matthew K. Horton, Jason M. Munro, Sinéad M. Griffin, Kristin A. Persson, Vivek B. Shenoy

    Abstract: The discovery of intrinsic magnetic topological order in $\rm MnBi_2Te_4$ has invigorated the search for materials with coexisting magnetic and topological phases. These multi-order quantum materials are expected to exhibit new topological phases that can be tuned with magnetic fields, but the search for such materials is stymied by difficulties in predicting magnetic structure and stability. Here… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: 9 pages, 6 figures

    Journal ref: Science Advances 09 Dec 2020: Vol. 6, no. 50, eabd1076

  14. arXiv:1712.02003  [pdf, other

    q-fin.ST physics.soc-ph

    Universal fluctuations in growth dynamics of economic systems

    Authors: Nathan C. Frey, Sakib Matin, H. Eugene Stanley, Michael Salinger

    Abstract: The growth of business firms is an example of a system of complex interacting units that resembles complex interacting systems in nature such as earthquakes. Remarkably, work in econophysics has provided evidence that the statistical properties of the growth of business firms follow the same sorts of power laws that characterize physical systems near their critical points. Given how economies chan… ▽ More

    Submitted 21 May, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: 15 pages, 7 figures

    Journal ref: Scientific Reports 9, 713 (2019)