Skip to main content

Showing 1–21 of 21 results for author: Frey, N

.
  1. arXiv:2407.00236  [pdf, other

    cs.LG cs.NE

    Closed-Form Test Functions for Biophysical Sequence Optimization Algorithms

    Authors: Samuel Stanton, Robert Alberstein, Nathan Frey, Andrew Watkins, Kyunghyun Cho

    Abstract: There is a growing body of work seeking to replicate the success of machine learning (ML) on domains like computer vision (CV) and natural language processing (NLP) to applications involving biophysical data. One of the key ingredients of prior successes in CV and NLP was the broad acceptance of difficult benchmarks that distilled key subproblems into approachable tasks that any junior researcher… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2403.05224  [pdf, other

    physics.flu-dyn

    Investigating the shortcomings of the Flow Convergence Method for quantification of Mitral Regurgitation in a pulsatile in-vitro environment and with Computational Fluid Dynamics

    Authors: Robin Leister, Roger Karl, Lubov Stroh, Derliz Mereles, Matthias Eden, Luis Neff, Raffaele de Simone, Gabriele Romano, Jochen Kriegseis, Matthias Karck, Norbert Frey, Bettina Frohnapfel, Alexander Stroh, Sandy Engelhardt

    Abstract: The flow convergence method includes calculation of the proximal isovelocity surface area (PISA) and is widely used to classify mitral regurgitation (MR) with echocardiography. It constitutes a primary decision factor for determination of treatment and should therefore be a robust quantification method. However, it is known for its tendency to underestimate MR and its dependence on user expertise.… ▽ More

    Submitted 14 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2306.12360  [pdf, other

    q-bio.BM cs.LG

    Protein Discovery with Discrete Walk-Jump Sampling

    Authors: Nathan C. Frey, Daniel Berenberg, Karina Zadorozhny, Joseph Kleinhenz, Julien Lafrance-Vanasse, Isidro Hotzel, Yan Wu, Stephen Ra, Richard Bonneau, Kyunghyun Cho, Andreas Loukas, Vladimir Gligorijevic, Saeed Saremi

    Abstract: We resolve difficulties in training and sampling from a discrete generative model by learning a smoothed energy function, sampling from the smoothed data manifold with Langevin Markov chain Monte Carlo (MCMC), and projecting back to the true data manifold with one-step denoising. Our Discrete Walk-Jump Sampling formalism combines the contrastive divergence training of an energy-based model and imp… ▽ More

    Submitted 15 March, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: ICLR 2024 oral presentation, top 1.2% of submissions; {ICLR 2023 Physics for Machine Learning, NeurIPS 2023 GenBio, MLCB 2023} Spotlight

  4. arXiv:2305.20009  [pdf, other

    cs.LG q-bio.BM

    Protein Design with Guided Discrete Diffusion

    Authors: Nate Gruver, Samuel Stanton, Nathan C. Frey, Tim G. J. Rudner, Isidro Hotzel, Julien Lafrance-Vanasse, Arvind Rajpal, Kyunghyun Cho, Andrew Gordon Wilson

    Abstract: A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling. The generative model samples plausible sequences while the discriminative model guides a search for sequences with high fitness. Given its broad success in conditional sampling, classifier-guided diffusion modeling is a promising foundation for protein design, leading many to… ▽ More

    Submitted 12 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Advances in Neural Information Processing Systems 36, December 10-16, 2023

  5. arXiv:2302.07754  [pdf, other

    cs.LG

    SupSiam: Non-contrastive Auxiliary Loss for Learning from Molecular Conformers

    Authors: Michael Maser, Ji Won Park, Joshua Yao-Yu Lin, Jae Hyeon Lee, Nathan C. Frey, Andrew Watkins

    Abstract: We investigate Siamese networks for learning related embeddings for augmented samples of molecular conformers. We find that a non-contrastive (positive-pair only) auxiliary task aids in supervised training of Euclidean neural networks (E3NNs) and increases manifold smoothness (MS) around point-cloud geometries. We demonstrate this property for multiple drug-activity prediction tasks while maintain… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: Submitted to the MLDD workshop, ICLR 2023

  6. arXiv:2301.11581  [pdf, other

    cs.AI cs.CY cs.DC cs.LG

    A Green(er) World for A.I

    Authors: Dan Zhao, Nathan C. Frey, Joseph McDonald, Matthew Hubbell, David Bestor, Michael Jones, Andrew Prout, Vijay Gadepally, Siddharth Samsi

    Abstract: As research and practice in artificial intelligence (A.I.) grow in leaps and bounds, the resources necessary to sustain and support their operations also grow at an increasing pace. While innovations and applications from A.I. have brought significant advances, from applications to vision and natural language to improvements to fields like medical imaging and materials engineering, their costs sho… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: 8 pages, published in 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

    Journal ref: D. Zhao et al., "A Green(er) World for A.I.," 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Lyon, France, 2022, pp. 742-750

  7. arXiv:2211.13408  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Graph Contrastive Learning for Materials

    Authors: Teddy Koker, Keegan Quigley, Will Spaeth, Nathan C. Frey, Lin Li

    Abstract: Recent work has shown the potential of graph neural networks to efficiently predict material properties, enabling high-throughput screening of materials. Training these models, however, often requires large quantities of labelled data, obtained via costly methods such as ab initio calculations or experimental evaluation. By leveraging a series of material-specific transformations, we introduce Cry… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 7 pages, 3 figures, NeurIPS 2022 AI for Accelerated Materials Design Workshop

  8. arXiv:2210.10838  [pdf, other

    cs.LG q-bio.QM

    A Pareto-optimal compositional energy-based model for sampling and optimization of protein sequences

    Authors: Nataša Tagasovska, Nathan C. Frey, Andreas Loukas, Isidro Hötzel, Julien Lafrance-Vanasse, Ryan Lewis Kelly, Yan Wu, Arvind Rajpal, Richard Bonneau, Kyunghyun Cho, Stephen Ra, Vladimir Gligorijević

    Abstract: Deep generative models have emerged as a popular machine learning-based approach for inverse design problems in the life sciences. However, these problems often require sampling new designs that satisfy multiple properties of interest in addition to learning the data distribution. This multi-objective optimization becomes more challenging when properties are independent or orthogonal to each other… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  9. Roughness of molecular property landscapes and its impact on modellability

    Authors: Matteo Aldeghi, David E. Graff, Nathan Frey, Joseph A. Morrone, Edward O. Pyzer-Knapp, Kirk E. Jordan, Connor W. Coley

    Abstract: In molecular discovery and drug design, structure-property relationships and activity landscapes are often qualitatively or quantitatively analyzed to guide the navigation of chemical space. The roughness (or smoothness) of these molecular property landscapes is one of their most studied geometric attributes, as it can characterize the presence of activity cliffs, with rougher landscapes generally… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: 17 pages, 6 figures, 2 tables (SI with 17 pages, 16 figures)

    Journal ref: J. Chem. Inf. Model. 2022, 62, 19, 4660-4671

  10. Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models

    Authors: Joseph McDonald, Baolin Li, Nathan Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi

    Abstract: The energy requirements of current natural language processing models continue to grow at a rapid, unsustainable pace. Recent works highlighting this problem conclude there is an urgent need for methods that reduce the energy needs of NLP and machine learning more broadly. In this article, we investigate techniques that can be used to reduce the energy consumption of common NLP applications. In pa… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Journal ref: Findings of the Association for Computational Linguistics: NAACL 2022

  11. The MIT Supercloud Workload Classification Challenge

    Authors: Benny J. Tang, Qiqi Chen, Matthew L. Weiss, Nathan Frey, Joseph McDonald, David Bestor, Charles Yee, William Arcand, Chansup Byun, Daniel Edelman, Matthew Hubbell, Michael Jones, Jeremy Kepner, Anna Klein, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Andrew Bowne, Lindsey McEvoy, Baolin Li, Devesh Tiwari , et al. (2 additional authors not shown)

    Abstract: High-Performance Computing (HPC) centers and cloud providers support an increasingly diverse set of applications on heterogenous hardware. As Artificial Intelligence (AI) and Machine Learning (ML) workloads have become an increasingly larger share of the compute workloads, new approaches to optimized resource usage, allocation, and deployment of new AI frameworks are needed. By identifying compute… ▽ More

    Submitted 13 April, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted at IPDPS ADOPT'22

  12. arXiv:2204.00056  [pdf, other

    physics.chem-ph cs.LG

    SELFIES and the future of molecular string representations

    Authors: Mario Krenn, Qianxiang Ai, Senja Barthel, Nessa Carson, Angelo Frei, Nathan C. Frey, Pascal Friederich, Théophile Gaudin, Alberto Alexander Gayle, Kevin Maik Jablonka, Rafael F. Lameiro, Dominik Lemm, Alston Lo, Seyed Mohamad Moosavi, José Manuel Nápoles-Duarte, AkshatKumar Nigam, Robert Pollice, Kohulan Rajan, Ulrich Schatzschneider, Philippe Schwaller, Marta Skreta, Berend Smit, Felix Strieth-Kalthoff, Chong Sun, Gary Tom , et al. (6 additional authors not shown)

    Abstract: Artificial intelligence (AI) and machine learning (ML) are expanding in popularity for broad applications to challenging tasks in chemistry and materials science. Examples include the prediction of properties, the discovery of new reaction pathways, or the design of new molecules. The machine needs to read and write fluently in a chemical language for each of these tasks. Strings are a common tool… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: 34 pages, 15 figures, comments and suggestions for additional references are welcome!

    Journal ref: Cell Patterns 3(10), 100588(2022)

  13. arXiv:2201.12423  [pdf, other

    cs.LG cs.DC

    Benchmarking Resource Usage for Efficient Distributed Deep Learning

    Authors: Nathan C. Frey, Baolin Li, Joseph McDonald, Dan Zhao, Michael Jones, David Bestor, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi

    Abstract: Deep learning (DL) workflows demand an ever-increasing budget of compute and energy in order to achieve outsized gains. Neural architecture searches, hyperparameter sweeps, and rapid prototy** consume immense resources that can prevent resource-constrained researchers from experimenting with large models and carry considerable environmental impact. As such, it becomes essential to understand how… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 14 pages, 17 figures

  14. arXiv:2201.12419  [pdf, other

    physics.chem-ph cs.LG

    FastFlows: Flow-Based Models for Molecular Graph Generation

    Authors: Nathan C. Frey, Vijay Gadepally, Bharath Ramsundar

    Abstract: We propose a framework using normalizing-flow based models, SELF-Referencing Embedded Strings, and multi-objective optimization that efficiently generates small molecules. With an initial training set of only 100 small molecules, FastFlows generates thousands of chemically valid molecules in seconds. Because of the efficient sampling, substructure filters can be applied as desired to eliminate com… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 7 pages, 4 figures, ELLIS Machine Learning for Molecule Discovery Workshop 2021

  15. arXiv:2112.04977  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    Bringing Atomistic Deep Learning to Prime Time

    Authors: Nathan C. Frey, Siddharth Samsi, Bharath Ramsundar, Connor W. Coley, Vijay Gadepally

    Abstract: Artificial intelligence has not yet revolutionized the design of materials and molecules. In this perspective, we identify four barriers preventing the integration of atomistic deep learning, molecular science, and high-performance computing. We outline focused research efforts to address the opportunities presented by these challenges.

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: 6 pages, 1 figure, NeurIPS 2021 AI for Science workshop

  16. arXiv:2112.03364  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    Scalable Geometric Deep Learning on Molecular Graphs

    Authors: Nathan C. Frey, Siddharth Samsi, Joseph McDonald, Lin Li, Connor W. Coley, Vijay Gadepally

    Abstract: Deep learning in molecular and materials sciences is limited by the lack of integration between applied science, artificial intelligence, and high-performance computing. Bottlenecks with respect to the amount of training data, the size and complexity of model architectures, and the scale of the compute infrastructure are all key factors limiting the scaling of deep learning for molecules and mater… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 7 pages, 3 figures, NeurIPS 2021 AI for Science workshop

  17. arXiv:2111.07140  [pdf, ps, other

    eess.SP cs.LG

    The Pseudo Projection Operator: Applications of Deep Learning to Projection Based Filtering in Non-Trivial Frequency Regimes

    Authors: Matthew L. Weiss, Nathan C. Frey, Siddharth Samsi, Randy C. Paffenroth, Vijay Gadepally

    Abstract: Traditional frequency based projection filters, or projection operators (PO), separate signal and noise through a series of transformations which remove frequencies where noise is present. However, this technique relies on a priori knowledge of what frequencies contain signal and noise and that these frequencies do not overlap, which is difficult to achieve in practice. To address these issues, we… ▽ More

    Submitted 13 April, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

  18. arXiv:2105.06988  [pdf, other

    cs.CV

    Automatic Non-Linear Video Editing Transfer

    Authors: Nathan Frey, Peggy Chi, Weilong Yang, Irfan Essa

    Abstract: We propose an automatic approach that extracts editing styles in a source video and applies the edits to matched footage for video creation. Our Computer Vision based techniques considers framing, content type, playback speed, and lighting of each input video segment. By applying a combination of these features, we demonstrate an effective method that automatically transfers the visual and tempora… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: Published to AI for Content Creation Workshop at CVPR 2021

    Journal ref: AI for Content Creation Workshop at CVPR 2021

  19. arXiv:2006.01075  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el physics.comp-ph

    High-throughput search for magnetic and topological order in transition metal oxides

    Authors: Nathan C. Frey, Matthew K. Horton, Jason M. Munro, Sinéad M. Griffin, Kristin A. Persson, Vivek B. Shenoy

    Abstract: The discovery of intrinsic magnetic topological order in $\rm MnBi_2Te_4$ has invigorated the search for materials with coexisting magnetic and topological phases. These multi-order quantum materials are expected to exhibit new topological phases that can be tuned with magnetic fields, but the search for such materials is stymied by difficulties in predicting magnetic structure and stability. Here… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: 9 pages, 6 figures

    Journal ref: Science Advances 09 Dec 2020: Vol. 6, no. 50, eabd1076

  20. arXiv:1712.02003  [pdf, other

    q-fin.ST physics.soc-ph

    Universal fluctuations in growth dynamics of economic systems

    Authors: Nathan C. Frey, Sakib Matin, H. Eugene Stanley, Michael Salinger

    Abstract: The growth of business firms is an example of a system of complex interacting units that resembles complex interacting systems in nature such as earthquakes. Remarkably, work in econophysics has provided evidence that the statistical properties of the growth of business firms follow the same sorts of power laws that characterize physical systems near their critical points. Given how economies chan… ▽ More

    Submitted 21 May, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: 15 pages, 7 figures

    Journal ref: Scientific Reports 9, 713 (2019)

  21. Angular Distribution of Gamma-ray Bursts and Weak Lensing

    Authors: Liliya L. R. Williams, Natalie Frey

    Abstract: We investigate whether Gamma-Ray Bursts (GRBs) from the Current BATSE Catalog have been affected by weak lensing by the nearby large scale structure. The redshift distribution of GRBs is believed to be broad, extending to z~5, so most events can be assumed to be at large redshifts, and hence subject to weak lensing, which would betray itself as projected (anti-)correlations between GRB events an… ▽ More

    Submitted 29 October, 2002; originally announced October 2002.

    Comments: 29 pages, incl 8 figs and 1 table; accepted to ApJ

    Journal ref: Astrophys.J.583:594-605,2003