-
AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning Potentials
Authors:
Janice Lan,
Aini Palizhati,
Muhammed Shuaibi,
Brandon M. Wood,
Brook Wander,
Abhishek Das,
Matt Uyttendaele,
C. Lawrence Zitnick,
Zachary W. Ulissi
Abstract:
Computational catalysis is playing an increasingly significant role in the design of catalysts across a wide range of applications. A common task for many computational methods is the need to accurately compute the adsorption energy for an adsorbate and a catalyst surface of interest. Traditionally, the identification of low energy adsorbate-surface configurations relies on heuristic methods and r…
▽ More
Computational catalysis is playing an increasingly significant role in the design of catalysts across a wide range of applications. A common task for many computational methods is the need to accurately compute the adsorption energy for an adsorbate and a catalyst surface of interest. Traditionally, the identification of low energy adsorbate-surface configurations relies on heuristic methods and researcher intuition. As the desire to perform high-throughput screening increases, it becomes challenging to use heuristics and intuition alone. In this paper, we demonstrate machine learning potentials can be leveraged to identify low energy adsorbate-surface configurations more accurately and efficiently. Our algorithm provides a spectrum of trade-offs between accuracy and efficiency, with one balanced option finding the lowest energy configuration 87.36% of the time, while achieving a 2000x speedup in computation. To standardize benchmarking, we introduce the Open Catalyst Dense dataset containing nearly 1,000 diverse surfaces and 100,000 unique configurations.
△ Less
Submitted 15 September, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Open Challenges in Develo** Generalizable Large Scale Machine Learning Models for Catalyst Discovery
Authors:
Adeesh Kolluru,
Muhammed Shuaibi,
Aini Palizhati,
Nima Shoghi,
Abhishek Das,
Brandon Wood,
C. Lawrence Zitnick,
John R Kitchin,
Zachary W Ulissi
Abstract:
The development of machine learned potentials for catalyst discovery has predominantly been focused on very specific chemistries and material compositions. While effective in interpolating between available materials, these approaches struggle to generalize across chemical space. The recent curation of large-scale catalyst datasets has offered the opportunity to build a universal machine learning…
▽ More
The development of machine learned potentials for catalyst discovery has predominantly been focused on very specific chemistries and material compositions. While effective in interpolating between available materials, these approaches struggle to generalize across chemical space. The recent curation of large-scale catalyst datasets has offered the opportunity to build a universal machine learning potential, spanning chemical and composition space. If accomplished, said potential could accelerate the catalyst discovery process across a variety of applications (CO2 reduction, NH3 production, etc.) without additional specialized training efforts that are currently required. The release of the Open Catalyst 2020 (OC20) has begun just that, pushing the heterogeneous catalysis and machine learning communities towards building more accurate and robust models. In this perspective, we discuss some of the challenges and findings of recent developments on OC20. We examine the performance of current models across different materials and adsorbates to identify notably underperforming subsets. We then discuss some of the modeling efforts surrounding energy-conservation, approaches to finding and evaluating the local minima, and augmentation of off-equilibrium data. To complement the community's ongoing developments, we end with an outlook to some of the important challenges that have yet to be thoroughly explored for large-scale catalyst discovery.
△ Less
Submitted 13 June, 2022; v1 submitted 4 June, 2022;
originally announced June 2022.
-
The Open Catalyst 2020 (OC20) Dataset and Community Challenges
Authors:
Lowik Chanussot,
Abhishek Das,
Siddharth Goyal,
Thibaut Lavril,
Muhammed Shuaibi,
Morgane Riviere,
Kevin Tran,
Javier Heras-Domingo,
Caleb Ho,
Weihua Hu,
Aini Palizhati,
Anuroop Sriram,
Brandon Wood,
Junwoong Yoon,
Devi Parikh,
C. Lawrence Zitnick,
Zachary Ulissi
Abstract:
Catalyst discovery and optimization is key to solving many societal and energy challenges including solar fuels synthesis, long-term energy storage, and renewable fertilizer production. Despite considerable effort by the catalysis community to apply machine learning models to the computational catalyst discovery process, it remains an open challenge to build models that can generalize across both…
▽ More
Catalyst discovery and optimization is key to solving many societal and energy challenges including solar fuels synthesis, long-term energy storage, and renewable fertilizer production. Despite considerable effort by the catalysis community to apply machine learning models to the computational catalyst discovery process, it remains an open challenge to build models that can generalize across both elemental compositions of surfaces and adsorbate identity/configurations, perhaps because datasets have been smaller in catalysis than related fields. To address this we developed the OC20 dataset, consisting of 1,281,040 Density Functional Theory (DFT) relaxations (~264,890,000 single point evaluations) across a wide swath of materials, surfaces, and adsorbates (nitrogen, carbon, and oxygen chemistries). We supplemented this dataset with randomly perturbed structures, short timescale molecular dynamics, and electronic structure analyses. The dataset comprises three central tasks indicative of day-to-day catalyst modeling and comes with pre-defined train/validation/test splits to facilitate direct comparisons with future model development efforts. We applied three state-of-the-art graph neural network models (CGCNN, SchNet, Dimenet++) to each of these tasks as baseline demonstrations for the community to build on. In almost every task, no upper limit on model size was identified, suggesting that even larger models are likely to improve on initial results. The dataset and baseline models are both provided as open resources, as well as a public leader board to encourage community contributions to solve these important tasks.
△ Less
Submitted 24 September, 2021; v1 submitted 19 October, 2020;
originally announced October 2020.
-
An Introduction to Electrocatalyst Design using Machine Learning for Renewable Energy Storage
Authors:
C. Lawrence Zitnick,
Lowik Chanussot,
Abhishek Das,
Siddharth Goyal,
Javier Heras-Domingo,
Caleb Ho,
Weihua Hu,
Thibaut Lavril,
Aini Palizhati,
Morgane Riviere,
Muhammed Shuaibi,
Anuroop Sriram,
Kevin Tran,
Brandon Wood,
Junwoong Yoon,
Devi Parikh,
Zachary Ulissi
Abstract:
Scalable and cost-effective solutions to renewable energy storage are essential to addressing the world's rising energy needs while reducing climate change. As we increase our reliance on renewable energy sources such as wind and solar, which produce intermittent power, storage is needed to transfer power from times of peak generation to peak demand. This may require the storage of power for hours…
▽ More
Scalable and cost-effective solutions to renewable energy storage are essential to addressing the world's rising energy needs while reducing climate change. As we increase our reliance on renewable energy sources such as wind and solar, which produce intermittent power, storage is needed to transfer power from times of peak generation to peak demand. This may require the storage of power for hours, days, or months. One solution that offers the potential of scaling to nation-sized grids is the conversion of renewable energy to other fuels, such as hydrogen or methane. To be widely adopted, this process requires cost-effective solutions to running electrochemical reactions. An open challenge is finding low-cost electrocatalysts to drive these reactions at high rates. Through the use of quantum mechanical simulations (density functional theory), new catalyst structures can be tested and evaluated. Unfortunately, the high computational cost of these simulations limits the number of structures that may be tested. The use of machine learning may provide a method to efficiently approximate these calculations, leading to new approaches in finding effective electrocatalysts. In this paper, we provide an introduction to the challenges in finding suitable electrocatalysts, how machine learning may be applied to the problem, and the use of the Open Catalyst Project OC20 dataset for model training.
△ Less
Submitted 14 October, 2020;
originally announced October 2020.