ChemCrow: Augmenting large-language models with chemistry tools
Authors:
Andres M Bran,
Sam Cox,
Oliver Schilter,
Carlo Baldassari,
Andrew D White,
Philippe Schwaller
Abstract:
Over the last decades, excellent computational chemistry tools have been developed. Integrating them into a single platform with enhanced accessibility could help reaching their full potential by overcoming steep learning curves. Recently, large-language models (LLMs) have shown strong performance in tasks across domains, but struggle with chemistry-related problems. Moreover, these models lack ac…
▽ More
Over the last decades, excellent computational chemistry tools have been developed. Integrating them into a single platform with enhanced accessibility could help reaching their full potential by overcoming steep learning curves. Recently, large-language models (LLMs) have shown strong performance in tasks across domains, but struggle with chemistry-related problems. Moreover, these models lack access to external knowledge sources, limiting their usefulness in scientific applications. In this study, we introduce ChemCrow, an LLM chemistry agent designed to accomplish tasks across organic synthesis, drug discovery, and materials design. By integrating 18 expert-designed tools, ChemCrow augments the LLM performance in chemistry, and new capabilities emerge. Our agent autonomously planned and executed the syntheses of an insect repellent, three organocatalysts, and guided the discovery of a novel chromophore. Our evaluation, including both LLM and expert assessments, demonstrates ChemCrow's effectiveness in automating a diverse set of chemical tasks. Surprisingly, we find that GPT-4 as an evaluator cannot distinguish between clearly wrong GPT-4 completions and Chemcrow's performance. Our work not only aids expert chemists and lowers barriers for non-experts, but also fosters scientific advancement by bridging the gap between experimental and computational chemistry.
△ Less
Submitted 2 October, 2023; v1 submitted 11 April, 2023;
originally announced April 2023.
Accelerating Material Design with the Generative Toolkit for Scientific Discovery
Authors:
Matteo Manica,
Jannis Born,
Joris Cadow,
Dimitrios Christofidellis,
Ashish Dave,
Dean Clarke,
Yves Gaetan Nana Teukam,
Giorgio Giannone,
Samuel C. Hoffman,
Matthew Buchan,
Vijil Chenthamarakshan,
Timothy Donovan,
Hsiang Han Hsu,
Federico Zipoli,
Oliver Schilter,
Akihiro Kishimoto,
Lisa Hamada,
Inkit Padhi,
Karl Wehden,
Lauren McHugh,
Alexy Khrabrov,
Payel Das,
Seiji Takeda,
John R. Smith
Abstract:
With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible…
▽ More
With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible open-source library enables scientists, developers, and researchers to train and use state-of-the-art generative models to accelerate scientific discovery focused on material design.
△ Less
Submitted 31 January, 2023; v1 submitted 8 July, 2022;
originally announced July 2022.