Skip to main content

Showing 1–10 of 10 results for author: Miret, S

Searching in archive cond-mat. Search in all archives.
.
  1. arXiv:2406.17295  [pdf, other

    cond-mat.mtrl-sci cs.LG

    MatText: Do Language Models Need More than Text & Scale for Materials Modeling?

    Authors: Nawaf Alampara, Santiago Miret, Kevin Maik Jablonka

    Abstract: Effectively representing materials as text has the potential to leverage the vast advancements of large language models (LLMs) for discovering new materials. While LLMs have shown remarkable success in various domains, their application to materials science remains underexplored. A fundamental challenge is the lack of understanding of how to best utilize text-based representations for materials mo… ▽ More

    Submitted 28 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2404.01475  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.chem-ph

    Are large language models superhuman chemists?

    Authors: Adrian Mirza, Nawaf Alampara, Sreekanth Kunchapu, Benedict Emoekabu, Aswanth Krishnan, Mara Wilhelmi, Macjonathan Okereke, Juliane Eberhardt, Amir Mohammad Elahi, Maximilian Greiner, Caroline T. Holick, Tanya Gupta, Mehrdad Asgari, Christina Glaubitz, Lea C. Klepsch, Yannik Köster, Jakob Meyer, Santiago Miret, Tim Hoffmann, Fabian Alexander Kreth, Michael Ringleb, Nicole Roesner, Ulrich S. Schubert, Leanne M. Stafast, Dinga Wonanke , et al. (3 additional authors not shown)

    Abstract: Large language models (LLMs) have gained widespread interest due to their ability to process human language and perform tasks on which they have not been explicitly trained. This is relevant for the chemical sciences, which face the problem of small and diverse datasets that are frequently in the form of text. LLMs have shown promise in addressing these issues and are increasingly being harnessed… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  3. arXiv:2402.05200  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.CL cs.LG

    Are LLMs Ready for Real-World Materials Discovery?

    Authors: Santiago Miret, N M Anoop Krishnan

    Abstract: Large Language Models (LLMs) create exciting possibilities for powerful language processing tools to accelerate research in materials science. While LLMs have great potential to accelerate materials understanding and discovery, they currently fall short in being practical materials science tools. In this position paper, we show relevant failure cases of LLMs in materials science that reveal curren… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2310.08511  [pdf, other

    cs.CL cond-mat.mtrl-sci cs.AI

    HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science

    Authors: Yu Song, Santiago Miret, Huan Zhang, Bang Liu

    Abstract: We propose an instruction-based process for trustworthy data curation in materials science (MatSci-Instruct), which we then apply to finetune a LLaMa-based language model targeted for materials science (HoneyBee). MatSci-Instruct helps alleviate the scarcity of relevant, high-quality materials science textual data available in the open literature, and HoneyBee is the first billion-parameter langua… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  5. arXiv:2310.07864  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Towards Foundation Models for Materials Science: The Open MatSci ML Toolkit

    Authors: Kin Long Kelvin Lee, Carmelo Gonzales, Matthew Spellings, Mikhail Galkin, Santiago Miret, Nalini Kumar

    Abstract: Artificial intelligence and machine learning have shown great promise in their ability to accelerate novel materials discovery. As researchers and domain scientists seek to unify and consolidate chemical knowledge, the case for models with potential to generalize across different tasks within materials science - so-called "foundation models" - grows with ambitions. This manuscript reviews our rece… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 17 pages, 7 figures, 1 table. Accepted paper/presentation at the AI4Science workshop at Super Computing '23

  6. arXiv:2310.02902  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    Searching for High-Value Molecules Using Reinforcement Learning and Transformers

    Authors: Raj Ghugare, Santiago Miret, Adriana Hugessen, Mariano Phielipp, Glen Berseth

    Abstract: Reinforcement learning (RL) over text representations can be effective for finding high-value policies that can search over graphs. However, RL requires careful structuring of the search space and algorithm design to be effective in this challenge. Through extensive experiments, we explore how different design choices for text grammar and algorithmic choices for training can affect an RL policy's… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  7. arXiv:2310.02428  [pdf, other

    cs.LG cond-mat.mtrl-sci

    EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations

    Authors: Vaibhav Bihani, Utkarsh Pratiush, Sajid Mannan, Tao Du, Zhimin Chen, Santiago Miret, Matthieu Micoulaut, Morten M Smedskjaer, Sayan Ranu, N M Anoop Krishnan

    Abstract: Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model at… ▽ More

    Submitted 24 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  8. arXiv:2309.05934  [pdf, other

    cond-mat.mtrl-sci cs.AI

    MatSciML: A Broad, Multi-Task Benchmark for Solid-State Materials Modeling

    Authors: Kin Long Kelvin Lee, Carmelo Gonzales, Marcel Nassar, Matthew Spellings, Mikhail Galkin, Santiago Miret

    Abstract: We propose MatSci ML, a novel benchmark for modeling MATerials SCIence using Machine Learning (MatSci ML) methods focused on solid-state materials with periodic crystal structures. Applying machine learning methods to solid-state materials is a nascent field with substantial fragmentation largely driven by the great variety of datasets used to develop machine learning models. This fragmentation ma… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  9. arXiv:2305.08264  [pdf, other

    cs.CL cond-mat.mtrl-sci cs.AI

    MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling

    Authors: Yu Song, Santiago Miret, Bang Liu

    Abstract: We present MatSci-NLP, a natural language benchmark for evaluating the performance of natural language processing (NLP) models on materials science text. We construct the benchmark from publicly available materials science text data to encompass seven different NLP tasks, including conventional NLP tasks like named entity recognition and relation classification, as well as NLP tasks specific to ma… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  10. arXiv:2210.17484  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    The Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science

    Authors: Santiago Miret, Kin Long Kelvin Lee, Carmelo Gonzales, Marcel Nassar, Matthew Spellings

    Abstract: We present the Open MatSci ML Toolkit: a flexible, self-contained, and scalable Python-based framework to apply deep learning models and methods on scientific data with a specific focus on materials science and the OpenCatalyst Dataset. Our toolkit provides: 1. A scalable machine learning workflow for materials science leveraging PyTorch Lightning, which enables seamless scaling across different c… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Paper accompanying Open-Source Software from https://github.com/IntelLabs/matsciml

    Report number: 2835-8856

    Journal ref: Transactions on Machine Learning Research (2023)