Search | arXiv e-print repository

In-Context Learning of Physical Properties: Few-Shot Adaptation to Out-of-Distribution Molecular Graphs

Authors: Grzegorz Kaszuba, Amirhossein D. Naghdi, Dario Massa, Stefanos Papanikolaou, Andrzej Jaszkiewicz, Piotr Sankowski

Abstract: Large language models manifest the ability of few-shot adaptation to a sequence of provided examples. This behavior, known as in-context learning, allows for performing nontrivial machine learning tasks during inference only. In this work, we address the question: can we leverage in-context learning to predict out-of-distribution materials properties? However, this would not be possible for struct… ▽ More Large language models manifest the ability of few-shot adaptation to a sequence of provided examples. This behavior, known as in-context learning, allows for performing nontrivial machine learning tasks during inference only. In this work, we address the question: can we leverage in-context learning to predict out-of-distribution materials properties? However, this would not be possible for structure property prediction tasks unless an effective method is found to pass atomic-level geometric features to the transformer model. To address this problem, we employ a compound model in which GPT-2 acts on the output of geometry-aware graph neural networks to adapt in-context information. To demonstrate our model's capabilities, we partition the QM9 dataset into sequences of molecules that share a common substructure and use them for in-context learning. This approach significantly improves the performance of the model on out-of-distribution examples, surpassing the one of general graph neural network models. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 12 pages, 4 figures

arXiv:2312.16073 [pdf, other]

Compositional Search of Stable Crystalline Structures in Multi-Component Alloys Using Generative Diffusion Models

Authors: Grzegorz Kaszuba, Amirhossein Naghdi Dorabati, Stefanos Papanikolaou, Andrzej Jaszkiewicz, Piotr Sankowski

Abstract: Exploring the vast composition space of multi-component alloys presents a challenging task for both \textit{ab initio} (first principles) and experimental methods due to the time-consuming procedures involved. This ultimately impedes the discovery of novel, stable materials that may display exceptional properties. Here, the Crystal Diffusion Variational Autoencoder (CDVAE) model is adapted to char… ▽ More Exploring the vast composition space of multi-component alloys presents a challenging task for both \textit{ab initio} (first principles) and experimental methods due to the time-consuming procedures involved. This ultimately impedes the discovery of novel, stable materials that may display exceptional properties. Here, the Crystal Diffusion Variational Autoencoder (CDVAE) model is adapted to characterize the stable compositions of a well studied multi-component alloy, NiFeCr, with two distinct crystalline phases known to be stable across its compositional space. To this end, novel extensions to CDVAE were proposed, enhancing the model's ability to reconstruct configurations from their latent space within the test set by approximately 30\% . A fact that increases a model's probability of discovering new materials when dealing with various crystalline structures. Afterwards, the new model is applied for materials generation, demonstrating excellent agreement in identifying stable configurations within the ternary phase space when compared to first principles data. Finally, a computationally efficient framework for inverse design is proposed, employing Molecular Dynamics (MD) simulations of multi-component alloys with reliable interatomic potentials, enabling the optimization of materials property across the phase space. △ Less

Submitted 26 December, 2023; originally announced December 2023.

arXiv:2106.03275 [pdf, other]

What if we Increase the Number of Objectives? Theoretical and Empirical Implications for Many-objective Optimization

Authors: Richard Allmendinger, Andrzej Jaszkiewicz, Arnaud Liefooghe, Christiane Tammer

Abstract: The difficulty of solving a multi-objective optimization problem is impacted by the number of objectives to be optimized. The presence of many objectives typically introduces a number of challenges that affect the choice/design of optimization algorithms. This paper investigates the drivers of these challenges from two angles: (i) the influence of the number of objectives on problem characteristic… ▽ More The difficulty of solving a multi-objective optimization problem is impacted by the number of objectives to be optimized. The presence of many objectives typically introduces a number of challenges that affect the choice/design of optimization algorithms. This paper investigates the drivers of these challenges from two angles: (i) the influence of the number of objectives on problem characteristics and (ii) the practical behavior of commonly used procedures and algorithms for co** with many objectives. In addition to reviewing various drivers, the paper makes theoretical contributions by quantifying some drivers and/or verifying these drivers empirically by carrying out experiments on multi-objective NK landscapes and other typical benchmarks. We then make use of our theoretical and empirical findings to derive practical recommendations to support algorithm design. Finally, we discuss remaining theoretical gaps and opportunities for future research in the area of multi- and many-objective optimization. △ Less

Submitted 6 June, 2021; originally announced June 2021.

arXiv:2012.00673 [pdf, other]

Modified Dorfman procedure for pool tests with dilution -- COVID-19 case study

Authors: Andrzej Jaszkiewicz

Abstract: The outbreak of the global COVID-19 pandemic results in unprecedented demand for fast and efficient testing of large numbers of patients for the presence of SARS-CoV-2 coronavirus. Beside technical improvements of the cost and speed of individual tests, pool testing may be used to improve efficiency and throughput of a population test. Dorfman pool testing procedure is one of the best known and st… ▽ More The outbreak of the global COVID-19 pandemic results in unprecedented demand for fast and efficient testing of large numbers of patients for the presence of SARS-CoV-2 coronavirus. Beside technical improvements of the cost and speed of individual tests, pool testing may be used to improve efficiency and throughput of a population test. Dorfman pool testing procedure is one of the best known and studied methods of this kind. This procedure is, however, based on unrealistic assumptions that the pool test has perfect sensitivity and the only objective is to minimize the number of tests, and is not well adapted to the case of imperfect pool tests. We propose and analyze a simple modification of this procedure in which test of a pool with negative result is independently repeated up to several times. The proposed procedure is evaluated in a computational study using recent data about dilution effect for SARS-CoV-2 PCR tests, showing that the proposed approach significantly reduces the number of false negatives with a relatively small increase of the number of tests, especially for small prevalence rates. For example, for prevalence rate 0.001 the number of tests could be reduced to 22.1% of individual tests, increasing the expected number of false negatives by no more than 1%, and to 16.8% of individual tests increasing the expected number of false negatives by no more than 10%. At the same time, a similar reduction of the expected number of tests in the standard Dorfman procedure would yield 675% and 821% increase of the expected number of false negatives, respectively. This makes the proposed procedure an interesting choice for screening tests in the case of diseases like COVID-19. △ Less

Submitted 15 December, 2020; v1 submitted 30 November, 2020; originally announced December 2020.

arXiv:1707.07899 [pdf, other]

Many-Objective Pareto Local Search

Authors: Andrzej Jaszkiewicz

Abstract: We propose a new Pareto Local Search Algorithm for the many-objective combinatorial optimization. Pareto Local Search proved to be a very effective tool in the case of the bi-objective combinatorial optimization and it was used in a number of the state-of-the-art algorithms for problems of this kind. On the other hand, the standard Pareto Local Search algorithm becomes very inefficient for problem… ▽ More We propose a new Pareto Local Search Algorithm for the many-objective combinatorial optimization. Pareto Local Search proved to be a very effective tool in the case of the bi-objective combinatorial optimization and it was used in a number of the state-of-the-art algorithms for problems of this kind. On the other hand, the standard Pareto Local Search algorithm becomes very inefficient for problems with more than two objectives. We build an effective Many-Objective Pareto Local Search algorithm using three new mechanisms: the efficient update of large Pareto archives with ND-Tree data structure, a new mechanism for the selection of the promising solutions for the neighborhood exploration, and a partial exploration of the neighborhoods. We apply the proposed algorithm to the instances of two different problems, i.e. the traveling salesperson problem and the traveling salesperson problem with profits with up to 5 objectives showing high effectiveness of the proposed algorithm. △ Less

Submitted 14 December, 2017; v1 submitted 25 July, 2017; originally announced July 2017.

arXiv:1703.09469 [pdf, other]

Experimental Analysis of Design Elements of Scalarizing Functions-based Multiobjective Evolutionary Algorithms

Authors: Mansoureh Aghabeig, Andrzej Jaszkiewicz

Abstract: In this paper we systematically study the importance, i.e., the influence on performance, of the main design elements that differentiate scalarizing functions-based multiobjective evolutionary algorithms (MOEAs). This class of MOEAs includes Multiobjecitve Genetic Local Search (MOGLS) and Multiobjective Evolutionary Algorithm Based on Decomposition (MOEA/D) and proved to be very successful in mult… ▽ More In this paper we systematically study the importance, i.e., the influence on performance, of the main design elements that differentiate scalarizing functions-based multiobjective evolutionary algorithms (MOEAs). This class of MOEAs includes Multiobjecitve Genetic Local Search (MOGLS) and Multiobjective Evolutionary Algorithm Based on Decomposition (MOEA/D) and proved to be very successful in multiple computational experiments and practical applications. The two algorithms share the same common structure and differ only in two main aspects. Using three different multiobjective combinatorial optimization problems, i.e., the multiobjective symmetric traveling salesperson problem, the traveling salesperson problem with profits, and the multiobjective set covering problem, we show that the main differentiating design element is the mechanism for parent selection, while the selection of weight vectors, either random or uniformly distributed, is practically negligible if the number of uniform weight vectors is sufficiently large. △ Less

Submitted 28 March, 2017; originally announced March 2017.

arXiv:1612.03402 [pdf, other]

Improved Quick Hypervolume Algorithm

Authors: Andrzej Jaszkiewicz

Abstract: In this paper, we present a significant improvement of Quick Hypervolume algorithm, one of the state-of-the-art algorithms for calculating exact hypervolume of the space dominated by a set of d-dimensional points. This value is often used as a quality indicator in multiobjective evolutionary algorithms and other multiobjective metaheuristics and the efficiency of calculating this indicator is of c… ▽ More In this paper, we present a significant improvement of Quick Hypervolume algorithm, one of the state-of-the-art algorithms for calculating exact hypervolume of the space dominated by a set of d-dimensional points. This value is often used as a quality indicator in multiobjective evolutionary algorithms and other multiobjective metaheuristics and the efficiency of calculating this indicator is of crucial importance especially in the case of large sets or many dimensional objective spaces. We use a similar divide and conquer scheme as in the original Quick Hypervolume algorithm, but in our algorithm we split the problem into smaller sub-problems in a different way. Through both theoretical analysis and computational study we show that our approach improves computational complexity of the algorithm and practical running times. △ Less

Submitted 11 August, 2017; v1 submitted 11 December, 2016; originally announced December 2016.

arXiv:1603.04798 [pdf, other]

ND-Tree-based update: a Fast Algorithm for the Dynamic Non-Dominance Problem

Authors: Andrzej Jaszkiewicz, Thibaut Lust

Abstract: In this paper we propose a new method called ND-Tree-based update (or shortly ND-Tree) for the dynamic non-dominance problem, i.e. the problem of online update of a Pareto archive composed of mutually non-dominated points. It uses a new ND-Tree data structure in which each node represents a subset of points contained in a hyperrectangle defined by its local approximate ideal and nadir points. By b… ▽ More In this paper we propose a new method called ND-Tree-based update (or shortly ND-Tree) for the dynamic non-dominance problem, i.e. the problem of online update of a Pareto archive composed of mutually non-dominated points. It uses a new ND-Tree data structure in which each node represents a subset of points contained in a hyperrectangle defined by its local approximate ideal and nadir points. By building subsets containing points located close in the objective space and using basic properties of the local ideal and nadir points we can efficiently avoid searching many branches in the tree. ND-Tree may be used in multiobjective evolutionary algorithms and other multiobjective metaheuristics to update an archive of potentially non-dominated points. We prove that the proposed algorithm has sub-linear time complexity under mild assumptions. We experimentally compare ND-Tree to the simple list, Quad-tree, and M-Front methods using artificial and realistic benchmarks with up to 10 objectives and show that with this new method substantial reduction of the number of point comparisons and computational time can be obtained. Furthermore, we apply the method to the non-dominated sorting problem showing that it is highly competitive to some recently proposed algorithms dedicated to this problem. △ Less

Submitted 7 November, 2017; v1 submitted 15 March, 2016; originally announced March 2016.

Comments: 15 pages, 21 figures, 3 tables

Showing 1–8 of 8 results for author: Jaszkiewicz, A