Search | arXiv e-print repository

Molecule-Edit Templates for Efficient and Accurate Retrosynthesis Prediction

Authors: Mikołaj Sacha, Michał Sadowski, Piotr Kozakowski, Ruard van Workum, Stanisław Jastrzębski

Abstract: Retrosynthesis involves determining a sequence of reactions to synthesize complex molecules from simpler precursors. As this poses a challenge in organic chemistry, machine learning has offered solutions, particularly for predicting possible reaction substrates for a given target molecule. These solutions mainly fall into template-based and template-free categories. The former is efficient but rel… ▽ More Retrosynthesis involves determining a sequence of reactions to synthesize complex molecules from simpler precursors. As this poses a challenge in organic chemistry, machine learning has offered solutions, particularly for predicting possible reaction substrates for a given target molecule. These solutions mainly fall into template-based and template-free categories. The former is efficient but relies on a vast set of predefined reaction patterns, while the latter, though more flexible, can be computationally intensive and less interpretable. To address these issues, we introduce METRO (Molecule-Edit Templates for RetrOsynthesis), a machine-learning model that predicts reactions using minimal templates - simplified reaction patterns capturing only essential molecular changes - reducing computational overhead and achieving state-of-the-art results on standard benchmarks. △ Less

Submitted 11 October, 2023; originally announced October 2023.

ACM Class: I.2.1; I.5.1

arXiv:2308.08162 [pdf, other]

Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

Authors: Mikołaj Sacha, Bartosz Jura, Dawid Rymarczyk, Łukasz Struski, Jacek Tabor, Bartosz Zieliński

Abstract: Prototypical parts-based networks are becoming increasingly popular due to their faithful self-explanations. However, their similarity maps are calculated in the penultimate network layer. Therefore, the receptive field of the prototype activation region often depends on parts of the image outside this region, which can lead to misleading interpretations. We name this undesired behavior a spatial… ▽ More Prototypical parts-based networks are becoming increasingly popular due to their faithful self-explanations. However, their similarity maps are calculated in the penultimate network layer. Therefore, the receptive field of the prototype activation region often depends on parts of the image outside this region, which can lead to misleading interpretations. We name this undesired behavior a spatial explanation misalignment and introduce an interpretability benchmark with a set of dedicated metrics for quantifying this phenomenon. In addition, we propose a method for misalignment compensation and apply it to existing state-of-the-art models. We show the expressiveness of our benchmark and the effectiveness of the proposed compensation methodology through extensive empirical studies. △ Less

Submitted 16 August, 2023; originally announced August 2023.

Comments: Under review. Code will be release upon acceptance

arXiv:2301.12276 [pdf, other]

ProtoSeg: Interpretable Semantic Segmentation with Prototypical Parts

Authors: Mikołaj Sacha, Dawid Rymarczyk, Łukasz Struski, Jacek Tabor, Bartosz Zieliński

Abstract: We introduce ProtoSeg, a novel model for interpretable semantic image segmentation, which constructs its predictions using similar patches from the training set. To achieve accuracy comparable to baseline methods, we adapt the mechanism of prototypical parts and introduce a diversity loss function that increases the variety of prototypes within each class. We show that ProtoSeg discovers semantic… ▽ More We introduce ProtoSeg, a novel model for interpretable semantic image segmentation, which constructs its predictions using similar patches from the training set. To achieve accuracy comparable to baseline methods, we adapt the mechanism of prototypical parts and introduce a diversity loss function that increases the variety of prototypes within each class. We show that ProtoSeg discovers semantic concepts, in contrast to standard segmentation models. Experiments conducted on Pascal VOC and Cityscapes datasets confirm the precision and transparency of the presented method. △ Less

Submitted 28 January, 2023; originally announced January 2023.

Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 1481-1492

arXiv:2006.15426 [pdf, other]

Molecule Edit Graph Attention Network: Modeling Chemical Reactions as Sequences of Graph Edits

Authors: Mikołaj Sacha, Mikołaj Błaż, Piotr Byrski, Paweł Dąbrowski-Tumański, Mikołaj Chromiński, Rafał Loska, Paweł Włodarczyk-Pruszyński, Stanisław Jastrzębski

Abstract: The central challenge in automated synthesis planning is to be able to generate and predict outcomes of a diverse set of chemical reactions. In particular, in many cases, the most likely synthesis pathway cannot be applied due to additional constraints, which requires proposing alternative chemical reactions. With this in mind, we present Molecule Edit Graph Attention Network (MEGAN), an end-to-en… ▽ More The central challenge in automated synthesis planning is to be able to generate and predict outcomes of a diverse set of chemical reactions. In particular, in many cases, the most likely synthesis pathway cannot be applied due to additional constraints, which requires proposing alternative chemical reactions. With this in mind, we present Molecule Edit Graph Attention Network (MEGAN), an end-to-end encoder-decoder neural model. MEGAN is inspired by models that express a chemical reaction as a sequence of graph edits, akin to the arrow pushing formalism. We extend this model to retrosynthesis prediction (predicting substrates given the product of a chemical reaction) and scale it up to large datasets. We argue that representing the reaction as a sequence of edits enables MEGAN to efficiently explore the space of plausible chemical reactions, maintaining the flexibility of modeling the reaction in an end-to-end fashion, and achieving state-of-the-art accuracy in standard benchmarks. Code and trained models are made available online at https://github.com/molecule-one/megan. △ Less

Submitted 25 May, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

arXiv:2004.09545 [pdf]

Influence of COVID-19 confinement in students performance in higher education

Authors: T. Gonzalez, M. A. de la Rubia, K. P. Hincz, M. Comas-Lopez, L. Subirats, S. Fort, G. M. Sacha

Abstract: This study explores the effects of COVID-19 confinement in the students performance in higher education. Using a field experiment of 458 students from three different subjects in Universidad Autonoma de Madrid (Spain), we study the differences in assessments by dividing students into two groups. The first group (control) corresponds to academic years 2017/2018 and 2018/2019. The second group (expe… ▽ More This study explores the effects of COVID-19 confinement in the students performance in higher education. Using a field experiment of 458 students from three different subjects in Universidad Autonoma de Madrid (Spain), we study the differences in assessments by dividing students into two groups. The first group (control) corresponds to academic years 2017/2018 and 2018/2019. The second group (experimental) corresponds to students from 2019/2020, which is the group of students that interrupted their face-to-face activities because of the confinement. The results show that there is a significant positive effect of the COVID-19 confinement on students performance. This effect is also significative in activities that did not change their format when performed after the confinement. We find that this effect is significative both in subjects that increased the number of assessment activities and subjects that did not change the workload of students. Additionally, an analysis of students learning strategies before confinement shows that students did not study in a continuous basis. Based on these results, we conclude that COVID-19 confinement changed students learning strategies to a more continuous habit, improving their efficiency. For these reasons, better scores in students assessment are expected due to COVID-19 confinement that can be explained by an improvement in their learning performance. △ Less

Submitted 20 April, 2020; originally announced April 2020.

arXiv:1404.5144 [pdf]

Influence of the learning method in the performance of feedforward neural networks when the activity of neurons is modified

Authors: M. Konomi, G. M. Sacha

Abstract: A method that allows us to give a different treatment to any neuron inside feedforward neural networks is presented. The algorithm has been implemented with two very different learning methods: a standard Back-propagation (BP) procedure and an evolutionary algorithm. First, we have demonstrated that the EA training method converges faster and gives more accurate results than BP. Then we have made… ▽ More A method that allows us to give a different treatment to any neuron inside feedforward neural networks is presented. The algorithm has been implemented with two very different learning methods: a standard Back-propagation (BP) procedure and an evolutionary algorithm. First, we have demonstrated that the EA training method converges faster and gives more accurate results than BP. Then we have made a full analysis of the effects of turning off different combinations of neurons after the training phase. We demonstrate that EA is much more robust than BP for all the cases under study. Even in the case when two hidden neurons are lost, EA training is still able to give good average results. This difference implies that we must be very careful when pruning or redundancy effects are being studied since the network performance when losing neurons strongly depends on the training method. Moreover, the influence of the individual inputs will also depend on the training algorithm. Since EA keeps a good classification performance when units are lost, this method could be a good way to simulate biological learning systems since they must be robust against deficient neuron performance. Although biological systems are much more complex than the simulations shown in this article, we propose that a smart training strategy such as the one shown here could be considered as a first protection against the losing of a certain number of neurons. △ Less

Submitted 21 April, 2014; originally announced April 2014.

arXiv:1403.1465 [pdf]

Adaptive Model for Computer-Assisted Assessment in Programming Skills

Authors: P. Molins-Ruano, C. González-Sacristán, F. Díez, P. Rodriguez, G. M. Sacha

Abstract: In this work, we show a methodology aimed to improve the quality of the assessment process for subjects related to basic programming. The method takes into account the relevance of the items and the students answers to follow different paths to improve the accuracy of the assessment process. We have developed numerical simulations and experiments with real students that demonstrate the advantages… ▽ More In this work, we show a methodology aimed to improve the quality of the assessment process for subjects related to basic programming. The method takes into account the relevance of the items and the students answers to follow different paths to improve the accuracy of the assessment process. We have developed numerical simulations and experiments with real students that demonstrate the advantages of this model when compared with traditional evaluation tools. This method improves the objectiveness and takes into account the relevance of the subject contents. We also demonstrate that the architecture of the algorithm is fully compatible with traditional multiple choice test formalisms. Our results can be directly used in computer-assisted tests for different subjects and disciplines, as well as used by the students as a self-evaluation tool with the objective of correcting their deficiencies in the learning process. △ Less

Submitted 6 March, 2014; originally announced March 2014.

Comments: 7 pages, 4 figures, 1 table

Showing 1–7 of 7 results for author: Sacha, M