Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES

Bjerrum, Esben Jannik; Margreitter, Christian; Blaschke, Thomas; de Castro, Raquel López-Ríos

doi:10.1007/s10822-023-00512-6

Physics > Chemical Physics

arXiv:2210.12458 (physics)

[Submitted on 22 Oct 2022 (v1), last revised 3 Mar 2023 (this version, v2)]

Title:Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES

Authors:Esben Jannik Bjerrum, Christian Margreitter, Thomas Blaschke, Raquel López-Ríos de Castro

View PDF

Abstract:Using generative deep learning models and reinforcement learning together can effectively generate new molecules with desired properties. By employing a multi-objective scoring function, thousands of high-scoring molecules can be generated, making this approach useful for drug discovery and material science. However, the application of these methods can be hindered by computationally expensive or time-consuming scoring procedures, particularly when a large number of function calls are required as feedback in the reinforcement learning optimization. Here, we propose the use of double-loop reinforcement learning with simplified molecular line entry system (SMILES) augmentation to improve the efficiency and speed of the optimization. By adding an inner loop that augments the generated SMILES strings to non-canonical SMILES for use in additional reinforcement learning rounds, we can both reuse the scoring calculations on the molecular level, thereby speeding up the learning process, as well as offer additional protection against mode collapse. We find that employing between 5 and 10 augmentation repetitions is optimal for the scoring functions tested and is further associated with an increased diversity in the generated compounds, improved reproducibility of the sampling runs and the generation of molecules of higher similarity to known ligands.

Comments:	25 pages and 18 Figures. Supplementary material included
Subjects:	Chemical Physics (physics.chem-ph); Machine Learning (cs.LG)
MSC classes:	68T07
ACM classes:	I.2.1; J.3
Cite as:	arXiv:2210.12458 [physics.chem-ph]
	(or arXiv:2210.12458v2 [physics.chem-ph] for this version)
	https://doi.org/10.48550/arXiv.2210.12458
Related DOI:	https://doi.org/10.1007/s10822-023-00512-6

Submission history

From: Esben Jannik Bjerrum [view email]
[v1] Sat, 22 Oct 2022 14:36:38 UTC (2,533 KB)
[v2] Fri, 3 Mar 2023 07:38:26 UTC (2,513 KB)

Physics > Chemical Physics

Title:Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Physics > Chemical Physics

Title:Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators