Search | arXiv e-print repository

Quantum Theory and Application of Contextual Optimal Transport

Authors: Nicola Mariella, Albert Akhriev, Francesco Tacchino, Christa Zoufal, Juan Carlos Gonzalez-Espitia, Benedek Harsanyi, Eugene Koskin, Ivano Tavernelli, Stefan Woerner, Marianna Rapsomaniki, Sergiy Zhuk, Jannis Born

Abstract: Optimal Transport (OT) has fueled machine learning (ML) across many domains. When paired data measurements $(\boldsymbolμ, \boldsymbolν)$ are coupled to covariates, a challenging conditional distribution learning setting arises. Existing approaches for learning a $\textit{global}$ transport map parameterized through a potentially unseen context utilize Neural OT and largely rely on Brenier's theor… ▽ More Optimal Transport (OT) has fueled machine learning (ML) across many domains. When paired data measurements $(\boldsymbolμ, \boldsymbolν)$ are coupled to covariates, a challenging conditional distribution learning setting arises. Existing approaches for learning a $\textit{global}$ transport map parameterized through a potentially unseen context utilize Neural OT and largely rely on Brenier's theorem. Here, we propose a first-of-its-kind quantum computing formulation for amortized optimization of contextualized transportation plans. We exploit a direct link between doubly stochastic matrices and unitary operators thus unravelling a natural connection between OT and quantum computation. We verify our method (QontOT) on synthetic and real data by predicting variations in cell type distributions conditioned on drug dosage. Importantly we conduct a 24-qubit hardware experiment on a task challenging for classical computers and report a performance that cannot be matched with our classical neural OT approach. In sum, this is a first step toward learning to predict contextualized transportation plans through quantum computing. △ Less

Submitted 3 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: ICML 2024

arXiv:2309.16235 [pdf, other]

Language models in molecular discovery

Authors: Nikita Janakarajan, Tim Erdmann, Sarath Swaminathan, Teodoro Laino, Jannis Born

Abstract: The success of language models, especially transformer-based architectures, has trickled into other domains giving rise to "scientific language models" that operate on small molecules, proteins or polymers. In chemistry, language models contribute to accelerating the molecule discovery cycle as evidenced by promising recent findings in early-stage drug discovery. Here, we review the role of langua… ▽ More The success of language models, especially transformer-based architectures, has trickled into other domains giving rise to "scientific language models" that operate on small molecules, proteins or polymers. In chemistry, language models contribute to accelerating the molecule discovery cycle as evidenced by promising recent findings in early-stage drug discovery. Here, we review the role of language models in molecular discovery, underlining their strength in de novo drug design, property prediction and reaction chemistry. We highlight valuable open-source software assets thus lowering the entry barrier to the field of scientific language modeling. Last, we sketch a vision for future molecular design that combines a chatbot interface with access to computational chemistry tools. Our contribution serves as a valuable resource for researchers, chemists, and AI enthusiasts interested in understanding how language models can and will be used to accelerate chemical discovery. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: Under review

arXiv:2307.05734 [pdf, other]

Towards quantum-enabled cell-centric therapeutics

Authors: Saugata Basu, Jannis Born, Aritra Bose, Sara Capponi, Dimitra Chalkia, Timothy A Chan, Hakan Doga, Frederik F. Flother, Gad Getz, Mark Goldsmith, Tanvi Gujarati, Aldo Guzman-Saenz, Dimitrios Iliopoulos, Gavin O. Jones, Stefan Knecht, Dhiraj Madan, Sabrina Maniscalco, Nicola Mariella, Joseph A. Morrone, Khadijeh Najafi, Pushpak Pati, Daniel Platt, Maria Anna Rapsomaniki, Anupama Ray, Kahn Rhrissorrakrai , et al. (8 additional authors not shown)

Abstract: In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum com… ▽ More In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum computing has already started to be recognized in fields such as cryptanalysis, natural science simulations, and optimization among others, very little is known about the full potential of quantum computing simulations and machine learning in the realm of healthcare and life science (HCLS). Herein, we discuss the transformational changes we expect from the use of quantum computation for HCLS research, more specifically in the field of cell-centric therapeutics. Moreover, we identify and elaborate open problems in cell engineering, tissue modeling, perturbation modeling, and bio-topology while discussing candidate quantum algorithms for research on these topics and their potential advantages over classical computational approaches. △ Less

Submitted 1 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

Comments: 6 figures

arXiv:2301.12586 [pdf, other]

Unifying Molecular and Textual Representations via Multi-task Language Modelling

Authors: Dimitrios Christofidellis, Giorgio Giannone, Jannis Born, Ole Winther, Teodoro Laino, Matteo Manica

Abstract: The recent advances in neural language models have also been successfully applied to the field of chemistry, offering generative solutions for classical problems in molecular design and synthesis planning. These new methods have the potential to fuel a new era of data-driven automation in scientific discovery. However, specialized models are still typically required for each task, leading to the n… ▽ More The recent advances in neural language models have also been successfully applied to the field of chemistry, offering generative solutions for classical problems in molecular design and synthesis planning. These new methods have the potential to fuel a new era of data-driven automation in scientific discovery. However, specialized models are still typically required for each task, leading to the need for problem-specific fine-tuning and neglecting task interrelations. The main obstacle in this field is the lack of a unified representation between natural language and chemical representations, complicating and limiting human-machine interaction. Here, we propose the first multi-domain, multi-task language model that can solve a wide range of tasks in both the chemical and natural language domains. Our model can handle chemical and natural language concurrently, without requiring expensive pre-training on single domains or task-specific models. Interestingly, sharing weights across domains remarkably improves our model when benchmarked against state-of-the-art baselines on single-domain and cross-domain tasks. In particular, sharing information across domains and tasks gives rise to large improvements in cross-domain tasks, the magnitude of which increase with scale, as measured by more than a dozen of relevant metrics. Our work suggests that such models can robustly and efficiently accelerate discovery in physical sciences by superseding problem-specific fine-tuning and enhancing human-model interactions. △ Less

Submitted 17 May, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

Comments: ICML 2023

arXiv:2301.08750 [pdf, other]

Domain-agnostic and Multi-level Evaluation of Generative Models

Authors: Girmaw Abebe Tadesse, Jannis Born, Celia Cintas, William Ogallo, Dmitry Zubarev, Matteo Manica, Komminist Weldemariam

Abstract: While the capabilities of generative models heavily improved in different domains (images, text, graphs, molecules, etc.), their evaluation metrics largely remain based on simplified quantities or manual inspection with limited practicality. To this end, we propose a framework for Multi-level Performance Evaluation of Generative mOdels (MPEGO), which could be employed across different domains. MPE… ▽ More While the capabilities of generative models heavily improved in different domains (images, text, graphs, molecules, etc.), their evaluation metrics largely remain based on simplified quantities or manual inspection with limited practicality. To this end, we propose a framework for Multi-level Performance Evaluation of Generative mOdels (MPEGO), which could be employed across different domains. MPEGO aims to quantify generation performance hierarchically, starting from a sub-feature-based low-level evaluation to a global features-based high-level evaluation. MPEGO offers great customizability as the employed features are entirely user-driven and can thus be highly domain/problem-specific while being arbitrarily complex (e.g., outcomes of experimental procedures). We validate MPEGO using multiple generative models across several datasets from the material discovery domain. An ablation study is conducted to study the plausibility of intermediate steps in MPEGO. Results demonstrate that MPEGO provides a flexible, user-driven, and multi-level evaluation framework, with practical insights on the generation quality. The framework, source code, and experiments will be available at https://github.com/GT4SD/mpego. △ Less

Submitted 20 January, 2023; originally announced January 2023.

arXiv:2207.03928 [pdf, other]

doi 10.1038/s41524-023-01028-1

Accelerating Material Design with the Generative Toolkit for Scientific Discovery

Authors: Matteo Manica, Jannis Born, Joris Cadow, Dimitrios Christofidellis, Ashish Dave, Dean Clarke, Yves Gaetan Nana Teukam, Giorgio Giannone, Samuel C. Hoffman, Matthew Buchan, Vijil Chenthamarakshan, Timothy Donovan, Hsiang Han Hsu, Federico Zipoli, Oliver Schilter, Akihiro Kishimoto, Lisa Hamada, Inkit Padhi, Karl Wehden, Lauren McHugh, Alexy Khrabrov, Payel Das, Seiji Takeda, John R. Smith

Abstract: With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible… ▽ More With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible open-source library enables scientists, developers, and researchers to train and use state-of-the-art generative models to accelerate scientific discovery focused on material design. △ Less

Submitted 31 January, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

Comments: 15 pages, 2 figures

Journal ref: Nature Partner Journals (npj) Computational Materials 9, 69 (2023)

arXiv:2207.02572 [pdf, other]

The MICADO Atmospheric Dispersion Corrector: Optomechanical design, expected performance and calibration techniques

Authors: J. A. van den Born, R. Romp, A. W. Janssen, R. Navarro, W. Jellema, E. Tolstoy, B. Jayawardhana, M. Hartl

Abstract: The differential refraction of light passing through the atmosphere can have a severe impact on image quality if no atmospheric dispersion corrector (ADC) is used. For the Extremely Large Telescope (ELT) this holds true well into the infrared. MICADO, the near-infrared imaging camera for the ELT, will employ a cryogenic ADC consisting of two counter-rotating Amici prisms with diameters of 125 mm.… ▽ More The differential refraction of light passing through the atmosphere can have a severe impact on image quality if no atmospheric dispersion corrector (ADC) is used. For the Extremely Large Telescope (ELT) this holds true well into the infrared. MICADO, the near-infrared imaging camera for the ELT, will employ a cryogenic ADC consisting of two counter-rotating Amici prisms with diameters of 125 mm. The mechanism will reduce the atmospheric dispersion to below 2.5 milli arcseconds (mas), with a set goal of 1 mas. In this report, we provide an overview of the current status of the ADC in development for MICADO. We summarise the optomechanical design and discuss how the cryogenic environment impacts the performance. We will also discuss our plan to use a diffraction mask in the cold pupil to calibrate and validate the performance once the instrument is fully integrated. △ Less

Submitted 6 July, 2022; originally announced July 2022.

Comments: 16 pages, 12 figures, submitted to Proceedings of SPIE Astronomical Telescopes & Instrumentation 2022

arXiv:2202.01338 [pdf, other]

doi 10.1038/s42256-023-00639-z

Regression Transformer: Concurrent sequence regression and generation for molecular language modeling

Authors: Jannis Born, Matteo Manica

Abstract: Despite significant progress of generative models in the natural sciences, their controllability remains challenging. One fundamentally missing aspect of molecular or protein generative models is an inductive bias that can reflect continuous properties of interest. To that end, we propose the Regression Transformer (RT), a novel method that abstracts regression as a conditional sequence modeling p… ▽ More Despite significant progress of generative models in the natural sciences, their controllability remains challenging. One fundamentally missing aspect of molecular or protein generative models is an inductive bias that can reflect continuous properties of interest. To that end, we propose the Regression Transformer (RT), a novel method that abstracts regression as a conditional sequence modeling problem. This introduces a new paradigm of multitask language models which seamlessly bridge sequence regression and conditional sequence generation. We thoroughly demonstrate that, despite using a nominal-scale training objective, the RT matches or surpasses the performance of conventional regression models in property prediction tasks of small molecules, proteins and chemical reactions. Critically, priming the same model with continuous properties yields a highly competitive conditional generative model that outperforms specialized approaches in a substructure-constrained, property-driven molecule generation benchmark. Our dichotomous approach is facilitated by a novel, alternating training scheme that enables the model to decorate seed sequences by desired properties, e.g., to optimize reaction yield. In sum, the RT is the first report of a multitask model that concurrently excels at predictive and generative tasks in biochemistry. This finds particular application in property-driven, local exploration of the chemical or protein space and could pave the road toward foundation models in material design. The code to reproduce all experiments of the paper is available at: https://github.com/IBM/regression-transformer △ Less

Submitted 11 November, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

Comments: Updated paper, under review; Preliminary version as spotlight talk at ICLR 2022 workshop on Machine Learning for Drug Discovery

Journal ref: Nature Machine Intelligence 5, 432-444 (2023)

arXiv:2112.01284 [pdf, other]

doi 10.1093/mnras/stac845

Demonstration of an imaging technique for the measurement of PSF elongation caused by Atmospheric Dispersion

Authors: J. A. van den Born, W. Jellema, E. Dijkstra

Abstract: Elongation of the point spread function due to atmospheric dispersion becomes a severe problem for high resolution imaging instruments, if an atmospheric dispersion corrector is not present. In this work we report on a novel technique to measure this elongation, corrected or uncorrected, from imaging data. By employing a simple diffraction mask it is possible to magnify the chromatic elongation ca… ▽ More Elongation of the point spread function due to atmospheric dispersion becomes a severe problem for high resolution imaging instruments, if an atmospheric dispersion corrector is not present. In this work we report on a novel technique to measure this elongation, corrected or uncorrected, from imaging data. By employing a simple diffraction mask it is possible to magnify the chromatic elongation caused by the atmosphere and thus make it easier to measure. We discuss the theory and design of such a mask and report on two proof of concept observations using the 40 cm Gratama telescope at the University of Groningen. We evaluate the acquired images using a geometric approach, a forward modelling approach and from a direct measurement of the length of the point spread function. For the first two methods we report measurements consistent with atmospheric dispersion models to within 0.5 arcsec. Direct measurements of the elongation do not prove suitable for the characterisation of atmospheric dispersion. We conclude that the addition of this type of diffraction mask can be valuable for measurements of PSF elongation. This can enable high precision correction of atmospheric dispersion on future instruments. △ Less

Submitted 23 March, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

Comments: Accepted for publication in the Monthly Notices of the Royal Astronomical Society. Contains 11 pages, 11 figures, 2 tables

arXiv:2105.03323 [pdf, other]

doi 10.1093/bioinformatics/btab294

TITAN: T Cell Receptor Specificity Prediction with Bimodal Attention Networks

Authors: Anna Weber, Jannis Born, María Rodríguez Martínez

Abstract: Motivation: The activity of the adaptive immune system is governed by T-cells and their specific T-cell receptors (TCR), which selectively recognize foreign antigens. Recent advances in experimental techniques have enabled sequencing of TCRs and their antigenic targets (epitopes), allowing to research the missing link between TCR sequence and epitope binding specificity. Scarcity of data and a lar… ▽ More Motivation: The activity of the adaptive immune system is governed by T-cells and their specific T-cell receptors (TCR), which selectively recognize foreign antigens. Recent advances in experimental techniques have enabled sequencing of TCRs and their antigenic targets (epitopes), allowing to research the missing link between TCR sequence and epitope binding specificity. Scarcity of data and a large sequence space make this task challenging, and to date only models limited to a small set of epitopes have achieved good performance. Here, we establish a k-nearest-neighbor (K-NN) classifier as a strong baseline and then propose TITAN (Tcr epITope bimodal Attention Networks), a bimodal neural network that explicitly encodes both TCR sequences and epitopes to enable the independent study of generalization capabilities to unseen TCRs and/or epitopes. Results: By encoding epitopes at the atomic level with SMILES sequences, we leverage transfer learning and data augmentation to enrich the input data space and boost performance. TITAN achieves high performance in the prediction of specificity of unseen TCRs (ROC-AUC 0.87 in 10-fold CV) and surpasses the results of the current state-of-the-art (ImRex) by a large margin. Notably, our Levenshtein-distance-based K-NN classifier also exhibits competitive performance on unseen TCRs. While the generalization to unseen epitopes remains challenging, we report two major breakthroughs. First, by dissecting the attention heatmaps, we demonstrate that the sparsity of available epitope data favors an implicit treatment of epitopes as classes. This may be a general problem that limits unseen epitope performance for sufficiently complex models. Second, we show that TITAN nevertheless exhibits significantly improved performance on unseen epitopes and is capable of focusing attention on chemically meaningful molecular structures. △ Less

Submitted 21 April, 2021; originally announced May 2021.

Comments: 9 pages, 5 figures, to be published in ISMB 2021 conference proceedings

Journal ref: Bioinformatics 37 (2021): i237-i244

arXiv:2011.09358 [pdf, other]

doi 10.1117/12.2560912

A Fourier optics approach to evaluate the astrometric performance of MICADO

Authors: J. A. van den Born, W. Jellema, R. Navarro, E. Tolstoy, B. Jayawardhana, A. W. Janssen

Abstract: We present our investigation into the impact of wavefront errors on high accuracy astrometry using Fourier Optics. MICADO, the upcoming near-IR imaging instrument for the Extremely Large Telescope, will offer capabilities for relative astrometry with an accuracy of 50 micro arcseconds (μas). Due to the large size of the point spread function (PSF) compared to the astrometric requirement, the detai… ▽ More We present our investigation into the impact of wavefront errors on high accuracy astrometry using Fourier Optics. MICADO, the upcoming near-IR imaging instrument for the Extremely Large Telescope, will offer capabilities for relative astrometry with an accuracy of 50 micro arcseconds (μas). Due to the large size of the point spread function (PSF) compared to the astrometric requirement, the detailed shape and position of the PSF on the detector must be well understood. Furthermore, because the atmospheric dispersion corrector of MICADO is a moving component within an otherwise mostly static instrument, it might not be sufficient to perform a simple pre-observation calibration. Therefore, we have built a Fourier Optics framework, allowing us to evaluate the small changes in the centroid position of the PSF as a function of wavefront error. For a complete evaluation, we model both the low order surface form errors, using Zernike polynomials, and the mid- and high-spatial frequencies, using Power Spectral Density analysis. The described work will then make it possible, performing full diffractive beam propagation, to assess the expected astrometric performance of MICADO. △ Less

Submitted 18 November, 2020; originally announced November 2020.

Comments: 13 pages, 13 figures, to be submitted to the SPIE Astronomical Telescopes & Instrumentation 2020 conference

arXiv:2009.06116 [pdf, other]

doi 10.3390/app11020672

Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image Analysis

Authors: Jannis Born, Nina Wiedemann, Gabriel Brändle, Charlotte Buhre, Bastian Rieck, Karsten Borgwardt

Abstract: Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools. Ultrasound, in contrast to CT or X-Ray, has many practical advantages and can serve as a globally-applicable first-line examination technique. We provide the largest publicly available lung ultrasound (US) dataset for COVID-19 consisting of 106 videos from three classes (COVID-… ▽ More Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools. Ultrasound, in contrast to CT or X-Ray, has many practical advantages and can serve as a globally-applicable first-line examination technique. We provide the largest publicly available lung ultrasound (US) dataset for COVID-19 consisting of 106 videos from three classes (COVID-19, bacterial pneumonia, and healthy controls); curated and approved by medical experts. On this dataset, we perform an in-depth study of the value of deep learning methods for differential diagnosis of COVID-19. We propose a frame-based convolutional neural network that correctly classifies COVID-19 US videos with a sensitivity of 0.98+-0.04 and a specificity of 0.91+-08 (frame-based sensitivity 0.93+-0.05, specificity 0.87+-0.07). We further employ class activation maps for the spatio-temporal localization of pulmonary biomarkers, which we subsequently validate for human-in-the-loop scenarios in a blindfolded study with medical experts. Aiming for scalability and robustness, we perform ablation studies comparing mobile-friendly, frame- and video-based architectures and show reliability of the best model by aleatoric and epistemic uncertainty estimates. We hope to pave the road for a community effort toward an accessible, efficient and interpretable screening method and we have started to work on a clinical validation of the proposed method. Data and code are publicly available. △ Less

Submitted 13 September, 2020; originally announced September 2020.

Comments: 8 pages, 4 figures

Journal ref: Applied Sciences 2021 (special issue on: "Fighting COVID-19: Emerging Techniques and Aid Systems for Prevention, Forecasting and Diagnosis")

arXiv:2006.12132 [pdf, other]

doi 10.1093/mnras/staa1870

Quantification of the expected residual dispersion of the MICADO Near-IR imaging instrument

Authors: J. A. van den Born, W. Jellema

Abstract: MICADO, a near-infrared imager for the Extremely Large Telescope, is being designed to deliver diffraction limited imaging and 50 micro arcsecond ($μ$as) astrometric accuracy. MICADO employs an atmospheric dispersion corrector (ADC) to keep the chromatic elongation of the point spread function (PSF) under control. We must understand the dispersion and residuals after correction to reach the optimu… ▽ More MICADO, a near-infrared imager for the Extremely Large Telescope, is being designed to deliver diffraction limited imaging and 50 micro arcsecond ($μ$as) astrometric accuracy. MICADO employs an atmospheric dispersion corrector (ADC) to keep the chromatic elongation of the point spread function (PSF) under control. We must understand the dispersion and residuals after correction to reach the optimum performance. Therefore, we identified several sources of chromatic dispersion that need to be considered for the MICADO ADC. First, we compared common models of atmospheric dispersion to investigate whether these models remain suitable for MICADO. We showed that the differential dispersion between common atmospheric models and integration over the full atmosphere is less than 10 $μ$as for most observations in H-band. We then performed an error propagation analysis to understand the uncertainty in the atmospheric dispersion as a function of atmospheric conditions. In addition, we investigated the impact of photometric color on the astrometric performance. While the differential refraction between stars within the same field of view can be significant, the inclusion of an ADC rendered this effect negligible. For MICADO specifically, we found that the current optomechanical design dominates the residual dispersion budget of 0.4 milli arcseconds (mas), with a contribution of 0.31 mas due to the positioning accuracy of the prisms and up to 0.15 mas due to a mismatch between the dispersive properties of the glass and the atmosphere. We found no showstoppers in the design of the MICADO ADC for achieving 50 $μ$as relative astrometric accuracy. △ Less

Submitted 22 June, 2020; originally announced June 2020.

Comments: Submitted to Monthly Notices of the Royal Astronomical Society. Contains 11 pages and 12 figures

arXiv:2005.13285 [pdf, other]

PaccMann$^{RL}$ on SARS-CoV-2: Designing antiviral candidates with conditional generative models

Authors: Jannis Born, Matteo Manica, Joris Cadow, Greta Markert, Nil Adell Mill, Modestas Filipavicius, María Rodríguez Martínez

Abstract: With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affin… ▽ More With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affinity model on predicting affinities of antiviral compounds to target proteins and couple this model with pharmacological toxicity predictors. Exploiting this multi-objective as a reward function of a conditional molecular generator (consisting of two VAEs), we showcase a framework that navigates the chemical space toward regions with more antiviral molecules. Specifically, we explore a challenging setting of generating ligands against unseen protein targets by performing a leave-one-out-cross-validation on 41 SARS-CoV-2-related target proteins. Using deep RL, it is demonstrated that in 35 out of 41 cases, the generation is biased towards sampling more binding ligands, with an average increase of 83% comparing to an unbiased VAE. We present a case-study on a potential Envelope-protein inhibitor and perform a synthetic accessibility assessment of the best generated molecules is performed that resembles a viable roadmap towards a rapid in-vitro evaluation of potential SARS-CoV-2 inhibitors. △ Less

Submitted 6 July, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

Comments: 5 pages, 6 figures

Journal ref: ICML Workshop on Computational Biology 2020

arXiv:2004.12084 [pdf, other]

POCOVID-Net: Automatic Detection of COVID-19 From a New Lung Ultrasound Imaging Dataset (POCUS)

Authors: Jannis Born, Gabriel Brändle, Manuel Cossio, Marion Disdier, Julie Goulet, Jérémie Roulin, Nina Wiedemann

Abstract: With the rapid development of COVID-19 into a global pandemic, there is an ever more urgent need for cheap, fast and reliable tools that can assist physicians in diagnosing COVID-19. Medical imaging such as CT can take a key role in complementing conventional diagnostic tools from molecular biology, and, using deep learning techniques, several automatic systems were demonstrated promising performa… ▽ More With the rapid development of COVID-19 into a global pandemic, there is an ever more urgent need for cheap, fast and reliable tools that can assist physicians in diagnosing COVID-19. Medical imaging such as CT can take a key role in complementing conventional diagnostic tools from molecular biology, and, using deep learning techniques, several automatic systems were demonstrated promising performances using CT or X-ray data. Here, we advocate a more prominent role of point-of-care ultrasound imaging to guide COVID-19 detection. Ultrasound is non-invasive and ubiquitous in medical facilities around the globe. Our contribution is threefold. First, we gather a lung ultrasound (POCUS) dataset consisting of 1103 images (654 COVID-19, 277 bacterial pneumonia and 172 healthy controls), sampled from 64 videos. This dataset was assembled from various online sources, processed specifically for deep learning models and is intended to serve as a starting point for an open-access initiative. Second, we train a deep convolutional neural network (POCOVID-Net) on this 3-class dataset and achieve an accuracy of 89% and, by a majority vote, a video accuracy of 92% . For detecting COVID-19 in particular, the model performs with a sensitivity of 0.96, a specificity of 0.79 and F1-score of 0.92 in a 5-fold cross validation. Third, we provide an open-access web service (POCOVIDScreen) that is available at: https://pocovidscreen.org. The website deploys the predictive model, allowing to perform predictions on ultrasound lung images. In addition, it grants medical staff the option to (bulk) upload their own screenings in order to contribute to the growing public database of pathological lung ultrasound images. Dataset and code are available from: https://github.com/jannisborn/covid19_pocus_ultrasound. NOTE: This preprint is superseded by our paper in Applied Sciences: https://doi.org/10.3390/app11020672 △ Less

Submitted 24 January, 2021; v1 submitted 25 April, 2020; originally announced April 2020.

Comments: 7 pages, 4 figures

Journal ref: ISMB TransMed COSI 2020

arXiv:2004.01215 [pdf, other]

CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models

Authors: Vijil Chenthamarakshan, Payel Das, Samuel C. Hoffman, Hendrik Strobelt, Inkit Padhi, Kar Wai Lim, Benjamin Hoover, Matteo Manica, Jannis Born, Teodoro Laino, Aleksandra Mojsilovic

Abstract: The novel nature of SARS-CoV-2 calls for the development of efficient de novo drug design approaches. In this study, we propose an end-to-end framework, named CogMol (Controlled Generation of Molecules), for designing new drug-like small molecules targeting novel viral proteins with high affinity and off-target selectivity. CogMol combines adaptive pre-training of a molecular SMILES Variational Au… ▽ More The novel nature of SARS-CoV-2 calls for the development of efficient de novo drug design approaches. In this study, we propose an end-to-end framework, named CogMol (Controlled Generation of Molecules), for designing new drug-like small molecules targeting novel viral proteins with high affinity and off-target selectivity. CogMol combines adaptive pre-training of a molecular SMILES Variational Autoencoder (VAE) and an efficient multi-attribute controlled sampling scheme that uses guidance from attribute predictors trained on latent features. To generate novel and optimal drug-like molecules for unseen viral targets, CogMol leverages a protein-molecule binding affinity predictor that is trained using SMILES VAE embeddings and protein sequence embeddings learned unsupervised from a large corpus. CogMol framework is applied to three SARS-CoV-2 target proteins: main protease, receptor-binding domain of the spike protein, and non-structural protein 9 replicase. The generated candidates are novel at both molecular and chemical scaffold levels when compared to the training data. CogMol also includes insilico screening for assessing toxicity of parent molecules and their metabolites with a multi-task toxicity classifier, synthetic feasibility with a chemical retrosynthesis predictor, and target structure binding with docking simulations. Docking reveals favorable binding of generated molecules to the target protein structure, where 87-95 % of high affinity molecules showed docking free energy < -6 kcal/mol. When compared to approved drugs, the majority of designed compounds show low parent molecule and metabolite toxicity and high synthetic feasibility. In summary, CogMol handles multi-constraint design of synthesizable, low-toxic, drug-like molecules with high target specificity and selectivity, and does not need target-dependent fine-tuning of the framework or target structure information. △ Less

Submitted 23 June, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

arXiv:1909.05114 [pdf, other]

doi 10.1007/978-3-030-45257-5_18

PaccMann$^{RL}$: Designing anticancer drugs from transcriptomic data via reinforcement learning

Authors: Jannis Born, Matteo Manica, Ali Oskooei, Joris Cadow, Karsten Borgwardt, María Rodríguez Martínez

Abstract: With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capa… ▽ More With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capable of tailoring anticancer compounds for a specific biomolecular profile. Using a RL framework, the transcriptomic profiles of cancer cells are used as a context for the generation of candidate molecules. Our molecule generator combines two separately pretrained variational autoencoders (VAEs) - the first VAE encodes transcriptomic profiles into a smooth, latent space which in turn is used to condition a second VAE to generate novel molecular structures on the given transcriptomic profile. The generative process is optimized through PaccMann, a previously developed drug sensitivity prediction model to obtain effective anticancer compounds for the given context (i.e., transcriptomic profile). We demonstrate how the molecule generation can be biased towards compounds with high predicted inhibitory effect against individual cell lines or specific cancer sites. We verify our approach by investigating candidate drugs generated against specific cancer types and find the highest structural similarity to existing compounds with known efficacy against these cancer types. We envision our approach to transform in silico anticancer drug design by leveraging the biomolecular characteristics of the disease in order to increase success rates in lead compound discovery. △ Less

Submitted 16 April, 2020; v1 submitted 29 August, 2019; originally announced September 2019.

Comments: 18 pages total (12 pages main text, 4 pages references, 11 pages appendix) 8 figures

Journal ref: International Conference on Research in Computational Molecular Biology 2020

arXiv:1904.11223 [pdf, other]

doi 10.1021/acs.molpharmaceut.9b00520

Towards Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-based Convolutional Encoders

Authors: Matteo Manica, Ali Oskooei, Jannis Born, Vigneshwari Subramanian, Julio Sáez-Rodríguez, María Rodríguez Martínez

Abstract: In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior kn… ▽ More In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior knowledge on intracellular interactions from protein-protein interaction networks. We demonstrate that our multiscale convolutional attention-based (MCA) encoder significantly outperforms a baseline model trained on Morgan fingerprints, a selection of encoders based on SMILES as well as previously reported state of the art for multimodal drug sensitivity prediction (R2 = 0.86 and RMSE = 0.89). Moreover, the explainability of our approach is demonstrated by a thorough analysis of the attention weights. We show that the attended genes significantly enrich apoptotic processes and that the drug attention is strongly correlated with a standard chemical structure similarity index. Finally, we report a case study of two receptor tyrosine kinase (RTK) inhibitors acting on a leukemia cell line, showcasing the ability of the model to focus on informative genes and submolecular regions of the two compounds. The demonstrated generalizability and the interpretability of our model testify its potential for in-silico prediction of anticancer compound efficacy on unseen cancer cells, positioning it as a valid solution for the development of personalized therapies as well as for the evaluation of candidate compounds in de novo drug design. △ Less

Submitted 14 July, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

Comments: 11 pages, 5 figures, 1 table, Workshop on Computational Biology at the International Conference on Machine Learning (ICML), Long Beach, CA, 2019

Journal ref: Mol. Pharmaceutics 2019

arXiv:1811.06802 [pdf, other]

PaccMann: Prediction of anticancer compound sensitivity with multi-modal attention-based neural networks

Authors: Ali Oskooei, Jannis Born, Matteo Manica, Vigneshwari Subramanian, Julio Sáez-Rodríguez, María Rodríguez Martínez

Abstract: We present a novel approach for the prediction of anticancer compound sensitivity by means of multi-modal attention-based neural networks (PaccMann). In our approach, we integrate three key pillars of drug sensitivity, namely, the molecular structure of compounds, transcriptomic profiles of cancer cells as well as prior knowledge about interactions among proteins within cells. Our models ingest a… ▽ More We present a novel approach for the prediction of anticancer compound sensitivity by means of multi-modal attention-based neural networks (PaccMann). In our approach, we integrate three key pillars of drug sensitivity, namely, the molecular structure of compounds, transcriptomic profiles of cancer cells as well as prior knowledge about interactions among proteins within cells. Our models ingest a drug-cell pair consisting of SMILES encoding of a compound and the gene expression profile of a cancer cell and predicts an IC50 sensitivity value. Gene expression profiles are encoded using an attention-based encoding mechanism that assigns high weights to the most informative genes. We present and study three encoders for SMILES string of compounds: 1) bidirectional recurrent 2) convolutional 3) attention-based encoders. We compare our devised models against a baseline model that ingests engineered fingerprints to represent the molecular structure. We demonstrate that using our attention-based encoders, we can surpass the baseline model. The use of attention-based encoders enhance interpretability and enable us to identify genes, bonds and atoms that were used by the network to make a prediction. △ Less

Submitted 14 July, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

Comments: 10 pages, 5 figures, 2 tables. NIPS MLMM 2018

Journal ref: NeurIPS 2018 Workshop on Machine Learning for Molecules & Materials

arXiv:1605.07268 [pdf, other]

Classifying discourse in a CSCL platform to evaluate correlations with Teacher Participation and Progress

Authors: Eliana Scheihing, Matthieu Vernier, Javiera Born, Julio Guerra, Luis Carcamo

Abstract: In Computer-Supported learning, monitoring and engaging a group of learners is a complex task for teachers, especially when learners are working collaboratively: Are my students motivated? What kind of progress are they making? Should I intervene? Is my communication and the didactic design adapted to my students? Our hypothesis is that the analysis of natural language interactions between student… ▽ More In Computer-Supported learning, monitoring and engaging a group of learners is a complex task for teachers, especially when learners are working collaboratively: Are my students motivated? What kind of progress are they making? Should I intervene? Is my communication and the didactic design adapted to my students? Our hypothesis is that the analysis of natural language interactions between students, and between students and teachers, provide very valuable information and could be used to produce qualitative indicators to help teachers' decisions. We develop an automatic approach in three steps (1) to explore the discursive functions of messages in a CSCL platform, (2) to classify the messages automatically and (3) to evaluate correlations between discursive attitudes and other variables linked to the learning activity. Results tend to show that some types of discourse are correlated with a notion of Progress on the learning activities and the importance of emotive participation from the Teacher. △ Less

Submitted 23 May, 2016; originally announced May 2016.

arXiv:1407.7999 [pdf, ps, other]

doi 10.1111/j.1365-2869.2012.01039.x

Induction of slow oscillations by rhythmic acoustic stimulation

Authors: Hong-Viet V. Ngo, Jens Christian Claussen, Jan Born, Matthias Mölle

Abstract: Slow oscillations are electrical potential oscillations with a spectral peak frequency of $\sim$0.8 Hz, and hallmark the electroencephalogram during slow-wave sleep. Recent studies have indicated a causal contribution of slow oscillations to the consolidation of memories during slow-wave sleep, raising the question to what extent such oscillations can be induced by external stimulation. Here, we e… ▽ More Slow oscillations are electrical potential oscillations with a spectral peak frequency of $\sim$0.8 Hz, and hallmark the electroencephalogram during slow-wave sleep. Recent studies have indicated a causal contribution of slow oscillations to the consolidation of memories during slow-wave sleep, raising the question to what extent such oscillations can be induced by external stimulation. Here, we examined whether slow oscillations can be effectively induced by rhythmic acoustic stimulation. Human subjects were examined in three conditions: (i) with tones presented at a rate of 0.8 Hz (`0.8-Hz stimulation'); (ii) with tones presented at a random sequence (`random stimulation'); and (iii) with no tones presented in a control condition (`sham'). Stimulation started during wakefulness before sleep and continued for the first $\sim$90 min of sleep. Compared with the other two conditions, 0.8-Hz stimulation significantly delayed sleep onset. However, once sleep was established, 0.8-Hz stimulation significantly increased and entrained endogenous slow oscillation activity. Sleep after the 90-min period of stimulation did not differ between the conditions. Our data show that rhythmic acoustic stimulation can be used to effectively enhance slow oscillation activity. However, the effect depends on the brain state, requiring the presence of stable non-rapid eye movement sleep. △ Less

Submitted 30 July, 2014; originally announced July 2014.

Journal ref: J. Sleep Res. 22, 22-31 (2013)

Showing 1–21 of 21 results for author: Born, J