-
Quantum Theory and Application of Contextual Optimal Transport
Authors:
Nicola Mariella,
Albert Akhriev,
Francesco Tacchino,
Christa Zoufal,
Juan Carlos Gonzalez-Espitia,
Benedek Harsanyi,
Eugene Koskin,
Ivano Tavernelli,
Stefan Woerner,
Marianna Rapsomaniki,
Sergiy Zhuk,
Jannis Born
Abstract:
Optimal Transport (OT) has fueled machine learning (ML) across many domains. When paired data measurements $(\boldsymbolμ, \boldsymbolν)$ are coupled to covariates, a challenging conditional distribution learning setting arises. Existing approaches for learning a $\textit{global}$ transport map parameterized through a potentially unseen context utilize Neural OT and largely rely on Brenier's theor…
▽ More
Optimal Transport (OT) has fueled machine learning (ML) across many domains. When paired data measurements $(\boldsymbolμ, \boldsymbolν)$ are coupled to covariates, a challenging conditional distribution learning setting arises. Existing approaches for learning a $\textit{global}$ transport map parameterized through a potentially unseen context utilize Neural OT and largely rely on Brenier's theorem. Here, we propose a first-of-its-kind quantum computing formulation for amortized optimization of contextualized transportation plans. We exploit a direct link between doubly stochastic matrices and unitary operators thus unravelling a natural connection between OT and quantum computation. We verify our method (QontOT) on synthetic and real data by predicting variations in cell type distributions conditioned on drug dosage. Importantly we conduct a 24-qubit hardware experiment on a task challenging for classical computers and report a performance that cannot be matched with our classical neural OT approach. In sum, this is a first step toward learning to predict contextualized transportation plans through quantum computing.
△ Less
Submitted 3 June, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
Language models in molecular discovery
Authors:
Nikita Janakarajan,
Tim Erdmann,
Sarath Swaminathan,
Teodoro Laino,
Jannis Born
Abstract:
The success of language models, especially transformer-based architectures, has trickled into other domains giving rise to "scientific language models" that operate on small molecules, proteins or polymers. In chemistry, language models contribute to accelerating the molecule discovery cycle as evidenced by promising recent findings in early-stage drug discovery. Here, we review the role of langua…
▽ More
The success of language models, especially transformer-based architectures, has trickled into other domains giving rise to "scientific language models" that operate on small molecules, proteins or polymers. In chemistry, language models contribute to accelerating the molecule discovery cycle as evidenced by promising recent findings in early-stage drug discovery. Here, we review the role of language models in molecular discovery, underlining their strength in de novo drug design, property prediction and reaction chemistry. We highlight valuable open-source software assets thus lowering the entry barrier to the field of scientific language modeling. Last, we sketch a vision for future molecular design that combines a chatbot interface with access to computational chemistry tools. Our contribution serves as a valuable resource for researchers, chemists, and AI enthusiasts interested in understanding how language models can and will be used to accelerate chemical discovery.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Towards quantum-enabled cell-centric therapeutics
Authors:
Saugata Basu,
Jannis Born,
Aritra Bose,
Sara Capponi,
Dimitra Chalkia,
Timothy A Chan,
Hakan Doga,
Frederik F. Flother,
Gad Getz,
Mark Goldsmith,
Tanvi Gujarati,
Aldo Guzman-Saenz,
Dimitrios Iliopoulos,
Gavin O. Jones,
Stefan Knecht,
Dhiraj Madan,
Sabrina Maniscalco,
Nicola Mariella,
Joseph A. Morrone,
Khadijeh Najafi,
Pushpak Pati,
Daniel Platt,
Maria Anna Rapsomaniki,
Anupama Ray,
Kahn Rhrissorrakrai
, et al. (8 additional authors not shown)
Abstract:
In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum com…
▽ More
In recent years, there has been tremendous progress in the development of quantum computing hardware, algorithms and services leading to the expectation that in the near future quantum computers will be capable of performing simulations for natural science applications, operations research, and machine learning at scales mostly inaccessible to classical computers. Whereas the impact of quantum computing has already started to be recognized in fields such as cryptanalysis, natural science simulations, and optimization among others, very little is known about the full potential of quantum computing simulations and machine learning in the realm of healthcare and life science (HCLS). Herein, we discuss the transformational changes we expect from the use of quantum computation for HCLS research, more specifically in the field of cell-centric therapeutics. Moreover, we identify and elaborate open problems in cell engineering, tissue modeling, perturbation modeling, and bio-topology while discussing candidate quantum algorithms for research on these topics and their potential advantages over classical computational approaches.
△ Less
Submitted 1 August, 2023; v1 submitted 11 July, 2023;
originally announced July 2023.
-
Unifying Molecular and Textual Representations via Multi-task Language Modelling
Authors:
Dimitrios Christofidellis,
Giorgio Giannone,
Jannis Born,
Ole Winther,
Teodoro Laino,
Matteo Manica
Abstract:
The recent advances in neural language models have also been successfully applied to the field of chemistry, offering generative solutions for classical problems in molecular design and synthesis planning. These new methods have the potential to fuel a new era of data-driven automation in scientific discovery. However, specialized models are still typically required for each task, leading to the n…
▽ More
The recent advances in neural language models have also been successfully applied to the field of chemistry, offering generative solutions for classical problems in molecular design and synthesis planning. These new methods have the potential to fuel a new era of data-driven automation in scientific discovery. However, specialized models are still typically required for each task, leading to the need for problem-specific fine-tuning and neglecting task interrelations. The main obstacle in this field is the lack of a unified representation between natural language and chemical representations, complicating and limiting human-machine interaction. Here, we propose the first multi-domain, multi-task language model that can solve a wide range of tasks in both the chemical and natural language domains. Our model can handle chemical and natural language concurrently, without requiring expensive pre-training on single domains or task-specific models. Interestingly, sharing weights across domains remarkably improves our model when benchmarked against state-of-the-art baselines on single-domain and cross-domain tasks. In particular, sharing information across domains and tasks gives rise to large improvements in cross-domain tasks, the magnitude of which increase with scale, as measured by more than a dozen of relevant metrics. Our work suggests that such models can robustly and efficiently accelerate discovery in physical sciences by superseding problem-specific fine-tuning and enhancing human-model interactions.
△ Less
Submitted 17 May, 2023; v1 submitted 29 January, 2023;
originally announced January 2023.
-
Domain-agnostic and Multi-level Evaluation of Generative Models
Authors:
Girmaw Abebe Tadesse,
Jannis Born,
Celia Cintas,
William Ogallo,
Dmitry Zubarev,
Matteo Manica,
Komminist Weldemariam
Abstract:
While the capabilities of generative models heavily improved in different domains (images, text, graphs, molecules, etc.), their evaluation metrics largely remain based on simplified quantities or manual inspection with limited practicality. To this end, we propose a framework for Multi-level Performance Evaluation of Generative mOdels (MPEGO), which could be employed across different domains. MPE…
▽ More
While the capabilities of generative models heavily improved in different domains (images, text, graphs, molecules, etc.), their evaluation metrics largely remain based on simplified quantities or manual inspection with limited practicality. To this end, we propose a framework for Multi-level Performance Evaluation of Generative mOdels (MPEGO), which could be employed across different domains. MPEGO aims to quantify generation performance hierarchically, starting from a sub-feature-based low-level evaluation to a global features-based high-level evaluation. MPEGO offers great customizability as the employed features are entirely user-driven and can thus be highly domain/problem-specific while being arbitrarily complex (e.g., outcomes of experimental procedures). We validate MPEGO using multiple generative models across several datasets from the material discovery domain. An ablation study is conducted to study the plausibility of intermediate steps in MPEGO. Results demonstrate that MPEGO provides a flexible, user-driven, and multi-level evaluation framework, with practical insights on the generation quality. The framework, source code, and experiments will be available at https://github.com/GT4SD/mpego.
△ Less
Submitted 20 January, 2023;
originally announced January 2023.
-
Accelerating Material Design with the Generative Toolkit for Scientific Discovery
Authors:
Matteo Manica,
Jannis Born,
Joris Cadow,
Dimitrios Christofidellis,
Ashish Dave,
Dean Clarke,
Yves Gaetan Nana Teukam,
Giorgio Giannone,
Samuel C. Hoffman,
Matthew Buchan,
Vijil Chenthamarakshan,
Timothy Donovan,
Hsiang Han Hsu,
Federico Zipoli,
Oliver Schilter,
Akihiro Kishimoto,
Lisa Hamada,
Inkit Padhi,
Karl Wehden,
Lauren McHugh,
Alexy Khrabrov,
Payel Das,
Seiji Takeda,
John R. Smith
Abstract:
With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible…
▽ More
With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible open-source library enables scientists, developers, and researchers to train and use state-of-the-art generative models to accelerate scientific discovery focused on material design.
△ Less
Submitted 31 January, 2023; v1 submitted 8 July, 2022;
originally announced July 2022.
-
The MICADO Atmospheric Dispersion Corrector: Optomechanical design, expected performance and calibration techniques
Authors:
J. A. van den Born,
R. Romp,
A. W. Janssen,
R. Navarro,
W. Jellema,
E. Tolstoy,
B. Jayawardhana,
M. Hartl
Abstract:
The differential refraction of light passing through the atmosphere can have a severe impact on image quality if no atmospheric dispersion corrector (ADC) is used. For the Extremely Large Telescope (ELT) this holds true well into the infrared. MICADO, the near-infrared imaging camera for the ELT, will employ a cryogenic ADC consisting of two counter-rotating Amici prisms with diameters of 125 mm.…
▽ More
The differential refraction of light passing through the atmosphere can have a severe impact on image quality if no atmospheric dispersion corrector (ADC) is used. For the Extremely Large Telescope (ELT) this holds true well into the infrared. MICADO, the near-infrared imaging camera for the ELT, will employ a cryogenic ADC consisting of two counter-rotating Amici prisms with diameters of 125 mm. The mechanism will reduce the atmospheric dispersion to below 2.5 milli arcseconds (mas), with a set goal of 1 mas. In this report, we provide an overview of the current status of the ADC in development for MICADO. We summarise the optomechanical design and discuss how the cryogenic environment impacts the performance. We will also discuss our plan to use a diffraction mask in the cold pupil to calibrate and validate the performance once the instrument is fully integrated.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
Regression Transformer: Concurrent sequence regression and generation for molecular language modeling
Authors:
Jannis Born,
Matteo Manica
Abstract:
Despite significant progress of generative models in the natural sciences, their controllability remains challenging. One fundamentally missing aspect of molecular or protein generative models is an inductive bias that can reflect continuous properties of interest. To that end, we propose the Regression Transformer (RT), a novel method that abstracts regression as a conditional sequence modeling p…
▽ More
Despite significant progress of generative models in the natural sciences, their controllability remains challenging. One fundamentally missing aspect of molecular or protein generative models is an inductive bias that can reflect continuous properties of interest. To that end, we propose the Regression Transformer (RT), a novel method that abstracts regression as a conditional sequence modeling problem. This introduces a new paradigm of multitask language models which seamlessly bridge sequence regression and conditional sequence generation.
We thoroughly demonstrate that, despite using a nominal-scale training objective, the RT matches or surpasses the performance of conventional regression models in property prediction tasks of small molecules, proteins and chemical reactions. Critically, priming the same model with continuous properties yields a highly competitive conditional generative model that outperforms specialized approaches in a substructure-constrained, property-driven molecule generation benchmark. Our dichotomous approach is facilitated by a novel, alternating training scheme that enables the model to decorate seed sequences by desired properties, e.g., to optimize reaction yield.
In sum, the RT is the first report of a multitask model that concurrently excels at predictive and generative tasks in biochemistry. This finds particular application in property-driven, local exploration of the chemical or protein space and could pave the road toward foundation models in material design.
The code to reproduce all experiments of the paper is available at: https://github.com/IBM/regression-transformer
△ Less
Submitted 11 November, 2022; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Demonstration of an imaging technique for the measurement of PSF elongation caused by Atmospheric Dispersion
Authors:
J. A. van den Born,
W. Jellema,
E. Dijkstra
Abstract:
Elongation of the point spread function due to atmospheric dispersion becomes a severe problem for high resolution imaging instruments, if an atmospheric dispersion corrector is not present. In this work we report on a novel technique to measure this elongation, corrected or uncorrected, from imaging data. By employing a simple diffraction mask it is possible to magnify the chromatic elongation ca…
▽ More
Elongation of the point spread function due to atmospheric dispersion becomes a severe problem for high resolution imaging instruments, if an atmospheric dispersion corrector is not present. In this work we report on a novel technique to measure this elongation, corrected or uncorrected, from imaging data. By employing a simple diffraction mask it is possible to magnify the chromatic elongation caused by the atmosphere and thus make it easier to measure. We discuss the theory and design of such a mask and report on two proof of concept observations using the 40 cm Gratama telescope at the University of Groningen. We evaluate the acquired images using a geometric approach, a forward modelling approach and from a direct measurement of the length of the point spread function. For the first two methods we report measurements consistent with atmospheric dispersion models to within 0.5 arcsec. Direct measurements of the elongation do not prove suitable for the characterisation of atmospheric dispersion. We conclude that the addition of this type of diffraction mask can be valuable for measurements of PSF elongation. This can enable high precision correction of atmospheric dispersion on future instruments.
△ Less
Submitted 23 March, 2022; v1 submitted 2 December, 2021;
originally announced December 2021.
-
TITAN: T Cell Receptor Specificity Prediction with Bimodal Attention Networks
Authors:
Anna Weber,
Jannis Born,
María Rodríguez Martínez
Abstract:
Motivation: The activity of the adaptive immune system is governed by T-cells and their specific T-cell receptors (TCR), which selectively recognize foreign antigens. Recent advances in experimental techniques have enabled sequencing of TCRs and their antigenic targets (epitopes), allowing to research the missing link between TCR sequence and epitope binding specificity. Scarcity of data and a lar…
▽ More
Motivation: The activity of the adaptive immune system is governed by T-cells and their specific T-cell receptors (TCR), which selectively recognize foreign antigens. Recent advances in experimental techniques have enabled sequencing of TCRs and their antigenic targets (epitopes), allowing to research the missing link between TCR sequence and epitope binding specificity. Scarcity of data and a large sequence space make this task challenging, and to date only models limited to a small set of epitopes have achieved good performance. Here, we establish a k-nearest-neighbor (K-NN) classifier as a strong baseline and then propose TITAN (Tcr epITope bimodal Attention Networks), a bimodal neural network that explicitly encodes both TCR sequences and epitopes to enable the independent study of generalization capabilities to unseen TCRs and/or epitopes. Results: By encoding epitopes at the atomic level with SMILES sequences, we leverage transfer learning and data augmentation to enrich the input data space and boost performance. TITAN achieves high performance in the prediction of specificity of unseen TCRs (ROC-AUC 0.87 in 10-fold CV) and surpasses the results of the current state-of-the-art (ImRex) by a large margin. Notably, our Levenshtein-distance-based K-NN classifier also exhibits competitive performance on unseen TCRs. While the generalization to unseen epitopes remains challenging, we report two major breakthroughs. First, by dissecting the attention heatmaps, we demonstrate that the sparsity of available epitope data favors an implicit treatment of epitopes as classes. This may be a general problem that limits unseen epitope performance for sufficiently complex models. Second, we show that TITAN nevertheless exhibits significantly improved performance on unseen epitopes and is capable of focusing attention on chemically meaningful molecular structures.
△ Less
Submitted 21 April, 2021;
originally announced May 2021.
-
A Fourier optics approach to evaluate the astrometric performance of MICADO
Authors:
J. A. van den Born,
W. Jellema,
R. Navarro,
E. Tolstoy,
B. Jayawardhana,
A. W. Janssen
Abstract:
We present our investigation into the impact of wavefront errors on high accuracy astrometry using Fourier Optics. MICADO, the upcoming near-IR imaging instrument for the Extremely Large Telescope, will offer capabilities for relative astrometry with an accuracy of 50 micro arcseconds (μas). Due to the large size of the point spread function (PSF) compared to the astrometric requirement, the detai…
▽ More
We present our investigation into the impact of wavefront errors on high accuracy astrometry using Fourier Optics. MICADO, the upcoming near-IR imaging instrument for the Extremely Large Telescope, will offer capabilities for relative astrometry with an accuracy of 50 micro arcseconds (μas). Due to the large size of the point spread function (PSF) compared to the astrometric requirement, the detailed shape and position of the PSF on the detector must be well understood. Furthermore, because the atmospheric dispersion corrector of MICADO is a moving component within an otherwise mostly static instrument, it might not be sufficient to perform a simple pre-observation calibration. Therefore, we have built a Fourier Optics framework, allowing us to evaluate the small changes in the centroid position of the PSF as a function of wavefront error. For a complete evaluation, we model both the low order surface form errors, using Zernike polynomials, and the mid- and high-spatial frequencies, using Power Spectral Density analysis. The described work will then make it possible, performing full diffractive beam propagation, to assess the expected astrometric performance of MICADO.
△ Less
Submitted 18 November, 2020;
originally announced November 2020.
-
Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image Analysis
Authors:
Jannis Born,
Nina Wiedemann,
Gabriel Brändle,
Charlotte Buhre,
Bastian Rieck,
Karsten Borgwardt
Abstract:
Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools. Ultrasound, in contrast to CT or X-Ray, has many practical advantages and can serve as a globally-applicable first-line examination technique. We provide the largest publicly available lung ultrasound (US) dataset for COVID-19 consisting of 106 videos from three classes (COVID-…
▽ More
Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools. Ultrasound, in contrast to CT or X-Ray, has many practical advantages and can serve as a globally-applicable first-line examination technique. We provide the largest publicly available lung ultrasound (US) dataset for COVID-19 consisting of 106 videos from three classes (COVID-19, bacterial pneumonia, and healthy controls); curated and approved by medical experts. On this dataset, we perform an in-depth study of the value of deep learning methods for differential diagnosis of COVID-19. We propose a frame-based convolutional neural network that correctly classifies COVID-19 US videos with a sensitivity of 0.98+-0.04 and a specificity of 0.91+-08 (frame-based sensitivity 0.93+-0.05, specificity 0.87+-0.07). We further employ class activation maps for the spatio-temporal localization of pulmonary biomarkers, which we subsequently validate for human-in-the-loop scenarios in a blindfolded study with medical experts. Aiming for scalability and robustness, we perform ablation studies comparing mobile-friendly, frame- and video-based architectures and show reliability of the best model by aleatoric and epistemic uncertainty estimates. We hope to pave the road for a community effort toward an accessible, efficient and interpretable screening method and we have started to work on a clinical validation of the proposed method. Data and code are publicly available.
△ Less
Submitted 13 September, 2020;
originally announced September 2020.
-
Quantification of the expected residual dispersion of the MICADO Near-IR imaging instrument
Authors:
J. A. van den Born,
W. Jellema
Abstract:
MICADO, a near-infrared imager for the Extremely Large Telescope, is being designed to deliver diffraction limited imaging and 50 micro arcsecond ($μ$as) astrometric accuracy. MICADO employs an atmospheric dispersion corrector (ADC) to keep the chromatic elongation of the point spread function (PSF) under control. We must understand the dispersion and residuals after correction to reach the optimu…
▽ More
MICADO, a near-infrared imager for the Extremely Large Telescope, is being designed to deliver diffraction limited imaging and 50 micro arcsecond ($μ$as) astrometric accuracy. MICADO employs an atmospheric dispersion corrector (ADC) to keep the chromatic elongation of the point spread function (PSF) under control. We must understand the dispersion and residuals after correction to reach the optimum performance. Therefore, we identified several sources of chromatic dispersion that need to be considered for the MICADO ADC. First, we compared common models of atmospheric dispersion to investigate whether these models remain suitable for MICADO. We showed that the differential dispersion between common atmospheric models and integration over the full atmosphere is less than 10 $μ$as for most observations in H-band. We then performed an error propagation analysis to understand the uncertainty in the atmospheric dispersion as a function of atmospheric conditions. In addition, we investigated the impact of photometric color on the astrometric performance. While the differential refraction between stars within the same field of view can be significant, the inclusion of an ADC rendered this effect negligible. For MICADO specifically, we found that the current optomechanical design dominates the residual dispersion budget of 0.4 milli arcseconds (mas), with a contribution of 0.31 mas due to the positioning accuracy of the prisms and up to 0.15 mas due to a mismatch between the dispersive properties of the glass and the atmosphere. We found no showstoppers in the design of the MICADO ADC for achieving 50 $μ$as relative astrometric accuracy.
△ Less
Submitted 22 June, 2020;
originally announced June 2020.
-
PaccMann$^{RL}$ on SARS-CoV-2: Designing antiviral candidates with conditional generative models
Authors:
Jannis Born,
Matteo Manica,
Joris Cadow,
Greta Markert,
Nil Adell Mill,
Modestas Filipavicius,
María Rodríguez Martínez
Abstract:
With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affin…
▽ More
With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affinity model on predicting affinities of antiviral compounds to target proteins and couple this model with pharmacological toxicity predictors. Exploiting this multi-objective as a reward function of a conditional molecular generator (consisting of two VAEs), we showcase a framework that navigates the chemical space toward regions with more antiviral molecules. Specifically, we explore a challenging setting of generating ligands against unseen protein targets by performing a leave-one-out-cross-validation on 41 SARS-CoV-2-related target proteins. Using deep RL, it is demonstrated that in 35 out of 41 cases, the generation is biased towards sampling more binding ligands, with an average increase of 83% comparing to an unbiased VAE. We present a case-study on a potential Envelope-protein inhibitor and perform a synthetic accessibility assessment of the best generated molecules is performed that resembles a viable roadmap towards a rapid in-vitro evaluation of potential SARS-CoV-2 inhibitors.
△ Less
Submitted 6 July, 2020; v1 submitted 27 May, 2020;
originally announced May 2020.
-
POCOVID-Net: Automatic Detection of COVID-19 From a New Lung Ultrasound Imaging Dataset (POCUS)
Authors:
Jannis Born,
Gabriel Brändle,
Manuel Cossio,
Marion Disdier,
Julie Goulet,
Jérémie Roulin,
Nina Wiedemann
Abstract:
With the rapid development of COVID-19 into a global pandemic, there is an ever more urgent need for cheap, fast and reliable tools that can assist physicians in diagnosing COVID-19. Medical imaging such as CT can take a key role in complementing conventional diagnostic tools from molecular biology, and, using deep learning techniques, several automatic systems were demonstrated promising performa…
▽ More
With the rapid development of COVID-19 into a global pandemic, there is an ever more urgent need for cheap, fast and reliable tools that can assist physicians in diagnosing COVID-19. Medical imaging such as CT can take a key role in complementing conventional diagnostic tools from molecular biology, and, using deep learning techniques, several automatic systems were demonstrated promising performances using CT or X-ray data. Here, we advocate a more prominent role of point-of-care ultrasound imaging to guide COVID-19 detection. Ultrasound is non-invasive and ubiquitous in medical facilities around the globe. Our contribution is threefold. First, we gather a lung ultrasound (POCUS) dataset consisting of 1103 images (654 COVID-19, 277 bacterial pneumonia and 172 healthy controls), sampled from 64 videos. This dataset was assembled from various online sources, processed specifically for deep learning models and is intended to serve as a starting point for an open-access initiative. Second, we train a deep convolutional neural network (POCOVID-Net) on this 3-class dataset and achieve an accuracy of 89% and, by a majority vote, a video accuracy of 92% . For detecting COVID-19 in particular, the model performs with a sensitivity of 0.96, a specificity of 0.79 and F1-score of 0.92 in a 5-fold cross validation. Third, we provide an open-access web service (POCOVIDScreen) that is available at: https://pocovidscreen.org. The website deploys the predictive model, allowing to perform predictions on ultrasound lung images. In addition, it grants medical staff the option to (bulk) upload their own screenings in order to contribute to the growing public database of pathological lung ultrasound images.
Dataset and code are available from: https://github.com/jannisborn/covid19_pocus_ultrasound.
NOTE: This preprint is superseded by our paper in Applied Sciences: https://doi.org/10.3390/app11020672
△ Less
Submitted 24 January, 2021; v1 submitted 25 April, 2020;
originally announced April 2020.
-
CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models
Authors:
Vijil Chenthamarakshan,
Payel Das,
Samuel C. Hoffman,
Hendrik Strobelt,
Inkit Padhi,
Kar Wai Lim,
Benjamin Hoover,
Matteo Manica,
Jannis Born,
Teodoro Laino,
Aleksandra Mojsilovic
Abstract:
The novel nature of SARS-CoV-2 calls for the development of efficient de novo drug design approaches. In this study, we propose an end-to-end framework, named CogMol (Controlled Generation of Molecules), for designing new drug-like small molecules targeting novel viral proteins with high affinity and off-target selectivity. CogMol combines adaptive pre-training of a molecular SMILES Variational Au…
▽ More
The novel nature of SARS-CoV-2 calls for the development of efficient de novo drug design approaches. In this study, we propose an end-to-end framework, named CogMol (Controlled Generation of Molecules), for designing new drug-like small molecules targeting novel viral proteins with high affinity and off-target selectivity. CogMol combines adaptive pre-training of a molecular SMILES Variational Autoencoder (VAE) and an efficient multi-attribute controlled sampling scheme that uses guidance from attribute predictors trained on latent features. To generate novel and optimal drug-like molecules for unseen viral targets, CogMol leverages a protein-molecule binding affinity predictor that is trained using SMILES VAE embeddings and protein sequence embeddings learned unsupervised from a large corpus. CogMol framework is applied to three SARS-CoV-2 target proteins: main protease, receptor-binding domain of the spike protein, and non-structural protein 9 replicase. The generated candidates are novel at both molecular and chemical scaffold levels when compared to the training data. CogMol also includes insilico screening for assessing toxicity of parent molecules and their metabolites with a multi-task toxicity classifier, synthetic feasibility with a chemical retrosynthesis predictor, and target structure binding with docking simulations. Docking reveals favorable binding of generated molecules to the target protein structure, where 87-95 % of high affinity molecules showed docking free energy < -6 kcal/mol. When compared to approved drugs, the majority of designed compounds show low parent molecule and metabolite toxicity and high synthetic feasibility. In summary, CogMol handles multi-constraint design of synthesizable, low-toxic, drug-like molecules with high target specificity and selectivity, and does not need target-dependent fine-tuning of the framework or target structure information.
△ Less
Submitted 23 June, 2020; v1 submitted 2 April, 2020;
originally announced April 2020.
-
PaccMann$^{RL}$: Designing anticancer drugs from transcriptomic data via reinforcement learning
Authors:
Jannis Born,
Matteo Manica,
Ali Oskooei,
Joris Cadow,
Karsten Borgwardt,
María Rodríguez Martínez
Abstract:
With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capa…
▽ More
With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capable of tailoring anticancer compounds for a specific biomolecular profile. Using a RL framework, the transcriptomic profiles of cancer cells are used as a context for the generation of candidate molecules. Our molecule generator combines two separately pretrained variational autoencoders (VAEs) - the first VAE encodes transcriptomic profiles into a smooth, latent space which in turn is used to condition a second VAE to generate novel molecular structures on the given transcriptomic profile. The generative process is optimized through PaccMann, a previously developed drug sensitivity prediction model to obtain effective anticancer compounds for the given context (i.e., transcriptomic profile). We demonstrate how the molecule generation can be biased towards compounds with high predicted inhibitory effect against individual cell lines or specific cancer sites. We verify our approach by investigating candidate drugs generated against specific cancer types and find the highest structural similarity to existing compounds with known efficacy against these cancer types. We envision our approach to transform in silico anticancer drug design by leveraging the biomolecular characteristics of the disease in order to increase success rates in lead compound discovery.
△ Less
Submitted 16 April, 2020; v1 submitted 29 August, 2019;
originally announced September 2019.
-
Towards Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-based Convolutional Encoders
Authors:
Matteo Manica,
Ali Oskooei,
Jannis Born,
Vigneshwari Subramanian,
Julio Sáez-Rodríguez,
María Rodríguez Martínez
Abstract:
In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior kn…
▽ More
In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior knowledge on intracellular interactions from protein-protein interaction networks. We demonstrate that our multiscale convolutional attention-based (MCA) encoder significantly outperforms a baseline model trained on Morgan fingerprints, a selection of encoders based on SMILES as well as previously reported state of the art for multimodal drug sensitivity prediction (R2 = 0.86 and RMSE = 0.89). Moreover, the explainability of our approach is demonstrated by a thorough analysis of the attention weights. We show that the attended genes significantly enrich apoptotic processes and that the drug attention is strongly correlated with a standard chemical structure similarity index. Finally, we report a case study of two receptor tyrosine kinase (RTK) inhibitors acting on a leukemia cell line, showcasing the ability of the model to focus on informative genes and submolecular regions of the two compounds. The demonstrated generalizability and the interpretability of our model testify its potential for in-silico prediction of anticancer compound efficacy on unseen cancer cells, positioning it as a valid solution for the development of personalized therapies as well as for the evaluation of candidate compounds in de novo drug design.
△ Less
Submitted 14 July, 2019; v1 submitted 25 April, 2019;
originally announced April 2019.
-
PaccMann: Prediction of anticancer compound sensitivity with multi-modal attention-based neural networks
Authors:
Ali Oskooei,
Jannis Born,
Matteo Manica,
Vigneshwari Subramanian,
Julio Sáez-Rodríguez,
María Rodríguez Martínez
Abstract:
We present a novel approach for the prediction of anticancer compound sensitivity by means of multi-modal attention-based neural networks (PaccMann). In our approach, we integrate three key pillars of drug sensitivity, namely, the molecular structure of compounds, transcriptomic profiles of cancer cells as well as prior knowledge about interactions among proteins within cells. Our models ingest a…
▽ More
We present a novel approach for the prediction of anticancer compound sensitivity by means of multi-modal attention-based neural networks (PaccMann). In our approach, we integrate three key pillars of drug sensitivity, namely, the molecular structure of compounds, transcriptomic profiles of cancer cells as well as prior knowledge about interactions among proteins within cells. Our models ingest a drug-cell pair consisting of SMILES encoding of a compound and the gene expression profile of a cancer cell and predicts an IC50 sensitivity value. Gene expression profiles are encoded using an attention-based encoding mechanism that assigns high weights to the most informative genes. We present and study three encoders for SMILES string of compounds: 1) bidirectional recurrent 2) convolutional 3) attention-based encoders. We compare our devised models against a baseline model that ingests engineered fingerprints to represent the molecular structure. We demonstrate that using our attention-based encoders, we can surpass the baseline model. The use of attention-based encoders enhance interpretability and enable us to identify genes, bonds and atoms that were used by the network to make a prediction.
△ Less
Submitted 14 July, 2019; v1 submitted 16 November, 2018;
originally announced November 2018.
-
Classifying discourse in a CSCL platform to evaluate correlations with Teacher Participation and Progress
Authors:
Eliana Scheihing,
Matthieu Vernier,
Javiera Born,
Julio Guerra,
Luis Carcamo
Abstract:
In Computer-Supported learning, monitoring and engaging a group of learners is a complex task for teachers, especially when learners are working collaboratively: Are my students motivated? What kind of progress are they making? Should I intervene? Is my communication and the didactic design adapted to my students? Our hypothesis is that the analysis of natural language interactions between student…
▽ More
In Computer-Supported learning, monitoring and engaging a group of learners is a complex task for teachers, especially when learners are working collaboratively: Are my students motivated? What kind of progress are they making? Should I intervene? Is my communication and the didactic design adapted to my students? Our hypothesis is that the analysis of natural language interactions between students, and between students and teachers, provide very valuable information and could be used to produce qualitative indicators to help teachers' decisions. We develop an automatic approach in three steps (1) to explore the discursive functions of messages in a CSCL platform, (2) to classify the messages automatically and (3) to evaluate correlations between discursive attitudes and other variables linked to the learning activity. Results tend to show that some types of discourse are correlated with a notion of Progress on the learning activities and the importance of emotive participation from the Teacher.
△ Less
Submitted 23 May, 2016;
originally announced May 2016.
-
Induction of slow oscillations by rhythmic acoustic stimulation
Authors:
Hong-Viet V. Ngo,
Jens Christian Claussen,
Jan Born,
Matthias Mölle
Abstract:
Slow oscillations are electrical potential oscillations with a spectral peak frequency of $\sim$0.8 Hz, and hallmark the electroencephalogram during slow-wave sleep. Recent studies have indicated a causal contribution of slow oscillations to the consolidation of memories during slow-wave sleep, raising the question to what extent such oscillations can be induced by external stimulation. Here, we e…
▽ More
Slow oscillations are electrical potential oscillations with a spectral peak frequency of $\sim$0.8 Hz, and hallmark the electroencephalogram during slow-wave sleep. Recent studies have indicated a causal contribution of slow oscillations to the consolidation of memories during slow-wave sleep, raising the question to what extent such oscillations can be induced by external stimulation. Here, we examined whether slow oscillations can be effectively induced by rhythmic acoustic stimulation. Human subjects were examined in three conditions: (i) with tones presented at a rate of 0.8 Hz (`0.8-Hz stimulation'); (ii) with tones presented at a random sequence (`random stimulation'); and (iii) with no tones presented in a control condition (`sham'). Stimulation started during wakefulness before sleep and continued for the first $\sim$90 min of sleep. Compared with the other two conditions, 0.8-Hz stimulation significantly delayed sleep onset. However, once sleep was established, 0.8-Hz stimulation significantly increased and entrained endogenous slow oscillation activity. Sleep after the 90-min period of stimulation did not differ between the conditions. Our data show that rhythmic acoustic stimulation can be used to effectively enhance slow oscillation activity. However, the effect depends on the brain state, requiring the presence of stable non-rapid eye movement sleep.
△ Less
Submitted 30 July, 2014;
originally announced July 2014.