Skip to main content

Showing 1–5 of 5 results for author: Cadow, J

.
  1. Accelerating Material Design with the Generative Toolkit for Scientific Discovery

    Authors: Matteo Manica, Jannis Born, Joris Cadow, Dimitrios Christofidellis, Ashish Dave, Dean Clarke, Yves Gaetan Nana Teukam, Giorgio Giannone, Samuel C. Hoffman, Matthew Buchan, Vijil Chenthamarakshan, Timothy Donovan, Hsiang Han Hsu, Federico Zipoli, Oliver Schilter, Akihiro Kishimoto, Lisa Hamada, Inkit Padhi, Karl Wehden, Lauren McHugh, Alexy Khrabrov, Payel Das, Seiji Takeda, John R. Smith

    Abstract: With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible… ▽ More

    Submitted 31 January, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: 15 pages, 2 figures

    Journal ref: Nature Partner Journals (npj) Computational Materials 9, 69 (2023)

  2. arXiv:2012.03084  [pdf, other

    q-bio.BM cs.CL

    Pre-training Protein Language Models with Label-Agnostic Binding Pairs Enhances Performance in Downstream Tasks

    Authors: Modestas Filipavicius, Matteo Manica, Joris Cadow, Maria Rodriguez Martinez

    Abstract: Less than 1% of protein sequences are structurally and functionally annotated. Natural Language Processing (NLP) community has recently embraced self-supervised learning as a powerful approach to learn representations from unlabeled text, in large part due to the attention-based context-aware Transformer models. In this work we present a modification to the RoBERTa model by inputting during pre-tr… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

    Comments: 20 pages, 12 figures, accepted to Machine Learning for Structural Biology (MLSB) workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS)

  3. arXiv:2005.13285  [pdf, other

    q-bio.QM cs.LG stat.ML

    PaccMann$^{RL}$ on SARS-CoV-2: Designing antiviral candidates with conditional generative models

    Authors: Jannis Born, Matteo Manica, Joris Cadow, Greta Markert, Nil Adell Mill, Modestas Filipavicius, María Rodríguez Martínez

    Abstract: With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affin… ▽ More

    Submitted 6 July, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: 5 pages, 6 figures

    Journal ref: ICML Workshop on Computational Biology 2020

  4. arXiv:1909.05114  [pdf, other

    q-bio.BM cs.LG stat.ML

    PaccMann$^{RL}$: Designing anticancer drugs from transcriptomic data via reinforcement learning

    Authors: Jannis Born, Matteo Manica, Ali Oskooei, Joris Cadow, Karsten Borgwardt, María Rodríguez Martínez

    Abstract: With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capa… ▽ More

    Submitted 16 April, 2020; v1 submitted 29 August, 2019; originally announced September 2019.

    Comments: 18 pages total (12 pages main text, 4 pages references, 11 pages appendix) 8 figures

    Journal ref: International Conference on Research in Computational Molecular Biology 2020

  5. PIMKL: Pathway Induced Multiple Kernel Learning

    Authors: Matteo Manica, Joris Cadow, Roland Mathis, María Rodríguez Martínez

    Abstract: Reliable identification of molecular biomarkers is essential for accurate patient stratification. While state-of-the-art machine learning approaches for sample classification continue to push boundaries in terms of performance, most of these methods are not able to integrate different data types and lack generalization power, limiting their application in a clinical setting. Furthermore, many meth… ▽ More

    Submitted 5 July, 2018; v1 submitted 29 March, 2018; originally announced March 2018.

    Journal ref: npj Systems Biology and Applications (2019)