-
Antibody Representation Learning for Drug Discovery
Authors:
Lin Li,
Esther Gupta,
John Spaeth,
Leslie Shing,
Tristan Bepler,
Rajmonda Sulo Caceres
Abstract:
Therapeutic antibody development has become an increasingly popular approach for drug development. To date, antibody therapeutics are largely developed using large scale experimental screens of antibody libraries containing hundreds of millions of antibody sequences. The high cost and difficulty of develo** therapeutic antibodies create a pressing need for computational methods to predict antibo…
▽ More
Therapeutic antibody development has become an increasingly popular approach for drug development. To date, antibody therapeutics are largely developed using large scale experimental screens of antibody libraries containing hundreds of millions of antibody sequences. The high cost and difficulty of develo** therapeutic antibodies create a pressing need for computational methods to predict antibody properties and create bespoke designs. However, the relationship between antibody sequence and activity is a complex physical process and traditional iterative design approaches rely on large scale assays and random mutagenesis. Deep learning methods have emerged as a promising way to learn antibody property predictors, but predicting antibody properties and target-specific activities depends critically on the choice of antibody representations and data linking sequences to properties is often limited. Existing works have not yet investigated the value, limitations and opportunities of these methods in application to antibody-based drug discovery. In this paper, we present results on a novel SARS-CoV-2 antibody binding dataset and an additional benchmark dataset. We compare three classes of models: conventional statistical sequence models, supervised learning on each dataset independently, and fine-tuning an antibody specific pre-trained language model. Experimental results suggest that self-supervised pretraining of feature representation consistently offers significant improvement in over previous approaches. We also investigate the impact of data size on the model performance, and discuss challenges and opportunities that the machine learning community can address to advance in silico engineering and design of therapeutic antibodies.
△ Less
Submitted 5 October, 2022;
originally announced October 2022.
-
Flimma: a federated and privacy-preserving tool for differential gene expression analysis
Authors:
Olga Zolotareva,
Reza Nasirigerdeh,
Julian Matschinske,
Reihaneh Torkzadehmahani,
Tobias Frisch,
Julian Späth,
David B. Blumenthal,
Amir Abbasinejad,
Paolo Tieri,
Nina K. Wenke,
Markus List,
Jan Baumbach
Abstract:
Aggregating transcriptomics data across hospitals can increase sensitivity and robustness of differential expression analyses, yielding deeper clinical insights. As data exchange is often restricted by privacy legislation, meta-analyses are frequently employed to pool local results. However, if class labels are inhomogeneously distributed between cohorts, their accuracy may drop. Flimma (https://e…
▽ More
Aggregating transcriptomics data across hospitals can increase sensitivity and robustness of differential expression analyses, yielding deeper clinical insights. As data exchange is often restricted by privacy legislation, meta-analyses are frequently employed to pool local results. However, if class labels are inhomogeneously distributed between cohorts, their accuracy may drop. Flimma (https://exbio.wzw.tum.de/flimma/) addresses this issue by implementing the state-of-the-art workflow limma voom in a privacy-preserving manner, i.e. patient data never leaves its source site. Flimma results are identical to those generated by limma voom on combined datasets even in imbalanced scenarios where meta-analysis approaches fail.
△ Less
Submitted 23 November, 2020; v1 submitted 30 October, 2020;
originally announced October 2020.
-
Exploring the SARS-CoV-2 virus-host-drug interactome for drug repurposing
Authors:
Sepideh Sadegh,
Julian Matschinske,
David B. Blumenthal,
Gihanna Galindez,
Tim Kacprowski,
Markus List,
Reza Nasirigerdeh,
Mhaned Oubounyt,
Andreas Pichlmair,
Tim Daniel Rose,
Marisol Salgado-Albarrán,
Julian Späth,
Alexey Stukalov,
Nina K. Wenke,
Kevin Yuan,
Josch K. Pauling,
Jan Baumbach
Abstract:
Coronavirus Disease-2019 (COVID-19) is an infectious disease caused by the SARS-CoV-2 virus. It was first identified in Wuhan, China, and has since spread causing a global pandemic. Various studies have been performed to understand the molecular mechanisms of viral infection for predicting drug repurposing candidates. However, such information is spread across many publications and it is very time…
▽ More
Coronavirus Disease-2019 (COVID-19) is an infectious disease caused by the SARS-CoV-2 virus. It was first identified in Wuhan, China, and has since spread causing a global pandemic. Various studies have been performed to understand the molecular mechanisms of viral infection for predicting drug repurposing candidates. However, such information is spread across many publications and it is very time-consuming to access, integrate, explore, and exploit. We developed CoVex, the first interactive online platform for SARS-CoV-2 and SARS-CoV-1 host interactome exploration and drug (target) identification. CoVex integrates 1) experimentally validated virus-human protein interactions, 2) human protein-protein interactions and 3) drug-target interactions. The web interface allows user-friendly visual exploration of the virus-host interactome and implements systems medicine algorithms for network-based prediction of drugs. Thus, CoVex is an important resource, not only to understand the molecular mechanisms involved in SARS-CoV-2 and SARS-CoV-1 pathogenicity, but also in clinical research for the identification and prioritization of candidate therapeutics. We apply CoVex to investigate recent hypotheses on a systems biology level and to systematically explore the molecular mechanisms driving the virus life cycle. Furthermore, we extract and discuss drug repurposing candidates involved in these mechanisms. CoVex renders COVID-19 drug research systems-medicine-ready by giving the scientific community direct access to network medicine algorithms integrating virus-host-drug interactions. It is available at https://exbio.wzw.tum.de/covex/.
△ Less
Submitted 26 April, 2020;
originally announced April 2020.