Skip to main content

Showing 1–21 of 21 results for author: Stevens, R

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2406.07025  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    Entropy-Reinforced Planning with Large Language Models for Drug Discovery

    Authors: Xuefeng Liu, Chih-chan Tien, Peng Ding, Songhao Jiang, Rick L. Stevens

    Abstract: The objective of drug discovery is to identify chemical compounds that possess specific pharmaceutical properties toward a binding target. Existing large language models (LLMS) can achieve high token matching scores in terms of likelihood for molecule generation. However, relying solely on LLM decoding often results in the generation of molecules that are either invalid due to a single misused tok… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Published in ICML2024

  2. arXiv:2310.11329  [pdf

    q-bio.QM

    Influencing factors on false positive rates when classifying tumor cell line response to drug treatment

    Authors: Priyanka Vasanthakumari, Thomas Brettin, Yitan Zhu, Hyunseung Yoo, Maulik Shukla, Alexander Partin, Fangfang Xia, Oleksandr Narykov, Rick L. Stevens

    Abstract: Informed selection of drug candidates for laboratory experimentation provides an efficient means of identifying suitable anti-cancer treatments. The advancement of artificial intelligence has led to the development of computational models to predict cancer cell line response to drug treatment. It is important to analyze the false positive rate (FPR) of the models, to increase the number of effecti… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  3. arXiv:2308.01921  [pdf, other

    q-bio.BM cs.AI cs.LG

    Transferable Graph Neural Fingerprint Models for Quick Response to Future Bio-Threats

    Authors: Wei Chen, Yihui Ren, Ai Kagawa, Matthew R. Carbone, Samuel Yen-Chi Chen, Xiaohui Qu, Shinjae Yoo, Austin Clyde, Arvind Ramanathan, Rick L. Stevens, Hubertus J. J. van Dam, Deyu Lu

    Abstract: Fast screening of drug molecules based on the ligand binding affinity is an important step in the drug discovery pipeline. Graph neural fingerprint is a promising method for develo** molecular docking surrogates with high throughput and great fidelity. In this study, we built a COVID-19 drug docking dataset of about 300,000 drug candidates on 23 coronavirus protein targets. With this dataset, we… ▽ More

    Submitted 14 September, 2023; v1 submitted 17 July, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures, 2 tables, accepted by ICLMA2023

    ACM Class: I.2.1

  4. arXiv:2304.03210  [pdf, other

    q-bio.MN cs.DC

    Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery

    Authors: Ashka Shah, Arvind Ramanathan, Valerie Hayot-Sasson, Rick Stevens

    Abstract: Causal discovery of genome-scale networks is important for identifying pathways from genes to observable traits - e.g. differences in cell function, disease, drug resistance and others. Causal learners based on graphical models rely on interventional samples to orient edges in the network. However, these models have not been shown to scale up the size of the genome, which are on the order of 1e3-1… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: To be published in Platform for Advanced Scientific Computing 2023 (PASC23) conference proceedings

  5. arXiv:2303.04630  [pdf

    cs.LG q-bio.QM stat.AP

    Mining the contribution of intensive care clinical course to outcome after traumatic brain injury

    Authors: Shubhayu Bhattacharyay, Pier Francesco Caruso, Cecilia Åkerlund, Lindsay Wilson, Robert D Stevens, David K Menon, Ewout W Steyerberg, David W Nelson, Ari Ercole, the CENTER-TBI investigators/participants

    Abstract: Existing methods to characterise the evolving condition of traumatic brain injury (TBI) patients in the intensive care unit (ICU) do not capture the context necessary for individualising treatment. Here, we integrate all heterogenous data stored in medical records (1,166 pre-ICU and ICU variables) to model the individualised contribution of clinical course to six-month functional outcome on the Gl… ▽ More

    Submitted 1 August, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Journal ref: npj Digit. Med. 6, 154 (2023)

  6. arXiv:2211.10442  [pdf, other

    q-bio.QM cs.LG

    Deep learning methods for drug response prediction in cancer: predominant and emerging trends

    Authors: Alexander Partin, Thomas S. Brettin, Yitan Zhu, Oleksandr Narykov, Austin Clyde, Jamie Overbeek, Rick L. Stevens

    Abstract: Cancer claims millions of lives yearly worldwide. While many therapies have been made available in recent years, by in large cancer remains unsolved. Exploiting computational predictive models to study and treat cancer holds great promise in improving drug development and personalized design of treatment plans, ultimately suppressing tumors, alleviating suffering, and prolonging lives of patients.… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  7. arXiv:2206.12179  [pdf

    stat.AP physics.ao-ph q-bio.QM

    How is model-related uncertainty quantified and reported in different disciplines?

    Authors: Emily G. Simmonds, Kwaku Peprah Adjei, Christoffer Wold Andersen, Janne Cathrin Hetle Aspheim, Claudia Battistin, Nicola Bulso, Hannah Christensen, Benjamin Cretois, Ryan Cubero, Ivan A. Davidovich, Lisa Dickel, Benjamin Dunn, Etienne Dunn-Sigouin, Karin Dyrstad, Sigurd Einum, Donata Giglio, Haakon Gjerlow, Amelie Godefroidt, Ricardo Gonzalez-Gil, Soledad Gonzalo Cogno, Fabian Grosse, Paul Halloran, Mari F. Jensen, John James Kennedy, Peter Egge Langsaether , et al. (18 additional authors not shown)

    Abstract: How do we know how much we know? Quantifying uncertainty associated with our modelling work is the only way we can answer how much we know about any phenomenon. With quantitative science now highly influential in the public sphere and the results from models translating into action, we must support our conclusions with sufficient rigour to produce useful, reproducible results. Incomplete considera… ▽ More

    Submitted 1 July, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: 40 Pages (including supporting information), 3 Figures, 2 Boxes, 1 Table

  8. arXiv:2204.11678  [pdf, other

    q-bio.QM

    Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images

    Authors: Alexander Partin, Thomas Brettin, Yitan Zhu, James M. Dolezal, Sara Kochanny, Alexander T. Pearson, Maulik Shukla, Yvonne A. Evrard, James H. Doroshow, Rick L. Stevens

    Abstract: Patient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies because the in vivo environment of PDXs helps preserve tumor heterogeneity and usually better mimics drug response of patients with cancer compared to CCLs. We investigate multimodal neural network (MM-Net) and data augmentation for drug response prediction in PDXs. The MM-Net learns to predict response using… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  9. arXiv:2109.05012  [pdf, other

    q-bio.QM q-bio.BM

    Scaffold-Induced Molecular Graph (SIMG): Effective Graph Sampling Methods for High-Throughput Computational Drug Discovery

    Authors: Austin Clyde, Ashka Shah, Max Zvyagin, Arvind Ramanathan, Rick Stevens

    Abstract: Scaffold based drug discovery (SBDD) is a technique for drug discovery which pins chemical scaffolds as the framework of design. Scaffolds, or molecular frameworks, organize the design of compounds into local neighborhoods. We formalize scaffold based drug discovery into a network design. Utilizing docking data from SARS-CoV-2 virtual screening studies and JAK2 kinase assay data, we showcase how a… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  10. arXiv:2106.07036  [pdf, other

    q-bio.BM cs.LG

    Protein-Ligand Docking Surrogate Models: A SARS-CoV-2 Benchmark for Deep Learning Accelerated Virtual Screening

    Authors: Austin Clyde, Thomas Brettin, Alexander Partin, Hyunseung Yoo, Yadu Babuji, Ben Blaiszik, Andre Merzky, Matteo Turilli, Shantenu Jha, Arvind Ramanathan, Rick Stevens

    Abstract: We propose a benchmark to study surrogate model accuracy for protein-ligand docking. We share a dataset consisting of 200 million 3D complex structures and 2D structure scores across a consistent set of 13 million "in-stock" molecules over 15 receptors, or binding sites, across the SARS-CoV-2 proteome. Our work shows surrogate docking models have six orders of magnitude more throughput than standa… ▽ More

    Submitted 30 June, 2021; v1 submitted 13 June, 2021; originally announced June 2021.

  11. arXiv:2106.02190  [pdf, other

    cs.LG cs.AI q-bio.BM

    Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery

    Authors: Yulun Wu, Mikaela Cashman, Nicholas Choma, Érica T. Prates, Verónica G. Melesse Vergara, Manesh Shah, Andrew Chen, Austin Clyde, Thomas S. Brettin, Wibe A. de Jong, Neeraj Kumar, Martha S. Head, Rick L. Stevens, Peter Nugent, Daniel A. Jacobson, James B. Brown

    Abstract: We developed Distilled Graph Attention Policy Network (DGAPN), a reinforcement learning model to generate novel graph-structured chemical representations that optimize user-defined objectives by efficiently navigating a physically constrained domain. The framework is examined on the task of generating molecules that are designed to bind, noncovalently, to functional sites of SARS-CoV-2 proteins. W… ▽ More

    Submitted 11 May, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

  12. arXiv:2104.08961  [pdf, other

    q-bio.QM

    A cross-study analysis of drug response prediction in cancer cell lines

    Authors: Fangfang Xia, Jonathan Allen, Prasanna Balaprakash, Thomas Brettin, Cristina Garcia-Cardona, Austin Clyde, Judith Cohn, James Doroshow, Xiaotian Duan, Veronika Dubinkina, Yvonne Evrard, Ya Ju Fan, Jason Gans, Stewart He, Pinyi Lu, Sergei Maslov, Alexander Partin, Maulik Shukla, Eric Stahlberg, Justin M. Wozniak, Hyunseung Yoo, George Zaki, Yitan Zhu, Rick Stevens

    Abstract: To enable personalized cancer treatment, machine learning models have been developed to predict drug response as a function of tumor and drug features. However, most algorithm development efforts have relied on cross validation within a single study to assess model accuracy. While an essential first step, cross validation within a biological data set typically provides an overly optimistic estimat… ▽ More

    Submitted 13 August, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted by Briefings in Bioinformatics

  13. arXiv:2103.02843  [pdf

    cs.DC cs.CE cs.LG physics.bio-ph q-bio.QM

    Pandemic Drugs at Pandemic Speed: Infrastructure for Accelerating COVID-19 Drug Discovery with Hybrid Machine Learning- and Physics-based Simulations on High Performance Computers

    Authors: Agastya P. Bhati, Shunzhou Wan, Dario Alfè, Austin R. Clyde, Mathis Bode, Li Tan, Mikhail Titov, Andre Merzky, Matteo Turilli, Shantenu Jha, Roger R. Highfield, Walter Rocchia, Nicola Scafuri, Sauro Succi, Dieter Kranzlmüller, Gerald Mathias, David Wifling, Yann Donon, Alberto Di Meglio, Sofia Vallecorsa, Heng Ma, Anda Trifan, Arvind Ramanathan, Tom Brettin, Alexander Partin , et al. (4 additional authors not shown)

    Abstract: The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods… ▽ More

    Submitted 4 September, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Journal ref: Interface Focus. 2021. 11 (6): 20210018

  14. arXiv:2011.12466  [pdf, other

    q-bio.QM cs.LG

    Learning Curves for Drug Response Prediction in Cancer Cell Lines

    Authors: Alexander Partin, Thomas Brettin, Yvonne A. Evrard, Yitan Zhu, Hyunseung Yoo, Fangfang Xia, Songhao Jiang, Austin Clyde, Maulik Shukla, Michael Fonstein, James H. Doroshow, Rick Stevens

    Abstract: Motivated by the size of cell line drug sensitivity data, researchers have been develo** machine learning (ML) models for predicting drug response to advance cancer treatment. As drug sensitivity studies continue generating data, a common question is whether the proposed predictors can further improve the generalization performance with more training data. We utilize empirical learning curves fo… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: 14 pages, 7 figures

  15. arXiv:2010.06574  [pdf, other

    cs.DC cs.CE q-bio.QM

    IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads

    Authors: Aymen Al Saadi, Dario Alfe, Yadu Babuji, Agastya Bhati, Ben Blaiszik, Thomas Brettin, Kyle Chard, Ryan Chard, Peter Coveney, Anda Trifan, Alex Brace, Austin Clyde, Ian Foster, Tom Gibbs, Shantenu Jha, Kristopher Keipert, Thorsten Kurth, Dieter Kranzlmüller, Hyungro Lee, Zhuozhao Li, Heng Ma, Andre Merzky, Gerald Mathias, Alexander Partin, Junqi Yin , et al. (11 additional authors not shown)

    Abstract: The drug discovery process currently employed in the pharmaceutical industry typically requires about 10 years and $2-3 billion to deliver one new drug. This is both too expensive and too slow, especially in emergencies like the COVID-19 pandemic. In silicomethodologies need to be improved to better select lead compounds that can proceed to later stages of the drug discovery protocol accelerating… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  16. arXiv:2006.02431  [pdf, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    Targeting SARS-CoV-2 with AI- and HPC-enabled Lead Generation: A First Data Release

    Authors: Yadu Babuji, Ben Blaiszik, Tom Brettin, Kyle Chard, Ryan Chard, Austin Clyde, Ian Foster, Zhi Hong, Shantenu Jha, Zhuozhao Li, Xuefeng Liu, Arvind Ramanathan, Yi Ren, Nicholaus Saint, Marcus Schwarting, Rick Stevens, Hubertus van Dam, Rick Wagner

    Abstract: Researchers across the globe are seeking to rapidly repurpose existing drugs or discover new drugs to counter the the novel coronavirus disease (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). One promising approach is to train machine learning (ML) and artificial intelligence (AI) tools to screen large numbers of small molecules. As a contribution to that effort,… ▽ More

    Submitted 27 May, 2020; originally announced June 2020.

    Comments: 11 pages, 5 figures

  17. arXiv:2006.01171  [pdf, other

    q-bio.QM cs.LG stat.ML

    Regression Enrichment Surfaces: a Simple Analysis Technique for Virtual Drug Screening Models

    Authors: Austin Clyde, Xiaotian Duan, Rick Stevens

    Abstract: We present a new method for understanding the performance of a model in virtual drug screening tasks. While most virtual screening problems present as a mix between ranking and classification, the models are typically trained as regression models presenting a problem requiring either a choice of a cutoff or ranking measure. Our method, regression enrichment surfaces (RES), is based on the goal of… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  18. arXiv:2005.09572  [pdf

    q-bio.QM cs.LG

    Ensemble Transfer Learning for the Prediction of Anti-Cancer Drug Response

    Authors: Yitan Zhu, Thomas Brettin, Yvonne A. Evrard, Alexander Partin, Fangfang Xia, Maulik Shukla, Hyunseung Yoo, James H. Doroshow, Rick Stevens

    Abstract: Transfer learning has been shown to be effective in many applications in which training data for the target problem are limited but data for a related (source) problem are abundant. In this paper, we apply transfer learning to the prediction of anti-cancer drug response. Previous transfer learning studies for drug response prediction focused on building models that predict the response of tumor ce… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  19. arXiv:2005.00095  [pdf, other

    cs.LG q-bio.GN q-bio.QM

    A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

    Authors: Austin Clyde, Tom Brettin, Alexander Partin, Maulik Shaulik, Hyunseung Yoo, Yvonne Evrard, Yitan Zhu, Fangfang Xia, Rick Stevens

    Abstract: By combining various cancer cell line (CCL) drug screening panels, the size of the data has grown significantly to begin understanding how advances in deep learning can advance drug response predictions. In this paper we train >35,000 neural network models, swee** over common featurization techniques. We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 f… ▽ More

    Submitted 4 May, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

  20. arXiv:2004.09673  [pdf, other

    q-bio.QM cs.LG eess.IV

    Neural Network Segmentation of Cell Ultrastructure Using Incomplete Annotation

    Authors: John Paul Francis, Hongzhi Wang, Kate White, Tanveer Syeda-Mahmood, Raymond Stevens

    Abstract: The Pancreatic beta cell is an important target in diabetes research. For scalable modeling of beta cell ultrastructure, we investigate automatic segmentation of whole cell imaging data acquired through soft X-ray tomography. During the course of the study, both complete and partial ultrastructure annotations were produced manually for different subsets of the data. To more effectively use existin… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  21. arXiv:1607.01224  [pdf, other

    stat.ML q-bio.QM

    Machine Learning for Antimicrobial Resistance

    Authors: John W. Santerre, James J. Davis, Fangfang Xia, Rick Stevens

    Abstract: Biological datasets amenable to applied machine learning are more available today than ever before, yet they lack adequate representation in the Data-for-Good community. Here we present a work in progress case study performing analysis on antimicrobial resistance (AMR) using standard ensemble machine learning techniques and note the successes and pitfalls such work entails. Broadly, applied machin… ▽ More

    Submitted 5 July, 2016; originally announced July 2016.

    Comments: presented at 2016 ICML Workshop on #Data4Good: Machine Learning in Social Good Applications, New York, NY