Skip to main content

Showing 1–11 of 11 results for author: Quackenbush, J

.
  1. arXiv:2403.04805  [pdf, other

    cs.LG q-bio.QM stat.AP stat.ML

    Not all tickets are equal and we know it: Guiding pruning with domain-specific knowledge

    Authors: Intekhab Hossain, Jonas Fischer, Rebekka Burkholz, John Quackenbush

    Abstract: Neural structure learning is of paramount importance for scientific discovery and interpretability. Yet, contemporary pruning algorithms that focus on computational resource efficiency face algorithmic barriers to select a meaningful model that aligns with domain expertise. To mitigate this challenge, we propose DASH, which guides pruning by available domain-specific structural information. In the… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  2. arXiv:2107.02911  [pdf, other

    cs.LG stat.ML

    Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification

    Authors: Alkis Gotovos, Rebekka Burkholz, John Quackenbush, Stefanie Jegelka

    Abstract: Modeling the time evolution of discrete sets of items (e.g., genetic mutations) is a fundamental problem in many biomedical applications. We approach this problem through the lens of continuous-time Markov chains, and show that the resulting learning task is generally underspecified in the usual setting of cross-sectional data. We explore a perhaps surprising remedy: including a number of addition… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  3. arXiv:2104.01690  [pdf, other

    q-bio.MN

    DRAGON: Determining Regulatory Associations using Graphical models on multi-Omic Networks

    Authors: Katherine H. Shutta, Deborah Weighill, Rebekka Burkholz, Marouen Ben Guebila, Dawn L. DeMeo, Helena U. Zacharias, John Quackenbush, Michael Altenbuchinger

    Abstract: The increasing quantity of multi-omics data, such as methylomic and transcriptomic profiles, collected on the same specimen, or even on the same cell, provide a unique opportunity to explore the complex interactions that define cell phenotype and govern cellular responses to perturbations. We propose a network approach based on Gaussian Graphical Models (GGMs) that facilitates the joint analysis o… ▽ More

    Submitted 21 September, 2022; v1 submitted 4 April, 2021; originally announced April 2021.

    Comments: 24 pages, 8 figures

  4. arXiv:2101.03985  [pdf

    q-bio.MN

    Gene targeting in disease networks

    Authors: Deborah Weighill, Marouen Ben Guebila, Kimberly Glass, John Platig, Jen Jen Yeh, John Quackenbush

    Abstract: Profiling of whole transcriptomes has become a cornerstone of molecular biology and an invaluable tool for the characterization of clinical phenotypes and the identification of disease subtypes. Analyses of these data are becoming ever more sophisticated as we move beyond simple comparisons to consider networks of higher-order interactions and associations. Gene regulatory networks model the regul… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

  5. The importance of transparency and reproducibility in artificial intelligence research

    Authors: Benjamin Haibe-Kains, George Alexandru Adam, Ahmed Hosny, Farnoosh Khodakarami, MAQC Society Board, Levi Waldron, Bo Wang, Chris McIntosh, Anshul Kundaje, Casey S. Greene, Michael M. Hoffman, Jeffrey T. Leek, Wolfgang Huber, Alvis Brazma, Joelle Pineau, Robert Tibshirani, Trevor Hastie, John P. A. Ioannidis, John Quackenbush, Hugo J. W. L. Aerts

    Abstract: In their study, McKinney et al. showed the high potential of artificial intelligence for breast cancer screening. However, the lack of detailed methods and computer code undermines its scientific value. We identify obstacles hindering transparent and reproducible AI research as faced by McKinney et al and provide solutions with implications for the broader field.

    Submitted 7 March, 2020; v1 submitted 28 February, 2020; originally announced March 2020.

    Journal ref: Nature 586 (2020) E14-E16

  6. arXiv:1909.05416  [pdf, other

    cs.AI cs.SI physics.soc-ph

    Cascade Size Distributions: Why They Matter and How to Compute Them Efficiently

    Authors: Rebekka Burkholz, John Quackenbush

    Abstract: Cascade models are central to understanding, predicting, and controlling epidemic spreading and information propagation. Related optimization, including influence maximization, model parameter inference, or the development of vaccination strategies, relies heavily on sampling from a model. This is either inefficient or inaccurate. As alternative, we present an efficient message passing algorithm t… ▽ More

    Submitted 16 December, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted at AAAI 2021

  7. arXiv:1703.01900  [pdf, other

    q-bio.QM cs.IR q-bio.GN q-bio.MN

    Network-based Distance Metric with Application to Discover Disease Subtypes in Cancer

    Authors: Jipeng Qiang, Wei Ding, John Quackenbush, ** Chen

    Abstract: While we once thought of cancer as single monolithic diseases affecting a specific organ site, we now understand that there are many subtypes of cancer defined by unique patterns of gene mutations. These gene mutational data, which can be more reliably obtained than gene expression data, help to determine how the subtypes develop, evolve, and respond to therapies. Different from dense continuous-v… ▽ More

    Submitted 28 February, 2017; originally announced March 2017.

  8. PyPanda: a Python Package for Gene Regulatory Network Reconstruction

    Authors: David G. P. van IJzendoorn, Kimberly Glass, John Quackenbush, Marieke L. Kuijjer

    Abstract: PANDA (Passing Attributes between Networks for Data Assimilation) is a gene regulatory network inference method that uses message-passing to integrate multiple sources of 'omics data. PANDA was originally coded in C++. In this application note we describe PyPanda, the Python version of PANDA. PyPanda runs considerably faster than the C++ version and includes additional features for network analysi… ▽ More

    Submitted 12 July, 2016; v1 submitted 22 April, 2016; originally announced April 2016.

    Comments: 3 pages, 1 figure; version after Bioinformatics proofs

  9. Bipartite Community Structure of eQTLs

    Authors: John Platig, Peter Castaldi, Dawn DeMeo, John Quackenbush

    Abstract: Genome Wide Association Studies (GWAS) and eQTL analyses have produced a large and growing number of genetic associations linked to a wide range of human phenotypes. As of 2013, there were more than 11,000 SNPs associated with a trait as reported in the NHGRI GWAS Catalog. However, interpreting the functional roles played by these SNPs remains a challenge. Here we describe an approach that uses th… ▽ More

    Submitted 9 September, 2015; originally announced September 2015.

  10. High Performance Computing of Gene Regulatory Networks using a Message-Passing Model

    Authors: Kimberly Glass, John Quackenbush, Jeremy Kepner

    Abstract: Gene regulatory network reconstruction is a fundamental problem in computational biology. We recently developed an algorithm, called PANDA (Passing Attributes Between Networks for Data Assimilation), that integrates multiple sources of 'omics data and estimates regulatory network models. This approach was initially implemented in the C++ programming language and has since been applied to a number… ▽ More

    Submitted 24 July, 2015; originally announced July 2015.

  11. arXiv:1505.06440  [pdf, other

    q-bio.MN

    Estimating sample-specific regulatory networks

    Authors: Marieke Lydia Kuijjer, Matthew Tung, GuoCheng Yuan, John Quackenbush, Kimberly Glass

    Abstract: Biological systems are driven by intricate interactions among the complex array of molecules that comprise the cell. Many methods have been developed to reconstruct network models of those interactions. These methods often draw on large numbers of samples with measured gene expression profiles to infer connections between genes (or gene products). The result is an aggregate network model represent… ▽ More

    Submitted 28 June, 2018; v1 submitted 24 May, 2015; originally announced May 2015.