Skip to main content

Showing 1–7 of 7 results for author: Tansey, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.12122  [pdf, other

    cs.LG stat.ML

    Targeted active learning for probabilistic models

    Authors: Christopher Tosh, Mauricio Tec, Wesley Tansey

    Abstract: A fundamental task in science is to design experiments that yield valuable insights about the system under study. Mathematically, these insights can be represented as a utility or risk function that shapes the value of conducting each experiment. We present PDBAL, a targeted active learning method that adaptively designs experiments to maximize scientific utility. PDBAL takes a user-specified risk… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  2. arXiv:2208.08579  [pdf, other

    stat.ME cs.LG stat.ML

    DIET: Conditional independence testing with marginal dependence measures of residual information

    Authors: Mukund Sudarshan, Aahlad Manas Puli, Wesley Tansey, Rajesh Ranganath

    Abstract: Conditional randomization tests (CRTs) assess whether a variable $x$ is predictive of another variable $y$, having observed covariates $z$. CRTs require fitting a large number of predictive models, which is often computationally intractable. Existing solutions to reduce the cost of CRTs typically split the dataset into a train and test portion, or rely on heuristics for interactions, both of which… ▽ More

    Submitted 11 April, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

  3. arXiv:2007.15835  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Direct Likelihood Knockoffs

    Authors: Mukund Sudarshan, Wesley Tansey, Rajesh Ranganath

    Abstract: Predictive modeling often uses black box machine learning methods, such as deep neural networks, to achieve state-of-the-art performance. In scientific domains, the scientist often wishes to discover which features are actually important for making the predictions. These discoveries may lead to costly follow-up experiments and as such it is important that the error rate on discoveries is not too h… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

  4. arXiv:1906.04072  [pdf, other

    stat.ML cs.LG stat.ME

    A Bayesian Model of Dose-Response for Cancer Drug Studies

    Authors: Wesley Tansey, Christopher Tosh, David M. Blei

    Abstract: Exploratory cancer drug studies test multiple tumor cell lines against multiple candidate drugs. The goal in each paired (cell line, drug) experiment is to map out the dose-response curve of the cell line as the dose level of the drug increases. We propose Bayesian Tensor Filtering (BTF), a hierarchical Bayesian model for dose-response modeling in multi-sample, multi-treatment cancer drug studies.… ▽ More

    Submitted 22 March, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Extended to handle covariates; additional benchmarks comparing to related work

  5. Interpreting Black Box Models via Hypothesis Testing

    Authors: Collin Burns, Jesse Thomason, Wesley Tansey

    Abstract: In science and medicine, model interpretations may be reported as discoveries of natural phenomena or used to guide patient treatments. In such high-stakes tasks, false discoveries may lead investigators astray. These applications would therefore benefit from control over the finite-sample error rate of interpretations. We reframe black box model interpretability as a multiple hypothesis testing p… ▽ More

    Submitted 17 August, 2020; v1 submitted 29 March, 2019; originally announced April 2019.

    Comments: FODS 2020

  6. arXiv:1806.03143  [pdf, other

    stat.ML cs.LG

    Black Box FDR

    Authors: Wesley Tansey, Yixin Wang, David M. Blei, Raul Rabadan

    Abstract: Analyzing large-scale, multi-experiment studies requires scientists to test each experimental outcome for statistical significance and then assess the results as a whole. We present Black Box FDR (BB-FDR), an empirical-Bayes method for analyzing multi-experiment studies when many covariates are gathered per experiment. BB-FDR learns a series of black box predictive models to boost power and contro… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: To appear at ICML'18; code available at https://github.com/tansey/bb-fdr

  7. arXiv:1612.00388  [pdf, other

    stat.ML cs.LG stat.AP

    Diet2Vec: Multi-scale analysis of massive dietary data

    Authors: Wesley Tansey, Edward W. Lowe Jr., James G. Scott

    Abstract: Smart phone apps that enable users to easily track their diets have become widespread in the last decade. This has created an opportunity to discover new insights into obesity and weight loss by analyzing the eating habits of the users of such apps. In this paper, we present diet2vec: an approach to modeling latent structure in a massive database of electronic diet journals. Through an iterative c… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

    Comments: Accepted to the NIPS 2016 Workshop on Machine Learning for Health