Skip to main content

Showing 1–11 of 11 results for author: Hill, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.07191  [pdf, other

    cs.AI cs.LG stat.AP

    Applying Large Language Models for Causal Structure Learning in Non Small Cell Lung Cancer

    Authors: Narmada Naik, Ayush Khandelwal, Mohit Joshi, Madhusudan Atre, Hollis Wright, Kavya Kannan, Scott Hill, Giridhar Mamidipudi, Ganapati Srinivasa, Carlo Bifulco, Brian Piening, Kevin Matlock

    Abstract: Causal discovery is becoming a key part in medical AI research. These methods can enhance healthcare by identifying causal links between biomarkers, demographics, treatments and outcomes. They can aid medical professionals in choosing more impactful treatments and strategies. In parallel, Large Language Models (LLMs) have shown great potential in identifying patterns and generating insights from t… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  2. arXiv:2306.01211  [pdf, other

    stat.ME stat.AP

    Priming bias versus post-treatment bias in experimental designs

    Authors: Matthew Blackwell, Jacob R. Brown, Sophie Hill, Kosuke Imai, Teppei Yamamoto

    Abstract: Conditioning on variables affected by treatment can induce post-treatment bias when estimating causal effects. Although this suggests that researchers should measure potential moderators before administering the treatment in an experiment, doing so may also bias causal effect estimation if the covariate measurement primes respondents to react differently to the treatment. This paper formally analy… ▽ More

    Submitted 28 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 32 pages (main text), 22 pages (supplementary materials), 5 figures

  3. arXiv:2204.00750  [pdf, other

    stat.ME

    Structural randomised selection

    Authors: Fan Wang, Sylvia Richardson, Steven M. Hill

    Abstract: An important problem in the analysis of high-dimensional omics data is to identify subsets of molecular variables that are associated with a phenotype of interest. This requires addressing the challenges of high dimensionality, strong multicollinearity and model uncertainty. We propose a new ensemble learning approach for improving the performance of sparse penalised regression methods, called STr… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  4. arXiv:2008.00163  [pdf, other

    stat.ME stat.ML

    The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks

    Authors: Konstantinos Pantazis, Avanti Athreya, Jesús Arroyo, William N. Frost, Evan S. Hill, Vince Lyzinski

    Abstract: Spectral inference on multiple networks is a rapidly-develo** subfield of graph statistics. Recent work has demonstrated that joint, or simultaneous, spectral embedding of multiple independent networks can deliver more accurate estimation than individual spectral decompositions of those same networks. Such inference procedures typically rely heavily on independence assumptions across the multipl… ▽ More

    Submitted 17 June, 2021; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: 44 pages, 13 figures

    MSC Class: 62H12; 62E20; 05C80

  5. arXiv:2004.06098  [pdf

    stat.AP econ.GN

    The effect of stay-at-home orders on COVID-19 cases and fatalities in the United States

    Authors: James H. Fowler, Seth J. Hill, Remy Levin, Nick Obradovich

    Abstract: Governments issue "stay at home" orders to reduce the spread of contagious diseases, but the magnitude of such orders' effectiveness is uncertain. In the United States these orders were not coordinated at the national level during the coronavirus disease 2019 (COVID-19) pandemic, which creates an opportunity to use spatial and temporal variation to measure the policies' effect with greater accurac… ▽ More

    Submitted 7 May, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

  6. arXiv:2002.03419  [pdf, other

    q-bio.PE stat.AP

    The Alzheimer's Disease Prediction Of Longitudinal Evolution (TADPOLE) Challenge: Results after 1 Year Follow-up

    Authors: Razvan V. Marinescu, Neil P. Oxtoby, Alexandra L. Young, Esther E. Bron, Arthur W. Toga, Michael W. Weiner, Frederik Barkhof, Nick C. Fox, Arman Eshaghi, Tina Toni, Marcin Salaterski, Veronika Lunina, Manon Ansart, Stanley Durrleman, Pascal Lu, Samuel Iddi, Dan Li, Wesley K. Thompson, Michael C. Donohue, Aviv Nahon, Yarden Levy, Dan Halbersberg, Mariya Cohen, Huiling Liao, Tengfei Li , et al. (71 additional authors not shown)

    Abstract: We present the findings of "The Alzheimer's Disease Prediction Of Longitudinal Evolution" (TADPOLE) Challenge, which compared the performance of 92 algorithms from 33 international teams at predicting the future trajectory of 219 individuals at risk of Alzheimer's disease. Challenge participants were required to make a prediction, for each month of a 5-year future time period, of three key outcome… ▽ More

    Submitted 27 December, 2021; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: Presents final results of the TADPOLE competition. 60 pages, 7 tables, 14 figures

    Journal ref: Machine Learning for Biomedical Imaging (MELBA), Dec 2021

  7. High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking

    Authors: Fan Wang, Sach Mukherjee, Sylvia Richardson, Steven M. Hill

    Abstract: Penalized likelihood approaches are widely used for high-dimensional regression. Although many methods have been proposed and the associated theory is now well-developed, the relative efficacy of different approaches in finite-sample settings, as encountered in practice, remains incompletely understood. There is therefore a need for empirical investigations in this area that can offer practical in… ▽ More

    Submitted 28 January, 2020; v1 submitted 2 August, 2018; originally announced August 2018.

    Comments: This is a post-peer-review, pre-copyedit version of an article published in Statistics and Computing. The final authenticated version is available online (open access) at: http://dx.doi.org/10.1007/s11222-019-09914-9

    Journal ref: Statistics and Computing, 2019. Advance online publication

  8. arXiv:1612.05678  [pdf, other

    stat.ML

    Causal Learning via Manifold Regularization

    Authors: Steven M. Hill, Chris. J. Oates, Duncan A. Blythe, Sach Mukherjee

    Abstract: This paper frames causal structure estimation as a machine learning task. The idea is to treat indicators of causal relationships between variables as `labels' and to exploit available data on the variables of interest to provide features for the labelling task. Background scientific knowledge or any available interventional data provide labels on some causal relationships and the remainder are tr… ▽ More

    Submitted 29 August, 2019; v1 submitted 16 December, 2016; originally announced December 2016.

    Journal ref: Journal of Machine Learning Research 20(127):1-32, 2019

  9. Inferring network structure from interventional time-course experiments

    Authors: Simon E. F. Spencer, Steven M. Hill, Sach Mukherjee

    Abstract: Graphical models are widely used to study biological networks. Interventions on network nodes are an important feature of many experimental designs for the study of biological networks. In this paper we put forward a causal variant of dynamic Bayesian networks (DBNs) for the purpose of modeling time-course data with interventions. The models inherit the simplicity and computational efficiency of D… ▽ More

    Submitted 16 June, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS806 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS806

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 507-524

  10. arXiv:1301.2194  [pdf, other

    stat.ML cs.LG stat.ME

    Network-based clustering with mixtures of L1-penalized Gaussian graphical models: an empirical investigation

    Authors: Steven M. Hill, Sach Mukherjee

    Abstract: In many applications, multivariate samples may harbor previously unrecognized heterogeneity at the level of conditional independence or network structure. For example, in cancer biology, disease subtypes may differ with respect to subtype-specific interplay between molecular components. Then, both subtype discovery and estimation of subtype-specific networks present important and related challenge… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: A version of this work also appears in the first author's PhD Thesis (Sparse Graphical Models for Cancer Signalling, University of Warwick, 2012), which can be accessed at http://wrap.warwick.ac.uk/id/eprint/49626

  11. arXiv:1201.3380  [pdf, other

    stat.AP q-bio.QM

    On the relationship between ODEs and DBNs

    Authors: Chris. J. Oates, Steven. M. Hill, Sach Mukherjee

    Abstract: Recently, Li et al. (Bioinformatics 27(19), 2686-91, 2011) proposed a method, called Differential Equation-based Local Dynamic Bayesian Network (DELDBN), for reverse engineering gene regulatory networks from time-course data. We commend the authors for an interesting paper that draws attention to the close relationship between dynamic Bayesian networks (DBNs) and differential equations (DEs). Thei… ▽ More

    Submitted 2 March, 2012; v1 submitted 16 January, 2012; originally announced January 2012.