Skip to main content

Showing 1–11 of 11 results for author: Agniel, D

.
  1. arXiv:2405.13591  [pdf, other

    stat.ME

    Running in circles: is practical application feasible for data fission and data thinning in post-clustering differential analysis?

    Authors: Benjamin Hivert, Denis Agniel, Rodolphe ThiƩbaut, Boris P. Hejblum

    Abstract: The standard pipeline to analyse single-cell RNA sequencing (scRNA-seq) often involves two steps : clustering and Differential Expression Analysis (DEA) to annotate cell populations based on gene expression. However, using clustering results for data-driven hypothesis formulation compromises statistical properties, especially Type I error control. Data fission was introduced to split the informati… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2402.16969  [pdf, other

    stat.ME

    Robust Evaluation of Longitudinal Surrogate Markers with Censored Data

    Authors: Denis Agniel, Layla Parast

    Abstract: The development of statistical methods to evaluate surrogate markers is an active area of research. In many clinical settings, the surrogate marker is not simply a single measurement but is instead a longitudinal trajectory of measurements over time, e.g., fasting plasma glucose measured every 6 months for 3 years. In general, available methods developed for the single-surrogate setting cannot acc… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  3. arXiv:2402.13391  [pdf, other

    stat.ME

    De-Biasing the Bias: Methods for Improving Disparity Assessments with Noisy Group Measurements

    Authors: Solvejg Wastvedt, Joshua Snoke, Denis Agniel, Julie Lai, Marc N. Elliott, Steven C. Martino

    Abstract: Health care decisions are increasingly informed by clinical decision support algorithms, but these algorithms may perpetuate or increase racial and ethnic disparities in access to and quality of health care. Further complicating the problem, clinical data often have missing or poor quality racial and ethnic information, which can lead to misleading assessments of algorithmic bias. We present novel… ▽ More

    Submitted 26 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  4. arXiv:2312.05400  [pdf, other

    stat.ME

    Generalized difference-in-differences

    Authors: Denis Agniel, Max Rubinstein, Jessie Coe, Maria DeYoreo

    Abstract: We propose a new method for estimating causal effects in longitudinal/panel data settings that we call generalized difference-in-differences. Our approach unifies two alternative approaches in these settings: ignorability estimators (e.g., synthetic controls) and difference-in-differences (DiD) estimators. We propose a new identifying assumption -- a stable bias assumption -- which generalizes the… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  5. arXiv:2210.13172  [pdf, other

    stat.ME

    Post-clustering difference testing: valid inference and practical considerations

    Authors: Benjamin Hivert, Denis Agniel, Rodolphe ThiƩbaut, Boris P Hejblum

    Abstract: Clustering is part of unsupervised analysis methods that consist in grou** samples into homogeneous and separate subgroups of observations also called clusters. To interpret the clusters, statistical hypothesis testing is often used to infer the variables that significantly separate the estimated clusters from each other. However, data-driven hypotheses are considered for the inference process,… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  6. Doubly-robust evaluation of high-dimensional surrogate markers

    Authors: Denis Agniel, Layla Parast, Boris Hejblum

    Abstract: When evaluating the effectiveness of a treatment, policy, or intervention, the desired measure of effectiveness may be expensive to collect, not routinely available, or may take a long time to occur. In these cases, it is sometimes possible to identify a surrogate outcome that can more easily/quickly/cheaply capture the effect of interest. Theory and methods for evaluating the strength of surrogat… ▽ More

    Submitted 2 December, 2020; v1 submitted 2 December, 2020; originally announced December 2020.

    Journal ref: Biostatistics-2022

  7. arXiv:1909.05813  [pdf, other

    stat.ME

    Synthetic estimation for the complier average causal effect

    Authors: Denis Agniel, Bing Han, Matthew Cefalu

    Abstract: We propose an improved estimator of the complier average causal effect (CACE). Researchers typically choose a presumably-unbiased estimator for the CACE in studies with noncompliance, when many other lower-variance estimators may be available. We propose a synthetic estimator that combines information across all available estimators, leveraging the efficiency in lower-variance estimators while mai… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

  8. arXiv:1706.03156  [pdf, other

    stat.ME

    Functional principal variance component testing for a genetic association study of HIV progression

    Authors: Denis Agniel, Wen Xie, Myron Essex, Tianxi Cai

    Abstract: HIV-1C is the most prevalent subtype of HIV-1 and accounts for over half of HIV-1 infections worldwide. Host genetic influence of HIV infection has been previously studied in HIV-1B, but little attention has been paid to the more prevalent subtype C. To understand the role of host genetics in HIV-1C disease progression, we perform a study to assess the association between longitudinally collected… ▽ More

    Submitted 9 June, 2017; originally announced June 2017.

    Comments: 20 pages, 6 figures

  9. arXiv:1612.00424  [pdf, other

    stat.ME

    Doubly robust matching estimators for high dimensional confounding adjustment

    Authors: Joseph Antonelli, Matthew Cefalu, Nathan Palmer, Denis Agniel

    Abstract: Valid estimation of treatment effects from observational data requires proper control of confounding. If the number of covariates is large relative to the number of observations, then controlling for all available covariates is infeasible. In cases where a sparsity condition holds, variable selection or penalization can reduce the dimension of the covariate space in a manner that allows for valid… ▽ More

    Submitted 10 January, 2018; v1 submitted 1 December, 2016; originally announced December 2016.

  10. arXiv:1605.02351  [pdf, other

    stat.AP q-bio.GN stat.ME

    Variance component score test for time-course gene set analysis of longitudinal RNA-seq data

    Authors: Denis Agniel, Boris P Hejblum

    Abstract: As gene expression measurement technology is shifting from microarrays to sequencing, the statistical tools available for their analysis must be adapted since RNA-seq data are measured as counts. Recently, it has been proposed to tackle the count nature of these data by modeling log-count reads per million as continuous variables, using nonparametric regression to account for their inherent hetero… ▽ More

    Submitted 6 January, 2017; v1 submitted 8 May, 2016; originally announced May 2016.

    Comments: 23 pages, 6 figures, typo corrections & acceptance acknowledgement

    MSC Class: 62P10

    Journal ref: Biostatistics-2017

  11. arXiv:1511.08074  [pdf, other

    stat.ME

    Estimation and testing for multiple regulation of multivariate mixed outcomes

    Authors: Denis Agniel, Katherine P. Liao, Tianxi Cai

    Abstract: Considerable interest has recently been focused on studying multiple phenotypes simultaneously in both epidemiological and genomic studies, either to capture the multidimensionality of complex disorders or to understand shared etiology of related disorders. We seek to identify {\em multiple regulators} or predictors that are associated with multiple outcomes when these outcomes may be measured on… ▽ More

    Submitted 25 November, 2015; originally announced November 2015.

    Comments: 25 pages, 6 figures