Skip to main content

Showing 1–23 of 23 results for author: Roy, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2211.16393  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    Bayesian Semiparametric Model for Sequential Treatment Decisions with Informative Timing

    Authors: Arman Oganisian, Kelly D. Getz, Todd A. Alonzo, Richard Aplenc, Jason A. Roy

    Abstract: We develop a Bayesian semi-parametric model for the estimating the impact of dynamic treatment rules on survival among patients diagnosed with pediatric acute myeloid leukemia (AML). The data consist of a subset of patients enrolled in the phase III AAML1031 clinical trial in which patients move through a sequence of four treatment courses. At each course, they undergo treatment that may or may no… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  2. arXiv:2208.13382  [pdf, other

    stat.ME stat.AP stat.OT

    A Bayesian nonparametric approach for causal inference with multiple mediators

    Authors: Samrat Roy, Michael J. Daniels, Brendan J. Kelly, Jason Roy

    Abstract: Mediation analysis with contemporaneously observed multiple mediators is an important area of causal inference. Recent approaches for multiple mediators are often based on parametric models and thus may suffer from model misspecification. Also, much of the existing literature either only allow estimation of the joint mediation effect, or, estimate the joint mediation effect as the sum of individua… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    ACM Class: G.3

  3. arXiv:2110.10266  [pdf, other

    stat.ME

    Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

    Authors: Yaqian Zhu, Nandita Mitra, Jason Roy

    Abstract: In observational studies, causal inference relies on several key identifying assumptions. One identifiability condition is the positivity assumption, which requires the probability of treatment be bounded away from 0 and 1. That is, for every covariate combination, it should be possible to observe both treated and control subjects, i.e., the covariate distributions should overlap between treatment… ▽ More

    Submitted 17 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

  4. Hierarchical Bayesian Bootstrap for Heterogeneous Treatment Effect Estimation

    Authors: Arman Oganisian, Nandita Mitra, Jason Roy

    Abstract: A major focus of causal inference is the estimation of heterogeneous average treatment effects (HTE) - average treatment effects within strata of another variable of interest such as levels of a biomarker, education, or age strata. Inference involves estimating a stratum-specific regression and integrating it over the distribution of confounders in that stratum - which itself must be estimated. St… ▽ More

    Submitted 4 January, 2023; v1 submitted 22 September, 2020; originally announced September 2020.

    Journal ref: The International Journal of Biostatistics, 2022

  5. arXiv:2006.13258  [pdf, other

    cs.LG cs.AI stat.ML

    Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

    Authors: Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

    Abstract: Adversarial Imitation Learning alternates between learning a discriminator -- which tells apart expert's demonstrations from generated ones -- and a generator's policy to produce trajectories that can fool this discriminator. This alternated optimization is known to be delicate in practice since it compounds unstable adversarial training with brittle and sample-inefficient reinforcement learning.… ▽ More

    Submitted 16 April, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  6. arXiv:2006.03465  [pdf, other

    cs.LG stat.ML

    Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion

    Authors: Josh Roy, George Konidaris

    Abstract: We introduce Wasserstein Adversarial Proximal Policy Optimization (WAPPO), a novel algorithm for visual transfer in Reinforcement Learning that explicitly learns to align the distributions of extracted features between a source and target task. WAPPO approximates and minimizes the Wasserstein-1 distance between the distributions of features from source and target domains via a novel Wasserstein Co… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

  7. arXiv:2004.07375  [pdf, other

    stat.ME stat.ML

    A Practical Introduction to Bayesian Estimation of Causal Effects: Parametric and Nonparametric Approaches

    Authors: Arman Oganisian, Jason A. Roy

    Abstract: Substantial advances in Bayesian methods for causal inference have been developed in recent years. We provide an introduction to Bayesian inference for causal effects for practicing statisticians who have some familiarity with Bayesian models and would like an overview of what it can add to causal estimation in practical settings. In the paper, we demonstrate how priors can induce shrinkage and sp… ▽ More

    Submitted 21 August, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Currently under second-round revision. This version included edits from first round

  8. arXiv:2003.00029  [pdf, other

    stat.AP

    Estimating the impact of treatment compliance over time on smoking cessation using data from ecological momentary assessments (EMA)

    Authors: Yaoyuan Vincent Tan, Donna Coffman, Megan Piper, Jason Roy

    Abstract: The Wisconsin Smoker's Health Study (WSHS2) was a longitudinal trial conducted to compare the effectiveness of two commonly used smoking cessation treatments, varenicline and combination nicotine replacement therapy (cNRT) with the less intense standard of care, nicotine patch. The main outcome of the WSHS2 study was that all three treatments had equivalent treatment effects. However, in-depth ana… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

    Comments: 26 pages, 6 figures

  9. arXiv:2002.04706  [pdf, other

    stat.ME stat.ML

    Bayesian Nonparametric Cost-Effectiveness Analyses: Causal Estimation and Adaptive Subgroup Discovery

    Authors: Arman Oganisian, Nandita Mitra, Jason Roy

    Abstract: Cost-effectiveness analyses (CEAs) are at the center of health economic decision making. While these analyses help policy analysts and economists determine coverage, inform policy, and guide resource allocation, they are statistically challenging for several reasons. Cost and effectiveness are correlated and follow complex joint distributions which are difficult to capture parametrically. Effectiv… ▽ More

    Submitted 8 September, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  10. arXiv:1912.00039  [pdf, other

    stat.ME

    Net benefit separation and the determination curve: a probabilistic framework for cost-effectiveness estimation

    Authors: Andrew J. Spieker, Nicholas Illenberger, Jason A. Roy, Nandita Mitra

    Abstract: Considerations regarding clinical effectiveness and cost are essential in comparing the overall value of two treatments. There has been growing interest in methodology to integrate cost and effectiveness measures in order to inform policy and promote adequate resource allocation. The net monetary benefit aggregates information on differences in mean cost and clinical outcomes; the cost-effectivene… ▽ More

    Submitted 2 December, 2019; v1 submitted 29 November, 2019; originally announced December 2019.

    Comments: 10 pages; 5 figures; 3 tables

  11. arXiv:1908.02269  [pdf, other

    cs.LG cs.MA stat.ML

    Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

    Authors: Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal

    Abstract: In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide t… ▽ More

    Submitted 9 November, 2020; v1 submitted 6 August, 2019; originally announced August 2019.

    Comments: 23 pages, 16 figures. This revised version contains additional results and minor edits

  12. arXiv:1901.07504  [pdf, other

    stat.AP

    Bayesian additive regression trees and the General BART model

    Authors: Yaoyuan Vincent Tan, Jason Roy

    Abstract: Bayesian additive regression trees (BART) is a flexible prediction model/machine learning approach that has gained widespread popularity in recent years. As BART becomes more mainstream, there is an increased need for a paper that walks readers through the details of BART, from what it is to why it works. This tutorial is aimed at providing such a resource. In addition to explaining the different… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

  13. arXiv:1901.00908  [pdf, other

    stat.ME

    Bayesian Longitudinal Causal Inference in the Analysis of the Public Health Impact of Pollutant Emissions

    Authors: Chanmin Kim, Corwin M Zigler, Michael J Daniels, Christine Choirat, Jason A Roy

    Abstract: Pollutant emissions from coal-burning power plants have been deemed to adversely impact ambient air quality and public health conditions. Despite the noticeable reduction in emissions and the improvement of air quality since the Clean Air Act (CAA) became the law, the public-health benefits from changes in emissions have not been widely evaluated yet. In terms of the chain of accountability (HEI A… ▽ More

    Submitted 3 January, 2019; originally announced January 2019.

  14. A Bayesian Nonparametric Model for Zero-Inflated Outcomes: Prediction, Clustering, and Causal Estimation

    Authors: Arman Oganisian, Nandita Mitra, Jason Roy

    Abstract: Researchers are often interested in predicting outcomes, conducting clustering analysis to detect distinct subgroups of their data, or computing causal treatment effects. Pathological data distributions that exhibit skewness and zero-inflation complicate these tasks - requiring highly flexible, data-adaptive modeling. In this paper, we present a fully nonparametric Bayesian generative model for co… ▽ More

    Submitted 9 March, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

  15. arXiv:1806.04200  [pdf, ps, other

    stat.AP

    A semiparametric modeling approach using Bayesian Additive Regression Trees with an application to evaluate heterogeneous treatment effects

    Authors: Bret Zeldow, Vincent Lo Re III, Jason Roy

    Abstract: Bayesian Additive Regression Trees (BART) is a flexible machine learning algorithm capable of capturing nonlinearities between an outcome and covariates and interaction among covariates. We extend BART to a semiparametric regression framework in which the conditional expectation of an outcome is a function of treatment, its effect modifiers, and confounders. The confounders, not of scientific inte… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

  16. arXiv:1806.02411  [pdf, other

    stat.ME

    Outcome identification in electronic health records using predictions from an enriched Dirichlet process mixture

    Authors: Bret Zeldow, James Flory, Alisa Stephens-Shields, Marsha Raebel, Jason Roy

    Abstract: We propose a novel semiparametric model for the joint distribution of a continuous longitudinal outcome and the baseline covariates using an enriched Dirichlet process (EDP) prior. This joint model decomposes into a linear mixed model for the outcome given the covariates and marginals for the covariates. The nonparametric EDP prior is placed on the regression and spline coefficients, the error var… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

  17. arXiv:1705.08742  [pdf, other

    stat.ME

    A causal approach to analysis of censored medical costs in the presence of time-varying treatment

    Authors: Andrew J. Spieker, Arman Oganisian, Emily M. Ko, Jason A. Roy, Nandita Mitra

    Abstract: There has recently been a growing interest in the development of statistical methods to compare medical costs between treatment groups. When cumulative cost is the outcome of interest, right-censoring poses the challenge of informative missingness due to heterogeneity in the rates of cost accumulation across subjects. Existing approaches seeking to address the challenge of informative cost traject… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

  18. arXiv:1702.08496  [pdf, other

    stat.ME

    Bayesian nonparametric generative models for causal inference with missing at random covariates

    Authors: Jason Roy, Kirsten J Lum, Michael J. Daniels, Bret Zeldow, Jordan Dworkin, Vincent Lo Re III

    Abstract: We propose a general Bayesian nonparametric (BNP) approach to causal inference in the point treatment setting. The joint distribution of the observed data (outcome, treatment, and confounders) is modeled using an enriched Dirichlet process. The combination of the observed data model and causal assumptions allows us to identify any type of causal effect - differences, ratios, or quantile effects, e… ▽ More

    Submitted 27 February, 2017; originally announced February 2017.

  19. arXiv:1701.04858  [pdf

    stat.AP stat.CO

    Mixed Effects Models are Sometimes Terrible

    Authors: Christopher Eager, Joseph Roy

    Abstract: Mixed-effects models have emerged as the gold standard of statistical analysis in different sub-fields of linguistics (Baayen, Davidson & Bates, 2008; Johnson, 2009; Barr, et al, 2013; Gries, 2015). One problematic feature of these models is their failure to converge under maximal (or even near-maximal) random effects structures. The lack of convergence is relatively unaddressed in linguistics and… ▽ More

    Submitted 5 January, 2017; originally announced January 2017.

    Comments: Write up for poster presented at Linguistic Society of America 2017: Eager, Christopher and Joseph Roy. Mixed Effects are Sometimes Terrible. Linguistic Society of America, Poster (January 5-8, 2017)

  20. Confronting Quasi-Separation in Logistic Mixed Effects for Linguistic Data: A Bayesian Approach

    Authors: Amelia Kimball, Kailen Shantz, Christopher Eager, Joseph Roy

    Abstract: Mixed effects regression models are widely used by language researchers. However, these regressions are implemented with an algorithm which may not converge on a solution. While convergence issues in linear mixed effects models can often be addressed with careful experiment design and model building, logistic mixed effects models introduce the possibility of separation or quasi-separation, which c… ▽ More

    Submitted 7 September, 2018; v1 submitted 31 October, 2016; originally announced November 2016.

    Comments: Draft version of JQL accepted paper

  21. arXiv:1503.08329  [pdf, other

    stat.ML cs.LG

    Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm

    Authors: Pascal Germain, Alexandre Lacasse, François Laviolette, Mario Marchand, Jean-Francis Roy

    Abstract: We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average disagreement. We also propose an extensive PAC-Bayesian analysis that shows how the C-bound can be estimated from various observations contained in th… ▽ More

    Submitted 28 July, 2015; v1 submitted 28 March, 2015; originally announced March 2015.

    Comments: Published in JMLR http://jmlr.org/papers/v16/germain15a.html

    Journal ref: Journal of Machine Learning Research 2015, vol. 16, p. 787-860

  22. arXiv:1501.03001  [pdf, other

    stat.ML cs.LG

    On Generalizing the C-Bound to the Multiclass and Multi-label Settings

    Authors: Francois Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy

    Abstract: The C-bound, introduced in Lacasse et al., gives a tight upper bound on the risk of a binary majority vote classifier. In this work, we present a first step towards extending this work to more complex outputs, by providing generalizations of the C-bound to the multiclass and multi-label settings.

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: NIPS 2014 Workshop on Representation and Learning Methods for Complex Outputs, Dec 2014, Montr{é}al, Canada

  23. arXiv:1408.1336  [pdf, other

    stat.ML

    On the Generalization of the C-Bound to Structured Output Ensemble Methods

    Authors: François Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy

    Abstract: This paper generalizes an important result from the PAC-Bayesian literature for binary classification to the case of ensemble methods for structured outputs. We prove a generic version of the \Cbound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. This bound may advantageously $(i)$ be app… ▽ More

    Submitted 15 June, 2015; v1 submitted 6 August, 2014; originally announced August 2014.