Skip to main content

Showing 1–41 of 41 results for author: McCormick, T H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.19597  [pdf, other

    stat.ME

    What's the Weight? Estimating Controlled Outcome Differences in Complex Surveys for Health Disparities Research

    Authors: Stephen Salerno, Emily K. Roberts, Belinda L. Needham, Tyler H. McCormick, Bhramar Mukherjee, Xu Shi

    Abstract: A basic descriptive question in statistics often asks whether there are differences in mean outcomes between groups based on levels of a discrete covariate (e.g., racial disparities in health outcomes). However, when this categorical covariate of interest is correlated with other factors related to the outcome, direct comparisons may lead to biased estimates and invalid inferential conclusions wit… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.11940  [pdf, other

    stat.ME cs.SI econ.EM stat.ML stat.OT

    Model-Based Inference and Experimental Design for Interference Using Partial Network Data

    Authors: Steven Wilkins Reeves, Shane Lubold, Arun G. Chandrasekhar, Tyler H. McCormick

    Abstract: The stable unit treatment value assumption states that the outcome of an individual is not affected by the treatment statuses of others, however in many real world applications, treatments can have an effect on many others beyond the immediately treated. Interference can generically be thought of as mediated through some network structure. In many empirically relevant situations however, complete… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2404.02438  [pdf, other

    cs.CL cs.LG stat.ML

    From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives

    Authors: Shuxian Fan, Adam Visokay, Kentaro Hoffman, Stephen Salerno, Li Liu, Jeffrey T. Leek, Tyler H. McCormick

    Abstract: In settings where most deaths occur outside the healthcare system, verbal autopsies (VAs) are a common tool to monitor trends in causes of death (COD). VAs are interviews with a surviving caregiver or relative that are used to predict the decedent's COD. Turning VAs into actionable insights for researchers and policymakers requires two steps (i) predicting likely COD using the VA interview and (ii… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 7 figures

  4. arXiv:2404.02141  [pdf, other

    stat.ME cs.LG econ.EM stat.CO stat.ML

    Robustly estimating heterogeneity in factorial data using Rashomon Partitions

    Authors: Aparajithan Venkateswaran, Anirudh Sankar, Arun G. Chandrasekhar, Tyler H. McCormick

    Abstract: Many statistical analyses, in both observational data and randomized control trials, ask: how does the outcome of interest vary with combinations of observable covariates? How do various drug combinations affect health outcomes, or how does technology adoption depend on incentives and demographics? Our goal is to partition this factorial space into "pools" of covariate combinations where the outco… ▽ More

    Submitted 25 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  5. arXiv:2403.12288  [pdf, ps, other

    stat.AP

    Bayesian analysis of verbal autopsy data using factor models with age- and sex-dependent associations between symptoms

    Authors: Tsuyoshi Kunihama, Zehang Richard Li, Samuel J. Clark, Tyler H. McCormick

    Abstract: Verbal autopsies (VAs) are extensively used to investigate the population-level distributions of deaths by cause in low-resource settings without well-organized vital statistics systems. Computer-based methods are often adopted to assign causes of death to deceased individuals based on the interview responses of their family members or caregivers. In this article, we develop a new Bayesian approac… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  6. arXiv:2403.05704  [pdf, other

    econ.EM cs.SI stat.AP stat.ME

    Non-robustness of diffusion estimates on networks with measurement error

    Authors: Arun G. Chandrasekhar, Paul Goldsmith-Pinkham, Tyler H. McCormick, Samuel Thau, Jerry Wei

    Abstract: Network diffusion models are used to study things like disease transmission, information spread, and technology adoption. However, small amounts of mismeasurement are extremely likely in the networks constructed to operationalize these models. We show that estimates of diffusions are highly non-robust to this measurement error. First, we show that even when measurement error is vanishingly small,… ▽ More

    Submitted 11 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  7. arXiv:2401.08702  [pdf, other

    stat.ME cs.LG

    Do We Really Even Need Data?

    Authors: Kentaro Hoffman, Stephen Salerno, Awan Afiaz, Jeffrey T. Leek, Tyler H. McCormick

    Abstract: As artificial intelligence and machine learning tools become more accessible, and scientists face new obstacles to data collection (e.g. rising costs, declining survey response rates), researchers increasingly use predictions from pre-trained algorithms as outcome variables. Though appealing for financial and logistical reasons, using standard tools for inference can misrepresent the association b… ▽ More

    Submitted 2 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  8. arXiv:2312.05718  [pdf, other

    stat.ME econ.GN q-bio.PE stat.AP

    Feasible contact tracing

    Authors: Aparajithan Venkateswaran, Jishnu Das, Tyler H. McCormick

    Abstract: Contact tracing is one of the most important tools for preventing the spread of infectious diseases, but as the experience of COVID-19 showed, it is also next-to-impossible to implement when the disease is spreading rapidly. We show how to substantially improve the efficiency of contact tracing by combining standard microeconomic tools that measure heterogeneity in how infectious a sick person is… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: main paper 37 pages, 4 figures; supplementary 32 pages, 13 figures

  9. arXiv:2309.16160  [pdf, other

    stat.AP

    Respondent-Driven Sampling: An Overview in the Context of Human Trafficking

    Authors: Jessica P. Kunke, Adam Visokay, Tyler H. McCormick

    Abstract: Respondent-driven sampling (RDS) is both a sampling strategy and an estimation method. It is commonly used to study individuals that are difficult to access with standard sampling techniques. As with any sampling strategy, RDS has advantages and challenges. This article examines recent work using RDS in the context of human trafficking. We begin with an overview of the RDS process and methodology,… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  10. arXiv:2308.12506  [pdf, other

    stat.ME math.ST

    General Covariance-Based Conditions for Central Limit Theorems with Dependent Triangular Arrays

    Authors: Arun G. Chandrasekhar, Matthew O. Jackson, Tyler H. McCormick, Vydhourie Thiyageswaran

    Abstract: We present a general central limit theorem with simple, easy-to-check covariance-based sufficient conditions for triangular arrays of random vectors when all variables could be interdependent. The result is constructed from Stein's method, but the conditions are distinct from related work. We show that these covariance conditions nest standard assumptions studied in the literature such as $M$-depe… ▽ More

    Submitted 14 December, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

  11. arXiv:2305.04381  [pdf, other

    stat.AP

    Estimating and Correcting Degree Ratio Bias in the Network Scale-up Method

    Authors: Ian Laga, Jessica P. Kunke, Tyler H. McCormick, Xiaoyue Niu

    Abstract: The Network Scale-up Method (NSUM) uses social networks and answers to "How many X's do you know?" questions to estimate sizes of groups excluded by standard surveys. This paper addresses the bias caused by varying average social network sizes across populations, commonly referred to as the degree ratio bias. This bias is especially important for marginalized populations like sex workers and drug… ▽ More

    Submitted 25 March, 2024; v1 submitted 7 May, 2023; originally announced May 2023.

  12. arXiv:2303.07490  [pdf, other

    stat.ME stat.AP

    Comparing the Robustness of Simple Network Scale-Up Method (NSUM) Estimators

    Authors: Jessica P. Kunke, Ian Laga, Xiaoyue Niu, Tyler H. McCormick

    Abstract: The network scale-up method (NSUM) is a cost-effective approach to estimating the size or prevalence of a group of people that is hard to reach through a standard survey. The basic NSUM involves two steps: estimating respondents' degrees by one of various methods (in this paper we focus on the probe group method which uses the number of people a respondent knows in various groups of known size), a… ▽ More

    Submitted 17 January, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Main paper 29 pages, 3 figures, 2 tables; supplement 14 pages, 5 figures

  13. arXiv:2302.11058  [pdf, other

    stat.AP stat.ME

    Bayesian Age Category Reconciliation for Age- and Cause-specific Under-five Mortality Estimates

    Authors: Shuxian Fan, Li Liu, Jamie Perin, Tyler H. McCormick

    Abstract: Age-disaggregated health data is crucial for effective public health planning and monitoring. Monitoring under-five mortality, for example, requires highly detailed age data since the distribution of potential causes of death varies substantially within the first few years of life. Comparative researchers often have to rely on multiple data sources yet, these sources often have ages aggregated at… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  14. arXiv:2210.15081  [pdf, other

    stat.ME cs.LG stat.CO stat.ML

    Bayesian Hyperbolic Multidimensional Scaling

    Authors: Bolun Liu, Shane Lubold, Adrian E. Raftery, Tyler H. McCormick

    Abstract: Multidimensional scaling (MDS) is a widely used approach to representing high-dimensional, dependent data. MDS works by assigning each observation a location on a low-dimensional geometric manifold, with distance on the manifold representing similarity. We propose a Bayesian approach to multidimensional scaling when the low-dimensional manifold is hyperbolic. Using hyperbolic space facilitates rep… ▽ More

    Submitted 15 August, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  15. arXiv:2109.08244  [pdf, other

    stat.AP

    The openVA Toolkit for Verbal Autopsies

    Authors: Zehang Richard Li, Jason Thomas, Eungang Choi, Tyler H. McCormick, Samuel J. Clark

    Abstract: Verbal autopsy (VA) is a survey-based tool widely used to infer cause of death (COD) in regions without complete-coverage civil registration and vital statistics systems. In such settings, many deaths happen outside of medical facilities and are not officially documented by a medical professional. VA surveys, consisting of signs and symptoms reported by a person close to the decedent, are used to… ▽ More

    Submitted 1 October, 2022; v1 submitted 16 September, 2021; originally announced September 2021.

  16. arXiv:2106.09702  [pdf, other

    stat.ME cs.LG cs.SI stat.AP stat.ML

    Spectral goodness-of-fit tests for complete and partial network data

    Authors: Shane Lubold, Bolun Liu, Tyler H. McCormick

    Abstract: Networks describe the, often complex, relationships between individual actors. In this work, we address the question of how to determine whether a parametric model, such as a stochastic block model or latent space model, fits a dataset well and will extrapolate to similar data. We use recent results in random matrix theory to derive a general goodness-of-fit test for dyadic data. We show that our… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  17. arXiv:2106.04271  [pdf, other

    stat.ME cs.LG cs.SI stat.AP stat.ML

    Inference for Network Regression Models with Community Structure

    Authors: Mengjie Pan, Tyler H. McCormick, Bailey K. Fosdick

    Abstract: Network regression models, where the outcome comprises the valued edge in a network and the predictors are actor or dyad-level covariates, are used extensively in the social and biological sciences. Valid inference relies on accurately modeling the residual dependencies among the relations. Frequently homogeneity assumptions are placed on the errors which are commonly incorrect and ignore critical… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  18. arXiv:2012.10559  [pdf, other

    stat.ME cs.SI math.GT stat.AP stat.ML

    Identifying the latent space geometry of network models through analysis of curvature

    Authors: Shane Lubold, Arun G. Chandrasekhar, Tyler H. McCormick

    Abstract: A common approach to modeling networks assigns each node to a position on a low-dimensional manifold where distance is inversely proportional to connection likelihood. More positive manifold curvature encourages more and tighter communities; negative curvature induces repulsion. We consistently estimate manifold type, dimension, and curvature from simply connected, complete Riemannian manifolds of… ▽ More

    Submitted 30 December, 2022; v1 submitted 18 December, 2020; originally announced December 2020.

  19. arXiv:2003.00401  [pdf, other

    stat.AP

    A flexible Bayesian framework to estimate age- and cause-specific child mortality over time from sample registration data

    Authors: Austin E Schumacher, Tyler H McCormick, Jon Wakefield, Yue Chu, Jamie Perin, Francisco Villavicencio, Noah Simon, Li Liu

    Abstract: In order to implement disease-specific interventions in young age groups, policy makers in low- and middle-income countries require timely and accurate estimates of age- and cause-specific child mortality. High quality data is not available in settings where these interventions are most needed, but there is a push to create sample registration systems that collect detailed mortality information. C… ▽ More

    Submitted 18 May, 2021; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: 16 pages, 4 figures, submitted to The Annals of Applied Statistics

    MSC Class: 62P99

  20. arXiv:1911.05522  [pdf, other

    stat.ME cs.CR cs.SI stat.AP stat.ML

    Anomaly Detection in Large Scale Networks with Latent Space Models

    Authors: Wesley Lee, Tyler H. McCormick, Joshua Neil, Cole Sodja, Yanran Cui

    Abstract: We develop a real-time anomaly detection algorithm for directed activity on large, sparse networks. We model the propensity for future activity using a dynamic logistic model with interaction terms for sender- and receiver-specific latent factors in addition to sender- and receiver-specific popularity scores; deviations from this underlying model constitute potential anomalies. Latent nodal attrib… ▽ More

    Submitted 29 January, 2021; v1 submitted 13 November, 2019; originally announced November 2019.

  21. arXiv:1908.09881  [pdf, other

    stat.ME cs.SI stat.AP

    Consistently estimating network statistics using Aggregated Relational Data

    Authors: Emily Breza, Arun G. Chandrasekhar, Shane Lubold, Tyler H. McCormick, Mengjie Pan

    Abstract: Collecting complete network data is expensive, time-consuming, and often infeasible. Aggregated Relational Data (ARD), which capture information about a social network by asking a respondent questions of the form ``How many people with trait X do you know?'' provide a low-cost option when collecting complete network data is not possible. Rather than asking about connections between each pair of in… ▽ More

    Submitted 21 October, 2022; v1 submitted 26 August, 2019; originally announced August 2019.

  22. arXiv:1904.00136  [pdf, other

    stat.ME

    Estimating spillovers using imprecisely measured networks

    Authors: Morgan Hardy, Rachel M. Heath, Wesley Lee, Tyler H. McCormick

    Abstract: In many experimental contexts, whether and how network interactions impact the outcome of interest for both treated and untreated individuals are key concerns. Networks data is often assumed to perfectly represent these possible interactions. This paper considers the problem of estimating treatment effects when measured connections are, instead, a noisy representation of the true spillover pathway… ▽ More

    Submitted 8 March, 2024; v1 submitted 29 March, 2019; originally announced April 2019.

  23. arXiv:1807.06063  [pdf, other

    stat.AP

    Modeling the social media relationships of Irish politicians using a generalized latent space stochastic blockmodel

    Authors: Tin Lok James Ng, Thomas Brendan Murphy, Ted Westling, Tyler H. McCormick, Bailey K. Fosdick

    Abstract: Dáil Éireann is the principal chamber of the Irish parliament. The 31st Dáil Éireann is the principal chamber of the Irish parliament. The 31st Dáil was in session from March 11th, 2011 to February 6th, 2016. Many of the members of the Dáil were active on social media and many were Twitter users who followed other members of the Dáil. The pattern of following amongst these politicians provides ins… ▽ More

    Submitted 13 December, 2020; v1 submitted 16 July, 2018; originally announced July 2018.

    Comments: 31 pages, 9 figures

    MSC Class: 62Pxx

  24. arXiv:1805.07051  [pdf, other

    stat.ML cs.LG

    Bayesian Joint Spike-and-Slab Graphical Lasso

    Authors: Zehang Richard Li, Tyler H. McCormick, Samuel J. Clark

    Abstract: In this article, we propose a new class of priors for Bayesian inference with multiple Gaussian graphical models. We introduce fully Bayesian treatments of two popular procedures, the group graphical lasso and the fused graphical lasso, and extend them to a continuous spike-and-slab framework to allow self-adaptive shrinkage and model selection simultaneously. We develop an EM algorithm that perfo… ▽ More

    Submitted 9 May, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

  25. arXiv:1803.07141  [pdf, other

    stat.AP stat.OT

    Quantifying the Contributions of Training Data and Algorithm Logic to the Performance of Automated Cause-assignment Algorithms for Verbal Autopsy

    Authors: Samuel J. Clark, Zehang Li, Tyler H. McCormick

    Abstract: A verbal autopsy (VA) consists of a survey with a relative or close contact of a person who has recently died. VA surveys are commonly used to infer likely causes of death for individuals when deaths happen outside of hospitals or healthcare facilities. Several statistical and algorithmic methods are available to assign cause of death using VA surveys. Each of these methods require as inputs some… ▽ More

    Submitted 15 November, 2018; v1 submitted 6 March, 2018; originally announced March 2018.

    Comments: This version implements Tariff with an additional normalization step that was previously ignored in the package

  26. arXiv:1803.01327  [pdf, ps, other

    stat.AP

    Bayesian factor models for probabilistic cause of death assessment with verbal autopsies

    Authors: Tsuyoshi Kunihama, Zehang Richard Li, Samuel J. Clark, Tyler H. McCormick

    Abstract: The distribution of deaths by cause provides crucial information for public health planning, response, and evaluation. About 60% of deaths globally are not registered or given a cause, limiting our ability to understand disease epidemiology. Verbal autopsy (VA) surveys are increasingly used in such settings to collect information on the signs, symptoms, and medical history of people who have recen… ▽ More

    Submitted 26 November, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

  27. arXiv:1711.00877  [pdf, other

    stat.AP

    Using Bayesian latent Gaussian graphical models to infer symptom associations in verbal autopsies

    Authors: Zehang Richard Li, Tyler H. McCormick, Samuel J. Clark

    Abstract: Learning dependence relationships among variables of mixed types provides insights in a variety of scientific settings and is a well-studied problem in statistics. Existing methods, however, typically rely on copious, high quality data to accurately learn associations. In this paper, we develop a method for scientific settings where learning dependence structure is essential, but data are sparse a… ▽ More

    Submitted 24 July, 2019; v1 submitted 2 November, 2017; originally announced November 2017.

  28. arXiv:1709.06970  [pdf, other

    stat.ML

    An Expectation Conditional Maximization approach for Gaussian graphical models

    Authors: Zehang Richard Li, Tyler H. McCormick

    Abstract: Bayesian graphical models are a useful tool for understanding dependence relationships among many variables, particularly in situations with external prior information. In high-dimensional settings, the space of possible graphs becomes enormous, rendering even state-of-the-art Bayesian stochastic search computationally infeasible. We propose a deterministic alternative to estimate Gaussian and Gau… ▽ More

    Submitted 6 February, 2019; v1 submitted 20 September, 2017; originally announced September 2017.

  29. arXiv:1703.04157  [pdf, other

    stat.ME

    Using Aggregated Relational Data to feasibly identify network structure without network data

    Authors: Emily Breza, Arun G. Chandrasekhar, Tyler H. McCormick, Mengjie Pan

    Abstract: Social network data is often prohibitively expensive to collect, limiting empirical network research. Typical economic network map** requires (1) enumerating a census, (2) eliciting the names of all network links for each individual, (3) matching the list of social connections to the census, and (4) repeating (1)-(3) across many networks. In settings requiring field surveys, steps (2)-(3) can be… ▽ More

    Submitted 2 August, 2018; v1 submitted 12 March, 2017; originally announced March 2017.

  30. arXiv:1701.05530  [pdf, other

    stat.ME cs.SI stat.AP

    Regression of exchangeable relational arrays

    Authors: Frank W. Marrs, Bailey K. Fosdick, Tyler H. McCormick

    Abstract: Relational arrays represent measures of association between pairs of actors, often in varied contexts or over time. Trade flows between countries, financial transactions between individuals, contact frequencies between school children in classrooms, and dynamic protein-protein interactions are all examples of relational arrays. Elements of a relational array are often modeled as a linear function… ▽ More

    Submitted 22 June, 2022; v1 submitted 19 January, 2017; originally announced January 2017.

    Comments: Accepted at Biometrika

  31. arXiv:1609.02629  [pdf, other

    stat.ME stat.AP

    Inferring social structure from continuous-time interaction data

    Authors: Wesley Lee, Bailey K. Fosdick, Tyler H. McCormick

    Abstract: Relational event data, which consist of events involving pairs of actors over time, are now commonly available at the finest of temporal resolutions. Existing continuous-time methods for modeling such data are based on point processes and directly model interaction "contagion," whereby one interaction increases the propensity of future interactions among actors, often as dictated by some latent va… ▽ More

    Submitted 15 January, 2018; v1 submitted 8 September, 2016; originally announced September 2016.

    Journal ref: Applied Stochastic Models in Business and Industry 2018, Vol. 34, No. 2, 87-104

  32. arXiv:1608.07618  [pdf, other

    stat.ME stat.AP

    Multiresolution network models

    Authors: Bailey K. Fosdick, Tyler H. McCormick, Thomas Brendan Murphy, Tin Lok James Ng, Ted Westling

    Abstract: Many existing statistical and machine learning tools for social network analysis focus on a single level of analysis. Methods designed for clustering optimize a global partition of the graph, whereas projection based approaches (e.g. the latent space model in the statistics literature) represent in rich detail the roles of individuals. Many pertinent questions in sociology and economics, however,… ▽ More

    Submitted 5 July, 2018; v1 submitted 26 August, 2016; originally announced August 2016.

  33. arXiv:1511.01644  [pdf, ps, other

    stat.AP cs.LG stat.ML

    Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model

    Authors: Benjamin Letham, Cynthia Rudin, Tyler H. McCormick, David Madigan

    Abstract: We aim to produce predictive models that are not only accurate, but are also interpretable to human experts. Our models are decision lists, which consist of a series of if...then... statements (e.g., if high blood pressure, then stroke) that discretize a high-dimensional, multivariate feature space into a series of simple, readily interpretable decision statements. We introduce a generative model… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS848 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS848

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 3, 1350-1371

  34. arXiv:1510.08151  [pdf, other

    stat.ME

    Beyond prediction: A framework for inference with variational approximations in mixture models

    Authors: Ted Westling, Tyler H. McCormick

    Abstract: Variational inference is a popular method for estimating model parameters and conditional distributions in hierarchical and mixed models, which arise frequently in many settings in the health, social, and biological sciences. Variational inference in a frequentist context works by approximating intractable conditional distributions with a tractable family and optimizing the resulting lower bound o… ▽ More

    Submitted 9 January, 2019; v1 submitted 27 October, 2015; originally announced October 2015.

  35. Reactive point processes: A new approach to predicting power failures in underground electrical systems

    Authors: Şeyda Ertekin, Cynthia Rudin, Tyler H. McCormick

    Abstract: Reactive point processes (RPPs) are a new statistical model designed for predicting discrete events in time based on past history. RPPs were developed to handle an important problem within the domain of electrical grid reliability: short-term prediction of electrical grid failures ("manhole events"), including outages, fires, explosions and smoking manholes, which can cause threats to public safet… ▽ More

    Submitted 28 May, 2015; originally announced May 2015.

    Comments: Published at http://dx.doi.org/10.1214/14-AOAS789 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS789

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 122-144

  36. arXiv:1504.06964  [pdf, other

    stat.ME stat.AP stat.ML

    Modeling Recovery Curves With Application to Prostatectomy

    Authors: Fulton Wang, Tyler H. McCormick, Cynthia Rudin, John Gore

    Abstract: We propose a Bayesian model that predicts recovery curves based on information available before the disruptive event. A recovery curve of interest is the quantified sexual function of prostate cancer patients after prostatectomy surgery. We illustrate the utility of our model as a pre-treatment medical decision aid, producing personalized predictions that are both interpretable and accurate. We un… ▽ More

    Submitted 4 March, 2018; v1 submitted 27 April, 2015; originally announced April 2015.

    Comments: Accepted to Biostatistics, 2018. Includes supplementary material and high resolution images of predictions for patients

  37. arXiv:1411.3042  [pdf, other

    stat.AP

    Probabilistic Cause-of-death Assignment using Verbal Autopsies

    Authors: Tyler H. McCormick, Zehang Li, Clara Calvert, Amelia C. Crampin, Kathleen Kahn, Samuel J. Clark

    Abstract: In regions without complete-coverage civil registration and vital statistics systems there is uncertainty about even the most basic demographic indicators. In such areas the majority of deaths occur outside hospitals and are not recorded. Worldwide, fewer than one-third of deaths are assigned a cause, with the least information available from the most impoverished nations. In populations like this… ▽ More

    Submitted 21 September, 2015; v1 submitted 11 November, 2014; originally announced November 2014.

  38. arXiv:1401.5343  [pdf, ps, other

    stat.AP stat.ME

    Clustering South African households based on their asset status using latent variable models

    Authors: Damien McParland, Isobel Claire Gormley, Tyler H. McCormick, Samuel J. Clark, Chodziwadziwa Whiteson Kabudula, Mark A. Collinson

    Abstract: The Agincourt Health and Demographic Surveillance System has since 2001 conducted a biannual household asset survey in order to quantify household socio-economic status (SES) in a rural population living in northeast South Africa. The survey contains binary, ordinal and nominal items. In the absence of income or expenditure data, the SES landscape in the study population is explored and described… ▽ More

    Submitted 31 July, 2014; v1 submitted 21 January, 2014; originally announced January 2014.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS726 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS726

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 2, 747-776

  39. Estimating population size using the network scale up method

    Authors: Rachael Maltiel, Adrian E. Raftery, Tyler H. McCormick, Aaron J. Baraff

    Abstract: We develop methods for estimating the size of hard-to-reach populations from data collected using network-based questions on standard surveys. Such data arise by asking respondents how many people they know in a specific group (e.g., people named Michael, intravenous drug users). The Network Scale up Method (NSUM) is a tool for producing population size estimates using these indirect measures of r… ▽ More

    Submitted 5 November, 2015; v1 submitted 4 June, 2013; originally announced June 2013.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS827 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS827

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 3, 1247-1277

  40. Latent demographic profile estimation in hard-to-reach groups

    Authors: Tyler H. McCormick, Tian Zheng

    Abstract: The sampling frame in most social science surveys excludes members of certain groups, known as hard-to-reach groups. These groups, or subpopulations, may be difficult to access (the homeless, e.g.), camouflaged by stigma (individuals with HIV/AIDS), or both (commercial sex workers). Even basic demographic information about these groups is typically unknown, especially in many develo** nations. W… ▽ More

    Submitted 11 January, 2013; originally announced January 2013.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS569 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS569

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 4, 1795-1813

  41. Bayesian hierarchical rule modeling for predicting medical conditions

    Authors: Tyler H. McCormick, Cynthia Rudin, David Madigan

    Abstract: We propose a statistical modeling technique, called the Hierarchical Association Rule Model (HARM), that predicts a patient's possible future medical conditions given the patient's current and past history of reported conditions. The core of our technique is a Bayesian hierarchical model for selecting predictive association rules (such as "condition 1 and condition 2 $\rightarrow$ condition 3") fr… ▽ More

    Submitted 28 June, 2012; originally announced June 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOAS522 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS522

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 2, 652-668