Skip to main content

Showing 1–31 of 31 results for author: Crawford, F W

.
  1. arXiv:2402.15433  [pdf, other

    stat.AP

    Mutually Exciting Point Processes for Crowdfunding Platform Dynamics

    Authors: Alexandra Djorno, Forrest W. Crawford

    Abstract: Crowdfunding is a powerful tool for individuals or organizations seeking financial support from a vast audience. Despite widespread adoption, managers often lack information about dynamics of their platforms. Hawkes processes have been used to represent self-exciting behavior in a wide variety of empirical fields, but have not been applied to crowdfunding platforms in a way that could help manager… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  2. arXiv:2306.08840  [pdf, other

    stat.ME stat.OT

    The role of discretization scales in causal inference with continuous-time treatment

    Authors: **ghao Sun, Forrest W. Crawford

    Abstract: There are well-established methods for identifying the causal effect of a time-varying treatment applied at discrete time points. However, in the real world, many treatments are continuous or have a finer time scale than the one used for measurement or analysis. While researchers have investigated the discrepancies between estimates under varying discretization scales using simulations and empiric… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  3. arXiv:2211.15934  [pdf, other

    math.ST stat.ME

    Causal identification for continuous-time stochastic processes

    Authors: **ghao Sun, Forrest W. Crawford

    Abstract: Many real-world processes are trajectories that may be regarded as continuous-time "functional data". Examples include patients' biomarker concentrations, environmental pollutant levels, and prices of stocks. Corresponding advances in data collection have yielded near continuous-time measurements, from e.g. physiological monitors, wearable digital devices, and environmental sensors. Statistical me… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  4. arXiv:2208.01208  [pdf, other

    stat.AP cs.SI

    Communication network dynamics in a large organizational hierarchy

    Authors: Nathaniel Josephs, Sida Peng, Forrest W. Crawford

    Abstract: Most businesses impose a supervisory hierarchy on employees to facilitate management, decision-making, and collaboration, yet routine inter-employee communication patterns within workplaces tend to emerge more naturally as a consequence of both supervisory relationships and the needs of the organization. What then is the relationship between a formal organizational structure and the emergent commu… ▽ More

    Submitted 11 March, 2024; v1 submitted 1 August, 2022; originally announced August 2022.

  5. arXiv:2111.09684  [pdf, other

    stat.ME

    A sample size heuristic for network scale-up studies

    Authors: Nathaniel Josephs, Dennis M. Feehan, Forrest W. Crawford

    Abstract: The network scale-up method (NSUM) is a survey-based method for estimating the number of individuals in a hidden or hard-to-reach subgroup of a general population. In NSUM surveys, sampled individuals report how many others they know in the subpopulation of interest (e.g. "How many sex workers do you know?") and how many others they know in subpopulations of the general population (e.g. "How many… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  6. arXiv:2105.03493  [pdf, other

    stat.ME stat.AP

    Causal identification of infectious disease intervention effects in a clustered population

    Authors: Xiaoxuan Cai, Eben Kenah, Forrest W. Crawford

    Abstract: Causal identification of treatment effects for infectious disease outcomes in interconnected populations is challenging because infection outcomes may be transmissible to others, and treatment given to one individual may affect others' outcomes. Contagion, or transmissibility of outcomes, complicates standard conceptions of treatment interference in which an intervention delivered to one individua… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

  7. arXiv:2008.00127  [pdf, other

    stat.ME stat.AP

    Dependence-robust confidence intervals for capture-recapture surveys

    Authors: **ghao Sun, Luk Van Baelen, Els Plettinckx, Forrest W. Crawford

    Abstract: Capture-recapture (CRC) surveys are used to estimate the size of a population whose members cannot be enumerated directly. CRC surveys have been used to estimate the number of Covid-19 infections, people who use drugs, sex workers, conflict casualties, and trafficking victims. When $k$ capture samples are obtained, counts of unit captures in subsets of samples are represented naturally by a $2^k$… ▽ More

    Submitted 14 October, 2022; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: To appear in the Journal of Survey Statistics and Methodology

  8. arXiv:1912.04151  [pdf, other

    stat.AP math.ST q-bio.PE

    Identification of causal intervention effects under contagion

    Authors: Xiaoxuan Cai, Wen Wei Loh, Forrest W. Crawford

    Abstract: Defining and identifying causal intervention effects for transmissible infectious disease outcomes is challenging because a treatment -- such as a vaccine -- given to one individual may affect the infection outcomes of others. Epidemiologists have proposed causal estimands to quantify effects of interventions under contagion using a two-person partnership model. These simple conceptual models have… ▽ More

    Submitted 10 December, 2019; v1 submitted 9 December, 2019; originally announced December 2019.

  9. arXiv:1905.03657  [pdf, other

    stat.ME math.ST stat.AP

    Efficient and minimal length parametric conformal prediction regions

    Authors: Daniel J. Eck, Forrest W. Crawford

    Abstract: Conformal prediction methods construct prediction regions for iid data that are valid in finite samples. We provide two parametric conformal prediction regions that are applicable for a wide class of continuous statistical models. This class of statistical models includes generalized linear models (GLMs) with continuous outcomes. Our parametric conformal prediction regions possesses finite sample… ▽ More

    Submitted 25 October, 2019; v1 submitted 9 May, 2019; originally announced May 2019.

  10. arXiv:1902.01377  [pdf, other

    stat.AP stat.ME

    Interpretation of the individual effect under treatment spillover

    Authors: Forrest W. Crawford, Olga Morozova, Ashley L. Buchanan, Donna Spiegelman

    Abstract: Some interventions may include important spillover or dissemination effects between study participants. For example, vaccines, cash transfers, and education programs may exert a causal effect on participants beyond those to whom individual treatment is assigned. In a recent paper, Buchanan et al. provide a causal definition of the "individual effect" of an intervention in networks of people who in… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  11. arXiv:1808.05593  [pdf, other

    stat.AP math.ST q-bio.PE

    Randomization for the susceptibility effect of an infectious disease intervention

    Authors: Daniel J. Eck, Olga Morozova, Forrest W. Crawford

    Abstract: Randomized trials of infectious disease interventions, such as vaccines, often focus on groups of connected or potentially interacting individuals. When the pathogen of interest is transmissible between study subjects, interference may occur: individual infection outcomes may depend on treatments received by others. Epidemiologists have defined the primary causal effect of interest -- called the "… ▽ More

    Submitted 9 December, 2019; v1 submitted 16 August, 2018; originally announced August 2018.

  12. arXiv:1808.04753  [pdf, other

    math.ST stat.AP stat.ME

    Estimating the size of a hidden finite set: large-sample behavior of estimators

    Authors: Si Cheng, Daniel J. Eck, Forrest W. Crawford

    Abstract: A finite set is "hidden" if its elements are not directly enumerable or if its size cannot be ascertained via a deterministic query. In public health, epidemiology, demography, ecology and intelligence analysis, researchers have developed a wide variety of indirect statistical approaches, under different models for sampling and observation, for estimating the size of a hidden set. Some methods mak… ▽ More

    Submitted 15 October, 2019; v1 submitted 14 August, 2018; originally announced August 2018.

  13. arXiv:1707.05884  [pdf, other

    stat.ME

    Risk ratios for contagious outcomes

    Authors: Olga Morozova, Ted Cohen, Forrest W. Crawford

    Abstract: The risk ratio is a popular tool for summarizing the relationship between a binary covariate and outcome, even when outcomes may be dependent. Investigations of infectious disease outcomes in cohort studies of individuals embedded within clusters -- households, villages, or small groups -- often report risk ratios. Epidemiologists have warned that risk ratios may be misleading when outcomes are co… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

  14. arXiv:1610.08473  [pdf, other

    stat.ML cs.SI physics.soc-ph

    Estimating the Size of a Large Network and its Communities from a Random Sample

    Authors: Lin Chen, Amin Karbasi, Forrest W. Crawford

    Abstract: Most real-world networks are too large to be measured or studied directly and there is substantial interest in estimating global network properties from smaller sub-samples. One of the most important global properties is the number of vertices/nodes in the network. Estimating the number of vertices in a large network is a major challenge in computer science, epidemiology, demography, and intellige… ▽ More

    Submitted 26 October, 2016; originally announced October 2016.

    Comments: Accepted by NIPS 2016

  15. arXiv:1608.06769  [pdf, other

    stat.CO q-bio.PE

    Direct likelihood-based inference for discretely observed stochastic compartmental models of infectious disease

    Authors: Lam Si Tung Ho, Forrest W. Crawford, Marc A. Suchard

    Abstract: Stochastic compartmental models are important tools for understanding the course of infectious diseases epidemics in populations and in prospective evaluation of intervention policies. However, calculating the likelihood for discretely observed data from even simple models -- such as the ubiquitous susceptible-infectious-removed (SIR) model -- has been considered computationally intractable, since… ▽ More

    Submitted 25 July, 2018; v1 submitted 24 August, 2016; originally announced August 2016.

  16. arXiv:1603.08616  [pdf, other

    cs.LG cs.DS cs.SI stat.ML

    Submodular Variational Inference for Network Reconstruction

    Authors: Lin Chen, Forrest W Crawford, Amin Karbasi

    Abstract: In real-world and online social networks, individuals receive and transmit information in real time. Cascading information transmissions (e.g. phone calls, text messages, social media posts) may be understood as a realization of a diffusion process operating on the network, and its branching path can be represented by a directed tree. The process only traverses and thus reveals a limited portion o… ▽ More

    Submitted 10 July, 2017; v1 submitted 28 March, 2016; originally announced March 2016.

    Comments: Accepted for UAI 2017

  17. arXiv:1603.03819  [pdf, other

    stat.CO

    Birth/birth-death processes and their computable transition probabilities with biological applications

    Authors: Lam Si Tung Ho, Jason Xu, Forrest W. Crawford, Vladimir N. Minin, Marc A. Suchard

    Abstract: Birth-death processes track the size of a univariate population, but many biological systems involve interaction between populations, necessitating models for two or more populations simultaneously. A lack of efficient methods for evaluating finite-time transition probabilities of bivariate processes, however, has restricted statistical inference in these models. Researchers rely on computationall… ▽ More

    Submitted 7 August, 2017; v1 submitted 11 March, 2016; originally announced March 2016.

  18. arXiv:1602.00359  [pdf, ps, other

    math.ST

    Confidence intervals for means under constrained dependence

    Authors: Peter M. Aronow, Forrest W. Crawford, José R. Zubizarreta

    Abstract: We develop a general framework for conducting inference on the mean of dependent random variables given constraints on their dependency graph. We establish the consistency of an oracle variance estimator of the mean when the dependency graph is known, along with an associated central limit theorem. We derive an integer linear program for finding an upper bound for the estimated variance when the g… ▽ More

    Submitted 31 January, 2016; originally announced February 2016.

  19. arXiv:1511.05397  [pdf, other

    stat.AP stat.ME

    Identification of homophily and preferential recruitment in respondent-driven sampling

    Authors: Forrest W. Crawford, Peter M. Aronow, Li Zeng, Jianghong Li

    Abstract: Respondent-driven sampling (RDS) is a link-tracing procedure for surveying hidden or hard-to-reach populations in which subjects recruit other subjects via their social network. There is significant research interest in detecting clustering or dependence of epidemiological traits in networks, but researchers disagree about whether data from RDS studies can reveal it. Two distinct mechanisms accoun… ▽ More

    Submitted 17 November, 2015; originally announced November 2015.

  20. arXiv:1511.04137  [pdf, other

    cs.SI cs.AI cs.LG

    Seeing the Unseen Network: Inferring Hidden Social Ties from Respondent-Driven Sampling

    Authors: Lin Chen, Forrest W. Crawford, Amin Karbasi

    Abstract: Learning about the social structure of hidden and hard-to-reach populations --- such as drug users and sex workers --- is a major goal of epidemiological and public health research on risk behaviors and disease prevention. Respondent-driven sampling (RDS) is a peer-referral process widely used by many health organizations, where research subjects recruit other subjects from their social network. I… ▽ More

    Submitted 1 December, 2015; v1 submitted 12 November, 2015; originally announced November 2015.

    Comments: A full version with technical proofs. Accepted by AAAI-16

  21. arXiv:1504.08349  [pdf, other

    stat.ME

    Hidden population size estimation from respondent-driven sampling: a network approach

    Authors: Forrest W. Crawford, Jiacheng Wu, Robert Heimer

    Abstract: Estimating the size of stigmatized, hidden, or hard-to-reach populations is a major problem in epidemiology, demography, and public health research. Capture-recapture and multiplier methods have become standard tools for inference of hidden population sizes, but they require independent random sampling of target population members, which is rarely possible. Respondent-driven sampling (RDS) is a su… ▽ More

    Submitted 30 April, 2015; originally announced April 2015.

  22. arXiv:1504.03574  [pdf, ps, other

    stat.ME

    Nonparametric Identification for Respondent-Driven Sampling

    Authors: Peter M. Aronow, Forrest W. Crawford

    Abstract: Respondent-driven sampling is a survey method for hidden or hard-to-reach populations in which sampled individuals recruit others in the study population via their social links. The most popular estimator for for the population mean assumes that individual sampling probabilities are proportional to each subject's reported degree in a social network connecting members of the hidden population. Howe… ▽ More

    Submitted 14 April, 2015; originally announced April 2015.

  23. arXiv:1406.0721  [pdf, other

    stat.ME

    The graphical structure of respondent-driven sampling

    Authors: Forrest W. Crawford

    Abstract: Respondent-driven sampling (RDS) is a chain-referral method for sampling members of a hidden or hard-to-reach population such as sex workers, homeless people, or drug users via their social network. Most methodological work on RDS has focused on inference of population means under the assumption that subjects' network degree determines their probability of being sampled. Criticism of existing esti… ▽ More

    Submitted 31 July, 2015; v1 submitted 3 June, 2014; originally announced June 2014.

  24. Sex, lies and self-reported counts: Bayesian mixture models for hea** in longitudinal count data via birth-death processes

    Authors: Forrest W. Crawford, Robert E. Weiss, Marc A. Suchard

    Abstract: Surveys often ask respondents to report nonnegative counts, but respondents may misremember or round to a nearby multiple of 5 or 10. This phenomenon is called hea**, and the error inherent in heaped self-reported numbers can bias estimation. Heaped data may be collected cross-sectionally or longitudinally and there may be covariates that complicate the inferential task. Hea** is a well-known… ▽ More

    Submitted 14 September, 2015; v1 submitted 16 May, 2014; originally announced May 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS809 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS809

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 2, 572-596

  25. arXiv:1403.4223  [pdf, other

    q-bio.PE

    On the distribution of interspecies correlation for Markov models of character evolution on Yule trees

    Authors: Willem H. Mulder, Forrest W. Crawford

    Abstract: Efforts to reconstruct phylogenetic trees and understand evolutionary processes depend fundamentally on stochastic models of speciation and mutation. The simplest continuous-time model for speciation in phylogenetic trees is the Yule process, in which new species are "born" from existing lineages at a constant rate. Recent work has illuminated some of the structural properties of Yule trees, but i… ▽ More

    Submitted 15 August, 2014; v1 submitted 17 March, 2014; originally announced March 2014.

  26. arXiv:1312.1268  [pdf, other

    stat.AP stat.ME

    Combining List Experiment and Direct Question Estimates of Sensitive Behavior Prevalence

    Authors: Peter M. Aronow, Alexander Coppock, Forrest W. Crawford, Donald P. Green

    Abstract: Survey respondents may give untruthful answers to sensitive questions when asked directly. In recent years, researchers have turned to the list experiment (also known as the item count technique) to overcome this difficulty. While list experiments may be less prone to bias than direct questioning, list experiments are also more susceptible to sampling variability. We show that researchers do not h… ▽ More

    Submitted 1 June, 2014; v1 submitted 4 December, 2013; originally announced December 2013.

  27. arXiv:1305.1656  [pdf, other

    stat.ME math.ST

    Markov counting models for correlated binary responses

    Authors: Forrest W. Crawford, Daniel Zelterman

    Abstract: We propose a class of continuous-time Markov counting processes for analyzing correlated binary data and establish a correspondence between these models and sums of exchangeable Bernoulli random variables. Our approach generalizes many previous models for correlated outcomes, admits easily interpretable parameterizations, allows different cluster sizes, and incorporates ascertainment bias in a nat… ▽ More

    Submitted 26 August, 2014; v1 submitted 7 May, 2013; originally announced May 2013.

  28. arXiv:1301.1305  [pdf, other

    stat.ME q-bio.PE

    Birth-death processes

    Authors: Forrest W. Crawford, Marc A. Suchard

    Abstract: Many important stochastic counting models can be written as general birth-death processes (BDPs). BDPs are continuous-time Markov chains on the non-negative integers and can be used to easily parameterize a rich variety of probability distributions. Although the theoretical properties of general BDPs are well understood, traditionally statistical work on BDPs has been limited to the simple linear… ▽ More

    Submitted 25 July, 2014; v1 submitted 7 January, 2013; originally announced January 2013.

    Comments: This review replaces an earlier version that focused exclusively on integrals of birth-death processes

  29. arXiv:1207.5032  [pdf, other

    q-bio.PE

    Diversity, disparity, and evolutionary rate estimation for unresolved Yule trees

    Authors: Forrest W. Crawford, Marc A. Suchard

    Abstract: The branching structure of biological evolution confers statistical dependencies on phenotypic trait values in related organisms. For this reason, comparative macroevolutionary studies usually begin with an inferred phylogeny that describes the evolutionary relationships of the organisms of interest. The probability of the observed trait data can be computed by assuming a model for trait evolution… ▽ More

    Submitted 20 July, 2012; originally announced July 2012.

  30. arXiv:1111.6644  [pdf, other

    q-bio.PE q-bio.QM

    Transition probabilities for general birth-death processes with applications in ecology, genetics, and evolution

    Authors: Forrest W. Crawford, Marc A. Suchard

    Abstract: A birth-death process is a continuous-time Markov chain that counts the number of particles in a system over time. In the general process with $n$ current particles, a new particle is born with instantaneous rate $λ_n$ and a particle dies with instantaneous rate $μ_n$. Currently no robust and efficient method exists to evaluate the finite-time transition probabilities in a general birth-death proc… ▽ More

    Submitted 28 November, 2011; originally announced November 2011.

    Journal ref: J Math Biol, 65:553-580, 2012

  31. arXiv:1111.4954  [pdf, other

    stat.ME q-bio.PE stat.CO

    Estimation for general birth-death processes

    Authors: Forrest W. Crawford, Vladimir N. Minin, Marc A. Suchard

    Abstract: Birth-death processes (BDPs) are continuous-time Markov chains that track the number of "particles" in a system over time. While widely used in population biology, genetics and ecology, statistical inference of the instantaneous particle birth and death rates remains largely limited to restrictive linear BDPs in which per-particle birth and death rates are constant. Researchers often observe the n… ▽ More

    Submitted 21 November, 2011; originally announced November 2011.