Skip to main content

Showing 1–6 of 6 results for author: Baker, J

Searching in archive stat. Search in all archives.
.
  1. Testing unit root non-stationarity in the presence of missing data in univariate time series of mobile health studies

    Authors: Charlotte Fowler, Xiaoxuan Cai, Justin T. Baker, Jukka-Pekka Onnela, Linda Valeri

    Abstract: The use of digital devices to collect data in mobile health (mHealth) studies introduces a novel application of time series methods, with the constraint of potential data missing at random (MAR) or missing not at random (MNAR). In time series analysis, testing for stationarity is an important preliminary step to inform appropriate later analyses. The augmented Dickey-Fuller (ADF) test was develope… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  2. arXiv:2206.14343  [pdf, other

    stat.ME

    State space model multiple imputation for missing data in non-stationary multivariate time series with application in digital Psychiatry

    Authors: Xiaoxuan Cai, Xinru Wang, Li Zeng, Habiballah Rahimi Eichi, Dost Ongur, Lisa Dixon, Justin T. Baker, Jukka-Pekka Onnela, Linda Valeri

    Abstract: Mobile technology enables unprecedented continuous monitoring of an individual's behavior, social interactions, symptoms, and other health conditions, presenting an enormous opportunity for therapeutic advancements and scientific discoveries regarding the etiology of psychiatric illness. Continuous collection of mobile data results in the generation of a new type of data: entangled multivariate ti… ▽ More

    Submitted 12 April, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

  3. arXiv:1806.07137  [pdf, other

    stat.CO cs.LG stat.ML

    Large-Scale Stochastic Sampling from the Probability Simplex

    Authors: Jack Baker, Paul Fearnhead, Emily B Fox, Christopher Nemeth

    Abstract: Stochastic gradient Markov chain Monte Carlo (SGMCMC) has become a popular method for scalable Bayesian inference. These methods are based on sampling a discrete-time approximation to a continuous time process, such as the Langevin diffusion. When applied to distributions defined on a constrained space the time-discretization error can dominate when we are near the boundary of the space. We demons… ▽ More

    Submitted 26 October, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted to Advances in Neural Information Processing Systems (2018)

  4. arXiv:1710.00578  [pdf, other

    stat.CO stat.AP stat.ML

    sgmcmc: An R Package for Stochastic Gradient Markov Chain Monte Carlo

    Authors: Jack Baker, Paul Fearnhead, Emily B. Fox, Christopher Nemeth

    Abstract: This paper introduces the R package sgmcmc; which can be used for Bayesian inference on problems with large datasets using stochastic gradient Markov chain Monte Carlo (SGMCMC). Traditional Markov chain Monte Carlo (MCMC) methods, such as Metropolis-Hastings, are known to run prohibitively slowly as the dataset size increases. SGMCMC solves this issue by only using a subset of data at each iterati… ▽ More

    Submitted 13 April, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

  5. arXiv:1706.05439  [pdf, other

    stat.CO cs.LG stat.ML

    Control Variates for Stochastic Gradient MCMC

    Authors: Jack Baker, Paul Fearnhead, Emily B. Fox, Christopher Nemeth

    Abstract: It is well known that Markov chain Monte Carlo (MCMC) methods scale poorly with dataset size. A popular class of methods for solving this issue is stochastic gradient MCMC. These methods use a noisy estimate of the gradient of the log posterior, which reduces the per iteration computational cost of the algorithm. Despite this, there are a number of results suggesting that stochastic gradient Lange… ▽ More

    Submitted 14 December, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

  6. arXiv:1509.01228  [pdf, other

    astro-ph.HE physics.data-an stat.ML

    Machine Learning Model of the Swift/BAT Trigger Algorithm for Long GRB Population Studies

    Authors: Philip B Graff, Amy Y Lien, John G Baker, Takanori Sakamoto

    Abstract: To draw inferences about gamma-ray burst (GRB) source populations based on Swift observations, it is essential to understand the detection efficiency of the Swift burst alert telescope (BAT). This study considers the problem of modeling the Swift/BAT triggering algorithm for long GRBs, a computationally expensive procedure, and models it using machine learning algorithms. A large sample of simulat… ▽ More

    Submitted 8 February, 2016; v1 submitted 3 September, 2015; originally announced September 2015.

    Comments: 16 pages, 18 figures, 5 tables, published by ApJ

    Journal ref: ApJ, 818, 55 (2016)