-
COVID Future Panel Survey: A Unique Public Dataset Documenting How U.S. Residents' Travel Related Choices Changed During the COVID-19 Pandemic
Authors:
Rishabh Singh Chauhan,
Matthew Wigginton Bhagat-Conway,
Tassio Magassy,
Nicole Corcoran,
Ehsan Rahimi,
Abbie Dirks,
Ram Pendyala,
Abolfazl Mohammadian,
Sybil Derrible,
Deborah Salon
Abstract:
The COVID-19 pandemic is an unprecedented global crisis that has impacted virtually everyone. We conducted a nationwide online longitudinal survey in the United States to collect information about the shifts in travel-related behavior and attitudes before, during, and after the pandemic. The survey asked questions about commuting, long distance travel, working from home, online learning, online sh…
▽ More
The COVID-19 pandemic is an unprecedented global crisis that has impacted virtually everyone. We conducted a nationwide online longitudinal survey in the United States to collect information about the shifts in travel-related behavior and attitudes before, during, and after the pandemic. The survey asked questions about commuting, long distance travel, working from home, online learning, online shop**, pandemic experiences, attitudes, and demographic information. The survey has been deployed to the same respondents thrice to observe how the responses to the pandemic have evolved over time. The first wave of the survey was conducted from April 2020 to June 2021, the second wave from November 2020 to August 2021, and the third wave from October 2021 to November 2021. In total, 9,265 responses were collected in the first wave; of these, 2,877 respondents returned for the second wave and 2,728 for the third wave. Survey data are publicly available. This unique dataset can aid policy makers in making decisions in areas including transport, workforce development, and more. This article demonstrates the framework for conducting this online longitudinal survey. It details the step-by-step procedure involved in conducting the survey and in curating the data to make it representative of the national trends.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
A Random Point Initialization Approach to Image Segmentation with Variational Level-sets
Authors:
J. N. Mueller,
J. N. Corcoran
Abstract:
Image segmentation is an essential component in many image processing and computer vision tasks. The primary goal of image segmentation is to simplify an image for easier analysis, and there are two broad approaches for achieving this: edge based methods, which extract the boundaries of specific known objects, and region based methods, which partition the image into regions that are statistically…
▽ More
Image segmentation is an essential component in many image processing and computer vision tasks. The primary goal of image segmentation is to simplify an image for easier analysis, and there are two broad approaches for achieving this: edge based methods, which extract the boundaries of specific known objects, and region based methods, which partition the image into regions that are statistically homogeneous. One of the more prominent edge finding methods, known as the level set method, evolves a zero-level contour in the image plane with gradient descent until the contour has converged to the object boundaries. While the classical level set method and its variants have proved successful in segmenting real images, they are susceptible to becoming stuck in noisy regions of the image plane without a priori knowledge of the image and they are unable to provide details beyond object outer boundary locations. We propose a modification to the variational level set image segmentation method that can quickly detect object boundaries by making use of random point initialization. We demonstrate the efficacy of our approach by comparing the performance of our method on real images to that of the prominent Canny Method.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Rare Events via Cross-Entropy Population Monte Carlo
Authors:
Caleb Miller,
Jem N. Corcoran,
Michael D. Schneider
Abstract:
We present a Cross-Entropy based population Monte Carlo algorithm. This methods stands apart from previous work in that we are not optimizing a mixture distribution. Instead, we leverage deterministic mixture weights and optimize the distributions individually through a reinterpretation of the typical derivation of the cross-entropy method. Demonstrations on numerical examples show that the algori…
▽ More
We present a Cross-Entropy based population Monte Carlo algorithm. This methods stands apart from previous work in that we are not optimizing a mixture distribution. Instead, we leverage deterministic mixture weights and optimize the distributions individually through a reinterpretation of the typical derivation of the cross-entropy method. Demonstrations on numerical examples show that the algorithm can outperform existing resampling population Monte Carlo methods, especially for higher-dimensional problems.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Coupling from the Past for the Stochastic Simulation of Chemical Reaction Networks
Authors:
J. N. Mueller,
J. N. Corcoran
Abstract:
Chemical reaction networks (CRNs) are fundamental computational models used to study the behavior of chemical reactions in well-mixed solutions. They have been used extensively to model a broad range of biological systems, and are primarily used when the more traditional model of deterministic continuous mass action kinetics is invalid due to small molecular counts. We present a perfect sampling a…
▽ More
Chemical reaction networks (CRNs) are fundamental computational models used to study the behavior of chemical reactions in well-mixed solutions. They have been used extensively to model a broad range of biological systems, and are primarily used when the more traditional model of deterministic continuous mass action kinetics is invalid due to small molecular counts. We present a perfect sampling algorithm to draw error-free samples from the stationary distributions of stochastic models for coupled, linear chemical reaction networks. The state spaces of such networks are given by all permissible combinations of molecular counts for each chemical species, and thereby grow exponentially with the numbers of species in the network. To avoid simulations involving large numbers of states, we propose a subset of chemical species such that coupling of paths started from these states guarantee coupling of paths started from all states in the state space and we show for the well-known Reversible Michaelis-Menten model that the subset does in fact guarantee perfect draws from the stationary distribution of interest. We compare solutions computed in two ways with this algorithm to those found analytically using the chemical master equation and we compare the distribution of coupling times for the two simulation approaches.
△ Less
Submitted 11 May, 2021;
originally announced May 2021.
-
Controlled Accuracy Gibbs Sampling of Order Constrained Non-IID Ordered Random Variates
Authors:
Jem N. Corcoran,
Caleb Miller
Abstract:
Order statistics arising from $m$ independent but not identically distributed random variables are typically constructed by arranging some $X_{1}, X_{2}, \ldots, X_{m}$, with $X_{i}$ having distribution function $F_{i}(x)$, in increasing order denoted as $X_{(1)} \leq X_{(2)} \leq \ldots \leq X_{(m)}$. In this case, $X_{(i)}$ is not necessarily associated with $F_{i}(x)$. Assuming one can simulate…
▽ More
Order statistics arising from $m$ independent but not identically distributed random variables are typically constructed by arranging some $X_{1}, X_{2}, \ldots, X_{m}$, with $X_{i}$ having distribution function $F_{i}(x)$, in increasing order denoted as $X_{(1)} \leq X_{(2)} \leq \ldots \leq X_{(m)}$. In this case, $X_{(i)}$ is not necessarily associated with $F_{i}(x)$. Assuming one can simulate values from each distribution, one can generate such "non-iid" order statistics by simulating $X_{i}$ from $F_{i}$, for $i=1,2,\ldots, m$, and arranging them in order. In this paper, we consider the problem of simulating ordered values $X_{(1)}, X_{(2)}, \ldots, X_{(m)}$ such that the marginal distribution of $X_{(i)}$ is $F_{i}(x)$. This problem arises in Bayesian principal components analysis (BPCA) where the $X_{i}$ are ordered eigenvalues that are a posteriori independent but not identically distributed. We propose a novel coupling-from-the-past algorithm to "perfectly" (up to computable order of accuracy) simulate such {\emph{order-constrained non-iid}} order statistics. We demonstrate the effectiveness of our approach for several examples, including the BPCA problem.
△ Less
Submitted 12 November, 2021; v1 submitted 31 December, 2020;
originally announced December 2020.
-
Bayesian Fusion of Data Partitioned Particle Estimates
Authors:
Caleb Miller,
Michael D. Schneider,
Jem N. Corcoran,
Jason Bernstein
Abstract:
We present a Bayesian data fusion method to approximate a posterior distribution from an ensemble of particle estimates that only have access to subsets of the data. Our approach relies on approximate probabilistic inference of model parameters through Monte Carlo methods, followed by an update and resample scheme related to multiple importance sampling to combine information from the initial esti…
▽ More
We present a Bayesian data fusion method to approximate a posterior distribution from an ensemble of particle estimates that only have access to subsets of the data. Our approach relies on approximate probabilistic inference of model parameters through Monte Carlo methods, followed by an update and resample scheme related to multiple importance sampling to combine information from the initial estimates. We show the method is convergent in the particle limit and directly suited to application on multi-sensor data fusion problems by demonstrating efficacy on a multi-sensor Keplerian orbit determination problem and a bearings-only tracking problem.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
A Birth and Death Process for Bayesian Network Structure Inference
Authors:
D. Jennings,
J. N. Corcoran
Abstract:
Bayesian networks (BNs) are graphical models that are useful for representing high-dimensional probability distributions. There has been a great deal of interest in recent years in the NP-hard problem of learning the structure of a BN from observed data. Typically, one assigns a score to various structures and the search becomes an optimization problem that can be approached with either determinis…
▽ More
Bayesian networks (BNs) are graphical models that are useful for representing high-dimensional probability distributions. There has been a great deal of interest in recent years in the NP-hard problem of learning the structure of a BN from observed data. Typically, one assigns a score to various structures and the search becomes an optimization problem that can be approached with either deterministic or stochastic methods. In this paper, we walk through the space of graphs by modeling the appearance and disappearance of edges as a birth and death process and compare our novel approach to the popular Metropolis-Hastings search strategy. We give empirical evidence that the birth and death process has superior mixing properties.
△ Less
Submitted 1 October, 2016;
originally announced October 2016.
-
Particle Filtering and Smoothing Using Windowed Rejection Sampling
Authors:
J. N. Corcoran,
D. Jennings
Abstract:
"Particle methods" are sequential Monte Carlo algorithms, typically involving importance sampling, that are used to estimate and sample from joint and marginal densities from a collection of a, presumably increasing, number of random variables. In particular, a particle filter aims to estimate the current state $X_{n}$ of a stochastic system that is not directly observable by estimating a posterio…
▽ More
"Particle methods" are sequential Monte Carlo algorithms, typically involving importance sampling, that are used to estimate and sample from joint and marginal densities from a collection of a, presumably increasing, number of random variables. In particular, a particle filter aims to estimate the current state $X_{n}$ of a stochastic system that is not directly observable by estimating a posterior distribution $π(x_{n}|y_{1},y_{2}, \ldots, y_{n})$ where the $\{Y_{n}\}$ are observations related to the $\{X_{n}\}$ through some measurement model $π(y_{n}|x_{n})$. A particle smoother aims to estimate a marginal distribution $π(x_{i}|y_{1},y_{2}, \ldots, y_{n})$ for $1 \leq i < n$. Particle methods are used extensively for hidden Markov models where $\{X_{n}\}$ is a Markov chain as well as for more general state space models.
Existing particle filtering algorithms are extremely fast and easy to implement. Although they suffer from issues of degeneracy and "sample impoverishment", steps can be taken to minimize these problems and overall they are excellent tools for inference. However, if one wishes to sample from a posterior distribution of interest, a particle filter is only able to produce dependent draws. Particle smoothing algorithms are complicated and far less robust, often requiring cumbersome post-processing, "forward-backward" recursions, and multiple passes through subroutines. In this paper we introduce an alternative algorithm for both filtering and smoothing that is based on rejection sampling "in windows" . We compare both speed and accuracy of the traditional particle filter and this "windowed rejection sampler" (WRS) for several examples and show that good estimates for smoothing distributions are obtained at no extra cost.
△ Less
Submitted 16 July, 2014;
originally announced July 2014.