-
Spatial particle processes with coagulation: Gibbs-measure approach, gelation and Smoluchowski equation
Authors:
Luisa Andreis,
Wolfgang König,
Heide Langhammer,
Robert I. A. Patterson
Abstract:
We study a spatial Markovian particle system with pairwise coagulation, a spatial version of the Marcus--Lushnikov process: according to a coagulation kernel $K$, particle pairs merge into a single particle, and their masses are united. We introduce a statistical-mechanics approach to the study of this process. We derive an explicit formula for the empirical process of the particle configuration a…
▽ More
We study a spatial Markovian particle system with pairwise coagulation, a spatial version of the Marcus--Lushnikov process: according to a coagulation kernel $K$, particle pairs merge into a single particle, and their masses are united. We introduce a statistical-mechanics approach to the study of this process. We derive an explicit formula for the empirical process of the particle configuration at a given fixed time $T$ in terms of a reference Poisson point process, whose points are trajectories that coagulate into one particle by time $T$. The non-coagulation between any two of them induces an exponential pair-interaction, which turns the description into a many-body system with a Gibbsian pair-interaction.
Based on this, we first give a large-deviation principle for the joint distribution of the particle histories (conditioning on an upper bound for particle sizes), in the limit as the number $N$ of initial atoms diverges and the kernel scales as $\frac 1N K$. We characterise the minimiser(s) of the rate function, we give criteria for its uniqueness and prove a law of large numbers (unconditioned). Furthermore, we use the unique minimiser to construct a solution of the Smoluchowski equation and give a criterion for the occurrence of a gelation phase transition.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Simulating Pedestrian Avoidance: The Humans vs Zombies Scenario
Authors:
Juan P. Oriana,
German A. Patterson,
Daniel R. Parisi
Abstract:
This study introduces a unique active matter system as an application of the pedestrian collision avoidance paradigm, that proposes dynamically adjusting the desired velocity. We present a fictitious human-zombie scenario set within a closed geometry, combining prey-predator behavior with a one-way contagion process that can transform prey into predators. The system demonstrates varied responses,…
▽ More
This study introduces a unique active matter system as an application of the pedestrian collision avoidance paradigm, that proposes dynamically adjusting the desired velocity. We present a fictitious human-zombie scenario set within a closed geometry, combining prey-predator behavior with a one-way contagion process that can transform prey into predators. The system demonstrates varied responses, in cases where agents have the same maximum speeds, a single zombie always catches a human, whereas two zombies never catch a single human. As the number of human agents increases, observables, such as the final fraction of zombie agents and total conversion times, exhibit a significant change in the system's behavior at intermediate density values. Most notably, there is evidence of a first-order phase transition when the mean population speed is analyzed as an order parameter.
△ Less
Submitted 24 December, 2023;
originally announced December 2023.
-
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Authors:
Vincent Liu,
Prabhat Nagarajan,
Andrew Patterson,
Martha White
Abstract:
Offline reinforcement learning algorithms often require careful hyperparameter tuning. Consequently, before deployment, we need to select amongst a set of candidate policies. As yet, however, there is little understanding about the fundamental limits of this offline policy selection (OPS) problem. In this work we aim to provide clarity on when sample efficient OPS is possible, primarily by connect…
▽ More
Offline reinforcement learning algorithms often require careful hyperparameter tuning. Consequently, before deployment, we need to select amongst a set of candidate policies. As yet, however, there is little understanding about the fundamental limits of this offline policy selection (OPS) problem. In this work we aim to provide clarity on when sample efficient OPS is possible, primarily by connecting OPS to off-policy policy evaluation (OPE) and Bellman error (BE) estimation. We first show a hardness result, that in the worst case, OPS is just as hard as OPE, by proving a reduction of OPE to OPS. As a result, no OPS method can be more sample efficient than OPE in the worst case. We then propose a BE method for OPS, called Identifiable BE Selection (IBES), that has a straightforward method for selecting its own hyperparameters. We highlight that using IBES for OPS generally has more requirements than OPE methods, but if satisfied, can be more sample efficient. We conclude with an empirical study comparing OPE and IBES, and by showing the difficulty of OPS on an offline Atari benchmark dataset.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Experimental and numerical study of a second-order transition in the behavior of confined self-propelled particles
Authors:
E. Barone,
G. A. Patterson
Abstract:
In this study, we conduct experimental investigations on the behavior of confined self-propelled particles within a circular arena, employing small commercial robots capable of locomotion, communication, and information processing. These robots execute circular trajectories, which can be clockwise or counterclockwise, based on two internal states. Using a majority-based stochastic decision algorit…
▽ More
In this study, we conduct experimental investigations on the behavior of confined self-propelled particles within a circular arena, employing small commercial robots capable of locomotion, communication, and information processing. These robots execute circular trajectories, which can be clockwise or counterclockwise, based on two internal states. Using a majority-based stochastic decision algorithm, each robot can reverse its direction based on the states of two neighboring robots. By manipulating a control parameter governing the interaction, the system exhibits a transition-from a state where all robots rotate randomly to one where they rotate uniformly in the same direction. Moreover, this transition significantly impacts the trajectories of the robots. To extend our findings to larger systems, we introduce a mathematical model enabling characterization of the order transition type and the resulting trajectories. Our results reveal a second-order transition from active Brownian to chiral motion. Lastly, we analyze the particle density within the arena, examining how it varies concerning system size and the control parameter.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
Fundamental diagram of vibration-driven vehicles
Authors:
German A. Patterson,
Daniel R. Parisi
Abstract:
In this study, we conducted experimental investigations into the fundamental diagram of vibration-driven vehicles (VDV) in a one-dimensional array. As these mechanical agents interact solely through collisions, their mean speed remains nearly constant at low and medium densities. However, there is a reduction of between 25% and 40% when the lineal density approaches the inverse of the contact dist…
▽ More
In this study, we conducted experimental investigations into the fundamental diagram of vibration-driven vehicles (VDV) in a one-dimensional array. As these mechanical agents interact solely through collisions, their mean speed remains nearly constant at low and medium densities. However, there is a reduction of between 25% and 40% when the lineal density approaches the inverse of the contact distance. Remarkably, in this one-dimensional system, the outcome is significantly influenced by the order in which agents, sorted by their free speeds, are gradually introduced into the experiment. While a significant speed difference is observed at low and medium densities based on this ordering, both curves eventually converge to the same speed at maximum density. Moreover, the attained speed in saturated systems is slower than the speed of the slowest agent.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies
Authors:
James Paul Mason,
Alexandra Werth,
Colin G. West,
Allison A. Youngblood,
Donald L. Woodraska,
Courtney Peck,
Kevin Lacjak,
Florian G. Frick,
Moutamen Gabir,
Reema A. Alsinan,
Thomas Jacobsen,
Mohammad Alrubaie,
Kayla M. Chizmar,
Benjamin P. Lau,
Lizbeth Montoya Dominguez,
David Price,
Dylan R. Butler,
Connor J. Biron,
Nikita Feoktistov,
Kai Dewey,
N. E. Loomis,
Michal Bodzianowski,
Connor Kuybus,
Henry Dietrick,
Aubrey M. Wolfe
, et al. (977 additional authors not shown)
Abstract:
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th…
▽ More
Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Empirical Design in Reinforcement Learning
Authors:
Andrew Patterson,
Samuel Neumann,
Martha White,
Adam White
Abstract:
Empirical design in reinforcement learning is no small task. Running good experiments requires attention to detail and at times significant computational resources. While compute resources available per dollar have continued to grow rapidly, so have the scale of typical experiments in reinforcement learning. It is now common to benchmark agents with millions of parameters against dozens of tasks,…
▽ More
Empirical design in reinforcement learning is no small task. Running good experiments requires attention to detail and at times significant computational resources. While compute resources available per dollar have continued to grow rapidly, so have the scale of typical experiments in reinforcement learning. It is now common to benchmark agents with millions of parameters against dozens of tasks, each using the equivalent of 30 days of experience. The scale of these experiments often conflict with the need for proper statistical evidence, especially when comparing algorithms. Recent studies have highlighted how popular algorithms are sensitive to hyper-parameter settings and implementation details, and that common empirical practice leads to weak statistical evidence (Machado et al., 2018; Henderson et al., 2018). Here we take this one step further.
This manuscript represents both a call to action, and a comprehensive resource for how to do good experiments in reinforcement learning. In particular, we cover: the statistical assumptions underlying common performance measures, how to properly characterize performance variation and stability, hypothesis testing, special considerations for comparing multiple agents, baseline and illustrative example construction, and how to deal with hyper-parameters and experimenter bias. Throughout we highlight common mistakes found in the literature and the statistical consequences of those in example experiments. The objective of this document is to provide answers on how we can use our unprecedented compute to do good science in reinforcement learning, as well as stay alert to potential pitfalls in our empirical design.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Development and Demonstration of an Efficient Readout Error Mitigation Technique for use in NISQ Algorithms
Authors:
Andrew Arrasmith,
Andrew Patterson,
Alice Boughton,
Marco Paini
Abstract:
The approximate state estimation and the closely related classical shadows methods allow for the estimation of complicated observables with relatively few shots. As these methods make use of random measurements that can symmetrise the effect of readout errors, they have been shown to permit simplified approaches to readout error mitigation which require only a number of samples that scales as…
▽ More
The approximate state estimation and the closely related classical shadows methods allow for the estimation of complicated observables with relatively few shots. As these methods make use of random measurements that can symmetrise the effect of readout errors, they have been shown to permit simplified approaches to readout error mitigation which require only a number of samples that scales as $\mathcal{O}(1)$ with increasing numbers of qubits. However, these techniques require executing a different circuit at each shot, adding a typically prohibitive amount of latency that prohibits their practical application. In this manuscript we consider the approximate state estimation of readout-mitigated expectation values, and how to best implement that procedure on the Rigetti quantum computing hardware. We discuss the theoretical aspects involved, providing an explicit computation of the effect of readout error on the estimated expectation values and how to mitigate that effect. Leveraging improvements to the Rigetti control systems, we then demonstrate an efficient implementation of this approach. Not only do we find that we can suppress the effect of correlated errors and accurately mitigate the readout errors, we find that we can do so quickly, collecting and processing $10^6$ samples in less than $1.5$ minutes. This development opens the way for practical uses of methods with this type of randomisation.
△ Less
Submitted 20 April, 2023; v1 submitted 30 March, 2023;
originally announced March 2023.
-
Robust Losses for Learning Value Functions
Authors:
Andrew Patterson,
Victor Liao,
Martha White
Abstract:
Most value function learning algorithms in reinforcement learning are based on the mean squared (projected) Bellman error. However, squared errors are known to be sensitive to outliers, both skewing the solution of the objective and resulting in high-magnitude and high-variance gradients. To control these high-magnitude updates, typical strategies in RL involve clip** gradients, clip** rewards…
▽ More
Most value function learning algorithms in reinforcement learning are based on the mean squared (projected) Bellman error. However, squared errors are known to be sensitive to outliers, both skewing the solution of the objective and resulting in high-magnitude and high-variance gradients. To control these high-magnitude updates, typical strategies in RL involve clip** gradients, clip** rewards, rescaling rewards, or clip** errors. While these strategies appear to be related to robust losses -- like the Huber loss -- they are built on semi-gradient update rules which do not minimize a known loss. In this work, we build on recent insights reformulating squared Bellman errors as a saddlepoint optimization problem and propose a saddlepoint reformulation for a Huber Bellman error and Absolute Bellman error. We start from a formalization of robust losses, then derive sound gradient-based approaches to minimize these losses in both the online off-policy prediction and control settings. We characterize the solutions of the robust losses, providing insight into the problem settings where the robust losses define notably better solutions than the mean squared Bellman error. Finally, we show that the resulting gradient-based algorithms are more stable, for both prediction and control, with less sensitivity to meta-parameters.
△ Less
Submitted 17 April, 2023; v1 submitted 17 May, 2022;
originally announced May 2022.
-
A Temporal-Difference Approach to Policy Gradient Estimation
Authors:
Samuele Tosatto,
Andrew Patterson,
Martha White,
A. Rupam Mahmood
Abstract:
The policy gradient theorem (Sutton et al., 2000) prescribes the usage of a cumulative discounted state distribution under the target policy to approximate the gradient. Most algorithms based on this theorem, in practice, break this assumption, introducing a distribution shift that can cause the convergence to poor solutions. In this paper, we propose a new approach of reconstructing the policy gr…
▽ More
The policy gradient theorem (Sutton et al., 2000) prescribes the usage of a cumulative discounted state distribution under the target policy to approximate the gradient. Most algorithms based on this theorem, in practice, break this assumption, introducing a distribution shift that can cause the convergence to poor solutions. In this paper, we propose a new approach of reconstructing the policy gradient from the start state without requiring a particular sampling strategy. The policy gradient calculation in this form can be simplified in terms of a gradient critic, which can be recursively estimated due to a new Bellman equation of gradients. By using temporal-difference updates of the gradient critic from an off-policy data stream, we develop the first estimator that sidesteps the distribution shift issue in a model-free way. We prove that, under certain realizability conditions, our estimator is unbiased regardless of the sampling strategy. We empirically show that our technique achieves a superior bias-variance trade-off and performance in presence of off-policy samples.
△ Less
Submitted 7 July, 2022; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Spontaneous trail formation in populations of auto-chemotactic walkers
Authors:
Zahra Mokhtari,
Robert I. A. Patterson,
Felix Höfling
Abstract:
We study the formation of trails in populations of self-propelled agents that make oriented deposits of pheromones and also sense such deposits to which they then respond with gradual changes of their direction of motion. Based on extensive off-lattice computer simulations aiming at the scale of insects, e.g., ants, we identify a number of emerging stationary patterns and obtain qualitatively the…
▽ More
We study the formation of trails in populations of self-propelled agents that make oriented deposits of pheromones and also sense such deposits to which they then respond with gradual changes of their direction of motion. Based on extensive off-lattice computer simulations aiming at the scale of insects, e.g., ants, we identify a number of emerging stationary patterns and obtain qualitatively the non-equilibrium state diagram of the model, spanned by the strength of the agent--pheromone interaction and the number density of the population. In particular, we demonstrate the spontaneous formation of persistent, macroscopic trails, and highlight some behaviour that is consistent with a dynamic phase transition. This includes a characterisation of the mass of system-spanning trails as a potential order parameter. We also propose a dynamic model for a few macroscopic observables, including the sub-population size of trail-following agents, which captures the early phase of trail formation.
△ Less
Submitted 8 December, 2021;
originally announced December 2021.
-
A large-deviations principle for all the components in a sparse inhomogeneous random graph
Authors:
Luisa Andreis,
Wolfgang König,
Heide Langhammer,
Robert I. A. Patterson
Abstract:
We study an inhomogeneous sparse random graph on [N] = {1, . . . , N } as introduced in a seminal paper by Bollobas, Janson and Riordan (2007): vertices have a type (here in a compact metric space S), and edges between different vertices occur randomly and independently over all vertex pairs, with a probability depending on the two vertex types. In the limit N to infinity, we consider the sparse r…
▽ More
We study an inhomogeneous sparse random graph on [N] = {1, . . . , N } as introduced in a seminal paper by Bollobas, Janson and Riordan (2007): vertices have a type (here in a compact metric space S), and edges between different vertices occur randomly and independently over all vertex pairs, with a probability depending on the two vertex types. In the limit N to infinity, we consider the sparse regime, where the average degree is O(1). We prove a large-deviations principle with explicit rate function for the statistics of the collection of all the connected components, registered according to their vertex type sets, and distinguished according to being microscopic (of finite size) or macroscopic (of size proportional to N). In doing so, we derive explicit logarithmic asymptotics for the probability that GN is connected. We present a full analysis of the rate function including its minimizers. From this analysis we deduce a number of limit laws, conditional and unconditional, which provide comprehensive information about all the microscopic and macroscopic components of the graph. In particular, we recover the criterion for the existence of the phase transition given in [5].
△ Less
Submitted 17 August, 2023; v1 submitted 25 November, 2021;
originally announced November 2021.
-
A local normal form for Hamiltonian actions of compact semisimple Poisson-Lie groups
Authors:
Megumi Harada,
Jeremy Lane,
Aidan Patterson
Abstract:
The main contribution of this manuscript is a local normal form for Hamiltonian actions of Poisson-Lie groups $K$ on a symplectic manifold equipped with an $AN$-valued moment map, where $AN$ is the dual Poisson-Lie group of $K$. Our proof uses the delinearization theorem of Alekseev which relates a classical Hamiltonian action of $K$ with $\mathfrak{k}^*$-valued moment map to a Hamiltonian action…
▽ More
The main contribution of this manuscript is a local normal form for Hamiltonian actions of Poisson-Lie groups $K$ on a symplectic manifold equipped with an $AN$-valued moment map, where $AN$ is the dual Poisson-Lie group of $K$. Our proof uses the delinearization theorem of Alekseev which relates a classical Hamiltonian action of $K$ with $\mathfrak{k}^*$-valued moment map to a Hamiltonian action with an $AN$-valued moment map, via a deformation of symplectic structures. We obtain our main result by proving a ``delinearization commutes with symplectic quotients'' theorem which is also of independent interest, and then putting this together with the local normal form theorem for classical Hamiltonian actions wtih $\mathfrak{k}^*$-valued moment maps. A key ingredient for our main result is the delinearization $\mathcal{D}(ω_{can})$ of the canonical symplectic structure on $T^*K$, so we additionally take some steps toward explicit computations of $\mathcal{D}(ω_{can})$. In particular, in the case $K=SU(2)$, we obtain explicit formulas for the matrix coefficients of $\mathcal{D}(ω_{can})$ with respect to a natural choice of coordinates on $T^*SU(2)$.
△ Less
Submitted 5 June, 2021;
originally announced June 2021.
-
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Authors:
Andrew Patterson,
Adam White,
Martha White
Abstract:
Many reinforcement learning algorithms rely on value estimation, however, the most widely used algorithms -- namely temporal difference algorithms -- can diverge under both off-policy sampling and nonlinear function approximation. Many algorithms have been developed for off-policy value estimation based on the linear mean squared projected Bellman error (MSPBE) and are sound under linear function…
▽ More
Many reinforcement learning algorithms rely on value estimation, however, the most widely used algorithms -- namely temporal difference algorithms -- can diverge under both off-policy sampling and nonlinear function approximation. Many algorithms have been developed for off-policy value estimation based on the linear mean squared projected Bellman error (MSPBE) and are sound under linear function approximation. Extending these methods to the nonlinear case has been largely unsuccessful. Recently, several methods have been introduced that approximate a different objective -- the mean-squared Bellman error (MSBE) -- which naturally facilitate nonlinear approximation. In this work, we build on these insights and introduce a new generalized MSPBE that extends the linear MSPBE to the nonlinear setting. We show how this generalized objective unifies previous work and obtain new bounds for the value error of the solutions of the generalized objective. We derive an easy-to-use, but sound, algorithm to minimize the generalized objective, and show that it is more stable across runs, is less sensitive to hyperparameters, and performs favorably across four control domains with neural network function approximation.
△ Less
Submitted 28 March, 2022; v1 submitted 28 April, 2021;
originally announced April 2021.
-
Variational structures beyond gradient flows: a macroscopic fluctuation-theory perspective
Authors:
Robert I. A. Patterson,
D. R. Michiel Renger,
Upanshu Sharma
Abstract:
Macroscopic equations arising out of stochastic particle systems in detailed balance (called dissipative systems or gradient flows) have a natural variational structure, which can be derived from the large-deviation rate functional for the density of the particle system. While large deviations can be studied in considerable generality, these variational structures are often restricted to systems i…
▽ More
Macroscopic equations arising out of stochastic particle systems in detailed balance (called dissipative systems or gradient flows) have a natural variational structure, which can be derived from the large-deviation rate functional for the density of the particle system. While large deviations can be studied in considerable generality, these variational structures are often restricted to systems in detailed balance. Using insights from macroscopic fluctuation theory, in this work we aim to generalise this variational connection beyond dissipative systems by augmenting densities with fluxes, which encode non-dissipative effects. Our main contribution is an abstract framework, which for a given flux-density cost and a quasipotential, provides a decomposition into dissipative and non-dissipative components and a generalised orthogonality relation between them. We then apply this abstract theory to various stochastic particle systems -- independent copies of jump processes, zero-range processes, chemical-reaction networks in complex balance and lattice-gas models.
△ Less
Submitted 3 October, 2023; v1 submitted 26 March, 2021;
originally announced March 2021.
-
Large deviations for Markov jump processes with uniformly diminishing rates
Authors:
Andrea Agazzi,
Luisa Andreis,
Robert I. A. Patterson,
D. R. Michiel Renger
Abstract:
We prove a large-deviation principle (LDP) for the sample paths of jump Markov processes in the small noise limit when, possibly, all the jump rates vanish uniformly, but slowly enough, in a region of the state space. We further discuss the optimality of our assumptions on the decay of the jump rates. As a direct application of this work we relax the assumptions needed for the application of LDPs…
▽ More
We prove a large-deviation principle (LDP) for the sample paths of jump Markov processes in the small noise limit when, possibly, all the jump rates vanish uniformly, but slowly enough, in a region of the state space. We further discuss the optimality of our assumptions on the decay of the jump rates. As a direct application of this work we relax the assumptions needed for the application of LDPs to, e.g., Chemical Reaction Network dynamics, where vanishing reaction rates arise naturally particularly the context of mass action kinetics.
△ Less
Submitted 25 February, 2021;
originally announced February 2021.
-
Social Distance Characterization by means of Pedestrian Simulation
Authors:
Daniel R. Parisi,
Germán A. Patterson,
Lucio Pagni,
Agustina Osimani,
Tomas Bacigalupo,
Juan Godfrid,
Federico M. Bergagna,
Manuel Rodriguez Brizi,
Pedro Momesso,
Fermin L. Gomez,
Jimena Lozano,
Juan Martin Baader,
Ignacio Ribas,
Facundo P. Astiz Meyer,
Miguel Di Luca,
Nicolás E. Barrera,
Ezequiel M. Keimel Álvarez,
Maite M. Herran Oyhanarte,
Pedro R. **arilho,
Ximena Zuberbuhler,
Felipe Gorostiaga
Abstract:
In the present work, we study how the number of simulated clients (occupancy) affects the social distance in an ideal supermarket. For this, we account for realistic typical dimensions and process time (picking products and checkout). From the simulated trajectories, we measure events of social distance less than 2 m and its duration. Between other observables, we define a social distance coeffici…
▽ More
In the present work, we study how the number of simulated clients (occupancy) affects the social distance in an ideal supermarket. For this, we account for realistic typical dimensions and process time (picking products and checkout). From the simulated trajectories, we measure events of social distance less than 2 m and its duration. Between other observables, we define a social distance coefficient that informs how many events (of a given duration) suffer each agent in the system. These kinds of outputs could be useful for building procedures and protocols in the context of a pandemic allowing to keep low health risks while setting a maximum operating capacity.
△ Less
Submitted 8 September, 2020;
originally announced September 2020.
-
Contraction $\mathcal{L}_1$-Adaptive Control using Gaussian Processes
Authors:
Aditya Gahlawat,
Arun Lakshmanan,
Lin Song,
Andrew Patterson,
Zhuohuan Wu,
Naira Hovakimyan,
Evangelos Theodorou
Abstract:
We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while…
▽ More
We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while providing safety certificates. Furthermore, $\mathcal{CL}_1$-$\mathcal{GP}$ incorporates any available data into a GP model of uncertainties, which improves performance and enables the motion planner to achieve optimality safely. This way, the safe operation of the system is always guaranteed, even during the learning transients. We provide a few illustrative examples for the safe learning and control of planar quadrotor systems in a variety of environments.
△ Less
Submitted 30 November, 2021; v1 submitted 8 September, 2020;
originally announced September 2020.
-
Optical Hemodynamic Imaging of Jugular Venous Dynamics During Altered Central Venous Pressure
Authors:
Robert Amelard,
Andrew D Robertson,
Courtney A Patterson,
Hannah Heigold,
Essi Saarikoski,
Richard L Hughson
Abstract:
An optical imaging system is proposed for quantitatively assessing jugular venous response to altered central venous pressure. The proposed system assesses sub-surface optical absorption changes from jugular venous waveforms with a spatial calibration procedure to normalize incident tissue illumination. Widefield frames of the right lateral neck were captured and calibrated using a novel flexible…
▽ More
An optical imaging system is proposed for quantitatively assessing jugular venous response to altered central venous pressure. The proposed system assesses sub-surface optical absorption changes from jugular venous waveforms with a spatial calibration procedure to normalize incident tissue illumination. Widefield frames of the right lateral neck were captured and calibrated using a novel flexible surface calibration method. A hemodynamic optical model was derived to quantify jugular venous optical attenuation (JVA) signals, and generate a spatial jugular venous pulsatility map. JVA was assessed in three cardiovascular protocols that altered central venous pressure: acute central hypovolemia (lower body negative pressure), venous congestion (head-down tilt), and impaired cardiac filling (Valsalva maneuver). JVA waveforms exhibited biphasic wave properties consistent with jugular venous pulse dynamics when time-aligned with an electrocardiogram. JVA correlated strongly (median, interquartile range) with invasive central venous pressure during graded central hypovolemia (r=0.85, [0.72, 0.95]), graded venous congestion (r=0.94, [0.84, 0.99]), and impaired cardiac filling (r=0.94, [0.85, 0.99]). Reduced JVA during graded acute hypovolemia was strongly correlated with reductions in stroke volume (SV) (r=0.85, [0.76, 0.92]) from baseline (SV: 79$\pm$15 mL, JVA: 0.56$\pm$0.10 a.u.) to -40 mmHg suction (SV: 59$\pm$18 mL, JVA: 0.47$\pm$0.05 a.u.; p$<$0.01). The proposed non-contact optical imaging system demonstrated jugular venous dynamics consistent with invasive central venous monitoring during three protocols that altered central venous pressure. This system provides non-invasive monitoring of pressure-induced jugular venous dynamics in clinically relevant conditions where catheterization is traditionally required, enabling monitoring in non-surgical environments.
△ Less
Submitted 24 March, 2021; v1 submitted 22 July, 2020;
originally announced July 2020.
-
Gradient Temporal-Difference Learning with Regularized Corrections
Authors:
Sina Ghiassian,
Andrew Patterson,
Shivam Garg,
Dhawal Gupta,
Adam White,
Martha White
Abstract:
It is still common to use Q-learning and temporal difference (TD) learning-even though they have divergence issues and sound Gradient TD alternatives exist-because divergence seems rare and they typically perform well. However, recent work with large neural network learning systems reveals that instability is more common than previously thought. Practitioners face a difficult dilemma: choose an ea…
▽ More
It is still common to use Q-learning and temporal difference (TD) learning-even though they have divergence issues and sound Gradient TD alternatives exist-because divergence seems rare and they typically perform well. However, recent work with large neural network learning systems reveals that instability is more common than previously thought. Practitioners face a difficult dilemma: choose an easy to use and performant TD method, or a more complex algorithm that is more sound but harder to tune and all but unexplored with non-linear function approximation or control. In this paper, we introduce a new method called TD with Regularized Corrections (TDRC), that attempts to balance ease of use, soundness, and performance. It behaves as well as TD, when TD performs well, but is sound in cases where TD diverges. We empirically investigate TDRC across a range of problems, for both prediction and control, and for both linear and non-linear function approximation, and show, potentially for the first time, that gradient TD methods could be a better alternative to TD and Q-learning.
△ Less
Submitted 17 September, 2020; v1 submitted 1 July, 2020;
originally announced July 2020.
-
A continuous-time state-space model for rapid quality-control of Argos locations from animal-borne tags
Authors:
Ian D. Jonsen,
Toby A. Patterson,
Daniel P. Costa,
Philip D. Doherty,
Brendan J. Godley,
W. James Grecian,
Christophe Guinet,
Xavier Hoenner,
Sarah S. Kienle,
Patrick W. Robison,
Stephen C. Votier,
Matthew J. Witt,
Mark A. Hindell,
Robert G. Harcourt,
Clive R. McMahon
Abstract:
State-space models are important tools for quality control of error-prone animal movement data. The near real-time (within 24 h) capability of the Argos satellite system aids dynamic ocean management of human activities by informing when animals enter intensive use zones. This capability also facilitates use of ocean observations from animal-borne sensors in operational ocean forecasting models. S…
▽ More
State-space models are important tools for quality control of error-prone animal movement data. The near real-time (within 24 h) capability of the Argos satellite system aids dynamic ocean management of human activities by informing when animals enter intensive use zones. This capability also facilitates use of ocean observations from animal-borne sensors in operational ocean forecasting models. Such near real-time data provision requires rapid, reliable quality control to deal with error-prone Argos locations. We formulate a continuous-time state-space model for the three types of Argos location data (Least-Squares, Kalman filter, and Kalman smoother), accounting for irregular timing of observations. Our model is deliberately simple to ensure speed and reliability for automated, near real-time quality control of Argos data. We validate the model by fitting to Argos data collected from 61 individuals across 7 marine vertebrates and compare model-estimated locations to GPS locations. Estimation accuracy varied among species with median Root Mean Squared Errors usually < 5 km and decreased with increasing data sampling rate and precision of Argos locations. Including a model parameter to inflate Argos error ellipse sizes resulted in more accurate location estimates. In some cases, the model appreciably improved the accuracy of the Argos Kalman smoother locations, which should not be possible if the smoother uses all available information. Our model provides quality-controlled locations from Argos Least-Squares or Kalman filter data with slightly better accuracy than Argos Kalman smoother data that are only available via reprocessing. Simplicity and ease of use make the model suitable both for automated quality control of near real-time Argos data and for manual use by researchers working with historical Argos data.
△ Less
Submitted 1 May, 2020;
originally announced May 2020.
-
$\mathcal{L}_1$-$\mathcal{GP}$: $\mathcal{L}_1$ Adaptive Control with Bayesian Learning
Authors:
Aditya Gahlawat,
Pan Zhao,
Andrew Patterson,
Naira Hovakimyan,
Evangelos A. Theodorou
Abstract:
We present $\mathcal{L}_1$-$\mathcal{GP}$, an architecture based on $\mathcal{L}_1$ adaptive control and Gaussian Process Regression (GPR) for safe simultaneous control and learning. On one hand, the $\mathcal{L}_1$ adaptive control provides stability and transient performance guarantees, which allows for GPR to efficiently and safely learn the uncertain dynamics. On the other hand, the learned dy…
▽ More
We present $\mathcal{L}_1$-$\mathcal{GP}$, an architecture based on $\mathcal{L}_1$ adaptive control and Gaussian Process Regression (GPR) for safe simultaneous control and learning. On one hand, the $\mathcal{L}_1$ adaptive control provides stability and transient performance guarantees, which allows for GPR to efficiently and safely learn the uncertain dynamics. On the other hand, the learned dynamics can be conveniently incorporated into the $\mathcal{L}_1$ control architecture without sacrificing robustness and tracking performance. Subsequently, the learned dynamics can lead to less conservative designs for performance/robustness tradeoff. We illustrate the efficacy of the proposed architecture via numerical simulations.
△ Less
Submitted 30 April, 2020;
originally announced April 2020.
-
Uncovering ecological state dynamics with hidden Markov models
Authors:
Brett T. McClintock,
Roland Langrock,
Olivier Gimenez,
Emmanuelle Cam,
David L. Borchers,
Richard Glennie,
Toby A. Patterson
Abstract:
Ecological systems can often be characterised by changes among a finite set of underlying states pertaining to individuals, populations, communities, or entire ecosystems through time. Owing to the inherent difficulty of empirical field studies, ecological state dynamics operating at any level of this hierarchy can often be unobservable or "hidden". Ecologists must therefore often contend with inc…
▽ More
Ecological systems can often be characterised by changes among a finite set of underlying states pertaining to individuals, populations, communities, or entire ecosystems through time. Owing to the inherent difficulty of empirical field studies, ecological state dynamics operating at any level of this hierarchy can often be unobservable or "hidden". Ecologists must therefore often contend with incomplete or indirect observations that are somehow related to these underlying processes. By formally disentangling state and observation processes based on simple yet powerful mathematical properties that can be used to describe many ecological phenomena, hidden Markov models (HMMs) can facilitate inferences about complex system state dynamics that might otherwise be intractable. However, while HMMs are routinely applied in other disciplines, they have only recently begun to gain traction within the broader ecological community. We provide a gentle introduction to HMMs, establish some common terminology, and review the immense scope of HMMs for applied ecological research. We also provide a supplemental tutorial on some of the more technical aspects of HMM implementation and interpretation. By illustrating how practitioners can use a simple conceptual template to customise HMMs for their specific systems of interest, revealing methodological links between existing applications, and highlighting some practical considerations and limitations of these approaches, our goal is to help establish HMMs as a fundamental inferential tool for ecologists.
△ Less
Submitted 14 July, 2020; v1 submitted 24 February, 2020;
originally announced February 2020.
-
Learning Probabilistic Intersection Traffic Models for Trajectory Prediction
Authors:
Andrew Patterson,
Aditya Gahlawat,
Naira Hovakimyan
Abstract:
Autonomous agents must be able to safely interact with other vehicles to integrate into urban environments. The safety of these agents is dependent on their ability to predict collisions with other vehicles' future trajectories for replanning and collision avoidance. The information needed to predict collisions can be learned from previously observed vehicle trajectories in a specific environment,…
▽ More
Autonomous agents must be able to safely interact with other vehicles to integrate into urban environments. The safety of these agents is dependent on their ability to predict collisions with other vehicles' future trajectories for replanning and collision avoidance. The information needed to predict collisions can be learned from previously observed vehicle trajectories in a specific environment, generating a traffic model. The learned traffic model can then be incorporated as prior knowledge into any trajectory estimation method being used in this environment. This work presents a Gaussian process based probabilistic traffic model that is used to quantify vehicle behaviors in an intersection. The Gaussian process model provides estimates for the average vehicle trajectory, while also capturing the variance between the different paths a vehicle may take in the intersection. The method is demonstrated on a set of time-series position trajectories. These trajectories are reconstructed by removing object recognition errors and missed frames that may occur due to data source processing. To create the intersection traffic model, the reconstructed trajectories are clustered based on their source and destination lanes. For each cluster, a Gaussian process model is created to capture the average behavior and the variance of the cluster. To show the applicability of the Gaussian model, the test trajectories are classified with only partial observations. Performance is quantified by the number of observations required to correctly classify the vehicle trajectory. Both the intersection traffic modeling computations and the classification procedure are timed. These times are presented as results and demonstrate that the model can be constructed in a reasonable amount of time and the classification procedure can be used for online applications.
△ Less
Submitted 5 February, 2020;
originally announced February 2020.
-
Quantum State Discrimination Using Noisy Quantum Neural Networks
Authors:
Andrew Patterson,
Hongxiang Chen,
Leonard Wossnig,
Simone Severini,
Dan Browne,
Ivan Rungger
Abstract:
Near-term quantum computers are noisy, and therefore must run algorithms with a low circuit depth and qubit count. Here we investigate how noise affects a quantum neural network (QNN) for state discrimination, applicable on near-term quantum devices as it fulfils the above criteria. We find that when simulating gradient calculation on a noisy device, a large number of parameters is disadvantageous…
▽ More
Near-term quantum computers are noisy, and therefore must run algorithms with a low circuit depth and qubit count. Here we investigate how noise affects a quantum neural network (QNN) for state discrimination, applicable on near-term quantum devices as it fulfils the above criteria. We find that when simulating gradient calculation on a noisy device, a large number of parameters is disadvantageous. By introducing a new smaller circuit ansatz we overcome this limitation, and find that the QNN performs well at noise levels of current quantum hardware. We also show that networks trained at higher noise levels can still converge to useful parameters. Our findings show that noisy quantum computers can be used in applications for state discrimination and for classifiers of the output of quantum generative adversarial networks.
△ Less
Submitted 15 June, 2020; v1 submitted 1 November, 2019;
originally announced November 2019.
-
Dynamical mean field theory algorithm and experiment on quantum computers
Authors:
I. Rungger,
N. Fitzpatrick,
H. Chen,
C. H. Alderete,
H. Apel,
A. Cowtan,
A. Patterson,
D. Munoz Ramo,
Y. Zhu,
N. H. Nguyen,
E. Grant,
S. Chretien,
L. Wossnig,
N. M. Linke,
R. Duncan
Abstract:
The developments of quantum computing algorithms and experiments for atomic scale simulations have largely focused on quantum chemistry for molecules, while their application in condensed matter systems is scarcely explored. Here we present a quantum algorithm to perform dynamical mean field theory (DMFT) calculations for condensed matter systems on currently available quantum computers, and demon…
▽ More
The developments of quantum computing algorithms and experiments for atomic scale simulations have largely focused on quantum chemistry for molecules, while their application in condensed matter systems is scarcely explored. Here we present a quantum algorithm to perform dynamical mean field theory (DMFT) calculations for condensed matter systems on currently available quantum computers, and demonstrate it on two quantum hardware platforms. DMFT is required to properly describe the large class of materials with strongly correlated electrons. The computationally challenging part arises from solving the effective problem of an interacting impurity coupled to a bath, which scales exponentially with system size on conventional computers. An exponential speedup is expected on quantum computers, but the algorithms proposed so far are based on real time evolution of the wavefunction, which requires high-depth circuits and hence very low noise levels in the quantum hardware. Here we propose an alternative approach, which uses the variational quantum eigensolver (VQE) method for ground and excited states to obtain the needed quantities as part of an exact diagonalization impurity solver. We present the algorithm for a two site DMFT system, which we benchmark using simulations on conventional computers as well as experiments on superconducting and trapped ion qubits, demonstrating that this method is suitable for running DMFT calculations on currently available quantum hardware.
△ Less
Submitted 8 January, 2020; v1 submitted 10 October, 2019;
originally announced October 2019.
-
Critical slowing down in the bistable regime of circuit quantum electrodynamics
Authors:
P. Brookes,
G. Tancredi,
A. D. Patterson,
J. Rahamim,
M. Esposito,
P. J. Leek,
E. Ginossar,
M. H. Szymanska
Abstract:
We investigate the dynamics of the bistable regime of the generalized Jaynes-Cummings Hamiltonian (GJC), realised by a circuit quantum electrodynamics (cQED) system consisting of a transmon qubit coupled to a microwave cavity. In this regime we observe critical slowing down in the approach to the steady state. By measuring the response of the cavity to a step function drive pulse we characterize t…
▽ More
We investigate the dynamics of the bistable regime of the generalized Jaynes-Cummings Hamiltonian (GJC), realised by a circuit quantum electrodynamics (cQED) system consisting of a transmon qubit coupled to a microwave cavity. In this regime we observe critical slowing down in the approach to the steady state. By measuring the response of the cavity to a step function drive pulse we characterize this slowing down as a function of driving frequency and power. We find that the critical slowing down saturates as the driving power is increased. We compare these results with the predictions of analytical and numerical calculations both with and without the Duffing approximation. We find that the Duffing approximation incorrectly predicts that the critical slowing down timescale increases exponentially with the drive, whereas the GJC model accurately predicts the saturation seen in our data, suggesting a different process of quantum activation.
△ Less
Submitted 31 July, 2019;
originally announced July 2019.
-
System-Level Development of a User-Integrated Semi-Autonomous Lawn Mowing System: Problem Overview, Basic Requirements, and Proposed Architecture
Authors:
Albert E. Patterson,
Yang Yuan,
William R. Norris
Abstract:
This concept paper outlines some recent efforts toward the design and development of user-integrated semi-autonomous home-sized lawn mowing systems from a systems engineering perspective. This is an important and emerging field of study within the robotics and systems engineering communities. The work presented includes a review of current progress on this problem, a discussion of the problem from…
▽ More
This concept paper outlines some recent efforts toward the design and development of user-integrated semi-autonomous home-sized lawn mowing systems from a systems engineering perspective. This is an important and emerging field of study within the robotics and systems engineering communities. The work presented includes a review of current progress on this problem, a discussion of the problem from a systems engineering perspective, a general system architecture developed by the authors, and a preliminary set of design requirements. This work is meant to provide a baseline and motivation for the further development and refinement of these systems within the systems engineering and robotics communities and is relevant to both academic and commercial research.
△ Less
Submitted 12 July, 2019;
originally announced July 2019.
-
Calibration of the cross-resonance two-qubit gate between directly-coupled transmons
Authors:
A. D. Patterson,
J. Rahamim,
T. Tsunoda,
P. Spring,
S. Jebari,
K. Ratter,
M. Mergenthaler,
G. Tancredi,
B. Vlastakis,
M. Esposito,
P. J. Leek
Abstract:
Quantum computation requires the precise control of the evolution of a quantum system, typically through application of discrete quantum logic gates on a set of qubits. Here, we use the cross-resonance interaction to implement a gate between two superconducting transmon qubits with a direct static dispersive coupling. We demonstrate a practical calibration procedure for the optimization of the gat…
▽ More
Quantum computation requires the precise control of the evolution of a quantum system, typically through application of discrete quantum logic gates on a set of qubits. Here, we use the cross-resonance interaction to implement a gate between two superconducting transmon qubits with a direct static dispersive coupling. We demonstrate a practical calibration procedure for the optimization of the gate, combining continuous and repeated-gate Hamiltonian tomography with step-wise reduction of dominant two-qubit coherent errors through map** to microwave control parameters. We show experimentally that this procedure can enable a $\hat{ZX}_{-π/2}$ gate with a fidelity $F=97.0(7)\%$, measured with interleaved randomized benchmarking. We show this in a architecture with out-of-plane control and readout that is readily extensible to larger scale quantum circuits.
△ Less
Submitted 14 May, 2019;
originally announced May 2019.
-
Realization of a Carbon-Nanotube-Based Superconducting Qubit
Authors:
Matthias Mergenthaler,
Ani Nersisyan,
Andrew Patterson,
Martina Esposito,
Andreas Baumgartner,
Christian Schönenberger,
G. Andrew D. Briggs,
Edward A. Laird,
Peter J. Leek
Abstract:
Hybrid circuit quantum electrodynamics (QED) involves the study of coherent quantum physics in solid state systems via their interactions with superconducting microwave circuits. Here we present an implementation of a hybrid superconducting qubit that employs a carbon nanotube as a Josephson junction. We realize the junction by contacting a carbon nanotube with a superconducting Pd/Al bi-layer, an…
▽ More
Hybrid circuit quantum electrodynamics (QED) involves the study of coherent quantum physics in solid state systems via their interactions with superconducting microwave circuits. Here we present an implementation of a hybrid superconducting qubit that employs a carbon nanotube as a Josephson junction. We realize the junction by contacting a carbon nanotube with a superconducting Pd/Al bi-layer, and implement voltage tunability of the qubit frequency using a local electrostatic gate. We demonstrate strong dispersive coupling to a coplanar waveguide resonator via observation of a resonator frequency shift dependent on applied gate voltage. We extract qubit parameters from spectroscopy using dispersive readout and find qubit relaxation and coherence times in the range of $10-200~\rm{ns}$.
△ Less
Submitted 22 April, 2019;
originally announced April 2019.
-
Intent-Aware Probabilistic Trajectory Estimation for Collision Prediction with Uncertainty Quantification
Authors:
Andrew Patterson,
Arun Lakshmanan,
Naira Hovakimyan
Abstract:
Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the ass…
▽ More
Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the assumption of deterministic knowledge of each obstacle's future behavior with probabilistic knowledge, to allow a larger class of obstacles to be considered. The method solely relies on position and velocity measurements to predict collisions with dynamic obstacles. We show that the uncertainty region for obstacle positions can be expressed in terms of a combination of polynomials generated with Gaussian process regression. To control the growth of uncertainty over arbitrary time horizons, a probabilistic obstacle intention is assumed as a distribution over obstacle positions and velocities, which can be naturally included in the Gaussian process framework. Our approach is demonstrated in two case studies in which (i), an obstacle overtakes the agent and (ii), an obstacle crosses the agent's path perpendicularly. In these simulations we show that the collision can be predicted despite having limited knowledge of the obstacle's behavior.
△ Less
Submitted 4 April, 2019;
originally announced April 2019.
-
Bilinear Coagulation Equations
Authors:
Daniel Heydecker,
Robert I. A. Patterson
Abstract:
We consider coagulation equations of Smoluchowski or Flory type where the total merge rate has a bilinear form $π(y)\cdot Aπ(x)$ for a vector of conserved quantities $π$, generalising the multiplicative kernel. For these kernels, a gelation transition occurs at a finite time $t_\mathrm{g}\in (0,\infty)$, which can be given exactly in terms of an eigenvalue problem in finite dimensions. We prove a…
▽ More
We consider coagulation equations of Smoluchowski or Flory type where the total merge rate has a bilinear form $π(y)\cdot Aπ(x)$ for a vector of conserved quantities $π$, generalising the multiplicative kernel. For these kernels, a gelation transition occurs at a finite time $t_\mathrm{g}\in (0,\infty)$, which can be given exactly in terms of an eigenvalue problem in finite dimensions. We prove a hydrodynamic limit for a stochastic coagulant, including a corresponding phase transition for the largest particle, and exploit a coupling to random graphs to extend analysis of the limiting process beyond the gelation time.
△ Less
Submitted 14 October, 2019; v1 submitted 20 February, 2019;
originally announced February 2019.
-
Proximity Queries for Absolutely Continuous Parametric Curves
Authors:
Arun Lakshmanan,
Andrew Patterson,
Venanzio Cichella,
Naira Hovakimyan
Abstract:
In motion planning problems for autonomous robots, such as self-driving cars, the robot must ensure that its planned path is not in close proximity to obstacles in the environment. However, the problem of evaluating the proximity is generally non-convex and serves as a significant computational bottleneck for motion planning algorithms. In this paper, we present methods for a general class of abso…
▽ More
In motion planning problems for autonomous robots, such as self-driving cars, the robot must ensure that its planned path is not in close proximity to obstacles in the environment. However, the problem of evaluating the proximity is generally non-convex and serves as a significant computational bottleneck for motion planning algorithms. In this paper, we present methods for a general class of absolutely continuous parametric curves to compute: (i) the minimum separating distance, (ii) tolerance verification, and (iii) collision detection. Our methods efficiently compute bounds on obstacle proximity by bounding the curve in a convex region. This bound is based on an upper bound on the curve arc length that can be expressed in closed form for a useful class of parametric curves including curves with trigonometric or polynomial bases. We demonstrate the computational efficiency and accuracy of our approach through numerical simulations of several proximity problems.
△ Less
Submitted 19 June, 2019; v1 submitted 13 February, 2019;
originally announced February 2019.
-
Cloud Programming Simplified: A Berkeley View on Serverless Computing
Authors:
Eric Jonas,
Johann Schleier-Smith,
Vikram Sreekanti,
Chia-Che Tsai,
Anurag Khandelwal,
Qifan Pu,
Vaishaal Shankar,
Joao Carreira,
Karl Krauth,
Neeraja Yadwadkar,
Joseph E. Gonzalez,
Raluca Ada Popa,
Ion Stoica,
David A. Patterson
Abstract:
Serverless cloud computing handles virtually all the system administration operations needed to make it easier for programmers to use the cloud. It provides an interface that greatly simplifies cloud programming, and represents an evolution that parallels the transition from assembly language to high-level programming languages. This paper gives a quick history of cloud computing, including an acc…
▽ More
Serverless cloud computing handles virtually all the system administration operations needed to make it easier for programmers to use the cloud. It provides an interface that greatly simplifies cloud programming, and represents an evolution that parallels the transition from assembly language to high-level programming languages. This paper gives a quick history of cloud computing, including an accounting of the predictions of the 2009 Berkeley View of Cloud Computing paper, explains the motivation for serverless computing, describes applications that stretch the current limits of serverless, and then lists obstacles and research opportunities required for serverless computing to fulfill its full potential. Just as the 2009 paper identified challenges for the cloud and predicted they would be addressed and that cloud use would accelerate, we predict these issues are solvable and that serverless computing will grow to dominate the future of cloud computing.
△ Less
Submitted 9 February, 2019;
originally announced February 2019.
-
A large-deviations principle for all the cluster sizes of a sparse Erdős-Rényi graph
Authors:
Luisa Andreis,
Wolfgang König,
Robert I. A. Patterson
Abstract:
Let $\mathcal{G}(N,\frac 1Nt_N)$ be the Erdős-Rényi graph with connection probability $\frac 1Nt_N\sim t/N$ as $N\to\infty$ for a fixed $t\in(0,\infty)$. We derive a large-deviations principle for the empirical measure of the sizes of all the connected components of $\mathcal{G}(N,\frac 1Nt_N)$, registered according to microscopic sizes (i.e., of finite order), macroscopic ones (i.e., of order…
▽ More
Let $\mathcal{G}(N,\frac 1Nt_N)$ be the Erdős-Rényi graph with connection probability $\frac 1Nt_N\sim t/N$ as $N\to\infty$ for a fixed $t\in(0,\infty)$. We derive a large-deviations principle for the empirical measure of the sizes of all the connected components of $\mathcal{G}(N,\frac 1Nt_N)$, registered according to microscopic sizes (i.e., of finite order), macroscopic ones (i.e., of order $N$), and mesoscopic ones (everything in between). The rate function explicitly describes the microscopic and macroscopic components and the fraction of vertices in components of mesoscopic sizes. Moreover, it clearly captures the well known phase transition at $t=1$ as part of a comprehensive picture. The proofs rely on elementary combinatorics and on known estimates and asymptotics for the probability that subgraphs are connected. We also draw conclusions for the strongly related model of the multiplicative coalescent, the Marcus--Lushnikov coagulation model with monodisperse initial condition, and its gelation phase transition.
△ Less
Submitted 21 January, 2021; v1 submitted 7 January, 2019;
originally announced January 2019.
-
Online Off-policy Prediction
Authors:
Sina Ghiassian,
Andrew Patterson,
Martha White,
Richard S. Sutton,
Adam White
Abstract:
This paper investigates the problem of online prediction learning, where learning proceeds continuously as the agent interacts with an environment. The predictions made by the agent are contingent on a particular way of behaving, represented as a value function. However, the behavior used to select actions and generate the behavior data might be different from the one used to define the prediction…
▽ More
This paper investigates the problem of online prediction learning, where learning proceeds continuously as the agent interacts with an environment. The predictions made by the agent are contingent on a particular way of behaving, represented as a value function. However, the behavior used to select actions and generate the behavior data might be different from the one used to define the predictions, and thus the samples are generated off-policy. The ability to learn behavior-contingent predictions online and off-policy has long been advocated as a key capability of predictive-knowledge learning systems but remained an open algorithmic challenge for decades. The issue lies with the temporal difference (TD) learning update at the heart of most prediction algorithms: combining bootstrap**, off-policy sampling and function approximation may cause the value estimate to diverge. A breakthrough came with the development of a new objective function that admitted stochastic gradient descent variants of TD. Since then, many sound online off-policy prediction algorithms have been developed, but there has been limited empirical work investigating the relative merits of all the variants. This paper aims to fill these empirical gaps and provide clarity on the key ideas behind each method. We summarize the large body of literature on off-policy learning, focusing on 1- methods that use computation linear in the number of features and are convergent under off-policy sampling, and 2- other methods which have proven useful with non-fixed, nonlinear function approximation. We provide an empirical study of off-policy prediction methods in two challenging microworlds. We report each method's parameter sensitivity, empirical convergence rate, and final performance, providing new insights that should enable practitioners to successfully extend these new methods to large-scale applications.[Abridged abstract]
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
General Value Function Networks
Authors:
Matthew Schlegel,
Andrew Jacobsen,
Zaheer Abbas,
Andrew Patterson,
Adam White,
Martha White
Abstract:
State construction is important for learning in partially observable environments. A general purpose strategy for state construction is to learn the state update using a Recurrent Neural Network (RNN), which updates the internal state using the current internal state and the most recent observation. This internal state provides a summary of the observed sequence, to facilitate accurate predictions…
▽ More
State construction is important for learning in partially observable environments. A general purpose strategy for state construction is to learn the state update using a Recurrent Neural Network (RNN), which updates the internal state using the current internal state and the most recent observation. This internal state provides a summary of the observed sequence, to facilitate accurate predictions and decision-making. At the same time, specifying and training RNNs is notoriously tricky, particularly as the common strategy to approximate gradients back in time, called truncated Back-prop Through Time (BPTT), can be sensitive to the truncation window. Further, domain-expertise--which can usually help constrain the function class and so improve trainability--can be difficult to incorporate into complex recurrent units used within RNNs. In this work, we explore how to use multi-step predictions to constrain the RNN and incorporate prior knowledge. In particular, we revisit the idea of using predictions to construct state and ask: does constraining (parts of) the state to consist of predictions about the future improve RNN trainability? We formulate a novel RNN architecture, called a General Value Function Network (GVFN), where each internal state component corresponds to a prediction about the future represented as a value function. We first provide an objective for optimizing GVFNs, and derive several algorithms to optimize this objective. We then show that GVFNs are more robust to the truncation level, in many cases only requiring one-step gradient updates.
△ Less
Submitted 2 February, 2021; v1 submitted 17 July, 2018;
originally announced July 2018.
-
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains
Authors:
Yangchen Pan,
Muhammad Zaheer,
Adam White,
Andrew Patterson,
Martha White
Abstract:
Model-based strategies for control are critical to obtain sample efficient learning. Dyna is a planning paradigm that naturally interleaves learning and planning, by simulating one-step experience to update the action-value function. This elegant planning strategy has been mostly explored in the tabular setting. The aim of this paper is to revisit sample-based planning, in stochastic and continuou…
▽ More
Model-based strategies for control are critical to obtain sample efficient learning. Dyna is a planning paradigm that naturally interleaves learning and planning, by simulating one-step experience to update the action-value function. This elegant planning strategy has been mostly explored in the tabular setting. The aim of this paper is to revisit sample-based planning, in stochastic and continuous domains with learned models. We first highlight the flexibility afforded by a model over Experience Replay (ER). Replay-based methods can be seen as stochastic planning methods that repeatedly sample from a buffer of recent agent-environment interactions and perform updates to improve data efficiency. We show that a model, as opposed to a replay buffer, is particularly useful for specifying which states to sample from during planning, such as predecessor states that propagate information in reverse from a state more quickly. We introduce a semi-parametric model learning approach, called Reweighted Experience Models (REMs), that makes it simple to sample next states or predecessors. We demonstrate that REM-Dyna exhibits similar advantages over replay-based methods in learning in continuous state problems, and that the performance gap grows when moving to stochastic domains, of increasing size.
△ Less
Submitted 12 June, 2018;
originally announced June 2018.
-
Percolation for D2D Networks on Street Systems
Authors:
Elie Cali,
Nila Novita Gafur,
Christian Hirsch,
Benedikt Jahnel,
Taoufik En-Najjary,
Robert I. A. Patterson
Abstract:
We study fundamental characteristics for the connectivity of multi-hop D2D networks. Devices are randomly distributed on street systems and are able to communicate with each other whenever their separation is smaller than some connectivity threshold. We model the street systems as Poisson-Voronoi or Poisson-Delaunay tessellations with varying street lengths. We interpret the existence of adequate…
▽ More
We study fundamental characteristics for the connectivity of multi-hop D2D networks. Devices are randomly distributed on street systems and are able to communicate with each other whenever their separation is smaller than some connectivity threshold. We model the street systems as Poisson-Voronoi or Poisson-Delaunay tessellations with varying street lengths. We interpret the existence of adequate D2D connectivity as percolation of the underlying random graph. We derive and compare approximations for the critical device-intensity for percolation, the percolation probability and the graph distance. Our results show that for urban areas, the Poisson Boolean Model gives a very good approximation, while for rural areas, the percolation probability stays far from 1 even far above the percolation threshold.
△ Less
Submitted 31 January, 2018;
originally announced January 2018.
-
Double-sided coaxial circuit QED with out-of-plane wiring
Authors:
J. Rahamim,
T. Behrle,
M. J. Peterer,
A. Patterson,
P. Spring,
T. Tsunoda,
R. Manenti,
G. Tancredi,
P. J. Leek
Abstract:
Superconducting circuits are well established as a strong candidate platform for the development of quantum computing. In order to advance to a practically useful level, architectures are needed which combine arrays of many qubits with selective qubit control and readout, without compromising on coherence. Here we present a coaxial circuit QED architecture in which qubit and resonator are fabricat…
▽ More
Superconducting circuits are well established as a strong candidate platform for the development of quantum computing. In order to advance to a practically useful level, architectures are needed which combine arrays of many qubits with selective qubit control and readout, without compromising on coherence. Here we present a coaxial circuit QED architecture in which qubit and resonator are fabricated on opposing sides of a single chip, and control and readout wiring are provided by coaxial wiring running perpendicular to the chip plane. We present characterisation measurements of a fabricated device in good agreement with simulated parameters and demonstrating energy relaxation and dephasing times of $T_1 = 4.1\,μ$s and $T_2 = 5.7\,μ$s respectively. The architecture allows for scaling to large arrays of selectively controlled and measured qubits with the advantage of all wiring being out of the plane.
△ Less
Submitted 1 June, 2017; v1 submitted 16 March, 2017;
originally announced March 2017.
-
Circuit quantum acoustodynamics with surface acoustic waves
Authors:
R. Manenti,
A. F. Kockum,
A. Patterson,
T. Behrle,
J. Rahamim,
G. Tancredi,
F. Nori,
P. J. Leek
Abstract:
The experimental investigation of quantum devices incorporating mechanical resonators has opened up new frontiers in the study of quantum mechanics at a macroscopic level$^{1,2}$. Superconducting microwave circuits have proven to be a powerful platform for the realisation of such quantum devices, both in cavity optomechanics$^{3,4}$, and circuit quantum electro-dynamics (QED)$^{5,6}$. While most e…
▽ More
The experimental investigation of quantum devices incorporating mechanical resonators has opened up new frontiers in the study of quantum mechanics at a macroscopic level$^{1,2}$. Superconducting microwave circuits have proven to be a powerful platform for the realisation of such quantum devices, both in cavity optomechanics$^{3,4}$, and circuit quantum electro-dynamics (QED)$^{5,6}$. While most experiments to date have involved localised nanomechanical resonators, it has recently been shown that propagating surface acoustic waves (SAWs) can be piezoelectrically coupled to superconducting qubits$^{7,8}$, and confined in high-quality Fabry-Perot cavities up to microwave frequencies in the quantum regime$^{9}$, indicating the possibility of realising coherent exchange of quantum information between the two systems. Here we present measurements of a device in which a superconducting qubit is embedded in, and interacts with, the acoustic field of a Fabry-Perot SAW cavity on quartz, realising a surface acoustic version of cavity quantum electrodynamics. This quantum acoustodynamics (QAD) architecture may be used to develop new quantum acoustic devices in which quantum information is stored in trapped on-chip surface acoustic wavepackets, and manipulated in ways that are impossible with purely electromagnetic signals, due to the $10^{5}$ times slower speed of travel of the mechanical waves.
△ Less
Submitted 13 March, 2017;
originally announced March 2017.
-
Simultaneous bistability of qubit and resonator in circuit quantum electrodynamics
Authors:
Th. K. Mavrogordatos,
G. Tancredi,
M. Elliott,
M. J. Peterer,
A. Patterson,
J. Rahamim,
P. J. Leek,
E. Ginossar,
M. H. Szymańska
Abstract:
We explore the joint activated dynamics exhibited by two quantum degrees of freedom: a cavity mode oscillator which is strongly coupled to a superconducting qubit in the strongly coherently driven dispersive regime. Dynamical simulations and complementary measurements show a range of parameters where both the cavity and the qubit exhibit sudden simultaneous switching between two metastable states.…
▽ More
We explore the joint activated dynamics exhibited by two quantum degrees of freedom: a cavity mode oscillator which is strongly coupled to a superconducting qubit in the strongly coherently driven dispersive regime. Dynamical simulations and complementary measurements show a range of parameters where both the cavity and the qubit exhibit sudden simultaneous switching between two metastable states. This manifests in ensemble averaged amplitudes of both the cavity and qubit exhibiting a partial coherent cancellation. Transmission measurements of driven microwave cavities coupled to transmon qubits show detailed features which agree with the theory in the regime of simultaneous switching.
△ Less
Submitted 2 January, 2017; v1 submitted 30 November, 2016;
originally announced November 2016.
-
Estimation and simulation of foraging trips in land-based marine predators
Authors:
Théo Michelot,
Roland Langrock,
Sophie Bestley,
Ian D. Jonsen,
Theoni Photopoulou,
Toby A. Patterson
Abstract:
The behaviour of colony-based marine predators is the focus of much research globally. Large telemetry and tracking data sets have been collected for this group of animals, and are accompanied by many theoretical studies of optimal foraging strategies. However, relatively few studies have detailed statistical methods for inferring behaviours in central place foraging trips. In this paper we descri…
▽ More
The behaviour of colony-based marine predators is the focus of much research globally. Large telemetry and tracking data sets have been collected for this group of animals, and are accompanied by many theoretical studies of optimal foraging strategies. However, relatively few studies have detailed statistical methods for inferring behaviours in central place foraging trips. In this paper we describe an approach based on hidden Markov models, which splits foraging trips into segments labelled as "outbound", "search", "forage", and "inbound". By structuring the hidden Markov model transition matrix appropriately, the model naturally handles the sequence of behaviours within a foraging trip. Additionally, by structuring the model in this way, we are able to develop realistic simulations from the fitted model. We demonstrate our approach on data from southern elephant seals (Mirounga leonina) tagged on Kerguelen Island in the Southern Ocean. We discuss the differences between our 4-state model and the widely used 2-state model, and the advantages and disadvantages of employing a more complex model.
△ Less
Submitted 25 April, 2017; v1 submitted 20 October, 2016;
originally announced October 2016.
-
The Renewed Case for the Reduced Instruction Set Computer: Avoiding ISA Bloat with Macro-Op Fusion for RISC-V
Authors:
Christopher Celio,
Palmer Dabbelt,
David A. Patterson,
Krste Asanović
Abstract:
This report makes the case that a well-designed Reduced Instruction Set Computer (RISC) can match, and even exceed, the performance and code density of existing commercial Complex Instruction Set Computers (CISC) while maintaining the simplicity and cost-effectiveness that underpins the original RISC goals.
We begin by comparing the dynamic instruction counts and dynamic instruction bytes fetche…
▽ More
This report makes the case that a well-designed Reduced Instruction Set Computer (RISC) can match, and even exceed, the performance and code density of existing commercial Complex Instruction Set Computers (CISC) while maintaining the simplicity and cost-effectiveness that underpins the original RISC goals.
We begin by comparing the dynamic instruction counts and dynamic instruction bytes fetched for the popular proprietary ARMv7, ARMv8, IA-32, and x86-64 Instruction Set Architectures (ISAs) against the free and open RISC-V RV64G and RV64GC ISAs when running the SPEC CINT2006 benchmark suite. RISC-V was designed as a very small ISA to support a wide range of implementations, and has a less mature compiler toolchain. However, we observe that on SPEC CINT2006 RV64G executes on average 16% more instructions than x86-64, 3% more instructions than IA-32, 9% more instructions than ARMv8, but 4% fewer instructions than ARMv7.
CISC x86 implementations break up complex instructions into smaller internal RISC-like micro-ops, and the RV64G instruction count is within 2% of the x86-64 retired micro-op count. RV64GC, the compressed variant of RV64G, is the densest ISA studied, fetching 8% fewer dynamic instruction bytes than x86-64. We observed that much of the increased RISC-V instruction count is due to a small set of common multi-instruction idioms.
Exploiting this fact, the RV64G and RV64GC effective instruction count can be reduced by 5.4% on average by leveraging macro-op fusion. Combining the compressed RISC-V ISA extension with macro-op fusion provides both the densest ISA and the fewest dynamic operations retired per program, reducing the motivation to add more instructions to the ISA. This approach retains a single simple ISA suitable for both low-end and high-end implementations, where high-end implementations can boost performance through microarchitectural techniques.
△ Less
Submitted 8 July, 2016;
originally announced July 2016.
-
Statistical modelling of individual animal movement: an overview of key methods and a discussion of practical challenges
Authors:
Toby A Patterson,
Alison Parton,
Roland Langrock,
Paul G Blackwell,
Len Thomas,
Ruth King
Abstract:
With the influx of complex and detailed tracking data gathered from electronic tracking devices, the analysis of animal movement data has recently emerged as a cottage industry amongst biostatisticians. New approaches of ever greater complexity are continue to be added to the literature. In this paper, we review what we believe to be some of the most popular and most useful classes of statistical…
▽ More
With the influx of complex and detailed tracking data gathered from electronic tracking devices, the analysis of animal movement data has recently emerged as a cottage industry amongst biostatisticians. New approaches of ever greater complexity are continue to be added to the literature. In this paper, we review what we believe to be some of the most popular and most useful classes of statistical models used to analyze individual animal movement data. Specifically we consider discrete-time hidden Markov models, more general state-space models and diffusion processes. We argue that these models should be core components in the toolbox for quantitative researchers working on stochastic modelling of individual animal movement. The paper concludes by offering some general observations on the direction of statistical analysis of animal movement. There is a trend in movement ecology toward what are arguably overly-complex modelling approaches which are inaccessible to ecologists, unwieldy with large data sets or not based in mainstream statistical practice. Additionally, some analysis methods developed within the ecological community ignore fundamental properties of movement data, potentially leading to misleading conclusions about animal movement. Corresponding approaches, e.g. based on Lévy walk-type models, continue to be popular despite having been largely discredited. We contend that there is a need for an appropriate balance between the extremes of either being overly complex or being overly simplistic, whereby the discipline relies on models of intermediate complexity that are usable by general ecologists, but grounded in well-developed statistical practice and efficient to fit to large data sets.
△ Less
Submitted 30 January, 2017; v1 submitted 24 March, 2016;
originally announced March 2016.
-
Analysis of animal accelerometer data using hidden Markov models
Authors:
Vianey Leos-Barajas,
Theoni Photopoulou,
Roland Langrock,
Toby A. Patterson,
Yuuki Watanabe,
Megan Murgatroyd,
Yannis P. Papastamatiou
Abstract:
Use of accelerometers is now widespread within animal biotelemetry as they provide a means of measuring an animal's activity in a meaningful and quantitative way where direct observation is not possible. In sequential acceleration data there is a natural dependence between observations of movement or behaviour, a fact that has been largely ignored in most analyses. Analyses of acceleration data wh…
▽ More
Use of accelerometers is now widespread within animal biotelemetry as they provide a means of measuring an animal's activity in a meaningful and quantitative way where direct observation is not possible. In sequential acceleration data there is a natural dependence between observations of movement or behaviour, a fact that has been largely ignored in most analyses. Analyses of acceleration data where serial dependence has been explicitly modelled have largely relied on hidden Markov models (HMMs). Depending on the aim of an analysis, either a supervised or an unsupervised learning approach can be applied. Under a supervised context, an HMM is trained to classify unlabelled acceleration data into a finite set of pre-specified categories, whereas we will demonstrate how an unsupervised learning approach can be used to infer new aspects of animal behaviour. We will provide the details necessary to implement and assess an HMM in both the supervised and unsupervised context, and discuss the data requirements of each case. We outline two applications to marine and aerial systems (sharks and eagles) taking the unsupervised approach, which is more readily applicable to animal activity measured in the field. HMMs were used to infer the effects of temporal, atmospheric and tidal inputs on animal behaviour. Animal accelerometer data allow ecologists to identify important correlates and drivers of animal activity (and hence behaviour). The HMM framework is well suited to deal with the main features commonly observed in accelerometer data. The ability to combine direct observations of animals activity and combine it with statistical models which account for the features of accelerometer data offer a new way to quantify animal behaviour, energetic expenditure and deepen our insights into individual behaviour as a constituent of populations and ecosystems.
△ Less
Submitted 20 February, 2016;
originally announced February 2016.
-
Traffic flow densities in large transport networks
Authors:
Christian Hirsch,
Benedikt Jahnel,
Paul Keeler,
Robert I. A. Patterson
Abstract:
We consider transport networks with nodes scattered at random in a large domain. At certain local rates, the nodes generate traffic flowing according to some navigation scheme in a given direction. In the thermodynamic limit of a growing domain, we present an asymptotic formula expressing the local traffic flow density at any given location in the domain in terms of three fundamental characteristi…
▽ More
We consider transport networks with nodes scattered at random in a large domain. At certain local rates, the nodes generate traffic flowing according to some navigation scheme in a given direction. In the thermodynamic limit of a growing domain, we present an asymptotic formula expressing the local traffic flow density at any given location in the domain in terms of three fundamental characteristics of the underlying network: the spatial intensity of the nodes together with their traffic generation rates, and of the links induced by the navigation. This formula holds for a general class of navigations satisfying a link-density and a sub-ballisticity condition. As a specific example, we verify these conditions for navigations arising from a directed spanning tree on a Poisson point process with inhomogeneous intensity function.
△ Less
Submitted 2 February, 2016;
originally announced February 2016.
-
Kinetic Theory of Cluster Dynamics
Authors:
Robert I. A. Patterson,
Sergio Simonella,
Wolfgang Wagner
Abstract:
In a Newtonian system with localized interactions the whole set of particles is naturally decomposed into dynamical clusters, defined as finite groups of particles having an influence on each other's trajectory during a given interval of time. For an ideal gas with short-range intermolecular force, we provide a description of the cluster size distribution in terms of the reduced Boltzmann density.…
▽ More
In a Newtonian system with localized interactions the whole set of particles is naturally decomposed into dynamical clusters, defined as finite groups of particles having an influence on each other's trajectory during a given interval of time. For an ideal gas with short-range intermolecular force, we provide a description of the cluster size distribution in terms of the reduced Boltzmann density. In the simplified context of Maxwell molecules, we show that a macroscopic fraction of the gas forms a giant component in finite kinetic time. The critical index of this phase transition is in agreement with previous numerical results on the elastic billiard.
△ Less
Submitted 24 June, 2016; v1 submitted 21 January, 2016;
originally announced January 2016.
-
Modelling latent individual heterogeneity in mark-recapture data with Dirichlet process priors
Authors:
Jessica H Ford,
Toby A Patterson,
Mark V Bravington
Abstract:
The natural subgroups often seen in mark-recapture studies and the complexity of real mark-recapture data means that parametric and discrete style models can be insufficient. Non-parametric models avoid these often restrictive assumptions. We consider the non-parametric Dirichlet process for modelling latent individual heterogeneity in probability of observation and the probability of remaining in…
▽ More
The natural subgroups often seen in mark-recapture studies and the complexity of real mark-recapture data means that parametric and discrete style models can be insufficient. Non-parametric models avoid these often restrictive assumptions. We consider the non-parametric Dirichlet process for modelling latent individual heterogeneity in probability of observation and the probability of remaining in or out of a marine sanctuary. Simulation studies demonstrated accurate estimation of multiple groups of latent individual heterogeneity. Simulations were also used to identify the limits of the Dirichlet process. The ability of the Dirichlet process to pick up unimodal heterogeneity was explored in order to avoid potential spurious multimodality. In application to a subset of the data from the North Atlantic humpback whales we were able to estimate annual population-level variation in usage of the marine sanctuary and three measures of individual-level variation. With the Dirichlet process prior we were able to detect multimodality in each parameter.
△ Less
Submitted 22 November, 2015;
originally announced November 2015.
-
Efficient MCMC implementation of multi-state mark-recapture models
Authors:
Jessica H Ford,
Toby A Patterson,
Mark V Bravington
Abstract:
Inherent differences in behaviour of individual animal movement can introduce bias into estimates of population parameters derived from mark-recapture data. Additionally, quantifying individual heterogeneity is of considerable interest in it's own right as numerous studies have shown how heterogeneity can drive population dynamics. In this paper we incorporate multiple measures of individual heter…
▽ More
Inherent differences in behaviour of individual animal movement can introduce bias into estimates of population parameters derived from mark-recapture data. Additionally, quantifying individual heterogeneity is of considerable interest in it's own right as numerous studies have shown how heterogeneity can drive population dynamics. In this paper we incorporate multiple measures of individual heterogeneity into a multi-state mark-recapture model, using a Beta-Binomial Gibbs sampler using MCMC estimation. We also present a novel Independent Metropolis-Hastings sampler which allows for efficient updating of the hyper-parameters which cannot be updated using Gibbs sampling. We tested the model using simulation studies and applied the model to mark-resight data of North Atlantic humpback whales observed in the Stellwagen Bank National Marine Sanctuary where heterogeneity is present in both sighting probability and site preference. Simulation studies show asymptotic convergence of the posterior distribution for each of the hyper-parameters to true parameter values. In application to humpback whales individual heterogeneity is evident in sighting probability and propensity to use the marine sanctuary.
△ Less
Submitted 22 November, 2015;
originally announced November 2015.