Search | arXiv e-print repository

Spatial particle processes with coagulation: Gibbs-measure approach, gelation and Smoluchowski equation

Authors: Luisa Andreis, Wolfgang König, Heide Langhammer, Robert I. A. Patterson

Abstract: We study a spatial Markovian particle system with pairwise coagulation, a spatial version of the Marcus--Lushnikov process: according to a coagulation kernel $K$, particle pairs merge into a single particle, and their masses are united. We introduce a statistical-mechanics approach to the study of this process. We derive an explicit formula for the empirical process of the particle configuration a… ▽ More We study a spatial Markovian particle system with pairwise coagulation, a spatial version of the Marcus--Lushnikov process: according to a coagulation kernel $K$, particle pairs merge into a single particle, and their masses are united. We introduce a statistical-mechanics approach to the study of this process. We derive an explicit formula for the empirical process of the particle configuration at a given fixed time $T$ in terms of a reference Poisson point process, whose points are trajectories that coagulate into one particle by time $T$. The non-coagulation between any two of them induces an exponential pair-interaction, which turns the description into a many-body system with a Gibbsian pair-interaction. Based on this, we first give a large-deviation principle for the joint distribution of the particle histories (conditioning on an upper bound for particle sizes), in the limit as the number $N$ of initial atoms diverges and the kernel scales as $\frac 1N K$. We characterise the minimiser(s) of the rate function, we give criteria for its uniqueness and prove a law of large numbers (unconditioned). Furthermore, we use the unique minimiser to construct a solution of the Smoluchowski equation and give a criterion for the occurrence of a gelation phase transition. △ Less

Submitted 12 January, 2024; originally announced January 2024.

Comments: 60 pages, 1 figure

MSC Class: 82C22; 60J25; 60F10; 60G55; 60K35; 35Q70

arXiv:2312.16225 [pdf, other]

Simulating Pedestrian Avoidance: The Humans vs Zombies Scenario

Authors: Juan P. Oriana, German A. Patterson, Daniel R. Parisi

Abstract: This study introduces a unique active matter system as an application of the pedestrian collision avoidance paradigm, that proposes dynamically adjusting the desired velocity. We present a fictitious human-zombie scenario set within a closed geometry, combining prey-predator behavior with a one-way contagion process that can transform prey into predators. The system demonstrates varied responses,… ▽ More This study introduces a unique active matter system as an application of the pedestrian collision avoidance paradigm, that proposes dynamically adjusting the desired velocity. We present a fictitious human-zombie scenario set within a closed geometry, combining prey-predator behavior with a one-way contagion process that can transform prey into predators. The system demonstrates varied responses, in cases where agents have the same maximum speeds, a single zombie always catches a human, whereas two zombies never catch a single human. As the number of human agents increases, observables, such as the final fraction of zombie agents and total conversion times, exhibit a significant change in the system's behavior at intermediate density values. Most notably, there is evidence of a first-order phase transition when the mean population speed is analyzed as an order parameter. △ Less

Submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.02355 [pdf, other]

When is Offline Policy Selection Sample Efficient for Reinforcement Learning?

Authors: Vincent Liu, Prabhat Nagarajan, Andrew Patterson, Martha White

Abstract: Offline reinforcement learning algorithms often require careful hyperparameter tuning. Consequently, before deployment, we need to select amongst a set of candidate policies. As yet, however, there is little understanding about the fundamental limits of this offline policy selection (OPS) problem. In this work we aim to provide clarity on when sample efficient OPS is possible, primarily by connect… ▽ More Offline reinforcement learning algorithms often require careful hyperparameter tuning. Consequently, before deployment, we need to select amongst a set of candidate policies. As yet, however, there is little understanding about the fundamental limits of this offline policy selection (OPS) problem. In this work we aim to provide clarity on when sample efficient OPS is possible, primarily by connecting OPS to off-policy policy evaluation (OPE) and Bellman error (BE) estimation. We first show a hardness result, that in the worst case, OPS is just as hard as OPE, by proving a reduction of OPE to OPS. As a result, no OPS method can be more sample efficient than OPE in the worst case. We then propose a BE method for OPS, called Identifiable BE Selection (IBES), that has a straightforward method for selecting its own hyperparameters. We highlight that using IBES for OPS generally has more requirements than OPE methods, but if satisfied, can be more sample efficient. We conclude with an empirical study comparing OPE and IBES, and by showing the difficulty of OPS on an offline Atari benchmark dataset. △ Less

Submitted 4 December, 2023; originally announced December 2023.

arXiv:2311.14847 [pdf, other]

Experimental and numerical study of a second-order transition in the behavior of confined self-propelled particles

Authors: E. Barone, G. A. Patterson

Abstract: In this study, we conduct experimental investigations on the behavior of confined self-propelled particles within a circular arena, employing small commercial robots capable of locomotion, communication, and information processing. These robots execute circular trajectories, which can be clockwise or counterclockwise, based on two internal states. Using a majority-based stochastic decision algorit… ▽ More In this study, we conduct experimental investigations on the behavior of confined self-propelled particles within a circular arena, employing small commercial robots capable of locomotion, communication, and information processing. These robots execute circular trajectories, which can be clockwise or counterclockwise, based on two internal states. Using a majority-based stochastic decision algorithm, each robot can reverse its direction based on the states of two neighboring robots. By manipulating a control parameter governing the interaction, the system exhibits a transition-from a state where all robots rotate randomly to one where they rotate uniformly in the same direction. Moreover, this transition significantly impacts the trajectories of the robots. To extend our findings to larger systems, we introduce a mathematical model enabling characterization of the order transition type and the resulting trajectories. Our results reveal a second-order transition from active Brownian to chiral motion. Lastly, we analyze the particle density within the arena, examining how it varies concerning system size and the control parameter. △ Less

Submitted 24 November, 2023; originally announced November 2023.

Comments: 16 pages, 7 figures

arXiv:2311.01211 [pdf, other]

Fundamental diagram of vibration-driven vehicles

Authors: German A. Patterson, Daniel R. Parisi

Abstract: In this study, we conducted experimental investigations into the fundamental diagram of vibration-driven vehicles (VDV) in a one-dimensional array. As these mechanical agents interact solely through collisions, their mean speed remains nearly constant at low and medium densities. However, there is a reduction of between 25% and 40% when the lineal density approaches the inverse of the contact dist… ▽ More In this study, we conducted experimental investigations into the fundamental diagram of vibration-driven vehicles (VDV) in a one-dimensional array. As these mechanical agents interact solely through collisions, their mean speed remains nearly constant at low and medium densities. However, there is a reduction of between 25% and 40% when the lineal density approaches the inverse of the contact distance. Remarkably, in this one-dimensional system, the outcome is significantly influenced by the order in which agents, sorted by their free speeds, are gradually introduced into the experiment. While a significant speed difference is observed at low and medium densities based on this ordering, both curves eventually converge to the same speed at maximum density. Moreover, the attained speed in saturated systems is slower than the speed of the slowest agent. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Comments: 8 pages, 5 figures

arXiv:2305.05687 [pdf, other]

doi 10.3847/1538-4357/accc89

Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies

Authors: James Paul Mason, Alexandra Werth, Colin G. West, Allison A. Youngblood, Donald L. Woodraska, Courtney Peck, Kevin Lacjak, Florian G. Frick, Moutamen Gabir, Reema A. Alsinan, Thomas Jacobsen, Mohammad Alrubaie, Kayla M. Chizmar, Benjamin P. Lau, Lizbeth Montoya Dominguez, David Price, Dylan R. Butler, Connor J. Biron, Nikita Feoktistov, Kai Dewey, N. E. Loomis, Michal Bodzianowski, Connor Kuybus, Henry Dietrick, Aubrey M. Wolfe , et al. (977 additional authors not shown)

Abstract: Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th… ▽ More Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 1,002 authors, 14 pages, 4 figures, 3 tables, published by The Astrophysical Journal on 2023-05-09, volume 948, page 71

arXiv:2304.01315 [pdf, other]

Empirical Design in Reinforcement Learning

Authors: Andrew Patterson, Samuel Neumann, Martha White, Adam White

Abstract: Empirical design in reinforcement learning is no small task. Running good experiments requires attention to detail and at times significant computational resources. While compute resources available per dollar have continued to grow rapidly, so have the scale of typical experiments in reinforcement learning. It is now common to benchmark agents with millions of parameters against dozens of tasks,… ▽ More Empirical design in reinforcement learning is no small task. Running good experiments requires attention to detail and at times significant computational resources. While compute resources available per dollar have continued to grow rapidly, so have the scale of typical experiments in reinforcement learning. It is now common to benchmark agents with millions of parameters against dozens of tasks, each using the equivalent of 30 days of experience. The scale of these experiments often conflict with the need for proper statistical evidence, especially when comparing algorithms. Recent studies have highlighted how popular algorithms are sensitive to hyper-parameter settings and implementation details, and that common empirical practice leads to weak statistical evidence (Machado et al., 2018; Henderson et al., 2018). Here we take this one step further. This manuscript represents both a call to action, and a comprehensive resource for how to do good experiments in reinforcement learning. In particular, we cover: the statistical assumptions underlying common performance measures, how to properly characterize performance variation and stability, hypothesis testing, special considerations for comparing multiple agents, baseline and illustrative example construction, and how to deal with hyper-parameters and experimenter bias. Throughout we highlight common mistakes found in the literature and the statistical consequences of those in example experiments. The objective of this document is to provide answers on how we can use our unprecedented compute to do good science in reinforcement learning, as well as stay alert to potential pitfalls in our empirical design. △ Less

Submitted 3 April, 2023; originally announced April 2023.

Comments: In submission to JMLR

arXiv:2303.17741 [pdf, other]

Development and Demonstration of an Efficient Readout Error Mitigation Technique for use in NISQ Algorithms

Authors: Andrew Arrasmith, Andrew Patterson, Alice Boughton, Marco Paini

Abstract: The approximate state estimation and the closely related classical shadows methods allow for the estimation of complicated observables with relatively few shots. As these methods make use of random measurements that can symmetrise the effect of readout errors, they have been shown to permit simplified approaches to readout error mitigation which require only a number of samples that scales as… ▽ More The approximate state estimation and the closely related classical shadows methods allow for the estimation of complicated observables with relatively few shots. As these methods make use of random measurements that can symmetrise the effect of readout errors, they have been shown to permit simplified approaches to readout error mitigation which require only a number of samples that scales as $\mathcal{O}(1)$ with increasing numbers of qubits. However, these techniques require executing a different circuit at each shot, adding a typically prohibitive amount of latency that prohibits their practical application. In this manuscript we consider the approximate state estimation of readout-mitigated expectation values, and how to best implement that procedure on the Rigetti quantum computing hardware. We discuss the theoretical aspects involved, providing an explicit computation of the effect of readout error on the estimated expectation values and how to mitigate that effect. Leveraging improvements to the Rigetti control systems, we then demonstrate an efficient implementation of this approach. Not only do we find that we can suppress the effect of correlated errors and accurately mitigate the readout errors, we find that we can do so quickly, collecting and processing $10^6$ samples in less than $1.5$ minutes. This development opens the way for practical uses of methods with this type of randomisation. △ Less

Submitted 20 April, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

Comments: 19 pages, 3 figures, v2 has minor typo corrections

arXiv:2205.08464 [pdf, other]

Robust Losses for Learning Value Functions

Authors: Andrew Patterson, Victor Liao, Martha White

Abstract: Most value function learning algorithms in reinforcement learning are based on the mean squared (projected) Bellman error. However, squared errors are known to be sensitive to outliers, both skewing the solution of the objective and resulting in high-magnitude and high-variance gradients. To control these high-magnitude updates, typical strategies in RL involve clip** gradients, clip** rewards… ▽ More Most value function learning algorithms in reinforcement learning are based on the mean squared (projected) Bellman error. However, squared errors are known to be sensitive to outliers, both skewing the solution of the objective and resulting in high-magnitude and high-variance gradients. To control these high-magnitude updates, typical strategies in RL involve clip** gradients, clip** rewards, rescaling rewards, or clip** errors. While these strategies appear to be related to robust losses -- like the Huber loss -- they are built on semi-gradient update rules which do not minimize a known loss. In this work, we build on recent insights reformulating squared Bellman errors as a saddlepoint optimization problem and propose a saddlepoint reformulation for a Huber Bellman error and Absolute Bellman error. We start from a formalization of robust losses, then derive sound gradient-based approaches to minimize these losses in both the online off-policy prediction and control settings. We characterize the solutions of the robust losses, providing insight into the problem settings where the robust losses define notably better solutions than the mean squared Bellman error. Finally, we show that the resulting gradient-based algorithms are more stable, for both prediction and control, with less sensitivity to meta-parameters. △ Less

Submitted 17 April, 2023; v1 submitted 17 May, 2022; originally announced May 2022.

Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (2022)

arXiv:2202.02396 [pdf, other]

A Temporal-Difference Approach to Policy Gradient Estimation

Authors: Samuele Tosatto, Andrew Patterson, Martha White, A. Rupam Mahmood

Abstract: The policy gradient theorem (Sutton et al., 2000) prescribes the usage of a cumulative discounted state distribution under the target policy to approximate the gradient. Most algorithms based on this theorem, in practice, break this assumption, introducing a distribution shift that can cause the convergence to poor solutions. In this paper, we propose a new approach of reconstructing the policy gr… ▽ More The policy gradient theorem (Sutton et al., 2000) prescribes the usage of a cumulative discounted state distribution under the target policy to approximate the gradient. Most algorithms based on this theorem, in practice, break this assumption, introducing a distribution shift that can cause the convergence to poor solutions. In this paper, we propose a new approach of reconstructing the policy gradient from the start state without requiring a particular sampling strategy. The policy gradient calculation in this form can be simplified in terms of a gradient critic, which can be recursively estimated due to a new Bellman equation of gradients. By using temporal-difference updates of the gradient critic from an off-policy data stream, we develop the first estimator that sidesteps the distribution shift issue in a model-free way. We prove that, under certain realizability conditions, our estimator is unbiased regardless of the sampling strategy. We empirically show that our technique achieves a superior bias-variance trade-off and performance in presence of off-policy samples. △ Less

Submitted 7 July, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

arXiv:2112.11907 [pdf, other]

doi 10.1088/1367-2630/ac43ec

Spontaneous trail formation in populations of auto-chemotactic walkers

Authors: Zahra Mokhtari, Robert I. A. Patterson, Felix Höfling

Abstract: We study the formation of trails in populations of self-propelled agents that make oriented deposits of pheromones and also sense such deposits to which they then respond with gradual changes of their direction of motion. Based on extensive off-lattice computer simulations aiming at the scale of insects, e.g., ants, we identify a number of emerging stationary patterns and obtain qualitatively the… ▽ More We study the formation of trails in populations of self-propelled agents that make oriented deposits of pheromones and also sense such deposits to which they then respond with gradual changes of their direction of motion. Based on extensive off-lattice computer simulations aiming at the scale of insects, e.g., ants, we identify a number of emerging stationary patterns and obtain qualitatively the non-equilibrium state diagram of the model, spanned by the strength of the agent--pheromone interaction and the number density of the population. In particular, we demonstrate the spontaneous formation of persistent, macroscopic trails, and highlight some behaviour that is consistent with a dynamic phase transition. This includes a characterisation of the mass of system-spanning trails as a potential order parameter. We also propose a dynamic model for a few macroscopic observables, including the sub-population size of trail-following agents, which captures the early phase of trail formation. △ Less

Submitted 8 December, 2021; originally announced December 2021.

Journal ref: New J. Phys. 24, 013012 (2022)

arXiv:2111.13200 [pdf]

doi 10.1007/s00440-022-01180-7

A large-deviations principle for all the components in a sparse inhomogeneous random graph

Authors: Luisa Andreis, Wolfgang König, Heide Langhammer, Robert I. A. Patterson

Abstract: We study an inhomogeneous sparse random graph on [N] = {1, . . . , N } as introduced in a seminal paper by Bollobas, Janson and Riordan (2007): vertices have a type (here in a compact metric space S), and edges between different vertices occur randomly and independently over all vertex pairs, with a probability depending on the two vertex types. In the limit N to infinity, we consider the sparse r… ▽ More We study an inhomogeneous sparse random graph on [N] = {1, . . . , N } as introduced in a seminal paper by Bollobas, Janson and Riordan (2007): vertices have a type (here in a compact metric space S), and edges between different vertices occur randomly and independently over all vertex pairs, with a probability depending on the two vertex types. In the limit N to infinity, we consider the sparse regime, where the average degree is O(1). We prove a large-deviations principle with explicit rate function for the statistics of the collection of all the connected components, registered according to their vertex type sets, and distinguished according to being microscopic (of finite size) or macroscopic (of size proportional to N). In doing so, we derive explicit logarithmic asymptotics for the probability that GN is connected. We present a full analysis of the rate function including its minimizers. From this analysis we deduce a number of limit laws, conditional and unconditional, which provide comprehensive information about all the microscopic and macroscopic components of the graph. In particular, we recover the criterion for the existence of the phase transition given in [5]. △ Less

Submitted 17 August, 2023; v1 submitted 25 November, 2021; originally announced November 2021.

Journal ref: Probability Theory and Related Fields (2023) 186:521-620

arXiv:2106.02957 [pdf, ps, other]

doi 10.2140/involve.2022.15.775

A local normal form for Hamiltonian actions of compact semisimple Poisson-Lie groups

Authors: Megumi Harada, Jeremy Lane, Aidan Patterson

Abstract: The main contribution of this manuscript is a local normal form for Hamiltonian actions of Poisson-Lie groups $K$ on a symplectic manifold equipped with an $AN$-valued moment map, where $AN$ is the dual Poisson-Lie group of $K$. Our proof uses the delinearization theorem of Alekseev which relates a classical Hamiltonian action of $K$ with $\mathfrak{k}^*$-valued moment map to a Hamiltonian action… ▽ More The main contribution of this manuscript is a local normal form for Hamiltonian actions of Poisson-Lie groups $K$ on a symplectic manifold equipped with an $AN$-valued moment map, where $AN$ is the dual Poisson-Lie group of $K$. Our proof uses the delinearization theorem of Alekseev which relates a classical Hamiltonian action of $K$ with $\mathfrak{k}^*$-valued moment map to a Hamiltonian action with an $AN$-valued moment map, via a deformation of symplectic structures. We obtain our main result by proving a ``delinearization commutes with symplectic quotients'' theorem which is also of independent interest, and then putting this together with the local normal form theorem for classical Hamiltonian actions wtih $\mathfrak{k}^*$-valued moment maps. A key ingredient for our main result is the delinearization $\mathcal{D}(ω_{can})$ of the canonical symplectic structure on $T^*K$, so we additionally take some steps toward explicit computations of $\mathcal{D}(ω_{can})$. In particular, in the case $K=SU(2)$, we obtain explicit formulas for the matrix coefficients of $\mathcal{D}(ω_{can})$ with respect to a natural choice of coordinates on $T^*SU(2)$. △ Less

Submitted 5 June, 2021; originally announced June 2021.

Comments: 23 pages

MSC Class: Primary: 53D20; Secondary: 53D17

Journal ref: Involve 15 (2022) 775-812

arXiv:2104.13844 [pdf, other]

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning

Authors: Andrew Patterson, Adam White, Martha White

Abstract: Many reinforcement learning algorithms rely on value estimation, however, the most widely used algorithms -- namely temporal difference algorithms -- can diverge under both off-policy sampling and nonlinear function approximation. Many algorithms have been developed for off-policy value estimation based on the linear mean squared projected Bellman error (MSPBE) and are sound under linear function… ▽ More Many reinforcement learning algorithms rely on value estimation, however, the most widely used algorithms -- namely temporal difference algorithms -- can diverge under both off-policy sampling and nonlinear function approximation. Many algorithms have been developed for off-policy value estimation based on the linear mean squared projected Bellman error (MSPBE) and are sound under linear function approximation. Extending these methods to the nonlinear case has been largely unsuccessful. Recently, several methods have been introduced that approximate a different objective -- the mean-squared Bellman error (MSBE) -- which naturally facilitate nonlinear approximation. In this work, we build on these insights and introduce a new generalized MSPBE that extends the linear MSPBE to the nonlinear setting. We show how this generalized objective unifies previous work and obtain new bounds for the value error of the solutions of the generalized objective. We derive an easy-to-use, but sound, algorithm to minimize the generalized objective, and show that it is more stable across runs, is less sensitive to hyperparameters, and performs favorably across four control domains with neural network function approximation. △ Less

Submitted 28 March, 2022; v1 submitted 28 April, 2021; originally announced April 2021.

Comments: Accepted for publication in JMLR 2022

arXiv:2103.14384 [pdf, ps, other]

Variational structures beyond gradient flows: a macroscopic fluctuation-theory perspective

Authors: Robert I. A. Patterson, D. R. Michiel Renger, Upanshu Sharma

Abstract: Macroscopic equations arising out of stochastic particle systems in detailed balance (called dissipative systems or gradient flows) have a natural variational structure, which can be derived from the large-deviation rate functional for the density of the particle system. While large deviations can be studied in considerable generality, these variational structures are often restricted to systems i… ▽ More Macroscopic equations arising out of stochastic particle systems in detailed balance (called dissipative systems or gradient flows) have a natural variational structure, which can be derived from the large-deviation rate functional for the density of the particle system. While large deviations can be studied in considerable generality, these variational structures are often restricted to systems in detailed balance. Using insights from macroscopic fluctuation theory, in this work we aim to generalise this variational connection beyond dissipative systems by augmenting densities with fluxes, which encode non-dissipative effects. Our main contribution is an abstract framework, which for a given flux-density cost and a quasipotential, provides a decomposition into dissipative and non-dissipative components and a generalised orthogonality relation between them. We then apply this abstract theory to various stochastic particle systems -- independent copies of jump processes, zero-range processes, chemical-reaction networks in complex balance and lattice-gas models. △ Less

Submitted 3 October, 2023; v1 submitted 26 March, 2021; originally announced March 2021.

MSC Class: 35Q82; 35Q84; 49S05; 49J40; 60F10; 82C22; 82C35

arXiv:2102.13040 [pdf, other]

Large deviations for Markov jump processes with uniformly diminishing rates

Authors: Andrea Agazzi, Luisa Andreis, Robert I. A. Patterson, D. R. Michiel Renger

Abstract: We prove a large-deviation principle (LDP) for the sample paths of jump Markov processes in the small noise limit when, possibly, all the jump rates vanish uniformly, but slowly enough, in a region of the state space. We further discuss the optimality of our assumptions on the decay of the jump rates. As a direct application of this work we relax the assumptions needed for the application of LDPs… ▽ More We prove a large-deviation principle (LDP) for the sample paths of jump Markov processes in the small noise limit when, possibly, all the jump rates vanish uniformly, but slowly enough, in a region of the state space. We further discuss the optimality of our assumptions on the decay of the jump rates. As a direct application of this work we relax the assumptions needed for the application of LDPs to, e.g., Chemical Reaction Network dynamics, where vanishing reaction rates arise naturally particularly the context of mass action kinetics. △ Less

Submitted 25 February, 2021; originally announced February 2021.

Comments: 19 pages, 2 figures

MSC Class: 60F10; 60J75; 80A30

arXiv:2009.04019 [pdf, other]

Social Distance Characterization by means of Pedestrian Simulation

Authors: Daniel R. Parisi, Germán A. Patterson, Lucio Pagni, Agustina Osimani, Tomas Bacigalupo, Juan Godfrid, Federico M. Bergagna, Manuel Rodriguez Brizi, Pedro Momesso, Fermin L. Gomez, Jimena Lozano, Juan Martin Baader, Ignacio Ribas, Facundo P. Astiz Meyer, Miguel Di Luca, Nicolás E. Barrera, Ezequiel M. Keimel Álvarez, Maite M. Herran Oyhanarte, Pedro R. **arilho, Ximena Zuberbuhler, Felipe Gorostiaga

Abstract: In the present work, we study how the number of simulated clients (occupancy) affects the social distance in an ideal supermarket. For this, we account for realistic typical dimensions and process time (picking products and checkout). From the simulated trajectories, we measure events of social distance less than 2 m and its duration. Between other observables, we define a social distance coeffici… ▽ More In the present work, we study how the number of simulated clients (occupancy) affects the social distance in an ideal supermarket. For this, we account for realistic typical dimensions and process time (picking products and checkout). From the simulated trajectories, we measure events of social distance less than 2 m and its duration. Between other observables, we define a social distance coefficient that informs how many events (of a given duration) suffer each agent in the system. These kinds of outputs could be useful for building procedures and protocols in the context of a pandemic allowing to keep low health risks while setting a maximum operating capacity. △ Less

Submitted 8 September, 2020; originally announced September 2020.

Comments: 12 pages, 9 figures, to be submitted to Scientific Reports

arXiv:2009.03864 [pdf, other]

Contraction $\mathcal{L}_1$-Adaptive Control using Gaussian Processes

Authors: Aditya Gahlawat, Arun Lakshmanan, Lin Song, Andrew Patterson, Zhuohuan Wu, Naira Hovakimyan, Evangelos Theodorou

Abstract: We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while… ▽ More We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while providing safety certificates. Furthermore, $\mathcal{CL}_1$-$\mathcal{GP}$ incorporates any available data into a GP model of uncertainties, which improves performance and enables the motion planner to achieve optimality safely. This way, the safe operation of the system is always guaranteed, even during the learning transients. We provide a few illustrative examples for the safe learning and control of planar quadrotor systems in a variety of environments. △ Less

Submitted 30 November, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

Comments: Submitted to Learning for Dynamics and Control (L4DC) Conference, 2021

arXiv:2007.11527 [pdf, other]

Optical Hemodynamic Imaging of Jugular Venous Dynamics During Altered Central Venous Pressure

Authors: Robert Amelard, Andrew D Robertson, Courtney A Patterson, Hannah Heigold, Essi Saarikoski, Richard L Hughson

Abstract: An optical imaging system is proposed for quantitatively assessing jugular venous response to altered central venous pressure. The proposed system assesses sub-surface optical absorption changes from jugular venous waveforms with a spatial calibration procedure to normalize incident tissue illumination. Widefield frames of the right lateral neck were captured and calibrated using a novel flexible… ▽ More An optical imaging system is proposed for quantitatively assessing jugular venous response to altered central venous pressure. The proposed system assesses sub-surface optical absorption changes from jugular venous waveforms with a spatial calibration procedure to normalize incident tissue illumination. Widefield frames of the right lateral neck were captured and calibrated using a novel flexible surface calibration method. A hemodynamic optical model was derived to quantify jugular venous optical attenuation (JVA) signals, and generate a spatial jugular venous pulsatility map. JVA was assessed in three cardiovascular protocols that altered central venous pressure: acute central hypovolemia (lower body negative pressure), venous congestion (head-down tilt), and impaired cardiac filling (Valsalva maneuver). JVA waveforms exhibited biphasic wave properties consistent with jugular venous pulse dynamics when time-aligned with an electrocardiogram. JVA correlated strongly (median, interquartile range) with invasive central venous pressure during graded central hypovolemia (r=0.85, [0.72, 0.95]), graded venous congestion (r=0.94, [0.84, 0.99]), and impaired cardiac filling (r=0.94, [0.85, 0.99]). Reduced JVA during graded acute hypovolemia was strongly correlated with reductions in stroke volume (SV) (r=0.85, [0.76, 0.92]) from baseline (SV: 79$\pm$15 mL, JVA: 0.56$\pm$0.10 a.u.) to -40 mmHg suction (SV: 59$\pm$18 mL, JVA: 0.47$\pm$0.05 a.u.; p$<$0.01). The proposed non-contact optical imaging system demonstrated jugular venous dynamics consistent with invasive central venous monitoring during three protocols that altered central venous pressure. This system provides non-invasive monitoring of pressure-induced jugular venous dynamics in clinically relevant conditions where catheterization is traditionally required, enabling monitoring in non-surgical environments. △ Less

Submitted 24 March, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

arXiv:2007.00611 [pdf, other]

Gradient Temporal-Difference Learning with Regularized Corrections

Authors: Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White

Abstract: It is still common to use Q-learning and temporal difference (TD) learning-even though they have divergence issues and sound Gradient TD alternatives exist-because divergence seems rare and they typically perform well. However, recent work with large neural network learning systems reveals that instability is more common than previously thought. Practitioners face a difficult dilemma: choose an ea… ▽ More It is still common to use Q-learning and temporal difference (TD) learning-even though they have divergence issues and sound Gradient TD alternatives exist-because divergence seems rare and they typically perform well. However, recent work with large neural network learning systems reveals that instability is more common than previously thought. Practitioners face a difficult dilemma: choose an easy to use and performant TD method, or a more complex algorithm that is more sound but harder to tune and all but unexplored with non-linear function approximation or control. In this paper, we introduce a new method called TD with Regularized Corrections (TDRC), that attempts to balance ease of use, soundness, and performance. It behaves as well as TD, when TD performs well, but is sound in cases where TD diverges. We empirically investigate TDRC across a range of problems, for both prediction and control, and for both linear and non-linear function approximation, and show, potentially for the first time, that gradient TD methods could be a better alternative to TD and Q-learning. △ Less

Submitted 17 September, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

Comments: Appeared in Proceedings of the 37th International Conference on Machine Learning (ICML2020)

arXiv:2005.00401 [pdf, other]

doi 10.1186/s40462-020-00217-7

A continuous-time state-space model for rapid quality-control of Argos locations from animal-borne tags

Authors: Ian D. Jonsen, Toby A. Patterson, Daniel P. Costa, Philip D. Doherty, Brendan J. Godley, W. James Grecian, Christophe Guinet, Xavier Hoenner, Sarah S. Kienle, Patrick W. Robison, Stephen C. Votier, Matthew J. Witt, Mark A. Hindell, Robert G. Harcourt, Clive R. McMahon

Abstract: State-space models are important tools for quality control of error-prone animal movement data. The near real-time (within 24 h) capability of the Argos satellite system aids dynamic ocean management of human activities by informing when animals enter intensive use zones. This capability also facilitates use of ocean observations from animal-borne sensors in operational ocean forecasting models. S… ▽ More State-space models are important tools for quality control of error-prone animal movement data. The near real-time (within 24 h) capability of the Argos satellite system aids dynamic ocean management of human activities by informing when animals enter intensive use zones. This capability also facilitates use of ocean observations from animal-borne sensors in operational ocean forecasting models. Such near real-time data provision requires rapid, reliable quality control to deal with error-prone Argos locations. We formulate a continuous-time state-space model for the three types of Argos location data (Least-Squares, Kalman filter, and Kalman smoother), accounting for irregular timing of observations. Our model is deliberately simple to ensure speed and reliability for automated, near real-time quality control of Argos data. We validate the model by fitting to Argos data collected from 61 individuals across 7 marine vertebrates and compare model-estimated locations to GPS locations. Estimation accuracy varied among species with median Root Mean Squared Errors usually < 5 km and decreased with increasing data sampling rate and precision of Argos locations. Including a model parameter to inflate Argos error ellipse sizes resulted in more accurate location estimates. In some cases, the model appreciably improved the accuracy of the Argos Kalman smoother locations, which should not be possible if the smoother uses all available information. Our model provides quality-controlled locations from Argos Least-Squares or Kalman filter data with slightly better accuracy than Argos Kalman smoother data that are only available via reprocessing. Simplicity and ease of use make the model suitable both for automated quality control of near real-time Argos data and for manual use by researchers working with historical Argos data. △ Less

Submitted 1 May, 2020; originally announced May 2020.

Comments: 25 pages, 10 figures

Journal ref: Mov Ecol 8, 31 (2020)

arXiv:2004.14594 [pdf, ps, other]

$\mathcal{L}_1$-$\mathcal{GP}$: $\mathcal{L}_1$ Adaptive Control with Bayesian Learning

Authors: Aditya Gahlawat, Pan Zhao, Andrew Patterson, Naira Hovakimyan, Evangelos A. Theodorou

Abstract: We present $\mathcal{L}_1$-$\mathcal{GP}$, an architecture based on $\mathcal{L}_1$ adaptive control and Gaussian Process Regression (GPR) for safe simultaneous control and learning. On one hand, the $\mathcal{L}_1$ adaptive control provides stability and transient performance guarantees, which allows for GPR to efficiently and safely learn the uncertain dynamics. On the other hand, the learned dy… ▽ More We present $\mathcal{L}_1$-$\mathcal{GP}$, an architecture based on $\mathcal{L}_1$ adaptive control and Gaussian Process Regression (GPR) for safe simultaneous control and learning. On one hand, the $\mathcal{L}_1$ adaptive control provides stability and transient performance guarantees, which allows for GPR to efficiently and safely learn the uncertain dynamics. On the other hand, the learned dynamics can be conveniently incorporated into the $\mathcal{L}_1$ control architecture without sacrificing robustness and tracking performance. Subsequently, the learned dynamics can lead to less conservative designs for performance/robustness tradeoff. We illustrate the efficacy of the proposed architecture via numerical simulations. △ Less

Submitted 30 April, 2020; originally announced April 2020.

arXiv:2002.10497 [pdf, other]

doi 10.1111/ele.13610

Uncovering ecological state dynamics with hidden Markov models

Authors: Brett T. McClintock, Roland Langrock, Olivier Gimenez, Emmanuelle Cam, David L. Borchers, Richard Glennie, Toby A. Patterson

Abstract: Ecological systems can often be characterised by changes among a finite set of underlying states pertaining to individuals, populations, communities, or entire ecosystems through time. Owing to the inherent difficulty of empirical field studies, ecological state dynamics operating at any level of this hierarchy can often be unobservable or "hidden". Ecologists must therefore often contend with inc… ▽ More Ecological systems can often be characterised by changes among a finite set of underlying states pertaining to individuals, populations, communities, or entire ecosystems through time. Owing to the inherent difficulty of empirical field studies, ecological state dynamics operating at any level of this hierarchy can often be unobservable or "hidden". Ecologists must therefore often contend with incomplete or indirect observations that are somehow related to these underlying processes. By formally disentangling state and observation processes based on simple yet powerful mathematical properties that can be used to describe many ecological phenomena, hidden Markov models (HMMs) can facilitate inferences about complex system state dynamics that might otherwise be intractable. However, while HMMs are routinely applied in other disciplines, they have only recently begun to gain traction within the broader ecological community. We provide a gentle introduction to HMMs, establish some common terminology, and review the immense scope of HMMs for applied ecological research. We also provide a supplemental tutorial on some of the more technical aspects of HMM implementation and interpretation. By illustrating how practitioners can use a simple conceptual template to customise HMMs for their specific systems of interest, revealing methodological links between existing applications, and highlighting some practical considerations and limitations of these approaches, our goal is to help establish HMMs as a fundamental inferential tool for ecologists. △ Less

Submitted 14 July, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

arXiv:2002.01965 [pdf, other]

Learning Probabilistic Intersection Traffic Models for Trajectory Prediction

Authors: Andrew Patterson, Aditya Gahlawat, Naira Hovakimyan

Abstract: Autonomous agents must be able to safely interact with other vehicles to integrate into urban environments. The safety of these agents is dependent on their ability to predict collisions with other vehicles' future trajectories for replanning and collision avoidance. The information needed to predict collisions can be learned from previously observed vehicle trajectories in a specific environment,… ▽ More Autonomous agents must be able to safely interact with other vehicles to integrate into urban environments. The safety of these agents is dependent on their ability to predict collisions with other vehicles' future trajectories for replanning and collision avoidance. The information needed to predict collisions can be learned from previously observed vehicle trajectories in a specific environment, generating a traffic model. The learned traffic model can then be incorporated as prior knowledge into any trajectory estimation method being used in this environment. This work presents a Gaussian process based probabilistic traffic model that is used to quantify vehicle behaviors in an intersection. The Gaussian process model provides estimates for the average vehicle trajectory, while also capturing the variance between the different paths a vehicle may take in the intersection. The method is demonstrated on a set of time-series position trajectories. These trajectories are reconstructed by removing object recognition errors and missed frames that may occur due to data source processing. To create the intersection traffic model, the reconstructed trajectories are clustered based on their source and destination lanes. For each cluster, a Gaussian process model is created to capture the average behavior and the variance of the cluster. To show the applicability of the Gaussian model, the test trajectories are classified with only partial observations. Performance is quantified by the number of observations required to correctly classify the vehicle trajectory. Both the intersection traffic modeling computations and the classification procedure are timed. These times are presented as results and demonstrate that the model can be constructed in a reasonable amount of time and the classification procedure can be used for online applications. △ Less

Submitted 5 February, 2020; originally announced February 2020.

arXiv:1911.00352 [pdf, other]

doi 10.1103/PhysRevResearch.3.013063

Quantum State Discrimination Using Noisy Quantum Neural Networks

Authors: Andrew Patterson, Hongxiang Chen, Leonard Wossnig, Simone Severini, Dan Browne, Ivan Rungger

Abstract: Near-term quantum computers are noisy, and therefore must run algorithms with a low circuit depth and qubit count. Here we investigate how noise affects a quantum neural network (QNN) for state discrimination, applicable on near-term quantum devices as it fulfils the above criteria. We find that when simulating gradient calculation on a noisy device, a large number of parameters is disadvantageous… ▽ More Near-term quantum computers are noisy, and therefore must run algorithms with a low circuit depth and qubit count. Here we investigate how noise affects a quantum neural network (QNN) for state discrimination, applicable on near-term quantum devices as it fulfils the above criteria. We find that when simulating gradient calculation on a noisy device, a large number of parameters is disadvantageous. By introducing a new smaller circuit ansatz we overcome this limitation, and find that the QNN performs well at noise levels of current quantum hardware. We also show that networks trained at higher noise levels can still converge to useful parameters. Our findings show that noisy quantum computers can be used in applications for state discrimination and for classifiers of the output of quantum generative adversarial networks. △ Less

Submitted 15 June, 2020; v1 submitted 1 November, 2019; originally announced November 2019.

Comments: 8 pages, 9 figures

Journal ref: Phys. Rev. Research 3, 013063 (2021)

arXiv:1910.04735 [pdf, other]

Dynamical mean field theory algorithm and experiment on quantum computers

Authors: I. Rungger, N. Fitzpatrick, H. Chen, C. H. Alderete, H. Apel, A. Cowtan, A. Patterson, D. Munoz Ramo, Y. Zhu, N. H. Nguyen, E. Grant, S. Chretien, L. Wossnig, N. M. Linke, R. Duncan

Abstract: The developments of quantum computing algorithms and experiments for atomic scale simulations have largely focused on quantum chemistry for molecules, while their application in condensed matter systems is scarcely explored. Here we present a quantum algorithm to perform dynamical mean field theory (DMFT) calculations for condensed matter systems on currently available quantum computers, and demon… ▽ More The developments of quantum computing algorithms and experiments for atomic scale simulations have largely focused on quantum chemistry for molecules, while their application in condensed matter systems is scarcely explored. Here we present a quantum algorithm to perform dynamical mean field theory (DMFT) calculations for condensed matter systems on currently available quantum computers, and demonstrate it on two quantum hardware platforms. DMFT is required to properly describe the large class of materials with strongly correlated electrons. The computationally challenging part arises from solving the effective problem of an interacting impurity coupled to a bath, which scales exponentially with system size on conventional computers. An exponential speedup is expected on quantum computers, but the algorithms proposed so far are based on real time evolution of the wavefunction, which requires high-depth circuits and hence very low noise levels in the quantum hardware. Here we propose an alternative approach, which uses the variational quantum eigensolver (VQE) method for ground and excited states to obtain the needed quantities as part of an exact diagonalization impurity solver. We present the algorithm for a two site DMFT system, which we benchmark using simulations on conventional computers as well as experiments on superconducting and trapped ion qubits, demonstrating that this method is suitable for running DMFT calculations on currently available quantum hardware. △ Less

Submitted 8 January, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

arXiv:1907.13592 [pdf, other]

doi 10.1126/sciadv.abe9492

Critical slowing down in the bistable regime of circuit quantum electrodynamics

Authors: P. Brookes, G. Tancredi, A. D. Patterson, J. Rahamim, M. Esposito, P. J. Leek, E. Ginossar, M. H. Szymanska

Abstract: We investigate the dynamics of the bistable regime of the generalized Jaynes-Cummings Hamiltonian (GJC), realised by a circuit quantum electrodynamics (cQED) system consisting of a transmon qubit coupled to a microwave cavity. In this regime we observe critical slowing down in the approach to the steady state. By measuring the response of the cavity to a step function drive pulse we characterize t… ▽ More We investigate the dynamics of the bistable regime of the generalized Jaynes-Cummings Hamiltonian (GJC), realised by a circuit quantum electrodynamics (cQED) system consisting of a transmon qubit coupled to a microwave cavity. In this regime we observe critical slowing down in the approach to the steady state. By measuring the response of the cavity to a step function drive pulse we characterize this slowing down as a function of driving frequency and power. We find that the critical slowing down saturates as the driving power is increased. We compare these results with the predictions of analytical and numerical calculations both with and without the Duffing approximation. We find that the Duffing approximation incorrectly predicts that the critical slowing down timescale increases exponentially with the drive, whereas the GJC model accurately predicts the saturation seen in our data, suggesting a different process of quantum activation. △ Less

Submitted 31 July, 2019; originally announced July 2019.

Journal ref: Sci. Adv.7,eabe9492 (2021)

arXiv:1907.09558 [pdf, other]

System-Level Development of a User-Integrated Semi-Autonomous Lawn Mowing System: Problem Overview, Basic Requirements, and Proposed Architecture

Authors: Albert E. Patterson, Yang Yuan, William R. Norris

Abstract: This concept paper outlines some recent efforts toward the design and development of user-integrated semi-autonomous home-sized lawn mowing systems from a systems engineering perspective. This is an important and emerging field of study within the robotics and systems engineering communities. The work presented includes a review of current progress on this problem, a discussion of the problem from… ▽ More This concept paper outlines some recent efforts toward the design and development of user-integrated semi-autonomous home-sized lawn mowing systems from a systems engineering perspective. This is an important and emerging field of study within the robotics and systems engineering communities. The work presented includes a review of current progress on this problem, a discussion of the problem from a systems engineering perspective, a general system architecture developed by the authors, and a preliminary set of design requirements. This work is meant to provide a baseline and motivation for the further development and refinement of these systems within the systems engineering and robotics communities and is relevant to both academic and commercial research. △ Less

Submitted 12 July, 2019; originally announced July 2019.

Comments: 11 pages, 8 figures, and 32 references

arXiv:1905.05670 [pdf, other]

doi 10.1103/PhysRevApplied.12.064013

Calibration of the cross-resonance two-qubit gate between directly-coupled transmons

Authors: A. D. Patterson, J. Rahamim, T. Tsunoda, P. Spring, S. Jebari, K. Ratter, M. Mergenthaler, G. Tancredi, B. Vlastakis, M. Esposito, P. J. Leek

Abstract: Quantum computation requires the precise control of the evolution of a quantum system, typically through application of discrete quantum logic gates on a set of qubits. Here, we use the cross-resonance interaction to implement a gate between two superconducting transmon qubits with a direct static dispersive coupling. We demonstrate a practical calibration procedure for the optimization of the gat… ▽ More Quantum computation requires the precise control of the evolution of a quantum system, typically through application of discrete quantum logic gates on a set of qubits. Here, we use the cross-resonance interaction to implement a gate between two superconducting transmon qubits with a direct static dispersive coupling. We demonstrate a practical calibration procedure for the optimization of the gate, combining continuous and repeated-gate Hamiltonian tomography with step-wise reduction of dominant two-qubit coherent errors through map** to microwave control parameters. We show experimentally that this procedure can enable a $\hat{ZX}_{-π/2}$ gate with a fidelity $F=97.0(7)\%$, measured with interleaved randomized benchmarking. We show this in a architecture with out-of-plane control and readout that is readily extensible to larger scale quantum circuits. △ Less

Submitted 14 May, 2019; originally announced May 2019.

Comments: 8 pages, 6 figures, 1 table

Journal ref: Phys. Rev. Applied 12, 064013 (2019)

arXiv:1904.10132 [pdf, other]

doi 10.1103/PhysRevApplied.15.064050

Realization of a Carbon-Nanotube-Based Superconducting Qubit

Authors: Matthias Mergenthaler, Ani Nersisyan, Andrew Patterson, Martina Esposito, Andreas Baumgartner, Christian Schönenberger, G. Andrew D. Briggs, Edward A. Laird, Peter J. Leek

Abstract: Hybrid circuit quantum electrodynamics (QED) involves the study of coherent quantum physics in solid state systems via their interactions with superconducting microwave circuits. Here we present an implementation of a hybrid superconducting qubit that employs a carbon nanotube as a Josephson junction. We realize the junction by contacting a carbon nanotube with a superconducting Pd/Al bi-layer, an… ▽ More Hybrid circuit quantum electrodynamics (QED) involves the study of coherent quantum physics in solid state systems via their interactions with superconducting microwave circuits. Here we present an implementation of a hybrid superconducting qubit that employs a carbon nanotube as a Josephson junction. We realize the junction by contacting a carbon nanotube with a superconducting Pd/Al bi-layer, and implement voltage tunability of the qubit frequency using a local electrostatic gate. We demonstrate strong dispersive coupling to a coplanar waveguide resonator via observation of a resonator frequency shift dependent on applied gate voltage. We extract qubit parameters from spectroscopy using dispersive readout and find qubit relaxation and coherence times in the range of $10-200~\rm{ns}$. △ Less

Submitted 22 April, 2019; originally announced April 2019.

Journal ref: Phys. Rev. Applied 15, 064050 (2021)

arXiv:1904.02765 [pdf, other]

Intent-Aware Probabilistic Trajectory Estimation for Collision Prediction with Uncertainty Quantification

Authors: Andrew Patterson, Arun Lakshmanan, Naira Hovakimyan

Abstract: Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the ass… ▽ More Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the assumption of deterministic knowledge of each obstacle's future behavior with probabilistic knowledge, to allow a larger class of obstacles to be considered. The method solely relies on position and velocity measurements to predict collisions with dynamic obstacles. We show that the uncertainty region for obstacle positions can be expressed in terms of a combination of polynomials generated with Gaussian process regression. To control the growth of uncertainty over arbitrary time horizons, a probabilistic obstacle intention is assumed as a distribution over obstacle positions and velocities, which can be naturally included in the Gaussian process framework. Our approach is demonstrated in two case studies in which (i), an obstacle overtakes the agent and (ii), an obstacle crosses the agent's path perpendicularly. In these simulations we show that the collision can be predicted despite having limited knowledge of the obstacle's behavior. △ Less

Submitted 4 April, 2019; originally announced April 2019.

arXiv:1902.07686 [pdf, ps, other]

Bilinear Coagulation Equations

Authors: Daniel Heydecker, Robert I. A. Patterson

Abstract: We consider coagulation equations of Smoluchowski or Flory type where the total merge rate has a bilinear form $π(y)\cdot Aπ(x)$ for a vector of conserved quantities $π$, generalising the multiplicative kernel. For these kernels, a gelation transition occurs at a finite time $t_\mathrm{g}\in (0,\infty)$, which can be given exactly in terms of an eigenvalue problem in finite dimensions. We prove a… ▽ More We consider coagulation equations of Smoluchowski or Flory type where the total merge rate has a bilinear form $π(y)\cdot Aπ(x)$ for a vector of conserved quantities $π$, generalising the multiplicative kernel. For these kernels, a gelation transition occurs at a finite time $t_\mathrm{g}\in (0,\infty)$, which can be given exactly in terms of an eigenvalue problem in finite dimensions. We prove a hydrodynamic limit for a stochastic coagulant, including a corresponding phase transition for the largest particle, and exploit a coupling to random graphs to extend analysis of the limiting process beyond the gelation time. △ Less

Submitted 14 October, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

Comments: Generalises the previous version to focus on general coagulation processes of bilinear type, without restricting to the single example of the previous version. The previous results are mentioned as motivation, and all results of the previous version can be obtained from this more general version

arXiv:1902.05027 [pdf, other]

doi 10.15607/RSS.2019.XV.042

Proximity Queries for Absolutely Continuous Parametric Curves

Authors: Arun Lakshmanan, Andrew Patterson, Venanzio Cichella, Naira Hovakimyan

Abstract: In motion planning problems for autonomous robots, such as self-driving cars, the robot must ensure that its planned path is not in close proximity to obstacles in the environment. However, the problem of evaluating the proximity is generally non-convex and serves as a significant computational bottleneck for motion planning algorithms. In this paper, we present methods for a general class of abso… ▽ More In motion planning problems for autonomous robots, such as self-driving cars, the robot must ensure that its planned path is not in close proximity to obstacles in the environment. However, the problem of evaluating the proximity is generally non-convex and serves as a significant computational bottleneck for motion planning algorithms. In this paper, we present methods for a general class of absolutely continuous parametric curves to compute: (i) the minimum separating distance, (ii) tolerance verification, and (iii) collision detection. Our methods efficiently compute bounds on obstacle proximity by bounding the curve in a convex region. This bound is based on an upper bound on the curve arc length that can be expressed in closed form for a useful class of parametric curves including curves with trigonometric or polynomial bases. We demonstrate the computational efficiency and accuracy of our approach through numerical simulations of several proximity problems. △ Less

Submitted 19 June, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

Comments: Proceedings of Robotics: Science and Systems

arXiv:1902.03383 [pdf, ps, other]

Cloud Programming Simplified: A Berkeley View on Serverless Computing

Authors: Eric Jonas, Johann Schleier-Smith, Vikram Sreekanti, Chia-Che Tsai, Anurag Khandelwal, Qifan Pu, Vaishaal Shankar, Joao Carreira, Karl Krauth, Neeraja Yadwadkar, Joseph E. Gonzalez, Raluca Ada Popa, Ion Stoica, David A. Patterson

Abstract: Serverless cloud computing handles virtually all the system administration operations needed to make it easier for programmers to use the cloud. It provides an interface that greatly simplifies cloud programming, and represents an evolution that parallels the transition from assembly language to high-level programming languages. This paper gives a quick history of cloud computing, including an acc… ▽ More Serverless cloud computing handles virtually all the system administration operations needed to make it easier for programmers to use the cloud. It provides an interface that greatly simplifies cloud programming, and represents an evolution that parallels the transition from assembly language to high-level programming languages. This paper gives a quick history of cloud computing, including an accounting of the predictions of the 2009 Berkeley View of Cloud Computing paper, explains the motivation for serverless computing, describes applications that stretch the current limits of serverless, and then lists obstacles and research opportunities required for serverless computing to fulfill its full potential. Just as the 2009 paper identified challenges for the cloud and predicted they would be addressed and that cloud use would accelerate, we predict these issues are solvable and that serverless computing will grow to dominate the future of cloud computing. △ Less

Submitted 9 February, 2019; originally announced February 2019.

arXiv:1901.01876 [pdf, ps, other]

doi 10.1002/rsa.21007

A large-deviations principle for all the cluster sizes of a sparse Erdős-Rényi graph

Authors: Luisa Andreis, Wolfgang König, Robert I. A. Patterson

Abstract: Let $\mathcal{G}(N,\frac 1Nt_N)$ be the Erdős-Rényi graph with connection probability $\frac 1Nt_N\sim t/N$ as $N\to\infty$ for a fixed $t\in(0,\infty)$. We derive a large-deviations principle for the empirical measure of the sizes of all the connected components of $\mathcal{G}(N,\frac 1Nt_N)$, registered according to microscopic sizes (i.e., of finite order), macroscopic ones (i.e., of order… ▽ More Let $\mathcal{G}(N,\frac 1Nt_N)$ be the Erdős-Rényi graph with connection probability $\frac 1Nt_N\sim t/N$ as $N\to\infty$ for a fixed $t\in(0,\infty)$. We derive a large-deviations principle for the empirical measure of the sizes of all the connected components of $\mathcal{G}(N,\frac 1Nt_N)$, registered according to microscopic sizes (i.e., of finite order), macroscopic ones (i.e., of order $N$), and mesoscopic ones (everything in between). The rate function explicitly describes the microscopic and macroscopic components and the fraction of vertices in components of mesoscopic sizes. Moreover, it clearly captures the well known phase transition at $t=1$ as part of a comprehensive picture. The proofs rely on elementary combinatorics and on known estimates and asymptotics for the probability that subgraphs are connected. We also draw conclusions for the strongly related model of the multiplicative coalescent, the Marcus--Lushnikov coagulation model with monodisperse initial condition, and its gelation phase transition. △ Less

Submitted 21 January, 2021; v1 submitted 7 January, 2019; originally announced January 2019.

arXiv:1811.02597 [pdf, other]

Online Off-policy Prediction

Authors: Sina Ghiassian, Andrew Patterson, Martha White, Richard S. Sutton, Adam White

Abstract: This paper investigates the problem of online prediction learning, where learning proceeds continuously as the agent interacts with an environment. The predictions made by the agent are contingent on a particular way of behaving, represented as a value function. However, the behavior used to select actions and generate the behavior data might be different from the one used to define the prediction… ▽ More This paper investigates the problem of online prediction learning, where learning proceeds continuously as the agent interacts with an environment. The predictions made by the agent are contingent on a particular way of behaving, represented as a value function. However, the behavior used to select actions and generate the behavior data might be different from the one used to define the predictions, and thus the samples are generated off-policy. The ability to learn behavior-contingent predictions online and off-policy has long been advocated as a key capability of predictive-knowledge learning systems but remained an open algorithmic challenge for decades. The issue lies with the temporal difference (TD) learning update at the heart of most prediction algorithms: combining bootstrap**, off-policy sampling and function approximation may cause the value estimate to diverge. A breakthrough came with the development of a new objective function that admitted stochastic gradient descent variants of TD. Since then, many sound online off-policy prediction algorithms have been developed, but there has been limited empirical work investigating the relative merits of all the variants. This paper aims to fill these empirical gaps and provide clarity on the key ideas behind each method. We summarize the large body of literature on off-policy learning, focusing on 1- methods that use computation linear in the number of features and are convergent under off-policy sampling, and 2- other methods which have proven useful with non-fixed, nonlinear function approximation. We provide an empirical study of off-policy prediction methods in two challenging microworlds. We report each method's parameter sensitivity, empirical convergence rate, and final performance, providing new insights that should enable practitioners to successfully extend these new methods to large-scale applications.[Abridged abstract] △ Less

Submitted 6 November, 2018; originally announced November 2018.

Comments: 68 pages

arXiv:1807.06763 [pdf, other]

doi 10.1613/jair.1.12105

General Value Function Networks

Authors: Matthew Schlegel, Andrew Jacobsen, Zaheer Abbas, Andrew Patterson, Adam White, Martha White

Abstract: State construction is important for learning in partially observable environments. A general purpose strategy for state construction is to learn the state update using a Recurrent Neural Network (RNN), which updates the internal state using the current internal state and the most recent observation. This internal state provides a summary of the observed sequence, to facilitate accurate predictions… ▽ More State construction is important for learning in partially observable environments. A general purpose strategy for state construction is to learn the state update using a Recurrent Neural Network (RNN), which updates the internal state using the current internal state and the most recent observation. This internal state provides a summary of the observed sequence, to facilitate accurate predictions and decision-making. At the same time, specifying and training RNNs is notoriously tricky, particularly as the common strategy to approximate gradients back in time, called truncated Back-prop Through Time (BPTT), can be sensitive to the truncation window. Further, domain-expertise--which can usually help constrain the function class and so improve trainability--can be difficult to incorporate into complex recurrent units used within RNNs. In this work, we explore how to use multi-step predictions to constrain the RNN and incorporate prior knowledge. In particular, we revisit the idea of using predictions to construct state and ask: does constraining (parts of) the state to consist of predictions about the future improve RNN trainability? We formulate a novel RNN architecture, called a General Value Function Network (GVFN), where each internal state component corresponds to a prediction about the future represented as a value function. We first provide an objective for optimizing GVFNs, and derive several algorithms to optimize this objective. We then show that GVFNs are more robust to the truncation level, in many cases only requiring one-step gradient updates. △ Less

Submitted 2 February, 2021; v1 submitted 17 July, 2018; originally announced July 2018.

Comments: Published in the Journal of Artificial Intelligence Research

Journal ref: Journal of Artificial Intelligence Research, 70, 497-543 (2021)

arXiv:1806.04624 [pdf, other]

Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains

Authors: Yangchen Pan, Muhammad Zaheer, Adam White, Andrew Patterson, Martha White

Abstract: Model-based strategies for control are critical to obtain sample efficient learning. Dyna is a planning paradigm that naturally interleaves learning and planning, by simulating one-step experience to update the action-value function. This elegant planning strategy has been mostly explored in the tabular setting. The aim of this paper is to revisit sample-based planning, in stochastic and continuou… ▽ More Model-based strategies for control are critical to obtain sample efficient learning. Dyna is a planning paradigm that naturally interleaves learning and planning, by simulating one-step experience to update the action-value function. This elegant planning strategy has been mostly explored in the tabular setting. The aim of this paper is to revisit sample-based planning, in stochastic and continuous domains with learned models. We first highlight the flexibility afforded by a model over Experience Replay (ER). Replay-based methods can be seen as stochastic planning methods that repeatedly sample from a buffer of recent agent-environment interactions and perform updates to improve data efficiency. We show that a model, as opposed to a replay buffer, is particularly useful for specifying which states to sample from during planning, such as predecessor states that propagate information in reverse from a state more quickly. We introduce a semi-parametric model learning approach, called Reweighted Experience Models (REMs), that makes it simple to sample next states or predecessors. We demonstrate that REM-Dyna exhibits similar advantages over replay-based methods in learning in continuous state problems, and that the performance gap grows when moving to stochastic domains, of increasing size. △ Less

Submitted 12 June, 2018; originally announced June 2018.

Comments: IJCAI 2018

arXiv:1801.10588 [pdf, other]

Percolation for D2D Networks on Street Systems

Authors: Elie Cali, Nila Novita Gafur, Christian Hirsch, Benedikt Jahnel, Taoufik En-Najjary, Robert I. A. Patterson

Abstract: We study fundamental characteristics for the connectivity of multi-hop D2D networks. Devices are randomly distributed on street systems and are able to communicate with each other whenever their separation is smaller than some connectivity threshold. We model the street systems as Poisson-Voronoi or Poisson-Delaunay tessellations with varying street lengths. We interpret the existence of adequate… ▽ More We study fundamental characteristics for the connectivity of multi-hop D2D networks. Devices are randomly distributed on street systems and are able to communicate with each other whenever their separation is smaller than some connectivity threshold. We model the street systems as Poisson-Voronoi or Poisson-Delaunay tessellations with varying street lengths. We interpret the existence of adequate D2D connectivity as percolation of the underlying random graph. We derive and compare approximations for the critical device-intensity for percolation, the percolation probability and the graph distance. Our results show that for urban areas, the Poisson Boolean Model gives a very good approximation, while for rural areas, the percolation probability stays far from 1 even far above the percolation threshold. △ Less

Submitted 31 January, 2018; originally announced January 2018.

Comments: 6 pages, 7 figures, 1 table

arXiv:1703.05828 [pdf, ps, other]

doi 10.1063/1.4984299

Double-sided coaxial circuit QED with out-of-plane wiring

Authors: J. Rahamim, T. Behrle, M. J. Peterer, A. Patterson, P. Spring, T. Tsunoda, R. Manenti, G. Tancredi, P. J. Leek

Abstract: Superconducting circuits are well established as a strong candidate platform for the development of quantum computing. In order to advance to a practically useful level, architectures are needed which combine arrays of many qubits with selective qubit control and readout, without compromising on coherence. Here we present a coaxial circuit QED architecture in which qubit and resonator are fabricat… ▽ More Superconducting circuits are well established as a strong candidate platform for the development of quantum computing. In order to advance to a practically useful level, architectures are needed which combine arrays of many qubits with selective qubit control and readout, without compromising on coherence. Here we present a coaxial circuit QED architecture in which qubit and resonator are fabricated on opposing sides of a single chip, and control and readout wiring are provided by coaxial wiring running perpendicular to the chip plane. We present characterisation measurements of a fabricated device in good agreement with simulated parameters and demonstrating energy relaxation and dephasing times of $T_1 = 4.1\,μ$s and $T_2 = 5.7\,μ$s respectively. The architecture allows for scaling to large arrays of selectively controlled and measured qubits with the advantage of all wiring being out of the plane. △ Less

Submitted 1 June, 2017; v1 submitted 16 March, 2017; originally announced March 2017.

Comments: 4 pages, 3 figures, 1 table

Journal ref: Appl. Phys. Lett. 110, 222602 (2017)

arXiv:1703.04495 [pdf, ps, other]

doi 10.1038/s41467-017-01063-9

Circuit quantum acoustodynamics with surface acoustic waves

Authors: R. Manenti, A. F. Kockum, A. Patterson, T. Behrle, J. Rahamim, G. Tancredi, F. Nori, P. J. Leek

Abstract: The experimental investigation of quantum devices incorporating mechanical resonators has opened up new frontiers in the study of quantum mechanics at a macroscopic level$^{1,2}$. Superconducting microwave circuits have proven to be a powerful platform for the realisation of such quantum devices, both in cavity optomechanics$^{3,4}$, and circuit quantum electro-dynamics (QED)$^{5,6}$. While most e… ▽ More The experimental investigation of quantum devices incorporating mechanical resonators has opened up new frontiers in the study of quantum mechanics at a macroscopic level$^{1,2}$. Superconducting microwave circuits have proven to be a powerful platform for the realisation of such quantum devices, both in cavity optomechanics$^{3,4}$, and circuit quantum electro-dynamics (QED)$^{5,6}$. While most experiments to date have involved localised nanomechanical resonators, it has recently been shown that propagating surface acoustic waves (SAWs) can be piezoelectrically coupled to superconducting qubits$^{7,8}$, and confined in high-quality Fabry-Perot cavities up to microwave frequencies in the quantum regime$^{9}$, indicating the possibility of realising coherent exchange of quantum information between the two systems. Here we present measurements of a device in which a superconducting qubit is embedded in, and interacts with, the acoustic field of a Fabry-Perot SAW cavity on quartz, realising a surface acoustic version of cavity quantum electrodynamics. This quantum acoustodynamics (QAD) architecture may be used to develop new quantum acoustic devices in which quantum information is stored in trapped on-chip surface acoustic wavepackets, and manipulated in ways that are impossible with purely electromagnetic signals, due to the $10^{5}$ times slower speed of travel of the mechanical waves. △ Less

Submitted 13 March, 2017; originally announced March 2017.

Comments: 12 pages, 9 figures, 1 table

Journal ref: Nature Communications 8, 975 (2017)

arXiv:1611.10354 [pdf, other]

doi 10.1103/PhysRevLett.118.040402

Simultaneous bistability of qubit and resonator in circuit quantum electrodynamics

Authors: Th. K. Mavrogordatos, G. Tancredi, M. Elliott, M. J. Peterer, A. Patterson, J. Rahamim, P. J. Leek, E. Ginossar, M. H. Szymańska

Abstract: We explore the joint activated dynamics exhibited by two quantum degrees of freedom: a cavity mode oscillator which is strongly coupled to a superconducting qubit in the strongly coherently driven dispersive regime. Dynamical simulations and complementary measurements show a range of parameters where both the cavity and the qubit exhibit sudden simultaneous switching between two metastable states.… ▽ More We explore the joint activated dynamics exhibited by two quantum degrees of freedom: a cavity mode oscillator which is strongly coupled to a superconducting qubit in the strongly coherently driven dispersive regime. Dynamical simulations and complementary measurements show a range of parameters where both the cavity and the qubit exhibit sudden simultaneous switching between two metastable states. This manifests in ensemble averaged amplitudes of both the cavity and qubit exhibiting a partial coherent cancellation. Transmission measurements of driven microwave cavities coupled to transmon qubits show detailed features which agree with the theory in the regime of simultaneous switching. △ Less

Submitted 2 January, 2017; v1 submitted 30 November, 2016; originally announced November 2016.

Journal ref: Phys. Rev. Lett. 118, 040402 (2017)

arXiv:1610.06953 [pdf, other]

Estimation and simulation of foraging trips in land-based marine predators

Authors: Théo Michelot, Roland Langrock, Sophie Bestley, Ian D. Jonsen, Theoni Photopoulou, Toby A. Patterson

Abstract: The behaviour of colony-based marine predators is the focus of much research globally. Large telemetry and tracking data sets have been collected for this group of animals, and are accompanied by many theoretical studies of optimal foraging strategies. However, relatively few studies have detailed statistical methods for inferring behaviours in central place foraging trips. In this paper we descri… ▽ More The behaviour of colony-based marine predators is the focus of much research globally. Large telemetry and tracking data sets have been collected for this group of animals, and are accompanied by many theoretical studies of optimal foraging strategies. However, relatively few studies have detailed statistical methods for inferring behaviours in central place foraging trips. In this paper we describe an approach based on hidden Markov models, which splits foraging trips into segments labelled as "outbound", "search", "forage", and "inbound". By structuring the hidden Markov model transition matrix appropriately, the model naturally handles the sequence of behaviours within a foraging trip. Additionally, by structuring the model in this way, we are able to develop realistic simulations from the fitted model. We demonstrate our approach on data from southern elephant seals (Mirounga leonina) tagged on Kerguelen Island in the Southern Ocean. We discuss the differences between our 4-state model and the widely used 2-state model, and the advantages and disadvantages of employing a more complex model. △ Less

Submitted 25 April, 2017; v1 submitted 20 October, 2016; originally announced October 2016.

arXiv:1607.02318 [pdf, other]

The Renewed Case for the Reduced Instruction Set Computer: Avoiding ISA Bloat with Macro-Op Fusion for RISC-V

Authors: Christopher Celio, Palmer Dabbelt, David A. Patterson, Krste Asanović

Abstract: This report makes the case that a well-designed Reduced Instruction Set Computer (RISC) can match, and even exceed, the performance and code density of existing commercial Complex Instruction Set Computers (CISC) while maintaining the simplicity and cost-effectiveness that underpins the original RISC goals. We begin by comparing the dynamic instruction counts and dynamic instruction bytes fetche… ▽ More This report makes the case that a well-designed Reduced Instruction Set Computer (RISC) can match, and even exceed, the performance and code density of existing commercial Complex Instruction Set Computers (CISC) while maintaining the simplicity and cost-effectiveness that underpins the original RISC goals. We begin by comparing the dynamic instruction counts and dynamic instruction bytes fetched for the popular proprietary ARMv7, ARMv8, IA-32, and x86-64 Instruction Set Architectures (ISAs) against the free and open RISC-V RV64G and RV64GC ISAs when running the SPEC CINT2006 benchmark suite. RISC-V was designed as a very small ISA to support a wide range of implementations, and has a less mature compiler toolchain. However, we observe that on SPEC CINT2006 RV64G executes on average 16% more instructions than x86-64, 3% more instructions than IA-32, 9% more instructions than ARMv8, but 4% fewer instructions than ARMv7. CISC x86 implementations break up complex instructions into smaller internal RISC-like micro-ops, and the RV64G instruction count is within 2% of the x86-64 retired micro-op count. RV64GC, the compressed variant of RV64G, is the densest ISA studied, fetching 8% fewer dynamic instruction bytes than x86-64. We observed that much of the increased RISC-V instruction count is due to a small set of common multi-instruction idioms. Exploiting this fact, the RV64G and RV64GC effective instruction count can be reduced by 5.4% on average by leveraging macro-op fusion. Combining the compressed RISC-V ISA extension with macro-op fusion provides both the densest ISA and the fewest dynamic operations retired per program, reducing the motivation to add more instructions to the ISA. This approach retains a single simple ISA suitable for both low-end and high-end implementations, where high-end implementations can boost performance through microarchitectural techniques. △ Less

Submitted 8 July, 2016; originally announced July 2016.

Report number: UCB/EECS-2016-130

arXiv:1603.07511 [pdf, other]

Statistical modelling of individual animal movement: an overview of key methods and a discussion of practical challenges

Authors: Toby A Patterson, Alison Parton, Roland Langrock, Paul G Blackwell, Len Thomas, Ruth King

Abstract: With the influx of complex and detailed tracking data gathered from electronic tracking devices, the analysis of animal movement data has recently emerged as a cottage industry amongst biostatisticians. New approaches of ever greater complexity are continue to be added to the literature. In this paper, we review what we believe to be some of the most popular and most useful classes of statistical… ▽ More With the influx of complex and detailed tracking data gathered from electronic tracking devices, the analysis of animal movement data has recently emerged as a cottage industry amongst biostatisticians. New approaches of ever greater complexity are continue to be added to the literature. In this paper, we review what we believe to be some of the most popular and most useful classes of statistical models used to analyze individual animal movement data. Specifically we consider discrete-time hidden Markov models, more general state-space models and diffusion processes. We argue that these models should be core components in the toolbox for quantitative researchers working on stochastic modelling of individual animal movement. The paper concludes by offering some general observations on the direction of statistical analysis of animal movement. There is a trend in movement ecology toward what are arguably overly-complex modelling approaches which are inaccessible to ecologists, unwieldy with large data sets or not based in mainstream statistical practice. Additionally, some analysis methods developed within the ecological community ignore fundamental properties of movement data, potentially leading to misleading conclusions about animal movement. Corresponding approaches, e.g. based on Lévy walk-type models, continue to be popular despite having been largely discredited. We contend that there is a need for an appropriate balance between the extremes of either being overly complex or being overly simplistic, whereby the discipline relies on models of intermediate complexity that are usable by general ecologists, but grounded in well-developed statistical practice and efficient to fit to large data sets. △ Less

Submitted 30 January, 2017; v1 submitted 24 March, 2016; originally announced March 2016.

arXiv:1602.06466 [pdf, other]

Analysis of animal accelerometer data using hidden Markov models

Authors: Vianey Leos-Barajas, Theoni Photopoulou, Roland Langrock, Toby A. Patterson, Yuuki Watanabe, Megan Murgatroyd, Yannis P. Papastamatiou

Abstract: Use of accelerometers is now widespread within animal biotelemetry as they provide a means of measuring an animal's activity in a meaningful and quantitative way where direct observation is not possible. In sequential acceleration data there is a natural dependence between observations of movement or behaviour, a fact that has been largely ignored in most analyses. Analyses of acceleration data wh… ▽ More Use of accelerometers is now widespread within animal biotelemetry as they provide a means of measuring an animal's activity in a meaningful and quantitative way where direct observation is not possible. In sequential acceleration data there is a natural dependence between observations of movement or behaviour, a fact that has been largely ignored in most analyses. Analyses of acceleration data where serial dependence has been explicitly modelled have largely relied on hidden Markov models (HMMs). Depending on the aim of an analysis, either a supervised or an unsupervised learning approach can be applied. Under a supervised context, an HMM is trained to classify unlabelled acceleration data into a finite set of pre-specified categories, whereas we will demonstrate how an unsupervised learning approach can be used to infer new aspects of animal behaviour. We will provide the details necessary to implement and assess an HMM in both the supervised and unsupervised context, and discuss the data requirements of each case. We outline two applications to marine and aerial systems (sharks and eagles) taking the unsupervised approach, which is more readily applicable to animal activity measured in the field. HMMs were used to infer the effects of temporal, atmospheric and tidal inputs on animal behaviour. Animal accelerometer data allow ecologists to identify important correlates and drivers of animal activity (and hence behaviour). The HMM framework is well suited to deal with the main features commonly observed in accelerometer data. The ability to combine direct observations of animals activity and combine it with statistical models which account for the features of accelerometer data offer a new way to quantify animal behaviour, energetic expenditure and deepen our insights into individual behaviour as a constituent of populations and ecosystems. △ Less

Submitted 20 February, 2016; originally announced February 2016.

arXiv:1602.01009 [pdf, ps, other]

Traffic flow densities in large transport networks

Authors: Christian Hirsch, Benedikt Jahnel, Paul Keeler, Robert I. A. Patterson

Abstract: We consider transport networks with nodes scattered at random in a large domain. At certain local rates, the nodes generate traffic flowing according to some navigation scheme in a given direction. In the thermodynamic limit of a growing domain, we present an asymptotic formula expressing the local traffic flow density at any given location in the domain in terms of three fundamental characteristi… ▽ More We consider transport networks with nodes scattered at random in a large domain. At certain local rates, the nodes generate traffic flowing according to some navigation scheme in a given direction. In the thermodynamic limit of a growing domain, we present an asymptotic formula expressing the local traffic flow density at any given location in the domain in terms of three fundamental characteristics of the underlying network: the spatial intensity of the nodes together with their traffic generation rates, and of the links induced by the navigation. This formula holds for a general class of navigations satisfying a link-density and a sub-ballisticity condition. As a specific example, we verify these conditions for navigations arising from a directed spanning tree on a Poisson point process with inhomogeneous intensity function. △ Less

Submitted 2 February, 2016; originally announced February 2016.

Comments: 20 pages, 7 figures

MSC Class: 60K30; 60F15; 90B20

arXiv:1601.05838 [pdf, other]

doi 10.1016/j.physd.2016.06.007

Kinetic Theory of Cluster Dynamics

Authors: Robert I. A. Patterson, Sergio Simonella, Wolfgang Wagner

Abstract: In a Newtonian system with localized interactions the whole set of particles is naturally decomposed into dynamical clusters, defined as finite groups of particles having an influence on each other's trajectory during a given interval of time. For an ideal gas with short-range intermolecular force, we provide a description of the cluster size distribution in terms of the reduced Boltzmann density.… ▽ More In a Newtonian system with localized interactions the whole set of particles is naturally decomposed into dynamical clusters, defined as finite groups of particles having an influence on each other's trajectory during a given interval of time. For an ideal gas with short-range intermolecular force, we provide a description of the cluster size distribution in terms of the reduced Boltzmann density. In the simplified context of Maxwell molecules, we show that a macroscopic fraction of the gas forms a giant component in finite kinetic time. The critical index of this phase transition is in agreement with previous numerical results on the elastic billiard. △ Less

Submitted 24 June, 2016; v1 submitted 21 January, 2016; originally announced January 2016.

Journal ref: Physica D: Nonlinear Phenomena 335, 26-32 (2016)

arXiv:1511.07103 [pdf, other]

Modelling latent individual heterogeneity in mark-recapture data with Dirichlet process priors

Authors: Jessica H Ford, Toby A Patterson, Mark V Bravington

Abstract: The natural subgroups often seen in mark-recapture studies and the complexity of real mark-recapture data means that parametric and discrete style models can be insufficient. Non-parametric models avoid these often restrictive assumptions. We consider the non-parametric Dirichlet process for modelling latent individual heterogeneity in probability of observation and the probability of remaining in… ▽ More The natural subgroups often seen in mark-recapture studies and the complexity of real mark-recapture data means that parametric and discrete style models can be insufficient. Non-parametric models avoid these often restrictive assumptions. We consider the non-parametric Dirichlet process for modelling latent individual heterogeneity in probability of observation and the probability of remaining in or out of a marine sanctuary. Simulation studies demonstrated accurate estimation of multiple groups of latent individual heterogeneity. Simulations were also used to identify the limits of the Dirichlet process. The ability of the Dirichlet process to pick up unimodal heterogeneity was explored in order to avoid potential spurious multimodality. In application to a subset of the data from the North Atlantic humpback whales we were able to estimate annual population-level variation in usage of the marine sanctuary and three measures of individual-level variation. With the Dirichlet process prior we were able to detect multimodality in each parameter. △ Less

Submitted 22 November, 2015; originally announced November 2015.

Comments: 18 pages, 8 figures

arXiv:1511.07102 [pdf, other]

Efficient MCMC implementation of multi-state mark-recapture models

Authors: Jessica H Ford, Toby A Patterson, Mark V Bravington

Abstract: Inherent differences in behaviour of individual animal movement can introduce bias into estimates of population parameters derived from mark-recapture data. Additionally, quantifying individual heterogeneity is of considerable interest in it's own right as numerous studies have shown how heterogeneity can drive population dynamics. In this paper we incorporate multiple measures of individual heter… ▽ More Inherent differences in behaviour of individual animal movement can introduce bias into estimates of population parameters derived from mark-recapture data. Additionally, quantifying individual heterogeneity is of considerable interest in it's own right as numerous studies have shown how heterogeneity can drive population dynamics. In this paper we incorporate multiple measures of individual heterogeneity into a multi-state mark-recapture model, using a Beta-Binomial Gibbs sampler using MCMC estimation. We also present a novel Independent Metropolis-Hastings sampler which allows for efficient updating of the hyper-parameters which cannot be updated using Gibbs sampling. We tested the model using simulation studies and applied the model to mark-resight data of North Atlantic humpback whales observed in the Stellwagen Bank National Marine Sanctuary where heterogeneity is present in both sighting probability and site preference. Simulation studies show asymptotic convergence of the posterior distribution for each of the hyper-parameters to true parameter values. In application to humpback whales individual heterogeneity is evident in sighting probability and propensity to use the marine sanctuary. △ Less

Submitted 22 November, 2015; originally announced November 2015.

Comments: 23 pages, 9 figures

Showing 1–50 of 69 results for author: Patterson, A