-
Existence and Uniqueness for the SQG Vortex-Wave System when the Vorticity is Constant near the Point-Vortex
Authors:
Dimitri Cobb,
Martin Donati,
Ludovic Godard-Cadillac
Abstract:
This article studies the vortex-wave system for the Surface Quasi-Geostrophic equation with parameter 0 < s < 1. We obtained local existence of classical solutions in H^4 under the standard "plateau hypothesis", H^2-stability of the solutions, and a blow-up criterion. In the sub-critical case s > 1/2 we established global existence of weak solutions. For the critical case s = 1/2, we introduced a…
▽ More
This article studies the vortex-wave system for the Surface Quasi-Geostrophic equation with parameter 0 < s < 1. We obtained local existence of classical solutions in H^4 under the standard "plateau hypothesis", H^2-stability of the solutions, and a blow-up criterion. In the sub-critical case s > 1/2 we established global existence of weak solutions. For the critical case s = 1/2, we introduced a weaker notion of solution (V-weak solutions) to give a meaning to the equation and prove global existence.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Weak Solutions for a non-Newtonian Stokes-Transport System
Authors:
Dimitri Cobb,
Geoffrey Lacour
Abstract:
In this article, we study a non-Newtonian Stokes-Transport system. This set of PDEs was introduced as a model for describing the behavior of a cloud of particles in suspension in a Stokes fluid, and is a nonlinear coupling between a hyperbolic equation (Transport) and a nonlinear elliptic equation (non-Newtonian Stokes), and as such can be considered as an active scalar equation. We prove the exis…
▽ More
In this article, we study a non-Newtonian Stokes-Transport system. This set of PDEs was introduced as a model for describing the behavior of a cloud of particles in suspension in a Stokes fluid, and is a nonlinear coupling between a hyperbolic equation (Transport) and a nonlinear elliptic equation (non-Newtonian Stokes), and as such can be considered as an active scalar equation. We prove the existence of global weak solutions with initial data in critical Lebesgue spaces. In order to overcome the difficulties introduced by the highly nonlinear aspect of this problem, we resort to a combination of DiPerna-Lions theory of transport equations and Minty's trick for elliptic equations.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
High-contrast JWST-MIRI spectroscopy of planet-forming disks for the JDISC Survey
Authors:
Klaus M. Pontoppidan,
Colette Salyk,
Andrea Banzatti,
Ke Zhang,
Ilaria Pascucci,
Karin I. Oberg,
Feng Long,
Carlos Munoz-Romero,
John Carr,
Joan Najita,
Geoffrey A. Blake,
Nicole Arulanantham,
Sean Andrews,
Nicholas P. Ballering,
Edwin Bergin,
Jenny Calahan,
Douglas Cobb,
Maria Jose Colmenares,
Annie Dickson-Vandervelde,
Anna Dignan,
Joel Green,
Phoebe Heretz,
Greg Herczeg,
Anusha Kalyaan,
Sebastian Krijt
, et al. (4 additional authors not shown)
Abstract:
The JWST Disk Infrared Spectral Chemistry Survey (JDISCS) aims to understand the evolution of the chemistry of inner protoplanetary disks using the Mid-InfraRed Instrument (MIRI) on the James Webb Space Telescope (JWST). With a growing sample of >30 disks, the survey implements a custom method to calibrate the MIRI Medium Resolution Spectrometer (MRS) to contrasts of better than 1:300 across its 4…
▽ More
The JWST Disk Infrared Spectral Chemistry Survey (JDISCS) aims to understand the evolution of the chemistry of inner protoplanetary disks using the Mid-InfraRed Instrument (MIRI) on the James Webb Space Telescope (JWST). With a growing sample of >30 disks, the survey implements a custom method to calibrate the MIRI Medium Resolution Spectrometer (MRS) to contrasts of better than 1:300 across its 4.9-28 micron spectral range. This is achieved using observations of Themis-family asteroids as precise empirical reference sources. High spectral contrast enables precise retrievals of physical parameters, searches for rare molecular species and isotopologues, and constraints on the inventories of carbon- and nitrogen-bearing species. JDISCS also offers significant improvements to the MRS wavelength and resolving power calibration. We describe the JDISCS calibrated data and demonstrate its quality using observations of the disk around the solar-mass young star FZ Tau. The FZ Tau MIRI spectrum is dominated by strong emission from warm water vapor. We show that the water and CO line emission originates from the disk surface and traces a range of gas temperatures of ~500-1500 K. We retrieve parameters for the observed CO and H2O lines, and show that they are consistent with a radial distribution represented by two temperature components. A high water abundance of n(H2O)~10^-4 fills the disk surface at least out to the 350 K isotherm at 1.5 au. We search the FZ Tau environs for extended emission detecting a large (radius of ~300 au) ring of emission from H2 gas surrounding FZ Tau, and discuss its origin.
△ Less
Submitted 16 January, 2024; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Direct Amortized Likelihood Ratio Estimation
Authors:
Adam D. Cobb,
Brian Matejek,
Daniel Elenius,
Anirban Roy,
Susmit Jha
Abstract:
We introduce a new amortized likelihood ratio estimator for likelihood-free simulation-based inference (SBI). Our estimator is simple to train and estimates the likelihood ratio using a single forward pass of the neural estimator. Our approach directly computes the likelihood ratio between two competing parameter sets which is different from the previous approach of comparing two neural network ou…
▽ More
We introduce a new amortized likelihood ratio estimator for likelihood-free simulation-based inference (SBI). Our estimator is simple to train and estimates the likelihood ratio using a single forward pass of the neural estimator. Our approach directly computes the likelihood ratio between two competing parameter sets which is different from the previous approach of comparing two neural network output values. We refer to our model as the direct neural ratio estimator (DNRE). As part of introducing the DNRE, we derive a corresponding Monte Carlo estimate of the posterior. We benchmark our new ratio estimator and compare to previous ratio estimators in the literature. We show that our new ratio estimator often outperforms these previous approaches. As a further contribution, we introduce a new derivative estimator for likelihood ratio estimators that enables us to compare likelihood-free Hamiltonian Monte Carlo (HMC) with random-walk Metropolis-Hastings (MH). We show that HMC is equally competitive, which has not been previously shown. Finally, we include a novel real-world application of SBI by using our neural ratio estimator to design a quadcopter. Code is available at https://github.com/SRI-CSL/dnre.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs
Authors:
Adam D. Cobb,
Anirban Roy,
Daniel Elenius,
F. Michael Heim,
Brian Swenson,
Sydney Whittington,
James D. Walker,
Theodore Bapty,
Joseph Hite,
Karthik Ramani,
Christopher McComb,
Susmit Jha
Abstract:
We present AircraftVerse, a publicly available aerial vehicle design dataset. Aircraft design encompasses different physics domains and, hence, multiple modalities of representation. The evaluation of these cyber-physical system (CPS) designs requires the use of scientific analytical and simulation models ranging from computer-aided design tools for structural and manufacturing analysis, computati…
▽ More
We present AircraftVerse, a publicly available aerial vehicle design dataset. Aircraft design encompasses different physics domains and, hence, multiple modalities of representation. The evaluation of these cyber-physical system (CPS) designs requires the use of scientific analytical and simulation models ranging from computer-aided design tools for structural and manufacturing analysis, computational fluid dynamics tools for drag and lift computation, battery models for energy estimation, and simulation models for flight control and dynamics. AircraftVerse contains 27,714 diverse air vehicle designs - the largest corpus of engineering designs with this level of complexity. Each design comprises the following artifacts: a symbolic design tree describing topology, propulsion subsystem, battery subsystem, and other design details; a STandard for the Exchange of Product (STEP) model data; a 3D CAD design using a stereolithography (STL) file format; a 3D point cloud for the shape of the design; and evaluation results from high fidelity state-of-the-art physics models that characterize performance metrics such as maximum flight distance and hover-time. We also present baseline surrogate models that use different modalities of design representation to predict design performance metrics, which we provide as part of our dataset release. Finally, we discuss the potential impact of this dataset on the use of learning in aircraft design and, more generally, in CPS. AircraftVerse is accompanied by a data card, and it is released under Creative Commons Attribution-ShareAlike (CC BY-SA) license. The dataset is hosted at https://zenodo.org/record/6525446, baseline models and code at https://github.com/SRI-CSL/AircraftVerse, and the dataset description at https://aircraftverse.onrender.com/.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Comparing Machines and Children: Using Developmental Psychology Experiments to Assess the Strengths and Weaknesses of LaMDA Responses
Authors:
Eliza Kosoy,
Emily Rose Reagan,
Leslie Lai,
Alison Gopnik,
Danielle Krettek Cobb
Abstract:
Developmental psychologists have spent decades devising experiments to test the intelligence and knowledge of infants and children, tracing the origin of crucial concepts and capacities. Moreover, experimental techniques in developmental psychology have been carefully designed to discriminate the cognitive capacities that underlie particular behaviors. We propose that using classical experiments f…
▽ More
Developmental psychologists have spent decades devising experiments to test the intelligence and knowledge of infants and children, tracing the origin of crucial concepts and capacities. Moreover, experimental techniques in developmental psychology have been carefully designed to discriminate the cognitive capacities that underlie particular behaviors. We propose that using classical experiments from child development is a particularly effective way to probe the computational abilities of AI models, in general, and LLMs in particular. First, the methodological techniques of developmental psychology, such as the use of novel stimuli to control for past experience or control conditions to determine whether children are using simple associations, can be equally helpful for assessing the capacities of LLMs. In parallel, testing LLMs in this way can tell us whether the information that is encoded in text is sufficient to enable particular responses, or whether those responses depend on other kinds of information, such as information from exploration of the physical world. In this work we adapt classical developmental experiments to evaluate the capabilities of LaMDA, a large language model from Google. We propose a novel LLM Response Score (LRS) metric which can be used to evaluate other language models, such as GPT. We find that LaMDA generates appropriate responses that are similar to those of children in experiments involving social understanding, perhaps providing evidence that knowledge of these domains is discovered through language. On the other hand, LaMDA's responses in early object and action understanding, theory of mind, and especially causal reasoning tasks are very different from those of young children, perhaps showing that these domains require more real-world, self-initiated exploration and cannot simply be learned from patterns in language input.
△ Less
Submitted 7 November, 2023; v1 submitted 18 May, 2023;
originally announced May 2023.
-
On the Well-Posedness of a Fractional Stokes-Transport System
Authors:
Dimitri Cobb
Abstract:
The purpose of this paper is to study the existence, uniqueness and lifespan of solutions for a fractional Stokes-Transport system. This problem should be understood as a model for sedimentation in a fluid where the viscosity law is given by a fractional Lapalce operator $(- Δ)^{α/2}$, with $α= 2$ corresponding to the case of a normal viscous fluid, and $α= 0$ reducing the problem to the Inviscid…
▽ More
The purpose of this paper is to study the existence, uniqueness and lifespan of solutions for a fractional Stokes-Transport system. This problem should be understood as a model for sedimentation in a fluid where the viscosity law is given by a fractional Lapalce operator $(- Δ)^{α/2}$, with $α= 2$ corresponding to the case of a normal viscous fluid, and $α= 0$ reducing the problem to the Inviscid Incompressible Porous Media equation. For each value of $α\in [0, d]$, we prove various results related to well-posedness in critical function spaces, such as the existence of global weak solutions (for $α> 0$), local existence and uniqueness (for $α\geq 0$), global existence and uniqueness (for $α\geq 1$), as well as study the lifespan of local solutions (for $0 \leq α< 1$). In particular, we show that gravity stratification leads to a directional blow-up criterion for local solutions (for $α\in [0, 1[$) and find a lower bound for the lifespan of solutions which depends on the value of the dissipation parameter $α\in [0, 1[$.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Design of Unmanned Air Vehicles Using Transformer Surrogate Models
Authors:
Adam D. Cobb,
Anirban Roy,
Daniel Elenius,
Susmit Jha
Abstract:
Computer-aided design (CAD) is a promising new area for the application of artificial intelligence (AI) and machine learning (ML). The current practice of design of cyber-physical systems uses the digital twin methodology, wherein the actual physical design is preceded by building detailed models that can be evaluated by physics simulation models. These physics models are often slow and the manual…
▽ More
Computer-aided design (CAD) is a promising new area for the application of artificial intelligence (AI) and machine learning (ML). The current practice of design of cyber-physical systems uses the digital twin methodology, wherein the actual physical design is preceded by building detailed models that can be evaluated by physics simulation models. These physics models are often slow and the manual design process often relies on exploring near-by variations of existing designs. AI holds the promise of breaking these design silos and increasing the diversity and performance of designs by accelerating the exploration of the design space. In this paper, we focus on the design of electrical unmanned aerial vehicles (UAVs). The high-density batteries and purely electrical propulsion systems have disrupted the space of UAV design, making this domain an ideal target for AI-based design. In this paper, we develop an AI Designer that synthesizes novel UAV designs. Our approach uses a deep transformer model with a novel domain-specific encoding such that we can evaluate the performance of new proposed designs without running expensive flight dynamics models and CAD tools. We demonstrate that our approach significantly reduces the overall compute requirements for the design process and accelerates the design space exploration. Finally, we identify future research directions to achieve full-scale deployment of AI-assisted CAD for UAVs.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Remarks on Chemin's space of homogeneous distributions
Authors:
Dimitri Cobb
Abstract:
This article focuses on Chemin's space $\mathcal{S}'_h$ of homogeneous distributions, which was introduced to serve as a basis for realizations of subcritical homogeneous Besov spaces. We will discuss how this construction fails in multiple ways for supercritical spaces. In particular, we study its intersection $X_h := \mathcal{S}'_h \cap X$ with various Banach spaces $X$, namely supercritical hom…
▽ More
This article focuses on Chemin's space $\mathcal{S}'_h$ of homogeneous distributions, which was introduced to serve as a basis for realizations of subcritical homogeneous Besov spaces. We will discuss how this construction fails in multiple ways for supercritical spaces. In particular, we study its intersection $X_h := \mathcal{S}'_h \cap X$ with various Banach spaces $X$, namely supercritical homogeneous Besov spaces and the Lebesgue space $L^\infty$. For each $X$, we find out if the intersection $X_h$ is dense in $X$. If it is not, then we study its closure $C = {\rm clos}(X_h)$ and prove that the quotient $X/C$ is not separable and that $C$ is not complemented in $X$.
△ Less
Submitted 15 July, 2022;
originally announced July 2022.
-
Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning
Authors:
Meet P. Vadera,
Adam D. Cobb,
Brian Jalaian,
Benjamin M. Marlin
Abstract:
Bayesian methods hold significant promise for improving the uncertainty quantification ability and robustness of deep neural network models. Recent research has seen the investigation of a number of approximate Bayesian inference methods for deep neural networks, building on both the variational Bayesian and Markov chain Monte Carlo (MCMC) frameworks. A fundamental issue with MCMC methods is that…
▽ More
Bayesian methods hold significant promise for improving the uncertainty quantification ability and robustness of deep neural network models. Recent research has seen the investigation of a number of approximate Bayesian inference methods for deep neural networks, building on both the variational Bayesian and Markov chain Monte Carlo (MCMC) frameworks. A fundamental issue with MCMC methods is that the improvements they enable are obtained at the expense of increased computation time and model storage costs. In this paper, we investigate the potential of sparse network structures to flexibly trade-off model storage costs and inference run time against predictive performance and uncertainty quantification ability. We use stochastic gradient MCMC methods as the core Bayesian inference method and consider a variety of approaches for selecting sparse network structures. Surprisingly, our results show that certain classes of randomly selected substructures can perform as well as substructures derived from state-of-the-art iterative pruning methods while drastically reducing model training times.
△ Less
Submitted 8 February, 2022;
originally announced February 2022.
-
HumBugDB: A Large-scale Acoustic Mosquito Dataset
Authors:
Ivan Kiskin,
Marianne Sinka,
Adam D. Cobb,
Waqas Rafique,
Lawrence Wang,
Davide Zilli,
Benjamin Gutteridge,
Rinita Dam,
Theodoros Marinos,
Yunpeng Li,
Dickson Msaky,
Emmanuel Kaindoa,
Gerard Killeen,
Eva Herreros-Moya,
Kathy J. Willis,
Stephen J. Roberts
Abstract:
This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and y…
▽ More
This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and yellow fever. Collecting this dataset is motivated by the need to assist applications which utilise mosquito acoustics to conduct surveys to help predict outbreaks and inform intervention policy. The task of detecting mosquitoes from the sound of their wingbeats is challenging due to the difficulty in collecting recordings from realistic scenarios. To address this, as part of the HumBug project, we conducted global experiments to record mosquitoes ranging from those bred in culture cages to mosquitoes captured in the wild. Consequently, the audio recordings vary in signal-to-noise ratio and contain a broad range of indoor and outdoor background environments from Tanzania, Thailand, Kenya, the USA and the UK. In this paper we describe in detail how we collected, labelled and curated the data. The data is provided from a PostgreSQL database, which contains important metadata such as the capture method, age, feeding status and gender of the mosquitoes. Additionally, we provide code to extract features and train Bayesian convolutional neural networks for two key tasks: the identification of mosquitoes from their corresponding background environments, and the classification of detected mosquitoes into species. Our extensive dataset is both challenging to machine learning researchers focusing on acoustic identification, and critical to entomologists, geo-spatial modellers and other domain experts to understand mosquito behaviour, model their distribution, and manage the threat they pose to humans.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Bounded Solutions in Incompressible Hydrodynamics
Authors:
Dimitri Cobb
Abstract:
In this article, we study bounded solutions of Euler-type equations on $\mathbb{R}^d$ which have no integrability at $|x| \rightarrow +\infty$. As has been previously noted, such solutions fail to achieve uniqueness in an initial value problem, even under strong smoothness conditions. This contrasts with well-posedness results that have been obtained by using the Leray projection operator in these…
▽ More
In this article, we study bounded solutions of Euler-type equations on $\mathbb{R}^d$ which have no integrability at $|x| \rightarrow +\infty$. As has been previously noted, such solutions fail to achieve uniqueness in an initial value problem, even under strong smoothness conditions. This contrasts with well-posedness results that have been obtained by using the Leray projection operator in these equations. This apparent paradox is solved by noting that using the Leray projector requires an extra condition the solutions must fulfill at $|x| \rightarrow + \infty$. Our goal is to find one such condition which is sharp. We then apply the methods we develop to prove a full uniqueness result for Besov-Lipschitz solutions, as to the theory of Serfati solutions. In the last Section, we see how these techniques also apply to the Elsässer variables used in ideal MHD.
△ Less
Submitted 23 January, 2023; v1 submitted 7 May, 2021;
originally announced May 2021.
-
Symmetry breaking in ideal magnetohydrodynamics: the role of the velocity
Authors:
Dimitri Cobb,
Francesco Fanelli
Abstract:
The ideal magnetohydrodynamic equations are, roughly speaking, a quasi-linear symmetric hyperbolic system of PDEs, but not all the unknowns play the same role in this system. Indeed, in the regime of small magnetic fields, the equations are close to the incompressible Euler equations. In the present paper, we adopt this point of view to study questions linked with the lifespan of strong solutions…
▽ More
The ideal magnetohydrodynamic equations are, roughly speaking, a quasi-linear symmetric hyperbolic system of PDEs, but not all the unknowns play the same role in this system. Indeed, in the regime of small magnetic fields, the equations are close to the incompressible Euler equations. In the present paper, we adopt this point of view to study questions linked with the lifespan of strong solutions to the ideal magnetohydrodynamic equations. First of all, we prove a continuation criterion in terms of the velocity field only. Secondly, we refine the explicit lower bound for the lifespan of $2$-D flows found in [11], by relaxing the regularity assumptions on the initial magnetic field.
△ Less
Submitted 26 February, 2021;
originally announced February 2021.
-
Better call Surrogates: A hybrid Evolutionary Algorithm for Hyperparameter optimization
Authors:
Subhodip Biswas,
Adam D Cobb,
Andreea Sistrunk,
Naren Ramakrishnan,
Brian Jalaian
Abstract:
In this paper, we propose a surrogate-assisted evolutionary algorithm (EA) for hyperparameter optimization of machine learning (ML) models. The proposed STEADE model initially estimates the objective function landscape using RadialBasis Function interpolation, and then transfers the knowledge to an EA technique called Differential Evolution that is used to evolve new solutions guided by a Bayesian…
▽ More
In this paper, we propose a surrogate-assisted evolutionary algorithm (EA) for hyperparameter optimization of machine learning (ML) models. The proposed STEADE model initially estimates the objective function landscape using RadialBasis Function interpolation, and then transfers the knowledge to an EA technique called Differential Evolution that is used to evolve new solutions guided by a Bayesian optimization framework. We empirically evaluate our model on the hyperparameter optimization problems as a part of the black box optimization challenge at NeurIPS 2020 and demonstrate the improvement brought about by STEADE over the vanilla EA.
△ Less
Submitted 11 December, 2020;
originally announced December 2020.
-
Scaling Hamiltonian Monte Carlo Inference for Bayesian Neural Networks with Symmetric Splitting
Authors:
Adam D. Cobb,
Brian Jalaian
Abstract:
Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) approach that exhibits favourable exploration properties in high-dimensional models such as neural networks. Unfortunately, HMC has limited use in large-data regimes and little work has explored suitable approaches that aim to preserve the entire Hamiltonian. In our work, we introduce a new symmetric integration scheme for split HM…
▽ More
Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) approach that exhibits favourable exploration properties in high-dimensional models such as neural networks. Unfortunately, HMC has limited use in large-data regimes and little work has explored suitable approaches that aim to preserve the entire Hamiltonian. In our work, we introduce a new symmetric integration scheme for split HMC that does not rely on stochastic gradients. We show that our new formulation is more efficient than previous approaches and is easy to implement with a single GPU. As a result, we are able to perform full HMC over common deep learning architectures using entire data sets. In addition, when we compare with stochastic gradient MCMC, we show that our method achieves better performance in both accuracy and uncertainty quantification. Our approach demonstrates HMC as a feasible option when considering inference schemes for large-scale machine learning problems.
△ Less
Submitted 13 October, 2020;
originally announced October 2020.
-
Elsässer formulation of the ideal MHD and improved lifespan in two space dimensions
Authors:
Dimitri Cobb,
Francesco Fanelli
Abstract:
In the present paper, we show an improved lower bound for the lifespan of the solutions to the ideal MHD equations in the case of space dimension $d=2$. In particular, for small initial magnetic fields $b_0$ of size (say) $\varepsilon>0$, the lifespan $T_\varepsilon>0$ of the corresponding solution goes to $+\infty$ in the limit $\varepsilon\rightarrow0^+$.
Such a result does not follow from sta…
▽ More
In the present paper, we show an improved lower bound for the lifespan of the solutions to the ideal MHD equations in the case of space dimension $d=2$. In particular, for small initial magnetic fields $b_0$ of size (say) $\varepsilon>0$, the lifespan $T_\varepsilon>0$ of the corresponding solution goes to $+\infty$ in the limit $\varepsilon\rightarrow0^+$.
Such a result does not follow from standard quasi-linear hyperbolic theory. For proving it, three are the crucial ingredients: first of all, to work in endpoint Besov spaces $B^s_{\infty,r}$, under the condition $s>1$ and $r\in[1,+\infty]$ or $s=r=1$; moreover, to use the Elsässer formulation of the ideal MHD, recasted in its vorticity formulation; finally, to take advantage of the special structure of the non-linear terms.
We also rigorously establish the equivalence between the original formulation of the ideal MHD and its Elsässer formulation for a large class of weak solutions. The construction of explicit counterexamples shows the sharpness of our assumptions. Related non-uniqueness issues are discussed as well.
△ Less
Submitted 23 September, 2020;
originally announced September 2020.
-
Rigorous derivation and well-posedness of a quasi-homogeneous ideal MHD system
Authors:
Dimitri Cobb,
Francesco Fanelli
Abstract:
The goal of this paper is twofold. On the one hand, we introduce a quasi-homogeneous version of the classical ideal MHD system and study its well-posedness in critical Besov spaces $B^s_{p,r}(\mathbb{R}^d)$, $d\geq2$, with $1<p<+\infty$ and under the Lipschitz condition $s>1+d/p$ and $r\in[1,+\infty]$, or $s=1+d/p$ and $r=1$. A key ingredient is the reformulation of the system \textsl{via} the so-…
▽ More
The goal of this paper is twofold. On the one hand, we introduce a quasi-homogeneous version of the classical ideal MHD system and study its well-posedness in critical Besov spaces $B^s_{p,r}(\mathbb{R}^d)$, $d\geq2$, with $1<p<+\infty$ and under the Lipschitz condition $s>1+d/p$ and $r\in[1,+\infty]$, or $s=1+d/p$ and $r=1$. A key ingredient is the reformulation of the system \textsl{via} the so-called Elsässer variables. On the other hand, we give a rigorous justification of quasi-homogeneous MHD models, both in the ideal and in the dissipative cases: when $d=2$, we will derive them from a non-homogeneous incompressible MHD system with Coriolis force, in the regime of low Rossby number and for small density variations around a constant state. Our method of proof relies on a relative entropy inequality for the primitive system, and yields precise rates of convergence, depending on the size of the initial data, on the order of the Rossby number and on the regularity of the viscosity and resistivity coefficients.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural Networks
Authors:
Meet P. Vadera,
Adam D. Cobb,
Brian Jalaian,
Benjamin M. Marlin
Abstract:
While deep learning methods continue to improve in predictive accuracy on a wide range of application domains, significant issues remain with other aspects of their performance including their ability to quantify uncertainty and their robustness. Recent advances in approximate Bayesian inference hold significant promise for addressing these concerns, but the computational scalability of these meth…
▽ More
While deep learning methods continue to improve in predictive accuracy on a wide range of application domains, significant issues remain with other aspects of their performance including their ability to quantify uncertainty and their robustness. Recent advances in approximate Bayesian inference hold significant promise for addressing these concerns, but the computational scalability of these methods can be problematic when applied to large-scale models. In this paper, we describe initial work on the development ofURSABench(the Uncertainty, Robustness, Scalability, and Accu-racy Benchmark), an open-source suite of bench-marking tools for comprehensive assessment of approximate Bayesian inference methods with a focus on deep learning-based classification tasks
△ Less
Submitted 8 July, 2020;
originally announced July 2020.
-
Accurate Machine Learning Atmospheric Retrieval via a Neural Network Surrogate Model for Radiative Transfer
Authors:
Michael D. Himes,
Joseph Harrington,
Adam D. Cobb,
Atilim Gunes Baydin,
Frank Soboczenski,
Molly D. O'Beirne,
Simone Zorzan,
David C. Wright,
Zacchaeus Scheffer,
Shawn D. Domagal-Goldman,
Giada N. Arney
Abstract:
Atmospheric retrieval determines the properties of an atmosphere based on its measured spectrum. The low signal-to-noise ratio of exoplanet observations require a Bayesian approach to determine posterior probability distributions of each model parameter, given observed spectra. This inference is computationally expensive, as it requires many executions of a costly radiative transfer (RT) simulatio…
▽ More
Atmospheric retrieval determines the properties of an atmosphere based on its measured spectrum. The low signal-to-noise ratio of exoplanet observations require a Bayesian approach to determine posterior probability distributions of each model parameter, given observed spectra. This inference is computationally expensive, as it requires many executions of a costly radiative transfer (RT) simulation for each set of sampled model parameters. Machine learning (ML) has recently been shown to provide a significant reduction in runtime for retrievals, mainly by training inverse ML models that predict parameter distributions, given observed spectra, albeit with reduced posterior accuracy. Here we present a novel approach to retrieval by training a forward ML surrogate model that predicts spectra given model parameters, providing a fast approximate RT simulation that can be used in a conventional Bayesian retrieval framework without significant loss of accuracy. We demonstrate our method on the emission spectrum of HD 189733 b and find good agreement with a traditional retrieval from the Bayesian Atmospheric Radiative Transfer (BART) code (Bhattacharyya coefficients of 0.9843--0.9972, with a mean of 0.9925, between 1D marginalized posteriors). This accuracy comes while still offering significant speed enhancements over traditional RT, albeit not as much as ML methods with lower posterior accuracy. Our method is ~9x faster per parallel chain than BART when run on an AMD EPYC 7402P central processing unit (CPU). Neural-network computation using an NVIDIA Titan Xp graphics processing unit is 90--180x faster per chain than BART on that CPU.
△ Less
Submitted 3 May, 2022; v1 submitted 4 March, 2020;
originally announced March 2020.
-
HumBug Zooniverse: a crowd-sourced acoustic mosquito dataset
Authors:
Ivan Kiskin,
Adam D. Cobb,
Lawrence Wang,
Stephen Roberts
Abstract:
Mosquitoes are the only known vector of malaria, which leads to hundreds of thousands of deaths each year. Understanding the number and location of potential mosquito vectors is of paramount importance to aid the reduction of malaria transmission cases. In recent years, deep learning has become widely used for bioacoustic classification tasks. In order to enable further research applications in th…
▽ More
Mosquitoes are the only known vector of malaria, which leads to hundreds of thousands of deaths each year. Understanding the number and location of potential mosquito vectors is of paramount importance to aid the reduction of malaria transmission cases. In recent years, deep learning has become widely used for bioacoustic classification tasks. In order to enable further research applications in this field, we release a new dataset of mosquito audio recordings. With over a thousand contributors, we obtained 195,434 labels of two second duration, of which approximately 10 percent signify mosquito events. We present an example use of the dataset, in which we train a convolutional neural network on log-Mel features, showcasing the information content of the labels. We hope this will become a vital resource for those researching all aspects of malaria, and add to the existing audio datasets for bioacoustic detection and signal processing.
△ Less
Submitted 14 February, 2020; v1 submitted 14 January, 2020;
originally announced January 2020.
-
On the fast rotation asymptotics of a non-homogeneous incompressible MHD system
Authors:
Dimitri Cobb,
Francesco Fanelli
Abstract:
This paper is devoted to the analysis of a singular perturbation problem for a $2$-D incompressible MHD system with density variations and Coriolis force, in the limit of small Rossby numbers. Two regimes are considered. The first one is the quasi-homogeneous regime, where the densities are small perturbations around a constant state. The limit dynamics is identified as an incompressible homogeneo…
▽ More
This paper is devoted to the analysis of a singular perturbation problem for a $2$-D incompressible MHD system with density variations and Coriolis force, in the limit of small Rossby numbers. Two regimes are considered. The first one is the quasi-homogeneous regime, where the densities are small perturbations around a constant state. The limit dynamics is identified as an incompressible homogeneous MHD system, coupled with an additional transport equation for the limit of the density variations. The second case is the fully non-homogeneous regime, where the densities vary around a general non-constant profile. In this case, in the limit, the equation for the magnetic field combines with an underdetermined linear equation, which links the limit density variation function with the limit velocity field. The proof is based on a compensated compactness argument, which enables us to consider general ill-prepared initial data. An application of Di Perna-Lions theory for transport equations allows to treat the case of density-dependent viscosity and resistivity coefficients.
△ Less
Submitted 9 December, 2019;
originally announced December 2019.
-
Introducing an Explicit Symplectic Integration Scheme for Riemannian Manifold Hamiltonian Monte Carlo
Authors:
Adam D. Cobb,
Atılım Güneş Baydin,
Andrew Markham,
Stephen J. Roberts
Abstract:
We introduce a recent symplectic integration scheme derived for solving physically motivated systems with non-separable Hamiltonians. We show its relevance to Riemannian manifold Hamiltonian Monte Carlo (RMHMC) and provide an alternative to the currently used generalised leapfrog symplectic integrator, which relies on solving multiple fixed point iterations to convergence. Via this approach, we ar…
▽ More
We introduce a recent symplectic integration scheme derived for solving physically motivated systems with non-separable Hamiltonians. We show its relevance to Riemannian manifold Hamiltonian Monte Carlo (RMHMC) and provide an alternative to the currently used generalised leapfrog symplectic integrator, which relies on solving multiple fixed point iterations to convergence. Via this approach, we are able to reduce the number of higher-order derivative calculations per leapfrog step. We explore the implications of this integrator and demonstrate its efficacy in reducing the computational burden of RMHMC. Our code is provided in a new open-source Python package, hamiltorch.
△ Less
Submitted 14 October, 2019;
originally announced October 2019.
-
An Ensemble of Bayesian Neural Networks for Exoplanetary Atmospheric Retrieval
Authors:
Adam D. Cobb,
Michael D. Himes,
Frank Soboczenski,
Simone Zorzan,
Molly D. O'Beirne,
Atılım Güneş Baydin,
Yarin Gal,
Shawn D. Domagal-Goldman,
Giada N. Arney,
Daniel Angerhausen
Abstract:
Machine learning is now used in many areas of astrophysics, from detecting exoplanets in Kepler transit signals to removing telescope systematics. Recent work demonstrated the potential of using machine learning algorithms for atmospheric retrieval by implementing a random forest to perform retrievals in seconds that are consistent with the traditional, computationally-expensive nested-sampling re…
▽ More
Machine learning is now used in many areas of astrophysics, from detecting exoplanets in Kepler transit signals to removing telescope systematics. Recent work demonstrated the potential of using machine learning algorithms for atmospheric retrieval by implementing a random forest to perform retrievals in seconds that are consistent with the traditional, computationally-expensive nested-sampling retrieval method. We expand upon their approach by presenting a new machine learning model, \texttt{plan-net}, based on an ensemble of Bayesian neural networks that yields more accurate inferences than the random forest for the same data set of synthetic transmission spectra. We demonstrate that an ensemble provides greater accuracy and more robust uncertainties than a single model. In addition to being the first to use Bayesian neural networks for atmospheric retrieval, we also introduce a new loss function for Bayesian neural networks that learns correlations between the model outputs. Importantly, we show that designing machine learning models to explicitly incorporate domain-specific knowledge both improves performance and provides additional insight by inferring the covariance of the retrieved atmospheric parameters. We apply \texttt{plan-net} to the Hubble Space Telescope Wide Field Camera 3 transmission spectrum for WASP-12b and retrieve an isothermal temperature and water abundance consistent with the literature. We highlight that our method is flexible and can be expanded to higher-resolution spectra and a larger number of atmospheric parameters.
△ Less
Submitted 25 May, 2019;
originally announced May 2019.
-
Bayesian deep neural networks for low-cost neurophysiological markers of Alzheimer's disease severity
Authors:
Wolfgang Fruehwirt,
Adam D. Cobb,
Martin Mairhofer,
Leonard Weydemann,
Heinrich Garn,
Reinhold Schmidt,
Thomas Benke,
Peter Dal-Bianco,
Gerhard Ransmayr,
Markus Waser,
Dieter Grossegger,
Pengfei Zhang,
Georg Dorffner,
Stephen Roberts
Abstract:
As societies around the world are ageing, the number of Alzheimer's disease (AD) patients is rapidly increasing. To date, no low-cost, non-invasive biomarkers have been established to advance the objectivization of AD diagnosis and progression assessment. Here, we utilize Bayesian neural networks to develop a multivariate predictor for AD severity using a wide range of quantitative EEG (QEEG) mark…
▽ More
As societies around the world are ageing, the number of Alzheimer's disease (AD) patients is rapidly increasing. To date, no low-cost, non-invasive biomarkers have been established to advance the objectivization of AD diagnosis and progression assessment. Here, we utilize Bayesian neural networks to develop a multivariate predictor for AD severity using a wide range of quantitative EEG (QEEG) markers. The Bayesian treatment of neural networks both automatically controls model complexity and provides a predictive distribution over the target function, giving uncertainty bounds for our regression task. It is therefore well suited to clinical neuroscience, where data sets are typically sparse and practitioners require a precise assessment of the predictive uncertainty. We use data of one of the largest prospective AD EEG trials ever conducted to demonstrate the potential of Bayesian deep learning in this domain, while comparing two distinct Bayesian neural network approaches, i.e., Monte Carlo dropout and Hamiltonian Monte Carlo.
△ Less
Submitted 13 December, 2018; v1 submitted 12 December, 2018;
originally announced December 2018.
-
Bayesian Deep Learning for Exoplanet Atmospheric Retrieval
Authors:
Frank Soboczenski,
Michael D. Himes,
Molly D. O'Beirne,
Simone Zorzan,
Atilim Gunes Baydin,
Adam D. Cobb,
Yarin Gal,
Daniel Angerhausen,
Massimo Mascaro,
Giada N. Arney,
Shawn D. Domagal-Goldman
Abstract:
Over the past decade, the study of extrasolar planets has evolved rapidly from plain detection and identification to comprehensive categorization and characterization of exoplanet systems and their atmospheres. Atmospheric retrieval, the inverse modeling technique used to determine an exoplanetary atmosphere's temperature structure and composition from an observed spectrum, is both time-consuming…
▽ More
Over the past decade, the study of extrasolar planets has evolved rapidly from plain detection and identification to comprehensive categorization and characterization of exoplanet systems and their atmospheres. Atmospheric retrieval, the inverse modeling technique used to determine an exoplanetary atmosphere's temperature structure and composition from an observed spectrum, is both time-consuming and compute-intensive, requiring complex algorithms that compare thousands to millions of atmospheric models to the observational data to find the most probable values and associated uncertainties for each model parameter. For rocky, terrestrial planets, the retrieved atmospheric composition can give insight into the surface fluxes of gaseous species necessary to maintain the stability of that atmosphere, which may in turn provide insight into the geological and/or biological processes active on the planet. These atmospheres contain many molecules, some of them biosignatures, spectral fingerprints indicative of biological activity, which will become observable with the next generation of telescopes. Runtimes of traditional retrieval models scale with the number of model parameters, so as more molecular species are considered, runtimes can become prohibitively long. Recent advances in machine learning (ML) and computer vision offer new ways to reduce the time to perform a retrieval by orders of magnitude, given a sufficient data set to train with. Here we present an ML-based retrieval framework called Intelligent exoplaNet Atmospheric RetrievAl (INARA) that consists of a Bayesian deep learning model for retrieval and a data set of 3,000,000 synthetic rocky exoplanetary spectra generated using the NASA Planetary Spectrum Generator. Our work represents the first ML retrieval model for rocky, terrestrial exoplanets and the first synthetic data set of terrestrial spectra generated at this scale.
△ Less
Submitted 2 December, 2018; v1 submitted 8 November, 2018;
originally announced November 2018.
-
Loss-Calibrated Approximate Inference in Bayesian Neural Networks
Authors:
Adam D. Cobb,
Stephen J. Roberts,
Yarin Gal
Abstract:
Current approaches in approximate inference for Bayesian neural networks minimise the Kullback-Leibler divergence to approximate the true posterior over the weights. However, this approximation is without knowledge of the final application, and therefore cannot guarantee optimal predictions for a given task. To make more suitable task-specific approximations, we introduce a new loss-calibrated evi…
▽ More
Current approaches in approximate inference for Bayesian neural networks minimise the Kullback-Leibler divergence to approximate the true posterior over the weights. However, this approximation is without knowledge of the final application, and therefore cannot guarantee optimal predictions for a given task. To make more suitable task-specific approximations, we introduce a new loss-calibrated evidence lower bound for Bayesian neural networks in the context of supervised learning, informed by Bayesian decision theory. By introducing a lower bound that depends on a utility function, we ensure that our approximation achieves higher utility than traditional methods for applications that have asymmetric utility functions. Furthermore, in using dropout inference, we highlight that our new objective is identical to that of standard dropout neural networks, with an additional utility-dependent penalty term. We demonstrate our new loss-calibrated model with an illustrative medical example and a restricted model capacity experiment, and highlight failure modes of the comparable weighted cross entropy approach. Lastly, we demonstrate the scalability of our method to real world applications with per-pixel semantic segmentation on an autonomous driving data set.
△ Less
Submitted 10 May, 2018;
originally announced May 2018.
-
Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus
Authors:
Adam D. Cobb,
Richard Everett,
Andrew Markham,
Stephen J. Roberts
Abstract:
In systems of multiple agents, identifying the cause of observed agent dynamics is challenging. Often, these agents operate in diverse, non-stationary environments, where models rely on hand-crafted environment-specific features to infer influential regions in the system's surroundings. To overcome the limitations of these inflexible models, we present GP-LAPLACE, a technique for locating sources…
▽ More
In systems of multiple agents, identifying the cause of observed agent dynamics is challenging. Often, these agents operate in diverse, non-stationary environments, where models rely on hand-crafted environment-specific features to infer influential regions in the system's surroundings. To overcome the limitations of these inflexible models, we present GP-LAPLACE, a technique for locating sources and sinks from trajectories in time-varying fields. Using Gaussian processes, we jointly infer a spatio-temporal vector field, as well as canonical vector calculus operations on that field. Notably, we do this from only agent trajectories without requiring knowledge of the environment, and also obtain a metric for denoting the significance of inferred causal features in the environment by exploiting our probabilistic method. To evaluate our approach, we apply it to both synthetic and real-world GPS data, demonstrating the applicability of our technique in the presence of multiple agents, as well as its superiority over existing methods.
△ Less
Submitted 12 November, 2018; v1 submitted 22 February, 2018;
originally announced February 2018.
-
Learning from lions: inferring the utility of agents from their trajectories
Authors:
Adam D. Cobb,
Andrew Markham,
Stephen J. Roberts
Abstract:
We build a model using Gaussian processes to infer a spatio-temporal vector field from observed agent trajectories. Significant landmarks or influence points in agent surroundings are jointly derived through vector calculus operations that indicate presence of sources and sinks. We evaluate these influence points by using the Kullback-Leibler divergence between the posterior and prior Laplacian of…
▽ More
We build a model using Gaussian processes to infer a spatio-temporal vector field from observed agent trajectories. Significant landmarks or influence points in agent surroundings are jointly derived through vector calculus operations that indicate presence of sources and sinks. We evaluate these influence points by using the Kullback-Leibler divergence between the posterior and prior Laplacian of the inferred spatio-temporal vector field. Through locating significant features that influence trajectories, our model aims to give greater insight into underlying causal utility functions that determine agent decision-making. A key feature of our model is that it infers a joint Gaussian process over the observed trajectories, the time-varying vector field of utility and canonical vector calculus operators. We apply our model to both synthetic data and lion GPS data collected at the Bubye Valley Conservancy in southern Zimbabwe.
△ Less
Submitted 7 September, 2017;
originally announced September 2017.