Search | arXiv e-print repository

Existence and Uniqueness for the SQG Vortex-Wave System when the Vorticity is Constant near the Point-Vortex

Authors: Dimitri Cobb, Martin Donati, Ludovic Godard-Cadillac

Abstract: This article studies the vortex-wave system for the Surface Quasi-Geostrophic equation with parameter 0 < s < 1. We obtained local existence of classical solutions in H^4 under the standard "plateau hypothesis", H^2-stability of the solutions, and a blow-up criterion. In the sub-critical case s > 1/2 we established global existence of weak solutions. For the critical case s = 1/2, we introduced a… ▽ More This article studies the vortex-wave system for the Surface Quasi-Geostrophic equation with parameter 0 < s < 1. We obtained local existence of classical solutions in H^4 under the standard "plateau hypothesis", H^2-stability of the solutions, and a blow-up criterion. In the sub-critical case s > 1/2 we established global existence of weak solutions. For the critical case s = 1/2, we introduced a weaker notion of solution (V-weak solutions) to give a meaning to the equation and prove global existence. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2401.02599 [pdf, ps, other]

Weak Solutions for a non-Newtonian Stokes-Transport System

Authors: Dimitri Cobb, Geoffrey Lacour

Abstract: In this article, we study a non-Newtonian Stokes-Transport system. This set of PDEs was introduced as a model for describing the behavior of a cloud of particles in suspension in a Stokes fluid, and is a nonlinear coupling between a hyperbolic equation (Transport) and a nonlinear elliptic equation (non-Newtonian Stokes), and as such can be considered as an active scalar equation. We prove the exis… ▽ More In this article, we study a non-Newtonian Stokes-Transport system. This set of PDEs was introduced as a model for describing the behavior of a cloud of particles in suspension in a Stokes fluid, and is a nonlinear coupling between a hyperbolic equation (Transport) and a nonlinear elliptic equation (non-Newtonian Stokes), and as such can be considered as an active scalar equation. We prove the existence of global weak solutions with initial data in critical Lebesgue spaces. In order to overcome the difficulties introduced by the highly nonlinear aspect of this problem, we resort to a combination of DiPerna-Lions theory of transport equations and Minty's trick for elliptic equations. △ Less

Submitted 4 January, 2024; originally announced January 2024.

MSC Class: 35M31; 35Q35; 76A05; 76T20; 35J92

arXiv:2311.17020 [pdf, other]

High-contrast JWST-MIRI spectroscopy of planet-forming disks for the JDISC Survey

Authors: Klaus M. Pontoppidan, Colette Salyk, Andrea Banzatti, Ke Zhang, Ilaria Pascucci, Karin I. Oberg, Feng Long, Carlos Munoz-Romero, John Carr, Joan Najita, Geoffrey A. Blake, Nicole Arulanantham, Sean Andrews, Nicholas P. Ballering, Edwin Bergin, Jenny Calahan, Douglas Cobb, Maria Jose Colmenares, Annie Dickson-Vandervelde, Anna Dignan, Joel Green, Phoebe Heretz, Greg Herczeg, Anusha Kalyaan, Sebastian Krijt , et al. (4 additional authors not shown)

Abstract: The JWST Disk Infrared Spectral Chemistry Survey (JDISCS) aims to understand the evolution of the chemistry of inner protoplanetary disks using the Mid-InfraRed Instrument (MIRI) on the James Webb Space Telescope (JWST). With a growing sample of >30 disks, the survey implements a custom method to calibrate the MIRI Medium Resolution Spectrometer (MRS) to contrasts of better than 1:300 across its 4… ▽ More The JWST Disk Infrared Spectral Chemistry Survey (JDISCS) aims to understand the evolution of the chemistry of inner protoplanetary disks using the Mid-InfraRed Instrument (MIRI) on the James Webb Space Telescope (JWST). With a growing sample of >30 disks, the survey implements a custom method to calibrate the MIRI Medium Resolution Spectrometer (MRS) to contrasts of better than 1:300 across its 4.9-28 micron spectral range. This is achieved using observations of Themis-family asteroids as precise empirical reference sources. High spectral contrast enables precise retrievals of physical parameters, searches for rare molecular species and isotopologues, and constraints on the inventories of carbon- and nitrogen-bearing species. JDISCS also offers significant improvements to the MRS wavelength and resolving power calibration. We describe the JDISCS calibrated data and demonstrate its quality using observations of the disk around the solar-mass young star FZ Tau. The FZ Tau MIRI spectrum is dominated by strong emission from warm water vapor. We show that the water and CO line emission originates from the disk surface and traces a range of gas temperatures of ~500-1500 K. We retrieve parameters for the observed CO and H2O lines, and show that they are consistent with a radial distribution represented by two temperature components. A high water abundance of n(H2O)~10^-4 fills the disk surface at least out to the 350 K isotherm at 1.5 au. We search the FZ Tau environs for extended emission detecting a large (radius of ~300 au) ring of emission from H2 gas surrounding FZ Tau, and discuss its origin. △ Less

Submitted 16 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

Comments: Accepted for publication in the Astrophysical Journal

arXiv:2311.10571 [pdf, other]

Direct Amortized Likelihood Ratio Estimation

Authors: Adam D. Cobb, Brian Matejek, Daniel Elenius, Anirban Roy, Susmit Jha

Abstract: We introduce a new amortized likelihood ratio estimator for likelihood-free simulation-based inference (SBI). Our estimator is simple to train and estimates the likelihood ratio using a single forward pass of the neural estimator. Our approach directly computes the likelihood ratio between two competing parameter sets which is different from the previous approach of comparing two neural network ou… ▽ More We introduce a new amortized likelihood ratio estimator for likelihood-free simulation-based inference (SBI). Our estimator is simple to train and estimates the likelihood ratio using a single forward pass of the neural estimator. Our approach directly computes the likelihood ratio between two competing parameter sets which is different from the previous approach of comparing two neural network output values. We refer to our model as the direct neural ratio estimator (DNRE). As part of introducing the DNRE, we derive a corresponding Monte Carlo estimate of the posterior. We benchmark our new ratio estimator and compare to previous ratio estimators in the literature. We show that our new ratio estimator often outperforms these previous approaches. As a further contribution, we introduce a new derivative estimator for likelihood ratio estimators that enables us to compare likelihood-free Hamiltonian Monte Carlo (HMC) with random-walk Metropolis-Hastings (MH). We show that HMC is equally competitive, which has not been previously shown. Finally, we include a novel real-world application of SBI by using our neural ratio estimator to design a quadcopter. Code is available at https://github.com/SRI-CSL/dnre. △ Less

Submitted 17 November, 2023; originally announced November 2023.

Comments: 12 Pages, 10 Figures, GitHub: https://github.com/SRI-CSL/dnre

arXiv:2306.05562 [pdf, other]

AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs

Authors: Adam D. Cobb, Anirban Roy, Daniel Elenius, F. Michael Heim, Brian Swenson, Sydney Whittington, James D. Walker, Theodore Bapty, Joseph Hite, Karthik Ramani, Christopher McComb, Susmit Jha

Abstract: We present AircraftVerse, a publicly available aerial vehicle design dataset. Aircraft design encompasses different physics domains and, hence, multiple modalities of representation. The evaluation of these cyber-physical system (CPS) designs requires the use of scientific analytical and simulation models ranging from computer-aided design tools for structural and manufacturing analysis, computati… ▽ More We present AircraftVerse, a publicly available aerial vehicle design dataset. Aircraft design encompasses different physics domains and, hence, multiple modalities of representation. The evaluation of these cyber-physical system (CPS) designs requires the use of scientific analytical and simulation models ranging from computer-aided design tools for structural and manufacturing analysis, computational fluid dynamics tools for drag and lift computation, battery models for energy estimation, and simulation models for flight control and dynamics. AircraftVerse contains 27,714 diverse air vehicle designs - the largest corpus of engineering designs with this level of complexity. Each design comprises the following artifacts: a symbolic design tree describing topology, propulsion subsystem, battery subsystem, and other design details; a STandard for the Exchange of Product (STEP) model data; a 3D CAD design using a stereolithography (STL) file format; a 3D point cloud for the shape of the design; and evaluation results from high fidelity state-of-the-art physics models that characterize performance metrics such as maximum flight distance and hover-time. We also present baseline surrogate models that use different modalities of design representation to predict design performance metrics, which we provide as part of our dataset release. Finally, we discuss the potential impact of this dataset on the use of learning in aircraft design and, more generally, in CPS. AircraftVerse is accompanied by a data card, and it is released under Creative Commons Attribution-ShareAlike (CC BY-SA) license. The dataset is hosted at https://zenodo.org/record/6525446, baseline models and code at https://github.com/SRI-CSL/AircraftVerse, and the dataset description at https://aircraftverse.onrender.com/. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: The dataset is hosted at https://zenodo.org/record/6525446, baseline models and code at https://github.com/SRI-CSL/AircraftVerse, and the dataset description at https://aircraftverse.onrender.com/

arXiv:2305.11243 [pdf]

Comparing Machines and Children: Using Developmental Psychology Experiments to Assess the Strengths and Weaknesses of LaMDA Responses

Authors: Eliza Kosoy, Emily Rose Reagan, Leslie Lai, Alison Gopnik, Danielle Krettek Cobb

Abstract: Developmental psychologists have spent decades devising experiments to test the intelligence and knowledge of infants and children, tracing the origin of crucial concepts and capacities. Moreover, experimental techniques in developmental psychology have been carefully designed to discriminate the cognitive capacities that underlie particular behaviors. We propose that using classical experiments f… ▽ More Developmental psychologists have spent decades devising experiments to test the intelligence and knowledge of infants and children, tracing the origin of crucial concepts and capacities. Moreover, experimental techniques in developmental psychology have been carefully designed to discriminate the cognitive capacities that underlie particular behaviors. We propose that using classical experiments from child development is a particularly effective way to probe the computational abilities of AI models, in general, and LLMs in particular. First, the methodological techniques of developmental psychology, such as the use of novel stimuli to control for past experience or control conditions to determine whether children are using simple associations, can be equally helpful for assessing the capacities of LLMs. In parallel, testing LLMs in this way can tell us whether the information that is encoded in text is sufficient to enable particular responses, or whether those responses depend on other kinds of information, such as information from exploration of the physical world. In this work we adapt classical developmental experiments to evaluate the capabilities of LaMDA, a large language model from Google. We propose a novel LLM Response Score (LRS) metric which can be used to evaluate other language models, such as GPT. We find that LaMDA generates appropriate responses that are similar to those of children in experiments involving social understanding, perhaps providing evidence that knowledge of these domains is discovered through language. On the other hand, LaMDA's responses in early object and action understanding, theory of mind, and especially causal reasoning tasks are very different from those of young children, perhaps showing that these domains require more real-world, self-initiated exploration and cannot simply be learned from patterns in language input. △ Less

Submitted 7 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

Comments: 9 pages, 7 figures

arXiv:2301.10511 [pdf, ps, other]

On the Well-Posedness of a Fractional Stokes-Transport System

Authors: Dimitri Cobb

Abstract: The purpose of this paper is to study the existence, uniqueness and lifespan of solutions for a fractional Stokes-Transport system. This problem should be understood as a model for sedimentation in a fluid where the viscosity law is given by a fractional Lapalce operator $(- Δ)^{α/2}$, with $α= 2$ corresponding to the case of a normal viscous fluid, and $α= 0$ reducing the problem to the Inviscid… ▽ More The purpose of this paper is to study the existence, uniqueness and lifespan of solutions for a fractional Stokes-Transport system. This problem should be understood as a model for sedimentation in a fluid where the viscosity law is given by a fractional Lapalce operator $(- Δ)^{α/2}$, with $α= 2$ corresponding to the case of a normal viscous fluid, and $α= 0$ reducing the problem to the Inviscid Incompressible Porous Media equation. For each value of $α\in [0, d]$, we prove various results related to well-posedness in critical function spaces, such as the existence of global weak solutions (for $α> 0$), local existence and uniqueness (for $α\geq 0$), global existence and uniqueness (for $α\geq 1$), as well as study the lifespan of local solutions (for $0 \leq α< 1$). In particular, we show that gravity stratification leads to a directional blow-up criterion for local solutions (for $α\in [0, 1[$) and find a lower bound for the lifespan of solutions which depends on the value of the dissipation parameter $α\in [0, 1[$. △ Less

Submitted 25 January, 2023; originally announced January 2023.

Comments: 48 pages, submitted

MSC Class: 35Q35 (primary); 76D03; 35Q49; 35S10; 76D50 (secondary)

arXiv:2211.08138 [pdf, other]

Design of Unmanned Air Vehicles Using Transformer Surrogate Models

Authors: Adam D. Cobb, Anirban Roy, Daniel Elenius, Susmit Jha

Abstract: Computer-aided design (CAD) is a promising new area for the application of artificial intelligence (AI) and machine learning (ML). The current practice of design of cyber-physical systems uses the digital twin methodology, wherein the actual physical design is preceded by building detailed models that can be evaluated by physics simulation models. These physics models are often slow and the manual… ▽ More Computer-aided design (CAD) is a promising new area for the application of artificial intelligence (AI) and machine learning (ML). The current practice of design of cyber-physical systems uses the digital twin methodology, wherein the actual physical design is preceded by building detailed models that can be evaluated by physics simulation models. These physics models are often slow and the manual design process often relies on exploring near-by variations of existing designs. AI holds the promise of breaking these design silos and increasing the diversity and performance of designs by accelerating the exploration of the design space. In this paper, we focus on the design of electrical unmanned aerial vehicles (UAVs). The high-density batteries and purely electrical propulsion systems have disrupted the space of UAV design, making this domain an ideal target for AI-based design. In this paper, we develop an AI Designer that synthesizes novel UAV designs. Our approach uses a deep transformer model with a novel domain-specific encoding such that we can evaluate the performance of new proposed designs without running expensive flight dynamics models and CAD tools. We demonstrate that our approach significantly reduces the overall compute requirements for the design process and accelerates the design space exploration. Finally, we identify future research directions to achieve full-scale deployment of AI-assisted CAD for UAVs. △ Less

Submitted 11 November, 2022; originally announced November 2022.

Comments: 8 pages, 8 figures

arXiv:2207.07415 [pdf, ps, other]

Remarks on Chemin's space of homogeneous distributions

Authors: Dimitri Cobb

Abstract: This article focuses on Chemin's space $\mathcal{S}'_h$ of homogeneous distributions, which was introduced to serve as a basis for realizations of subcritical homogeneous Besov spaces. We will discuss how this construction fails in multiple ways for supercritical spaces. In particular, we study its intersection $X_h := \mathcal{S}'_h \cap X$ with various Banach spaces $X$, namely supercritical hom… ▽ More This article focuses on Chemin's space $\mathcal{S}'_h$ of homogeneous distributions, which was introduced to serve as a basis for realizations of subcritical homogeneous Besov spaces. We will discuss how this construction fails in multiple ways for supercritical spaces. In particular, we study its intersection $X_h := \mathcal{S}'_h \cap X$ with various Banach spaces $X$, namely supercritical homogeneous Besov spaces and the Lebesgue space $L^\infty$. For each $X$, we find out if the intersection $X_h$ is dense in $X$. If it is not, then we study its closure $C = {\rm clos}(X_h)$ and prove that the quotient $X/C$ is not separable and that $C$ is not complemented in $X$. △ Less

Submitted 15 July, 2022; originally announced July 2022.

Comments: Submitted. The material in this article appears in the authors PhD dissertation "Etude mathématique de fluides en interaction avec un champ magnétique" (to appear on arXiv)

MSC Class: 42B35 (Primary); 46E30; 46E35; 42B37 (Secondary)

arXiv:2202.03770 [pdf, other]

Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning

Authors: Meet P. Vadera, Adam D. Cobb, Brian Jalaian, Benjamin M. Marlin

Abstract: Bayesian methods hold significant promise for improving the uncertainty quantification ability and robustness of deep neural network models. Recent research has seen the investigation of a number of approximate Bayesian inference methods for deep neural networks, building on both the variational Bayesian and Markov chain Monte Carlo (MCMC) frameworks. A fundamental issue with MCMC methods is that… ▽ More Bayesian methods hold significant promise for improving the uncertainty quantification ability and robustness of deep neural network models. Recent research has seen the investigation of a number of approximate Bayesian inference methods for deep neural networks, building on both the variational Bayesian and Markov chain Monte Carlo (MCMC) frameworks. A fundamental issue with MCMC methods is that the improvements they enable are obtained at the expense of increased computation time and model storage costs. In this paper, we investigate the potential of sparse network structures to flexibly trade-off model storage costs and inference run time against predictive performance and uncertainty quantification ability. We use stochastic gradient MCMC methods as the core Bayesian inference method and consider a variety of approaches for selecting sparse network structures. Surprisingly, our results show that certain classes of randomly selected substructures can perform as well as substructures derived from state-of-the-art iterative pruning methods while drastically reducing model training times. △ Less

Submitted 8 February, 2022; originally announced February 2022.

Comments: Preprint. Work in progress

arXiv:2110.07607 [pdf, other]

HumBugDB: A Large-scale Acoustic Mosquito Dataset

Authors: Ivan Kiskin, Marianne Sinka, Adam D. Cobb, Waqas Rafique, Lawrence Wang, Davide Zilli, Benjamin Gutteridge, Rinita Dam, Theodoros Marinos, Yunpeng Li, Dickson Msaky, Emmanuel Kaindoa, Gerard Killeen, Eva Herreros-Moya, Kathy J. Willis, Stephen J. Roberts

Abstract: This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and y… ▽ More This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and yellow fever. Collecting this dataset is motivated by the need to assist applications which utilise mosquito acoustics to conduct surveys to help predict outbreaks and inform intervention policy. The task of detecting mosquitoes from the sound of their wingbeats is challenging due to the difficulty in collecting recordings from realistic scenarios. To address this, as part of the HumBug project, we conducted global experiments to record mosquitoes ranging from those bred in culture cages to mosquitoes captured in the wild. Consequently, the audio recordings vary in signal-to-noise ratio and contain a broad range of indoor and outdoor background environments from Tanzania, Thailand, Kenya, the USA and the UK. In this paper we describe in detail how we collected, labelled and curated the data. The data is provided from a PostgreSQL database, which contains important metadata such as the capture method, age, feeding status and gender of the mosquitoes. Additionally, we provide code to extract features and train Bayesian convolutional neural networks for two key tasks: the identification of mosquitoes from their corresponding background environments, and the classification of detected mosquitoes into species. Our extensive dataset is both challenging to machine learning researchers focusing on acoustic identification, and critical to entomologists, geo-spatial modellers and other domain experts to understand mosquito behaviour, model their distribution, and manage the threat they pose to humans. △ Less

Submitted 14 October, 2021; originally announced October 2021.

Comments: Accepted at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks. 10 pages main, 39 pages including appendix. This paper accompanies the dataset found at https://zenodo.org/record/4904800 with corresponding code at https://github.com/HumBug-Mosquito/HumBugDB

ACM Class: E.0; I.2.1; J.3

arXiv:2105.03257 [pdf, ps, other]

Bounded Solutions in Incompressible Hydrodynamics

Authors: Dimitri Cobb

Abstract: In this article, we study bounded solutions of Euler-type equations on $\mathbb{R}^d$ which have no integrability at $|x| \rightarrow +\infty$. As has been previously noted, such solutions fail to achieve uniqueness in an initial value problem, even under strong smoothness conditions. This contrasts with well-posedness results that have been obtained by using the Leray projection operator in these… ▽ More In this article, we study bounded solutions of Euler-type equations on $\mathbb{R}^d$ which have no integrability at $|x| \rightarrow +\infty$. As has been previously noted, such solutions fail to achieve uniqueness in an initial value problem, even under strong smoothness conditions. This contrasts with well-posedness results that have been obtained by using the Leray projection operator in these equations. This apparent paradox is solved by noting that using the Leray projector requires an extra condition the solutions must fulfill at $|x| \rightarrow + \infty$. Our goal is to find one such condition which is sharp. We then apply the methods we develop to prove a full uniqueness result for Besov-Lipschitz solutions, as to the theory of Serfati solutions. In the last Section, we see how these techniques also apply to the Elsässer variables used in ideal MHD. △ Less

Submitted 23 January, 2023; v1 submitted 7 May, 2021; originally announced May 2021.

Comments: Second version. 34 pages, submitted. Significant changes: important references added, main theorem expanded, added detailed comparision with previous results, added application to Serfati solutions

MSC Class: 35Q35 (primary); 35A02; 35B30; 35Q31; 35S30

arXiv:2102.13586 [pdf, ps, other]

Symmetry breaking in ideal magnetohydrodynamics: the role of the velocity

Authors: Dimitri Cobb, Francesco Fanelli

Abstract: The ideal magnetohydrodynamic equations are, roughly speaking, a quasi-linear symmetric hyperbolic system of PDEs, but not all the unknowns play the same role in this system. Indeed, in the regime of small magnetic fields, the equations are close to the incompressible Euler equations. In the present paper, we adopt this point of view to study questions linked with the lifespan of strong solutions… ▽ More The ideal magnetohydrodynamic equations are, roughly speaking, a quasi-linear symmetric hyperbolic system of PDEs, but not all the unknowns play the same role in this system. Indeed, in the regime of small magnetic fields, the equations are close to the incompressible Euler equations. In the present paper, we adopt this point of view to study questions linked with the lifespan of strong solutions to the ideal magnetohydrodynamic equations. First of all, we prove a continuation criterion in terms of the velocity field only. Secondly, we refine the explicit lower bound for the lifespan of $2$-D flows found in [11], by relaxing the regularity assumptions on the initial magnetic field. △ Less

Submitted 26 February, 2021; originally announced February 2021.

Comments: Submitted

arXiv:2012.06453 [pdf, other]

Better call Surrogates: A hybrid Evolutionary Algorithm for Hyperparameter optimization

Authors: Subhodip Biswas, Adam D Cobb, Andreea Sistrunk, Naren Ramakrishnan, Brian Jalaian

Abstract: In this paper, we propose a surrogate-assisted evolutionary algorithm (EA) for hyperparameter optimization of machine learning (ML) models. The proposed STEADE model initially estimates the objective function landscape using RadialBasis Function interpolation, and then transfers the knowledge to an EA technique called Differential Evolution that is used to evolve new solutions guided by a Bayesian… ▽ More In this paper, we propose a surrogate-assisted evolutionary algorithm (EA) for hyperparameter optimization of machine learning (ML) models. The proposed STEADE model initially estimates the objective function landscape using RadialBasis Function interpolation, and then transfers the knowledge to an EA technique called Differential Evolution that is used to evolve new solutions guided by a Bayesian optimization framework. We empirically evaluate our model on the hyperparameter optimization problems as a part of the black box optimization challenge at NeurIPS 2020 and demonstrate the improvement brought about by STEADE over the vanilla EA. △ Less

Submitted 11 December, 2020; originally announced December 2020.

Comments: Accepted at the black box optimization challenge at NeurIPS 2020

arXiv:2010.06772 [pdf, other]

Scaling Hamiltonian Monte Carlo Inference for Bayesian Neural Networks with Symmetric Splitting

Authors: Adam D. Cobb, Brian Jalaian

Abstract: Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) approach that exhibits favourable exploration properties in high-dimensional models such as neural networks. Unfortunately, HMC has limited use in large-data regimes and little work has explored suitable approaches that aim to preserve the entire Hamiltonian. In our work, we introduce a new symmetric integration scheme for split HM… ▽ More Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) approach that exhibits favourable exploration properties in high-dimensional models such as neural networks. Unfortunately, HMC has limited use in large-data regimes and little work has explored suitable approaches that aim to preserve the entire Hamiltonian. In our work, we introduce a new symmetric integration scheme for split HMC that does not rely on stochastic gradients. We show that our new formulation is more efficient than previous approaches and is easy to implement with a single GPU. As a result, we are able to perform full HMC over common deep learning architectures using entire data sets. In addition, when we compare with stochastic gradient MCMC, we show that our method achieves better performance in both accuracy and uncertainty quantification. Our approach demonstrates HMC as a feasible option when considering inference schemes for large-scale machine learning problems. △ Less

Submitted 13 October, 2020; originally announced October 2020.

Comments: 11 pages, 13 figures

arXiv:2009.11230 [pdf, ps, other]

Elsässer formulation of the ideal MHD and improved lifespan in two space dimensions

Authors: Dimitri Cobb, Francesco Fanelli

Abstract: In the present paper, we show an improved lower bound for the lifespan of the solutions to the ideal MHD equations in the case of space dimension $d=2$. In particular, for small initial magnetic fields $b_0$ of size (say) $\varepsilon>0$, the lifespan $T_\varepsilon>0$ of the corresponding solution goes to $+\infty$ in the limit $\varepsilon\rightarrow0^+$. Such a result does not follow from sta… ▽ More In the present paper, we show an improved lower bound for the lifespan of the solutions to the ideal MHD equations in the case of space dimension $d=2$. In particular, for small initial magnetic fields $b_0$ of size (say) $\varepsilon>0$, the lifespan $T_\varepsilon>0$ of the corresponding solution goes to $+\infty$ in the limit $\varepsilon\rightarrow0^+$. Such a result does not follow from standard quasi-linear hyperbolic theory. For proving it, three are the crucial ingredients: first of all, to work in endpoint Besov spaces $B^s_{\infty,r}$, under the condition $s>1$ and $r\in[1,+\infty]$ or $s=r=1$; moreover, to use the Elsässer formulation of the ideal MHD, recasted in its vorticity formulation; finally, to take advantage of the special structure of the non-linear terms. We also rigorously establish the equivalence between the original formulation of the ideal MHD and its Elsässer formulation for a large class of weak solutions. The construction of explicit counterexamples shows the sharpness of our assumptions. Related non-uniqueness issues are discussed as well. △ Less

Submitted 23 September, 2020; originally announced September 2020.

Comments: Submitted

arXiv:2007.15094 [pdf, ps, other]

Rigorous derivation and well-posedness of a quasi-homogeneous ideal MHD system

Authors: Dimitri Cobb, Francesco Fanelli

Abstract: The goal of this paper is twofold. On the one hand, we introduce a quasi-homogeneous version of the classical ideal MHD system and study its well-posedness in critical Besov spaces $B^s_{p,r}(\mathbb{R}^d)$, $d\geq2$, with $1<p<+\infty$ and under the Lipschitz condition $s>1+d/p$ and $r\in[1,+\infty]$, or $s=1+d/p$ and $r=1$. A key ingredient is the reformulation of the system \textsl{via} the so-… ▽ More The goal of this paper is twofold. On the one hand, we introduce a quasi-homogeneous version of the classical ideal MHD system and study its well-posedness in critical Besov spaces $B^s_{p,r}(\mathbb{R}^d)$, $d\geq2$, with $1<p<+\infty$ and under the Lipschitz condition $s>1+d/p$ and $r\in[1,+\infty]$, or $s=1+d/p$ and $r=1$. A key ingredient is the reformulation of the system \textsl{via} the so-called Elsässer variables. On the other hand, we give a rigorous justification of quasi-homogeneous MHD models, both in the ideal and in the dissipative cases: when $d=2$, we will derive them from a non-homogeneous incompressible MHD system with Coriolis force, in the regime of low Rossby number and for small density variations around a constant state. Our method of proof relies on a relative entropy inequality for the primitive system, and yields precise rates of convergence, depending on the size of the initial data, on the order of the Rossby number and on the regularity of the viscosity and resistivity coefficients. △ Less

Submitted 29 July, 2020; originally announced July 2020.

Comments: Submitted

arXiv:2007.04466 [pdf, ps, other]

URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural Networks

Authors: Meet P. Vadera, Adam D. Cobb, Brian Jalaian, Benjamin M. Marlin

Abstract: While deep learning methods continue to improve in predictive accuracy on a wide range of application domains, significant issues remain with other aspects of their performance including their ability to quantify uncertainty and their robustness. Recent advances in approximate Bayesian inference hold significant promise for addressing these concerns, but the computational scalability of these meth… ▽ More While deep learning methods continue to improve in predictive accuracy on a wide range of application domains, significant issues remain with other aspects of their performance including their ability to quantify uncertainty and their robustness. Recent advances in approximate Bayesian inference hold significant promise for addressing these concerns, but the computational scalability of these methods can be problematic when applied to large-scale models. In this paper, we describe initial work on the development ofURSABench(the Uncertainty, Robustness, Scalability, and Accu-racy Benchmark), an open-source suite of bench-marking tools for comprehensive assessment of approximate Bayesian inference methods with a focus on deep learning-based classification tasks △ Less

Submitted 8 July, 2020; originally announced July 2020.

Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

arXiv:2003.02430 [pdf, other]

doi 10.3847/PSJ/abe3fd

Accurate Machine Learning Atmospheric Retrieval via a Neural Network Surrogate Model for Radiative Transfer

Authors: Michael D. Himes, Joseph Harrington, Adam D. Cobb, Atilim Gunes Baydin, Frank Soboczenski, Molly D. O'Beirne, Simone Zorzan, David C. Wright, Zacchaeus Scheffer, Shawn D. Domagal-Goldman, Giada N. Arney

Abstract: Atmospheric retrieval determines the properties of an atmosphere based on its measured spectrum. The low signal-to-noise ratio of exoplanet observations require a Bayesian approach to determine posterior probability distributions of each model parameter, given observed spectra. This inference is computationally expensive, as it requires many executions of a costly radiative transfer (RT) simulatio… ▽ More Atmospheric retrieval determines the properties of an atmosphere based on its measured spectrum. The low signal-to-noise ratio of exoplanet observations require a Bayesian approach to determine posterior probability distributions of each model parameter, given observed spectra. This inference is computationally expensive, as it requires many executions of a costly radiative transfer (RT) simulation for each set of sampled model parameters. Machine learning (ML) has recently been shown to provide a significant reduction in runtime for retrievals, mainly by training inverse ML models that predict parameter distributions, given observed spectra, albeit with reduced posterior accuracy. Here we present a novel approach to retrieval by training a forward ML surrogate model that predicts spectra given model parameters, providing a fast approximate RT simulation that can be used in a conventional Bayesian retrieval framework without significant loss of accuracy. We demonstrate our method on the emission spectrum of HD 189733 b and find good agreement with a traditional retrieval from the Bayesian Atmospheric Radiative Transfer (BART) code (Bhattacharyya coefficients of 0.9843--0.9972, with a mean of 0.9925, between 1D marginalized posteriors). This accuracy comes while still offering significant speed enhancements over traditional RT, albeit not as much as ML methods with lower posterior accuracy. Our method is ~9x faster per parallel chain than BART when run on an AMD EPYC 7402P central processing unit (CPU). Neural-network computation using an NVIDIA Titan Xp graphics processing unit is 90--180x faster per chain than BART on that CPU. △ Less

Submitted 3 May, 2022; v1 submitted 4 March, 2020; originally announced March 2020.

Comments: 16 pages, 4 figures, submitted to PSJ 3/4/2020, revised 1/22/2021, accepted 2/4/2021, published 4/25/2022. Updated to match the published manuscript. Himes et al. 2022, PSJ, 3, 91

Journal ref: Planet. Sci. J. 3 (2022) 91

arXiv:2001.04733 [pdf, other]

HumBug Zooniverse: a crowd-sourced acoustic mosquito dataset

Authors: Ivan Kiskin, Adam D. Cobb, Lawrence Wang, Stephen Roberts

Abstract: Mosquitoes are the only known vector of malaria, which leads to hundreds of thousands of deaths each year. Understanding the number and location of potential mosquito vectors is of paramount importance to aid the reduction of malaria transmission cases. In recent years, deep learning has become widely used for bioacoustic classification tasks. In order to enable further research applications in th… ▽ More Mosquitoes are the only known vector of malaria, which leads to hundreds of thousands of deaths each year. Understanding the number and location of potential mosquito vectors is of paramount importance to aid the reduction of malaria transmission cases. In recent years, deep learning has become widely used for bioacoustic classification tasks. In order to enable further research applications in this field, we release a new dataset of mosquito audio recordings. With over a thousand contributors, we obtained 195,434 labels of two second duration, of which approximately 10 percent signify mosquito events. We present an example use of the dataset, in which we train a convolutional neural network on log-Mel features, showcasing the information content of the labels. We hope this will become a vital resource for those researching all aspects of malaria, and add to the existing audio datasets for bioacoustic detection and signal processing. △ Less

Submitted 14 February, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

Comments: Awarded Best Paper at the 2019 NeurIPS ML4D workshop. Accepted at ICASSP 2020

arXiv:1912.04077 [pdf, ps, other]

On the fast rotation asymptotics of a non-homogeneous incompressible MHD system

Authors: Dimitri Cobb, Francesco Fanelli

Abstract: This paper is devoted to the analysis of a singular perturbation problem for a $2$-D incompressible MHD system with density variations and Coriolis force, in the limit of small Rossby numbers. Two regimes are considered. The first one is the quasi-homogeneous regime, where the densities are small perturbations around a constant state. The limit dynamics is identified as an incompressible homogeneo… ▽ More This paper is devoted to the analysis of a singular perturbation problem for a $2$-D incompressible MHD system with density variations and Coriolis force, in the limit of small Rossby numbers. Two regimes are considered. The first one is the quasi-homogeneous regime, where the densities are small perturbations around a constant state. The limit dynamics is identified as an incompressible homogeneous MHD system, coupled with an additional transport equation for the limit of the density variations. The second case is the fully non-homogeneous regime, where the densities vary around a general non-constant profile. In this case, in the limit, the equation for the magnetic field combines with an underdetermined linear equation, which links the limit density variation function with the limit velocity field. The proof is based on a compensated compactness argument, which enables us to consider general ill-prepared initial data. An application of Di Perna-Lions theory for transport equations allows to treat the case of density-dependent viscosity and resistivity coefficients. △ Less

Submitted 9 December, 2019; originally announced December 2019.

Comments: Submitted

arXiv:1910.06243 [pdf, other]

Introducing an Explicit Symplectic Integration Scheme for Riemannian Manifold Hamiltonian Monte Carlo

Authors: Adam D. Cobb, Atılım Güneş Baydin, Andrew Markham, Stephen J. Roberts

Abstract: We introduce a recent symplectic integration scheme derived for solving physically motivated systems with non-separable Hamiltonians. We show its relevance to Riemannian manifold Hamiltonian Monte Carlo (RMHMC) and provide an alternative to the currently used generalised leapfrog symplectic integrator, which relies on solving multiple fixed point iterations to convergence. Via this approach, we ar… ▽ More We introduce a recent symplectic integration scheme derived for solving physically motivated systems with non-separable Hamiltonians. We show its relevance to Riemannian manifold Hamiltonian Monte Carlo (RMHMC) and provide an alternative to the currently used generalised leapfrog symplectic integrator, which relies on solving multiple fixed point iterations to convergence. Via this approach, we are able to reduce the number of higher-order derivative calculations per leapfrog step. We explore the implications of this integrator and demonstrate its efficacy in reducing the computational burden of RMHMC. Our code is provided in a new open-source Python package, hamiltorch. △ Less

Submitted 14 October, 2019; originally announced October 2019.

arXiv:1905.10659 [pdf, other]

doi 10.3847/1538-3881/ab2390

An Ensemble of Bayesian Neural Networks for Exoplanetary Atmospheric Retrieval

Authors: Adam D. Cobb, Michael D. Himes, Frank Soboczenski, Simone Zorzan, Molly D. O'Beirne, Atılım Güneş Baydin, Yarin Gal, Shawn D. Domagal-Goldman, Giada N. Arney, Daniel Angerhausen

Abstract: Machine learning is now used in many areas of astrophysics, from detecting exoplanets in Kepler transit signals to removing telescope systematics. Recent work demonstrated the potential of using machine learning algorithms for atmospheric retrieval by implementing a random forest to perform retrievals in seconds that are consistent with the traditional, computationally-expensive nested-sampling re… ▽ More Machine learning is now used in many areas of astrophysics, from detecting exoplanets in Kepler transit signals to removing telescope systematics. Recent work demonstrated the potential of using machine learning algorithms for atmospheric retrieval by implementing a random forest to perform retrievals in seconds that are consistent with the traditional, computationally-expensive nested-sampling retrieval method. We expand upon their approach by presenting a new machine learning model, \texttt{plan-net}, based on an ensemble of Bayesian neural networks that yields more accurate inferences than the random forest for the same data set of synthetic transmission spectra. We demonstrate that an ensemble provides greater accuracy and more robust uncertainties than a single model. In addition to being the first to use Bayesian neural networks for atmospheric retrieval, we also introduce a new loss function for Bayesian neural networks that learns correlations between the model outputs. Importantly, we show that designing machine learning models to explicitly incorporate domain-specific knowledge both improves performance and provides additional insight by inferring the covariance of the retrieved atmospheric parameters. We apply \texttt{plan-net} to the Hubble Space Telescope Wide Field Camera 3 transmission spectrum for WASP-12b and retrieve an isothermal temperature and water abundance consistent with the literature. We highlight that our method is flexible and can be expanded to higher-resolution spectra and a larger number of atmospheric parameters. △ Less

Submitted 25 May, 2019; originally announced May 2019.

arXiv:1812.04994 [pdf, ps, other]

Bayesian deep neural networks for low-cost neurophysiological markers of Alzheimer's disease severity

Authors: Wolfgang Fruehwirt, Adam D. Cobb, Martin Mairhofer, Leonard Weydemann, Heinrich Garn, Reinhold Schmidt, Thomas Benke, Peter Dal-Bianco, Gerhard Ransmayr, Markus Waser, Dieter Grossegger, Pengfei Zhang, Georg Dorffner, Stephen Roberts

Abstract: As societies around the world are ageing, the number of Alzheimer's disease (AD) patients is rapidly increasing. To date, no low-cost, non-invasive biomarkers have been established to advance the objectivization of AD diagnosis and progression assessment. Here, we utilize Bayesian neural networks to develop a multivariate predictor for AD severity using a wide range of quantitative EEG (QEEG) mark… ▽ More As societies around the world are ageing, the number of Alzheimer's disease (AD) patients is rapidly increasing. To date, no low-cost, non-invasive biomarkers have been established to advance the objectivization of AD diagnosis and progression assessment. Here, we utilize Bayesian neural networks to develop a multivariate predictor for AD severity using a wide range of quantitative EEG (QEEG) markers. The Bayesian treatment of neural networks both automatically controls model complexity and provides a predictive distribution over the target function, giving uncertainty bounds for our regression task. It is therefore well suited to clinical neuroscience, where data sets are typically sparse and practitioners require a precise assessment of the predictive uncertainty. We use data of one of the largest prospective AD EEG trials ever conducted to demonstrate the potential of Bayesian deep learning in this domain, while comparing two distinct Bayesian neural network approaches, i.e., Monte Carlo dropout and Hamiltonian Monte Carlo. △ Less

Submitted 13 December, 2018; v1 submitted 12 December, 2018; originally announced December 2018.

Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

arXiv:1811.03390 [pdf, other]

Bayesian Deep Learning for Exoplanet Atmospheric Retrieval

Authors: Frank Soboczenski, Michael D. Himes, Molly D. O'Beirne, Simone Zorzan, Atilim Gunes Baydin, Adam D. Cobb, Yarin Gal, Daniel Angerhausen, Massimo Mascaro, Giada N. Arney, Shawn D. Domagal-Goldman

Abstract: Over the past decade, the study of extrasolar planets has evolved rapidly from plain detection and identification to comprehensive categorization and characterization of exoplanet systems and their atmospheres. Atmospheric retrieval, the inverse modeling technique used to determine an exoplanetary atmosphere's temperature structure and composition from an observed spectrum, is both time-consuming… ▽ More Over the past decade, the study of extrasolar planets has evolved rapidly from plain detection and identification to comprehensive categorization and characterization of exoplanet systems and their atmospheres. Atmospheric retrieval, the inverse modeling technique used to determine an exoplanetary atmosphere's temperature structure and composition from an observed spectrum, is both time-consuming and compute-intensive, requiring complex algorithms that compare thousands to millions of atmospheric models to the observational data to find the most probable values and associated uncertainties for each model parameter. For rocky, terrestrial planets, the retrieved atmospheric composition can give insight into the surface fluxes of gaseous species necessary to maintain the stability of that atmosphere, which may in turn provide insight into the geological and/or biological processes active on the planet. These atmospheres contain many molecules, some of them biosignatures, spectral fingerprints indicative of biological activity, which will become observable with the next generation of telescopes. Runtimes of traditional retrieval models scale with the number of model parameters, so as more molecular species are considered, runtimes can become prohibitively long. Recent advances in machine learning (ML) and computer vision offer new ways to reduce the time to perform a retrieval by orders of magnitude, given a sufficient data set to train with. Here we present an ML-based retrieval framework called Intelligent exoplaNet Atmospheric RetrievAl (INARA) that consists of a Bayesian deep learning model for retrieval and a data set of 3,000,000 synthetic rocky exoplanetary spectra generated using the NASA Planetary Spectrum Generator. Our work represents the first ML retrieval model for rocky, terrestrial exoplanets and the first synthetic data set of terrestrial spectra generated at this scale. △ Less

Submitted 2 December, 2018; v1 submitted 8 November, 2018; originally announced November 2018.

Comments: Third workshop on Bayesian Deep Learning (NeurIPS 2018), Montreal, Canada

MSC Class: 85A20; 68T05 ACM Class: J.2; I.2.6

arXiv:1805.03901 [pdf, other]

Loss-Calibrated Approximate Inference in Bayesian Neural Networks

Authors: Adam D. Cobb, Stephen J. Roberts, Yarin Gal

Abstract: Current approaches in approximate inference for Bayesian neural networks minimise the Kullback-Leibler divergence to approximate the true posterior over the weights. However, this approximation is without knowledge of the final application, and therefore cannot guarantee optimal predictions for a given task. To make more suitable task-specific approximations, we introduce a new loss-calibrated evi… ▽ More Current approaches in approximate inference for Bayesian neural networks minimise the Kullback-Leibler divergence to approximate the true posterior over the weights. However, this approximation is without knowledge of the final application, and therefore cannot guarantee optimal predictions for a given task. To make more suitable task-specific approximations, we introduce a new loss-calibrated evidence lower bound for Bayesian neural networks in the context of supervised learning, informed by Bayesian decision theory. By introducing a lower bound that depends on a utility function, we ensure that our approximation achieves higher utility than traditional methods for applications that have asymmetric utility functions. Furthermore, in using dropout inference, we highlight that our new objective is identical to that of standard dropout neural networks, with an additional utility-dependent penalty term. We demonstrate our new loss-calibrated model with an illustrative medical example and a restricted model capacity experiment, and highlight failure modes of the comparable weighted cross entropy approach. Lastly, we demonstrate the scalability of our method to real world applications with per-pixel semantic segmentation on an autonomous driving data set. △ Less

Submitted 10 May, 2018; originally announced May 2018.

Comments: 12 pages, 12 figures

arXiv:1802.10446 [pdf, other]

doi 10.1145/3219819.3220065

Identifying Sources and Sinks in the Presence of Multiple Agents with Gaussian Process Vector Calculus

Authors: Adam D. Cobb, Richard Everett, Andrew Markham, Stephen J. Roberts

Abstract: In systems of multiple agents, identifying the cause of observed agent dynamics is challenging. Often, these agents operate in diverse, non-stationary environments, where models rely on hand-crafted environment-specific features to infer influential regions in the system's surroundings. To overcome the limitations of these inflexible models, we present GP-LAPLACE, a technique for locating sources… ▽ More In systems of multiple agents, identifying the cause of observed agent dynamics is challenging. Often, these agents operate in diverse, non-stationary environments, where models rely on hand-crafted environment-specific features to infer influential regions in the system's surroundings. To overcome the limitations of these inflexible models, we present GP-LAPLACE, a technique for locating sources and sinks from trajectories in time-varying fields. Using Gaussian processes, we jointly infer a spatio-temporal vector field, as well as canonical vector calculus operations on that field. Notably, we do this from only agent trajectories without requiring knowledge of the environment, and also obtain a metric for denoting the significance of inferred causal features in the environment by exploiting our probabilistic method. To evaluate our approach, we apply it to both synthetic and real-world GPS data, demonstrating the applicability of our technique in the presence of multiple agents, as well as its superiority over existing methods. △ Less

Submitted 12 November, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

Comments: KDD '18 Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Pages 1254-1262, 9 pages, 5 figures, conference submission, University of Oxford. arXiv admin note: text overlap with arXiv:1709.02357

arXiv:1709.02357 [pdf, other]

Learning from lions: inferring the utility of agents from their trajectories

Authors: Adam D. Cobb, Andrew Markham, Stephen J. Roberts

Abstract: We build a model using Gaussian processes to infer a spatio-temporal vector field from observed agent trajectories. Significant landmarks or influence points in agent surroundings are jointly derived through vector calculus operations that indicate presence of sources and sinks. We evaluate these influence points by using the Kullback-Leibler divergence between the posterior and prior Laplacian of… ▽ More We build a model using Gaussian processes to infer a spatio-temporal vector field from observed agent trajectories. Significant landmarks or influence points in agent surroundings are jointly derived through vector calculus operations that indicate presence of sources and sinks. We evaluate these influence points by using the Kullback-Leibler divergence between the posterior and prior Laplacian of the inferred spatio-temporal vector field. Through locating significant features that influence trajectories, our model aims to give greater insight into underlying causal utility functions that determine agent decision-making. A key feature of our model is that it infers a joint Gaussian process over the observed trajectories, the time-varying vector field of utility and canonical vector calculus operators. We apply our model to both synthetic data and lion GPS data collected at the Bubye Valley Conservancy in southern Zimbabwe. △ Less

Submitted 7 September, 2017; originally announced September 2017.

Comments: 9 pages, 4 figures

Showing 1–28 of 28 results for author: Cobb, D