-
Topographically-generated near-internal waves as a response to winds over the ocean surface
Authors:
Ashley J. Barnes,
Callum J. Shakespeare,
Andy McC. Hogg,
Navid C. Constantinou
Abstract:
Internal waves propagate on the ocean stratification and carry energy and momentum through the ocean interior. The two most significant sources of these waves in the ocean are surface winds and oscillatory tidal flow across topography. We propose a hybrid of these two mechanisms, in which wind induced oscillations of sea surface and isopycnal heights are rapidly communicated to the seafloor via hy…
▽ More
Internal waves propagate on the ocean stratification and carry energy and momentum through the ocean interior. The two most significant sources of these waves in the ocean are surface winds and oscillatory tidal flow across topography. We propose a hybrid of these two mechanisms, in which wind induced oscillations of sea surface and isopycnal heights are rapidly communicated to the seafloor via hydrostatic pressure. In the presence of topography, the resulting oscillatory bottom velocity may then generate internal waves in a similar manner to the barotropic tide. We investigate this mechanism in an idealised numerical isopycnal model of a storm passing over a mid ocean ridge, and perform several perturbation experiments in which ocean and wind properties are varied. Bottom-generated internal waves are identified propagating away from the ridge in the wake of the storm. Estimates of the total wave energy suggest that in the right circumstances these waves could be a significant source of internal wave energy, with a local wind work to wave energy conversion rate of up to 50% of the corresponding conversion to surface generated near-inertial waves in our domain. Our results suggest a need for further investigation in less idealised scenarios to more precisely quantity this novel mechanism of deep ocean wave generation, and how it may affect abyssal mixing.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Early Career Perspectives For the NASA SMD Bridge Program
Authors:
Jenna M. Cann,
Arturo O. Martinez,
Amethyst Barnes,
Sara Doan,
Feyi Ilesanmi,
Margaret Lazzarini,
Teresa Monsue,
Carlos Pinedo,
Nicole Cabrera Salazar,
Amy Steele
Abstract:
In line with the Astro2020 Decadal Report State of the Profession findings and the NASA core value of Inclusion, the NASA Science Mission Directorate (SMD) Bridge Program was created to provide financial and programmatic support to efforts that work to increase the representation and inclusion of students from under-represented minorities in the STEM fields. To ensure an effective program, particu…
▽ More
In line with the Astro2020 Decadal Report State of the Profession findings and the NASA core value of Inclusion, the NASA Science Mission Directorate (SMD) Bridge Program was created to provide financial and programmatic support to efforts that work to increase the representation and inclusion of students from under-represented minorities in the STEM fields. To ensure an effective program, particularly for those who are often left out of these conversations, the NASA SMD Bridge Program Workshop was developed as a way to gather feedback from a diverse group of people about their unique needs and interests. The Early Career Perspectives Working Group was tasked with examining the current state of bridge programs, academia in general, and its effect on students and early career professionals. The working group, comprised of 10 early career and student members, analyzed the discussions and responses from workshop breakout sessions and two surveys, as well as their own experiences, to develop specific recommendations and metrics for implementing a successful and supportive bridge program. In this white paper, we will discuss the key themes that arose through our work, and highlight select recommendations for the NASA SMD Bridge Program to best support students and early career professionals.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Using Neural Networks to Learn the Jet Stream Forced Response from Natural Variability
Authors:
Charlotte Connolly,
Elizabeth A. Barnes,
Pedram Hassanzadeh,
Mike Pritchard
Abstract:
Two distinct features of anthropogenic climate change, warming in the tropical upper troposphere and warming at the Arctic surface, have competing effects on the mid-latitude jet stream's latitudinal position, often referred to as a "tug-of-war". Studies that investigate the jet's response to these thermal forcings show that it is sensitive to model type, season, initial atmospheric conditions, an…
▽ More
Two distinct features of anthropogenic climate change, warming in the tropical upper troposphere and warming at the Arctic surface, have competing effects on the mid-latitude jet stream's latitudinal position, often referred to as a "tug-of-war". Studies that investigate the jet's response to these thermal forcings show that it is sensitive to model type, season, initial atmospheric conditions, and the shape and magnitude of the forcing. Much of this past work focuses on studying a simulation's response to external manipulation. In contrast, we explore the potential to train a convolutional neural network (CNN) on internal variability alone and then use it to examine possible nonlinear responses of the jet to tropospheric thermal forcing that more closely resemble anthropogenic climate change. Our approach leverages the idea behind the fluctuation-dissipation theorem, which relates the internal variability of a system to its forced response but so far has been only used to quantify linear responses. We train a CNN on data from a long control run of the CESM dry dynamical core and show that it is able to skillfully predict the nonlinear response of the jet to sustained external forcing. The trained CNN provides a quick method for exploring the jet stream sensitivity to a wide range of tropospheric temperature tendencies and, considering that this method can likely be applied to any model with a long control run, could lend itself useful for early stage experiment design.
△ Less
Submitted 1 January, 2023;
originally announced January 2023.
-
Carefully choose the baseline: Lessons learned from applying XAI attribution methods for regression tasks in geoscience
Authors:
Antonios Mamalakis,
Elizabeth A. Barnes,
Imme Ebert-Uphoff
Abstract:
Methods of eXplainable Artificial Intelligence (XAI) are used in geoscientific applications to gain insights into the decision-making strategy of Neural Networks (NNs) highlighting which features in the input contribute the most to a NN prediction. Here, we discuss our lesson learned that the task of attributing a prediction to the input does not have a single solution. Instead, the attribution re…
▽ More
Methods of eXplainable Artificial Intelligence (XAI) are used in geoscientific applications to gain insights into the decision-making strategy of Neural Networks (NNs) highlighting which features in the input contribute the most to a NN prediction. Here, we discuss our lesson learned that the task of attributing a prediction to the input does not have a single solution. Instead, the attribution results and their interpretation depend greatly on the considered baseline (sometimes referred to as reference point) that the XAI method utilizes; a fact that has been overlooked so far in the literature. This baseline can be chosen by the user or it is set by construction in the method s algorithm, often without the user being aware of that choice. We highlight that different baselines can lead to different insights for different science questions and, thus, should be chosen accordingly. To illustrate the impact of the baseline, we use a large ensemble of historical and future climate simulations forced with the SSP3-7.0 scenario and train a fully connected NN to predict the ensemble- and global-mean temperature (i.e., the forced global warming signal) given an annual temperature map from an individual ensemble member. We then use various XAI methods and different baselines to attribute the network predictions to the input. We show that attributions differ substantially when considering different baselines, as they correspond to answering different science questions. We conclude by discussing some important implications and considerations about the use of baselines in XAI research.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
Investigating the fidelity of explainable artificial intelligence methods for applications of convolutional neural networks in geoscience
Authors:
Antonios Mamalakis,
Elizabeth A. Barnes,
Imme Ebert-Uphoff
Abstract:
Convolutional neural networks (CNNs) have recently attracted great attention in geoscience due to their ability to capture non-linear system behavior and extract predictive spatiotemporal patterns. Given their black-box nature however, and the importance of prediction explainability, methods of explainable artificial intelligence (XAI) are gaining popularity as a means to explain the CNN decision-…
▽ More
Convolutional neural networks (CNNs) have recently attracted great attention in geoscience due to their ability to capture non-linear system behavior and extract predictive spatiotemporal patterns. Given their black-box nature however, and the importance of prediction explainability, methods of explainable artificial intelligence (XAI) are gaining popularity as a means to explain the CNN decision-making strategy. Here, we establish an intercomparison of some of the most popular XAI methods and investigate their fidelity in explaining CNN decisions for geoscientific applications. Our goal is to raise awareness of the theoretical limitations of these methods and gain insight into the relative strengths and weaknesses to help guide best practices. The considered XAI methods are first applied to an idealized attribution benchmark, where the ground truth of explanation of the network is known a priori, to help objectively assess their performance. Secondly, we apply XAI to a climate-related prediction setting, namely to explain a CNN that is trained to predict the number of atmospheric rivers in daily snapshots of climate simulations. Our results highlight several important issues of XAI methods (e.g., gradient shattering, inability to distinguish the sign of attribution, ignorance to zero input) that have previously been overlooked in our field and, if not considered cautiously, may lead to a distorted picture of the CNN decision-making strategy. We envision that our analysis will motivate further investigation into XAI fidelity and will help towards a cautious implementation of XAI in geoscience, which can lead to further exploitation of CNNs and deep learning for prediction problems.
△ Less
Submitted 5 September, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
The Fine-Tuning of the Universe for Life
Authors:
Luke A. Barnes
Abstract:
When a physicist says that a theory is fine-tuned, they mean that it must make a suspiciously precise assumption in order to explain a certain observation. This is evidence that the theory is deficient or incomplete. One particular case of fine-tuning is particularly striking. The data in question are not the precise measurements of cosmology or particle physics, but a more general feature of our…
▽ More
When a physicist says that a theory is fine-tuned, they mean that it must make a suspiciously precise assumption in order to explain a certain observation. This is evidence that the theory is deficient or incomplete. One particular case of fine-tuning is particularly striking. The data in question are not the precise measurements of cosmology or particle physics, but a more general feature of our universe: it supports the existence of life. This chapter reviews this Fine-Tuning of the Universe for Life.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Adding Uncertainty to Neural Network Regression Tasks in the Geosciences
Authors:
Elizabeth A. Barnes,
Randal J. Barnes,
Nicolas Gordillo
Abstract:
A simple method for adding uncertainty to neural network regression tasks via estimation of a general probability distribution is described. The methodology supports estimation of heteroscedastic, asymmetric uncertainties by a simple modification of the network output and loss function. Method performance is demonstrated with a simple one dimensional data set and then applied to a more complex reg…
▽ More
A simple method for adding uncertainty to neural network regression tasks via estimation of a general probability distribution is described. The methodology supports estimation of heteroscedastic, asymmetric uncertainties by a simple modification of the network output and loss function. Method performance is demonstrated with a simple one dimensional data set and then applied to a more complex regression task using synthetic climate data.
△ Less
Submitted 15 September, 2021;
originally announced September 2021.
-
Relaxation Based Modeling of GMD Induced Cascading Failures in PowerModelsGMD.jl
Authors:
Adam Mate,
Arthur K. Barnes,
Steven K. Morley,
Jacob A. Friz-Trillo,
Eduardo Cotilla-Sanchez,
Sean P. Blake
Abstract:
A major risk of geomagnetic disturbances (GMDs) is cascading failure of electrical grids. The modeling of GMD events and cascading outages in power systems is difficult, both independently and jointly, because of the many different mechanisms and physics involved. This paper introduces a relaxation based modeling of GMD-induced cascading failures:~the dc approximation-based DCSIMSEP solver was ada…
▽ More
A major risk of geomagnetic disturbances (GMDs) is cascading failure of electrical grids. The modeling of GMD events and cascading outages in power systems is difficult, both independently and jointly, because of the many different mechanisms and physics involved. This paper introduces a relaxation based modeling of GMD-induced cascading failures:~the dc approximation-based DCSIMSEP solver was adapted to simulate cascading as a result of GMDs, the full set of ac power flow equations were relaxed to guarantee optimality, and the reactive power losses were modeled while kee** the problem convex. The developed algorithm was implemented in PowerModelsGMD.jl - an open-source software specifically designed to model and analyze geomagnetic hazards - and demonstrated to work on the RTS-GMLC-GIC-EAST synthetic test network.
△ Less
Submitted 30 December, 2021; v1 submitted 14 August, 2021;
originally announced August 2021.
-
Controlled abstention neural networks for identifying skillful predictions for classification problems
Authors:
Elizabeth A. Barnes,
Randal J. Barnes
Abstract:
The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed "forecasts of opportunity." When these opportunities are not present, scientists need prediction systems that a…
▽ More
The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed "forecasts of opportunity." When these opportunities are not present, scientists need prediction systems that are capable of saying "I don't know." We introduce a novel loss function, termed the "NotWrong loss", that allows neural networks to identify forecasts of opportunity for classification problems. The NotWrong loss introduces an abstention class that allows the network to identify the more confident samples and abstain (say "I don't know") on the less confident samples. The abstention loss is designed to abstain on a user-defined fraction of the samples via a PID controller. Unlike many machine learning methods used to reject samples post-training, the NotWrong loss is applied during training to preferentially learn from the more confident samples. We show that the NotWrong loss outperforms other existing loss functions for multiple climate use cases. The implementation of the proposed loss function is straightforward in most network architectures designed for classification as it only requires the addition of an abstention class to the output layer and modification of the loss function.
△ Less
Submitted 16 April, 2021;
originally announced April 2021.
-
Controlled abstention neural networks for identifying skillful predictions for regression problems
Authors:
Elizabeth A. Barnes,
Randal J. Barnes
Abstract:
The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed "forecasts of opportunity". When these opportunities are not present, scientists need prediction systems that a…
▽ More
The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed "forecasts of opportunity". When these opportunities are not present, scientists need prediction systems that are capable of saying "I don't know." We introduce a novel loss function, termed "abstention loss", that allows neural networks to identify forecasts of opportunity for regression problems. The abstention loss works by incorporating uncertainty in the network's prediction to identify the more confident samples and abstain (say "I don't know") on the less confident samples. The abstention loss is designed to determine the optimal abstention fraction, or abstain on a user-defined fraction via a PID controller. Unlike many methods for attaching uncertainty to neural network predictions post-training, the abstention loss is applied during training to preferentially learn from the more confident samples. The abstention loss is built upon a standard computer science method. While the standard approach is itself a simple yet powerful tool for incorporating uncertainty in regression problems, we demonstrate that the abstention loss outperforms this more standard method for the synthetic climate use cases explored here. The implementation of proposed loss function is straightforward in most network architectures designed for regression, as it only requires modification of the output layer and loss function.
△ Less
Submitted 16 April, 2021;
originally announced April 2021.
-
The Trouble with "Puddle Thinking": A User's Guide to the Anthropic Principle
Authors:
Geraint F. Lewis,
Luke A. Barnes
Abstract:
Are some cosmologists trying to return human beings to the centre of the cosmos? In the view of some critics, the so-called "anthropic principle" is a desperate attempt to salvage a scrap of dignity for our species after a few centuries of demotion at the hands of science. It is all things archaic and backwards - teleology, theology, religion, anthropocentrism - trying to sneak back in scientific…
▽ More
Are some cosmologists trying to return human beings to the centre of the cosmos? In the view of some critics, the so-called "anthropic principle" is a desperate attempt to salvage a scrap of dignity for our species after a few centuries of demotion at the hands of science. It is all things archaic and backwards - teleology, theology, religion, anthropocentrism - trying to sneak back in scientific camouflage. We argue that this is a mistake. The anthropic principle is not mere human arrogance, nor is it religion in disguise. It is a necessary part of the science of the universe.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
Neural Network Attribution Methods for Problems in Geoscience: A Novel Synthetic Benchmark Dataset
Authors:
Antonios Mamalakis,
Imme Ebert-Uphoff,
Elizabeth A. Barnes
Abstract:
Despite the increasingly successful application of neural networks to many problems in the geosciences, their complex and nonlinear structure makes the interpretation of their predictions difficult, which limits model trust and does not allow scientists to gain physical insights about the problem at hand. Many different methods have been introduced in the emerging field of eXplainable Artificial I…
▽ More
Despite the increasingly successful application of neural networks to many problems in the geosciences, their complex and nonlinear structure makes the interpretation of their predictions difficult, which limits model trust and does not allow scientists to gain physical insights about the problem at hand. Many different methods have been introduced in the emerging field of eXplainable Artificial Intelligence (XAI), which aim at attributing the network s prediction to specific features in the input domain. XAI methods are usually assessed by using benchmark datasets (like MNIST or ImageNet for image classification). However, an objective, theoretically derived ground truth for the attribution is lacking for most of these datasets, making the assessment of XAI in many cases subjective. Also, benchmark datasets specifically designed for problems in geosciences are rare. Here, we provide a framework, based on the use of additively separable functions, to generate attribution benchmark datasets for regression problems for which the ground truth of the attribution is known a priori. We generate a large benchmark dataset and train a fully connected network to learn the underlying function that was used for simulation. We then compare estimated heatmaps from different XAI methods to the ground truth in order to identify examples where specific XAI methods perform well or poorly. We believe that attribution benchmarks as the ones introduced herein are of great importance for further application of neural networks in the geosciences, and for more objective assessment and accurate implementation of XAI methods, which will increase model trust and assist in discovering new science.
△ Less
Submitted 10 June, 2022; v1 submitted 17 March, 2021;
originally announced March 2021.
-
A gravity-independent powder-based additive manufacturing process tailored for space applications
Authors:
Olfa D'Angelo,
Felix Kuthe,
Szu-Jia Liu,
Raphael Wiedey,
Joe M. Bennett,
Martina Meisnar,
Andrew Barnes,
W. Till Kranz,
Thomas Voigtmann,
Andreas Meyer
Abstract:
The future of space exploration missions will rely on technologies increasing their endurance and self-sufficiency, including for manufacturing objects on-demand. We propose a process for handling and additively manufacturing powders that functions independently of the gravitational environment and with no restriction on feedstock powder flowability. Based on a specific sequence of boundary loads…
▽ More
The future of space exploration missions will rely on technologies increasing their endurance and self-sufficiency, including for manufacturing objects on-demand. We propose a process for handling and additively manufacturing powders that functions independently of the gravitational environment and with no restriction on feedstock powder flowability. Based on a specific sequence of boundary loads applied to the granular packing, powder is transported to the printing zone, homogenized and put under compression to increase the density of the final part. The powder deposition process is validated by simulations that show the homogeneity and density of deposition to be insensitive to gravity and cohesion forces within the DEM model. We further provide an experimental proof of concept of the process by successfully 3D printing parts on-ground and in weightlessness, on parabolic flight. Powders exhibiting high and low flowability are used as model feedstock material to demonstrate the versatility of the process, opening the way for additive manufacturing of recycled material.
△ Less
Submitted 8 October, 2021; v1 submitted 19 February, 2021;
originally announced February 2021.
-
Will Artificial Intelligence supersede Earth System and Climate Models?
Authors:
Christopher Irrgang,
Niklas Boers,
Maike Sonnewald,
Elizabeth A. Barnes,
Christopher Kadow,
Joanna Staneva,
Jan Saynisch-Wagner
Abstract:
We outline a perspective of an entirely new research branch in Earth and climate sciences, where deep neural networks and Earth system models are dismantled as individual methodological approaches and reassembled as learning, self-validating, and interpretable Earth system model-network hybrids. Following this path, we coin the term "Neural Earth System Modelling" (NESYM) and highlight the necessi…
▽ More
We outline a perspective of an entirely new research branch in Earth and climate sciences, where deep neural networks and Earth system models are dismantled as individual methodological approaches and reassembled as learning, self-validating, and interpretable Earth system model-network hybrids. Following this path, we coin the term "Neural Earth System Modelling" (NESYM) and highlight the necessity of a transdisciplinary discussion platform, bringing together Earth and climate scientists, big data analysts, and AI experts. We examine the concurrent potential and pitfalls of Neural Earth System Modelling and discuss the open question whether artificial intelligence will not only infuse Earth system modelling, but ultimately render them obsolete.
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
Identifying Opportunities for Skillful Weather Prediction with Interpretable Neural Networks
Authors:
Elizabeth A. Barnes,
Kirsten Mayer,
Benjamin Toms,
Zane Martin,
Emily Gordon
Abstract:
The atmosphere is chaotic. This fundamental property of the climate system makes forecasting weather incredibly challenging: it's impossible to expect weather models to ever provide perfect predictions of the Earth system beyond timescales of approximately 2 weeks. Instead, atmospheric scientists look for specific states of the climate system that lead to more predictable behaviour than others. He…
▽ More
The atmosphere is chaotic. This fundamental property of the climate system makes forecasting weather incredibly challenging: it's impossible to expect weather models to ever provide perfect predictions of the Earth system beyond timescales of approximately 2 weeks. Instead, atmospheric scientists look for specific states of the climate system that lead to more predictable behaviour than others. Here, we demonstrate how neural networks can be used, not only to leverage these states to make skillful predictions, but moreover to identify the climatic conditions that lead to enhanced predictability. Furthermore, we employ a neural network interpretability method called ``layer-wise relevance propagation'' to create heatmaps of the regions in the input most relevant for a network's output. For Earth scientists, these relevant regions for the neural network's prediction are by far the most important product of our study: they provide scientific insight into the physical mechanisms that lead to enhanced weather predictability. While we demonstrate our approach for the atmospheric science domain, this methodology is applicable to a large range of geoscientific problems.
△ Less
Submitted 14 December, 2020;
originally announced December 2020.
-
The GlueX Beamline and Detector
Authors:
S. Adhikari,
C. S. Akondi,
H. Al Ghoul,
A. Ali,
M. Amaryan,
E. G. Anassontzis,
A. Austregesilo,
F. Barbosa,
J. Barlow,
A. Barnes,
E. Barriga,
R. Barsotti,
T. D. Beattie,
J. Benesch,
V. V. Berdnikov,
G. Biallas,
T. Black,
W. Boeglin,
P. Brindza,
W. J. Briscoe,
T. Britton,
J. Brock,
W. K. Brooks,
B. E. Cannon,
C. Carlin
, et al. (165 additional authors not shown)
Abstract:
The GlueX experiment at Jefferson Lab has been designed to study photoproduction reactions with a 9-GeV linearly polarized photon beam. The energy and arrival time of beam photons are tagged using a scintillator hodoscope and a scintillating fiber array. The photon flux is determined using a pair spectrometer, while the linear polarization of the photon beam is determined using a polarimeter based…
▽ More
The GlueX experiment at Jefferson Lab has been designed to study photoproduction reactions with a 9-GeV linearly polarized photon beam. The energy and arrival time of beam photons are tagged using a scintillator hodoscope and a scintillating fiber array. The photon flux is determined using a pair spectrometer, while the linear polarization of the photon beam is determined using a polarimeter based on triplet photoproduction. Charged-particle tracks from interactions in the central target are analyzed in a solenoidal field using a central straw-tube drift chamber and six packages of planar chambers with cathode strips and drift wires. Electromagnetic showers are reconstructed in a cylindrical scintillating fiber calorimeter inside the magnet and a lead-glass array downstream. Charged particle identification is achieved by measuring energy loss in the wire chambers and using the flight time of particles between the target and detectors outside the magnet. The signals from all detectors are recorded with flash ADCs and/or pipeline TDCs into memories allowing trigger decisions with a latency of 3.3 $μ$s. The detector operates routinely at trigger rates of 40 kHz and data rates of 600 megabytes per second. We describe the photon beam, the GlueX detector components, electronics, data-acquisition and monitoring systems, and the performance of the experiment during the first three years of operation.
△ Less
Submitted 26 October, 2020; v1 submitted 28 May, 2020;
originally announced May 2020.
-
Indicator patterns of forced change learned by an artificial neural network
Authors:
Elizabeth A. Barnes,
Benjamin Toms,
James W. Hurrell,
Imme Ebert-Uphoff,
Chuck Anderson,
David Anderson
Abstract:
Many problems in climate science require the identification of signals obscured by both the "noise" of internal climate variability and differences across models. Following previous work, we train an artificial neural network (ANN) to identify the year of input maps of temperature and precipitation from forced climate model simulations. This prediction task requires the ANN to learn forced pattern…
▽ More
Many problems in climate science require the identification of signals obscured by both the "noise" of internal climate variability and differences across models. Following previous work, we train an artificial neural network (ANN) to identify the year of input maps of temperature and precipitation from forced climate model simulations. This prediction task requires the ANN to learn forced patterns of change amidst a background of climate noise and model differences. We then apply a neural network visualization technique (layerwise relevance propagation) to visualize the spatial patterns that lead the ANN to successfully predict the year. These spatial patterns thus serve as "reliable indicators" of the forced change. The architecture of the ANN is chosen such that these indicators vary in time, thus capturing the evolving nature of regional signals of change. Results are compared to those of more standard approaches like signal-to-noise ratios and multi-linear regression in order to gain intuition about the reliable indicators identified by the ANN. We then apply an additional visualization tool (backward optimization) to highlight where disagreements in simulated and observed patterns of change are most important for the prediction of the year. This work demonstrates that ANNs and their visualization tools make a powerful pair for extracting climate patterns of forced change.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
Physically Interpretable Neural Networks for the Geosciences: Applications to Earth System Variability
Authors:
Benjamin A. Toms,
Elizabeth A. Barnes,
Imme Ebert-Uphoff
Abstract:
Neural networks have become increasingly prevalent within the geosciences, although a common limitation of their usage has been a lack of methods to interpret what the networks learn and how they make decisions. As such, neural networks have often been used within the geosciences to most accurately identify a desired output given a set of inputs, with the interpretation of what the network learns…
▽ More
Neural networks have become increasingly prevalent within the geosciences, although a common limitation of their usage has been a lack of methods to interpret what the networks learn and how they make decisions. As such, neural networks have often been used within the geosciences to most accurately identify a desired output given a set of inputs, with the interpretation of what the network learns used as a secondary metric to ensure the network is making the right decision for the right reason. Neural network interpretation techniques have become more advanced in recent years, however, and we therefore propose that the ultimate objective of using a neural network can also be the interpretation of what the network has learned rather than the output itself.
We show that the interpretation of neural networks can enable the discovery of scientifically meaningful connections within geoscientific data. In particular, we use two methods for neural network interpretation called backwards optimization and layerwise relevance propagation, both of which project the decision pathways of a network back onto the original input dimensions. To the best of our knowledge, LRP has not yet been applied to geoscientific research, and we believe it has great potential in this area. We show how these interpretation techniques can be used to reliably infer scientifically meaningful information from neural networks by applying them to common climate patterns. These results suggest that combining interpretable neural networks with novel scientific hypotheses will open the door to many new avenues in neural network-related geoscience research.
△ Less
Submitted 27 May, 2020; v1 submitted 3 December, 2019;
originally announced December 2019.
-
Constraints on ion vs. electron heating by plasma turbulence at low beta
Authors:
A. A. Schekochihin,
Y. Kawazura,
M. A. Barnes
Abstract:
It is shown that in low-beta, weakly collisional plasmas, such as the solar corona, some instances of the solar wind, the aurora, inner regions of accretion discs, their coronae, and some laboratory plasmas, Alfvénic fluctuations produce no ion heating within the gyrokinetic approximation, i.e., as long as their amplitudes (at the Larmor scale) are small and their frequencies stay below the ion La…
▽ More
It is shown that in low-beta, weakly collisional plasmas, such as the solar corona, some instances of the solar wind, the aurora, inner regions of accretion discs, their coronae, and some laboratory plasmas, Alfvénic fluctuations produce no ion heating within the gyrokinetic approximation, i.e., as long as their amplitudes (at the Larmor scale) are small and their frequencies stay below the ion Larmor frequency (even as their spatial scales can be above or below the ion Larmor scale). Thus, all low-frequency ion heating in such plasmas is due to compressive fluctuations ("slow modes"). Because these fluctuations energetically decouple from the Alfvénic ones already in the inertial range, the above conclusion means that the energy partition between ions and electrons in low-beta plasmas is decided at the outer scale, where turbulence is launched, and can be determined from magnetohydrodynamic (MHD) models of the relevant astrophysical systems. Any additional ion heating must come from non-gyrokinetic mechanisms such as cyclotron heating or the stochastic heating owing to distortions of ions' Larmor orbits. An exception to these conclusions occurs in the Hall limit, i.e., when the ratio of the ion to electron temperatures is as low as the ion beta (equivalently, the electron beta is order unity). In this regime, slow modes couple to Alfvénic ones well above the Larmor scale (viz., at the ion inertial or ion sound scale), so the Alfvénic and compressive cascades join and then separate again into two cascades of fluctuations that linearly resemble kinetic Alfvén and ion cyclotron waves, with the former heating electrons and the latter ions. The two cascades are shown to decouple, scalings for them are derived, and it is argued physically that the two species will be heated by them at approximately equal rates.
△ Less
Submitted 10 April, 2019; v1 submitted 23 December, 2018;
originally announced December 2018.
-
Bell's Spaceships: The Views from Bow and Stern
Authors:
Geraint F. Lewis,
Luke A. Barnes,
Martin J. Sticka
Abstract:
Unravelling apparent paradoxes has proven to be a powerful tool for understanding the complexities of special relativity. In this paper, we focus upon one such paradox, namely Bell's spaceship paradox, examining the relative motion of two uniformly accelerating spaceships. We consider the view from either spaceship, with the exchange of photons between the two. This recovers the well known result…
▽ More
Unravelling apparent paradoxes has proven to be a powerful tool for understanding the complexities of special relativity. In this paper, we focus upon one such paradox, namely Bell's spaceship paradox, examining the relative motion of two uniformly accelerating spaceships. We consider the view from either spaceship, with the exchange of photons between the two. This recovers the well known result that the leading spaceship loses sight of the trailing spaceship as it is redshifted and disappears behind what is known as the `Rindler horizon'. An immediate impact of this is that if either spaceship tries to measure the separation through `radar ranging', bouncing photons off one another, they would both eventually fail to receive any of the photon `**s' that they emit. We find that the view from this trailing spaceship is, however, starkly different, initially, seeing the leading spaceship with an increasing blueshift, followed by a decreasing blueshift. We conclude that, while the leading spaceship loses sight of the trailing spaceship, for the trailing spaceship the view of the separation between the two spaceships, and the apparent angular size of the leading spaceship, approach asymptotic values. Intriguingly, for particular parametrization of the journey of the two spaceships, these asymptotic values are identical to those properties seen before the spaceships began accelerating, and the view from the trailing spaceship becomes identical to when the two spaceships were initially at rest.
△ Less
Submitted 12 December, 2017;
originally announced December 2017.
-
Fine-Tuning in the Context of Bayesian Theory Testing
Authors:
Luke A. Barnes
Abstract:
Fine-tuning in physics and cosmology is often used as evidence that a theory is incomplete. For example, the parameters of the standard model of particle physics are "unnaturally" small (in various technical senses), which has driven much of the search for physics beyond the standard model. Of particular interest is the fine-tuning of the universe for life, which suggests that our universe's abili…
▽ More
Fine-tuning in physics and cosmology is often used as evidence that a theory is incomplete. For example, the parameters of the standard model of particle physics are "unnaturally" small (in various technical senses), which has driven much of the search for physics beyond the standard model. Of particular interest is the fine-tuning of the universe for life, which suggests that our universe's ability to create physical life forms is improbable and in need of explanation, perhaps by a multiverse. This claim has been challenged on the grounds that the relevant probability measure cannot be justified because it cannot be normalized, and so small probabilities cannot be inferred. We show how fine-tuning can be formulated within the context of Bayesian theory testing (or \emph{model selection}) in the physical sciences. The normalizability problem is seen to be a general problem for testing any theory with free parameters, and not a unique problem for fine-tuning. Physical theories in fact avoid such problems in one of two ways. Dimensional parameters are bounded by the Planck scale, avoiding troublesome infinities, and we are not compelled to assume that dimensionless parameters are distributed uniformly, which avoids non-normalizability.
△ Less
Submitted 12 July, 2017;
originally announced July 2017.
-
Testing the Multiverse: Bayes, Fine-Tuning and Typicality
Authors:
Luke A. Barnes
Abstract:
Theory testing in the physical sciences has been revolutionized in recent decades by Bayesian approaches to probability theory. Here, I will consider Bayesian approaches to theory extensions, that is, theories like inflation which aim to provide a deeper explanation for some aspect of our models (in this case, the standard model of cosmology) that seem unnatural or fine-tuned. In particular, I wil…
▽ More
Theory testing in the physical sciences has been revolutionized in recent decades by Bayesian approaches to probability theory. Here, I will consider Bayesian approaches to theory extensions, that is, theories like inflation which aim to provide a deeper explanation for some aspect of our models (in this case, the standard model of cosmology) that seem unnatural or fine-tuned. In particular, I will consider how cosmologists can test the multiverse using observations of this universe.
△ Less
Submitted 5 April, 2017;
originally announced April 2017.
-
Collisionality scaling of the electron heat flux in ETG turbulence
Authors:
G J Colyer,
A A Schekochihin,
F I Parra,
C M Roach,
M A Barnes,
Y-c Ghim,
W Dorland
Abstract:
In electrostatic simulations of MAST plasma at electron-gyroradius scales, using the local flux-tube gyrokinetic code GS2 with adiabatic ions, we find that the long-time saturated electron heat flux (the level most relevant to energy transport) decreases as the electron collisionality decreases. At early simulation times, the heat flux "quasi-saturates" without any strong dependence on collisional…
▽ More
In electrostatic simulations of MAST plasma at electron-gyroradius scales, using the local flux-tube gyrokinetic code GS2 with adiabatic ions, we find that the long-time saturated electron heat flux (the level most relevant to energy transport) decreases as the electron collisionality decreases. At early simulation times, the heat flux "quasi-saturates" without any strong dependence on collisionality, and with the turbulence dominated by streamer-like radially elongated structures. However, the zonal fluctuation component continues to grow slowly until much later times, eventually leading to a new saturated state dominated by zonal modes and with the heat flux proportional to the collision rate, in approximate agreement with the experimentally observed collisionality scaling of the energy confinement in MAST. We outline an explanation of this effect based on a model of ETG turbulence dominated by zonal-nonzonal interactions and on an analytically derived scaling of the zonal-mode dam** rate with the electron-ion collisionality. Improved energy confinement with decreasing collisionality is favourable towards the performance of future, hotter devices.
△ Less
Submitted 14 January, 2017; v1 submitted 22 July, 2016;
originally announced July 2016.
-
First Results from The GlueX Experiment
Authors:
The GlueX Collaboration,
H. Al Ghoul,
E. G. Anassontzis,
F. Barbosa,
A. Barnes,
T. D. Beattie,
D. W. Bennett,
V. V. Berdnikov,
T. Black,
W. Boeglin,
W. K. Brooks,
B. Cannon,
O. Chernyshov,
E. Chudakov,
V. Crede,
M. M. Dalton,
A. Deur,
S. Dobbs,
A. Dolgolenko,
M. Dugger,
H. Egiyan,
P. Eugenio,
A. M. Foda,
J. Frye,
S. Furletov
, et al. (86 additional authors not shown)
Abstract:
The GlueX experiment at Jefferson Lab ran with its first commissioning beam in late 2014 and the spring of 2015. Data were collected on both plastic and liquid hydrogen targets, and much of the detector has been commissioned. All of the detector systems are now performing at or near design specifications and events are being fully reconstructed, including exclusive production of $π^{0}$, $η$ and…
▽ More
The GlueX experiment at Jefferson Lab ran with its first commissioning beam in late 2014 and the spring of 2015. Data were collected on both plastic and liquid hydrogen targets, and much of the detector has been commissioned. All of the detector systems are now performing at or near design specifications and events are being fully reconstructed, including exclusive production of $π^{0}$, $η$ and $ω$ mesons. Linearly-polarized photons were successfully produced through coherent bremsstrahlung and polarization transfer to the $ρ$ has been observed.
△ Less
Submitted 14 January, 2016; v1 submitted 11 December, 2015;
originally announced December 2015.
-
A study of decays to strange final states with GlueX in Hall D using components of the BaBar DIRC
Authors:
The GlueX Collaboration,
M. Dugger,
B. Ritchie,
I. Senderovich,
E. Anassontzis,
P. Ioannou,
C. Kourkoumeli,
G. Vasileiadis,
G. Voulgaris,
N. Jarvis,
W. Levine,
P. Mattione,
W. McGinley,
C. A. Meyer,
R. Schumacher,
M. Staib,
F. Klein,
D. Sober,
N. Sparks,
N. Walford,
D. Doughty,
A. Barnes,
R. Jones,
J. McIntyre,
F. Mokaya
, et al. (82 additional authors not shown)
Abstract:
We propose to enhance the kaon identification capabilities of the GlueX detector by constructing an FDIRC (Focusing Detection of Internally Reflected Cherenkov) detector utilizing the decommissioned BaBar DIRC components. The GlueX FDIRC would significantly enhance the GlueX physics program by allowing one to search for and study hybrid mesons decaying into kaon final states. Such systematic studi…
▽ More
We propose to enhance the kaon identification capabilities of the GlueX detector by constructing an FDIRC (Focusing Detection of Internally Reflected Cherenkov) detector utilizing the decommissioned BaBar DIRC components. The GlueX FDIRC would significantly enhance the GlueX physics program by allowing one to search for and study hybrid mesons decaying into kaon final states. Such systematic studies of kaon final states are essential for inferring the quark flavor content of hybrid and conventional mesons. The GlueX FDIRC would reuse one-third of the synthetic fused silica bars that were utilized in the BaBar DIRC. A new focussing photon camera, read out with large area photodetectors, would be developed. We propose operating the enhanced GlueX detector in Hall D for a total of 220 days at an average intensity of 5x10^7 γ/s, a program that was conditionally approved by PAC39
△ Less
Submitted 1 August, 2014;
originally announced August 2014.
-
Density functional theory embedding for correlated wavefunctions: Improved methods for open-shell systems and transition metal complexes
Authors:
Jason D. Goodpaster,
Taylor A. Barnes,
Frederick R. Manby,
Thomas F. Miller III
Abstract:
Density functional theory (DFT) embedding provides a formally exact framework for interfacing correlated wave-function theory (WFT) methods with lower-level descriptions of electronic structure. Here, we report techniques to improve the accuracy and stability of WFT-in-DFT embedding calculations. In particular, we develop spin-dependent embedding potentials in both restricted and unrestricted orbi…
▽ More
Density functional theory (DFT) embedding provides a formally exact framework for interfacing correlated wave-function theory (WFT) methods with lower-level descriptions of electronic structure. Here, we report techniques to improve the accuracy and stability of WFT-in-DFT embedding calculations. In particular, we develop spin-dependent embedding potentials in both restricted and unrestricted orbital formulations to enable WFT-in-DFT embedding for open-shell systems, and we develop an orbital-occupation-freezing technique to improve the convergence of optimized effective potential (OEP) calculations that arise in the evaluation of the embedding potential. The new techniques are demonstrated in applications to the van-der-Waals-bound ethylene-propylene dimer and to the hexaaquairon(II) transition-metal cation. Calculation of the dissociation curve for the ethylene-propylene dimer reveals that WFT-in-DFT embedding reproduces full CCSD(T) energies to within 0.1 kcal/mol at all distances, eliminating errors in the dispersion interactions due to conventional exchange-correlation (XC) functionals while simultaneously avoiding errors due to subsystem partitioning across covalent bonds. Application of WFT-in-DFT embedding to the calculation of the low-spin/high-spin splitting energy in the hexaaquairon(II) cation reveals that the majority of the dependence on the DFT XC functional can be eliminated by treating only the single transition-metal atom at the WFT level; furthermore, these calculations demonstrate the substantial effects of open-shell contributions to the embedding potential, and they suggest that restricted open-shell WFT-in-DFT embedding provides better accuracy than unrestricted open-shell WFT-in-DFT embedding due to the removal of spin contamination.
△ Less
Submitted 17 December, 2012; v1 submitted 26 November, 2012;
originally announced November 2012.
-
The Fine-Tuning of the Universe for Intelligent Life
Authors:
Luke A. Barnes
Abstract:
The fine-tuning of the universe for intelligent life has received a great deal of attention in recent years, both in the philosophical and scientific literature. The claim is that in the space of possible physical laws, parameters and initial conditions, the set that permits the evolution of intelligent life is very small. I present here a review of the scientific literature, outlining cases of fi…
▽ More
The fine-tuning of the universe for intelligent life has received a great deal of attention in recent years, both in the philosophical and scientific literature. The claim is that in the space of possible physical laws, parameters and initial conditions, the set that permits the evolution of intelligent life is very small. I present here a review of the scientific literature, outlining cases of fine-tuning in the classic works of Carter, Carr and Rees, and Barrow and Tipler, as well as more recent work. To sharpen the discussion, the role of the antagonist will be played by Victor Stenger's recent book The Fallacy of Fine-Tuning: Why the Universe is Not Designed for Us. Stenger claims that all known fine-tuning cases can be explained without the need for a multiverse. Many of Stenger's claims will be found to be highly problematic. We will touch on such issues as the logical necessity of the laws of nature; objectivity, invariance and symmetry; theoretical physics and possible universes; entropy in cosmology; cosmic inflation and initial conditions; galaxy formation; the cosmological constant; stars and their formation; the properties of elementary particles and their effect on chemistry and the macroscopic world; the origin of mass; grand unified theories; and the dimensionality of space and time. I also provide an assessment of the multiverse, noting the significant challenges that it must face. I do not attempt to defend any conclusion based on the fine-tuning of the universe for intelligent life. This paper can be viewed as a critique of Stenger's book, or read independently.
△ Less
Submitted 7 June, 2012; v1 submitted 20 December, 2011;
originally announced December 2011.
-
Embedded density functional theory for covalently bonded and strongly interacting subsystems
Authors:
Jason D. Goodpaster,
Taylor A. Barnes,
Thomas F. Miller III
Abstract:
Embedded density functional theory (e-DFT) is used to describe the electronic structure of strongly interacting molecular subsystems. We present a general implementation of the Exact Embedding (EE) method [J. Chem. Phys. 133, 084103 (2010)] to calculate the large contributions of the non-additive kinetic potential (NAKP) in such applications. Potential energy curves are computed for the dissociati…
▽ More
Embedded density functional theory (e-DFT) is used to describe the electronic structure of strongly interacting molecular subsystems. We present a general implementation of the Exact Embedding (EE) method [J. Chem. Phys. 133, 084103 (2010)] to calculate the large contributions of the non-additive kinetic potential (NAKP) in such applications. Potential energy curves are computed for the dissociation of Li+-Be, CH3-CF3, and hydrogen-bonded water clusters, and e-DFT results obtained using the EE method are compared with those obtained using approximate kinetic energy functionals. In all cases, the EE method preserves excellent agreement with reference Kohn-Sham calculations, whereas the approximate functionals lead to qualitative failures in the calculated energies and equilibrium structures. We also demonstrate an accurate pairwise approximation to the NAKP that allows for efficient parallelization of the EE method in large systems; benchmark calculations on molecular crystals reveal ideal, size-independent scaling of wall-clock time with increasing system size.
△ Less
Submitted 19 February, 2011;
originally announced February 2011.
-
Expanding Space: the Root of all Evil?
Authors:
Matthew J. Francis,
Luke A. Barnes,
J. Berian James,
Geraint F. Lewis
Abstract:
While it remains the staple of virtually all cosmological teaching, the concept of expanding space in explaining the increasing separation of galaxies has recently come under fire as a dangerous idea whose application leads to the development of confusion and the establishment of misconceptions. In this paper, we develop a notion of expanding space that is completely valid as a framework for the…
▽ More
While it remains the staple of virtually all cosmological teaching, the concept of expanding space in explaining the increasing separation of galaxies has recently come under fire as a dangerous idea whose application leads to the development of confusion and the establishment of misconceptions. In this paper, we develop a notion of expanding space that is completely valid as a framework for the description of the evolution of the universe and whose application allows an intuitive understanding of the influence of universal expansion. We also demonstrate how arguments against the concept in general have failed thus far, as they imbue expanding space with physical properties not consistent with the expectations of general relativity.
△ Less
Submitted 3 July, 2007;
originally announced July 2007.
-
Energy Loss, Electron Screening, and the Astrophysical 3He(d,p)4He cross section
Authors:
K. Langanke,
T. D. Shoppa,
C. A. Barnes,
C. Rolfs
Abstract:
We reanalyze the low-energy 3He(d,p)4He cross section measurements of Engstler et al. using recently measured energy loss data for proton and deuteron beams in a helium gas. Although the new 3He(d,p)4He S-factors are significantly lower than those reported by Engstler et al. they clearly show the presence of electron screening effects. From the new S-factors we find an electron screening energy…
▽ More
We reanalyze the low-energy 3He(d,p)4He cross section measurements of Engstler et al. using recently measured energy loss data for proton and deuteron beams in a helium gas. Although the new 3He(d,p)4He S-factors are significantly lower than those reported by Engstler et al. they clearly show the presence of electron screening effects. From the new S-factors we find an electron screening energy in agreement with the adiabatic limit.
△ Less
Submitted 2 January, 1996; v1 submitted 11 December, 1995;
originally announced December 1995.