-
Partial Label Learning with Focal Loss for Sea Ice Classification Based on Ice Charts
Authors:
Behzad Vahedi,
Benjamin Lucas,
Farnoush Banaei-Kashani,
Andrew P. Barrett,
Walter N. Meier,
Siri Jodha Khalsa,
Morteza Karimzadeh
Abstract:
Sea ice, crucial to the Arctic and Earth's climate, requires consistent monitoring and high-resolution map**. Manual sea ice map**, however, is time-consuming and subjective, prompting the need for automated deep learning-based classification approaches. However, training these algorithms is challenging because expert-generated ice charts, commonly used as training data, do not map single ice…
▽ More
Sea ice, crucial to the Arctic and Earth's climate, requires consistent monitoring and high-resolution map**. Manual sea ice map**, however, is time-consuming and subjective, prompting the need for automated deep learning-based classification approaches. However, training these algorithms is challenging because expert-generated ice charts, commonly used as training data, do not map single ice types but instead map polygons with multiple ice types. Moreover, the distribution of various ice types in these charts is frequently imbalanced, resulting in a performance bias towards the dominant class. In this paper, we present a novel GeoAI approach to training sea ice classification by formalizing it as a partial label learning task with explicit confidence scores to address multiple labels and class imbalance. We treat the polygon-level labels as candidate partial labels, assign the corresponding ice concentrations as confidence scores to each candidate label, and integrate them with focal loss to train a Convolutional Neural Network (CNN). Our proposed approach leads to enhanced performance for sea ice classification in Sentinel-1 dual-polarized SAR images, improving classification accuracy (from 87% to 92%) and weighted average F-1 score (from 90% to 93%) compared to the conventional training approach of using one-hot encoded labels and Categorical Cross-Entropy loss. It also improves the F-1 score in 4 out of the 6 sea ice classes.
△ Less
Submitted 9 June, 2024; v1 submitted 5 June, 2024;
originally announced June 2024.
-
Benchmark Early and Red Team Often: A Framework for Assessing and Managing Dual-Use Hazards of AI Foundation Models
Authors:
Anthony M. Barrett,
Krystal Jackson,
Evan R. Murphy,
Nada Madkour,
Jessica Newman
Abstract:
A concern about cutting-edge or "frontier" AI foundation models is that an adversary may use the models for preparing chemical, biological, radiological, nuclear, (CBRN), cyber, or other attacks. At least two methods can identify foundation models with potential dual-use capability; each has advantages and disadvantages: A. Open benchmarks (based on openly available questions and answers), which a…
▽ More
A concern about cutting-edge or "frontier" AI foundation models is that an adversary may use the models for preparing chemical, biological, radiological, nuclear, (CBRN), cyber, or other attacks. At least two methods can identify foundation models with potential dual-use capability; each has advantages and disadvantages: A. Open benchmarks (based on openly available questions and answers), which are low-cost but accuracy-limited by the need to omit security-sensitive details; and B. Closed red team evaluations (based on private evaluation by CBRN and cyber experts), which are higher-cost but can achieve higher accuracy by incorporating sensitive details. We propose a research and risk-management approach using a combination of methods including both open benchmarks and closed red team evaluations, in a way that leverages advantages of both methods. We recommend that one or more groups of researchers with sufficient resources and access to a range of near-frontier and frontier foundation models run a set of foundation models through dual-use capability evaluation benchmarks and red team evaluations, then analyze the resulting sets of models' scores on benchmark and red team evaluations to see how correlated those are. If, as we expect, there is substantial correlation between the dual-use potential benchmark scores and the red team evaluation scores, then implications include the following: The open benchmarks should be used frequently during foundation model development as a quick, low-cost measure of a model's dual-use potential; and if a particular model gets a high score on the dual-use potential benchmark, then more in-depth red team assessments of that model's dual-use capability should be performed. We also discuss limitations and mitigations for our approach, e.g., if model developers try to game benchmarks by including a version of benchmark test data in a model's training data.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
A Red Teaming Framework for Securing AI in Maritime Autonomous Systems
Authors:
Mathew J. Walter,
Aaron Barrett,
Kimberly Tam
Abstract:
Artificial intelligence (AI) is being ubiquitously adopted to automate processes in science and industry. However, due to its often intricate and opaque nature, AI has been shown to possess inherent vulnerabilities which can be maliciously exploited with adversarial AI, potentially putting AI users and developers at both cyber and physical risk. In addition, there is insufficient comprehension of…
▽ More
Artificial intelligence (AI) is being ubiquitously adopted to automate processes in science and industry. However, due to its often intricate and opaque nature, AI has been shown to possess inherent vulnerabilities which can be maliciously exploited with adversarial AI, potentially putting AI users and developers at both cyber and physical risk. In addition, there is insufficient comprehension of the real-world effects of adversarial AI and an inadequacy of AI security examinations; therefore, the growing threat landscape is unknown for many AI solutions. To mitigate this issue, we propose one of the first red team frameworks for evaluating the AI security of maritime autonomous systems. The framework provides operators with a proactive (secure by design) and reactive (post-deployment evaluation) response to securing AI technology today and in the future. This framework is a multi-part checklist, which can be tailored to different systems and requirements. We demonstrate this framework to be highly effective for a red team to use to uncover numerous vulnerabilities within a real-world maritime autonomous systems AI, ranging from poisoning to adversarial patch attacks. The lessons learned from systematic AI red teaming can help prevent MAS-related catastrophic events in a world with increasing uptake and reliance on mission-critical AI.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Enhancing sea ice segmentation in Sentinel-1 images with atrous convolutions
Authors:
Rafael Pires de Lima,
Behzad Vahedi,
Nick Hughes,
Andrew P. Barrett,
Walter Meier,
Morteza Karimzadeh
Abstract:
Due to the growing volume of remote sensing data and the low latency required for safe marine navigation, machine learning (ML) algorithms are being developed to accelerate sea ice chart generation, currently a manual interpretation task. However, the low signal-to-noise ratio of the freely available Sentinel-1 Synthetic Aperture Radar (SAR) imagery, the ambiguity of backscatter signals for ice ty…
▽ More
Due to the growing volume of remote sensing data and the low latency required for safe marine navigation, machine learning (ML) algorithms are being developed to accelerate sea ice chart generation, currently a manual interpretation task. However, the low signal-to-noise ratio of the freely available Sentinel-1 Synthetic Aperture Radar (SAR) imagery, the ambiguity of backscatter signals for ice types, and the scarcity of open-source high-resolution labelled data makes automating sea ice map** challenging. We use Extreme Earth version 2, a high-resolution benchmark dataset generated for ML training and evaluation, to investigate the effectiveness of ML for automated sea ice map**. Our customized pipeline combines ResNets and Atrous Spatial Pyramid Pooling for SAR image segmentation. We investigate the performance of our model for: i) binary classification of sea ice and open water in a segmentation framework; and ii) a multiclass segmentation of five sea ice types. For binary ice-water classification, models trained with our largest training set have weighted F1 scores all greater than 0.95 for January and July test scenes. Specifically, the median weighted F1 score was 0.98, indicating high performance for both months. By comparison, a competitive baseline U-Net has a weighted average F1 score of ranging from 0.92 to 0.94 (median 0.93) for July, and 0.97 to 0.98 (median 0.97) for January. Multiclass ice type classification is more challenging, and even though our models achieve 2% improvement in weighted F1 average compared to the baseline U-Net, test weighted F1 is generally between 0.6 and 0.80. Our approach can efficiently segment full SAR scenes in one run, is faster than the baseline U-Net, retains spatial resolution and dimension, and is more robust against noise compared to approaches that rely on patch classification.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Flagellum Pum** Efficacy in Shear-Thinning Viscoelastic Fluids
Authors:
Aaron Barrett,
Aaron L. Fogelson,
M. Gregory Forest,
Cole Gruninger,
Sookkyung Lim,
Boyce E. Griffith
Abstract:
Microorganism motility often takes place within complex, viscoelastic fluid environments, e.g., sperm in cervicovaginal mucus and bacteria in biofilms. In such complex fluids, strains and stresses generated by the microorganism are stored and relax across a spectrum of length and time scales and the complex fluid can be driven out of its linear response regime. Phenomena not possible in viscous me…
▽ More
Microorganism motility often takes place within complex, viscoelastic fluid environments, e.g., sperm in cervicovaginal mucus and bacteria in biofilms. In such complex fluids, strains and stresses generated by the microorganism are stored and relax across a spectrum of length and time scales and the complex fluid can be driven out of its linear response regime. Phenomena not possible in viscous media thereby arise from feedback between the "swimmer" and the complex fluid, making swimming efficiency co-dependent on the propulsion mechanism and fluid properties. Here we parameterize a flagellar motor and filament properties together with elastic relaxation and nonlinear shear-thinning properties of the fluid in a computational immersed boundary model. We then explore swimming efficiency over this parameter space. One exemplary insight is that motor efficiency (measured by the volumetric flow rate) can be boosted vs.\ degraded by moderate vs.\ strong shear-thinning of the viscoelastic environment.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Benchmarking the Immersed Boundary Method for Viscoelastic Flows
Authors:
Cole Gruninger,
Aaron Barrett,
Fuhui Fang,
M. Gregory Forest,
Boyce E. Griffith
Abstract:
We present and analyze a series of benchmark tests regarding the application of the immersed boundary (IB) method to viscoelastic flows through and around non-trivial, stationary geometries. The IB method is widely used for the simulation of biological fluid dynamics and other modeling scenarios where a structure is immersed in a fluid. Although the IB method has been most commonly used to model s…
▽ More
We present and analyze a series of benchmark tests regarding the application of the immersed boundary (IB) method to viscoelastic flows through and around non-trivial, stationary geometries. The IB method is widely used for the simulation of biological fluid dynamics and other modeling scenarios where a structure is immersed in a fluid. Although the IB method has been most commonly used to model systems with viscous incompressible fluids, it also can be applied to visoelastic fluids, and has enabled the study of a wide variety of dynamical problems including the settling of vesicles and the swimming of elastic filaments in fluids modeled by the Oldroyd-B constuitive equation. However, to date, relatively little work has explored the accuracy or convergence properties of the numerical scheme. Herein, we present benchmarking results for an IB solver applied to viscoelastic flows in and around non-trivial geometries using the idealized Oldroyd-B and more realistic, polymer-entanglement-based Rolie-Poly constitutive equations. We use two-dimensional numerical test cases along with results from rheology experiments to benchmark the IB method and compare it to more complex finite element and finite volume viscoelastic flow solvers. Additionally, we analyze different choices of regularized delta function and relative Lagrangian grid spacings which allow us to identify and recommend the key choices of these numerical parameters depending on the present flow regime.
△ Less
Submitted 14 January, 2024; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Algorithmic Randomness and Probabilistic Laws
Authors:
Jeffrey A. Barrett,
Eddy Keming Chen
Abstract:
We consider two ways one might use algorithmic randomness to characterize a probabilistic law. The first is a generative chance* law. Such laws involve a nonstandard notion of chance. The second is a probabilistic* constraining law. Such laws impose relative frequency and randomness constraints that every physically possible world must satisfy. While each notion has virtues, we argue that the latt…
▽ More
We consider two ways one might use algorithmic randomness to characterize a probabilistic law. The first is a generative chance* law. Such laws involve a nonstandard notion of chance. The second is a probabilistic* constraining law. Such laws impose relative frequency and randomness constraints that every physically possible world must satisfy. While each notion has virtues, we argue that the latter has advantages over the former. It supports a unified governing account of non-Humean laws and provides independently motivated solutions to issues in the Humean best-system account. On both notions, we have a much tighter connection between probabilistic laws and their corresponding sets of possible worlds. Certain histories permitted by traditional probabilistic laws are ruled out as physically impossible. As a result, such laws avoid one variety of empirical underdetermination, but the approach reveals other varieties of underdetermination that are typically overlooked.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
Actionable Guidance for High-Consequence AI Risk Management: Towards Standards Addressing AI Catastrophic Risks
Authors:
Anthony M. Barrett,
Dan Hendrycks,
Jessica Newman,
Brandie Nonnecke
Abstract:
Artificial intelligence (AI) systems can provide many beneficial capabilities but also risks of adverse events. Some AI systems could present risks of events with very high or catastrophic consequences at societal scale. The US National Institute of Standards and Technology (NIST) has been develo** the NIST Artificial Intelligence Risk Management Framework (AI RMF) as voluntary guidance on AI ri…
▽ More
Artificial intelligence (AI) systems can provide many beneficial capabilities but also risks of adverse events. Some AI systems could present risks of events with very high or catastrophic consequences at societal scale. The US National Institute of Standards and Technology (NIST) has been develo** the NIST Artificial Intelligence Risk Management Framework (AI RMF) as voluntary guidance on AI risk assessment and management for AI developers and others. For addressing risks of events with catastrophic consequences, NIST indicated a need to translate from high level principles to actionable risk management guidance.
In this document, we provide detailed actionable-guidance recommendations focused on identifying and managing risks of events with very high or catastrophic consequences, intended as a risk management practices resource for NIST for AI RMF version 1.0 (released in January 2023), or for AI RMF users, or for other AI risk management guidance and standards as appropriate. We also provide our methodology for our recommendations.
We provide actionable-guidance recommendations for AI RMF 1.0 on: identifying risks from potential unintended uses and misuses of AI systems; including catastrophic-risk factors within the scope of risk assessments and impact assessments; identifying and mitigating human rights harms; and reporting information on AI risk factors including catastrophic-risk factors.
In addition, we provide recommendations on additional issues for a roadmap for later versions of the AI RMF or supplementary publications. These include: providing an AI RMF Profile with supplementary guidance for cutting-edge increasingly multi-purpose or general-purpose AI.
We aim for this work to be a concrete risk-management practices contribution, and to stimulate constructive dialogue on how to address catastrophic risks and associated issues in AI standards.
△ Less
Submitted 23 February, 2023; v1 submitted 17 June, 2022;
originally announced June 2022.
-
A Model of Fluid-Structure and Biochemical Interactions for Applications to Subclinical Leaflet Thrombosis
Authors:
Aaron Barrett,
Jordan A. Brown,
Margaret Anne Smith,
Andrew Woodward,
John P. Vavalle,
Arash Kheradvar,
Boyce E. Griffith,
Aaron L. Fogelson
Abstract:
Subclinical leaflet thrombosis (SLT) is a potentially serious complication of aortic valve replacement with a bioprosthetic valve in which blood clots form on the replacement valve. SLT is associated with increased risk of transient ischemic attacks and strokes and can progress to clinical leaflet thrombosis. SLT following aortic valve replacement also may be related to subsequent structural valve…
▽ More
Subclinical leaflet thrombosis (SLT) is a potentially serious complication of aortic valve replacement with a bioprosthetic valve in which blood clots form on the replacement valve. SLT is associated with increased risk of transient ischemic attacks and strokes and can progress to clinical leaflet thrombosis. SLT following aortic valve replacement also may be related to subsequent structural valve deterioration, which can impair the durability of the valve replacement. Because of the difficulty in clinical imaging of SLT, models are needed to determine the mechanisms of SLT and could eventually predict which patients will develop SLT. To this end, we develop methods to simulate leaflet thrombosis that combine fluid-structure interaction and a simplified thrombosis model that allows for deposition along the moving leaflets. Additionally, this model can be adapted to model deposition or absorption along other moving boundaries. We present convergence results and quantify the model's ability to realize changes in valve opening and pressures. These new approaches are an important advancement in our tools for modeling thrombosis in which they incorporate both adhesion to the surface of the moving leaflets and feedback to the fluid-structure interaction.
△ Less
Submitted 7 February, 2023; v1 submitted 3 May, 2022;
originally announced May 2022.
-
Operations for Autonomous Spacecraft
Authors:
Rebecca Castano,
Tiago Vaquero,
Federico Rossi,
Vandi Verma,
Ellen Van Wyk,
Dan Allard,
Bennett Huffmann,
Erin M. Murphy,
Nihal Dhamani,
Robert A. Hewitt,
Scott Davidoff,
Rashied Amini,
Anthony Barrett,
Julie Castillo-Rogez,
Steve A. Chien,
Mathieu Choukroun,
Alain Dadaian,
Raymond Francis,
Benjamin Gorr,
Mark Hofstadter,
Mitch Ingham,
Cristina Sorice,
Iain Tierney
Abstract:
Onboard autonomy technologies such as planning and scheduling, identification of scientific targets, and content-based data summarization, will lead to exciting new space science missions. However, the challenge of operating missions with such onboard autonomous capabilities has not been studied to a level of detail sufficient for consideration in mission concepts. These autonomy capabilities will…
▽ More
Onboard autonomy technologies such as planning and scheduling, identification of scientific targets, and content-based data summarization, will lead to exciting new space science missions. However, the challenge of operating missions with such onboard autonomous capabilities has not been studied to a level of detail sufficient for consideration in mission concepts. These autonomy capabilities will require changes to current operations processes, practices, and tools. We have developed a case study to assess the changes needed to enable operators and scientists to operate an autonomous spacecraft by facilitating a common model between the ground personnel and the onboard algorithms. We assess the new operations tools and workflows necessary to enable operators and scientists to convey their desired intent to the spacecraft, and to be able to reconstruct and explain the decisions made onboard and the state of the spacecraft. Mock-ups of these tools were used in a user study to understand the effectiveness of the processes and tools in enabling a shared framework of understanding, and in the ability of the operators and scientists to effectively achieve mission science objectives.
△ Less
Submitted 21 November, 2021;
originally announced November 2021.
-
Greater than the parts: A review of the information decomposition approach to causal emergence
Authors:
Pedro A. M. Mediano,
Fernando E. Rosas,
Andrea I. Luppi,
Henrik J. Jensen,
Anil K. Seth,
Adam B. Barrett,
Robin L. Carhart-Harris,
Daniel Bor
Abstract:
Emergence is a profound subject that straddles many scientific disciplines, including the formation of galaxies and how consciousness arises from the collective activity of neurons. Despite the broad interest that exists on this concept, the study of emergence has suffered from a lack of formalisms that could be used to guide discussions and advance theories. Here we summarise, elaborate on, and e…
▽ More
Emergence is a profound subject that straddles many scientific disciplines, including the formation of galaxies and how consciousness arises from the collective activity of neurons. Despite the broad interest that exists on this concept, the study of emergence has suffered from a lack of formalisms that could be used to guide discussions and advance theories. Here we summarise, elaborate on, and extend a recent formal theory of causal emergence based on information decomposition, which is quantifiable and amenable to empirical testing. This theory relates emergence with information about a system's temporal evolution that cannot be obtained from the parts of the system separately. This article provides an accessible but rigorous introduction to the framework, discussing the merits of the approach in various scenarios of interest. We also discuss several interpretation issues and potential misunderstandings, while highlighting the distinctive benefits of this formalism.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Towards an extended taxonomy of information dynamics via Integrated Information Decomposition
Authors:
Pedro A. M. Mediano,
Fernando E. Rosas,
Andrea I Luppi,
Robin L. Carhart-Harris,
Daniel Bor,
Anil K. Seth,
Adam B. Barrett
Abstract:
Complex systems, from the human brain to the global economy, are made of multiple elements that interact in such ways that the behaviour of the `whole' often seems to be more than what is readily explainable in terms of the `sum of the parts.' Our ability to understand and control these systems remains limited, one reason being that we still don't know how best to describe -- and quantify -- the h…
▽ More
Complex systems, from the human brain to the global economy, are made of multiple elements that interact in such ways that the behaviour of the `whole' often seems to be more than what is readily explainable in terms of the `sum of the parts.' Our ability to understand and control these systems remains limited, one reason being that we still don't know how best to describe -- and quantify -- the higher-order dynamical interactions that characterise their complexity. To address this limitation, we combine principles from the theories of Information Decomposition and Integrated Information into what we call Integrated Information Decomposition, or $Φ$ID. $Φ$ID provides a comprehensive framework to reason about, evaluate, and understand the information dynamics of complex multivariate systems. $Φ$ID reveals the existence of previously unreported modes of collective information flow, providing tools to express well-known measures of information transfer and dynamical complexity as aggregates of these modes. Via computational and empirical examples, we demonstrate that $Φ$ID extends our explanatory power beyond traditional causal discovery methods -- with profound implications for the study of complex systems across disciplines.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Immersed boundary simulations of cell-cell interactions in whole blood
Authors:
Andrew Kassen,
Aaron Barrett,
Varun Shankar,
Aaron L. Fogelson
Abstract:
We present a new method for the geometric reconstruction of elastic surfaces simulated by the immersed boundary method with the goal of simulating the motion and interactions of cells in whole blood. Our method uses parameter-free radial basis functions for high-order meshless parametric reconstruction of point clouds and the elastic force computations required by the immersed boundary method. Thi…
▽ More
We present a new method for the geometric reconstruction of elastic surfaces simulated by the immersed boundary method with the goal of simulating the motion and interactions of cells in whole blood. Our method uses parameter-free radial basis functions for high-order meshless parametric reconstruction of point clouds and the elastic force computations required by the immersed boundary method. This numerical framework allows us to consider the effect of endothelial geometry and red blood cell motion on the motion of platelets. We find red blood cells to be crucial for understanding the motion of platelets, to the point that the geometry of the vessel wall has a negligible effect in the presence of RBCs. We describe certain interactions that force the platelets to remain near the endothelium for extended periods, including a novel platelet motion that can be seen only in 3-dimensional simulations that we term "unicycling." We also observe red blood cell-mediated interactions between platelets and the endothelium for which the platelet has reduced speed. We suggest that these behaviors serve as mechanisms that allow platelets to better maintain vascular integrity.
△ Less
Submitted 20 August, 2021;
originally announced August 2021.
-
Integrated information as a common signature of dynamical and information-processing complexity
Authors:
Pedro A. M. Mediano,
Fernando E. Rosas,
Juan Carlos Farah,
Murray Shanahan,
Daniel Bor,
Adam B. Barrett
Abstract:
The apparent dichotomy between information-processing and dynamical approaches to complexity science forces researchers to choose between two diverging sets of tools and explanations, creating conflict and often hindering scientific progress. Nonetheless, given the shared theoretical goals between both approaches, it is reasonable to conjecture the existence of underlying common signatures that ca…
▽ More
The apparent dichotomy between information-processing and dynamical approaches to complexity science forces researchers to choose between two diverging sets of tools and explanations, creating conflict and often hindering scientific progress. Nonetheless, given the shared theoretical goals between both approaches, it is reasonable to conjecture the existence of underlying common signatures that capture interesting behaviour in both dynamical and information-processing systems. Here we argue that a pragmatic use of Integrated Information Theory (IIT), originally conceived in theoretical neuroscience, can provide a potential unifying framework to study complexity in general multivariate systems. Furthermore, by leveraging metrics put forward by the integrated information decomposition ($Φ$ID) framework, our results reveal that integrated information can effectively capture surprisingly heterogeneous signatures of complexity -- including metastability and criticality in networks of coupled oscillators as well as distributed computation and emergent stable particles in cellular automata -- without relying on idiosyncratic, ad-hoc criteria. These results show how an agnostic use of IIT can provide important steps towards bridging the gap between informational and dynamical approaches to complex systems.
△ Less
Submitted 18 June, 2021;
originally announced June 2021.
-
Component Based Solutions Under Architecture
Authors:
T. A. Barrett,
H. A. Proper
Abstract:
Many of today's applications have an, almost tangible, monolithic nature. They are built as 'islands', purporting to be self contained, offering little or nothing in the way of integration with other applications. In the past, being large and self-contained may have eliminated the need to interact with other solutions to some extent. However, in the business environments of today the interaction w…
▽ More
Many of today's applications have an, almost tangible, monolithic nature. They are built as 'islands', purporting to be self contained, offering little or nothing in the way of integration with other applications. In the past, being large and self-contained may have eliminated the need to interact with other solutions to some extent. However, in the business environments of today the interaction with other applications becomes paramount. As a result of this, many ad-hoc point-to-point integration solutions have been built between different applications. This has already led to an 'application spaghetti' at many of our customer sites. Many of today's applications are poorly structured, which makes their responsiveness to business change sluggish. The application spaghetti with its plethora of point-to-point interfaces further inhibits the responsiveness to change.
△ Less
Submitted 18 May, 2021;
originally announced May 2021.
-
A Hybrid Semi-Lagrangian Cut Cell Method for Advection-Diffusion Problems with Robin Boundary Conditions in Moving Domains
Authors:
Aaron Barrett,
Aaron L. Fogelson,
Boyce E. Griffith
Abstract:
We present a new discretization for advection-diffusion problems with Robin boundary conditions on complex time-dependent domains. The method is based on second order cut cell finite volume methods introduced by Bochkov et al. to discretize the Laplace operator and Robin boundary condition. To overcome the small cell problem, we use a splitting scheme that uses a semi-Lagrangian method to treat ad…
▽ More
We present a new discretization for advection-diffusion problems with Robin boundary conditions on complex time-dependent domains. The method is based on second order cut cell finite volume methods introduced by Bochkov et al. to discretize the Laplace operator and Robin boundary condition. To overcome the small cell problem, we use a splitting scheme that uses a semi-Lagrangian method to treat advection. We demonstrate second order accuracy in the $L^1$, $L^2$, and $L^\infty$ norms for both analytic test problems and numerical convergence studies. We also demonstrate the ability of the scheme to handle conversion of one concentration field to another across a moving boundary.
△ Less
Submitted 25 October, 2021; v1 submitted 16 February, 2021;
originally announced February 2021.
-
Pump efficacy in a fluid-structure interaction model of a chain of contracting lymphangions
Authors:
Hallie Elich,
Aaron Barrett,
Varun Shankar,
Aaron L. Fogelson
Abstract:
The transport of lymph through the lymphatic vasculature is the mechanism for returning excess interstitial fluid to the circulatory system, and it is essential for fluid homeostasis. Collecting lymphatic vessels comprise a significant portion of the lymphatic vasculature and are divided by valves into contractile segments known as lymphangions. Despite its importance, lymphatic transport in colle…
▽ More
The transport of lymph through the lymphatic vasculature is the mechanism for returning excess interstitial fluid to the circulatory system, and it is essential for fluid homeostasis. Collecting lymphatic vessels comprise a significant portion of the lymphatic vasculature and are divided by valves into contractile segments known as lymphangions. Despite its importance, lymphatic transport in collecting vessels is not well understood. We present a computational model to study lymph flow through chains of valved, contracting lymphangions. We used the Navier-Stokes equations to model the fluid flow and the immersed boundary method to handle the two-way, fluid-structure interaction in 2D, non-axisymmetric simulations. We used our model to evaluate the effects of chain length, contraction style, and adverse axial pressure difference (AAPD) on cycle-mean flow rates (CMFRs). In the model, longer lymphangion chains generally yield larger CMFRs, and they fail to generate positive CMFRs at higher AAPDs than shorter chains. Simultaneously contracting pumps generate the largest CMFRs at nearly every AAPD and for every chain length. Due to the contraction timing and valve dynamics, non-simultaneous pumps generate lower CMFRs than the simultaneous pumps; the discrepancy diminishes as the AAPD increases. Valve dynamics vary with the contraction style and exhibit hysteretic opening and closing behaviors. Our model provides insight into how contraction propagation affects flow rates and transport through a lymphangion chain.
△ Less
Submitted 4 July, 2021; v1 submitted 9 December, 2020;
originally announced December 2020.
-
Decomposing spectral and phasic differences in non-linear features between datasets
Authors:
Pedro A. M. Mediano,
Fernando E. Rosas,
Adam B. Barrett,
Daniel Bor
Abstract:
When employing non-linear methods to characterise complex systems, it is important to determine to what extent they are capturing genuine non-linear phenomena that could not be assessed by simpler spectral methods. Specifically, we are concerned with the problem of quantifying spectral and phasic effects on an observed difference in a non-linear feature between two systems (or two states of the sa…
▽ More
When employing non-linear methods to characterise complex systems, it is important to determine to what extent they are capturing genuine non-linear phenomena that could not be assessed by simpler spectral methods. Specifically, we are concerned with the problem of quantifying spectral and phasic effects on an observed difference in a non-linear feature between two systems (or two states of the same system). Here we derive, from a sequence of null models, a decomposition of the difference in an observable into spectral, phasic, and spectrum-phase interaction components. Our approach makes no assumptions about the structure of the data and adds nuance to a wide range of time series analyses.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Reconciling emergences: An information-theoretic approach to identify causal emergence in multivariate data
Authors:
Fernando E. Rosas,
Pedro A. M. Mediano,
Henrik J. Jensen,
Anil K. Seth,
Adam B. Barrett,
Robin L. Carhart-Harris,
Daniel Bor
Abstract:
The broad concept of emergence is instrumental in various of the most challenging open scientific questions -- yet, few quantitative theories of what constitutes emergent phenomena have been proposed. This article introduces a formal theory of causal emergence in multivariate systems, which studies the relationship between the dynamics of parts of a system and macroscopic features of interest. Our…
▽ More
The broad concept of emergence is instrumental in various of the most challenging open scientific questions -- yet, few quantitative theories of what constitutes emergent phenomena have been proposed. This article introduces a formal theory of causal emergence in multivariate systems, which studies the relationship between the dynamics of parts of a system and macroscopic features of interest. Our theory provides a quantitative definition of downward causation, and introduces a complementary modality of emergent behaviour -- which we refer to as causal decoupling. Moreover, the theory allows practical criteria that can be efficiently calculated in large systems, making our framework applicable in a range of scenarios of practical interest. We illustrate our findings in a number of case studies, including Conway's Game of Life, Reynolds' flocking model, and neural activity as measured by electrocorticography.
△ Less
Submitted 17 April, 2020;
originally announced April 2020.
-
An operational information decomposition via synergistic disclosure
Authors:
Fernando Rosas,
Pedro Mediano,
Borzoo Rassouli,
Adam Barrett
Abstract:
Multivariate information decompositions hold promise to yield insight into complex systems, and stand out for their ability to identify synergistic phenomena. However, the adoption of these approaches has been hindered by there being multiple possible decompositions, and no precise guidance for preferring one over the others. At the heart of this disagreement lies the absence of a clear operationa…
▽ More
Multivariate information decompositions hold promise to yield insight into complex systems, and stand out for their ability to identify synergistic phenomena. However, the adoption of these approaches has been hindered by there being multiple possible decompositions, and no precise guidance for preferring one over the others. At the heart of this disagreement lies the absence of a clear operational interpretation of what synergistic information is. Here we fill this gap by proposing a new information decomposition based on a novel operationalisation of informational synergy, which leverages recent developments in the literature of data privacy. Our decomposition is defined for any number of information sources, and its atoms can be calculated using elementary optimisation techniques. The decomposition provides a natural coarse-graining that scales gracefully with the system's size, and is applicable in a wide range of scenarios of practical interest.
△ Less
Submitted 13 March, 2020; v1 submitted 28 January, 2020;
originally announced January 2020.
-
An empirical, Bayesian approach to modelling the impact of weather on crop yield: maize in the US
Authors:
Raphael Shirley,
Edward Pope,
Myles Bartlett,
Seb Oliver,
Novi Quadrianto,
Peter Hurley,
Steven Duivenvoorden,
Phil Rooney,
Adam B. Barrett,
Chris Kent,
James Bacon
Abstract:
We apply an empirical, data-driven approach for describing crop yield as a function of monthly temperature and precipitation by employing generative probabilistic models with parameters determined through Bayesian inference. Our approach is applied to state-scale maize yield and meteorological data for the US Corn Belt from 1981 to 2014 as an exemplar, but would be readily transferable to other cr…
▽ More
We apply an empirical, data-driven approach for describing crop yield as a function of monthly temperature and precipitation by employing generative probabilistic models with parameters determined through Bayesian inference. Our approach is applied to state-scale maize yield and meteorological data for the US Corn Belt from 1981 to 2014 as an exemplar, but would be readily transferable to other crops, locations and spatial scales. Experimentation with a number of models shows that maize growth rates can be characterised by a two-dimensional Gaussian function of temperature and precipitation with monthly contributions accumulated over the growing period. This approach accounts for non-linear growth responses to the individual meteorological variables, and allows for interactions between them. Our models correctly identify that temperature and precipitation have the largest impact on yield in the six months prior to the harvest, in agreement with the typical growing season for US maize (April to September). Maximal growth rates occur for monthly mean temperature 18-19$^\circ$C, corresponding to a daily maximum temperature of 24-25$^\circ$C (in broad agreement with previous work) and monthly total precipitation 115 mm. Our approach also provides a self-consistent way of investigating climate change impacts on current US maize varieties in the absence of adaptation measures. Kee** precipitation and growing area fixed, a temperature increase of $2^\circ$C, relative to 1981-2014, results in the mean yield decreasing by 8\%, while the yield variance increases by a factor of around 3. We thus provide a flexible, data-driven framework for exploring the impacts of natural climate variability and climate change on globally significant crops based on their observed behaviour. In concert with other approaches, this can help inform the development of adaptation strategies that will ensure food security under a changing climate.
△ Less
Submitted 8 January, 2020;
originally announced January 2020.
-
Typical Worlds
Authors:
Jeffrey A. Barrett
Abstract:
Hugh Everett III presented pure wave mechanics, sometimes referred to as the many-worlds interpretation, as a solution to the quantum measurement problem. While pure wave mechanics is an objectively deterministic physical theory with no probabilities, Everett sought to show how the theory might be understood as making the standard quantum statistical predictions as appearances to observers who wer…
▽ More
Hugh Everett III presented pure wave mechanics, sometimes referred to as the many-worlds interpretation, as a solution to the quantum measurement problem. While pure wave mechanics is an objectively deterministic physical theory with no probabilities, Everett sought to show how the theory might be understood as making the standard quantum statistical predictions as appearances to observers who were themselves described by the theory. We will consider his argument and how it depends on a particular notion of branch typicality. We will also consider responses to Everett and the relationship between typicality and probability. The suggestion will be that pure wave mechanics requires a number of significant auxiliary assumptions in order to make anything like the standard quantum predictions.
△ Less
Submitted 7 December, 2019;
originally announced December 2019.
-
Forecasting vegetation condition for drought early warning systems in pastoral communities in Kenya
Authors:
Adam B. Barrett,
Steven Duivenvoorden,
Edward E. Salakpi,
James M. Muthoka,
John Mwangi,
Seb Oliver,
Pedram Rowhani
Abstract:
Droughts are a recurring hazard in sub-Saharan Africa, that can wreak huge socioeconomic costs.Acting early based on alerts provided by early warning systems (EWS) can potentially provide substantial mitigation, reducing the financial and human cost. However, existing EWS tend only to monitor current, rather than forecast future, environmental and socioeconomic indicators of drought, and hence are…
▽ More
Droughts are a recurring hazard in sub-Saharan Africa, that can wreak huge socioeconomic costs.Acting early based on alerts provided by early warning systems (EWS) can potentially provide substantial mitigation, reducing the financial and human cost. However, existing EWS tend only to monitor current, rather than forecast future, environmental and socioeconomic indicators of drought, and hence are not always sufficiently timely to be effective in practice. Here we present a novel method for forecasting satellite-based indicators of vegetation condition. Specifically, we focused on the 3-month Vegetation Condition Index (VCI3M) over pastoral livelihood zones in Kenya, which is the indicator used by the Kenyan National Drought Management Authority(NDMA). Using data from MODIS and Landsat, we apply linear autoregression and Gaussian process modeling methods and demonstrate high forecasting skill several weeks ahead. As a benchmark we predicted the drought alert marker used by NDMA (VCI3M<35). Both of our models were able to predict this alert marker four weeks ahead with a hit rate of around 89% and a false alarm rate of around 4%, or 81% and 6% respectively six weeks ahead. The methods developed here can thus identify a deteriorating vegetation condition well and sufficiently in advance to help disaster risk managers act early to support vulnerable communities and limit the impact of a drought hazard.
△ Less
Submitted 11 May, 2020; v1 submitted 23 November, 2019;
originally announced November 2019.
-
Beyond integrated information: A taxonomy of information dynamics phenomena
Authors:
Pedro A. M. Mediano,
Fernando Rosas,
Robin L. Carhart-Harris,
Anil K. Seth,
Adam B. Barrett
Abstract:
Most information dynamics and statistical causal analysis frameworks rely on the common intuition that causal interactions are intrinsically pairwise -- every 'cause' variable has an associated 'effect' variable, so that a 'causal arrow' can be drawn between them. However, analyses that depict interdependencies as directed graphs fail to discriminate the rich variety of modes of information flow t…
▽ More
Most information dynamics and statistical causal analysis frameworks rely on the common intuition that causal interactions are intrinsically pairwise -- every 'cause' variable has an associated 'effect' variable, so that a 'causal arrow' can be drawn between them. However, analyses that depict interdependencies as directed graphs fail to discriminate the rich variety of modes of information flow that can coexist within a system. This, in turn, creates problems with attempts to operationalise the concepts of 'dynamical complexity' or `integrated information.' To address this shortcoming, we combine concepts of partial information decomposition and integrated information, and obtain what we call Integrated Information Decomposition, or $Φ$ID. We show how $Φ$ID paves the way for more detailed analyses of interdependencies in multivariate time series, and sheds light on collective modes of information dynamics that have not been reported before. Additionally, $Φ$ID reveals that what is typically referred to as 'integration' is actually an aggregate of several heterogeneous phenomena. Furthermore, $Φ$ID can be used to formulate new, tailored measures of integrated information, as well as to understand and alleviate the limitations of existing measures.
△ Less
Submitted 5 September, 2019;
originally announced September 2019.
-
A fully 3D multi-path convolutional neural network with feature fusion and feature weighting for automatic lesion identification in brain MRI images
Authors:
Yunzhe Xue,
Meiyan Xie,
Fadi G. Farhat,
Olga Boukrina,
A. M. Barrett,
Jeffrey R. Binder,
Usman W. Roshan,
William W. Graves
Abstract:
We propose a fully 3D multi-path convolutional network to predict stroke lesions from 3D brain MRI images. Our multi-path model has independent encoders for different modalities containing residual convolutional blocks, weighted multi-path feature fusion from different modalities, and weighted fusion modules to combine encoder and decoder features. Compared to existing 3D CNNs like DeepMedic, 3D U…
▽ More
We propose a fully 3D multi-path convolutional network to predict stroke lesions from 3D brain MRI images. Our multi-path model has independent encoders for different modalities containing residual convolutional blocks, weighted multi-path feature fusion from different modalities, and weighted fusion modules to combine encoder and decoder features. Compared to existing 3D CNNs like DeepMedic, 3D U-Net, and AnatomyNet, our networks achieves the highest statistically significant cross-validation accuracy of 60.5% on the large ATLAS benchmark of 220 patients. We also test our model on multi-modal images from the Kessler Foundation and Medical College Wisconsin and achieve a statistically significant cross-validation accuracy of 65%, significantly outperforming the multi-modal 3D U-Net and DeepMedic. Overall our model offers a principled, extensible multi-path approach that outperforms multi-channel alternatives and achieves high Dice accuracies on existing benchmarks.
△ Less
Submitted 16 November, 2019; v1 submitted 17 July, 2019;
originally announced July 2019.
-
Gravito-optics and intensity correlations for binary inspiral signal detections
Authors:
Preston Jones,
Alexander Barrett,
Justin Carpenter,
Andri Gretarsson,
Ellie Gretarsson,
Brennan Hughey,
Douglas Singleton,
Darrel Smith,
Michele Zanolin
Abstract:
We examine the correlation functions associated with intensity interferometry and gravito-optics of gravitational wave signals from compact binary coalescences. Previous theoretical studies of the gravito-optics of gravitational waves has concentrated on the characterization of both the classical and the non-classical properties of signals from cosmological sources in the early Universe. These pre…
▽ More
We examine the correlation functions associated with intensity interferometry and gravito-optics of gravitational wave signals from compact binary coalescences. Previous theoretical studies of the gravito-optics of gravitational waves has concentrated on the characterization of both the classical and the non-classical properties of signals from cosmological sources in the early Universe. These previous works assume a periodic signal similar to the signals studied widely in optics and quantum optics and do not apply to transient signals. We develop the gravito-optics of intensity correlations for descriptions of the detection of transient signals from compact binary coalescences and apply these methods to calculate the two-point intensity correlations for the gravitational wave discovery. We also discuss the necessary theoretical work required for the description of the quantum gravito-optics of intensity correlations in the detection of signals from binary inspirals.
△ Less
Submitted 30 July, 2023; v1 submitted 28 June, 2019;
originally announced July 2019.
-
A multi-path 2.5 dimensional convolutional neural network system for segmenting stroke lesions in brain MRI images
Authors:
Yunzhe Xue,
Fadi G. Farhat,
Olga Boukrina,
A . M. Barrett,
Jeffrey R. Binder,
Usman W. Roshan,
William W. Graves
Abstract:
Automatic identification of brain lesions from magnetic resonance imaging (MRI) scans of stroke survivors would be a useful aid in patient diagnosis and treatment planning. We propose a multi-modal multi-path convolutional neural network system for automating stroke lesion segmentation. Our system has nine end-to-end UNets that take as input 2-dimensional (2D) slices and examines all three planes…
▽ More
Automatic identification of brain lesions from magnetic resonance imaging (MRI) scans of stroke survivors would be a useful aid in patient diagnosis and treatment planning. We propose a multi-modal multi-path convolutional neural network system for automating stroke lesion segmentation. Our system has nine end-to-end UNets that take as input 2-dimensional (2D) slices and examines all three planes with three different normalizations. Outputs from these nine total paths are concatenated into a 3D volume that is then passed to a 3D convolutional neural network to output a final lesion mask. We trained and tested our method on datasets from three sources: Medical College of Wisconsin (MCW), Kessler Foundation (KF), and the publicly available Anatomical Tracings of Lesions After Stroke (ATLAS) dataset. Cross-study validation results (with independent training and validation datasets) were obtained to compare with previous methods based on naive Bayes, random forests, and three recently published convolutional neural networks. Model performance was quantified in terms of the Dice coefficient. Training on the KF and MCW images and testing on the ATLAS images yielded a mean Dice coefficient of 0.54. This was reliably better than the next best previous model, UNet, at 0.47. Reversing the train and test datasets yields a mean Dice of 0.47 on KF and MCW images, whereas the next best UNet reaches 0.45. With all three datasets combined, the current system compared to previous methods also attained a reliably higher cross-validation accuracy. It also achieved high Dice values for many smaller lesions that existing methods have difficulty identifying. Overall, our system is a clear improvement over previous methods for automating stroke lesion segmentation, bringing us an important step closer to the inter-rater accuracy level of human experts.
△ Less
Submitted 26 May, 2019;
originally announced May 2019.
-
Enhancing Prediction Models for One-Year Mortality in Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome
Authors:
Seyedeh Neelufar Payrovnaziri,
Laura A. Barrett,
Daniel Bis,
Jiang Bian,
Zhe He
Abstract:
Predicting the risk of mortality for patients with acute myocardial infarction (AMI) using electronic health records (EHRs) data can help identify risky patients who might need more tailored care. In our previous work, we built computational models to predict one-year mortality of patients admitted to an intensive care unit (ICU) with AMI or post myocardial infarction syndrome. Our prior work only…
▽ More
Predicting the risk of mortality for patients with acute myocardial infarction (AMI) using electronic health records (EHRs) data can help identify risky patients who might need more tailored care. In our previous work, we built computational models to predict one-year mortality of patients admitted to an intensive care unit (ICU) with AMI or post myocardial infarction syndrome. Our prior work only used the structured clinical data from MIMIC-III, a publicly available ICU clinical database. In this study, we enhanced our work by adding the word embedding features from free-text discharge summaries. Using a richer set of features resulted in significant improvement in the performance of our deep learning models. The average accuracy of our deep learning models was 92.89% and the average F-measure was 0.928. We further reported the impact of different combinations of features extracted from structured and/or unstructured data on the performance of the deep learning models.
△ Less
Submitted 28 April, 2019;
originally announced April 2019.
-
The Phi measure of integrated information is not well-defined for general physical systems
Authors:
Adam B. Barrett,
Pedro A. M. Mediano
Abstract:
According to the Integrated Information Theory of Consciousness, consciousness is a fundamental observer-independent property of physical systems, and the measure Phi of integrated information is identical to the quantity or level of consciousness. For this to be plausible, there should be no alternative formulae for Phi consistent with the axioms of IIT, and there should not be cases of Phi being…
▽ More
According to the Integrated Information Theory of Consciousness, consciousness is a fundamental observer-independent property of physical systems, and the measure Phi of integrated information is identical to the quantity or level of consciousness. For this to be plausible, there should be no alternative formulae for Phi consistent with the axioms of IIT, and there should not be cases of Phi being ill-defined. This article presents three ways in which Phi, in its current formulation, fails to meet these standards, and discusses how this problem might be addressed.
△ Less
Submitted 12 February, 2019;
originally announced February 2019.
-
Building Computational Models to Predict One-Year Mortality in ICU Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome
Authors:
Laura A. Barrett,
Seyedeh Neelufar Payrovnaziri,
Jiang Bian,
Zhe He
Abstract:
Heart disease remains the leading cause of death in the United States. Compared with risk assessment guidelines that require manual calculation of scores, machine learning-based prediction for disease outcomes such as mortality can be utilized to save time and improve prediction accuracy. This study built and evaluated various machine learning models to predict one-year mortality in patients diagn…
▽ More
Heart disease remains the leading cause of death in the United States. Compared with risk assessment guidelines that require manual calculation of scores, machine learning-based prediction for disease outcomes such as mortality can be utilized to save time and improve prediction accuracy. This study built and evaluated various machine learning models to predict one-year mortality in patients diagnosed with acute myocardial infarction or post myocardial infarction syndrome in the MIMIC-III database. The results of the best performing shallow prediction models were compared to a deep feedforward neural network (Deep FNN) with back propagation. We included a cohort of 5436 admissions. Six datasets were developed and compared. The models applying Logistic Model Trees (LMT) and Simple Logistic algorithms to the combined dataset resulted in the highest prediction accuracy at 85.12% and the highest AUC at .901. In addition, other factors were observed to have an impact on outcomes as well.
△ Less
Submitted 12 December, 2018;
originally announced December 2018.
-
Measuring Integrated Information: Comparison of Candidate Measures in Theory and Simulation
Authors:
Pedro A. M. Mediano,
Anil K. Seth,
Adam B. Barrett
Abstract:
Integrated Information Theory (IIT) is a prominent theory of consciousness that has at its centre measures that quantify the extent to which a system generates more information than the sum of its parts. While several candidate measures of integrated information (`$Φ$') now exist, little is known about how they compare, especially in terms of their behaviour on non-trivial network models. In this…
▽ More
Integrated Information Theory (IIT) is a prominent theory of consciousness that has at its centre measures that quantify the extent to which a system generates more information than the sum of its parts. While several candidate measures of integrated information (`$Φ$') now exist, little is known about how they compare, especially in terms of their behaviour on non-trivial network models. In this article we provide clear and intuitive descriptions of six distinct candidate measures. We then explore the properties of each of these measures in simulation on networks consisting of eight interacting nodes, animated with Gaussian linear autoregressive dynamics. We find a striking diversity in the behaviour of these measures -- no two measures show consistent agreement across all analyses. Further, only a subset of the measures appear to genuinely reflect some form of dynamical complexity, in the sense of simultaneous segregation and integration between system components. Our results help guide the operationalisation of IIT and advance the development of measures of integrated information that may have more general applicability.
△ Less
Submitted 25 June, 2018;
originally announced June 2018.
-
Solved problems and remaining challenges for Granger causality analysis in neuroscience: A response to Stokes and Purdon (2017)
Authors:
Lionel Barnett,
Adam B. Barrett,
Anil K. Seth
Abstract:
Granger-Geweke causality (GGC) is a powerful and popular method for identifying directed functional (`causal') connectivity in neuroscience. In a recent paper, Stokes and Purdon [1] raise several concerns about its use. They make two primary claims: (1) that GGC estimates may be severely biased or of high variance, and (2) that GGC fails to reveal the full structural/causal mechanisms of a system.…
▽ More
Granger-Geweke causality (GGC) is a powerful and popular method for identifying directed functional (`causal') connectivity in neuroscience. In a recent paper, Stokes and Purdon [1] raise several concerns about its use. They make two primary claims: (1) that GGC estimates may be severely biased or of high variance, and (2) that GGC fails to reveal the full structural/causal mechanisms of a system. However, these claims rest, respectively, on an incomplete evaluation of the literature, and a misconception about what GGC can be said to measure. Here we explain how existing approaches (as implemented, for example, in our popular MVGC software [2,3]) resolve the first issue, and discuss the frequently-misunderstood distinction between functional and effective neural connectivity which underlies Stokes and Purdon's second claim.
[1] Patrick A. Stokes and Patrick. L. Purdon (2017), A study of problems encountered in Granger causality analysis from a neuroscience perspective, Proc. Natl. Acad. Sci. USA 114(34):7063-7072.
[2] Lionel Barnett and Anil K. Seth (2012), The MVGC Multivariate Granger Causality Matlab toolbox, http://users.sussex.ac.uk/~lionelb/MVGC/
[3] Lionel Barnett and Anil K. Seth (2014), The MVGC multivariate Granger causality toolbox: A new approach to Granger-causal inference, J. Neurosci. Methods 223:50-68
△ Less
Submitted 6 February, 2018; v1 submitted 26 August, 2017;
originally announced August 2017.
-
IB2d Reloaded: a more powerful Python and MATLAB implementation of the immersed boundary method
Authors:
Nicholas Battista,
Christopher Strickland,
Aaron Barrett,
Laura Miller
Abstract:
The immersed boundary method (IB) is an elegant way to fully couple the motion of a fluid and deformations of an immersed elastic structure. In that vein, the IB2d software allows for expedited explorations of fluid-structure interaction for beginners and veterans to the field of computational fluid dynamics (CFD). While most open source CFD codes are written in low level programming environments,…
▽ More
The immersed boundary method (IB) is an elegant way to fully couple the motion of a fluid and deformations of an immersed elastic structure. In that vein, the IB2d software allows for expedited explorations of fluid-structure interaction for beginners and veterans to the field of computational fluid dynamics (CFD). While most open source CFD codes are written in low level programming environments, IB2d was specifically written in high- level programming environments to make its accessibility extend beyond scientists with vast programming experience. Although introduced previously by Battista et al. 2015, many improvements and additions have been made to the software to allow for even more robust models of material properties for the elastic structures, including a data analysis package for both the fluid and immersed structure data, an improved time-step** scheme for higher accuracy solutions, and functionality for modeling slight fluid density variations as given by the Boussinesq approximation.
△ Less
Submitted 21 July, 2017;
originally announced July 2017.
-
Stability of zero-growth economics analysed with a Minskyan model
Authors:
Adam B. Barrett
Abstract:
As humanity is becoming increasingly confronted by Earth's finite biophysical limits, there is increasing interest in questions about the stability and equitability of a zero-growth capitalist economy, most notably: if one maintains a positive interest rate for loans, can a zero-growth economy be stable? This question has been explored on a few different macroeconomic models, and both `yes' and `n…
▽ More
As humanity is becoming increasingly confronted by Earth's finite biophysical limits, there is increasing interest in questions about the stability and equitability of a zero-growth capitalist economy, most notably: if one maintains a positive interest rate for loans, can a zero-growth economy be stable? This question has been explored on a few different macroeconomic models, and both `yes' and `no' answers have been obtained. However, economies can become unstable whether or not there is ongoing underlying growth in productivity with which to sustain growth in output. Here we attempt, for the first time, to assess via a model the relative stability of growth versus no-growth scenarios. The model employed draws from Keen's model of the Minsky financial instability hypothesis. The analysis focuses on dynamics as opposed to equilibrium, and scenarios of growth and no-growth of output (GDP) are obtained by tweaking a productivity growth input parameter. We confirm that, with or without growth, there can be both stable and unstable scenarios. To maintain stability, firms must not change their debt levels or target debt levels too quickly. Further, according to the model, the wages share is higher for zero-growth scenarios, although there are more frequent substantial drops in employment.
△ Less
Submitted 7 November, 2017; v1 submitted 26 April, 2017;
originally announced April 2017.
-
A Model of Pathways to Artificial Superintelligence Catastrophe for Risk and Decision Analysis
Authors:
Anthony M. Barrett,
Seth D. Baum
Abstract:
An artificial superintelligence (ASI) is artificial intelligence that is significantly more intelligent than humans in all respects. While ASI does not currently exist, some scholars propose that it could be created sometime in the future, and furthermore that its creation could cause a severe global catastrophe, possibly even resulting in human extinction. Given the high stakes, it is important t…
▽ More
An artificial superintelligence (ASI) is artificial intelligence that is significantly more intelligent than humans in all respects. While ASI does not currently exist, some scholars propose that it could be created sometime in the future, and furthermore that its creation could cause a severe global catastrophe, possibly even resulting in human extinction. Given the high stakes, it is important to analyze ASI risk and factor the risk into decisions related to ASI research and development. This paper presents a graphical model of major pathways to ASI catastrophe, focusing on ASI created via recursive self-improvement. The model uses the established risk and decision analysis modeling paradigms of fault trees and influence diagrams in order to depict combinations of events and conditions that could lead to AI catastrophe, as well as intervention options that could decrease risks. The events and conditions include select aspects of the ASI itself as well as the human process of ASI research, development, and management. Model structure is derived from published literature on ASI risk. The model offers a foundation for rigorous quantitative evaluation and decision making on the long-term risk of ASI catastrophe.
△ Less
Submitted 25 July, 2016;
originally announced July 2016.
-
Exploration of synergistic and redundant information sharing in static and dynamical Gaussian systems
Authors:
Adam B. Barrett
Abstract:
To fully characterize the information that two `source' variables carry about a third `target' variable, one must decompose the total information into redundant, unique and synergistic components, i.e. obtain a partial information decomposition (PID). However Shannon's theory of information does not provide formulae to fully determine these quantities. Several recent studies have begun addressing…
▽ More
To fully characterize the information that two `source' variables carry about a third `target' variable, one must decompose the total information into redundant, unique and synergistic components, i.e. obtain a partial information decomposition (PID). However Shannon's theory of information does not provide formulae to fully determine these quantities. Several recent studies have begun addressing this. Some possible definitions for PID quantities have been proposed, and some analyses have been carried out on systems composed of discrete variables. Here we present the first in-depth analysis of PIDs on Gaussian systems, both static and dynamical. We show that, for a broad class of Gaussian systems, previously proposed PID formulae imply that: (i) redundancy reduces to the minimum information provided by either source variable, and hence is independent of correlation between sources; (ii) synergy is the extra information contributed by the weaker source when the stronger source is known, and can either increase or decrease with correlation between sources. We find that Gaussian systems frequently exhibit net synergy, i.e. the information carried jointly by both sources is greater than the sum of informations carried by each source individually. Drawing from several explicit examples, we discuss the implications of these findings for measures of information transfer and information-based measures of complexity, both generally and within a neuroscience setting. Importantly, by providing independent formulae for synergy and redundancy applicable to continuous time-series data, we open up a new approach to characterizing and quantifying information sharing amongst complex system variables.
△ Less
Submitted 30 April, 2015; v1 submitted 11 November, 2014;
originally announced November 2014.
-
An Integration of Integrated Information Theory with Fundamental Physics
Authors:
Adam B. Barrett
Abstract:
To truly eliminate Cartesian ghosts from the science of consciousness, we must describe consciousness as an aspect of the physical. Integrated Information Theory states that consciousness arises from intrinsic information generated by dynamical systems; however existing formulations of this theory are not applicable to standard models of fundamental physical entities. Modern physics has shown that…
▽ More
To truly eliminate Cartesian ghosts from the science of consciousness, we must describe consciousness as an aspect of the physical. Integrated Information Theory states that consciousness arises from intrinsic information generated by dynamical systems; however existing formulations of this theory are not applicable to standard models of fundamental physical entities. Modern physics has shown that fields are fundamental entities, and in particular that the electromagnetic field is fundamental. Here I hypothesize that consciousness arises from information intrinsic to fundamental fields. This hypothesis unites fundamental physics with what we know empirically about the neuroscience underlying consciousness, and it bypasses the need to consider quantum effects.
△ Less
Submitted 3 July, 2014;
originally announced July 2014.
-
Multivariate Granger Causality and Generalized Variance
Authors:
Adam B. Barrett,
Lionel Barnett,
Anil K. Seth
Abstract:
Granger causality analysis is a popular method for inference on directed interactions in complex systems of many variables. A shortcoming of the standard framework for Granger causality is that it only allows for examination of interactions between single (univariate) variables within a system, perhaps conditioned on other variables. However, interactions do not necessarily take place between sing…
▽ More
Granger causality analysis is a popular method for inference on directed interactions in complex systems of many variables. A shortcoming of the standard framework for Granger causality is that it only allows for examination of interactions between single (univariate) variables within a system, perhaps conditioned on other variables. However, interactions do not necessarily take place between single variables, but may occur among groups, or "ensembles", of variables. In this study we establish a principled framework for Granger causality in the context of causal interactions among two or more multivariate sets of variables. Building on Geweke's seminal 1982 work, we offer new justifications for one particular form of multivariate Granger causality based on the generalized variances of residual errors. Taken together, our results support a comprehensive and theoretically consistent extension of Granger causality to the multivariate case. Treated individually, they highlight several specific advantages of the generalized variance measure, which we illustrate using applications in neuroscience as an example. We further show how the measure can be used to define "partial" Granger causality in the multivariate context and we also motivate reformulations of "causal density" and "Granger autonomy". Our results are directly applicable to experimental data and promise to reveal new types of functional relations in complex systems, neural and otherwise.
△ Less
Submitted 13 April, 2010; v1 submitted 1 February, 2010;
originally announced February 2010.
-
Granger causality and transfer entropy are equivalent for Gaussian variables
Authors:
Lionel Barnett,
Adam B Barrett,
Anil K. Seth
Abstract:
Granger causality is a statistical notion of causal influence based on prediction via vector autoregression. Developed originally in the field of econometrics, it has since found application in a broader arena, particularly in neuroscience. More recently transfer entropy, an information-theoretic measure of time-directed information transfer between jointly dependent processes, has gained tracti…
▽ More
Granger causality is a statistical notion of causal influence based on prediction via vector autoregression. Developed originally in the field of econometrics, it has since found application in a broader arena, particularly in neuroscience. More recently transfer entropy, an information-theoretic measure of time-directed information transfer between jointly dependent processes, has gained traction in a similarly wide field. While it has been recognized that the two concepts must be related, the exact relationship has until now not been formally described. Here we show that for Gaussian variables, Granger causality and transfer entropy are entirely equivalent, thus bridging autoregressive and information-theoretic approaches to data-driven causal inference.
△ Less
Submitted 10 November, 2009; v1 submitted 23 October, 2009;
originally announced October 2009.
-
Shannon Information Capacity of Discrete Synapses
Authors:
Adam B. Barrett,
M. C. W. van Rossum
Abstract:
There is evidence that biological synapses have only a fixed number of discrete weight states. Memory storage with such synapses behaves quite differently from synapses with unbounded, continuous weights as old memories are automatically overwritten by new memories. We calculate the storage capacity of discrete, bounded synapses in terms of Shannon information. For optimal learning rules, we inv…
▽ More
There is evidence that biological synapses have only a fixed number of discrete weight states. Memory storage with such synapses behaves quite differently from synapses with unbounded, continuous weights as old memories are automatically overwritten by new memories. We calculate the storage capacity of discrete, bounded synapses in terms of Shannon information. For optimal learning rules, we investigate how information storage depends on the number of synapses, the number of synaptic states and the coding sparseness.
△ Less
Submitted 13 March, 2008;
originally announced March 2008.
-
Dynamics and robustness of familiarity memory
Authors:
J. M. Cortes,
A. Greve,
A. B. Barrett,
M. C. W. van Rossum
Abstract:
When one is presented with an item or a face, one can sometimes have a sense of recognition without being able to recall where or when one has encountered it before. This sense of recognition is known as familiarity. Following previous computational models of familiarity memory we investigate the dynamical properties of familiarity discrimination, and contrast two different familiarity discrimin…
▽ More
When one is presented with an item or a face, one can sometimes have a sense of recognition without being able to recall where or when one has encountered it before. This sense of recognition is known as familiarity. Following previous computational models of familiarity memory we investigate the dynamical properties of familiarity discrimination, and contrast two different familiarity discriminators: one based on the energy of the neural network, and the other based on the time derivative of the energy. We show how the familiarity signal decays after a stimulus is presented, and examine the robustness of the familiarity discriminator in the presence of random fluctuations in neural activity. For both discriminators we establish, via a combined method of signal-to-noise ratio and mean field analysis, how the maximum number of successfully discriminated stimuli depends on the noise level.
△ Less
Submitted 6 October, 2007;
originally announced October 2007.
-
M-Theory on Manifolds with G_2 Holonomy
Authors:
Adam B. Barrett
Abstract:
We study M-theory on G_2 holonomy spaces that are constructed by dividing a seven-torus by some discrete symmetry group. We classify possible group elements that may be used in this construction and use them to find a set of possible orbifold groups that lead to co-dimension four singularities. We describe how to blow up such singularities, and then derive the moduli Kaehler potential for M-theo…
▽ More
We study M-theory on G_2 holonomy spaces that are constructed by dividing a seven-torus by some discrete symmetry group. We classify possible group elements that may be used in this construction and use them to find a set of possible orbifold groups that lead to co-dimension four singularities. We describe how to blow up such singularities, and then derive the moduli Kaehler potential for M-theory on the resulting class of G_2 manifolds. To consider the singular limit it is necessary to derive the supergravity action for M-theory on the orbifold C^2/Z_N. We do this by coupling 11-dimensional supergravity to a seven-dimensional Yang-Mills theory located on the orbifold fixed plane. We show that the resulting action is supersymmetric to leading non-trivial order in the 11-dimensional Newton constant. Obtaining this action enables us to then reduce M-theory on a toroidal G_2 orbifold with co-dimension four singularities, taking explicitly into account the additional gauge fields at the singularities. The four-dimensional effective theory has N=1 supersymmetry with non-Abelian N=4 gauge theory sub-sectors. We present explicit formulae for the Kaehler potential, gauge-kinetic function and superpotential. In the four-dimensional theory, blowing-up of the orbifold is described by continuation along D-flat directions. Using this interpretation, we demonstrate consistency of our results for singular G_2 spaces with corresponding ones obtained for smooth G_2 spaces. In addition, we consider the effects of switching on flux and Wilson lines on singular loci of the G_2 space, and we discuss the relation to N=4 SYM theory.
△ Less
Submitted 11 December, 2006;
originally announced December 2006.
-
Four-dimensional Effective M-theory on a Singular G_2 Manifold
Authors:
Lara B Anderson,
Adam B Barrett,
Andre Lukas,
Masahiro Yamaguchi
Abstract:
We reduce M-theory on a G_2 orbifold with co-dimension four singularities, taking explicitly into account the additional gauge fields at the singularities. As a starting point, we use 11-dimensional supergravity coupled to seven-dimensional super-Yang-Mills theory, as derived in a previous paper. The resulting four-dimensional theory has N=1 supersymmetry with non-Abelian N=4 gauge theory sub-se…
▽ More
We reduce M-theory on a G_2 orbifold with co-dimension four singularities, taking explicitly into account the additional gauge fields at the singularities. As a starting point, we use 11-dimensional supergravity coupled to seven-dimensional super-Yang-Mills theory, as derived in a previous paper. The resulting four-dimensional theory has N=1 supersymmetry with non-Abelian N=4 gauge theory sub-sectors. We present explicit formulae for the Kahler potential, gauge-kinetic function and superpotential. In the four-dimensional theory, blowing-up of the orbifold is described by a Higgs effect induced by continuation along D-flat directions. Using this interpretation, we show that our results are consistent with the corresponding ones obtained for smooth G_2 spaces. In addition, we consider the effects of switching on flux and Wilson lines on singular loci of the G_2 space, and we discuss the relation to N=4 SYM theory.
△ Less
Submitted 20 September, 2006; v1 submitted 30 June, 2006;
originally announced June 2006.
-
M-Theory on the Orbifold C^2/Z_N
Authors:
Lara B Anderson,
Adam B Barrett,
Andre Lukas
Abstract:
We construct M-theory on the orbifold C^2/Z_N by coupling 11-dimensional supergravity to a seven-dimensional Yang-Mills theory located on the orbifold fixed plane. It is shown that the resulting action is supersymmetric to leading non-trivial order in the 11-dimensional Newton constant. This action provides the starting point for a reduction of M-theory on G_2 spaces with co-dimension four singu…
▽ More
We construct M-theory on the orbifold C^2/Z_N by coupling 11-dimensional supergravity to a seven-dimensional Yang-Mills theory located on the orbifold fixed plane. It is shown that the resulting action is supersymmetric to leading non-trivial order in the 11-dimensional Newton constant. This action provides the starting point for a reduction of M-theory on G_2 spaces with co-dimension four singularities.
△ Less
Submitted 6 February, 2006;
originally announced February 2006.
-
Stability and Paradox in Algorithmic Logic
Authors:
Wayne Aitken,
Jeffrey A. Barrett
Abstract:
Type-free systems of logic are designed to consistently handle significant instances of self-reference. Some consistent type-free systems also have the feature of allowing the sort of general abstraction or comprehension principle that infamously leads to paradox in naive set theory. Because type-free systems possess these features, and avoid the hierarchy of types that is felt to be unnatural i…
▽ More
Type-free systems of logic are designed to consistently handle significant instances of self-reference. Some consistent type-free systems also have the feature of allowing the sort of general abstraction or comprehension principle that infamously leads to paradox in naive set theory. Because type-free systems possess these features, and avoid the hierarchy of types that is felt to be unnatural in some contexts, they have the potential to play an important role in the foundations of mathematics, the theory of classes (producing a richer notion of class than that currently used in set theory and category theory), property theory, natural language semantics, the theory of truth, and theoretical computer science. Clearly, type-free systems must depart from classical logic in some way, but there is little agreement on what kind of type-free system to use, and which departures from classical logic should be allowed.
Our approach to type-free logic is to study a naturally occurring type-free system that we believe is in some sense prototypical of systems that will ultimately prove useful. The logic studied in this paper, called algorithmic logic, concerns certain basic statements involving algorithms and the algorithmic rules of inference between such statements. This paper studies the propositional properties of algorithmic logic. A future paper will show that algorithmic logic possesses a general abstraction principle.
△ Less
Submitted 28 December, 2005;
originally announced December 2005.
-
Classification and Moduli Kahler Potentials of G_2 Manifolds
Authors:
Adam B Barrett,
Andre Lukas
Abstract:
Compact manifolds of G_2 holonomy may be constructed by dividing a seven-torus by some discrete symmetry group and then blowing up the singularities of the resulting orbifold. We classify possible group elements that may be used in this construction and use this classification to find a set of possible orbifold groups. We then derive the moduli Kahler potential for M-theory on the resulting clas…
▽ More
Compact manifolds of G_2 holonomy may be constructed by dividing a seven-torus by some discrete symmetry group and then blowing up the singularities of the resulting orbifold. We classify possible group elements that may be used in this construction and use this classification to find a set of possible orbifold groups. We then derive the moduli Kahler potential for M-theory on the resulting class of G_2 manifolds with blown up co-dimension four singularities.
△ Less
Submitted 19 January, 2005; v1 submitted 5 November, 2004;
originally announced November 2004.
-
Kinematic Evidence of Minor Mergers in Normal Sa Galaxies: NGC3626, NGC3900, NGC4772 and NGC5854
Authors:
Martha P. Haynes,
Katherine P. Jore,
Elizabeth A. Barrett,
Adrick H. Broeils,
Brian M. Murray
Abstract:
BVRI and H-alpha imaging and long-slit optical spectroscopic data are presented for four morphologically normal and relatively isolated Sa galaxies, NGC3626, NGC3900, NGC4772 and NGC5854. VLA HI synthesis imaging is presented for the first 3 objects. In all 4 galaxies, evidence of kinematic decoupling of ionized gas components is found; the degree and circumstances of the distinct kinematics var…
▽ More
BVRI and H-alpha imaging and long-slit optical spectroscopic data are presented for four morphologically normal and relatively isolated Sa galaxies, NGC3626, NGC3900, NGC4772 and NGC5854. VLA HI synthesis imaging is presented for the first 3 objects. In all 4 galaxies, evidence of kinematic decoupling of ionized gas components is found; the degree and circumstances of the distinct kinematics vary from complete counterrotation of all of the gas from all of the stars (NGC3626) to nuclear gas disks decoupled from the stars (NGC5854) to anomalous velocity central gas components (NGC3900 and NGC4772). In the 3 objects mapped in HI, the neutral gas extends far beyond the optical radius, R_HI/R_25 > 2. In general, the HI surface density is very low and the outer HI is patchy and asymmetric or found in a distinct ring, exterior to the optical edge. While the overall HI velocity fields are dominated by circular motions, strong warps are suggested in the outer regions. Optical imaging is also presented for NGC 4138 previously reported by Jore et al. (1996) to show counterrotating stellar components. The multiwavelength evidence is interpreted in terms of the kinematic "memory" of past minor mergers in objects that otherwise exhibit no morphological signs of interaction.
△ Less
Submitted 25 April, 2000;
originally announced April 2000.
-
The Persistence of Memory: Surreal Trajectories in Bohm's Theory
Authors:
Jeffrey A. Barrett
Abstract:
In this paper I describe the history of the surreal trajectories problem and argue that in fact it is not a problem for Bohm's theory. More specifically, I argue that one can take the particle trajectories predicted by Bohm's theory to be the actual trajectories that particles follow and that there is no reason to suppose that good particle detectors are somehow fooled in the context of the surr…
▽ More
In this paper I describe the history of the surreal trajectories problem and argue that in fact it is not a problem for Bohm's theory. More specifically, I argue that one can take the particle trajectories predicted by Bohm's theory to be the actual trajectories that particles follow and that there is no reason to suppose that good particle detectors are somehow fooled in the context of the surreal trajectories experiments. Rather than showing that Bohm's theory predicts the wrong particle trajectories or that it somehow prevents one from making reliable measurements, such experiments ultimately reveal the special role played by position and the fundamental incompatibility between Bohm's theory and relativity.
△ Less
Submitted 16 February, 2000;
originally announced February 2000.