-
Investigating potential causes of Sepsis with Bayesian network structure learning
Authors:
Bruno Petrungaro,
Neville K. Kitson,
Anthony C. Constantinou
Abstract:
Sepsis is a life-threatening and serious global health issue. This study combines knowledge with available hospital data to investigate the potential causes of Sepsis that can be affected by policy decisions. We investigate the underlying causal structure of this problem by combining clinical expertise with score-based, constraint-based, and hybrid structure learning algorithms. A novel approach t…
▽ More
Sepsis is a life-threatening and serious global health issue. This study combines knowledge with available hospital data to investigate the potential causes of Sepsis that can be affected by policy decisions. We investigate the underlying causal structure of this problem by combining clinical expertise with score-based, constraint-based, and hybrid structure learning algorithms. A novel approach to model averaging and knowledge-based constraints was implemented to arrive at a consensus structure for causal inference. The structure learning process highlighted the importance of exploring data-driven approaches alongside clinical expertise. This includes discovering unexpected, although reasonable, relationships from a clinical perspective. Hypothetical interventions on Chronic Obstructive Pulmonary Disease, Alcohol dependence, and Diabetes suggest that the presence of any of these risk factors in patients increases the likelihood of Sepsis. This finding, alongside measuring the effect of these risk factors on Sepsis, has potential policy implications. Recognising the importance of prediction in improving Sepsis related health outcomes, the model built is also assessed in its ability to predict Sepsis. The predictions generated by the consensus model were assessed for their accuracy, sensitivity, and specificity. These three indicators all had results around 70%, and the AUC was 80%, which means the causal structure of the model is reasonably accurate given that the models were trained on data available for commissioning purposes only.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Investigating the validity of structure learning algorithms in identifying risk factors for intervention in patients with diabetes
Authors:
Sheresh Zahoor,
Anthony C. Constantinou,
Tim M Curtis,
Mohammed Hasanuzzaman
Abstract:
Diabetes, a pervasive and enduring health challenge, imposes significant global implications on health, financial healthcare systems, and societal well-being. This study undertakes a comprehensive exploration of various structural learning algorithms to discern causal pathways amongst potential risk factors influencing diabetes progression. The methodology involves the application of these algorit…
▽ More
Diabetes, a pervasive and enduring health challenge, imposes significant global implications on health, financial healthcare systems, and societal well-being. This study undertakes a comprehensive exploration of various structural learning algorithms to discern causal pathways amongst potential risk factors influencing diabetes progression. The methodology involves the application of these algorithms to relevant diabetes data, followed by the conversion of their output graphs into Causal Bayesian Networks (CBNs), enabling predictive analysis and the evaluation of discrepancies in the effect of hypothetical interventions within our context-specific case study.
This study highlights the substantial impact of algorithm selection on intervention outcomes. To consolidate insights from diverse algorithms, we employ a model-averaging technique that helps us obtain a unique causal model for diabetes derived from a varied set of structural learning algorithms. We also investigate how each of those individual graphs, as well as the average graph, compare to the structures elicited by a domain expert who categorised graph edges into high confidence, moderate, and low confidence types, leading into three individual graphs corresponding to the three levels of confidence.
The resulting causal model and data are made available online, and serve as a valuable resource and a guide for informed decision-making by healthcare practitioners, offering a comprehensive understanding of the interactions between relevant risk factors and the effect of hypothetical interventions. Therefore, this research not only contributes to the academic discussion on diabetes, but also provides practical guidance for healthcare professionals in develo** efficient intervention and risk management strategies.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Spatially resolved random telegraph fluctuations of a single trap at the Si/SiO2 interface
Authors:
Megan Cowie,
Procopios C. Constantinou,
Neil J. Curson,
Taylor J. Z. Stock,
Peter Grutter
Abstract:
We use electrostatic force microscopy to spatially resolve random telegraph noise at the Si/SiO$_2$ interface. Our measurements demonstrate that two-state fluctuations are localized at interfacial traps, with bias-dependent rates and amplitudes. These two-level systems lead to correlated carrier number and mobility fluctuations with a range of characteristic timescales; taken together as an ensemb…
▽ More
We use electrostatic force microscopy to spatially resolve random telegraph noise at the Si/SiO$_2$ interface. Our measurements demonstrate that two-state fluctuations are localized at interfacial traps, with bias-dependent rates and amplitudes. These two-level systems lead to correlated carrier number and mobility fluctuations with a range of characteristic timescales; taken together as an ensemble, they give rise to a $1/f$ power spectral trend. Such individual defect fluctuations at the Si/SiO$_2$ interface impair the performance and reliability of nanoscale semiconductor devices, and will be a significant source of noise in semiconductor-based quantum sensors and computers. The fluctuations measured here are associated with a four-fold competition of rates, including slow two-state switching on the order of seconds and, in one state, fast switching on the order of nanoseconds which is associated with energy loss.
△ Less
Submitted 14 March, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Unraveling how winds and surface heat fluxes control the Atlantic Ocean's meridional heat transport
Authors:
Dhruv Bhagtani,
Andrew McC. Hogg,
Ryan M. Holmes,
Navid C. Constantinou
Abstract:
The North Atlantic Ocean circulation, fueled by winds and surface buoyancy fluxes, carries 1.25$\,$PettaWatts of heat poleward in the subtropics, and plays an important role in regulating global weather and climate patterns. Using a series of simulations with perturbed surface forcing, we study how winds and surface heat flux gradients affect the Atlantic meridional heat transport. We decompose th…
▽ More
The North Atlantic Ocean circulation, fueled by winds and surface buoyancy fluxes, carries 1.25$\,$PettaWatts of heat poleward in the subtropics, and plays an important role in regulating global weather and climate patterns. Using a series of simulations with perturbed surface forcing, we study how winds and surface heat flux gradients affect the Atlantic meridional heat transport. We decompose the Atlantic meridional heat transport into contributions from circulation cells at warm and cold temperatures (resembling a subtropical gyre and the dense overturning circulation respectively), and a mixed circulation that contains water masses traversing both these cells. Variations in wind stress initially alter the amount of heat carried by the warm and mixed cells, but on long time scales ($>$10 years), changes in the temperature distribution restore the heat transport to equilibrium. Changes in surface buoyancy forcing control the cold cell's circulation, and its associated meridional heat transport, through high-latitude processes.
△ Less
Submitted 8 May, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.
-
Single-Atom Control of Arsenic Incorporation in Silicon for High-Yield Artificial Lattice Fabrication
Authors:
Taylor J. Z. Stock,
Oliver Warschkow,
Procopios C. Constantinou,
David R. Bowler,
Steven R. Schofield,
Neil J. Curson
Abstract:
Artificial lattices constructed from individual dopant atoms within a semiconductor crystal hold promise to provide novel materials with tailored electronic, magnetic, and optical properties. These custom engineered lattices are anticipated to enable new, fundamental discoveries in condensed matter physics and lead to the creation of new semiconductor technologies including analog quantum simulato…
▽ More
Artificial lattices constructed from individual dopant atoms within a semiconductor crystal hold promise to provide novel materials with tailored electronic, magnetic, and optical properties. These custom engineered lattices are anticipated to enable new, fundamental discoveries in condensed matter physics and lead to the creation of new semiconductor technologies including analog quantum simulators and universal solid-state quantum computers. In this work, we report precise and repeatable, substitutional incorporation of single arsenic atoms into a silicon lattice. We employ a combination of scanning tunnelling microscopy hydrogen resist lithography and a detailed statistical exploration of the chemistry of arsine on the hydrogen terminated silicon (001) surface, to show that single arsenic dopants can be deterministically placed within four silicon lattice sites and incorporated with 97$\pm$2% yield. These findings bring us closer to the ultimate frontier in semiconductor technology: the deterministic assembly of atomically precise dopant and qubit arrays at arbitrarily large scales.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Topographically-generated near-internal waves as a response to winds over the ocean surface
Authors:
Ashley J. Barnes,
Callum J. Shakespeare,
Andy McC. Hogg,
Navid C. Constantinou
Abstract:
Internal waves propagate on the ocean stratification and carry energy and momentum through the ocean interior. The two most significant sources of these waves in the ocean are surface winds and oscillatory tidal flow across topography. We propose a hybrid of these two mechanisms, in which wind induced oscillations of sea surface and isopycnal heights are rapidly communicated to the seafloor via hy…
▽ More
Internal waves propagate on the ocean stratification and carry energy and momentum through the ocean interior. The two most significant sources of these waves in the ocean are surface winds and oscillatory tidal flow across topography. We propose a hybrid of these two mechanisms, in which wind induced oscillations of sea surface and isopycnal heights are rapidly communicated to the seafloor via hydrostatic pressure. In the presence of topography, the resulting oscillatory bottom velocity may then generate internal waves in a similar manner to the barotropic tide. We investigate this mechanism in an idealised numerical isopycnal model of a storm passing over a mid ocean ridge, and perform several perturbation experiments in which ocean and wind properties are varied. Bottom-generated internal waves are identified propagating away from the ridge in the wake of the storm. Estimates of the total wave energy suggest that in the right circumstances these waves could be a significant source of internal wave energy, with a local wind work to wave energy conversion rate of up to 50% of the corresponding conversion to surface generated near-inertial waves in our domain. Our results suggest a need for further investigation in less idealised scenarios to more precisely quantity this novel mechanism of deep ocean wave generation, and how it may affect abyssal mixing.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Causal discovery using dynamically requested knowledge
Authors:
Neville K Kitson,
Anthony C Constantinou
Abstract:
Causal Bayesian Networks (CBNs) are an important tool for reasoning under uncertainty in complex real-world systems. Determining the graphical structure of a CBN remains a key challenge and is undertaken either by eliciting it from humans, using machine learning to learn it from data, or using a combination of these two approaches. In the latter case, human knowledge is generally provided to the a…
▽ More
Causal Bayesian Networks (CBNs) are an important tool for reasoning under uncertainty in complex real-world systems. Determining the graphical structure of a CBN remains a key challenge and is undertaken either by eliciting it from humans, using machine learning to learn it from data, or using a combination of these two approaches. In the latter case, human knowledge is generally provided to the algorithm before it starts, but here we investigate a novel approach where the structure learning algorithm itself dynamically identifies and requests knowledge for relationships that the algorithm identifies as uncertain during structure learning. We integrate this approach into the Tabu structure learning algorithm and show that it offers considerable gains in structural accuracy, which are generally larger than those offered by existing approaches for integrating knowledge. We suggest that a variant which requests only arc orientation information may be particularly useful where the practitioner has little preexisting knowledge of the causal relationships. As well as offering improved accuracy, the approach can use human expertise more effectively and contributes to making the structure learning process more transparent.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Resistless EUV lithography: photon-induced oxide patterning on silicon
Authors:
Li-Ting Tseng,
Prajith Karadan,
Dimitrios Kazazis,
Procopios C. Constantinou,
Taylor J. Z. Stock,
Neil J. Curson,
Steven R. Schofield,
Matthias Muntwiler,
Gabriel Aeppli,
Yasin Ekinci
Abstract:
In this work, we show the feasibility of extreme ultraviolet (EUV) patterning on an HF-treated Si(100) surface in the absence of a photoresist. EUV lithography is the leading lithography technique in semiconductor manufacturing due to its high resolution and throughput, but future progress in resolution can be hampered because of the inherent limitations of the resists. We show that EUV photons ca…
▽ More
In this work, we show the feasibility of extreme ultraviolet (EUV) patterning on an HF-treated Si(100) surface in the absence of a photoresist. EUV lithography is the leading lithography technique in semiconductor manufacturing due to its high resolution and throughput, but future progress in resolution can be hampered because of the inherent limitations of the resists. We show that EUV photons can induce surface reactions on a partially H-terminated Si surface and assist the growth of an oxide layer, which serves as an etch mask. This mechanism is different from the H-desorption in scanning tunneling microscopy-based lithography. We achieve SiO2/Si gratings with 75 nm half-pitch and 31 nm height, demonstrating the efficacy of the method and the feasibility of patterning with EUV lithography without the use of a photoresist. Further development of the resistless EUV lithography method can be a viable approach to nm-scale lithography by overcoming the inherent resolution and roughness limitations of photoresist materials.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Altermagnetic lifting of Kramers spin degeneracy
Authors:
J. Krempaský,
L. Šmejkal,
S. W. D'Souza,
M. Hajlaoui,
G. Springholz,
K. Uhlířová,
F. Alarab,
P. C. Constantinou,
V. Strokov,
D. Usanov,
W. R. Pudelko,
R. González-Hernández,
A. Birk Hellenes,
Z. Jansa,
H. Reichlová,
Z. Šobáň,
R. D. Gonzalez Betancourt,
P. Wadley,
J. Sinova,
D. Kriegner,
J. Minár,
J. H. Dil,
T. Jungwirth
Abstract:
Lifted Kramers spin-degeneracy has been among the central topics of condensed-matter physics since the dawn of the band theory of solids. It underpins established practical applications as well as current frontier research, ranging from magnetic-memory technology to topological quantum matter. Traditionally, lifted Kramers spin-degeneracy has been considered to originate from two possible internal…
▽ More
Lifted Kramers spin-degeneracy has been among the central topics of condensed-matter physics since the dawn of the band theory of solids. It underpins established practical applications as well as current frontier research, ranging from magnetic-memory technology to topological quantum matter. Traditionally, lifted Kramers spin-degeneracy has been considered to originate from two possible internal symmetry-breaking mechanisms. The first one refers to time-reversal symmetry breaking by magnetization of ferromagnets, and tends to be strong due to the non-relativistic exchange-coupling origin. The second mechanism applies to crystals with broken inversion symmetry, and tends to be comparatively weaker as it originates from the relativistic spin-orbit coupling. A recent theory work based on spin-symmetry classification has identified an unconventional magnetic phase, dubbed altermagnetic, that allows for lifting the Kramers spin degeneracy without net magnetization and inversion-symmetry breaking. Here we provide the confirmation using photoemission spectroscopy and ab initio calculations. We identify two distinct unconventional mechanisms of lifted Kramers spin degeneracy generated by the altermagnetic phase of centrosymmetric MnTe with vanishing net magnetization. Our observation of the altermagnetic lifting of the Kramers spin degeneracy can have broad consequences in magnetism. It motivates exploration and exploitation of the unconventional nature of this magnetic phase in an extended family of materials, ranging from insulators and semiconductors to metals and superconductors, that have been either identified recently or perceived for many decades as conventional antiferromagnets.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Tuning structure learning algorithms with out-of-sample and resampling strategies
Authors:
Kiattikun Chobtham,
Anthony C. Constantinou
Abstract:
One of the challenges practitioners face when applying structure learning algorithms to their data involves determining a set of hyperparameters; otherwise, a set of hyperparameter defaults is assumed. The optimal hyperparameter configuration often depends on multiple factors, including the size and density of the usually unknown underlying true graph, the sample size of the input data, and the st…
▽ More
One of the challenges practitioners face when applying structure learning algorithms to their data involves determining a set of hyperparameters; otherwise, a set of hyperparameter defaults is assumed. The optimal hyperparameter configuration often depends on multiple factors, including the size and density of the usually unknown underlying true graph, the sample size of the input data, and the structure learning algorithm. We propose a novel hyperparameter tuning method, called the Out-of-sample Tuning for Structure Learning (OTSL), that employs out-of-sample and resampling strategies to estimate the optimal hyperparameter configuration for structure learning, given the input data set and structure learning algorithm. Synthetic experiments show that employing OTSL as a means to tune the hyperparameters of hybrid and score-based structure learning algorithms leads to improvements in graphical accuracy compared to the state-of-the-art. We also illustrate the applicability of this approach to real datasets from different disciplines.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Spatially resolved dielectric loss at the Si/SiO$_2$ interface
Authors:
Megan Cowie,
Taylor J. Z. Stock,
Procopios C. Constantinou,
Neil Curson,
Peter Grütter
Abstract:
The Si/SiO$_2$ interface is populated by isolated trap states which modify its electronic properties. These traps are of critical interest for the development of semiconductor-based quantum sensors and computers, as well as nanoelectronic devices. Here, we study the electric susceptibility of the Si/SiO$_2$ interface with nm spatial resolution using frequency-modulated atomic force microscopy to m…
▽ More
The Si/SiO$_2$ interface is populated by isolated trap states which modify its electronic properties. These traps are of critical interest for the development of semiconductor-based quantum sensors and computers, as well as nanoelectronic devices. Here, we study the electric susceptibility of the Si/SiO$_2$ interface with nm spatial resolution using frequency-modulated atomic force microscopy to measure a patterned dopant delta-layer buried 2 nm beneath the silicon native oxide interface. We show that surface charge organization timescales, which range from 1-150 ns, increase significantly around interfacial states. We conclude that dielectric loss under time-varying gate biases at MHz and sub-MHz frequencies in metal-insulator-semiconductor capacitor device architectures is highly spatially heterogeneous over nm length scales.
Supplemental GIFs can be found at https://doi.org/10.6084/m9.figshare.25546687
△ Less
Submitted 4 April, 2024; v1 submitted 23 June, 2023;
originally announced June 2023.
-
Formulation and calibration of CATKE, a one-equation parameterization for microscale ocean mixing
Authors:
Gregory LeClaire Wagner,
Adeline Hillier,
Navid C. Constantinou,
Simone Silvestri,
Andre Souza,
Keaton Burns,
Chris Hill,
Jean-Michel Campin,
John Marshall,
Raffaele Ferrari
Abstract:
We describe CATKE, a parameterization for fluxes associated with small-scale or "microscale" ocean turbulent mixing on scales between 1 and 100 meters. CATKE uses a downgradient formulation that depends on a prognostic turbulent kinetic energy (TKE) variable and a diagnostic mixing length scale that includes a dynamic convective adjustment (CA) component. With its dynamic convective mixing length,…
▽ More
We describe CATKE, a parameterization for fluxes associated with small-scale or "microscale" ocean turbulent mixing on scales between 1 and 100 meters. CATKE uses a downgradient formulation that depends on a prognostic turbulent kinetic energy (TKE) variable and a diagnostic mixing length scale that includes a dynamic convective adjustment (CA) component. With its dynamic convective mixing length, CATKE predicts not just the depth spanned by convective plumes but also the characteristic convective mixing timescale, an important aspect of turbulent convection not captured by simpler static convective adjustment schemes. As a result, CATKE can describe the competition between convection and other processes such as shear-driven mixing and baroclinic restratification. To calibrate CATKE, we use Ensemble Kalman Inversion to minimize the error between 21 large eddy simulations (LES) and predictions of the LES data by CATKE-parameterized single column simulations at three different vertical resolutions. We find that CATKE makes accurate predictions of both idealized and realistic LES compared to microscale turbulence parameterizations commonly used in climate models.
△ Less
Submitted 22 June, 2024; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Intrinsically episodic Antarctic shelf intrusions of circumpolar deep water via canyons
Authors:
Ellie Q. Y. Ong,
Edward Doddridge,
Navid C. Constantinou,
Andrew McC. Hogg,
Matthew H. England
Abstract:
The structure of the Antarctic Slope Current at the continental shelf is crucial in governing the poleward transport of warm water. Canyons on the continental slope may provide a pathway for warm water to cross the slope current and intrude onto the continental shelf underneath ice shelves, which can increase rates of ice shelf melting, leading to reduced buttressing of ice shelves, accelerating g…
▽ More
The structure of the Antarctic Slope Current at the continental shelf is crucial in governing the poleward transport of warm water. Canyons on the continental slope may provide a pathway for warm water to cross the slope current and intrude onto the continental shelf underneath ice shelves, which can increase rates of ice shelf melting, leading to reduced buttressing of ice shelves, accelerating glacial flow and hence increased sea level rise. Observations and modelling studies of the Antarctic Slope Current and cross-shelf warm water intrusions are limited, particularly in the East Antarctica region. To explore this topic, an idealised configuration of the Antarctic Slope Current is developed, using an eddy-resolving isopycnal model that emulates the dynamics and topography of the East Antarctic sector. Warm water intrusions via canyons are found to occur in discrete episodes of large onshore flow induced by eddies, even in the absence of any temporal variability in external forcings, demonstrating the intrinsic nature of these intrusions to the slope current system. Canyon width is found to play a key role in modulating cross-shelf exchanges; warm water transport through narrower canyons is more irregular than transport through wider canyons. The intrinsically episodic cross-shelf transport is found to be driven by feedbacks between wind energy input and eddy generation in the Antarctic Slope Current. Improved understanding of the intrinsic variability of warm water intrusions can help guide future observational and modelling studies in the analysis of eddy impacts on Antarctic shelf circulation.
△ Less
Submitted 7 March, 2024; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Framework for phase transitions between the Maxwell and Gibbs constructions
Authors:
Constantinos Constantinou,
Tianqi Zhao,
Sophia Han,
Madappa Prakash
Abstract:
By taking the nucleon-to-quark phase transition within a neutron star as an example, we present a thermodynamically consistent method to calculate the equation of state of ambient matter so that transitions that are intermediate to those of the familiar Maxwell and Gibbs constructions can be described. This method does not address the poorly known surface tension between the two phases microscopic…
▽ More
By taking the nucleon-to-quark phase transition within a neutron star as an example, we present a thermodynamically consistent method to calculate the equation of state of ambient matter so that transitions that are intermediate to those of the familiar Maxwell and Gibbs constructions can be described. This method does not address the poorly known surface tension between the two phases microscopically (as, for example, in the calculation of the core pasta phases via the Wigner-Seitz approximation) but instead combines the local and global charge neutrality conditions characteristic of the Maxwell and Gibbs constructions, respectively. Overall charge neutrality is achieved by dividing the leptons to those that obey local charge neutrality (Maxwell) and those that maintain global charge neutrality (Gibbs). The equation of state is obtained by using equilibrium constraints derived from minimizing the total energy density. The results of this minimization are then used to calculate neutron star mass-radius curves, tidal deformabilities, equilibrium and adiabatic sound speeds, and nonradial $g$-mode oscillation frequencies for several intermediate constructions. Various quantities of interest transform smoothly from their Gibbs structures to those of Maxwell as the local-to-total electron ratio $η$, introduced to mimic the hadron-to-quark interface tension from $0$ (Gibbs) to $\infty$ (Maxwell), is raised from $0$ to $1$. A notable exception is the $g$-mode frequency for the specific case of $η=1$ for which a gap appears between the quark and hadronic branches.
△ Less
Submitted 16 April, 2023; v1 submitted 8 February, 2023;
originally announced February 2023.
-
Surface heating steers planetary-scale ocean circulation
Authors:
Dhruv Bhagtani,
Andrew McColl Hogg,
Ryan Mahony Holmes,
Navid C. Constantinou
Abstract:
Gyres are central features of large-scale ocean circulation and are involved in transporting tracers such as heat, nutrients, and carbon-dioxide within and across ocean basins. Traditionally, the gyre circulation is thought to be driven by surface winds and quantified via Sverdrup balance, but it has been proposed that surface buoyancy fluxes may also contribute to gyre forcing. Through a series o…
▽ More
Gyres are central features of large-scale ocean circulation and are involved in transporting tracers such as heat, nutrients, and carbon-dioxide within and across ocean basins. Traditionally, the gyre circulation is thought to be driven by surface winds and quantified via Sverdrup balance, but it has been proposed that surface buoyancy fluxes may also contribute to gyre forcing. Through a series of eddy-permitting global ocean model simulations with perturbed surface forcing, the relative contribution of wind stress and surface heat flux forcing to the large-scale ocean circulation is investigated, focusing on the subtropical gyres. In addition to gyre strength being linearly proportional to wind stress, it is shown that the gyre circulation is strongly impacted by variations in the surface heat flux (specifically, its meridional gradient) through a rearrangement of the ocean's buoyancy structure. On shorter timescales ($\sim$ decade), the gyre circulation anomalies are proportional to the magnitude of the surface heat flux gradient perturbation, with up to $\sim 0.15\,\mathrm{Sv}$ anomaly induced per $\mathrm{W}\,\mathrm{m}^{-2}$ change in the surface heat flux. On timescales longer than a decade, the gyre response to surface buoyancy flux gradient perturbations becomes non-linear as ocean circulation anomalies feed back onto the buoyancy structure induced by the surface buoyancy fluxes. These interactions complicate the development of a buoyancy-driven theory for the gyres to complement the Sverdrup relation. The flux-forced simulations underscore the importance of surface buoyancy forcing in steering the large-scale ocean circulation.
△ Less
Submitted 30 May, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
Nusselt number scaling in horizontal convection
Authors:
Navid C. Constantinou,
Cesar B. Rocha,
Stefan G. Llewellyn Smith,
William R. Young
Abstract:
We report a numerical study of horizontal convection (HC) at Prandtl number $Pr = 1$, with both no-slip and free-slip boundary conditions. We obtain 2D and 3D solutions and determine the relation between the Rayleigh number $Ra$ and the Nusselt number $Nu$. In 2D we vary $Ra$ between $0$ and $10^{14}$. In the range $10^6 \le Ra \le 10^{10}$ the $Nu$-$Ra$ relation is $Nu \sim Ra^{1/5}$. With $Ra$ g…
▽ More
We report a numerical study of horizontal convection (HC) at Prandtl number $Pr = 1$, with both no-slip and free-slip boundary conditions. We obtain 2D and 3D solutions and determine the relation between the Rayleigh number $Ra$ and the Nusselt number $Nu$. In 2D we vary $Ra$ between $0$ and $10^{14}$. In the range $10^6 \le Ra \le 10^{10}$ the $Nu$-$Ra$ relation is $Nu \sim Ra^{1/5}$. With $Ra$ greater than about $10^{11}$ we find a 2D regime with $Nu \sim Ra^{1/4}$ over three decades, up to the highest 2D $Ra$. In 3D, with maximum $Ra = 10^{11.5}$, we find only $Nu \sim Ra^{1/5}$. These results apply to both free slip and no slip boundary conditions. The $Nu \sim Ra^{1/4}$ regime has a double boundary layer (BL): there is a thin BL with thickness $\sim Ra^{-1/4}$ nested inside a thicker BL with thickness $\sim Ra^{-1/5}$. The $Ra^{-1/4}$ BL thickness, which determines $Nu$, coincides with the Kolmogorov and Batchelor scales of HC.
Numerical and theoretical results indicate that 3D HC is qualitatively and quantitatively similar to 2D HC. At the same $Ra$, the 3D $Nu$ exceeds the 2D $Nu$ by less than $20$%, i.e., there is very little 3D enhancement of heat transport. Boundary conditions are more important than dimensionality: the 2D free-slip solutions have larger $Nu$ than 3D no-slip solutions. Using the mechanical energy power integral of HC we show that the mean square vorticity of 3D HC is nearly equal to that of 2D HC at the same $Ra$. Thus vorticity amplification by strain-mediated vortex stretching does not operate in 3D HC.
△ Less
Submitted 7 June, 2023; v1 submitted 8 January, 2023;
originally announced January 2023.
-
Eigenvalue initialisation and regularisation for Koopman autoencoders
Authors:
Jack W. Miller,
Charles O'Neill,
Navid C. Constantinou,
Omri Azencot
Abstract:
Regularising the parameter matrices of neural networks is ubiquitous in training deep models. Typical regularisation approaches suggest initialising weights using small random values, and to penalise weights to promote sparsity. However, these widely used techniques may be less effective in certain scenarios. Here, we study the Koopman autoencoder model which includes an encoder, a Koopman operato…
▽ More
Regularising the parameter matrices of neural networks is ubiquitous in training deep models. Typical regularisation approaches suggest initialising weights using small random values, and to penalise weights to promote sparsity. However, these widely used techniques may be less effective in certain scenarios. Here, we study the Koopman autoencoder model which includes an encoder, a Koopman operator layer, and a decoder. These models have been designed and dedicated to tackle physics-related problems with interpretable dynamics and an ability to incorporate physics-related constraints. However, the majority of existing work employs standard regularisation practices. In our work, we take a step toward augmenting Koopman autoencoders with initialisation and penalty schemes tailored for physics-related settings. Specifically, we propose the "eigeninit" initialisation scheme that samples initial Koopman operators from specific eigenvalue distributions. In addition, we suggest the "eigenloss" penalty scheme that penalises the eigenvalues of the Koopman operator during training. We demonstrate the utility of these schemes on two synthetic data sets: a driven pendulum and flow past a cylinder; and two real-world problems: ocean surface temperatures and cyclone wind fields. We find on these datasets that eigenloss and eigeninit improves the convergence rate by up to a factor of 5, and that they reduce the cumulative long-term prediction error by up to a factor of 3. Such a finding points to the utility of incorporating similar schemes as an inductive bias in other physics-related deep learning approaches.
△ Less
Submitted 25 December, 2022; v1 submitted 22 December, 2022;
originally announced December 2022.
-
Stokes drift should not be added to ocean general circulation model velocities
Authors:
Gregory LeClaire Wagner,
Navid C. Constantinou,
Brandon G. Reichl
Abstract:
Studies of ocean surface transport often invoke the "Eulerian-mean hypothesis": that wave-agnostic general circulation models neglecting explicit surface waves effects simulate the Eulerian-mean ocean velocity time-averaged over surface wave oscillations. Acceptance of the Eulerian-mean hypothesis motivates reconstructing the total, Lagrangian-mean surface velocity by adding Stokes drift to model…
▽ More
Studies of ocean surface transport often invoke the "Eulerian-mean hypothesis": that wave-agnostic general circulation models neglecting explicit surface waves effects simulate the Eulerian-mean ocean velocity time-averaged over surface wave oscillations. Acceptance of the Eulerian-mean hypothesis motivates reconstructing the total, Lagrangian-mean surface velocity by adding Stokes drift to model output. Here, we show that the Eulerian-mean hypothesis is inconsistent, because wave-agnostic models cannot accurately simulate the Eulerian-mean velocity if Stokes drift is significant compared to the Eulerian-mean or Lagrangian-mean velocity. We conclude that Stokes drift should not be added to ocean general circulation model velocities. We additionally show the viability of the alternative "Lagrangian-mean hypothesis" using a theoretical argument and by comparing a wave-agnostic global ocean simulation with an explicitly wave-averaged simulation. We find that our wave-agnostic model accurately simulates the Lagrangian-mean velocity even though the Stokes drift is significant.
△ Less
Submitted 28 April, 2023; v1 submitted 16 October, 2022;
originally announced October 2022.
-
Non-destructive X-ray imaging of patterned delta-layer devices in silicon
Authors:
Nicolò D'Anna,
Dario Ferreira Sanchez,
Guy Matmon,
Jamie Bragg,
Procopios C. Constantinou,
Taylor J. Z. Stock,
Sarah Fearn,
Steven R. Schofield,
Neil J. Curson,
Marek Bartkowiak,
Y. Soh,
Daniel Grolimund,
Simon Gerber,
Gabriel Aeppli
Abstract:
The progress of miniaturisation in integrated electronics has led to atomic and nanometre-sized dopant devices in silicon. Such structures can be fabricated routinely by hydrogen resist lithography, using various dopants such as phosphorous and arsenic. However, the ability to non-destructively obtain atomic-species-specific images of the final structure, which would be an indispensable tool for b…
▽ More
The progress of miniaturisation in integrated electronics has led to atomic and nanometre-sized dopant devices in silicon. Such structures can be fabricated routinely by hydrogen resist lithography, using various dopants such as phosphorous and arsenic. However, the ability to non-destructively obtain atomic-species-specific images of the final structure, which would be an indispensable tool for building more complex nano-scale devices, such as quantum co-processors, remains an unresolved challenge. Here we exploit X-ray fluorescence to create an element-specific image of As dopants in silicon, with dopant densities in absolute units and a resolution limited by the beam focal size (here $\sim1~μ$m), without affecting the device's low temperature electronic properties. The As densities provided by the X-ray data are compared to those derived from Hall effect measurements as well as the standard non-repeatable, scanning tunnelling microscopy and secondary ion mass spectroscopy, techniques. Before and after the X-ray experiments, we also measured the magneto-conductance, dominated by weak localisation, a quantum interference effect extremely sensitive to sample dimensions and disorder. Notwithstanding the $1.5\times10^{10}$ Sv ($1.5\times10^{16}$ Rad/cm$^{-2}$) exposure of the device to X-rays, all transport data were unchanged to within experimental errors, corresponding to upper bounds of 0.2 Angstroms for the radiation-induced motion of the typical As atom and 3$\%$ for the loss of activated, carrier-contributing dopants. With next generation synchrotron radiation sources and more advanced optics, we foresee that it will be possible to obtain X-ray images of single dopant atoms within resolved radii of 5 nm.
△ Less
Submitted 14 April, 2023; v1 submitted 19 August, 2022;
originally announced August 2022.
-
The Impact of Variable Ordering on Bayesian Network Structure Learning
Authors:
Neville K Kitson,
Anthony C Constantinou
Abstract:
Causal Bayesian Networks provide an important tool for reasoning under uncertainty with potential application to many complex causal systems. Structure learning algorithms that can tell us something about the causal structure of these systems are becoming increasingly important. In the literature, the validity of these algorithms is often tested for sensitivity over varying sample sizes, hyper-par…
▽ More
Causal Bayesian Networks provide an important tool for reasoning under uncertainty with potential application to many complex causal systems. Structure learning algorithms that can tell us something about the causal structure of these systems are becoming increasingly important. In the literature, the validity of these algorithms is often tested for sensitivity over varying sample sizes, hyper-parameters, and occasionally objective functions. In this paper, we show that the order in which the variables are read from data can have much greater impact on the accuracy of the algorithm than these factors. Because the variable ordering is arbitrary, any significant effect it has on learnt graph accuracy is concerning, and this raises questions about the validity of the results produced by algorithms that are sensitive to, but have not been assessed against, different variable orderings.
△ Less
Submitted 12 April, 2024; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Discovery and density estimation of latent confounders in Bayesian networks with evidence lower bound
Authors:
Kiattikun Chobtham,
Anthony C. Constantinou
Abstract:
Discovering and parameterising latent confounders represent important and challenging problems in causal structure learning and density estimation respectively. In this paper, we focus on both discovering and learning the distribution of latent confounders. This task requires solutions that come from different areas of statistics and machine learning. We combine elements of variational Bayesian me…
▽ More
Discovering and parameterising latent confounders represent important and challenging problems in causal structure learning and density estimation respectively. In this paper, we focus on both discovering and learning the distribution of latent confounders. This task requires solutions that come from different areas of statistics and machine learning. We combine elements of variational Bayesian methods, expectation-maximisation, hill-climbing search, and structure learning under the assumption of causal insufficiency. We propose two learning strategies; one that maximises model selection accuracy, and another that improves computational efficiency in exchange for minor reductions in accuracy. The former strategy is suitable for small networks and the latter for moderate size networks. Both learning strategies perform well relative to existing solutions.
△ Less
Submitted 22 August, 2022; v1 submitted 11 June, 2022;
originally announced June 2022.
-
Parallel Sampling for Efficient High-dimensional Bayesian Network Structure Learning
Authors:
Zhigao Guo,
Anthony C. Constantinou
Abstract:
Score-based algorithms that learn the structure of Bayesian networks can be used for both exact and approximate solutions. While approximate learning scales better with the number of variables, it can be computationally expensive in the presence of high dimensional data. This paper describes an approximate algorithm that performs parallel sampling on Candidate Parent Sets (CPSs), and can be viewed…
▽ More
Score-based algorithms that learn the structure of Bayesian networks can be used for both exact and approximate solutions. While approximate learning scales better with the number of variables, it can be computationally expensive in the presence of high dimensional data. This paper describes an approximate algorithm that performs parallel sampling on Candidate Parent Sets (CPSs), and can be viewed as an extension of MINOBS which is a state-of-the-art algorithm for structure learning from high dimensional data. The modified algorithm, which we call Parallel Sampling MINOBS (PS-MINOBS), constructs the graph by sampling CPSs for each variable. Sampling is performed in parallel under the assumption the distribution of CPSs is half-normal when ordered by Bayesian score for each variable. Sampling from a half-normal distribution ensures that the CPSs sampled are likely to be those which produce the higher scores. Empirical results show that, in most cases, the proposed algorithm discovers higher score structures than MINOBS when both algorithms are restricted to the same runtime limit.
△ Less
Submitted 19 February, 2022;
originally announced February 2022.
-
Quasi-normal g-modes of neutron stars with quarks
Authors:
Tianqi Zhao,
Constantinos Constantinou,
Prashanth Jaikumar,
Madappa Prakash
Abstract:
Quasi-normal oscillation modes of neutron stars provide a means to probe their interior composition using gravitational wave astronomy. We compute the frequencies and dam** times of composition-dependent core g-modes of neutron stars containing quark matter employing linearized perturbative equations of general relativity. We find that ignoring background metric perturbations due to the oscillat…
▽ More
Quasi-normal oscillation modes of neutron stars provide a means to probe their interior composition using gravitational wave astronomy. We compute the frequencies and dam** times of composition-dependent core g-modes of neutron stars containing quark matter employing linearized perturbative equations of general relativity. We find that ignoring background metric perturbations due to the oscillating fluid, as in the Cowling approximation, underestimates the g-mode frequency by up to 10% for higher mass stars, depending on the parameters of the nuclear equation of state and how the mixed phase is constructed. The g-mode frequencies are well-described by a linear scaling with the central lepton (or combined lepton and quark) fraction for nucleonic (hybrid) stars. Our findings suggest that neutron stars with and without quarks are manifestly different with regards to their quasi-normal g-mode spectrum, and may thus be distinguished from one another in future observations of gravitational waves from merging neutron stars.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
How winds and ocean currents influence the drift of floating objects
Authors:
Till J. W. Wagner,
Ian Eisenman,
Amanda M. Ceroli,
Navid C. Constantinou
Abstract:
Arctic icebergs, unconstrained sea ice floes, oil slicks, mangrove drifters, lost cargo containers, and other flotsam are known to move at 2-4% of the prevailing wind velocity relative to the water, despite vast differences in the material properties, shapes, and sizes of objects. Here, we revisit the roles of density, aspect ratio, and skin and form drag in determining how an object is driven by…
▽ More
Arctic icebergs, unconstrained sea ice floes, oil slicks, mangrove drifters, lost cargo containers, and other flotsam are known to move at 2-4% of the prevailing wind velocity relative to the water, despite vast differences in the material properties, shapes, and sizes of objects. Here, we revisit the roles of density, aspect ratio, and skin and form drag in determining how an object is driven by winds and water currents. Idealized theoretical considerations show that although substantial differences exist for end members of the parameter space (either very thin or thick and very light or dense objects), most realistic cases of floating objects drift at $\approx$3% of the free-stream wind velocity (measured outside an object's surface boundary layer). This relationship, known as a long-standing rule of thumb for the drift of various types of floating objects, arises from the square root of the ratio of the density of air to that of water. We support our theoretical findings with flume experiments using floating objects with a range of densities and shapes.
△ Less
Submitted 29 January, 2022;
originally announced January 2022.
-
Hybrid Bayesian network discovery with latent variables by scoring multiple interventions
Authors:
Kiattikun Chobtham,
Anthony C. Constantinou,
Neville K. Kitson
Abstract:
In Bayesian Networks (BNs), the direction of edges is crucial for causal reasoning and inference. However, Markov equivalence class considerations mean it is not always possible to establish edge orientations, which is why many BN structure learning algorithms cannot orientate all edges from purely observational data. Moreover, latent confounders can lead to false positive edges. Relatively few me…
▽ More
In Bayesian Networks (BNs), the direction of edges is crucial for causal reasoning and inference. However, Markov equivalence class considerations mean it is not always possible to establish edge orientations, which is why many BN structure learning algorithms cannot orientate all edges from purely observational data. Moreover, latent confounders can lead to false positive edges. Relatively few methods have been proposed to address these issues. In this work, we present the hybrid mFGS-BS (majority rule and Fast Greedy equivalence Search with Bayesian Scoring) algorithm for structure learning from discrete data that involves an observational data set and one or more interventional data sets. The algorithm assumes causal insufficiency in the presence of latent variables and produces a Partial Ancestral Graph (PAG). Structure learning relies on a hybrid approach and a novel Bayesian scoring paradigm that calculates the posterior probability of each directed edge being added to the learnt graph. Experimental results based on well-known networks of up to 109 variables and 10k sample size show that mFGS-BS improves structure learning accuracy relative to the state-of-the-art and it is computationally efficient.
△ Less
Submitted 17 October, 2022; v1 submitted 20 December, 2021;
originally announced December 2021.
-
Natural orbitals for the ab initio no-core configuration interaction approach
Authors:
Patrick J. Fasano,
Chrysovalantis Constantinou,
Mark A. Caprio,
Pieter Maris,
James P. Vary
Abstract:
Ab initio no-core configuration interaction (NCCI) calculations for the nuclear many-body problem have traditionally relied upon an antisymmetrized product (Slater determinant) basis built from harmonic oscillator orbitals. The accuracy of such calculations is limited by the finite dimensions which are computationally feasible for the truncated many-body space. We therefore seek to improve the acc…
▽ More
Ab initio no-core configuration interaction (NCCI) calculations for the nuclear many-body problem have traditionally relied upon an antisymmetrized product (Slater determinant) basis built from harmonic oscillator orbitals. The accuracy of such calculations is limited by the finite dimensions which are computationally feasible for the truncated many-body space. We therefore seek to improve the accuracy obtained for a given basis size by optimizing the choice of single-particle orbitals. Natural orbitals, which diagonalize the one-body density matrix, provide a basis which maximizes the occupation of low-lying orbitals, thus accelerating convergence in a configuration-interaction basis, while also possibly providing physical insight into the single-particle structure of the many-body wave function. We describe the implementation of natural orbitals in the NCCI framework, and examine the nature of the natural orbitals thus obtained, the properties of the resulting many-body wave functions, and the convergence of observables. After taking $^3\mathrm{He}$ as an illustrative testbed, we explore aspects of NCCI calculations with natural orbitals for the ground state of the $p$-shell neutron halo nucleus $^6\mathrm{He}$.
△ Less
Submitted 5 May, 2022; v1 submitted 7 December, 2021;
originally announced December 2021.
-
Effective and efficient structure learning with pruning and model averaging strategies
Authors:
Anthony C. Constantinou,
Yang Liu,
Neville K. Kitson,
Kiattikun Chobtham,
Zhigao Guo
Abstract:
Learning the structure of a Bayesian Network (BN) with score-based solutions involves exploring the search space of possible graphs and moving towards the graph that maximises a given objective function. Some algorithms offer exact solutions that guarantee to return the graph with the highest objective score, while others offer approximate solutions in exchange for reduced computational complexity…
▽ More
Learning the structure of a Bayesian Network (BN) with score-based solutions involves exploring the search space of possible graphs and moving towards the graph that maximises a given objective function. Some algorithms offer exact solutions that guarantee to return the graph with the highest objective score, while others offer approximate solutions in exchange for reduced computational complexity. This paper describes an approximate BN structure learning algorithm, which we call Model Averaging Hill-Climbing (MAHC), that combines two novel strategies with hill-climbing search. The algorithm starts by pruning the search space of graphs, where the pruning strategy can be viewed as an aggressive version of the pruning strategies that are typically applied to combinatorial optimisation structure learning problems. It then performs model averaging in the hill-climbing search process and moves to the neighbouring graph that maximises the objective function, on average, for that neighbouring graph and over all its valid neighbouring graphs. Comparisons with other algorithms spanning different classes of learning suggest that the combination of aggressive pruning with model averaging is both effective and efficient, particularly in the presence of data noise.
△ Less
Submitted 30 April, 2022; v1 submitted 1 December, 2021;
originally announced December 2021.
-
The effect of charge, isospin, and strangeness in the QCD phase diagram critical end point
Authors:
Krishna Aryal,
Constantinos Constantinou,
Ricardo L. S. Farias,
Veronica Dexheimer
Abstract:
In this work, we discuss the deconfinement phase transition to quark matter in hot/dense matter. We {examine} the effect that different charge fractions, isospin fractions, net strangeness, and chemical equilibrium with respect to leptons have on the position of the coexistence line between different phases. In particular, we investigate how different sets of conditions that describe matter in neu…
▽ More
In this work, we discuss the deconfinement phase transition to quark matter in hot/dense matter. We {examine} the effect that different charge fractions, isospin fractions, net strangeness, and chemical equilibrium with respect to leptons have on the position of the coexistence line between different phases. In particular, we investigate how different sets of conditions that describe matter in neutron stars and their mergers, or matter created in heavy-ion collisions affect the position of the critical end point, namely where the first-order phase transition becomes a crossover. We also present an introduction to the topic of critical points, including a review of recent {advances} concerning QCD critical points.
△ Less
Submitted 19 November, 2021; v1 submitted 29 September, 2021;
originally announced September 2021.
-
$g$-modes of neutron stars with hadron-to-quark crossover transitions
Authors:
Constantinos Constantinou,
Sophia Han,
Prashanth Jaikumar,
Madappa Prakash
Abstract:
We perform the first study of the principal core $g$-mode oscillation in hybrid stars containing quark matter, utilizing a crossover model for the hadron-to-quark transition inspired by lattice QCD. The ensuing results are compared with our recent findings of $g$-mode frequencies in hybrid stars with a first-order phase transition using Gibbs constructions. We find that models using Gibbs construc…
▽ More
We perform the first study of the principal core $g$-mode oscillation in hybrid stars containing quark matter, utilizing a crossover model for the hadron-to-quark transition inspired by lattice QCD. The ensuing results are compared with our recent findings of $g$-mode frequencies in hybrid stars with a first-order phase transition using Gibbs constructions. We find that models using Gibbs construction yield $g$-mode amplitudes and the associated gravitational energy radiated that dominate over those of the chosen crossover model owing to the distinct behaviors of the equilibrium and adiabatic sound speeds in the various models. Based on our results, we conclude that were $g$-modes to be detected in upgraded LIGO and Virgo detectors it would indicate a first-order phase transition akin to a Gibbs construction.
△ Less
Submitted 20 November, 2021; v1 submitted 28 September, 2021;
originally announced September 2021.
-
A survey of Bayesian Network structure learning
Authors:
Neville K. Kitson,
Anthony C. Constantinou,
Zhigao Guo,
Yang Liu,
Kiattikun Chobtham
Abstract:
Bayesian Networks (BNs) have become increasingly popular over the last few decades as a tool for reasoning under uncertainty in fields as diverse as medicine, biology, epidemiology, economics and the social sciences. This is especially true in real-world areas where we seek to answer complex questions based on hypothetical evidence to determine actions for intervention. However, determining the gr…
▽ More
Bayesian Networks (BNs) have become increasingly popular over the last few decades as a tool for reasoning under uncertainty in fields as diverse as medicine, biology, epidemiology, economics and the social sciences. This is especially true in real-world areas where we seek to answer complex questions based on hypothetical evidence to determine actions for intervention. However, determining the graphical structure of a BN remains a major challenge, especially when modelling a problem under causal assumptions. Solutions to this problem include the automated discovery of BN graphs from data, constructing them based on expert knowledge, or a combination of the two. This paper provides a comprehensive review of combinatoric algorithms proposed for learning BN structure from data, describing 74 algorithms including prototypical, well-established and state-of-the-art approaches. The basic approach of each algorithm is described in consistent terms, and the similarities and differences between them highlighted. Methods of evaluating algorithms and their comparative performance are discussed including the consistency of claims made in the literature. Approaches for dealing with data noise in real-world datasets and incorporating expert knowledge into the learning process are also covered.
△ Less
Submitted 25 October, 2022; v1 submitted 23 September, 2021;
originally announced September 2021.
-
Greedy structure learning from data that contain systematic missing values
Authors:
Yang Liu,
Anthony C. Constantinou
Abstract:
Learning from data that contain missing values represents a common phenomenon in many domains. Relatively few Bayesian Network structure learning algorithms account for missing data, and those that do tend to rely on standard approaches that assume missing data are missing at random, such as the Expectation-Maximisation algorithm. Because missing data are often systematic, there is a need for more…
▽ More
Learning from data that contain missing values represents a common phenomenon in many domains. Relatively few Bayesian Network structure learning algorithms account for missing data, and those that do tend to rely on standard approaches that assume missing data are missing at random, such as the Expectation-Maximisation algorithm. Because missing data are often systematic, there is a need for more pragmatic methods that can effectively deal with data sets containing missing values not missing at random. The absence of approaches that deal with systematic missing data impedes the application of BN structure learning methods to real-world problems where missingness are not random. This paper describes three variants of greedy search structure learning that utilise pairwise deletion and inverse probability weighting to maximally leverage the observed data and to limit potential bias caused by missing values. The first two of the variants can be viewed as sub-versions of the third and best performing variant, but are important in their own in illustrating the successive improvements in learning accuracy. The empirical investigations show that the proposed approach outperforms the commonly used and state-of-the-art Structural EM algorithm, both in terms of learning accuracy and efficiency, as well as both when data are missing at random and not at random.
△ Less
Submitted 20 May, 2022; v1 submitted 8 July, 2021;
originally announced July 2021.
-
The impact of prior knowledge on causal structure learning
Authors:
Anthony C. Constantinou,
Zhigao Guo,
Neville K. Kitson
Abstract:
Causal Bayesian networks have become a powerful technology for reasoning under uncertainty in areas that require transparency and explainability, by relying on causal assumptions that enable us to simulate hypothetical interventions. The graphical structure of such models can be estimated by structure learning algorithms, domain knowledge, or a combination of both. Various knowledge approaches hav…
▽ More
Causal Bayesian networks have become a powerful technology for reasoning under uncertainty in areas that require transparency and explainability, by relying on causal assumptions that enable us to simulate hypothetical interventions. The graphical structure of such models can be estimated by structure learning algorithms, domain knowledge, or a combination of both. Various knowledge approaches have been proposed in the literature that enable us to specify prior knowledge that constrains or guides these algorithms. This paper introduces some novel, and also describes some existing, knowledge-based approaches that enable us to combine structure learning with knowledge obtained from heterogeneous sources. We investigate the impact of these approaches on structure learning across different algorithms, case studies and settings that we might encounter in practice. Each approach is assessed in terms of effectiveness and efficiency, including graphical accuracy, model fitting, complexity, and runtime; making this the first paper that provides a comparative evaluation of a wide range of knowledge approaches for structure learning. Because the value of knowledge depends on what data are available, we illustrate the results both with limited and big data. While the overall results show that knowledge becomes less important with big data due to higher learning accuracy rendering knowledge less important, some of the knowledge approaches are found to be more important with big data. Amongst the main conclusions is the observation that reduced search space obtained from knowledge does not always imply reduced computational complexity, perhaps because the relationships implied by the data and knowledge are in tension.
△ Less
Submitted 12 March, 2023; v1 submitted 31 January, 2021;
originally announced February 2021.
-
How do some Bayesian Network machine learned graphs compare to causal knowledge?
Authors:
Anthony C. Constantinou,
Norman Fenton,
Martin Neil
Abstract:
The graph of a Bayesian Network (BN) can be machine learned, determined by causal knowledge, or a combination of both. In disciplines like bioinformatics, applying BN structure learning algorithms can reveal new insights that would otherwise remain unknown. However, these algorithms are less effective when the input data are limited in terms of sample size, which is often the case when working wit…
▽ More
The graph of a Bayesian Network (BN) can be machine learned, determined by causal knowledge, or a combination of both. In disciplines like bioinformatics, applying BN structure learning algorithms can reveal new insights that would otherwise remain unknown. However, these algorithms are less effective when the input data are limited in terms of sample size, which is often the case when working with real data. This paper focuses on purely machine learned and purely knowledge-based BNs and investigates their differences in terms of graphical structure and how well the implied statistical models explain the data. The tests are based on four previous case studies whose BN structure was determined by domain knowledge. Using various metrics, we compare the knowledge-based graphs to the machine learned graphs generated from various algorithms implemented in TETRAD spanning all three classes of learning. The results show that, while the algorithms produce graphs with much higher model selection score, the knowledge-based graphs are more accurate predictors of variables of interest. Maximising score fitting is ineffective in the presence of limited sample size because the fitting becomes increasingly distorted with limited data, guiding algorithms towards graphical patterns that share higher fitting scores and yet deviate considerably from the true graph. This highlights the value of causal knowledge in these cases, as well as the need for more appropriate fitting scores suitable for limited data. Lastly, the experiments also provide new evidence that support the notion that results from simulated data tell us little about actual real-world performance.
△ Less
Submitted 2 February, 2021; v1 submitted 25 January, 2021;
originally announced January 2021.
-
g-mode Oscillations in Hybrid Stars: A Tale of Two Sounds
Authors:
Prashanth Jaikumar,
Alexandra Semposki,
Madappa Prakash,
Constantinos Constantinou
Abstract:
We study the principal core g-mode oscillation in hybrid stars containing quark matter and find that they have an unusually large frequency range ($\approx$ 200 - 600 Hz) compared to ordinary neutron stars or self-bound quark stars of the same mass. Theoretical arguments and numerical calculations that trace this effect to the difference in the behaviour of the equilibrium and adiabatic sound spee…
▽ More
We study the principal core g-mode oscillation in hybrid stars containing quark matter and find that they have an unusually large frequency range ($\approx$ 200 - 600 Hz) compared to ordinary neutron stars or self-bound quark stars of the same mass. Theoretical arguments and numerical calculations that trace this effect to the difference in the behaviour of the equilibrium and adiabatic sound speeds in the mixed phase of quarks and nucleons are provided. We propose that the sensitivity of core g-mode oscillations to non-nucleonic matter in neutron stars could be due to the presence of a mixed quark-nucleon phase. Based on our analysis, we conclude that for binary mergers where one or both components may be a hybrid star, the fraction of tidal energy pumped into resonant g-modes in hybrid stars can exceed that of a normal neutron star by a factor of 2-3, although resonance occurs during the last stages of inspiral. A self-bound star, on the other hand, has a much weaker tidal overlap with the g-mode. The cumulative tidal phase error in hybrid stars, $Δφ\cong$ 0.5 rad, is comparable to that from tides in ordinary neutron stars, presenting a challenge in distinguishing between the two cases. However, should the principal g-mode be excited to sufficient amplitude for detection in a post-merger remnant with quark matter in its interior, its frequency would be a possible indication for the existence of non-nucleonic matter in neutron stars.
△ Less
Submitted 7 June, 2021; v1 submitted 15 January, 2021;
originally announced January 2021.
-
Intrinsic oceanic decadal variability of upper-ocean heat content
Authors:
Navid C. Constantinou,
Andrew McC. Hogg
Abstract:
Atmosphere and ocean are coupled via air-sea interactions. The atmospheric conditions fuel the ocean circulation and its variability, but the extent to which ocean processes can affect the atmosphere at decadal time scales remains unclear. In particular, such low-frequency variability is difficult to extract from the short observational record, meaning that climate models are the primary tools dep…
▽ More
Atmosphere and ocean are coupled via air-sea interactions. The atmospheric conditions fuel the ocean circulation and its variability, but the extent to which ocean processes can affect the atmosphere at decadal time scales remains unclear. In particular, such low-frequency variability is difficult to extract from the short observational record, meaning that climate models are the primary tools deployed to resolve this question. Here, we assess how the ocean's intrinsic variability leads to patterns of upper-ocean heat content that vary at decadal time scales. These patterns have the potential to feed back on the atmosphere and thereby affect climate modes of variability, such as El Niño or the Interdecadal Pacific Oscillation. We use the output from a global ocean-sea ice circulation model at three different horizontal resolutions, each driven by the same atmospheric reanalysis. To disentangle the variability of the ocean's direct response to atmospheric forcing from the variability due to intrinsic ocean dynamics, we compare model runs driven with inter-annually varying forcing (1958-2018) and model runs driven with repeat-year forcing. Models with coarse resolution that rely on eddy parameterizations, show (i) significantly reduced variance of the upper-ocean heat content at decadal time scales and (ii) differences in the spatial patterns of low-frequency variability compared with higher resolution models. Climate projections are typically done with general circulation models with coarse-resolution ocean components. Therefore, these biases affect our ability to predict decadal climate modes of variability and, in turn, hinder climate projections. Our results suggest that for improving climate projections, the community should move towards coupled climate models with higher oceanic resolution.
△ Less
Submitted 13 June, 2021; v1 submitted 14 December, 2020;
originally announced December 2020.
-
Deconfinement Phase Transition under Chemical Equilibrium
Authors:
Veronica Dexheimer,
Krishna Aryal,
Madison Wolf,
Constantinos Constantinou,
Ricardo L. S. Farias
Abstract:
In this work, we investigate how the assumption of chemical equilibrium with leptons affects the deconfinement phase transition to quark matter. This is done within the framework of the Chiral Mean Field model (CMF) allowing for non-zero net strangeness, corresponding to the conditions found in astrophysical scenarios. We build 3-dimensional QCD phase diagrams with temperature, baryon chemical pot…
▽ More
In this work, we investigate how the assumption of chemical equilibrium with leptons affects the deconfinement phase transition to quark matter. This is done within the framework of the Chiral Mean Field model (CMF) allowing for non-zero net strangeness, corresponding to the conditions found in astrophysical scenarios. We build 3-dimensional QCD phase diagrams with temperature, baryon chemical potential, and either charge or isospin fraction or chemical potential to show how the deconfinement region collapses to a line in the special case of chemical equilibrium, such as the one established the interior of cold catalyzed neutron stars.
△ Less
Submitted 23 November, 2020;
originally announced November 2020.
-
Improving Bayesian Network Structure Learning in the Presence of Measurement Error
Authors:
Yang Liu,
Anthony C. Constantinou,
ZhiGao Guo
Abstract:
Structure learning algorithms that learn the graph of a Bayesian network from observational data often do so by assuming the data correctly reflect the true distribution of the variables. However, this assumption does not hold in the presence of measurement error, which can lead to spurious edges. This is one of the reasons why the synthetic performance of these algorithms often overestimates real…
▽ More
Structure learning algorithms that learn the graph of a Bayesian network from observational data often do so by assuming the data correctly reflect the true distribution of the variables. However, this assumption does not hold in the presence of measurement error, which can lead to spurious edges. This is one of the reasons why the synthetic performance of these algorithms often overestimates real-world performance. This paper describes an algorithm that can be added as an additional learning phase at the end of any structure learning algorithm, and serves as a correction learning phase that removes potential false positive edges. The results show that the proposed correction algorithm successfully improves the graphical score of four well-established structure learning algorithms spanning different classes of learning in the presence of measurement error.
△ Less
Submitted 19 November, 2020;
originally announced November 2020.
-
3-Dimensional QCD Phase Diagrams for Strange Matter
Authors:
V. Dexheimer,
K. Aryal,
C. Constantinou,
J. Peterson
Abstract:
In this work, we examine in detail the difference between constraining the electric charge fraction and isospin fraction when calculating the deconfinement phase transition in the presence of net strangeness. We present relations among charge and isospin fractions and the corresponding chemical potentials and draw 3-dimensional QCD phase diagrams for matter out of weak equilibrium. Finally, we bri…
▽ More
In this work, we examine in detail the difference between constraining the electric charge fraction and isospin fraction when calculating the deconfinement phase transition in the presence of net strangeness. We present relations among charge and isospin fractions and the corresponding chemical potentials and draw 3-dimensional QCD phase diagrams for matter out of weak equilibrium. Finally, we briefly discuss how our results can be applied to comparisons of matter created in heavy ion collisions and binary neutron star mergers.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Approximate learning of high dimensional Bayesian network structures via pruning of Candidate Parent Sets
Authors:
Zhigao Guo,
Anthony C. Constantinou
Abstract:
Score-based algorithms that learn Bayesian Network (BN) structures provide solutions ranging from different levels of approximate learning to exact learning. Approximate solutions exist because exact learning is generally not applicable to networks of moderate or higher complexity. In general, approximate solutions tend to sacrifice accuracy for speed, where the aim is to minimise the loss in accu…
▽ More
Score-based algorithms that learn Bayesian Network (BN) structures provide solutions ranging from different levels of approximate learning to exact learning. Approximate solutions exist because exact learning is generally not applicable to networks of moderate or higher complexity. In general, approximate solutions tend to sacrifice accuracy for speed, where the aim is to minimise the loss in accuracy and maximise the gain in speed. While some approximate algorithms are optimised to handle thousands of variables, these algorithms may still be unable to learn such high dimensional structures. Some of the most efficient score-based algorithms cast the structure learning problem as a combinatorial optimisation of candidate parent sets. This paper explores a strategy towards pruning the size of candidate parent sets, aimed at high dimensionality problems. The results illustrate how different levels of pruning affect the learning speed relative to the loss in accuracy in terms of model fitting, and show that aggressive pruning may be required to produce approximate solutions for high complexity problems.
△ Less
Submitted 10 September, 2020; v1 submitted 8 June, 2020;
originally announced June 2020.
-
Bayesian network structure learning with causal effects in the presence of latent variables
Authors:
Kiattikun Chobtham,
Anthony C. Constantinou
Abstract:
Latent variables may lead to spurious relationships that can be misinterpreted as causal relationships. In Bayesian Networks (BNs), this challenge is known as learning under causal insufficiency. Structure learning algorithms that assume causal insufficiency tend to reconstruct the ancestral graph of a BN, where bi-directed edges represent confounding and directed edges represent direct or ancestr…
▽ More
Latent variables may lead to spurious relationships that can be misinterpreted as causal relationships. In Bayesian Networks (BNs), this challenge is known as learning under causal insufficiency. Structure learning algorithms that assume causal insufficiency tend to reconstruct the ancestral graph of a BN, where bi-directed edges represent confounding and directed edges represent direct or ancestral relationships. This paper describes a hybrid structure learning algorithm, called CCHM, which combines the constraint-based part of cFCI with hill-climbing score-based learning. The score-based process incorporates Pearl s do-calculus to measure causal effects and orientate edges that would otherwise remain undirected, under the assumption the BN is a linear Structure Equation Model where data follow a multivariate Gaussian distribution. Experiments based on both randomised and well-known networks show that CCHM improves the state-of-the-art in terms of reconstructing the true ancestral graph.
△ Less
Submitted 18 August, 2020; v1 submitted 29 May, 2020;
originally announced May 2020.
-
Large-scale empirical validation of Bayesian Network structure learning algorithms with noisy data
Authors:
Anthony C. Constantinou,
Yang Liu,
Kiattikun Chobtham,
Zhigao Guo,
Neville K. Kitson
Abstract:
Numerous Bayesian Network (BN) structure learning algorithms have been proposed in the literature over the past few decades. Each publication makes an empirical or theoretical case for the algorithm proposed in that publication and results across studies are often inconsistent in their claims about which algorithm is 'best'. This is partly because there is no agreed evaluation approach to determin…
▽ More
Numerous Bayesian Network (BN) structure learning algorithms have been proposed in the literature over the past few decades. Each publication makes an empirical or theoretical case for the algorithm proposed in that publication and results across studies are often inconsistent in their claims about which algorithm is 'best'. This is partly because there is no agreed evaluation approach to determine their effectiveness. Moreover, each algorithm is based on a set of assumptions, such as complete data and causal sufficiency, and tend to be evaluated with data that conforms to these assumptions, however unrealistic these assumptions may be in the real world. As a result, it is widely accepted that synthetic performance overestimates real performance, although to what degree this may happen remains unknown. This paper investigates the performance of 15 structure learning algorithms. We propose a methodology that applies the algorithms to data that incorporates synthetic noise, in an effort to better understand the performance of structure learning algorithms when applied to real data. Each algorithm is tested over multiple case studies, sample sizes, types of noise, and assessed with multiple evaluation criteria. This work involved approximately 10,000 graphs with a total structure learning runtime of seven months. It provides the first large-scale empirical validation of BN structure learning algorithms under different assumptions of data noise. The results suggest that traditional synthetic performance may overestimate real-world performance by anywhere between 10% and more than 50%. They also show that while score-based learning is generally superior to constraint-based learning, a higher fitting score does not necessarily imply a more accurate causal graph. To facilitate comparisons with future studies, we have made all data, raw results, graphs and BN models freely available online.
△ Less
Submitted 11 September, 2020; v1 submitted 18 May, 2020;
originally announced May 2020.
-
Cause-and-effect of linear mechanisms sustaining wall turbulence
Authors:
Adrián Lozano-Durán,
Navid C. Constantinou,
Marios-Andreas Nikolaidis,
Michael Karp
Abstract:
Despite the nonlinear nature of turbulence, there is evidence that part of the energy-transfer mechanisms sustaining wall turbulence can be ascribed to linear processes. The different scenarios stem from linear stability theory and comprise exponential instabilities, neutral modes, transient growth from non-normal operators, and parametric instabilities from temporal mean-flow variations, among ot…
▽ More
Despite the nonlinear nature of turbulence, there is evidence that part of the energy-transfer mechanisms sustaining wall turbulence can be ascribed to linear processes. The different scenarios stem from linear stability theory and comprise exponential instabilities, neutral modes, transient growth from non-normal operators, and parametric instabilities from temporal mean-flow variations, among others. These mechanisms, each potentially capable of leading to the observed turbulence structure, are rooted in theoretical and conceptual arguments. Whether the flow follows any or a combination of them remains elusive. Here, we evaluate the linear mechanisms responsible for the energy transfer from the streamwise-averaged mean-flow ($\bf U$) to the fluctuating velocities ($\bf u'$). We use cause-and-effect analysis based on interventions. This is achieved by direct numerical simulation of turbulent channel flows at low Reynolds number, in which the energy transfer from $\bf U$ to $\bf u'$ is constrained to preclude a targeted linear mechanism. We show that transient growth is sufficient for sustaining realistic wall turbulence. Self-sustaining turbulence persists when exponential instabilities, neutral modes, and parametric instabilities of the mean flow are suppressed. We further show that a key component of transient growth is the Orr/push-over mechanism induced by spanwise variations of the base flow. Finally, we demonstrate that an ensemble of simulations with various frozen-in-time $\bf U$ arranged so that only transient growth is active, can faithfully represent the energy transfer from $\bf U$ to $\bf u'$ as in realistic turbulence. Our approach provides direct cause-and-effect evaluation of the linear energy-injection mechanisms from $\bf U$ to $\bf u'$ in the fully nonlinear system and simplifies the conceptual model of self-sustaining wall turbulence.
△ Less
Submitted 7 October, 2020; v1 submitted 9 May, 2020;
originally announced May 2020.
-
High-Energy Phase Diagrams with Charge and Isospin Axes under Heavy-Ion Collision and Stellar Conditions
Authors:
K. Aryal,
C. Constantinou,
R. L. S. Farias,
V. Dexheimer
Abstract:
We investigate the phase transition from hadron to quark matter in the general case without the assumption of chemical equilibrium. The effects of net strangeness on charge and isospin fractions, chemical potentials, and temperature are studied in the context of the Chiral Mean Field (CMF) model that incorporates chiral symmetry restoration and deconfinement. The extent to which these quantities a…
▽ More
We investigate the phase transition from hadron to quark matter in the general case without the assumption of chemical equilibrium. The effects of net strangeness on charge and isospin fractions, chemical potentials, and temperature are studied in the context of the Chiral Mean Field (CMF) model that incorporates chiral symmetry restoration and deconfinement. The extent to which these quantities are probed during deconfinement in conditions expected to exist in protoneutron stars, binary neutron-star mergers, and heavy-ion collisions is analyzed via the construction of 3-dimensional phase diagrams.
△ Less
Submitted 11 October, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Alternative physics to understand wall turbulence: Navier-Stokes equations with modified linear dynamics
Authors:
Adrán Lozano-Durán,
Marios-Andreas Nikolaidis,
Navid C. Constantinou,
Michael Karp
Abstract:
Despite the nonlinear nature of wall turbulence, there is evidence that the energy-injection mechanisms sustaining wall turbulence can be ascribed to linear processes. The different scenarios stem from linear stability theory and comprise exponential instabilities from mean-flow inflection points, transient growth from non-normal operators, and parametric instabilities from temporal mean-flow vari…
▽ More
Despite the nonlinear nature of wall turbulence, there is evidence that the energy-injection mechanisms sustaining wall turbulence can be ascribed to linear processes. The different scenarios stem from linear stability theory and comprise exponential instabilities from mean-flow inflection points, transient growth from non-normal operators, and parametric instabilities from temporal mean-flow variations, among others. These mechanisms, each potentially capable of leading to the observed turbulence structure, are rooted in simplified theories and conceptual arguments. Whether the flow follows any or a combination of them remains unclear. In the present study, we devise a collection of numerical experiments in which the Navier-Stokes equations are sensibly modified to quantify the role of the different linear mechanisms. This is achieved by direct numerical simulation of turbulent channel flows with constrained energy extraction from the streamwise-averaged mean-flow. We demonstrate that (i) transient growth alone is not sufficient to sustain wall turbulence and (ii) the flow remains turbulent when the exponential instabilities are suppressed. On the other hand, we show that (iii) transient growth combined with the parametric instability of the time-varying mean-flow is able to sustain turbulence.
△ Less
Submitted 13 December, 2019;
originally announced December 2019.
-
The Nusselt numbers of horizontal convection
Authors:
Cesar. B. Rocha,
Navid C. Constantinou,
Stefan G. Llewellyn Smith,
William R. Young
Abstract:
We consider the problem of horizontal convection in which non-uniform buoyancy, $b_{\rm s}(x,y)$, is imposed on the top surface of a container and all other surfaces are insulating. Horizontal convection produces a net horizontal flux of buoyancy, $\mathbf{J}$, defined by vertically and temporally averaging the interior horizontal flux of buoyancy. We show that…
▽ More
We consider the problem of horizontal convection in which non-uniform buoyancy, $b_{\rm s}(x,y)$, is imposed on the top surface of a container and all other surfaces are insulating. Horizontal convection produces a net horizontal flux of buoyancy, $\mathbf{J}$, defined by vertically and temporally averaging the interior horizontal flux of buoyancy. We show that $\overline{\mathbf{J}\cdot\mathbf{\nabla}b_{\rm s}}=-κ\langle|\boldsymbol{\nabla}b|^2\rangle$; overbar denotes a space-time average over the top surface, angle brackets denote a volume-time average and $κ$ is the molecular diffusivity of buoyancy $b$. This connection between $\mathbf{J}$ and $κ\langle|\boldsymbol{\nabla}b|^2\rangle$ justifies the definition of the horizontal-convective Nusselt number, $Nu$, as the ratio of $κ\langle|\boldsymbol{\nabla}b|^2\rangle$ to the corresponding quantity produced by molecular diffusion alone. We discuss the advantages of this definition of $Nu$ over other definitions of horizontal-convective Nusselt number currently in use. We investigate transient effects and show that $κ\langle|\boldsymbol{\nabla}b|^2\rangle$ equilibrates more rapidly than other global averages, such as the domain averaged kinetic energy and bottom buoyancy. We show that $κ\langle|\boldsymbol{\nabla} b|^2\rangle$ is essentially the volume-averaged rate of Boussinesq entropy production within the enclosure. In statistical steady state, the interior entropy production is balanced by a flux of entropy through the top surface. This leads to an equivalent "surface Nusselt number", defined as the surface average of vertical buoyancy flux through the top surface times the imposed surface buoyancy $b_{\rm s}(x,y)$. In experiments it is likely easier to evaluate the surface entropy flux, rather than the volume integral of $|\mathbf{\nabla}b|^2$ demanded by $κ\langle|\mathbf{\nabla}b|^2\rangle$.
△ Less
Submitted 24 March, 2020; v1 submitted 11 December, 2019;
originally announced December 2019.
-
Learning Bayesian networks from demographic and health survey data
Authors:
Neville Kenneth Kitson,
Anthony C. Constantinou
Abstract:
Child mortality from preventable diseases such as pneumonia and diarrhoea in low and middle-income countries remains a serious global challenge. We combine knowledge with available Demographic and Health Survey (DHS) data from India, to construct Causal Bayesian Networks (CBNs) and investigate the factors associated with childhood diarrhoea. We make use of freeware tools to learn the graphical str…
▽ More
Child mortality from preventable diseases such as pneumonia and diarrhoea in low and middle-income countries remains a serious global challenge. We combine knowledge with available Demographic and Health Survey (DHS) data from India, to construct Causal Bayesian Networks (CBNs) and investigate the factors associated with childhood diarrhoea. We make use of freeware tools to learn the graphical structure of the DHS data with score-based, constraint-based, and hybrid structure learning algorithms. We investigate the effect of missing values, sample size, and knowledge-based constraints on each of the structure learning algorithms and assess their accuracy with multiple scoring functions. Weaknesses in the survey methodology and data available, as well as the variability in the CBNs generated by the different algorithms, mean that it is not possible to learn a definitive CBN from data. However, knowledge-based constraints are found to be useful in reducing the variation in the graphs produced by the different algorithms, and produce graphs which are more reflective of the likely influential relationships in the data. Furthermore, valuable insights are gained into the performance and characteristics of the structure learning algorithms. Two score-based algorithms in particular, TABU and FGES, demonstrate many desirable qualities; a) with sufficient data, they produce a graph which is similar to the reference graph, b) they are relatively insensitive to missing values, and c) behave well with knowledge-based constraints. The results provide a basis for further investigation of the DHS data and for a deeper understanding of the behaviour of the structure learning algorithms when applied to real-world settings.
△ Less
Submitted 29 April, 2020; v1 submitted 2 December, 2019;
originally announced December 2019.
-
Atomic-Scale Patterning of Arsenic in Silicon by Scanning Tunneling Microscopy
Authors:
Taylor J. Z. Stock,
Oliver Warschkow,
Procopios C. Constantinou,
Juerong Li,
Sarah Fearn,
Eleanor Crane,
Emily V. S. Hofmann,
Alexander Kölker,
David R. McKenzie,
Steven R. Schofield,
Neil J. Curson
Abstract:
Over the last two decades, prototype devices for future classical and quantum computing technologies have been fabricated, by using scanning tunneling microscopy and hydrogen resist lithography to position phosphorus atoms in silicon with atomic-scale precision. Despite these successes, phosphine remains the only donor precursor molecule to have been demonstrated as compatible with the hydrogen re…
▽ More
Over the last two decades, prototype devices for future classical and quantum computing technologies have been fabricated, by using scanning tunneling microscopy and hydrogen resist lithography to position phosphorus atoms in silicon with atomic-scale precision. Despite these successes, phosphine remains the only donor precursor molecule to have been demonstrated as compatible with the hydrogen resist lithography technique. The potential benefits of atomic-scale placement of alternative dopant species have, until now, remained unexplored. In this work, we demonstrate successful fabrication of atomic-scale structures of arsenic-in-silicon. Using a scanning tunneling microscope tip, we pattern a monolayer hydrogen mask to selectively place arsenic atoms on the Si(001) surface using arsine as the precursor molecule. We fully elucidate the surface chemistry and reaction pathways of arsine on Si(001), revealing significant differences to phosphine. We explain how these differences result in enhanced surface immobilization and in-plane confinement of arsenic compared to phosphorus, and a dose-rate independent arsenic saturation density of $0.24{\pm}0.04$ ML. We demonstrate the successful encapsulation of arsenic delta-layers using silicon molecular beam epitaxy, and find electrical characteristics that are competitive with equivalent structures fabricated with phosphorus. Arsenic delta-layers are also found to offer improvement in out-of-plane confinement compared to similarly prepared phosphorus layers, while still retaining >80% carrier activation and sheet resistances of $<2 kΩ/{\square}$. These excellent characteristics of arsenic represent opportunities to enhance existing capabilities of atomic-scale fabrication of dopant structures in silicon, and are particularly important for three-dimensional devices, where vertical control of the position of device components is critical.
△ Less
Submitted 15 October, 2019;
originally announced October 2019.
-
Wall turbulence without modal instability of the streaks
Authors:
Adrián Lozano-Durán,
Marios-Andreas Nikolaidis,
Navid C. Constantinou,
Michael Karp
Abstract:
Despite the nonlinear nature of wall turbulence, there is evidence that the mechanism underlying the energy transfer from the mean flow to the turbulent fluctuations can be ascribed to linear processes. One of the most acclaimed linear instabilities for this energy transfer is the modal growth of perturbations with respect to the streamwise-averaged flow (or streaks). Here, we devise a numerical e…
▽ More
Despite the nonlinear nature of wall turbulence, there is evidence that the mechanism underlying the energy transfer from the mean flow to the turbulent fluctuations can be ascribed to linear processes. One of the most acclaimed linear instabilities for this energy transfer is the modal growth of perturbations with respect to the streamwise-averaged flow (or streaks). Here, we devise a numerical experiment in which the Navier--Stokes equations are sensibly modified to suppress these modal instabilities. Our results demonstrate that wall turbulence is sustained with realistic mean and fluctuating velocities despite the absence of streak instabilities.
△ Less
Submitted 12 September, 2019;
originally announced September 2019.
-
Eddy saturation of the Southern Ocean: a baroclinic versus barotropic perspective
Authors:
Navid C. Constantinou,
Andrew McC. Hogg
Abstract:
"Eddy saturation" is the regime in which the total time-mean volume transport of an oceanic current is relatively insensitive to the wind stress forcing and is often invoked as a dynamical description of Southern Ocean circulation. We revisit the problem of eddy saturation using a primitive-equations model in an idealized channel setup with bathymetry. We apply only mechanical wind stress forcing;…
▽ More
"Eddy saturation" is the regime in which the total time-mean volume transport of an oceanic current is relatively insensitive to the wind stress forcing and is often invoked as a dynamical description of Southern Ocean circulation. We revisit the problem of eddy saturation using a primitive-equations model in an idealized channel setup with bathymetry. We apply only mechanical wind stress forcing; there is no diapycnal mixing or surface buoyancy forcing. Our main aim is to assess the relative importance of two mechanisms for producing eddy saturated states: (i) the commonly invoked baroclinic mechanism that involves the competition of slo** isopycnals and restratification by production of baroclinic eddies, and (ii) the barotropic mechanism, that involves production of eddies through lateral shear instabilities or through the interaction of the barotropic current with bathymetric features. Our results suggest that the barotropic flow-component plays a crucial role in determining the total volume transport.
△ Less
Submitted 9 September, 2019; v1 submitted 20 June, 2019;
originally announced June 2019.
-
Treating quarks within neutron stars
Authors:
Sophia Han,
M. A. A. Mamun,
S. Lalit,
C. Constantinou,
M. Prakash
Abstract:
Neutron star interiors provide the opportunity to probe properties of cold dense matter in the QCD phase diagram. Utilizing models of dense matter in accord with nuclear systematics at nuclear densities, we investigate the compatibility of deconfined quark cores with current observational constraints on the maximum mass and tidal deformability of neutron stars. We explore various methods of implem…
▽ More
Neutron star interiors provide the opportunity to probe properties of cold dense matter in the QCD phase diagram. Utilizing models of dense matter in accord with nuclear systematics at nuclear densities, we investigate the compatibility of deconfined quark cores with current observational constraints on the maximum mass and tidal deformability of neutron stars. We explore various methods of implementing the hadron-to-quark phase transition, specifically, first-order transitions with sharp (Maxwell construction) and soft (Gibbs construction) interfaces, and smooth crossover transitions. We find that within the models we apply, hadronic matter has to be stiff for a first-order phase transition and soft for a crossover transition. In both scenarios and for the equations of state we employed, quarks appear at the center of pre-merger neutron stars in the mass range $\approx 1.0-1.6\,{\rm M}_{\odot}$, with a squared speed of sound $c^2_{\rm QM}\gtrsim 0.4$ characteristic of strong repulsive interactions required to support the recently discovered neutron star masses $\geq 2\,{\rm M}_{\odot}$. We also identify equations of state and phase transition scenarios that are consistent with the bounds placed on tidal deformations of neutron stars in the recent binary merger event GW170817. We emphasize that distinguishing hybrid stars with quark cores from normal hadronic stars is very difficult from the knowledge of masses and radii alone, unless drastic sharp transitions induce distinctive disconnected hybrid branches in the mass-radius relation.
△ Less
Submitted 31 October, 2019; v1 submitted 10 June, 2019;
originally announced June 2019.