-
Online Transfer Learning for RSV Case Detection
Authors:
Yiming Sun,
Yuhe Gao,
Runxue Bao,
Gregory F. Cooper,
Jessi Espino,
Harry Hochheiser,
Marian G. Michaels,
John M. Aronis,
Chenxi Song,
Ye Ye
Abstract:
Transfer learning has become a pivotal technique in machine learning and has proven to be effective in various real-world applications. However, utilizing this technique for classification tasks with sequential data often faces challenges, primarily attributed to the scarcity of class labels. To address this challenge, we introduce Multi-Source Adaptive Weighting (MSAW), an online multi-source tra…
▽ More
Transfer learning has become a pivotal technique in machine learning and has proven to be effective in various real-world applications. However, utilizing this technique for classification tasks with sequential data often faces challenges, primarily attributed to the scarcity of class labels. To address this challenge, we introduce Multi-Source Adaptive Weighting (MSAW), an online multi-source transfer learning method. MSAW integrates a dynamic weighting mechanism into an ensemble framework, enabling automatic adjustment of weights based on the relevance and contribution of each source (representing historical knowledge) and target model (learning from newly acquired data). We demonstrate the effectiveness of MSAW by applying it to detect Respiratory Syncytial Virus cases within Emergency Department visits, utilizing multiple years of electronic health records from the University of Pittsburgh Medical Center. Our method demonstrates performance improvements over many baselines, including refining pre-trained models with online learning as well as three static weighting approaches, showing MSAW's capacity to integrate historical knowledge with progressively accumulated new data. This study indicates the potential of online transfer learning in healthcare, particularly for develo** machine learning models that dynamically adapt to evolving situations where new data is incrementally accumulated.
△ Less
Submitted 7 April, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Moduli Spaces of Hyperplanar Admissible Flags in Projective Space
Authors:
George Cooper
Abstract:
We prove the existence of quasi-projective coarse moduli spaces parametrising certain complete flags of subschemes of a fixed projective space $\mathbb{P}(V)$ up to projective automorphisms. The flags of subschemes being parametrised are obtained by intersecting non-degenerate subvarieties of $\mathbb{P}(V)$ of dimension $n$ by flags of linear subspaces of $\mathbb{P}(V)$ of length $n$, with each…
▽ More
We prove the existence of quasi-projective coarse moduli spaces parametrising certain complete flags of subschemes of a fixed projective space $\mathbb{P}(V)$ up to projective automorphisms. The flags of subschemes being parametrised are obtained by intersecting non-degenerate subvarieties of $\mathbb{P}(V)$ of dimension $n$ by flags of linear subspaces of $\mathbb{P}(V)$ of length $n$, with each positive dimension component of the flags being required to be non-singular and non-degenerate, and with the dimension $0$ components being required to satisfy a Chow stability condition. These moduli spaces are constructed using non-reductive Geometric Invariant Theory for actions of groups whose unipotent radical is graded, making use of a non-reductive analogue of quotienting-in-stages developed by Hoskins and Jackson.
△ Less
Submitted 9 April, 2023; v1 submitted 5 April, 2023;
originally announced April 2023.
-
Determining Molecular Complexity using Assembly Theory and Spectroscopy
Authors:
Michael Jirasek,
Abhishek Sharma,
Jessica R. Bame,
S. Hessam M. Mehr,
Nicola Bell,
Stuart M. Marshall,
Cole Mathis,
Alasdair Macleod,
Geoffrey J. T. Cooper,
Marcel Swart,
Rosa Mollfulleda,
Leroy Cronin
Abstract:
Determining the complexity of molecules has important applications from molecular design to understanding the history of the process that led to the formation of the molecule. Currently, it is not possible to experimentally determine, without full structure elucidation, how complex a molecule is. Assembly Theory has been developed to quantify the complexity of a molecule by finding the shortest pa…
▽ More
Determining the complexity of molecules has important applications from molecular design to understanding the history of the process that led to the formation of the molecule. Currently, it is not possible to experimentally determine, without full structure elucidation, how complex a molecule is. Assembly Theory has been developed to quantify the complexity of a molecule by finding the shortest path to construct the molecule from building blocks, revealing its molecular assembly index (MA). In this study, we present an approach to rapidly and exhaustively calculate the MA of molecules from the spectroscopic measurements. We demonstrate that molecular complexity (MA) can be experimentally estimated using three independent techniques: nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS), and infrared spectroscopy (IR), and these give consistent results with good correlations with the theoretically determined values from assembly theory. By identifying and analysing the number of absorbances in IR spectra, carbon resonances in NMR, or molecular fragments in tandem MS, the molecular assembly of an unknown molecule can be reliably estimated from experimental data. This represents the first experimentally quantifiable approach to defining molecular assembly, a reliable metric for complexity, as an intrinsic property of molecules and can also be performed on complex mixtures. This paves the way to use spectroscopic and spectrometric techniques to unambiguously detect alien life in the solar system, and beyond on exoplanets.
△ Less
Submitted 7 November, 2023; v1 submitted 24 February, 2023;
originally announced February 2023.
-
GIT Constructions of Compactified Universal Jacobians over Stacks of Stable Maps
Authors:
George Cooper
Abstract:
We prove that any compactified universal Jacobian over any stack of stable maps, defined using torsion-free sheaves which are Gieseker semistable with respect to a relatively ample invertible sheaf over the universal curve, admits a projective good moduli space which can be constructed using GIT, and that the same is true for analogues parametrising semistable sheaves of higher rank. We also prove…
▽ More
We prove that any compactified universal Jacobian over any stack of stable maps, defined using torsion-free sheaves which are Gieseker semistable with respect to a relatively ample invertible sheaf over the universal curve, admits a projective good moduli space which can be constructed using GIT, and that the same is true for analogues parametrising semistable sheaves of higher rank. We also prove that for different choices of invertible sheaves, the corresponding good moduli spaces are related by a finite number of "Thaddeus flips". As a special case of our methods, we provide a new GIT construction of the universal Picard variety of Caporaso and Pandharipande.
△ Less
Submitted 16 April, 2023; v1 submitted 20 October, 2022;
originally announced October 2022.
-
The m-connecting imset and factorization for ADMG models
Authors:
Bryan Andrews,
Gregory F. Cooper,
Thomas S. Richardson,
Peter Spirtes
Abstract:
Directed acyclic graph (DAG) models have become widely studied and applied in statistics and machine learning -- indeed, their simplicity facilitates efficient procedures for learning and inference. Unfortunately, these models are not closed under marginalization, making them poorly equipped to handle systems with latent confounding. Acyclic directed mixed graph (ADMG) models characterize margins…
▽ More
Directed acyclic graph (DAG) models have become widely studied and applied in statistics and machine learning -- indeed, their simplicity facilitates efficient procedures for learning and inference. Unfortunately, these models are not closed under marginalization, making them poorly equipped to handle systems with latent confounding. Acyclic directed mixed graph (ADMG) models characterize margins of DAG models, making them far better suited to handle such systems. However, ADMG models have not seen wide-spread use due to their complexity and a shortage of statistical tools for their analysis. In this paper, we introduce the m-connecting imset which provides an alternative representation for the independence models induced by ADMGs. Furthermore, we define the m-connecting factorization criterion for ADMG models, characterized by a single equation, and prove its equivalence to the global Markov property. The m-connecting imset and factorization criterion provide two new statistical tools for learning and inference with ADMG models. We demonstrate the usefulness of these tools by formulating and evaluating a consistent scoring criterion with a closed form solution.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
DMINR: A Tool to Support Journalists Information Verification and Exploration
Authors:
Andrew MacFarlane,
Marisela Gutierrez-Lopez,
Stephann Makri,
Tim Atwell,
Sondess Missaoui,
Colin Porlezza,
Glenda Cooper
Abstract:
Journalists are key information workers who have specific requirements from information systems to support the verification and exploration of information. We overview the DMINR tool that has been designed and developed to meet the needs of journalists through the examination of journalists information behaviour in a newsroom. We outline our co-design process as well as the design, implementation…
▽ More
Journalists are key information workers who have specific requirements from information systems to support the verification and exploration of information. We overview the DMINR tool that has been designed and developed to meet the needs of journalists through the examination of journalists information behaviour in a newsroom. We outline our co-design process as well as the design, implementation and deployment of the tool. We report a usability test on the tool and conclude with details of how to develop the tool further
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Direct measurement of optical properties of glacier ice using a photon-counting diffuse LiDAR
Authors:
Markus Allgaier,
Matthew G. Cooper,
Anders E. Carlson,
Sarah W. Cooley,
Jonathan C. Ryan,
Brian J. Smith
Abstract:
The production of meltwater from glacier ice, which is exposed at the margins of land ice during the summer, is responsible for a large proportion of glacier mass loss. The rate of meltwater production from glacier ice is especially sensitive to its physical structure and chemical composition which combine to determine the albedo of glacier ice. However, the optical properties of near-surface glac…
▽ More
The production of meltwater from glacier ice, which is exposed at the margins of land ice during the summer, is responsible for a large proportion of glacier mass loss. The rate of meltwater production from glacier ice is especially sensitive to its physical structure and chemical composition which combine to determine the albedo of glacier ice. However, the optical properties of near-surface glacier ice are not well known since most prior work has focused on ice made in the laboratory or from deep cores. Here, we demonstrate a measurement technique based on diffuse propagation of nanosecond-duration laser pulses in near-surface glacier ice that enables the independent measurement of the scattering and absorption coefficients, allowing for a complete description of the processes governing radiative transfer. We employ a photon-counting detector to overcome the high losses associated with diffuse optics. The instrument is highly portable and rugged, making it optimally suited for deployment in remote regions. A set of measurements taken on Collier Glacier, Oregon, serves as a demonstration of the technique. These measurements provide insight into both physical structure and composition of near-surface glacier ice and open new avenues for the analysis of light-absorbing impurities and remote sensing of the cryosphere.
△ Less
Submitted 24 February, 2022; v1 submitted 19 January, 2022;
originally announced January 2022.
-
Causal Markov Boundaries
Authors:
Sofia Triantafillou,
Fattaneh Jabbari,
Greg Cooper
Abstract:
Feature selection is an important problem in machine learning, which aims to select variables that lead to an optimal predictive model. In this paper, we focus on feature selection for post-intervention outcome prediction from pre-intervention variables. We are motivated by healthcare settings, where the goal is often to select the treatment that will maximize a specific patient's outcome; however…
▽ More
Feature selection is an important problem in machine learning, which aims to select variables that lead to an optimal predictive model. In this paper, we focus on feature selection for post-intervention outcome prediction from pre-intervention variables. We are motivated by healthcare settings, where the goal is often to select the treatment that will maximize a specific patient's outcome; however, we often do not have sufficient randomized control trial data to identify well the conditional treatment effect. We show how we can use observational data to improve feature selection and effect estimation in two cases: (a) using observational data when we know the causal graph, and (b) when we do not know the causal graph but have observational and limited experimental data. Our paper extends the notion of Markov boundary to treatment-outcome pairs. We provide theoretical guarantees for the methods we introduce. In simulated data, we show that combining observational and experimental data improves feature selection and effect estimation.
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
Subcritical dynamos in rapidly-rotating planar convection
Authors:
Robert G. Cooper,
Paul J. Bushby,
Celine Guervilly
Abstract:
We study dynamo action using numerical simulations of planar Boussinesq convection at rapid rotation (low Ekman numbers, Ek), focusing on subcritical dynamo action in which the dynamo is sustained for Rayleigh numbers, Ra, below the critical Rayleigh number for the onset of nonmagnetic convection, Ra$_c$. These solutions are found by first investigating the supercritical regime, in which the dynam…
▽ More
We study dynamo action using numerical simulations of planar Boussinesq convection at rapid rotation (low Ekman numbers, Ek), focusing on subcritical dynamo action in which the dynamo is sustained for Rayleigh numbers, Ra, below the critical Rayleigh number for the onset of nonmagnetic convection, Ra$_c$. These solutions are found by first investigating the supercritical regime, in which the dynamo is able to generate a large-scale magnetic field that significantly influences the convective motions, with an associated Elsasser number of order Ek$^{1/3}$. Subcritical solutions are then found by tracking this solution branch into the subcritical regime, taking a supercritical solution and then gradually lowering the corresponding Rayleigh number. We show that decreasing the Ekman number leads to an extension of the subcritical range of Ra/Ra$_c$, down to an optimal value of Ek$=10^{-5}$. For magnetic Prandtl numbers of order unity, subcriticality is then hampered by the emergence of a large-scale mode at lower Ekman numbers when the dynamo driven by the smaller scale convection generates relatively stronger large-scale magnetic field. The inability of the large-scale mode to sustain dynamo action leads to an intermittent behaviour that appears to inhibit subcriticality. The subcritical solutions are also sensitive to the value of the magnetic Reynolds number (or equivalently, the magnetic Prandtl number, Pm), as values of the magnetic Reynolds number greater than 70 are required to produce dynamo action, but large values lead to fluctuations that are able to push the system too far from the subcritical branch and towards the trivial conducting state.
△ Less
Submitted 12 November, 2020; v1 submitted 2 November, 2020;
originally announced November 2020.
-
D-dimensional oscillators in simplicial structures: odd and even dimensions display different synchronization scenarios
Authors:
X. Dai,
K. Kovalenko,
M. Molodyk,
Z. Wang,
X. Li,
D. Musatov,
A. M. Raigorodskii,
K. Alfaro-Bittner,
G. D. Cooper,
G. Bianconi,
S. Boccaletti
Abstract:
From biology to social science, the functioning of a wide range of systems is the result of elementary interactions which involve more than two constituents, so that their description has unavoidably to go beyond simple pairwise-relationships. Simplicial complexes are therefore the mathematical objects providing a faithful representation of such systems. We here present a complete theory of synchr…
▽ More
From biology to social science, the functioning of a wide range of systems is the result of elementary interactions which involve more than two constituents, so that their description has unavoidably to go beyond simple pairwise-relationships. Simplicial complexes are therefore the mathematical objects providing a faithful representation of such systems. We here present a complete theory of synchronization of $D$-dimensional oscillators obeying an extended Kuramoto model, and interacting by means of 1- and 2- simplices. Not only our theory fully describes and unveils the intimate reasons and mechanisms for what was observed so far with pairwise interactions, but it also offers predictions for a series of rich and novel behaviors in simplicial structures, which include: a) a discontinuous de-synchronization transition at positive values of the coupling strength for all dimensions, b) an extra discontinuous transition at zero coupling for all odd dimensions, and c) the occurrence of partially synchronized states at $D=2$ (and all odd $D$) even for negative values of the coupling strength, a feature which is inherently prohibited with pairwise-interactions. Furthermore, our theory untangles several aspects of the emergent behavior: the system can never fully synchronize from disorder, and is characterized by an extreme multi-stability, in that the asymptotic stationary synchronized states depend always on the initial conditions. All our theoretical predictions are fully corroborated by extensive numerical simulations. Our results elucidate the dramatic and novel effects that higher-order interactions may induce in the collective dynamics of ensembles of coupled $D$-dimensional oscillators, and can therefore be of value and interest for the understanding of many phenomena observed in nature, like for instance the swarming and/or flocking processes unfolding in three or more dimensions.
△ Less
Submitted 28 October, 2020;
originally announced October 2020.
-
Observations of Pressure Anisotropy Effects within Semi-Collisional Magnetized-Plasma Bubbles
Authors:
E. R. Tubman,
A. S. Joglekar,
A. F. A. Bott,
M. Borghesi,
B. Coleman,
G. Cooper,
C. N. Danson,
P. Durey,
J. M. Foster,
P. Graham,
G. Gregori,
E. T. Gumbrell,
M. P. Hill. T. Hodge,
S. Kar,
R. J. Kingham,
M. Read,
C. P. Ridgers,
J. Skidmore,
C. Spindloe,
A. G. R. Thomas,
P. Treadwell,
S. Wilson,
L. Willingale,
N. C. Woolsey
Abstract:
Magnetized plasma interactions are ubiquitous in astrophysical and laboratory plasmas. Various physical effects have been shown to be important within colliding plasma flows influenced by opposing magnetic fields, however, experimental verification of the mechanisms within the interaction region has remained elusive. Here we discuss a laser-plasma experiment whereby experimental results verify tha…
▽ More
Magnetized plasma interactions are ubiquitous in astrophysical and laboratory plasmas. Various physical effects have been shown to be important within colliding plasma flows influenced by opposing magnetic fields, however, experimental verification of the mechanisms within the interaction region has remained elusive. Here we discuss a laser-plasma experiment whereby experimental results verify that Biermann battery generated magnetic fields are advected by Nernst flows and anisotropic pressure effects dominate these flows in a reconnection region. These fields are mapped using time-resolved proton probing in multiple directions. Various experimental, modelling and analytical techniques demonstrate the importance of anisotropic pressure in semi-collisional, high-$β$ plasmas, causing a reduction in the magnitude of the reconnecting fields when compared to resistive processes. Anisotropic pressure dynamics are crucial in collisionless plasmas, but are often neglected in collisional plasmas. We show pressure anisotropy to be essential in maintaining the interaction layer, redistributing magnetic fields even for semi-collisional, high energy density physics (HEDP) regimes
△ Less
Submitted 19 October, 2020;
originally announced October 2020.
-
Fine Structure of the Isovector Giant Dipole Resonance in $^{142-150}$Nd and $^{152}$Sm
Authors:
L. M. Donaldson,
J. Carter,
P. von Neumann-Cosel,
V. O. Nesterenko,
R. Neveling,
P. -G. Reinhard,
I. T. Usman,
P. Adsley,
C. A. Bertulani,
J. W. Brümmer,
E. Z. Buthelezi,
G. R. J. Cooper,
R. W. Fearick,
S. V. Förtsch,
H. Fujita,
Y. Fujita,
M. **go,
N. Y. Kheswa,
W. Kleinig,
C. O. Kureba,
J. Kvasil,
M. Latif,
K. C. W. Li,
J. P. Mira,
F. Nemulodi
, et al. (13 additional authors not shown)
Abstract:
Background: Inelastic proton scattering at energies of a few hundred MeV and very-forward angles including $0^\circ$ has been established as a tool to study electric-dipole strength distributions in nuclei. The present work reports a systematic investigation of the chain of stable even-mass Nd isotopes representing a transition from spherical to quadrupole-deformed nuclei.
Purpose: Extraction of…
▽ More
Background: Inelastic proton scattering at energies of a few hundred MeV and very-forward angles including $0^\circ$ has been established as a tool to study electric-dipole strength distributions in nuclei. The present work reports a systematic investigation of the chain of stable even-mass Nd isotopes representing a transition from spherical to quadrupole-deformed nuclei.
Purpose: Extraction of the equivalent photo-absorption cross sections and analysis of their fine structure in the energy region of the IsoVector Giant Dipole Resonance (IVGDR).
Method: Proton inelastic scattering reactions of 200 MeV protons were measured at iThemba LABS in Cape Town, South Africa. The scattering products were momentum-analysed by the K600 magnetic spectrometer positioned at $θ_{\mathrm{Lab}}=0^\circ$. Using dispersion-matching techniques, energy resolutions of $ΔE \approx 40 - 50$ keV were obtained. After subtraction of background and contributions from other multipoles, the spectra were converted to photo-absorption cross sections using the equivalent virtual-photon method.
Results: Wavelet-analysis techniques are used to extract characteristic energy scales of the fine structure of the IVGDR from the experimental data. Comparisons with the Quasiparticle-Phonon Model (QPM) and Skyrme Separable Random Phase Approximation (SSRPA) predictions provide insight into the role of different giant resonance dam** mechanisms.
Conclusions: Fine structure is observed even for the most deformed nuclei studied. Fragmentation of the one particle-one hole ($1p1h$) strength seems to be the main source of fine structure in both spherical and deformed nuclei. Some impact of the spreading due to coupling of the two particle-two hole ($2p2h$) states to the $1p1h$ doorway states is seen in the spherical/transitional nuclei, where calculations beyond the $1p1h$ level are available.
△ Less
Submitted 4 January, 2021; v1 submitted 2 October, 2020;
originally announced October 2020.
-
Learning Adjustment Sets from Observational and Limited Experimental Data
Authors:
Sofia Triantafillou,
Gregory Cooper
Abstract:
Estimating causal effects from observational data is not always possible due to confounding. Identifying a set of appropriate covariates (adjustment set) and adjusting for their influence can remove confounding bias; however, such a set is typically not identifiable from observational data alone. Experimental data do not have confounding bias, but are typically limited in sample size and can there…
▽ More
Estimating causal effects from observational data is not always possible due to confounding. Identifying a set of appropriate covariates (adjustment set) and adjusting for their influence can remove confounding bias; however, such a set is typically not identifiable from observational data alone. Experimental data do not have confounding bias, but are typically limited in sample size and can therefore yield imprecise estimates. Furthermore, experimental data often include a limited set of covariates, and therefore provide limited insight into the causal structure of the underlying system. In this work we introduce a method that combines large observational and limited experimental data to identify adjustment sets and improve the estimation of causal effects. The method identifies an adjustment set (if possible) by calculating the marginal likelihood for the experimental data given observationally-derived prior probabilities of potential adjustmen sets. In this way, the method can make inferences that are not possible using only the conditional dependencies and independencies in all the observational and experimental data. We show that the method successfully identifies adjustment sets and improves causal effect estimation in simulated data, and it can sometimes make additional inferences when compared to state-of-the-art methods for combining experimental and observational data.
△ Less
Submitted 17 November, 2020; v1 submitted 18 May, 2020;
originally announced May 2020.
-
Learning Latent Causal Structures with a Redundant Input Neural Network
Authors:
Jonathan D. Young,
Bryan Andrews,
Gregory F. Cooper,
Xinghua Lu
Abstract:
Most causal discovery algorithms find causal structure among a set of observed variables. Learning the causal structure among latent variables remains an important open problem, particularly when using high-dimensional data. In this paper, we address a problem for which it is known that inputs cause outputs, and these causal relationships are encoded by a causal network among a set of an unknown n…
▽ More
Most causal discovery algorithms find causal structure among a set of observed variables. Learning the causal structure among latent variables remains an important open problem, particularly when using high-dimensional data. In this paper, we address a problem for which it is known that inputs cause outputs, and these causal relationships are encoded by a causal network among a set of an unknown number of latent variables. We developed a deep learning model, which we call a redundant input neural network (RINN), with a modified architecture and a regularized objective function to find causal relationships between input, hidden, and output variables. More specifically, our model allows input variables to directly interact with all latent variables in a neural network to influence what information the latent variables should encode in order to generate the output variables accurately. In this setting, the direct connections between input and latent variables makes the latent variables partially interpretable; furthermore, the connectivity among the latent variables in the neural network serves to model their potential causal relationships to each other and to the output variables. A series of simulation experiments provide support that the RINN method can successfully recover latent causal structure between input and output variables.
△ Less
Submitted 8 September, 2020; v1 submitted 29 March, 2020;
originally announced March 2020.
-
Twisting moduli for GL(2)
Authors:
Benjamin Bedert,
George Cooper,
Thomas Oliver,
Pengcheng Zhang
Abstract:
We prove various converse theorems for automorphic forms on Γ_0(N), each assuming fewer twisted functional equations than the last. We show that no twisting at all is needed for holomorphic modular forms in the case that N is 18, 20, or 24 - these integers are the smallest multiples of 4 or 9 not covered by earlier work of Conrey-Farmer. This development is a consequence of finding generating sets…
▽ More
We prove various converse theorems for automorphic forms on Γ_0(N), each assuming fewer twisted functional equations than the last. We show that no twisting at all is needed for holomorphic modular forms in the case that N is 18, 20, or 24 - these integers are the smallest multiples of 4 or 9 not covered by earlier work of Conrey-Farmer. This development is a consequence of finding generating sets for Γ_0(N) such that each generator can be written as a product of special matrices. As for real-analytic Maass forms of even (resp. odd) weight we prove the analogous statement for N=1,...12,16,18 (resp. N=1,...,12,14,15,16,17,18,20,23,24).
△ Less
Submitted 5 March, 2020;
originally announced March 2020.
-
Knot spectrum of turbulence
Authors:
R. G. Cooper,
M. Mesgarnezhad,
A. W. Baggaley,
C. F. Barenghi
Abstract:
Streamlines, vortex lines and magnetic flux tubes in turbulent fluids and plasmas display a great amount of coiling, twisting and linking, raising the question as to whether their topological complexity (continually created and destroyed by reconnections) can be quantified. In superfluid helium, the discrete (quantized) nature of vorticity can be exploited to associate to each vortex loop a knot i…
▽ More
Streamlines, vortex lines and magnetic flux tubes in turbulent fluids and plasmas display a great amount of coiling, twisting and linking, raising the question as to whether their topological complexity (continually created and destroyed by reconnections) can be quantified. In superfluid helium, the discrete (quantized) nature of vorticity can be exploited to associate to each vortex loop a knot invariant called the Alexander polynomial whose degree characterizes the topology of that vortex loop. By numerically simulating the dynamics of a tangle of quantum vortex lines, we find that this quantum turbulence always contains vortex knots of very large degree which keep forming, vanishing and reforming, creating a distribution of topologies which we quantify in terms of a knot spectrum and its scaling law. We also find results analogous to those in the wider literature, demonstrating that the knotting probability of the vortex tangle grows with the vortex length, as for macromolecules, and saturates above a characteristic length, as found for tumbled strings.
△ Less
Submitted 8 July, 2019;
originally announced July 2019.
-
Studies of the Giant Dipole Resonance in $^{27}$Al, $^{40}$Ca, $^{56}$Fe, $^{58}$Ni and $^{208}$Pb with high energy-resolution inelastic proton scattering under 0$^\circ$
Authors:
M. **go,
E. Z. Buthelezi,
J. Carter,
G. R. J. Cooper,
R. W. Fearick,
S. V. Förtsch,
C. O. Kureba,
A. M. Krumbholz,
P. von Neumann-Cosel,
R. Neveling,
P. Papka,
I. Poltoratska,
V. Yu. Ponomarev,
A. Richter,
E. Sideras-Haddad,
F. D. Smit,
J. A. Swartz,
A. Tamii,
I. T. Usman
Abstract:
A survey of the fine structure of the Isovector Giant Dipole Resonance (IVGDR) was performed, using the recently commissioned zero-degree facility of the K600 magnetic spectrometer at iThemba LABS. Inelastic proton scattering at an incident energy of 200 MeV was measured on $^{27}$Al, $^{40}$Ca, $^{56}$Fe, $^{58}$Ni and $^{208}$Pb. A high energy resolution ($\rmΔ\it{E} \simeq$ 40 keV FWHM) could b…
▽ More
A survey of the fine structure of the Isovector Giant Dipole Resonance (IVGDR) was performed, using the recently commissioned zero-degree facility of the K600 magnetic spectrometer at iThemba LABS. Inelastic proton scattering at an incident energy of 200 MeV was measured on $^{27}$Al, $^{40}$Ca, $^{56}$Fe, $^{58}$Ni and $^{208}$Pb. A high energy resolution ($\rmΔ\it{E} \simeq$ 40 keV FWHM) could be achieved after utilising faint-beam and dispersion-matching techniques. Considerable fine structure is observed in the energy region of the IVGDR and characteristic energy scales are extracted from the experimental data by means of a wavelet analysis. The comparison with Quasiparticle-Phonon Model (QPM) calculations provides insight into the relevance of different giant resonance decay mechanisms. Photoabsorption cross sections derived from the data assuming dominance of relativistic Coulomb excitation are in fair agreement with previous work using real photons.
△ Less
Submitted 16 November, 2018; v1 submitted 7 August, 2018;
originally announced August 2018.
-
Obtaining Accurate Probabilistic Causal Inference by Post-Processing Calibration
Authors:
Fattaneh Jabbari,
Mahdi Pakdaman Naeini,
Gregory F. Cooper
Abstract:
Discovery of an accurate causal Bayesian network structure from observational data can be useful in many areas of science. Often the discoveries are made under uncertainty, which can be expressed as probabilities. To guide the use of such discoveries, including directing further investigation, it is important that those probabilities be well-calibrated. In this paper, we introduce a novel framewor…
▽ More
Discovery of an accurate causal Bayesian network structure from observational data can be useful in many areas of science. Often the discoveries are made under uncertainty, which can be expressed as probabilities. To guide the use of such discoveries, including directing further investigation, it is important that those probabilities be well-calibrated. In this paper, we introduce a novel framework to derive calibrated probabilities of causal relationships from observational data. The framework consists of three components: (1) an approximate method for generating initial probability estimates of the edge types for each pair of variables, (2) the availability of a relatively small number of the causal relationships in the network for which the truth status is known, which we call a calibration training set, and (3) a calibration method for using the approximate probability estimates and the calibration training set to generate calibrated probabilities for the many remaining pairs of variables. We also introduce a new calibration method based on a shallow neural network. Our experiments on simulated data support that the proposed approach improves the calibration of causal edge predictions. The results also support that the approach often improves the precision and recall of predictions.
△ Less
Submitted 22 December, 2017;
originally announced December 2017.
-
Wavelet signatures of $K$-splitting of the Isoscalar Giant Quadrupole Resonance in deformed nuclei from high-resolution (p,p$'$) scattering off $^{146,148,150}$Nd
Authors:
C. O. Kureba,
Z. Buthelezi,
J. Carter,
G. R. J. Cooper,
R. W. Fearick,
S. V. Förtsch,
M. **go,
W. Kleinig,
A. Krugmann,
A. M. Krumbolz,
J. Kvasil,
J. Mabiala,
J. P. Mira,
V. O. Nesterenko,
P. von Neumann-Cosel,
R. Neveling,
P. Papka,
P. -G. Reinhard,
A. Richter,
E. Sideras-Haddad,
F. D. Smit,
G. F. Steyn,
J. A. Swartz,
A. Tamii,
I. T. Usman
Abstract:
The phenomenon of fine structure of the Isoscalar Giant Quadrupole Resonance (ISGQR) has been studied with high energy-resolution proton inelastic scattering at iThemba LABS in the chain of stable even-mass Nd isotopes covering the transition from spherical to deformed ground states. A wavelet analysis of the background-subtracted spectra in the deformed 146,148,150Nd isotopes reveals characterist…
▽ More
The phenomenon of fine structure of the Isoscalar Giant Quadrupole Resonance (ISGQR) has been studied with high energy-resolution proton inelastic scattering at iThemba LABS in the chain of stable even-mass Nd isotopes covering the transition from spherical to deformed ground states. A wavelet analysis of the background-subtracted spectra in the deformed 146,148,150Nd isotopes reveals characteristic scales in correspondence with scales obtained from a Skyrme RPA calculation using the SVmas10 parameterization. A semblance analysis shows that these scales arise from the energy shift between the main fragments of the K = 0, 1 and K = 2 components.
△ Less
Submitted 4 March, 2018; v1 submitted 29 May, 2017;
originally announced May 2017.
-
Photofragmentation dynamics and dissociation energies of MoO and CrO
Authors:
Graham A. Cooper,
Alexander S. Gentleman,
Andreas Iskra,
Stuart R. Mackenzie
Abstract:
Neutral metal-containing molecules and clusters present a particular challenge to velocity map imaging techniques. Common methods of choice for producing such species such as laser ablation or magnetron sputtering typically generate a wide variety of metal-containing species and, without the possibility of mass-selection, even determining the identity of the dissociating moiety can be challenging.…
▽ More
Neutral metal-containing molecules and clusters present a particular challenge to velocity map imaging techniques. Common methods of choice for producing such species such as laser ablation or magnetron sputtering typically generate a wide variety of metal-containing species and, without the possibility of mass-selection, even determining the identity of the dissociating moiety can be challenging. In recent years, we have developed a velocity map imaging spectrometer equipped with a laser ablation source explicitly for studying neutral metal-containing species. Here, we report the results of velocity map imaging photofragmentation studies of MoO and CrO. In both cases, dissociation at the two- and three-photon level leads to fragmentation into a range of product channels, some of which can be confidently assigned to particular Mo (Cr) and O atom quantum states. Analysis of the kinetic energy release spectra as a function of photon energy allows precise determination of the ground state dissociation energies of MoO,respectively.
△ Less
Submitted 26 April, 2017;
originally announced April 2017.
-
Deformation dependence of the isovector giant dipole resonance: The neodymium isotopic chain revisited
Authors:
L. M. Donaldson,
C. A. Bertulani,
J. Carter,
V. O. Nesterenko,
P. von Neumann-Cosel,
R. Neveling,
P. -G. Reinhard,
I. T. Usman,
P. Adsley,
J. W. Brummer,
E. Z. Buthelezi,
G. R. J. Cooper,
R. W. Fearick,
S. V. Förtsch,
H. Fujita,
Y. Fujita,
M. **go,
W. Kleinig,
C. O. Kureba,
J. Kvasil,
M. Latif,
K. C. W. Li,
J. P. Mira,
F. Nemulodi,
P. Papka
, et al. (9 additional authors not shown)
Abstract:
Proton inelastic scattering experiments at energy E_p = 200 MeV and a spectrometer scattering angle of 0 degree were performed on 144,146,148,150Nd and 152Sm exciting the IsoVector Giant Dipole Resonance (IVGDR). Comparison with results from photo-absorption experiments reveals a shift of resonance maxima towards higher energies for vibrational and transitional nuclei. The extracted photo-absorpti…
▽ More
Proton inelastic scattering experiments at energy E_p = 200 MeV and a spectrometer scattering angle of 0 degree were performed on 144,146,148,150Nd and 152Sm exciting the IsoVector Giant Dipole Resonance (IVGDR). Comparison with results from photo-absorption experiments reveals a shift of resonance maxima towards higher energies for vibrational and transitional nuclei. The extracted photo-absorption cross sections in the most deformed nuclei, 150Nd and 152Sm, exhibit a pronounced asymmetry rather than a distinct double-hump structure expected as a signature of K-splitting. This behaviour can be related to the proximity of these nuclei to the critical point of the phase shape transition from vibrators to rotors with a soft quadrupole deformation potential. Self-consistent random-phase approximation (RPA) calculations using the SLy6 Skyrme force provide a relevant description of the IVGDR shapes deduced from the present data
△ Less
Submitted 3 November, 2017; v1 submitted 20 December, 2016;
originally announced December 2016.
-
Helicity and topology of a small region of quantum vorticity
Authors:
M. Mesgarnezhad,
R. G. Cooper,
A. W. Baggaley,
C. F. Barenghi
Abstract:
We numerically study the evolution of a small turbulent region of quantised vorticity in superfluid helium, a regime which can be realised in the laboratory. We show that the turbulence achieves a fluctuating steady-state in terms of dynamics (energy), geometry (length, writhing) and topology (linking). After defining the knot spectrum, we show that, at any instant, the turbulence consists of many…
▽ More
We numerically study the evolution of a small turbulent region of quantised vorticity in superfluid helium, a regime which can be realised in the laboratory. We show that the turbulence achieves a fluctuating steady-state in terms of dynamics (energy), geometry (length, writhing) and topology (linking). After defining the knot spectrum, we show that, at any instant, the turbulence consists of many unknots and few large loops of great geometrical and topological complexity.
△ Less
Submitted 28 March, 2017; v1 submitted 28 October, 2016;
originally announced October 2016.
-
Binary Classifier Calibration using an Ensemble of Near Isotonic Regression Models
Authors:
Mahdi Pakdaman Naeini,
Gregory F. Cooper
Abstract:
Learning accurate probabilistic models from data is crucial in many practical tasks in data mining. In this paper we present a new non-parametric calibration method called \textit{ensemble of near isotonic regression} (ENIR). The method can be considered as an extension of BBQ, a recently proposed calibration method, as well as the commonly used calibration method based on isotonic regression. ENI…
▽ More
Learning accurate probabilistic models from data is crucial in many practical tasks in data mining. In this paper we present a new non-parametric calibration method called \textit{ensemble of near isotonic regression} (ENIR). The method can be considered as an extension of BBQ, a recently proposed calibration method, as well as the commonly used calibration method based on isotonic regression. ENIR is designed to address the key limitation of isotonic regression which is the monotonicity assumption of the predictions. Similar to BBQ, the method post-processes the output of a binary classifier to obtain calibrated probabilities. Thus it can be combined with many existing classification models. We demonstrate the performance of ENIR on synthetic and real datasets for the commonly used binary classification models. Experimental results show that the method outperforms several common binary classifier calibration methods. In particular on the real data, ENIR commonly performs statistically significantly better than the other methods, and never worse. It is able to improve the calibration power of classifiers, while retaining their discrimination power. The method is also computationally tractable for large scale datasets, as it is $O(N \log N)$ time, where $N$ is the number of samples.
△ Less
Submitted 16 November, 2015;
originally announced November 2015.
-
Dissociation energies of AgRG (RG = Ar, Kr, Xe) and AgO molecules from velocity map imaging studies
Authors:
Graham A. Cooper,
Aras Kartouzian,
Alexander S. Gentleman,
Andreas Iskra,
Robert van Wijk,
Stuart R. Mackenzie
Abstract:
The near ultraviolet photodissociation dynamics of silver atom rare gas dimers have been studied by velocity map imaging. AgRG (RG = Ar, Kr, Xe) species generated by laser ablation are excited in the region of the C <- X continuum leading to direct, near threshold dissociation generating Ag* (2P3/2) + RG (1S0) products. Images recorded at excitation wavelengths throughout the C <- X continuum, cou…
▽ More
The near ultraviolet photodissociation dynamics of silver atom rare gas dimers have been studied by velocity map imaging. AgRG (RG = Ar, Kr, Xe) species generated by laser ablation are excited in the region of the C <- X continuum leading to direct, near threshold dissociation generating Ag* (2P3/2) + RG (1S0) products. Images recorded at excitation wavelengths throughout the C <- X continuum, coupled with known atomic energy levels, permit determination of the ground X (2SIGMA+) state dissociation energies of 85.9 +/- 23.4 cm-1 (AgAr), 149.3 +/- 22.4 cm-1 (AgKr) and 256.3 +/- 16.0 cm-1 (AgXe). Three additional photolysis processes, each yielding Ag atom photoproducts, are observed in the same spectral region. Two of these are markedly enhanced in intensity upon seeding the molecular beam with nitrous oxide, and are assigned to photodissociation of AgO at the two photon level. These features yield an improved ground state dissociation energy for AgO of 15965 +/- 81 cm-1, which is in good agreement with high level calculations. The third process results in Ag atom fragments whose kinetic energy shows anomalously weak photon energy dependence and is assigned tentatively to dissociative ionization of the silver dimer Ag2.
△ Less
Submitted 7 September, 2015;
originally announced September 2015.
-
Counting Markov Blanket Structures
Authors:
Shyam Visweswaran,
Gregory F. Cooper
Abstract:
Learning Markov blanket (MB) structures has proven useful in performing feature selection, learning Bayesian networks (BNs), and discovering causal relationships. We present a formula for efficiently determining the number of MB structures given a target variable and a set of other variables. As expected, the number of MB structures grows exponentially. However, we show quantitatively that there a…
▽ More
Learning Markov blanket (MB) structures has proven useful in performing feature selection, learning Bayesian networks (BNs), and discovering causal relationships. We present a formula for efficiently determining the number of MB structures given a target variable and a set of other variables. As expected, the number of MB structures grows exponentially. However, we show quantitatively that there are many fewer MB structures that contain the target variable than there are BN structures that contain it. In particular, the ratio of BN structures to MB structures appears to increase exponentially in the number of variables.
△ Less
Submitted 12 July, 2014; v1 submitted 9 July, 2014;
originally announced July 2014.
-
Binary Classifier Calibration: Non-parametric approach
Authors:
Mahdi Pakdaman Naeini,
Gregory F. Cooper,
Milos Hauskrecht
Abstract:
Accurate calibration of probabilistic predictive models learned is critical for many practical prediction and decision-making tasks. There are two main categories of methods for building calibrated classifiers. One approach is to develop methods for learning probabilistic models that are well-calibrated, ab initio. The other approach is to use some post-processing methods for transforming the outp…
▽ More
Accurate calibration of probabilistic predictive models learned is critical for many practical prediction and decision-making tasks. There are two main categories of methods for building calibrated classifiers. One approach is to develop methods for learning probabilistic models that are well-calibrated, ab initio. The other approach is to use some post-processing methods for transforming the output of a classifier to be well calibrated, as for example histogram binning, Platt scaling, and isotonic regression. One advantage of the post-processing approach is that it can be applied to any existing probabilistic classification model that was constructed using any machine-learning method.
In this paper, we first introduce two measures for evaluating how well a classifier is calibrated. We prove three theorems showing that using a simple histogram binning post-processing method, it is possible to make a classifier be well calibrated while retaining its discrimination capability. Also, by casting the histogram binning method as a density-based non-parametric binary classifier, we can extend it using two simple non-parametric density estimation methods. We demonstrate the performance of the proposed calibration methods on synthetic and real datasets. Experimental results show that the proposed methods either outperform or are comparable to existing calibration methods.
△ Less
Submitted 14 January, 2014;
originally announced January 2014.
-
Binary Classifier Calibration: Bayesian Non-Parametric Approach
Authors:
Mahdi Pakdaman Naeini,
Gregory F. Cooper,
Milos Hauskrecht
Abstract:
A set of probabilistic predictions is well calibrated if the events that are predicted to occur with probability p do in fact occur about p fraction of the time. Well calibrated predictions are particularly important when machine learning models are used in decision analysis. This paper presents two new non-parametric methods for calibrating outputs of binary classification models: a method based…
▽ More
A set of probabilistic predictions is well calibrated if the events that are predicted to occur with probability p do in fact occur about p fraction of the time. Well calibrated predictions are particularly important when machine learning models are used in decision analysis. This paper presents two new non-parametric methods for calibrating outputs of binary classification models: a method based on the Bayes optimal selection and a method based on the Bayesian model averaging. The advantage of these methods is that they are independent of the algorithm used to learn a predictive model, and they can be applied in a post-processing step, after the model is learned. This makes them applicable to a wide variety of machine learning models and methods. These calibration methods, as well as other methods, are tested on a variety of datasets in terms of both discrimination and calibration performance. The results show the methods either outperform or are comparable in performance to the state-of-the-art calibration methods.
△ Less
Submitted 13 January, 2014;
originally announced January 2014.
-
Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (1998)
Authors:
Gregory Cooper,
Serafin Moral
Abstract:
This is the Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, which was held in Madison, WI, July 24-26, 1998
This is the Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, which was held in Madison, WI, July 24-26, 1998
△ Less
Submitted 28 August, 2014; v1 submitted 13 April, 2013;
originally announced April 2013.
-
An Algorithm for Computing Probabilistic Propositions
Authors:
Gregory F. Cooper
Abstract:
A method for computing probabilistic propositions is presented. It assumes the availability of a single external routine for computing the probability of one instantiated variable, given a conjunction of other instantiated variables. In particular, the method allows belief network algorithms to calculate general probabilistic propositions over nodes in the network. Although in the worst case the t…
▽ More
A method for computing probabilistic propositions is presented. It assumes the availability of a single external routine for computing the probability of one instantiated variable, given a conjunction of other instantiated variables. In particular, the method allows belief network algorithms to calculate general probabilistic propositions over nodes in the network. Although in the worst case the time complexity of the method is exponential in the size of a query, it is polynomial in the size of a number of common types of queries.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
Stochastic Simulation of Bayesian Belief Networks
Authors:
Homer L. Chin,
Gregory F. Cooper
Abstract:
This paper examines Bayesian belief network inference using simulation as a method for computing the posterior probabilities of network variables. Specifically, it examines the use of a method described by Henrion, called logic sampling, and a method described by Pearl, called stochastic simulation. We first review the conditions under which logic sampling is computationally infeasible. Such cases…
▽ More
This paper examines Bayesian belief network inference using simulation as a method for computing the posterior probabilities of network variables. Specifically, it examines the use of a method described by Henrion, called logic sampling, and a method described by Pearl, called stochastic simulation. We first review the conditions under which logic sampling is computationally infeasible. Such cases motivated the development of the Pearl's stochastic simulation algorithm. We have found that this stochastic simulation algorithm, when applied to certain networks, leads to much slower than expected convergence to the true posterior probabilities. This behavior is a result of the tendency for local areas in the network to become fixed through many simulation cycles. The time required to obtain significant convergence can be made arbitrarily long by strengthening the probabilistic dependency between nodes. We propose the use of several forms of graph modification, such as graph pruning, arc reversal, and node reduction, in order to convert some networks into formats that are computationally more efficient for simulation.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
Updating Probabilities in Multiply-Connected Belief Networks
Authors:
Jaap Suermondt,
Gregory F. Cooper
Abstract:
This paper focuses on probability updates in multiply-connected belief networks. Pearl has designed the method of conditioning, which enables us to apply his algorithm for belief updates in singly-connected networks to multiply-connected belief networks by selecting a loop-cutset for the network and instantiating these loop-cutset nodes. We discuss conditions that need to be satisfied by the selec…
▽ More
This paper focuses on probability updates in multiply-connected belief networks. Pearl has designed the method of conditioning, which enables us to apply his algorithm for belief updates in singly-connected networks to multiply-connected belief networks by selecting a loop-cutset for the network and instantiating these loop-cutset nodes. We discuss conditions that need to be satisfied by the selected nodes. We present a heuristic algorithm for finding a loop-cutset that satisfies these conditions.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
A Method for Using Belief Networks as Influence Diagrams
Authors:
Gregory F. Cooper
Abstract:
This paper demonstrates a method for using belief-network algorithms to solve influence diagram problems. In particular, both exact and approximation belief-network algorithms may be applied to solve influence-diagram problems. More generally, knowing the relationship between belief-network and influence-diagram problems may be useful in the design and development of more efficient influence diagr…
▽ More
This paper demonstrates a method for using belief-network algorithms to solve influence diagram problems. In particular, both exact and approximation belief-network algorithms may be applied to solve influence-diagram problems. More generally, knowing the relationship between belief-network and influence-diagram problems may be useful in the design and development of more efficient influence diagram algorithms.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
KNET: Integrating Hypermedia and Bayesian Modeling
Authors:
R. Martin Chavez,
Gregory F. Cooper
Abstract:
KNET is a general-purpose shell for constructing expert systems based on belief networks and decision networks. Such networks serve as graphical representations for decision models, in which the knowledge engineer must define clearly the alternatives, states, preferences, and relationships that constitute a decision basis. KNET contains a knowledge-engineering core written in Object Pascal and an…
▽ More
KNET is a general-purpose shell for constructing expert systems based on belief networks and decision networks. Such networks serve as graphical representations for decision models, in which the knowledge engineer must define clearly the alternatives, states, preferences, and relationships that constitute a decision basis. KNET contains a knowledge-engineering core written in Object Pascal and an interface that tightly integrates HyperCard, a hypertext authoring tool for the Apple Macintosh computer, into a novel expert-system architecture. Hypertext and hypermedia have become increasingly important in the storage management, and retrieval of information. In broad terms, hypermedia deliver heterogeneous bits of information in dynamic, extensively cross-referenced packages. The resulting KNET system features a coherent probabilistic scheme for managing uncertainty, an objectoriented graphics editor for drawing and manipulating decision networks, and HyperCard's potential for quickly constructing flexible and friendly user interfaces. We envision KNET as a useful prototy** tool for our ongoing research on a variety of Bayesian reasoning problems, including tractable representation, inference, and explanation.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
Bounded Conditioning: Flexible Inference for Decisions under Scarce Resources
Authors:
Eric J. Horvitz,
Jaap Suermondt,
Gregory F. Cooper
Abstract:
We introduce a graceful approach to probabilistic inference called bounded conditioning. Bounded conditioning monotonically refines the bounds on posterior probabilities in a belief network with computation, and converges on final probabilities of interest with the allocation of a complete resource fraction. The approach allows a reasoner to exchange arbitrary quantities of computational resource…
▽ More
We introduce a graceful approach to probabilistic inference called bounded conditioning. Bounded conditioning monotonically refines the bounds on posterior probabilities in a belief network with computation, and converges on final probabilities of interest with the allocation of a complete resource fraction. The approach allows a reasoner to exchange arbitrary quantities of computational resource for incremental gains in inference quality. As such, bounded conditioning holds promise as a useful inference technique for reasoning under the general conditions of uncertain and varying reasoning resources. The algorithm solves a probabilistic bounding problem in complex belief networks by breaking the problem into a set of mutually exclusive, tractable subproblems and ordering their solution by the expected effect that each subproblem will have on the final answer. We introduce the algorithm, discuss its characterization, and present its performance on several belief networks, including a complex model for reasoning about problems in intensive-care medicine.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
An Empirical Evaluation of a Randomized Algorithm for Probabilistic Inference
Authors:
R. Martin Chavez,
Gregory F. Cooper
Abstract:
In recent years, researchers in decision analysis and artificial intelligence (Al) have used Bayesian belief networks to build models of expert opinion. Using standard methods drawn from the theory of computational complexity, workers in the field have shown that the problem of probabilistic inference in belief networks is difficult and almost certainly intractable. K N ET, a software environmen…
▽ More
In recent years, researchers in decision analysis and artificial intelligence (Al) have used Bayesian belief networks to build models of expert opinion. Using standard methods drawn from the theory of computational complexity, workers in the field have shown that the problem of probabilistic inference in belief networks is difficult and almost certainly intractable. K N ET, a software environment for constructing knowledge-based systems within the axiomatic framework of decision theory, contains a randomized approximation scheme for probabilistic inference. The algorithm can, in many circumstances, perform efficient approximate inference in large and richly interconnected models of medical diagnosis. Unlike previously described stochastic algorithms for probabilistic inference, the randomized approximation scheme computes a priori bounds on running time by analyzing the structure and contents of the belief network. In this article, we describe a randomized algorithm for probabilistic inference and analyze its performance mathematically. Then, we devote the major portion of the paper to a discussion of the algorithm's empirical behavior. The results indicate that the generation of good trials (that is, trials whose distribution closely matches the true distribution), rather than the computation of numerous mediocre trials, dominates the performance of stochastic simulation. Key words: probabilistic inference, belief networks, stochastic simulation, computational complexity theory, randomized algorithms.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
An Empirical Analysis of Likelihood-Weighting Simulation on a Large, Multiply-Connected Belief Network
Authors:
Michael Shwe,
Gregory F. Cooper
Abstract:
We analyzed the convergence properties of likelihood- weighting algorithms on a two-level, multiply connected, belief-network representation of the QMR knowledge base of internal medicine. Specifically, on two difficult diagnostic cases, we examined the effects of Markov blanket scoring, importance sampling, demonstrating that the Markov blanket scoring and self-importance sampling significantly i…
▽ More
We analyzed the convergence properties of likelihood- weighting algorithms on a two-level, multiply connected, belief-network representation of the QMR knowledge base of internal medicine. Specifically, on two difficult diagnostic cases, we examined the effects of Markov blanket scoring, importance sampling, demonstrating that the Markov blanket scoring and self-importance sampling significantly improve the convergence of the simulation on our model.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
A Combination of Cutset Conditioning with Clique-Tree Propagation in the Pathfinder System
Authors:
Jaap Suermondt,
Gregory F. Cooper,
David Heckerman
Abstract:
Cutset conditioning and clique-tree propagation are two popular methods for performing exact probabilistic inference in Bayesian belief networks. Cutset conditioning is based on decomposition of a subset of network nodes, whereas clique-tree propagation depends on aggregation of nodes. We describe a means to combine cutset conditioning and clique- tree propagation in an approach called aggregati…
▽ More
Cutset conditioning and clique-tree propagation are two popular methods for performing exact probabilistic inference in Bayesian belief networks. Cutset conditioning is based on decomposition of a subset of network nodes, whereas clique-tree propagation depends on aggregation of nodes. We describe a means to combine cutset conditioning and clique- tree propagation in an approach called aggregation after decomposition (AD). We discuss the application of the AD method in the Pathfinder system, a medical expert system that offers assistance with diagnosis in hematopathology.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
A Randomized Approximation Algorithm of Logic Sampling
Authors:
R. Martin Chavez,
Gregory F. Cooper
Abstract:
In recent years, researchers in decision analysis and artificial intelligence (AI) have used Bayesian belief networks to build models of expert opinion. Using standard methods drawn from the theory of computational complexity, workers in the field have shown that the problem of exact probabilistic inference on belief networks almost certainly requires exponential computation in the worst ease [3].…
▽ More
In recent years, researchers in decision analysis and artificial intelligence (AI) have used Bayesian belief networks to build models of expert opinion. Using standard methods drawn from the theory of computational complexity, workers in the field have shown that the problem of exact probabilistic inference on belief networks almost certainly requires exponential computation in the worst ease [3]. We have previously described a randomized approximation scheme, called BN-RAS, for computation on belief networks [ 1, 2, 4]. We gave precise analytic bounds on the convergence of BN-RAS and showed how to trade running time for accuracy in the evaluation of posterior marginal probabilities. We now extend our previous results and demonstrate the generality of our framework by applying similar mathematical techniques to the analysis of convergence for logic sampling [7], an alternative simulation algorithm for probabilistic inference.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
Kutato: An Entropy-Driven System for Construction of Probabilistic Expert Systems from Databases
Authors:
Edward H. Herskovits,
Gregory F. Cooper
Abstract:
Kutato is a system that takes as input a database of cases and produces a belief network that captures many of the dependence relations represented by those data. This system incorporates a module for determining the entropy of a belief network and a module for constructing belief networks based on entropy calculations. Kutato constructs an initial belief network in which all variables in the da…
▽ More
Kutato is a system that takes as input a database of cases and produces a belief network that captures many of the dependence relations represented by those data. This system incorporates a module for determining the entropy of a belief network and a module for constructing belief networks based on entropy calculations. Kutato constructs an initial belief network in which all variables in the database are assumed to be marginally independent. The entropy of this belief network is calculated, and that arc is added that minimizes the entropy of the resulting belief network. Conditional probabilities for an arc are obtained directly from the database. This process continues until an entropy-based threshold is reached. We have tested the system by generating databases from networks using the probabilistic logic-sampling method, and then using those databases as input to Kutato. The system consistently reproduces the original belief networks with high fidelity.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
A Bayesian Method for Constructing Bayesian Belief Networks from Databases
Authors:
Gregory F. Cooper,
Edward H. Herskovits
Abstract:
This paper presents a Bayesian method for constructing Bayesian belief networks from a database of cases. Potential applications include computer-assisted hypothesis testing, automated scientific discovery, and automated construction of probabilistic expert systems. Results are presented of a preliminary evaluation of an algorithm for constructing a belief network from a database of cases. We r…
▽ More
This paper presents a Bayesian method for constructing Bayesian belief networks from a database of cases. Potential applications include computer-assisted hypothesis testing, automated scientific discovery, and automated construction of probabilistic expert systems. Results are presented of a preliminary evaluation of an algorithm for constructing a belief network from a database of cases. We relate the methods in this paper to previous work, and we discuss open problems.
△ Less
Submitted 20 March, 2013;
originally announced March 2013.
-
An Evaluation of an Algorithm for Inductive Learning of Bayesian Belief Networks Usin
Authors:
Constantin F. Aliferis,
Gregory F. Cooper
Abstract:
Bayesian learning of belief networks (BLN) is a method for automatically constructing belief networks (BNs) from data using search and Bayesian scoring techniques. K2 is a particular instantiation of the method that implements a greedy search strategy. To evaluate the accuracy of K2, we randomly generated a number of BNs and for each of those we simulated data sets. K2 was then used to induce th…
▽ More
Bayesian learning of belief networks (BLN) is a method for automatically constructing belief networks (BNs) from data using search and Bayesian scoring techniques. K2 is a particular instantiation of the method that implements a greedy search strategy. To evaluate the accuracy of K2, we randomly generated a number of BNs and for each of those we simulated data sets. K2 was then used to induce the generating BNs from the simulated data. We examine the performance of the program, and the factors that influence it. We also present a simple BN model, developed from our results, which predicts the accuracy of K2, when given various characteristics of the data set.
△ Less
Submitted 27 February, 2013;
originally announced February 2013.
-
A Structurally and Temporally Extended Bayesian Belief Network Model: Definitions, Properties, and Modeling Techniques
Authors:
Constantin F. Aliferis,
Gregory F. Cooper
Abstract:
We developed the language of Modifiable Temporal Belief Networks (MTBNs) as a structural and temporal extension of Bayesian Belief Networks (BNs) to facilitate normative temporal and causal modeling under uncertainty. In this paper we present definitions of the model, its components, and its fundamental properties. We also discuss how to represent various types of temporal knowledge, with an emp…
▽ More
We developed the language of Modifiable Temporal Belief Networks (MTBNs) as a structural and temporal extension of Bayesian Belief Networks (BNs) to facilitate normative temporal and causal modeling under uncertainty. In this paper we present definitions of the model, its components, and its fundamental properties. We also discuss how to represent various types of temporal knowledge, with an emphasis on hybrid temporal-explicit time modeling, dynamic structures, avoiding causal temporal inconsistencies, and dealing with models that involve simultaneously actions (decisions) and causal and non-causal associations. We examine the relationships among BNs, Modifiable Belief Networks, and MTBNs with a single temporal granularity, and suggest areas of application suitable to each one of them.
△ Less
Submitted 13 February, 2013;
originally announced February 2013.
-
A Multivariate Discretization Method for Learning Bayesian Networks from Mixed Data
Authors:
Stefano Monti,
Gregory F. Cooper
Abstract:
In this paper we address the problem of discretization in the context of learning Bayesian networks (BNs) from data containing both continuous and discrete variables. We describe a new technique for <EM>multivariate</EM> discretization, whereby each continuous variable is discretized while taking into account its interaction with the other variables. The technique is based on the use of a Bayesian…
▽ More
In this paper we address the problem of discretization in the context of learning Bayesian networks (BNs) from data containing both continuous and discrete variables. We describe a new technique for <EM>multivariate</EM> discretization, whereby each continuous variable is discretized while taking into account its interaction with the other variables. The technique is based on the use of a Bayesian scoring metric that scores the discretization policy for a continuous variable given a BN structure and the observed data. Since the metric is relative to the BN structure currently being evaluated, the discretization of a variable needs to be dynamically adjusted as the BN structure changes.
△ Less
Submitted 30 January, 2013;
originally announced January 2013.
-
A Bayesian Network Classifier that Combines a Finite Mixture Model and a Naive Bayes Model
Authors:
Stefano Monti,
Gregory F. Cooper
Abstract:
In this paper we present a new Bayesian network model for classification that combines the naive-Bayes (NB) classifier and the finite-mixture (FM) classifier. The resulting classifier aims at relaxing the strong assumptions on which the two component models are based, in an attempt to improve on their classification performance, both in terms of accuracy and in terms of calibration of the estimate…
▽ More
In this paper we present a new Bayesian network model for classification that combines the naive-Bayes (NB) classifier and the finite-mixture (FM) classifier. The resulting classifier aims at relaxing the strong assumptions on which the two component models are based, in an attempt to improve on their classification performance, both in terms of accuracy and in terms of calibration of the estimated probabilities. The proposed classifier is obtained by superimposing a finite mixture model on the set of feature variables of a naive Bayes model. We present experimental results that compare the predictive performance on real datasets of the new classifier with the predictive performance of the NB classifier and the FM classifier.
△ Less
Submitted 23 January, 2013;
originally announced January 2013.
-
Causal Discovery from a Mixture of Experimental and Observational Data
Authors:
Gregory F. Cooper,
Changwon Yoo
Abstract:
This paper describes a Bayesian method for combining an arbitrary mixture of observational and experimental data in order to learn causal Bayesian networks. Observational data are passively observed. Experimental data, such as that produced by randomized controlled trials, result from the experimenter manipulating one or more variables (typically randomly) and observing the states of other variabl…
▽ More
This paper describes a Bayesian method for combining an arbitrary mixture of observational and experimental data in order to learn causal Bayesian networks. Observational data are passively observed. Experimental data, such as that produced by randomized controlled trials, result from the experimenter manipulating one or more variables (typically randomly) and observing the states of other variables. The paper presents a Bayesian method for learning the causal structure and parameters of the underlying causal process that is generating the data, given that (1) the data contains a mixture of observational and experimental case records, and (2) the causal process is modeled as a causal Bayesian network. This learning method was applied using as input various mixtures of experimental and observational data that were generated from the ALARM causal Bayesian network. In these experiments, the absolute and relative quantities of experimental and observational data were varied systematically. For each of these training datasets, the learning method was applied to predict the causal structure and to estimate the causal parameters that exist among randomly selected pairs of nodes in ALARM that are not confounded. The paper reports how these structure predictions and parameter estimates compare with the true causal structures and parameters as given by the ALARM network.
△ Less
Submitted 23 January, 2013;
originally announced January 2013.
-
A Bayesian Method for Causal Modeling and Discovery Under Selection
Authors:
Gregory F. Cooper
Abstract:
This paper describes a Bayesian method for learning causal networks using samples that were selected in a non-random manner from a population of interest. Examples of data obtained by non-random sampling include convenience samples and case-control data in which a fixed number of samples with and without some condition is collected; such data are not uncommon. The paper describes a method for comb…
▽ More
This paper describes a Bayesian method for learning causal networks using samples that were selected in a non-random manner from a population of interest. Examples of data obtained by non-random sampling include convenience samples and case-control data in which a fixed number of samples with and without some condition is collected; such data are not uncommon. The paper describes a method for combining data under selection with prior beliefs in order to derive a posterior probability for a model of the causal processes that are generating the data in the population of interest. The priors include beliefs about the nature of the non-random sampling procedure. Although exact application of the method would be computationally intractable for most realistic datasets, efficient special-case and approximation methods are discussed. Finally, the paper describes how to combine learning under selection with previous methods for learning from observational and experimental data that are obtained on random samples of the population of interest. The net result is a Bayesian methodology that supports causal modeling and discovery from a rich mixture of different types of data.
△ Less
Submitted 16 January, 2013;
originally announced January 2013.
-
A Bayesian Network Scoring Metric That Is Based On Globally Uniform Parameter Priors
Authors:
Mehmet Kayaalp,
Gregory F. Cooper
Abstract:
We introduce a new Bayesian network (BN) scoring metric called the Global Uniform (GU) metric. This metric is based on a particular type of default parameter prior. Such priors may be useful when a BN developer is not willing or able to specify domain-specific parameter priors. The GU parameter prior specifies that every prior joint probability distribution P consistent with a BN structure S is co…
▽ More
We introduce a new Bayesian network (BN) scoring metric called the Global Uniform (GU) metric. This metric is based on a particular type of default parameter prior. Such priors may be useful when a BN developer is not willing or able to specify domain-specific parameter priors. The GU parameter prior specifies that every prior joint probability distribution P consistent with a BN structure S is considered to be equally likely. Distribution P is consistent with S if P includes just the set of independence relations defined by S. We show that the GU metric addresses some undesirable behavior of the BDeu and K2 Bayesian network scoring metrics, which also use particular forms of default parameter priors. A closed form formula for computing GU for special classes of BNs is derived. Efficiently computing GU for an arbitrary BN remains an open problem.
△ Less
Submitted 12 December, 2012;
originally announced January 2013.
-
Bayesian Biosurveillance of Disease Outbreaks
Authors:
Gregory F. Cooper,
Denver Dash,
John Levander,
Weng-Keen Wong,
William Hogan,
Michael Wagner
Abstract:
Early, reliable detection of disease outbreaks is a critical problem today. This paper reports an investigation of the use of causal Bayesian networks to model spatio-temporal patterns of a non-contagious disease (respiratory anthrax infection) in a population of people. The number of parameters in such a network can become enormous, if not carefully managed. Also, inference needs to be performed…
▽ More
Early, reliable detection of disease outbreaks is a critical problem today. This paper reports an investigation of the use of causal Bayesian networks to model spatio-temporal patterns of a non-contagious disease (respiratory anthrax infection) in a population of people. The number of parameters in such a network can become enormous, if not carefully managed. Also, inference needs to be performed in real time as population data stream in. We describe techniques we have applied to address both the modeling and inference challenges. A key contribution of this paper is the explication of assumptions and techniques that are sufficient to allow the scaling of Bayesian network modeling and inference to millions of nodes for real-time surveillance applications. The results reported here provide a proof-of-concept that Bayesian networks can serve as the foundation of a system that effectively performs Bayesian biosurveillance of disease outbreaks.
△ Less
Submitted 11 July, 2012;
originally announced July 2012.
-
A theoretical study of Y structures for causal discovery
Authors:
Subramani Mani,
Peter L. Spirtes,
Gregory F. Cooper
Abstract:
There are several existing algorithms that under appropriate assumptions can reliably identify a subset of the underlying causal relationships from observational data. This paper introduces the first computationally feasible score-based algorithm that can reliably identify causal relationships in the large sample limit for discrete models, while allowing for the possibility that there are unobserv…
▽ More
There are several existing algorithms that under appropriate assumptions can reliably identify a subset of the underlying causal relationships from observational data. This paper introduces the first computationally feasible score-based algorithm that can reliably identify causal relationships in the large sample limit for discrete models, while allowing for the possibility that there are unobserved common causes. In doing so, the algorithm does not ever need to assign scores to causal structures with unobserved common causes. The algorithm is based on the identification of so called Y substructures within Bayesian network structures that can be learned from observational data. An example of a Y substructure is A -> C, B -> C, C -> D. After providing background on causal discovery, the paper proves the conditions under which the algorithm is reliable in the large sample limit.
△ Less
Submitted 27 June, 2012;
originally announced June 2012.
-
Level density of 2+ states in 40Ca from high energy-resolution (p,p') experiments
Authors:
I. Usman,
Z. Buthelezi,
J. Carter,
G. R. J. Cooper,
R. W. Fearick,
S. V. Förtsch,
H. Fujita,
Y. Kalmykov,
P. von Neumann-Cosel,
R. Neveling,
I. Poltoratska,
A. Richter,
A. Shevchenko,
E. Sideras-Haddad,
F. D. Smit,
J. Wambach
Abstract:
The level density of 2+ states in 40Ca has been extracted in the energy region of the isoscalar giant quadrupole resonance (ISGQR) from a fluctuation analysis of high energy-resolution p,p') data taken at incident energies of 200 MeV at the K600 magnetic spectrometer of iThemba LABS, South Africa. Quasi-free scattering cross sections were calculated to estimate their role as a background contribut…
▽ More
The level density of 2+ states in 40Ca has been extracted in the energy region of the isoscalar giant quadrupole resonance (ISGQR) from a fluctuation analysis of high energy-resolution p,p') data taken at incident energies of 200 MeV at the K600 magnetic spectrometer of iThemba LABS, South Africa. Quasi-free scattering cross sections were calculated to estimate their role as a background contribution to the spectra and found to be small. The shape of the background was determined from the discrete wavelet transform of the spectra using a biorthogonal wavelet function normalized at the lowest particle separation threshold. The experimental results are compared to widely used phenomenological and microscopic models.
△ Less
Submitted 9 November, 2011; v1 submitted 7 October, 2011;
originally announced October 2011.