-
Chaos in undamped, forced oscillators via stroboscopic maps
Authors:
Ronaldo S. S. Vieira,
Luiz H. R. Daniel,
Marcus A. M. de Aguiar
Abstract:
Non-linear dynamics is not a usually covered topic in undergraduate physics courses. However, its importance within classical mechanics and the general theory of dynamical systems is unquestionable. In this work we show that this subject can be included in the schedule of an introductory classical mechanics course without the need to develop a robust theory of chaotic dynamics. To do this, we take…
▽ More
Non-linear dynamics is not a usually covered topic in undergraduate physics courses. However, its importance within classical mechanics and the general theory of dynamical systems is unquestionable. In this work we show that this subject can be included in the schedule of an introductory classical mechanics course without the need to develop a robust theory of chaotic dynamics. To do this, we take as examples conservative non-linear oscillators subject to time-dependent periodic forces. By introducing the concept of stroboscopic maps we show that it is possible to visualize the appearance of chaos in these systems. We also address the example of the forced simple pendulum applying the same treatment. Finally, we briefly comment on the more general theory of chaos in conservative Hamiltonian systems.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Bouncing cosmologies in the presence of a Dirac-Born-Infeld field
Authors:
Mariam Campbell,
Richard Daniel,
Peter K. S. Dunsby,
Carsten van de Bruck
Abstract:
We perform a detailed dynamical system analysis for the behaviour of a Dirac-Born-Infeld (DBI) field in a spatially closed Friedmann-Lemaître-Robertson-Walker (FLRW) cosmology. The DBI field is characterised by a potential and brane tension. We study power-law or exponential functions for the potential and tension. We find that in a spatially closed FLRW cosmology, a DBI field in the ultra-relativ…
▽ More
We perform a detailed dynamical system analysis for the behaviour of a Dirac-Born-Infeld (DBI) field in a spatially closed Friedmann-Lemaître-Robertson-Walker (FLRW) cosmology. The DBI field is characterised by a potential and brane tension. We study power-law or exponential functions for the potential and tension. We find that in a spatially closed FLRW cosmology, a DBI field in the ultra-relativistic limit allows for a broader range of initial conditions resulting in a bouncing universe than in the non-relativistic limit. We further note that the range of initial conditions allowing for a bounce is larger if we consider power-law functions for the potential and tension, compared to the exponential case. Our dynamical analysis shows that a DBI field does not exhibit stable cyclical behaviour, including the case in which a negative cosmological constant is present.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Hyperfine structure of the $\mathbf{A^{1}Π}$ state of AlCl and its relevance to laser cooling and trap**
Authors:
J. R. Daniel,
J. C. Shaw,
C. Wang,
L. -R. Liu,
B. K. Kendrick,
B. Hemmerling,
D. J. McCarron
Abstract:
The majority of molecules proposed for laser cooling and trap** experiments have $Σ$-type ground states. Specifically, $^2Σ$ states have cycling transitions analogous to D1-lines in alkali-metal atoms while $^1Σ$ states offer both strong and weak cycling transitions analogous to those in alkaline-earth atoms. Despite this proposed variety, to date, only molecules with $^2Σ$-type ground states ha…
▽ More
The majority of molecules proposed for laser cooling and trap** experiments have $Σ$-type ground states. Specifically, $^2Σ$ states have cycling transitions analogous to D1-lines in alkali-metal atoms while $^1Σ$ states offer both strong and weak cycling transitions analogous to those in alkaline-earth atoms. Despite this proposed variety, to date, only molecules with $^2Σ$-type ground states have successfully been confined and cooled in magneto-optical traps. While none of the proposed $^1Σ$-type molecules have been successfully laser cooled and trapped, they are expected to have various advantages in terms of exhibiting a lower chemical reactivity and an internal structure that benefits the cooling schemes. Here, we present the prospects and strategies for optical cycling in AlCl -- a $^1Σ$ molecule -- and report on the characterization of the $A^{1}Π$ state hyperfine structure. Based on these results, we carry out detailed simulations on the expected capture velocity of a magneto-optical trap for AlCl. Finally, using {\it ab initio} calculations, we identify the photodissociation via a $3^1Π$ state and photoionization process via the $3^1Σ^+$ state as possible loss mechanisms for a magneto-optical trap of AlCl.
△ Less
Submitted 19 December, 2023; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Forecasts on interacting dark energy with standard sirens
Authors:
Elsa M. Teixeira,
Richard Daniel,
Noemi Frusciante,
Carsten van de Bruck
Abstract:
We present the predictions with standard sirens at Gravitational Waves detectors, such as the Laser Interferometer Space Antenna (LISA) and the Einstein Telescope (ET), for interacting dark energy theories. We focus on four models characterised by couplings between the dark energy field and the dark matter fluid arising from conformal or disformal transformations of the metric, along with an expon…
▽ More
We present the predictions with standard sirens at Gravitational Waves detectors, such as the Laser Interferometer Space Antenna (LISA) and the Einstein Telescope (ET), for interacting dark energy theories. We focus on four models characterised by couplings between the dark energy field and the dark matter fluid arising from conformal or disformal transformations of the metric, along with an exponential self-interacting potential. To this purpose we construct mock catalogues and perform a Markov Chain Monte Carlo analysis by considering ET and LISA standard sirens, and also their combination with Baryon Acoustic Oscillations (BAO) and Supernovae Ia (SNIa) data. We find that in all the four models considered, the accuracy on the $H_0$ parameter increases by one order of magnitude at 1$σ$ when compared to the SNIa+BAO data set, possibly shedding light in the future on the origin of the $H_0$-tension. The combination of standard sirens with SNIa+BAO allows to improve the accuracy on some coupling and exponential parameters, hinting at future prospects for constraining interactions in the dark sector.
△ Less
Submitted 1 November, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Estimating hypothetical estimands with causal inference and missing data estimators in a diabetes trial
Authors:
Camila Olarte Parra,
Rhian M. Daniel,
David Wright,
Jonathan W. Bartlett
Abstract:
The recently published ICH E9 addendum on estimands in clinical trials provides a framework for precisely defining the treatment effect that is to be estimated, but says little about estimation methods. Here we report analyses of a clinical trial in type 2 diabetes, targeting the effects of randomised treatment, handling rescue treatment and discontinuation of randomised treatment using the so-cal…
▽ More
The recently published ICH E9 addendum on estimands in clinical trials provides a framework for precisely defining the treatment effect that is to be estimated, but says little about estimation methods. Here we report analyses of a clinical trial in type 2 diabetes, targeting the effects of randomised treatment, handling rescue treatment and discontinuation of randomised treatment using the so-called hypothetical strategy. We show how this can be estimated using mixed models for repeated measures, multiple imputation, inverse probability of treatment weighting, G-formula and G-estimation. We describe their assumptions and practical details of their implementation using packages in R. We report the results of these analyses, broadly finding similar estimates and standard errors across the estimators. We discuss various considerations relevant when choosing an estimation approach, including computational time, how to handle missing data, whether to include post intercurrent event data in the analysis, whether and how to adjust for additional time-varying confounders, and whether and how to model different types of ICE separately.
△ Less
Submitted 22 September, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
A Race Track Trapped-Ion Quantum Processor
Authors:
S. A. Moses,
C. H. Baldwin,
M. S. Allman,
R. Ancona,
L. Ascarrunz,
C. Barnes,
J. Bartolotta,
B. Bjork,
P. Blanchard,
M. Bohn,
J. G. Bohnet,
N. C. Brown,
N. Q. Burdick,
W. C. Burton,
S. L. Campbell,
J. P. Campora III,
C. Carron,
J. Chambers,
J. W. Chan,
Y. H. Chen,
A. Chernoguzov,
E. Chertkov,
J. Colina,
J. P. Curtis,
R. Daniel
, et al. (71 additional authors not shown)
Abstract:
We describe and benchmark a new quantum charge-coupled device (QCCD) trapped-ion quantum computer based on a linear trap with periodic boundary conditions, which resembles a race track. The new system successfully incorporates several technologies crucial to future scalability, including electrode broadcasting, multi-layer RF routing, and magneto-optical trap (MOT) loading, while maintaining, and…
▽ More
We describe and benchmark a new quantum charge-coupled device (QCCD) trapped-ion quantum computer based on a linear trap with periodic boundary conditions, which resembles a race track. The new system successfully incorporates several technologies crucial to future scalability, including electrode broadcasting, multi-layer RF routing, and magneto-optical trap (MOT) loading, while maintaining, and in some cases exceeding, the gate fidelities of previous QCCD systems. The system is initially operated with 32 qubits, but future upgrades will allow for more. We benchmark the performance of primitive operations, including an average state preparation and measurement error of 1.6(1)$\times 10^{-3}$, an average single-qubit gate infidelity of $2.5(3)\times 10^{-5}$, and an average two-qubit gate infidelity of $1.84(5)\times 10^{-3}$. The system-level performance of the quantum processor is assessed with mirror benchmarking, linear cross-entropy benchmarking, a quantum volume measurement of $\mathrm{QV}=2^{16}$, and the creation of 32-qubit entanglement in a GHZ state. We also tested application benchmarks including Hamiltonian simulation, QAOA, error correction on a repetition code, and dynamics simulations using qubit reuse. We also discuss future upgrades to the new system aimed at adding more qubits and capabilities.
△ Less
Submitted 16 May, 2023; v1 submitted 5 May, 2023;
originally announced May 2023.
-
G-formula for causal inference via multiple imputation
Authors:
Jonathan W. Bartlett,
Camila Olarte Parra,
Emily Granger,
Ruth H. Keogh,
Erik W. van Zwet,
Rhian M. Daniel
Abstract:
G-formula is a popular approach for estimating treatment or exposure effects from longitudinal data that are subject to time-varying confounding. G-formula estimation is typically performed by Monte-Carlo simulation, with non-parametric bootstrap** used for inference. We show that G-formula can be implemented by exploiting existing methods for multiple imputation (MI) for synthetic data. This in…
▽ More
G-formula is a popular approach for estimating treatment or exposure effects from longitudinal data that are subject to time-varying confounding. G-formula estimation is typically performed by Monte-Carlo simulation, with non-parametric bootstrap** used for inference. We show that G-formula can be implemented by exploiting existing methods for multiple imputation (MI) for synthetic data. This involves using an existing modified version of Rubin's variance estimator. In practice missing data is ubiquitous in longitudinal datasets. We show that such missing data can be readily accommodated as part of the MI procedure when using G-formula, and describe how MI software can be used to implement the approach. We explore its performance using a simulation study and an application from cystic fibrosis.
△ Less
Submitted 11 October, 2023; v1 submitted 27 January, 2023;
originally announced January 2023.
-
Wildfire Smoke Detection with Computer Vision
Authors:
Eldan R. Daniel
Abstract:
Wildfires are becoming more frequent and their effects more devastating every day. Climate change has directly and indirectly affected the occurrence of these, as well as social phenomena have increased the vulnerability of people. Consequently, and given the inevitable occurrence of these, it is important to have early warning systems that allow a timely and effective response. Artificial intelli…
▽ More
Wildfires are becoming more frequent and their effects more devastating every day. Climate change has directly and indirectly affected the occurrence of these, as well as social phenomena have increased the vulnerability of people. Consequently, and given the inevitable occurrence of these, it is important to have early warning systems that allow a timely and effective response. Artificial intelligence, machine learning and Computer Vision offer an effective and achievable alternative for opportune detection of wildfires and thus reduce the risk of disasters. YOLOv7 offers a simple, fast, and efficient algorithm for training object detection models which can be used in early detection of smoke columns in the initial stage wildfires. The developed model showed promising results, achieving a score of 0.74 in the F1 curve when the confidence level is 0.298, that is, a higher score at lower confidence levels was obtained. This means when the conditions are favorable for false positives. The metrics demonstrates the resilience and effectiveness of the model in detecting smoke columns.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Transitioning from a bounce to $R^2$ inflation
Authors:
Richard Daniel,
Mariam Campbell,
Carsten van de Bruck,
Peter Dunsby
Abstract:
Non-singular bouncing cosmologies are well--motivated models for the early universe. Recent observational data are consistent with positive spatial curvature and allow for a natural collapsing and bouncing phase in the very early universe. Additionally, bouncing cosmologies have the potential to rectify conceptual shortcomings identified in the theory of inflation, such as the singularity problem.…
▽ More
Non-singular bouncing cosmologies are well--motivated models for the early universe. Recent observational data are consistent with positive spatial curvature and allow for a natural collapsing and bouncing phase in the very early universe. Additionally, bouncing cosmologies have the potential to rectify conceptual shortcomings identified in the theory of inflation, such as the singularity problem. In this paper we present a classical bouncing model in the context of modified gravity, including an $R^2$-term in the action. We show that after the bounce, the universe enters naturally a period of inflation, driven by the $R^2$--term. We analyse the stability of the model and find that the scalaron assists the stability of the model.
△ Less
Submitted 7 July, 2023; v1 submitted 2 December, 2022;
originally announced December 2022.
-
Optimizing Pulsed-Laser Ablation Production of AlCl Molecules for Laser Cooling
Authors:
Taylor N. Lewis,
Chen Wang,
John R. Daniel,
Madhav Dhital,
Christopher J. Bardeen,
Boerge Hemmerling
Abstract:
Aluminum monochloride (AlCl) has been proposed as a promising candidate for laser cooling to ultracold temperatures, and recent spectroscopy results support this prediction. It is challenging to produce large numbers of AlCl molecules because it is a highly reactive open-shell molecule and must be generated in situ. Here we show that pulsed-laser ablation of stable, non-toxic mixtures of Al with a…
▽ More
Aluminum monochloride (AlCl) has been proposed as a promising candidate for laser cooling to ultracold temperatures, and recent spectroscopy results support this prediction. It is challenging to produce large numbers of AlCl molecules because it is a highly reactive open-shell molecule and must be generated in situ. Here we show that pulsed-laser ablation of stable, non-toxic mixtures of Al with an alkali or alkaline earth chlorides, denoted XCln, can provide a robust and reliable source of cold AlCl molecules. Both the chemical identity of XCln and the Al:XCln molar ratio are varied, and the yield of AlCl is monitored using absorption spectroscopy in a cryogenic gas. For KCl, the production of Al and K atoms was also monitored. We model the AlCl production in the limits of nonequilibrium recombination dominated by first-encounter events. The non-equilibrium model is in agreement with the data and also reproduces the observed trend with different XCln precursors. We find that AlCl production is limited by the solid-state densities of Al and Cl atoms and the recondensation of Al atoms in the ablation plume. We suggest future directions for optimizing the production of cold AlCl molecules using laser ablation.
△ Less
Submitted 5 October, 2021; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Hypothetical estimands in clinical trials: a unification of causal inference and missing data methods
Authors:
Camila Olarte Parra,
Rhian M. Daniel,
Jonathan W. Bartlett
Abstract:
The ICH E9 addendum introduces the term intercurrent event to refer to events that happen after randomisation and that can either preclude observation of the outcome of interest or affect its interpretation. It proposes five strategies for handling intercurrent events to form an estimand but does not suggest statistical methods for estimation. In this paper we focus on the hypothetical strategy, w…
▽ More
The ICH E9 addendum introduces the term intercurrent event to refer to events that happen after randomisation and that can either preclude observation of the outcome of interest or affect its interpretation. It proposes five strategies for handling intercurrent events to form an estimand but does not suggest statistical methods for estimation. In this paper we focus on the hypothetical strategy, where the treatment effect is defined under the hypothetical scenario in which the intercurrent event is prevented. For its estimation, we consider causal inference and missing data methods. We establish that certain 'causal inference estimators' are identical to certain 'missing data estimators'. These links may help those familiar with one set of methods but not the other. Moreover, using potential outcome notation allows us to state more clearly the assumptions on which missing data methods rely to estimate hypothetical estimands. This helps to indicate whether estimating a hypothetical estimand is reasonable, and what data should be used in the analysis. We show that hypothetical estimands can be estimated by exploiting data after intercurrent event occurrence, which is typically not used. We also present Monte Carlo simulations that illustrate the implementation and performance of the methods in different settings.
△ Less
Submitted 9 July, 2021;
originally announced July 2021.
-
What's in a Measurement? Using GPT-3 on SemEval 2021 Task 8 -- MeasEval
Authors:
Curt Kohler,
Ron Daniel Jr
Abstract:
In the summer of 2020 OpenAI released its GPT-3 autoregressive language model to much fanfare. While the model has shown promise on tasks in several areas, it has not always been clear when the results were cherry-picked or when they were the unvarnished output. We were particularly interested in what benefits GPT-3 could bring to the SemEval 2021 MeasEval task - identifying measurements and their…
▽ More
In the summer of 2020 OpenAI released its GPT-3 autoregressive language model to much fanfare. While the model has shown promise on tasks in several areas, it has not always been clear when the results were cherry-picked or when they were the unvarnished output. We were particularly interested in what benefits GPT-3 could bring to the SemEval 2021 MeasEval task - identifying measurements and their associated attributes in scientific literature. We had already experimented with multi-turn questions answering as a solution to this task. We wanted to see if we could use GPT-3's few-shot learning capabilities to more easily develop a solution that would have better performance than our prior work. Unfortunately, we have not been successful in that effort. This paper discusses the approach we used, challenges we encountered, and results we observed. Some of the problems we encountered were simply due to the state of the art. For example, the limits on the size of the prompt and answer limited the amount of the training signal that could be offered. Others are more fundamental. We are unaware of generative models that excel in retaining factual information. Also, the impact of changes in the prompts is unpredictable, making it hard to reliably improve performance.
△ Less
Submitted 28 June, 2021;
originally announced June 2021.
-
Shall we count the living or the dead?
Authors:
Anders Huitfeldt,
Matthew P. Fox,
Eleanor J. Murray,
Asbjørn Hróbjartsson,
Rhian M. Daniel
Abstract:
In the 1958 paper "Shall we count the living or the dead?", Mindel C. Sheps proposed a principled solution to the familiar problem of asymmetry of the relative risk. We provide causal models to clarify the scope and limitations of Sheps' line of reasoning, and show that her preferred variant of the relative risk will be stable between patient groups under certain biologically interpretable conditi…
▽ More
In the 1958 paper "Shall we count the living or the dead?", Mindel C. Sheps proposed a principled solution to the familiar problem of asymmetry of the relative risk. We provide causal models to clarify the scope and limitations of Sheps' line of reasoning, and show that her preferred variant of the relative risk will be stable between patient groups under certain biologically interpretable conditions. Such stability is useful when findings from an intervention study must be generalized to support clinical decisions in patients whose risk profile differs from the participants in the study. We show that Sheps' approach is consistent with a substantial body of psychological and philosophical research on how human reasoners carry causal information from one context to another, and that it can be implemented in practice using van der Laan et al's Switch Relative Risk, or equivalently, using Baker and Jackson's Generalized Relative Risk Reduction (GRRR).
△ Less
Submitted 9 July, 2022; v1 submitted 11 June, 2021;
originally announced June 2021.
-
Spectroscopy on the $A^1Π\leftarrow X^1Σ^+$ Transition of Buffer-Gas Cooled AlCl
Authors:
J. R. Daniel,
C. Wang,
K. Rodriguez,
T. Lewis,
A. Teplukhin,
B. Kendrick,
C. Bardeen,
B. Hemmerling
Abstract:
Aluminum monochloride (AlCl) has been proposed as an excellent candidate for laser cooling. Here we present absorption spectroscopy measurements on the $A^1Π\leftarrow X^1Σ^+$ transition in AlCl inside a cryogenic helium buffer-gas beam cell. The high resolution absorption data enables a rigorous, quantitative comparison with our high-level ab initio calculations of the electronic and rovibronic e…
▽ More
Aluminum monochloride (AlCl) has been proposed as an excellent candidate for laser cooling. Here we present absorption spectroscopy measurements on the $A^1Π\leftarrow X^1Σ^+$ transition in AlCl inside a cryogenic helium buffer-gas beam cell. The high resolution absorption data enables a rigorous, quantitative comparison with our high-level ab initio calculations of the electronic and rovibronic energies, providing a comprehensive picture of the AlCl quantum structure. The combination of high resolution spectral data and theory permits the evaluation of spectroscopic constants and associated properties, like equilibrium bond length, with an order of magnitude higher precision. Based on the measured molecular equilibrium constants of the $A^1Π$ state, we estimate a Franck-Condon factor of the $A^1Π\leftarrow X^1Σ^+$ of 99.88%, which confirms that AlCl is amenable to laser cooling.
△ Less
Submitted 25 June, 2021; v1 submitted 3 May, 2021;
originally announced May 2021.
-
Inflation and Scale-invariant $R^2$-Gravity
Authors:
Carsten van de Bruck,
Richard Daniel
Abstract:
In scale-invariant models of fundamental physics, mass scales are generated by spontaneous symmetry breaking. In this work, we study inflation in scale-invariant $R^2$ gravity, in which the Planck mass is generated by a scalar field, which is responsible for spontaneous breaking of scale--symmetry. If the self-interactions of the scalar field are non-zero, a cosmological constant is generated, whi…
▽ More
In scale-invariant models of fundamental physics, mass scales are generated by spontaneous symmetry breaking. In this work, we study inflation in scale-invariant $R^2$ gravity, in which the Planck mass is generated by a scalar field, which is responsible for spontaneous breaking of scale--symmetry. If the self-interactions of the scalar field are non-zero, a cosmological constant is generated, which can be potentially quite large. To avoid fine-tuning at late times, we introduce another scalar field which drives the classical cosmological constant to zero during inflation. Working in the Einstein-frame, we find that due to a conserved Noether current the corresponding three-field inflationary model (consisting of the two scalar fields plus the scalaron) becomes effectively a two-field model. The prize to be paid for introducing the field which cancels the classical cosmological constant at the end of inflation is that the running of the spectral index and the running of the running can be quite large due to entropy perturbations during inflation, making the model testable with future cosmological experiments.
△ Less
Submitted 26 March, 2021; v1 submitted 23 February, 2021;
originally announced February 2021.
-
Time-dependent mediators in survival analysis: Modelling direct and indirect effects with the additive hazards model
Authors:
Odd O. Aalen,
Mats J. Stensrud,
Vanessa Didelez,
Rhian Daniel,
Kjetil Røysland,
Susanne Strohmaier
Abstract:
We discuss causal mediation analyses for survival data and propose a new approach based on the additive hazards model. The emphasis is on a dynamic point of view, that is, understanding how the direct and indirect effects develop over time. Hence, importantly, we allow for a time varying mediator. To define direct and indirect effects in such a longitudinal survival setting we take an intervention…
▽ More
We discuss causal mediation analyses for survival data and propose a new approach based on the additive hazards model. The emphasis is on a dynamic point of view, that is, understanding how the direct and indirect effects develop over time. Hence, importantly, we allow for a time varying mediator. To define direct and indirect effects in such a longitudinal survival setting we take an interventional approach (Didelez (2018)) where treatment is separated into one aspect affecting the mediator and a different aspect affecting survival. In general, this leads to a version of the non-parametric g-formula (Robins (1986)). In the present paper, we demonstrate that combining the g-formula with the additive hazards model and a sequential linear model for the mediator process results in simple and interpretable expressions for direct and indirect effects in terms of relative survival as well as cumulative hazards. Our results generalise and formalise the method of dynamic path analysis (Fosen et al. (2006), Strohmaier et al. (2015)). An application to data from a clinical trial on blood pressure medication is given.
△ Less
Submitted 8 February, 2021; v1 submitted 26 November, 2020;
originally announced November 2020.
-
Analytical Approximation of the Second-Harmonic Conversion Efficiency
Authors:
John R. Daniel,
Shan-Wen Tsai,
Boerge Hemmerling
Abstract:
The second-harmonic generation process of a focused laser beam inside a nonlinear crystal is described by the Boyd-Kleinman theory. Calculating the actual conversion efficiency and upconverted power requires the solution of a double integral that is analytically intractable. We provide an expression that predicts the exact gain coefficient within an error margin of less than 2% over several orders…
▽ More
The second-harmonic generation process of a focused laser beam inside a nonlinear crystal is described by the Boyd-Kleinman theory. Calculating the actual conversion efficiency and upconverted power requires the solution of a double integral that is analytically intractable. We provide an expression that predicts the exact gain coefficient within an error margin of less than 2% over several orders of magnitude of the confocal parameter and as a function of the walk-off parameter. Our result allows for readily tuning the beam parameters to optimize the performance of the upconversion process and improve optical system designs.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
A Bayesian model of microbiome data for simultaneous identification of covariate associations and prediction of phenotypic outcomes
Authors:
Matthew D. Koslovsky,
Kristi L. Hoffman,
Carrie R. Daniel,
Marina Vannucci
Abstract:
One of the major research questions regarding human microbiome studies is the feasibility of designing interventions that modulate the composition of the microbiome to promote health and cure disease. This requires extensive understanding of the modulating factors of the microbiome, such as dietary intake, as well as the relation between microbial composition and phenotypic outcomes, such as body…
▽ More
One of the major research questions regarding human microbiome studies is the feasibility of designing interventions that modulate the composition of the microbiome to promote health and cure disease. This requires extensive understanding of the modulating factors of the microbiome, such as dietary intake, as well as the relation between microbial composition and phenotypic outcomes, such as body mass index (BMI). Previous efforts have modeled these data separately, employing two-step approaches that can produce biased interpretations of the results. Here, we propose a Bayesian joint model that simultaneously identifies clinical covariates associated with microbial composition data and predicts a phenotypic response using information contained in the compositional data. Using spike-and-slab priors, our approach can handle high-dimensional compositional as well as clinical data. Additionally, we accommodate the compositional structure of the data via balances and overdispersion typically found in microbial samples. We apply our model to understand the relations between dietary intake, microbial samples, and BMI. In this analysis, we find numerous associations between microbial taxa and dietary factors that may lead to a microbiome that is generally more hospitable to the development of chronic diseases, such as obesity. Additionally, we demonstrate on simulated data how our method outperforms two-step approaches and also present a sensitivity analysis.
△ Less
Submitted 17 November, 2021; v1 submitted 30 April, 2020;
originally announced April 2020.
-
Toward durable Al-InSb hybrid heterostructures via epitaxy of 2ML interfacial InAs screening layers
Authors:
Candice Thomas,
Rosa E. Diaz,
J. Houston Dycus,
Michael E. Salmon,
Roger E. Daniel,
Tiantian Wang,
Geoffrey C. Gardner,
Michael J. Manfra
Abstract:
The large Landé g-factor, high spin-orbit coupling, and low effective mass of the two-dimensional electron gas in InSb quantum wells combined with proximal superconductivity may realize a scalable platform for topological quantum computation. Aluminum thin films directly deposited on top of InSb planar structures result in the formation of a reactive AlInSb layer at the interface. This interlayer…
▽ More
The large Landé g-factor, high spin-orbit coupling, and low effective mass of the two-dimensional electron gas in InSb quantum wells combined with proximal superconductivity may realize a scalable platform for topological quantum computation. Aluminum thin films directly deposited on top of InSb planar structures result in the formation of a reactive AlInSb layer at the interface. This interlayer progressively consumes the whole Al film, resulting in a disordered AlInSb layer after few months at room temperature. We report on a heterostructure design that results in a significant increase of the durability of these hybrid Al-InSb heterostructures with the preservation of a pure Al film and sharp superconductor-semiconductor interface for more than one year. Two monolayers of epitaxial InAs at the superconductor-semiconductor interface prevent interfacial reactivity as evidenced by X-ray reflectivity and energy dispersive spectroscopy measurements. Structural characterizations of the Al films by transmission electron microscopy reveal the presence of tens of nanometers wide grains predominantly oriented with Al(110) parallel to InSb(001).
△ Less
Submitted 8 October, 2019;
originally announced October 2019.
-
BioFLAIR: Pretrained Pooled Contextualized Embeddings for Biomedical Sequence Labeling Tasks
Authors:
Shreyas Sharma,
Ron Daniel Jr
Abstract:
Biomedical Named Entity Recognition (NER) is a challenging problem in biomedical information processing due to the widespread ambiguity of out of context terms and extensive lexical variations. Performance on bioNER benchmarks continues to improve due to advances like BERT, GPT, and XLNet. FLAIR (1) is an alternative embedding model which is less computationally intensive than the others mentioned…
▽ More
Biomedical Named Entity Recognition (NER) is a challenging problem in biomedical information processing due to the widespread ambiguity of out of context terms and extensive lexical variations. Performance on bioNER benchmarks continues to improve due to advances like BERT, GPT, and XLNet. FLAIR (1) is an alternative embedding model which is less computationally intensive than the others mentioned. We test FLAIR and its pretrained PubMed embeddings (which we term BioFLAIR) on a variety of bio NER tasks and compare those with results from BERT-type networks. We also investigate the effects of a small amount of additional pretraining on PubMed content, and of combining FLAIR and ELMO models. We find that with the provided embeddings, FLAIR performs on-par with the BERT networks - even establishing a new state of the art on one benchmark. Additional pretraining did not provide a clear benefit, although this might change with even more pretraining being done. Stacking the FLAIR embeddings with others typically does provide a boost in the benchmark results.
△ Less
Submitted 13 August, 2019;
originally announced August 2019.
-
End-to-End Learning for Answering Structured Queries Directly over Text
Authors:
Paul Groth,
Antony Scerri,
Ron Daniel, Jr.,
Bradley P. Allen
Abstract:
Structured queries expressed in languages (such as SQL, SPARQL, or XQuery) offer a convenient and explicit way for users to express their information needs for a number of tasks. In this work, we present an approach to answer these directly over text data without storing results in a database. We specifically look at the case of knowledge bases where queries are over entities and the relations bet…
▽ More
Structured queries expressed in languages (such as SQL, SPARQL, or XQuery) offer a convenient and explicit way for users to express their information needs for a number of tasks. In this work, we present an approach to answer these directly over text data without storing results in a database. We specifically look at the case of knowledge bases where queries are over entities and the relations between them. Our approach combines distributed query answering (e.g. Triple Pattern Fragments) with models built for extractive question answering. Importantly, by applying distributed querying answering we are able to simplify the model learning problem. We train models for a large portion (572) of the relations within Wikidata and achieve an average 0.70 F1 measure across all models. We also present a systematic method to construct the necessary training data for this task from knowledge graphs and describe a prototype implementation.
△ Less
Submitted 16 November, 2018; v1 submitted 15 November, 2018;
originally announced November 2018.
-
Open Information Extraction on Scientific Text: An Evaluation
Authors:
Paul Groth,
Michael Lauruhn,
Antony Scerri,
Ron Daniel Jr
Abstract:
Open Information Extraction (OIE) is the task of the unsupervised creation of structured information from text. OIE is often used as a starting point for a number of downstream tasks including knowledge base construction, relation extraction, and question answering. While OIE methods are targeted at being domain independent, they have been evaluated primarily on newspaper, encyclopedic or general…
▽ More
Open Information Extraction (OIE) is the task of the unsupervised creation of structured information from text. OIE is often used as a starting point for a number of downstream tasks including knowledge base construction, relation extraction, and question answering. While OIE methods are targeted at being domain independent, they have been evaluated primarily on newspaper, encyclopedic or general web text. In this article, we evaluate the performance of OIE on scientific texts originating from 10 different disciplines. To do so, we use two state-of-the-art OIE systems applying a crowd-sourcing approach. We find that OIE systems perform significantly worse on scientific text than encyclopedic text. We also provide an error analysis and suggest areas of work to reduce errors. Our corpus of sentences and judgments are made available.
△ Less
Submitted 4 June, 2018; v1 submitted 15 February, 2018;
originally announced February 2018.
-
Data-adaptive doubly robust instrumental variable methods for treatment effect heterogeneity
Authors:
Karla DiazOrdaz,
Rhian Daniel,
Noemi Kreif
Abstract:
We consider the estimation of the average treatment effect in the treated as a function of baseline covariates, where there is a valid (conditional) instrument.
We describe two doubly robust (DR) estimators: a locally efficient g-estimator, and a targeted minimum loss-based estimator (TMLE). These two DR estimators can be viewed as generalisations of the two-stage least squares (TSLS) method to…
▽ More
We consider the estimation of the average treatment effect in the treated as a function of baseline covariates, where there is a valid (conditional) instrument.
We describe two doubly robust (DR) estimators: a locally efficient g-estimator, and a targeted minimum loss-based estimator (TMLE). These two DR estimators can be viewed as generalisations of the two-stage least squares (TSLS) method to semi-parametric models that make weaker assumptions. We exploit recent theoretical results that extend to the g-estimator the use of data-adaptive fits for the nuisance parameters.
A simulation study is used to compare standard TSLS with the two DR estimators' finite-sample performance, (1) when fitted using parametric nuisance models, and (2) using data-adaptive nuisance fits, obtained from the Super Learner, an ensemble machine learning method.
Data-adaptive DR estimators have lower bias and improved coverage, when compared to incorrectly specified parametric DR estimators and TSLS. When the parametric model for the treatment effect curve is correctly specified, the g-estimator outperforms all others, but when this model is misspecified, TMLE performs best, while TSLS can result in large biases and zero coverage.
Finally, we illustrate the methods by reanalysing the COPERS (CO** with persistent Pain, Effectiveness Research in Self-management) trial to make inference about the causal effect of treatment actually received, and the extent to which this is modified by depression at baseline.
△ Less
Submitted 9 June, 2019; v1 submitted 8 February, 2018;
originally announced February 2018.
-
Measuring Spark on AWS: A Case Study on Mining Scientific Publications with Annotation Query
Authors:
Darin McBeath,
Ron Daniel Jr
Abstract:
Annotation Query (AQ) is a program that provides the ability to query many different types of NLP annotations on a text, as well as the original content and structure of the text. The query results may provide new annotations, or they may select subsets of the content and annotations for deeper processing. Like GATE's Mimir, AQ is based on region algebras. Our AQ is implemented to run on a Spark c…
▽ More
Annotation Query (AQ) is a program that provides the ability to query many different types of NLP annotations on a text, as well as the original content and structure of the text. The query results may provide new annotations, or they may select subsets of the content and annotations for deeper processing. Like GATE's Mimir, AQ is based on region algebras. Our AQ is implemented to run on a Spark cluster. In this paper we look at how AQ's runtimes are affected by the size of the collection, the number of nodes in the cluster, the type of node, and the characteristics of the queries. Cluster size, of course, makes a large difference in performance so long as skew can be avoided. We find that there is minimal difference in performance when persisting annotations serialized to local SSD drives as opposed to deserialized into local memory. We also find that if the number of nodes is kept constant, then AWS' storage-optimized instance performs the best. But if we factor in total cost, the compute-optimized nodes provides the best performance relative to cost.
△ Less
Submitted 2 February, 2018;
originally announced February 2018.
-
Missing at random: a stochastic process perspective
Authors:
Daniel Farewell,
Rhian Daniel,
Shaun Seaman
Abstract:
We offer a natural and extensible measure-theoretic treatment of missingness at random. Within the standard missing data framework, we give a novel characterisation of the observed data as a stop**-set sigma algebra. We demonstrate that the usual missingness at random conditions are equivalent to requiring particular stochastic processes to be adapted to a set-indexed filtration of the complete…
▽ More
We offer a natural and extensible measure-theoretic treatment of missingness at random. Within the standard missing data framework, we give a novel characterisation of the observed data as a stop**-set sigma algebra. We demonstrate that the usual missingness at random conditions are equivalent to requiring particular stochastic processes to be adapted to a set-indexed filtration of the complete data: measurability conditions that suffice to ensure the likelihood factorisation necessary for ignorability. Our rigorous statement of the missing at random conditions also clarifies a common confusion: what is fixed, and what is random?
△ Less
Submitted 20 January, 2018;
originally announced January 2018.
-
Complementary ab initio and X-ray nano-diffraction study of Ta2O5
Authors:
R. Hollerweger,
D. Holec,
J. Paulitsch,
M. Bartosik,
R. Daniel,
R. Rachbauer,
P. Polcik,
J. Keckes,
C. Krywka,
P. H. Mayrhofer
Abstract:
Numerous different crystal structures of Ta2O5 are reported in literature. Although experimentally and computationally obtained lattice parameters and mechanical properties are in excellent agreement there is a pronounced deviation when it comes to electronic structures of Ta2O5. Based on ab initio studies and nano-beam X-ray diffraction of sputtered Ta2O5 thin films, we introduce an orthorhombic…
▽ More
Numerous different crystal structures of Ta2O5 are reported in literature. Although experimentally and computationally obtained lattice parameters and mechanical properties are in excellent agreement there is a pronounced deviation when it comes to electronic structures of Ta2O5. Based on ab initio studies and nano-beam X-ray diffraction of sputtered Ta2O5 thin films, we introduce an orthorhombic basic structure with a = 0.6425, b = 0.3769, and c = 0.7706 nm, which is stabilized by flip** of an oxygen atom in neighboring c-planes. The calculated energy of formation is with -3.209 eV/atom almost as low as -3.259 eV/atom for the well-known Stephenson superstructure. We propose the new structure based on the fact that it allows for a good description of orthorhombic Ta2O5 even with a small and simple unit cell, which is especially advantageous for computational studies.
△ Less
Submitted 16 April, 2014;
originally announced April 2014.
-
A Precision Measurement of pp Elastic Scattering Cross Sections at Intermediate Energies
Authors:
D. Albers,
F. Bauer,
J. Bisplinghoff,
R. Bollmann,
K. Buesser,
M. Busch,
R. Daniel,
O. Diehl,
F. Dohrmann,
H. P. Engelhardt,
J. Ernst,
P. D. Eversheim,
M. Gasthuber,
R. Gebel,
J. Greiff,
A. Gross,
R. Gross-Hardt,
S. Heider,
A. Heine,
F. Hinterberger,
T. Hueskes,
M. Igelbrink,
M. Jeske,
R. Langkau,
J. Lindlein
, et al. (19 additional authors not shown)
Abstract:
We have measured differential cross sections for \pp elastic scattering with internal fiber targets in the recirculating beam of the proton synchrotron COSY. Measurements were made continuously during acceleration for projectile kinetic energies between 0.23 and 2.59 GeV in the angular range $30 \leq θ_{c.m.} \leq 90$ deg. Details of the apparatus and the data analysis are given and the resultin…
▽ More
We have measured differential cross sections for \pp elastic scattering with internal fiber targets in the recirculating beam of the proton synchrotron COSY. Measurements were made continuously during acceleration for projectile kinetic energies between 0.23 and 2.59 GeV in the angular range $30 \leq θ_{c.m.} \leq 90$ deg. Details of the apparatus and the data analysis are given and the resulting excitation functions and angular distributions presented. The precision of each data point is typically better than 4%, and a relative normalization uncertainty of only 2.5% within an excitation function has been reached. The impact on phase shift analysis as well as upper bounds on possible resonant contributions in lower partial waves are discussed.
△ Less
Submitted 23 March, 2004;
originally announced March 2004.