-
Scaling convolutional neural networks achieves expert-level seizure detection in neonatal EEG
Authors:
Robert Hogan,
Sean R. Mathieson,
Aurel Luca,
Soraia Ventura,
Sean Griffin,
Geraldine B. Boylan,
John M. O'Toole
Abstract:
Background: Neonatal seizures are a neurological emergency that require urgent treatment. They are hard to diagnose clinically and can go undetected if EEG monitoring is unavailable. EEG interpretation requires specialised expertise which is not widely available. Algorithms to detect EEG seizures can address this limitation but have yet to reach widespread clinical adoption.
Methods: Retrospecti…
▽ More
Background: Neonatal seizures are a neurological emergency that require urgent treatment. They are hard to diagnose clinically and can go undetected if EEG monitoring is unavailable. EEG interpretation requires specialised expertise which is not widely available. Algorithms to detect EEG seizures can address this limitation but have yet to reach widespread clinical adoption.
Methods: Retrospective EEG data from 332 neonates was used to develop and validate a seizure-detection model. The model was trained and tested with a development dataset ($n=202$) that was annotated with over 12k seizure events on a per-channel basis. This dataset was used to develop a convolutional neural network (CNN) using a modern architecture and training methods. The final model was then validated on two independent multi-reviewer datasets ($n=51$ and $n=79$).
Results: Increasing dataset and model size improved model performance: Matthews correlation coefficient (MCC) and Pearson's correlation ($r$) increased by up to 50% with data scaling and up to 15% with model scaling. Over 50k hours of annotated single-channel EEG was used for training a model with 21 million parameters. State-of-the-art was achieved on an open-access dataset (MCC=0.764, $r=0.824$, and AUC=0.982). The CNN attains expert-level performance on both held-out validation sets, with no significant difference in inter-rater agreement among the experts and among experts and algorithm ($Δκ< -0.095$, $p>0.05$).
Conclusion: With orders of magnitude increases in data and model scale we have produced a new state-of-the-art model for neonatal seizure detection. Expert-level equivalence on completely unseen data, a first in this field, provides a strong indication that the model is ready for further clinical validation.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Credentials in the Occupation Ontology
Authors:
John Beverley,
Robin McGill,
Sam Smith,
Jie Zheng,
Giacomo De Colle,
Finn Wilson,
Matthew Diller,
William D. Duncan,
William R. Hogan,
Yongqun He
Abstract:
The term credential encompasses educational certificates, degrees, certifications, and government-issued licenses. An occupational credential is a verification of an individuals qualification or competence issued by a third party with relevant authority. Job seekers often leverage such credentials as evidence that desired qualifications are satisfied by their holders. Many U.S. education and workf…
▽ More
The term credential encompasses educational certificates, degrees, certifications, and government-issued licenses. An occupational credential is a verification of an individuals qualification or competence issued by a third party with relevant authority. Job seekers often leverage such credentials as evidence that desired qualifications are satisfied by their holders. Many U.S. education and workforce development organizations have recognized the importance of credentials for employment and the challenges of understanding the value of credentials. In this study, we identified and ontologically defined credential and credential-related terms at the textual and semantic levels based on the Occupation Ontology (OccO), a BFO-based ontology. Different credential types and their authorization logic are modeled. We additionally defined a high-level hierarchy of credential related terms and relations among many terms, which were initiated in concert with the Alabama Talent Triad (ATT) program, which aims to connect learners, earners, employers and education/training providers through credentials and skills. To our knowledge, our research provides for the first time systematic ontological modeling of the important domain of credentials and related contents, supporting enhanced credential data and knowledge integration in the future.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Modeling beam propagation in a moving nonlinear medium
Authors:
Ryan Hogan,
Giulia Marcucci,
Akbar Safari,
A. Nicholas Black,
Boris Braverman,
Jeremy Upham,
Robert W. Boyd
Abstract:
Fully describing light propagation in a rotating, anisotropic medium with thermal nonlinearity requires modeling the interplay between nonlinear refraction, birefringence, and the nonlinear group index. Incorporating these factors into a generalized nonlinear Schrödinger equation and fitting them to recent experimental results reveals two key relationships: the photon drag effect can have a nonlin…
▽ More
Fully describing light propagation in a rotating, anisotropic medium with thermal nonlinearity requires modeling the interplay between nonlinear refraction, birefringence, and the nonlinear group index. Incorporating these factors into a generalized nonlinear Schrödinger equation and fitting them to recent experimental results reveals two key relationships: the photon drag effect can have a nonlinear component that is dependent on the motion of the medium, and the temporal dynamics of the moving birefringent nonlinear medium create distorted figure-eight-like transverse trajectories at the output. The beam trajectory can be accurately modelled with a full understanding of the propagation effects. Efficiently modeling these effects and accurately predicting the beam's output position has implications for optimizing applications in velocimetry and beam-steering. Understanding the roles of competitive nonlinearities gives insight into the creation or suppression of nonlinear phenomena like self-action effects.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Working Backwards: Learning to Place by Picking
Authors:
Oliver Limoyo,
Abhisek Konar,
Trevor Ablett,
Jonathan Kelly,
Francois R. Hogan,
Gregory Dudek
Abstract:
We present placing via picking (PvP), a method to autonomously collect real-world demonstrations for a family of placing tasks in which objects must be manipulated to specific contact-constrained locations. With PvP, we approach the collection of robotic object placement demonstrations by reversing the gras** process and exploiting the inherent symmetry of the pick and place problems. Specifical…
▽ More
We present placing via picking (PvP), a method to autonomously collect real-world demonstrations for a family of placing tasks in which objects must be manipulated to specific contact-constrained locations. With PvP, we approach the collection of robotic object placement demonstrations by reversing the gras** process and exploiting the inherent symmetry of the pick and place problems. Specifically, we obtain placing demonstrations from a set of grasp sequences of objects initially located at their target placement locations. Our system can collect hundreds of demonstrations in contact-constrained environments without human intervention by combining two modules: tactile regras** and compliant control for grasps. We train a policy directly from visual observations through behavioral cloning, using the autonomously-collected demonstrations. By doing so, the policy can generalize to object placement scenarios outside of the training environment without privileged information (e.g., placing a plate picked up from a table). We validate our approach in home robotic scenarios that include dishwasher loading and table setting. Our approach yields robotic placing policies that outperform policies trained with kinesthetic teaching, both in terms of performance and data efficiency, while requiring no human supervision.
△ Less
Submitted 20 March, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
A Study of Human-Robot Handover through Human-Human Object Transfer
Authors:
Charlotte Morissette,
Bobak H. Baghi,
Francois R. Hogan,
Gregory Dudek
Abstract:
In this preliminary study, we investigate changes in handover behaviour when transferring hazardous objects with the help of a high-resolution touch sensor. Participants were asked to hand over a safe and hazardous object (a full cup and an empty cup) while instrumented with a modified STS sensor. Our data shows a clear distinction in the length of handover for the full cup vs the empty one, with…
▽ More
In this preliminary study, we investigate changes in handover behaviour when transferring hazardous objects with the help of a high-resolution touch sensor. Participants were asked to hand over a safe and hazardous object (a full cup and an empty cup) while instrumented with a modified STS sensor. Our data shows a clear distinction in the length of handover for the full cup vs the empty one, with the former being slower. Sensor data further suggests a change in tactile behaviour dependent on the object's risk factor. The results of this paper motivate a deeper study of tactile factors which could characterize a risky handover, allowing for safer human-robot interactions in the future.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
A Study of Generative Large Language Model for Medical Research and Healthcare
Authors:
Cheng Peng,
Xi Yang,
Aokun Chen,
Kaleb E Smith,
Nima PourNejatian,
Anthony B Costa,
Cheryl Martin,
Mona G Flores,
Ying Zhang,
Tanja Magoc,
Gloria Lipori,
Duane A Mitchell,
Naykky S Ospina,
Mustafa M Ahmed,
William R Hogan,
Elizabeth A Shenkman,
Yi Guo,
Jiang Bian,
Yonghui Wu
Abstract:
There is enormous enthusiasm and concerns in using large language models (LLMs) in healthcare, yet current assumptions are all based on general-purpose LLMs such as ChatGPT. This study develops a clinical generative LLM, GatorTronGPT, using 277 billion words of mixed clinical and English text with a GPT-3 architecture of 20 billion parameters. GatorTronGPT improves biomedical natural language proc…
▽ More
There is enormous enthusiasm and concerns in using large language models (LLMs) in healthcare, yet current assumptions are all based on general-purpose LLMs such as ChatGPT. This study develops a clinical generative LLM, GatorTronGPT, using 277 billion words of mixed clinical and English text with a GPT-3 architecture of 20 billion parameters. GatorTronGPT improves biomedical natural language processing for medical research. Synthetic NLP models trained using GatorTronGPT generated text outperform NLP models trained using real-world clinical text. Physicians Turing test using 1 (worst) to 9 (best) scale shows that there is no significant difference in linguistic readability (p = 0.22; 6.57 of GatorTronGPT compared with 6.93 of human) and clinical relevance (p = 0.91; 7.0 of GatorTronGPT compared with 6.97 of human) and that physicians cannot differentiate them (p < 0.001). This study provides insights on the opportunities and challenges of LLMs for medical research and healthcare.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension
Authors:
Cheng Peng,
Xi Yang,
Zehao Yu,
Jiang Bian,
William R. Hogan,
Yonghui Wu
Abstract:
Objective: To develop a natural language processing system that solves both clinical concept extraction and relation extraction in a unified prompt-based machine reading comprehension (MRC) architecture with good generalizability for cross-institution applications.
Methods: We formulate both clinical concept extraction and relation extraction using a unified prompt-based MRC architecture and exp…
▽ More
Objective: To develop a natural language processing system that solves both clinical concept extraction and relation extraction in a unified prompt-based machine reading comprehension (MRC) architecture with good generalizability for cross-institution applications.
Methods: We formulate both clinical concept extraction and relation extraction using a unified prompt-based MRC architecture and explore state-of-the-art transformer models. We compare our MRC models with existing deep learning models for concept extraction and end-to-end relation extraction using two benchmark datasets developed by the 2018 National NLP Clinical Challenges (n2c2) challenge (medications and adverse drug events) and the 2022 n2c2 challenge (relations of social determinants of health [SDoH]). We also evaluate the transfer learning ability of the proposed MRC models in a cross-institution setting. We perform error analyses and examine how different prompting strategies affect the performance of MRC models.
Results and Conclusion: The proposed MRC models achieve state-of-the-art performance for clinical concept and relation extraction on the two benchmark datasets, outperforming previous non-MRC transformer models. GatorTron-MRC achieves the best strict and lenient F1-scores for concept extraction, outperforming previous deep learning models on the two datasets by 1%~3% and 0.7%~1.3%, respectively. For end-to-end relation extraction, GatorTron-MRC and BERT-MIMIC-MRC achieve the best F1-scores, outperforming previous deep learning models by 0.9%~2.4% and 10%-11%, respectively. For cross-institution evaluation, GatorTron-MRC outperforms traditional GatorTron by 6.4% and 16% for the two datasets, respectively. The proposed method is better at handling nested/overlapped concepts, extracting relations, and has good portability for cross-institute applications.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies
Authors:
Zehao Yu,
Xi Yang,
Chong Dang,
Prakash Adekkanattu,
Braja Gopal Patra,
Yifan Peng,
Jyotishman Pathak,
Debbie L. Wilson,
Ching-Yuan Chang,
Wei-Hsuan Lo-Ciganic,
Thomas J. George,
William R. Hogan,
Yi Guo,
Jiang Bian,
Yonghui Wu
Abstract:
Objective: We aim to develop an open-source natural language processing (NLP) package, SODA (i.e., SOcial DeterminAnts), with pre-trained transformer models to extract social determinants of health (SDoH) for cancer patients, examine the generalizability of SODA to a new disease domain (i.e., opioid use), and evaluate the extraction rate of SDoH using cancer populations.
Methods: We identified S…
▽ More
Objective: We aim to develop an open-source natural language processing (NLP) package, SODA (i.e., SOcial DeterminAnts), with pre-trained transformer models to extract social determinants of health (SDoH) for cancer patients, examine the generalizability of SODA to a new disease domain (i.e., opioid use), and evaluate the extraction rate of SDoH using cancer populations.
Methods: We identified SDoH categories and attributes and developed an SDoH corpus using clinical notes from a general cancer cohort. We compared four transformer-based NLP models to extract SDoH, examined the generalizability of NLP models to a cohort of patients prescribed with opioids, and explored customization strategies to improve performance. We applied the best NLP model to extract 19 categories of SDoH from the breast (n=7,971), lung (n=11,804), and colorectal cancer (n=6,240) cohorts.
Results and Conclusion: We developed a corpus of 629 cancer patients notes with annotations of 13,193 SDoH concepts/attributes from 19 categories of SDoH. The Bidirectional Encoder Representations from Transformers (BERT) model achieved the best strict/lenient F1 scores of 0.9216 and 0.9441 for SDoH concept extraction, 0.9617 and 0.9626 for linking attributes to SDoH concepts. Fine-tuning the NLP models using new annotations from opioid use patients improved the strict/lenient F1 scores from 0.8172/0.8502 to 0.8312/0.8679. The extraction rates among 19 categories of SDoH varied greatly, where 10 SDoH could be extracted from >70% of cancer patients, but 9 SDoH had a low extraction rate (<70% of cancer patients). The SODA package with pre-trained transformer models is publicly available at https://github.com/uf-hobiinformatics-lab/SDoH_SODA.
△ Less
Submitted 18 May, 2023; v1 submitted 6 December, 2022;
originally announced December 2022.
-
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Authors:
Sahand Rezaei-Shoshtari,
Charlotte Morissette,
Francois Robert Hogan,
Gregory Dudek,
David Meger
Abstract:
In this paper, hypernetworks are trained to generate behaviors across a range of unseen task conditions, via a novel TD-based training objective and data from a set of near-optimal RL solutions for training tasks. This work relates to meta RL, contextual RL, and transfer learning, with a particular focus on zero-shot performance at test time, enabled by knowledge of the task parameters (also known…
▽ More
In this paper, hypernetworks are trained to generate behaviors across a range of unseen task conditions, via a novel TD-based training objective and data from a set of near-optimal RL solutions for training tasks. This work relates to meta RL, contextual RL, and transfer learning, with a particular focus on zero-shot performance at test time, enabled by knowledge of the task parameters (also known as context). Our technical approach is based upon viewing each RL algorithm as a map** from the MDP specifics to the near-optimal value function and policy and seek to approximate it with a hypernetwork that can generate near-optimal value functions and policies, given the parameters of the MDP. We show that, under certain conditions, this map** can be considered as a supervised learning problem. We empirically evaluate the effectiveness of our method for zero-shot transfer to new reward and transition dynamics on a series of continuous control tasks from DeepMind Control Suite. Our method demonstrates significant improvements over baselines from multitask and meta RL approaches.
△ Less
Submitted 2 January, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Beam deflection and negative drag in a moving nonlinear medium
Authors:
Ryan Hogan,
Akbar Safari,
Giulia Marcucci,
Boris Braverman,
Robert W. Boyd
Abstract:
Light propagating in a moving medium with refractive index other than unity is subject to light drag. While the light drag effect due to the linear refractive index is often negligibly small, it can be enhanced in materials with a large group index. Here we show that the nonlinear refractive index can also play a crucial role in propagation of light in moving media and results in a beam deflection…
▽ More
Light propagating in a moving medium with refractive index other than unity is subject to light drag. While the light drag effect due to the linear refractive index is often negligibly small, it can be enhanced in materials with a large group index. Here we show that the nonlinear refractive index can also play a crucial role in propagation of light in moving media and results in a beam deflection that might be confused with the transverse drag effect. We perform an experiment with a rotating ruby crystal which exhibits a very large negative group index and a positive nonlinear refractive index. The negative group index drags the light opposite to the motion of the medium. However, the positive nonlinear refractive index deflects the beam towards the motion of the medium and hinders the observation of the negative drag effect. Hence, we show that it is necessary to measure not only the transverse shift of the beam, but also its output angle to discriminate the light-drag effect from beam deflection -- a crucial step missing in earlier experiments.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Quench dynamics in the Jaynes-Cummings-Hubbard and Dicke models
Authors:
Andrew R. Hogan,
Andy M. Martin
Abstract:
Both the Jaynes-Cummings-Hubbard (JCH) and Dicke models can be thought of as idealised models of a quantum battery. In this paper we numerically investigate the charging properties of both of these models. The two models differ in how the two-level systems are contained in cavities. In the Dicke model, the $N$ two-level systems are contained in a single cavity, while in the JCH model the two-level…
▽ More
Both the Jaynes-Cummings-Hubbard (JCH) and Dicke models can be thought of as idealised models of a quantum battery. In this paper we numerically investigate the charging properties of both of these models. The two models differ in how the two-level systems are contained in cavities. In the Dicke model, the $N$ two-level systems are contained in a single cavity, while in the JCH model the two-level systems each have their own cavity and are able to pass photons between them. In each of these models we consider a scenario where the two-level systems start in the ground state and the coupling parameter between the photon and the two-level systems is quenched. Each of these models display a maximum charging power that scales with the size of the battery $N$ and no super charging was found. Charging power also scales with the square root of the average number of photons per two-level system $m$ for both models. Finally, in the JCH model, the power was found to charge inversely with the square root of the photon-cavity coupling $κ$.
△ Less
Submitted 9 May, 2023; v1 submitted 3 October, 2022;
originally announced October 2022.
-
Ontology Development Kit: a toolkit for building, maintaining, and standardising biomedical ontologies
Authors:
Nicolas Matentzoglu,
Damien Goutte-Gattat,
Shawn Zheng Kai Tan,
James P. Balhoff,
Seth Carbon,
Anita R. Caron,
William D. Duncan,
Joe E. Flack,
Melissa Haendel,
Nomi L. Harris,
William R Hogan,
Charles Tapley Hoyt,
Rebecca C. Jackson,
HyeongSik Kim,
Huseyin Kir,
Martin Larralde,
Julie A. McMurry,
James A. Overton,
Bjoern Peters,
Clare Pilgrim,
Ray Stefancsik,
Sofia MC Robb,
Sabrina Toro,
Nicole A Vasilevsky,
Ramona Walls
, et al. (2 additional authors not shown)
Abstract:
Similar to managing software packages, managing the ontology life cycle involves multiple complex workflows such as preparing releases, continuous quality control checking, and dependency management. To manage these processes, a diverse set of tools is required, from command line utilities to powerful ontology engineering environments such as ROBOT. Particularly in the biomedical domain, which has…
▽ More
Similar to managing software packages, managing the ontology life cycle involves multiple complex workflows such as preparing releases, continuous quality control checking, and dependency management. To manage these processes, a diverse set of tools is required, from command line utilities to powerful ontology engineering environments such as ROBOT. Particularly in the biomedical domain, which has developed a set of highly diverse yet inter-dependent ontologies, standardising release practices and metadata, and establishing shared quality standards, are crucial to enable interoperability. The Ontology Development Kit (ODK) provides a set of standardised, customisable, and automatically executable workflows, and packages all required tooling in a single Docker image. In this paper, we provide an overview of how the ODK works, show how it is used in practice, and describe how we envision it driving standardisation efforts in our community.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records
Authors:
Xi Yang,
Aokun Chen,
Nima PourNejatian,
Hoo Chang Shin,
Kaleb E Smith,
Christopher Parisien,
Colin Compas,
Cheryl Martin,
Mona G Flores,
Ying Zhang,
Tanja Magoc,
Christopher A Harle,
Gloria Lipori,
Duane A Mitchell,
William R Hogan,
Elizabeth A Shenkman,
Jiang Bian,
Yonghui Wu
Abstract:
There is an increasing interest in develo** artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is compar…
▽ More
There is an increasing interest in develo** artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is comparatively small at 110 million parameters (compared with billions of parameters in the general domain). It is not clear how large clinical language models with billions of parameters can help medical AI systems utilize unstructured EHRs. In this study, we develop from scratch a large clinical language model - GatorTron - using >90 billion words of text (including >82 billion words of de-identified clinical text) and systematically evaluate it on 5 clinical NLP tasks including clinical concept extraction, medical relation extraction, semantic textual similarity, natural language inference (NLI), and medical question answering (MQA). We examine how (1) scaling up the number of parameters and (2) scaling up the size of the training data could benefit these NLP tasks. GatorTron models scale up the clinical language model from 110 million to 8.9 billion parameters and improve 5 clinical NLP tasks (e.g., 9.6% and 9.5% improvement in accuracy for NLI and MQA), which can be applied to medical AI systems to improve healthcare delivery. The GatorTron models are publicly available at: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/clara/models/gatortron_og.
△ Less
Submitted 16 December, 2022; v1 submitted 2 February, 2022;
originally announced March 2022.
-
Machine Learning Emulation of Urban Land Surface Processes
Authors:
David Meyer,
Sue Grimmond,
Peter Dueben,
Robin Hogan,
Maarten van Reeuwijk
Abstract:
Can we improve the modeling of urban land surface processes with machine learning (ML)? A prior comparison of urban land surface models (ULSMs) found that no single model is 'best' at predicting all common surface fluxes. Here, we develop an urban neural network (UNN) trained on the mean predicted fluxes from 22 ULSMs at one site. The UNN emulates the mean output of ULSMs accurately. When compared…
▽ More
Can we improve the modeling of urban land surface processes with machine learning (ML)? A prior comparison of urban land surface models (ULSMs) found that no single model is 'best' at predicting all common surface fluxes. Here, we develop an urban neural network (UNN) trained on the mean predicted fluxes from 22 ULSMs at one site. The UNN emulates the mean output of ULSMs accurately. When compared to a reference ULSM (Town Energy Balance; TEB), the UNN has greater accuracy relative to flux observations, less computational cost, and requires fewer input parameters. When coupled to the Weather Research Forecasting (WRF) model using TensorFlow bindings, WRF-UNN is stable and more accurate than the reference WRF-TEB. Although the application is currently constrained by the training data (1 site), we show a novel approach to improve the modeling of surface fluxes by combining the strengths of several ULSMs into one using ML.
△ Less
Submitted 15 March, 2022; v1 submitted 21 December, 2021;
originally announced December 2021.
-
A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models
Authors:
Zehao Yu,
Xi Yang,
Chong Dang,
Songzi Wu,
Prakash Adekkanattu,
Jyotishman Pathak,
Thomas J. George,
William R. Hogan,
Yi Guo,
Jiang Bian,
Yonghui Wu
Abstract:
Social and behavioral determinants of health (SBDoH) have important roles in sha** people's health. In clinical research studies, especially comparative effectiveness studies, failure to adjust for SBDoH factors will potentially cause confounding issues and misclassification errors in either statistical analyses and machine learning-based models. However, there are limited studies to examine SBD…
▽ More
Social and behavioral determinants of health (SBDoH) have important roles in sha** people's health. In clinical research studies, especially comparative effectiveness studies, failure to adjust for SBDoH factors will potentially cause confounding issues and misclassification errors in either statistical analyses and machine learning-based models. However, there are limited studies to examine SBDoH factors in clinical outcomes due to the lack of structured SBDoH information in current electronic health record (EHR) systems, while much of the SBDoH information is documented in clinical narratives. Natural language processing (NLP) is thus the key technology to extract such information from unstructured clinical text. However, there is not a mature clinical NLP system focusing on SBDoH. In this study, we examined two state-of-the-art transformer-based NLP models, including BERT and RoBERTa, to extract SBDoH concepts from clinical narratives, applied the best performing model to extract SBDoH concepts on a lung cancer screening patient cohort, and examined the difference of SBDoH information between NLP extracted results and structured EHRs (SBDoH information captured in standard vocabularies such as the International Classification of Diseases codes). The experimental results show that the BERT-based NLP model achieved the best strict/lenient F1-score of 0.8791 and 0.8999, respectively. The comparison between NLP extracted SBDoH information and structured EHRs in the lung cancer patient cohort of 864 patients with 161,933 various types of clinical notes showed that much more detailed information about smoking, education, and employment were only captured in clinical narratives and that it is necessary to use both clinical narratives and structured EHRs to construct a more complete picture of patients' SBDoH factors.
△ Less
Submitted 10 August, 2021;
originally announced August 2021.
-
Machine Learning Emulation of 3D Cloud Radiative Effects
Authors:
David Meyer,
Robin J. Hogan,
Peter D. Dueben,
Shannon L. Mason
Abstract:
The treatment of cloud structure in numerical weather and climate models is often greatly simplified to make them computationally affordable. Here we propose to correct the European Centre for Medium-Range Weather Forecasts 1D radiation scheme ecRad for 3D cloud effects using computationally cheap neural networks. 3D cloud effects are learned as the difference between ecRad's fast 1D Tripleclouds…
▽ More
The treatment of cloud structure in numerical weather and climate models is often greatly simplified to make them computationally affordable. Here we propose to correct the European Centre for Medium-Range Weather Forecasts 1D radiation scheme ecRad for 3D cloud effects using computationally cheap neural networks. 3D cloud effects are learned as the difference between ecRad's fast 1D Tripleclouds solver that neglects them and its 3D SPARTACUS (SPeedy Algorithm for Radiative TrAnsfer through CloUd Sides) solver that includes them but is about five times more computationally expensive. With typical errors between 20 % and 30 % of the 3D signal, neural networks improve Tripleclouds' accuracy for about 1 % increase in runtime. Thus, rather than emulating the whole of SPARTACUS, we keep Tripleclouds unchanged for cloud-free parts of the atmosphere and 3D-correct it elsewhere. The focus on the comparably small 3D correction instead of the entire signal allows us to improve predictions significantly if we assume a similar signal-to-noise ratio for both.
△ Less
Submitted 15 March, 2022; v1 submitted 22 March, 2021;
originally announced March 2021.
-
Learning Intuitive Physics with Multimodal Generative Models
Authors:
Sahand Rezaei-Shoshtari,
Francois Robert Hogan,
Michael Jenkin,
David Meger,
Gregory Dudek
Abstract:
Predicting the future interaction of objects when they come into contact with their environment is key for autonomous agents to take intelligent and anticipatory actions. This paper presents a perception framework that fuses visual and tactile feedback to make predictions about the expected motion of objects in dynamic scenes. Visual information captures object properties such as 3D shape and loca…
▽ More
Predicting the future interaction of objects when they come into contact with their environment is key for autonomous agents to take intelligent and anticipatory actions. This paper presents a perception framework that fuses visual and tactile feedback to make predictions about the expected motion of objects in dynamic scenes. Visual information captures object properties such as 3D shape and location, while tactile information provides critical cues about interaction forces and resulting object motion when it makes contact with the environment. Utilizing a novel See-Through-your-Skin (STS) sensor that provides high resolution multimodal sensing of contact surfaces, our system captures both the visual appearance and the tactile properties of objects. We interpret the dual stream signals from the sensor using a Multimodal Variational Autoencoder (MVAE), allowing us to capture both modalities of contacting objects and to develop a map** from visual to tactile interaction and vice-versa. Additionally, the perceptual system can be used to infer the outcome of future physical interactions, which we validate through simulated and real-world experiments in which the resting state of an object is predicted from given initial conditions.
△ Less
Submitted 19 January, 2021; v1 submitted 12 January, 2021;
originally announced January 2021.
-
Copula-based synthetic data augmentation for machine-learning emulators
Authors:
David Meyer,
Thomas Nagler,
Robin J. Hogan
Abstract:
Can we improve machine-learning (ML) emulators with synthetic data? If data are scarce or expensive to source and a physical model is available, statistically generated data may be useful for augmenting training sets cheaply. Here we explore the use of copula-based models for generating synthetically augmented datasets in weather and climate by testing the method on a toy physical model of downwel…
▽ More
Can we improve machine-learning (ML) emulators with synthetic data? If data are scarce or expensive to source and a physical model is available, statistically generated data may be useful for augmenting training sets cheaply. Here we explore the use of copula-based models for generating synthetically augmented datasets in weather and climate by testing the method on a toy physical model of downwelling longwave radiation and corresponding neural network emulator. Results show that for copula-augmented datasets, predictions are improved by up to 62 % for the mean absolute error (from 1.17 to 0.44 W m$^{-2}$).
△ Less
Submitted 26 September, 2021; v1 submitted 16 December, 2020;
originally announced December 2020.
-
Seeing Through your Skin: Recognizing Objects with a Novel Visuotactile Sensor
Authors:
Francois Robert Hogan,
Michael Jenkin,
Sahand Rezaei-Shoshtari,
Yogesh Girdhar,
David Meger,
Gregory Dudek
Abstract:
We introduce a new class of vision-based sensor and associated algorithmic processes that combine visual imaging with high-resolution tactile sending, all in a uniform hardware and computational architecture. We demonstrate the sensor's efficacy for both multi-modal object recognition and metrology. Object recognition is typically formulated as an unimodal task, but by combining two sensor modalit…
▽ More
We introduce a new class of vision-based sensor and associated algorithmic processes that combine visual imaging with high-resolution tactile sending, all in a uniform hardware and computational architecture. We demonstrate the sensor's efficacy for both multi-modal object recognition and metrology. Object recognition is typically formulated as an unimodal task, but by combining two sensor modalities we show that we can achieve several significant performance improvements. This sensor, named the See-Through-your-Skin sensor (STS), is designed to provide rich multi-modal sensing of contact surfaces. Inspired by recent developments in optical tactile sensing technology, we address a key missing feature of these sensors: the ability to capture a visual perspective of the region beyond the contact surface. Whereas optical tactile sensors are typically opaque, we present a sensor with a semitransparent skin that has the dual capabilities of acting as a tactile sensor and/or as a visual camera depending on its internal lighting conditions. This paper details the design of the sensor, showcases its dual sensing capabilities, and presents a deep learning architecture that fuses vision and touch. We validate the ability of the sensor to classify household objects, recognize fine textures, and infer their physical properties both through numerical simulations and experiments with a smart countertop prototype.
△ Less
Submitted 14 December, 2020; v1 submitted 18 November, 2020;
originally announced November 2020.
-
A Long Horizon Planning Framework for Manipulating Rigid Pointcloud Objects
Authors:
Anthony Simeonov,
Yilun Du,
Beomjoon Kim,
Francois R. Hogan,
Joshua Tenenbaum,
Pulkit Agrawal,
Alberto Rodriguez
Abstract:
We present a framework for solving long-horizon planning problems involving manipulation of rigid objects that operates directly from a point-cloud observation, i.e. without prior object models. Our method plans in the space of object subgoals and frees the planner from reasoning about robot-object interaction dynamics by relying on a set of generalizable manipulation primitives. We show that for…
▽ More
We present a framework for solving long-horizon planning problems involving manipulation of rigid objects that operates directly from a point-cloud observation, i.e. without prior object models. Our method plans in the space of object subgoals and frees the planner from reasoning about robot-object interaction dynamics by relying on a set of generalizable manipulation primitives. We show that for rigid bodies, this abstraction can be realized using low-level manipulation skills that maintain sticking contact with the object and represent subgoals as 3D transformations. To enable generalization to unseen objects and improve planning performance, we propose a novel way of representing subgoals for rigid-body manipulation and a graph-attention based neural network architecture for processing point-cloud inputs. We experimentally validate these choices using simulated and real-world experiments on the YuMi robot. Results demonstrate that our method can successfully manipulate new objects into target configurations requiring long-term planning. Overall, our framework realizes the best of the worlds of task-and-motion planning (TAMP) and learning-based approaches. Project website: https://anthonysimeonov.github.io/rpo-planning-framework/.
△ Less
Submitted 16 November, 2020;
originally announced November 2020.
-
Tactile Dexterity: Manipulation Primitives with Tactile Feedback
Authors:
Francois R. Hogan,
Jose Ballester,
Siyuan Dong,
Alberto Rodriguez
Abstract:
This paper develops closed-loop tactile controllers for dexterous robotic manipulation with a dual-palm robotic system. Tactile dexterity is an approach to dexterous manipulation that plans for robot/object interactions that render interpretable tactile information for control. We divide the role of tactile control into two goals: 1) control the contact state between the end-effector and the objec…
▽ More
This paper develops closed-loop tactile controllers for dexterous robotic manipulation with a dual-palm robotic system. Tactile dexterity is an approach to dexterous manipulation that plans for robot/object interactions that render interpretable tactile information for control. We divide the role of tactile control into two goals: 1) control the contact state between the end-effector and the object (contact/no-contact, stick/slip) by regulating the stability of planned contact configurations and monitoring undesired slip events; and 2) control the object state by tactile-based tracking and iterative replanning of the object and robot trajectories.
Key to this formulation is the decomposition of manipulation plans into sequences of manipulation primitives with simple mechanics and efficient planners. We consider the scenario of manipulating an object from an initial pose to a target pose on a flat surface while correcting for external perturbations and uncertainty in the initial pose of the object. We experimentally validate the approach with an ABB YuMi dual-arm robot and demonstrate the ability of the tactile controller to react to external perturbations.
△ Less
Submitted 30 April, 2020; v1 submitted 8 February, 2020;
originally announced February 2020.
-
Hybrid Differential Dynamic Programming for Planar Manipulation Primitives
Authors:
Neel Doshi,
Francois R. Hogan,
Alberto Rodriguez
Abstract:
We present a hybrid differential dynamic programming (DDP) algorithm for closed-loop execution of manipulation primitives with frictional contact switches. Planning and control of these primitives is challenging as they are hybrid, under-actuated, and stochastic. We address this by develo** hybrid DDP both to plan finite horizon trajectories with a few contact switches and to create linear stabi…
▽ More
We present a hybrid differential dynamic programming (DDP) algorithm for closed-loop execution of manipulation primitives with frictional contact switches. Planning and control of these primitives is challenging as they are hybrid, under-actuated, and stochastic. We address this by develo** hybrid DDP both to plan finite horizon trajectories with a few contact switches and to create linear stabilizing controllers. We evaluate the performance and computational cost of our framework in ablations studies for two primitives: planar pushing and planar pivoting. We find that generating pose-to-pose closed-loop trajectories from most configurations requires only a couple (one to two) hybrid switches and can be done in reasonable time (one to five seconds). We further demonstrate that our controller stabilizes these hybrid trajectories on a real pushing system. A video describing our work can be found at https://youtu.be/YGSe4cUfq6Q.
△ Less
Submitted 20 April, 2020; v1 submitted 31 October, 2019;
originally announced November 2019.
-
Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods
Authors:
Xi Yang,
Yan Gong,
Nida Waheed,
Keith March,
Jiang Bian,
William R. Hogan,
Yonghui Wu
Abstract:
Cardiotoxicity related to cancer therapies has become a serious issue, diminishing cancer treatment outcomes and quality of life. Early detection of cancer patients at risk for cardiotoxicity before cardiotoxic treatments and providing preventive measures are potential solutions to improve cancer patients's quality of life. This study focuses on predicting the development of heart failure in cance…
▽ More
Cardiotoxicity related to cancer therapies has become a serious issue, diminishing cancer treatment outcomes and quality of life. Early detection of cancer patients at risk for cardiotoxicity before cardiotoxic treatments and providing preventive measures are potential solutions to improve cancer patients's quality of life. This study focuses on predicting the development of heart failure in cancer patients after cancer diagnoses using historical electronic health record (EHR) data. We examined four machine learning algorithms using 143,199 cancer patients from the University of Florida Health (UF Health) Integrated Data Repository (IDR). We identified a total number of 1,958 qualified cases and matched them to 15,488 controls by gender, age, race, and major cancer type. Two feature encoding strategies were compared to encode variables as machine learning features. The gradient boosting (GB) based model achieved the best AUC score of 0.9077 (with a sensitivity of 0.8520 and a specificity of 0.8138), outperforming other machine learning methods. We also looked into the subgroup of cancer patients with exposure to chemotherapy drugs and observed a lower specificity score (0.7089). The experimental results show that machine learning methods are able to capture clinical factors that are known to be associated with heart failure and that it is feasible to use machine learning methods to identify cancer patients at risk for cancer therapy-related heart failure.
△ Less
Submitted 1 October, 2019;
originally announced October 2019.
-
Rainfall nowcasting by combining radars, microwave links and rain gauges
Authors:
Blandine Bianchi,
Peter Jan van Leeuwen,
Robin J. Hogan,
Alexis Berne
Abstract:
The objective of this work is to provide high-resolution rain rate maps at short lead-time forecasts (nowcasts) necessary to anticipate flooding and properly manage sewage systems in urban areas by combining radars, rain gauges, and operational microwave links, and taking into account their respective uncertainties. A variational approach (3D-Var) is used to find the best estimate for the rain rat…
▽ More
The objective of this work is to provide high-resolution rain rate maps at short lead-time forecasts (nowcasts) necessary to anticipate flooding and properly manage sewage systems in urban areas by combining radars, rain gauges, and operational microwave links, and taking into account their respective uncertainties. A variational approach (3D-Var) is used to find the best estimate for the rain rate, and its error covariance, from the different rain sensors. Short-term rain rate forecasts are then produced by assuming Lagrangian persistence. A velocity field is obtained from the operational radar-derived rain fields, and the rain rate field is advected using the Total Variance Diminishing (TVD) scheme. The error covariance associated to the estimated rain rate is also propagated, and we use these two in the 3D-Var at the next observation time step. This approach can be seen as a Variational Kalman Filter (VKF), in which the covariance of the prior is not constant but dependent on time. The proposed approach has been tested using data from 14 rain gauges, 14 microwave links and the operational radar rain product from MeteoSwiss in the area of Zurich (Switzerland). During the applications the assumption of the Lagrangian persistence appears to be valid up to 20 min (a bit longer for stratiform events). During convective events, the algorithm is less powerful and shorter lead times should be considered (i.e., 15 min). Although such lead times are short, they are still useful to various hydrological and outdoor applications.
△ Less
Submitted 28 October, 2018;
originally announced October 2018.
-
A Data-Efficient Approach to Precise and Controlled Pushing
Authors:
Maria Bauza,
Francois R. Hogan,
Alberto Rodriguez
Abstract:
Decades of research in control theory have shown that simple controllers, when provided with timely feedback, can control complex systems. Pushing is an example of a complex mechanical system that is difficult to model accurately due to unknown system parameters such as coefficients of friction and pressure distributions. In this paper, we explore the data-complexity required for controlling, rath…
▽ More
Decades of research in control theory have shown that simple controllers, when provided with timely feedback, can control complex systems. Pushing is an example of a complex mechanical system that is difficult to model accurately due to unknown system parameters such as coefficients of friction and pressure distributions. In this paper, we explore the data-complexity required for controlling, rather than modeling, such a system. Results show that a model-based control approach, where the dynamical model is learned from data, is capable of performing complex pushing trajectories with a minimal amount of training data (10 data points). The dynamics of pushing interactions are modeled using a Gaussian process (GP) and are leveraged within a model predictive control approach that linearizes the GP and imposes actuator and task constraints for a planar manipulation task.
△ Less
Submitted 9 October, 2018; v1 submitted 25 July, 2018;
originally announced July 2018.
-
Tactile Regrasp: Grasp Adjustments via Simulated Tactile Transformations
Authors:
Francois R. Hogan,
Maria Bauza,
Oleguer Canal,
Elliott Donlon,
Alberto Rodriguez
Abstract:
This paper presents a novel regrasp control policy that makes use of tactile sensing to plan local grasp adjustments. Our approach determines regrasp actions by virtually searching for local transformations of tactile measurements that improve the quality of the grasp. First, we construct a tactile-based grasp quality metric using a deep convolutional neural network trained on over 2800 grasps. Th…
▽ More
This paper presents a novel regrasp control policy that makes use of tactile sensing to plan local grasp adjustments. Our approach determines regrasp actions by virtually searching for local transformations of tactile measurements that improve the quality of the grasp. First, we construct a tactile-based grasp quality metric using a deep convolutional neural network trained on over 2800 grasps. The quality of each grasp, a continuous value between 0 and 1, is determined experimentally by measuring its resistance to external perturbations. Second, we simulate the tactile imprints associated with robot motions relative to the initial grasp by performing rigid-body transformations of the given tactile measurements. The newly generated tactile imprints are evaluated with the learned grasp quality network and the regrasp action is chosen to maximize the grasp quality.
Results show that the grasp quality network can predict the outcome of grasps with an average accuracy of 85% on known objects and 75% on a cross validation set of 12 objects. The regrasp control policy improves the success rate of grasp actions by an average relative increase of 70% on a test set of 8 objects.
△ Less
Submitted 9 October, 2018; v1 submitted 5 March, 2018;
originally announced March 2018.
-
Reactive Planar Manipulation with Convex Hybrid MPC
Authors:
Francois Robert Hogan,
Eudald Romo Grau,
Alberto Rodriguez
Abstract:
This paper presents a reactive controller for planar manipulation tasks that leverages machine learning to achieve real-time performance. The approach is based on a Model Predictive Control (MPC) formulation, where the goal is to find an optimal sequence of robot motions to achieve a desired object motion. Due to the multiple contact modes associated with frictional interactions, the resulting opt…
▽ More
This paper presents a reactive controller for planar manipulation tasks that leverages machine learning to achieve real-time performance. The approach is based on a Model Predictive Control (MPC) formulation, where the goal is to find an optimal sequence of robot motions to achieve a desired object motion. Due to the multiple contact modes associated with frictional interactions, the resulting optimization program suffers from combinatorial complexity when tasked with determining the optimal sequence of modes.
To overcome this difficulty, we formulate the search for the optimal mode sequences offline, separately from the search for optimal control inputs online. Using tools from machine learning, this leads to a convex hybrid MPC program that can be solved in real-time. We validate our algorithm on a planar manipulation experimental setup where results show that the convex hybrid MPC formulation with learned modes achieves good closed-loop performance on a trajectory tracking problem.
△ Less
Submitted 4 September, 2018; v1 submitted 16 October, 2017;
originally announced October 2017.
-
Robotic Pick-and-Place of Novel Objects in Clutter with Multi-Affordance Gras** and Cross-Domain Image Matching
Authors:
Andy Zeng,
Shuran Song,
Kuan-Ting Yu,
Elliott Donlon,
Francois R. Hogan,
Maria Bauza,
Daolin Ma,
Orion Taylor,
Melody Liu,
Eudald Romo,
Nima Fazeli,
Ferran Alet,
Nikhil Chavan Dafle,
Rachel Holladay,
Isabella Morona,
Prem Qu Nair,
Druck Green,
Ian Taylor,
Weber Liu,
Thomas Funkhouser,
Alberto Rodriguez
Abstract:
This paper presents a robotic pick-and-place system that is capable of gras** and recognizing both known and novel objects in cluttered environments. The key new feature of the system is that it handles a wide range of object categories without needing any task-specific training data for novel objects. To achieve this, it first uses a category-agnostic affordance prediction algorithm to select a…
▽ More
This paper presents a robotic pick-and-place system that is capable of gras** and recognizing both known and novel objects in cluttered environments. The key new feature of the system is that it handles a wide range of object categories without needing any task-specific training data for novel objects. To achieve this, it first uses a category-agnostic affordance prediction algorithm to select and execute among four different gras** primitive behaviors. It then recognizes picked objects with a cross-domain image classification framework that matches observed images to product images. Since product images are readily available for a wide range of objects (e.g., from the web), the system works out-of-the-box for novel objects without requiring any additional training data. Exhaustive experimental results demonstrate that our multi-affordance gras** achieves high success rates for a wide variety of objects in clutter, and our recognition algorithm achieves high accuracy for both known and novel grasped objects. The approach was part of the MIT-Princeton Team system that took 1st place in the stowing task at the 2017 Amazon Robotics Challenge. All code, datasets, and pre-trained models are available online at http://arc.cs.princeton.edu
△ Less
Submitted 30 May, 2020; v1 submitted 3 October, 2017;
originally announced October 2017.
-
Feedback Control of the Pusher-Slider System: A Story of Hybrid and Underactuated Contact Dynamics
Authors:
Francois Robert Hogan,
Alberto Rodriguez
Abstract:
This paper investigates real-time control strategies for dynamical systems that involve frictional contact interactions. Hybridness and underactuation are key characteristics of these systems that complicate the design of feedback controllers. In this research, we examine and test a novel feedback controller design on a planar pushing system, where the purpose is to control the motion of a sliding…
▽ More
This paper investigates real-time control strategies for dynamical systems that involve frictional contact interactions. Hybridness and underactuation are key characteristics of these systems that complicate the design of feedback controllers. In this research, we examine and test a novel feedback controller design on a planar pushing system, where the purpose is to control the motion of a sliding object on a flat surface using a point robotic pusher. The pusher-slider is a simple dynamical system that retains many of the challenges that are typical of robotic manipulation tasks.
Our results show that a model predictive control approach used in tandem with integer programming offers a powerful solution to capture the dynamic constraints associated with the friction cone as well as the hybrid nature of the contact. In order to achieve real-time control, simplifications are proposed to speed up the integer program. The concept of Family of Modes (FOM) is introduced to solve an online convex optimization problem by selecting a set of contact mode schedules that spans a large set of dynamic behaviors that can occur during the prediction horizon. The controller design is applied to stabilize the motion of a sliding object about a nominal trajectory, and to re-plan its trajectory in real-time to follow a moving target. We validate the controller design through numerical simulations and experimental results on an industrial ABB IRB 120 robotic arm.
△ Less
Submitted 24 November, 2016;
originally announced November 2016.
-
GAz: A Genetic Algorithm for Photometric Redshift Estimation
Authors:
Robert Hogan,
Malcolm Fairbairn,
Navin Seeburn
Abstract:
We present a new approach to the problem of estimating the redshift of galaxies from photometric data. The approach uses a genetic algorithm combined with non-linear regression to model the 2SLAQ LRG data set with SDSS DR7 photometry. The genetic algorithm explores the very large space of high order polynomials while only requiring optimisation of a small number of terms. We find a…
▽ More
We present a new approach to the problem of estimating the redshift of galaxies from photometric data. The approach uses a genetic algorithm combined with non-linear regression to model the 2SLAQ LRG data set with SDSS DR7 photometry. The genetic algorithm explores the very large space of high order polynomials while only requiring optimisation of a small number of terms. We find a $σ_{\text{rms}}=0.0408\pm 0.0006$ for redshifts in the range $0.4<z< 0.7$. These results are competitive with the current state-of-the-art but can be presented simply as a polynomial which does not require the user to run any code. We demonstrate that the method generalises well to other data sets and redshift ranges by testing it on SDSS DR11 and on simulated data. For other datasets or applications the code has been made available at https://github.com/rbrthogan/GAz.
△ Less
Submitted 16 March, 2015; v1 submitted 17 December, 2014;
originally announced December 2014.
-
Cause-of-death estimates for the early and late neonatal periods for 194 countries from 2000-2013
Authors:
Shefali Oza,
Joy E Lawn,
Daniel R Hogan,
Colin Mathers,
Simon Cousens
Abstract:
Objective: Cause-of-death distributions are important for prioritising interventions. We estimated proportions, risks, and numbers of deaths (with uncertainty) for programme-relevant causes of neonatal death for 194 countries for 2000-2013, differentiating between the early (days 0-6) and late (days 7-27) neonatal periods.
Methods: For 65 high-quality VR countries, we used the observed early and…
▽ More
Objective: Cause-of-death distributions are important for prioritising interventions. We estimated proportions, risks, and numbers of deaths (with uncertainty) for programme-relevant causes of neonatal death for 194 countries for 2000-2013, differentiating between the early (days 0-6) and late (days 7-27) neonatal periods.
Methods: For 65 high-quality VR countries, we used the observed early and late neonatal proportional cause distributions. For the remaining 129 countries, we used multinomial logistic models to estimate the early and late proportional cause distributions. We used separate models, with different inputs, for low and high neonatal mortality countries. We applied these cause-specific proportions to neonatal death estimates from the United Nations by country/year to estimate cause-specific risks and numbers of deaths.
Findings: Of the 2.76 million neonatal deaths in 2013, 0.99 (uncertainty: 0.70-1.31) million (35.7%) were estimated to be from preterm complications, 0.64 (uncertainty: 0.46-0.84) million (23.4%) from intrapartum-related complications, and 0.43 (0.22-0.66) million (15.6%) from sepsis. Preterm (40.8%) and intrapartum-related (27.0%) complications accounted for the majority of early neonatal deaths while infections caused nearly half of late neonatal deaths. In every region, preterm was the leading cause of neonatal death, with the highest risks in Southern Asia (11.9 per 1000 livebirths) and Sub-Saharan Africa (9.5).
Conclusion: The neonatal cause-of-death distribution differs between the early and late periods, and varies with NMR level and over time. To reduce neonatal deaths, this knowledge must be incorporated into policy decisions. The Every Newborn Action Plan provides stimulus for countries to update national strategies and include high-impact interventions to address these causes.
△ Less
Submitted 14 November, 2014;
originally announced November 2014.
-
Unifying inflation and dark matter with the Peccei-Quinn field: observable axions and observable tensors
Authors:
Malcolm Fairbairn,
Robert Hogan,
David J. E. Marsh
Abstract:
A model of high scale inflation is presented where the radial part of the Peccei-Quinn (PQ) field with a non-minimal coupling to gravity plays the role of the inflaton, and the QCD axion is the dark matter. A quantum fluctuation of $\mathcal{O}(H/2π)$ in the axion field will result in a smaller angular fluctuation if the PQ field is sitting at a larger radius during inflation than in the vacuum. T…
▽ More
A model of high scale inflation is presented where the radial part of the Peccei-Quinn (PQ) field with a non-minimal coupling to gravity plays the role of the inflaton, and the QCD axion is the dark matter. A quantum fluctuation of $\mathcal{O}(H/2π)$ in the axion field will result in a smaller angular fluctuation if the PQ field is sitting at a larger radius during inflation than in the vacuum. This changes the effective axion decay constant, $f_a$, during inflation and dramatically reduces the production of isocurvature modes. This mechanism opens up a new window in parameter space where an axion decay constant in the range $10^{12}\text{ GeV}\lesssim f_a\lesssim 10^{15}\text{ GeV}$ is compatible with observably large $r$. The exact range allowed for $f_a$ depends on the efficiency of reheating. This model also predicts a minimum possible value of $r=10^{-3}$. The new window can be explored by a measurement of $r$ possible with \textsc{Spider} and the proposed CASPEr experiment search for high $f_a$ axions.
△ Less
Submitted 7 October, 2014;
originally announced October 2014.
-
The Problem with False Vacuum Higgs Inflation
Authors:
Malcolm Fairbairn,
Philipp Grothaus,
Robert Hogan
Abstract:
We investigate the possibility of using the only known fundamental scalar, the Higgs, as an inflaton with minimal coupling to gravity. The peculiar appearance of a plateau or a false vacuum in the renormalised effective scalar potential suggests that the Higgs might drive inflation. For the case of a false vacuum we use an additional singlet scalar field, motivated by the strong CP problem, and it…
▽ More
We investigate the possibility of using the only known fundamental scalar, the Higgs, as an inflaton with minimal coupling to gravity. The peculiar appearance of a plateau or a false vacuum in the renormalised effective scalar potential suggests that the Higgs might drive inflation. For the case of a false vacuum we use an additional singlet scalar field, motivated by the strong CP problem, and its coupling to the Higgs to lift the barrier allowing for a graceful exit from inflation by mimicking hybrid inflation. We find that this scenario is incompatible with current measurements of the Higgs mass and the QCD coupling constant and conclude that the Higgs can only be the inflaton in more complicated scenarios.
△ Less
Submitted 28 March, 2014;
originally announced March 2014.
-
Electroweak Vacuum Stability in light of BICEP2
Authors:
Malcolm Fairbairn,
Robert Hogan
Abstract:
We consider the effect of a period of inflation with a high energy density upon the stability of the Higgs potential in the early universe. The recent measurement of a large tensor-to-scalar ratio, $r_T \sim 0.16$, by the BICEP-2 experiment possibly implies that the energy density during inflation was very high, comparable with the GUT scale. Given that the standard model Higgs potential is known…
▽ More
We consider the effect of a period of inflation with a high energy density upon the stability of the Higgs potential in the early universe. The recent measurement of a large tensor-to-scalar ratio, $r_T \sim 0.16$, by the BICEP-2 experiment possibly implies that the energy density during inflation was very high, comparable with the GUT scale. Given that the standard model Higgs potential is known to develop an instability at $Λ\sim 10^{10}$ GeV this means that the resulting large quantum fluctuations of the Higgs field could destabilize the vacuum during inflation, even if the Higgs field starts at zero expectation value. We estimate the probability of such a catastrophic destabilisation given such an inflationary scenario and calculate that for a Higgs mass of $m_h=125.5$ GeV that the top mass must be less than $m_t\sim 172$ GeV. We present two possible cures: a direct coupling between the Higgs and the inflaton and a non-zero temperature from dissipation during inflation.
△ Less
Submitted 30 April, 2014; v1 submitted 26 March, 2014;
originally announced March 2014.
-
Singlet Fermionic Dark Matter and the Electroweak Phase Transition
Authors:
Malcolm Fairbairn,
Robert Hogan
Abstract:
We consider a model with a gauge singlet Dirac fermion as a cold dark matter candidate. The dark matter particle communicates with the Standard Model via a gauge singlet scalar mediator that couples to the Higgs. The scalar mediator also serves to create a tree-level barrier in the scalar potential which leads to a strongly first order electroweak phase transition as required for Electroweak Baryo…
▽ More
We consider a model with a gauge singlet Dirac fermion as a cold dark matter candidate. The dark matter particle communicates with the Standard Model via a gauge singlet scalar mediator that couples to the Higgs. The scalar mediator also serves to create a tree-level barrier in the scalar potential which leads to a strongly first order electroweak phase transition as required for Electroweak Baryogenesis. We find a large number of models that can account for all the dark matter and provide a strong phase transition while avoiding constraints from dark matter direct detection, electroweak precision data, and the latest Higgs data from the LHC. The next generation of direct detection experiments could rule out a large region of the parameter space but can be evaded in some regions when the Higgs-singlet mixing is very small.
△ Less
Submitted 17 September, 2013; v1 submitted 15 May, 2013;
originally announced May 2013.
-
Statistical Tests of Chondrule Sorting
Authors:
S. A. Teitler,
J. M. Paque,
J. N. Cuzzi,
R. C. Hogan
Abstract:
The variation in sizes of chondrules from one chondrite to the next is thought to be due to some sorting process in the early solar nebula. Hypotheses for the sorting process include chondrule sorting by mass and sorting by some aerodynamic mechanism; one such aerodynamic mechanism is the process of turbulent concentration (TC). We present the results of a series of statistical tests of chondrule…
▽ More
The variation in sizes of chondrules from one chondrite to the next is thought to be due to some sorting process in the early solar nebula. Hypotheses for the sorting process include chondrule sorting by mass and sorting by some aerodynamic mechanism; one such aerodynamic mechanism is the process of turbulent concentration (TC). We present the results of a series of statistical tests of chondrule data from several different chondrites. The data do not clearly distinguish between various options for the sorting parameter, but we find that the data are inconsistent with being drawn from lognormal or (three-parameter) Weibull distributions in chondrule radius. We also find that all but one of the chondrule data sets tested are consistent with being drawn from the TC distribution.
△ Less
Submitted 18 October, 2011;
originally announced October 2011.
-
Towards Initial Mass Functions for Asteroids and Kuiper Belt Objects
Authors:
Jeffrey N. Cuzzi,
Robert C. Hogan,
William F. Bottke
Abstract:
Our goal is to understand primary accretion of the first planetesimals. The primitive meteorite record suggests that sizeable planetesimals formed in the asteroid belt over a period longer than a million years, each composed entirely of an unusual, but homogeneous, mixture of mm-size particles. We sketch a scenario in which primary accretion of 10-100km size planetesimals proceeds directly, if spo…
▽ More
Our goal is to understand primary accretion of the first planetesimals. The primitive meteorite record suggests that sizeable planetesimals formed in the asteroid belt over a period longer than a million years, each composed entirely of an unusual, but homogeneous, mixture of mm-size particles. We sketch a scenario in which primary accretion of 10-100km size planetesimals proceeds directly, if sporadically, from aerodynamically-sorted mm-size particles (generically "chondrules"). These planetesimal sizes are in general agreement with the currently observed asteroid mass peak near 100km diameter, which has been identified as a "fossil" property of the pre-erosion, pre-depletion population. We extend our primary accretion theory to make predictions for outer solar system planetesimals, which may also have a preferred size in the 100km diameter range. We estimate formation rates of planetesimals and assess the conditions needed to match estimates of both asteroid and Kuiper Belt Object (KBO) formation rates. For nebula parameters that satisfy observed mass accretion rates of Myr-old protoplanetary nebulae, the scenario is roughly consistent with not only the "fossil" sizes of the asteroids, and their estimated production rates, but also with the observed spread in formation ages of chondrules in a given chondrite, and with a tolerably small radial diffusive mixing during this time between formation and accretion (the model naturally helps explain the peculiar size distribution of chondrules within such objects). The scenario also produces 10-100km diameter primary KBOs. The optimum range of parameters, however, represents a higher gas density and fractional abundance of solids, and a smaller difference between keplerian and pressure-supported orbital velocities, than "canonical" models of the solar nebula. We discuss several potential explanations for these differences.
△ Less
Submitted 1 April, 2010;
originally announced April 2010.
-
Doppler lidar measurements of oriented planar ice crystals falling from supercooled and glaciated layer clouds
Authors:
C. D. Westbrook,
A. J. Illingworth,
E. J. O'Connor,
R. J. Hogan
Abstract:
The properties of planar ice crystals settling horizontally have been investigated using a vertically-pointing Doppler lidar. Strong specular reflections were observed from their oriented basal facets, identified by comparison with a second lidar pointing 4deg from zenith. Analysis of 17 months of continuous high-resolution observations reveal that these pristine crystals are frequently observed…
▽ More
The properties of planar ice crystals settling horizontally have been investigated using a vertically-pointing Doppler lidar. Strong specular reflections were observed from their oriented basal facets, identified by comparison with a second lidar pointing 4deg from zenith. Analysis of 17 months of continuous high-resolution observations reveal that these pristine crystals are frequently observed in ice falling from mid-level mixed-phase layer clouds (85% of the time for layers at -15C). Detailed analysis of a case study indicates that the crystals are nucleated and grow rapidly within the supercooled layer, then fall out, forming well-defined layers of specular reflection. From the lidar alone the fraction of oriented crystals cannot be quantified, but polarimetric radar measurements confirmed that a substantial fraction of the crystal population was well oriented. As the crystals fall into subsaturated air, specular reflection is observed to switch off as the crystal faces become rounded and lose their faceted structure. Specular reflection in ice falling from supercooled layers colder than -22C was also observed, but was much less pronounced than at warmer temperatures: we suggest that in cold clouds it is the small droplets in the distribution that freeze into plates and produce specular reflection, whilst larger droplets freeze into complex polycrystals. The lidar Doppler measurements show that typical fall speeds for the oriented crystals are 0.3m/s, with a weak temperature correlation; the corresponding Reynolds number is Re~10, in agreement with light-pillar measurements. Coincident Doppler radar observations show no correlation between the specular enhancement and eddy dissipation rate, indicating that turbulence does not control crystal orientation in these clouds.
△ Less
Submitted 20 August, 2009; v1 submitted 3 June, 2009;
originally announced June 2009.
-
Towards planetesimals: dense chondrule clumps in the protoplanetary nebula
Authors:
Jeffrey N. Cuzzi,
Robert C. Hogan,
Karim Shariff
Abstract:
We outline a scenario which traces a direct path from freely-floating nebula particles to the first 10-100km-sized bodies in the terrestrial planet region, producing planetesimals which have properties matching those of primitive meteorite parent bodies. We call this "primary accretion". The scenario draws on elements of previous work, and introduces a new critical threshold for planetesimal for…
▽ More
We outline a scenario which traces a direct path from freely-floating nebula particles to the first 10-100km-sized bodies in the terrestrial planet region, producing planetesimals which have properties matching those of primitive meteorite parent bodies. We call this "primary accretion". The scenario draws on elements of previous work, and introduces a new critical threshold for planetesimal formation. We presume the nebula to be weakly turbulent, which leads to dense concentrations of aerodynamically size-sorted particles having properties like those observed in chondrites. The fractional volume of the nebula occupied by these dense zones or clumps obeys a probability distribution as a function of their density, and the densest concentrations have particle mass density 100 times that of the gas. However, even these densest clumps are prevented by gas pressure from undergoing gravitational instability in the traditional sense (on a dynamical timescale). While in this state of arrested development, they are susceptible to disruption by the ram pressure of the differentially orbiting nebula gas. However, self-gravity can preserve sufficiently large and dense clumps from ram pressure disruption, allowing their entrained particles to sediment gently but inexorably towards their centers, producing 10-100 km "sandpile" planetesimals. Localized radial pressure fluctuations in the nebula, and interactions between differentially moving dense clumps, will also play a role that must be allowed for in future studies. The scenario is readily extended from meteorite parent bodies to primary accretion throughout the solar system.
△ Less
Submitted 21 April, 2008;
originally announced April 2008.
-
A Cascade Model for Particle Concentration and Enstrophy in Fully Developed Turbulence with Mass Loading Feedback
Authors:
Robert C. Hogan,
Jeffrey N. Cuzzi
Abstract:
A cascade model is described based on multiplier distributions determined from 3D direct numerical simulations (DNS) of turbulent particle laden flows, which include two-way coupling between the phases at global mass loadings equal to unity. The governing Eulerian equations are solved using pseudo-spectral methods on up to 512**3 computional grid points. DNS results for particle concentration an…
▽ More
A cascade model is described based on multiplier distributions determined from 3D direct numerical simulations (DNS) of turbulent particle laden flows, which include two-way coupling between the phases at global mass loadings equal to unity. The governing Eulerian equations are solved using pseudo-spectral methods on up to 512**3 computional grid points. DNS results for particle concentration and enstrophy at Taylor microscale Reynolds numbers in the range 34 - 170 were used to directly determine multiplier distributions (PDFs) on spatial scales 3 times the Kolmogorov length scale. The width of the PDFs, which is a measure of intermittency, decreases with increasing mass loading within the local region where the multipliers are measured. The functional form of this dependence is not sensitive to Reynolds numbers in the range considered. A partition correlation probability is included in the cascade model to account for the observed spatial anticorrelation between particle concentration and enstrophy. Joint probability distribution functions of concentration and enstrophy generated using the cascade model are shown to be in excellent agreement with those derived directly from our 3D simulations. Probabilities predicted by the cascade model are presented at Reynolds numbers well beyond what is achievable by direct simulation. These results clearly indicate that particle mass loading significantly reduces the probabilities of high particle concentration and enstrophy relative to those resulting from unloaded runs. Particle mass density appears to reach a limit at around 100 times the gas density. This approach has promise for significant computational savings in certain applications.
△ Less
Submitted 13 April, 2007; v1 submitted 13 April, 2007;
originally announced April 2007.
-
The capacitance of pristine ice crystals and aggregate snowflakes
Authors:
C. D. Westbrook,
R. J. Hogan,
A. J. Illingworth
Abstract:
A new method of accurately calculating the capacitance of realistic ice particles is described: such values are key to accurate estimates of deposition and evaporation rates in NWP models. The trajectories of diffusing water molecules are directly sampled, using random `walkers'. By counting how many of these trajectories intersect the surface of the ice particle (which may be any shape) and how…
▽ More
A new method of accurately calculating the capacitance of realistic ice particles is described: such values are key to accurate estimates of deposition and evaporation rates in NWP models. The trajectories of diffusing water molecules are directly sampled, using random `walkers'. By counting how many of these trajectories intersect the surface of the ice particle (which may be any shape) and how many escape outside a spherical boundary far from the particle, the capacitance of a number of model ice particle habits have been estimated, including hexagonal columns and plates, `scalene' columns and plates, bullets, bullet-rosettes, dendrites, and realistic aggregate snowflakes. For ice particles with sharp edges and corners this method is an efficient and straightforward way of solving Laplace's equation for the capacitance. Provided that a large enough number of random walkers are used to sample the particle geometry the authors expect the calculated capacitances to be accurate to within ~1%. The capacitance for our modelled aggregate snowflakes (C/Dmax=0.25, normalised by the maximum dimension Dmax) is shown to be in close agreement with recent aircraft measurements of snowflake sublimation rates. This result shows that the capacitance of a sphere (C/Dmax=0.5) which is commonly used in numerical models, overestimates the evaporation rate by a factor of 2. The effect of vapor `screening' by crystals growing in the vicinity of one another has also been investigated. The results clearly show that neighbouring crystals growing on a filament in cloud chamber experiments can strongly constrict the vapor supply to each other, and the resulting growth rate measurements may severely underestimate the rate for a single crystal in isolation (by a factor of 3 in our model setup).
△ Less
Submitted 17 July, 2007; v1 submitted 6 October, 2006;
originally announced October 2006.
-
Theory and observations of ice particle evolution in cirrus using Doppler radar: evidence for aggregation
Authors:
C. D. Westbrook,
R. J. Hogan,
A. J. Illingworth,
E. J. O'Connor
Abstract:
Vertically pointing Doppler radar has been used to study the evolution of ice particles as they sediment through a cirrus cloud. The measured Doppler fall speeds, together with radar-derived estimates for the altitude of cloud top, are used to estimate a characteristic fall time tc for the `average' ice particle. The change in radar reflectivity Z is studied as a function of tc, and is found to…
▽ More
Vertically pointing Doppler radar has been used to study the evolution of ice particles as they sediment through a cirrus cloud. The measured Doppler fall speeds, together with radar-derived estimates for the altitude of cloud top, are used to estimate a characteristic fall time tc for the `average' ice particle. The change in radar reflectivity Z is studied as a function of tc, and is found to increase exponentially with fall time. We use the idea of dynamically scaling particle size distributions to show that this behaviour implies exponential growth of the average particle size, and argue that this exponential growth is a signature of ice crystal aggregation.
△ Less
Submitted 7 December, 2006; v1 submitted 14 August, 2006;
originally announced August 2006.
-
Size-selective concentration of chondrules and other small particles in protoplanetary nebula turbulence
Authors:
Jeffrey N. Cuzzi,
Robert C. Hogan,
Julie M. Paque,
Anthony R. Dobrovolskis
Abstract:
Size-selective concentration of particles in a weakly turbulent protoplanetary nebula may be responsible for the initial collection of chondrules and other constituents into primitive body precursors. This paper presents the main elements of this process of turbulent concentration. In the terrestrial planet region, both the characteristic size and size distribution of chondrules are explained. "…
▽ More
Size-selective concentration of particles in a weakly turbulent protoplanetary nebula may be responsible for the initial collection of chondrules and other constituents into primitive body precursors. This paper presents the main elements of this process of turbulent concentration. In the terrestrial planet region, both the characteristic size and size distribution of chondrules are explained. "Fluffier" particles would be concentrated in nebula regions which were at a lower gas density and/or more intensely turbulent. The spatial distribution of concentrated particle density obeys multifractal scaling}, suggesting a close tie to the turbulent cascade process. This scaling behavior allows predictions of the probability distributions for concentration in the protoplanetary nebula to be made. Large concentration factors (>10^5) are readily obtained, implying that numerous zones of particle density significantly exceeding the gas density could exist. If most of the available solids were actually in chondrule sized particles, the ensuing particle mass density would become so large that the feedback effects on gas turbulence due to mass loading could no longer be neglected. This paper describes the process, presenting its basic elements and some implications, without including the effects of mass loading.
△ Less
Submitted 13 September, 2000;
originally announced September 2000.