-
Highly Accurate Disease Diagnosis and Highly Reproducible Biomarker Identification with PathFormer
Authors:
Zehao Dong,
Qihang Zhao,
Philip R. O. Payne,
Michael A Province,
Carlos Cruchaga,
Muhan Zhang,
Tianyu Zhao,
Yixin Chen,
Fuhai Li
Abstract:
Biomarker identification is critical for precise disease diagnosis and understanding disease pathogenesis in omics data analysis, like using fold change and regression analysis. Graph neural networks (GNNs) have been the dominant deep learning model for analyzing graph-structured data. However, we found two major limitations of existing GNNs in omics data analysis, i.e., limited-prediction (diagno…
▽ More
Biomarker identification is critical for precise disease diagnosis and understanding disease pathogenesis in omics data analysis, like using fold change and regression analysis. Graph neural networks (GNNs) have been the dominant deep learning model for analyzing graph-structured data. However, we found two major limitations of existing GNNs in omics data analysis, i.e., limited-prediction (diagnosis) accuracy and limited-reproducible biomarker identification capacity across multiple datasets. The root of the challenges is the unique graph structure of biological signaling pathways, which consists of a large number of targets and intensive and complex signaling interactions among these targets. To resolve these two challenges, in this study, we presented a novel GNN model architecture, named PathFormer, which systematically integrate signaling network, priori knowledge and omics data to rank biomarkers and predict disease diagnosis. In the comparison results, PathFormer outperformed existing GNN models significantly in terms of highly accurate prediction capability ( 30% accuracy improvement in disease diagnosis compared with existing GNN models) and high reproducibility of biomarker ranking across different datasets. The improvement was confirmed using two independent Alzheimer's Disease (AD) and cancer transcriptomic datasets. The PathFormer model can be directly applied to other omics data analysis studies.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
Rethinking the Power of Graph Canonization in Graph Representation Learning with Stability
Authors:
Zehao Dong,
Muhan Zhang,
Philip R. O. Payne,
Michael A Province,
Carlos Cruchaga,
Tianyu Zhao,
Fuhai Li,
Yixin Chen
Abstract:
The expressivity of Graph Neural Networks (GNNs) has been studied broadly in recent years to reveal the design principles for more powerful GNNs. Graph canonization is known as a typical approach to distinguish non-isomorphic graphs, yet rarely adopted when develo** expressive GNNs. This paper proposes to maximize the expressivity of GNNs by graph canonization, then the power of such GNNs is stu…
▽ More
The expressivity of Graph Neural Networks (GNNs) has been studied broadly in recent years to reveal the design principles for more powerful GNNs. Graph canonization is known as a typical approach to distinguish non-isomorphic graphs, yet rarely adopted when develo** expressive GNNs. This paper proposes to maximize the expressivity of GNNs by graph canonization, then the power of such GNNs is studies from the perspective of model stability. A stable GNN will map similar graphs to close graph representations in the vectorial space, and the stability of GNNs is critical to generalize their performance to unseen graphs. We theoretically reveal the trade-off of expressivity and stability in graph-canonization-enhanced GNNs. Then we introduce a notion of universal graph canonization as the general solution to address the trade-off and characterize a widely applicable sufficient condition to solve the universal graph canonization. A comprehensive set of experiments demonstrates the effectiveness of the proposed method. In many popular graph benchmark datasets, graph canonization successfully enhances GNNs and provides highly competitive performance, indicating the capability and great potential of proposed method in general graph representation learning. In graph datasets where the sufficient condition holds, GNNs enhanced by universal graph canonization consistently outperform GNN baselines and successfully improve the SOTA performance up to $31\%$, providing the optimal solution to numerous challenging real-world graph analytical tasks like gene network representation learning in bioinformatics.
△ Less
Submitted 9 February, 2024; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Comparison principles for nonlinear potential theories and PDEs with fiberegularity and sufficient monotonicity
Authors:
Marco Cirant,
Kevin R. Payne,
Davide F. Redaelli
Abstract:
We present some recent advances in the productive and symbiotic interplay between general potential theories (subharmonic functions associated to closed subsets $\mathcal{F} \subset \mathcal{J}^2(X)$ of the 2-jets on $X \subset \mathbb{R}^n$ open) and subsolutions of degenerate elliptic and parabolic PDEs of the form $F(x,u,Du,D^2u) = 0$. We will implement the monotonicity-duality method begun by…
▽ More
We present some recent advances in the productive and symbiotic interplay between general potential theories (subharmonic functions associated to closed subsets $\mathcal{F} \subset \mathcal{J}^2(X)$ of the 2-jets on $X \subset \mathbb{R}^n$ open) and subsolutions of degenerate elliptic and parabolic PDEs of the form $F(x,u,Du,D^2u) = 0$. We will implement the monotonicity-duality method begun by Harvey and Lawson in 2009 (in the pure second order constant coefficient case) for proving comparison principles for potential theories where $\mathcal{F}$ has sufficient monotonicity and fiberegularity (in variable coefficient settings) and which carry over to all differential operators $F$ which are compatible with $\mathcal{F}$ in a precise sense for which the correspondence principle holds. We will consider both elliptic and parabolic versions of the comparison principle in which the effect of boundary data is seen on the entire boundary or merely on a proper subset of the boundary. Particular attention will be given to gradient dependent examples with the requisite sufficient monotonicity of proper ellipticity and directionality in the gradient. Example operators we will discuss include the degenerate elliptic operators of optimal transport in which the target density is strictly increasing in some directions as well as operators which are weakly parabolic in the sense of Krylov. Further examples, modeled on hyperbolic polynomials in the sense of Gårding give a rich class of examples with directionality in the gradient. Moreover we present a model example in which the comparison principle holds, but standard viscosity structural conditions fail to hold.
△ Less
Submitted 20 May, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
A primer on quasi-convex functions in nonlinear potential theories
Authors:
Kevin R. Payne,
Davide F. Redaelli
Abstract:
We present a self-contained treatment of the fundamental role that quasi-convex functions play in general (nonlinear second order) potential theories, which concerns the study of generalized subharmonics associated to a suitable closed subset (subequations) of the space of 2-jets. Quasi-convex functions build a bridge between classical and viscosity notions of solutions of the natural Dirichlet pr…
▽ More
We present a self-contained treatment of the fundamental role that quasi-convex functions play in general (nonlinear second order) potential theories, which concerns the study of generalized subharmonics associated to a suitable closed subset (subequations) of the space of 2-jets. Quasi-convex functions build a bridge between classical and viscosity notions of solutions of the natural Dirichlet problem in any potential theory. Moreover, following a program initiated by Harvey and Lawson in [arXiv:0710.3991], a potential-theoretic approach is widely being applied for treating nonlinear partial differential equations (PDEs). This viewpoint revisits the conventional viscosity approach to nonlinear PDEs [arXiv:math/9207212] under a more geometric prospective inspired by Krylov (1995) and takes much insight from classical pluripotential theory. The possibility of a symbiotic and productive relationship between general potential theories and nonlinear PDEs relies heavily on the class of quasi-convex functions, which are themselves the generalized subharmonics of a pure second order constant coefficient potential theory.
△ Less
Submitted 26 February, 2024; v1 submitted 25 March, 2023;
originally announced March 2023.
-
Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19
Authors:
Davide Pigoli,
Kieran Baker,
Jobie Budd,
Lorraine Butler,
Harry Coppock,
Sabrina Egglestone,
Steven G. Gilmour,
Chris Holmes,
David Hurley,
Radka Jersakova,
Ivan Kiskin,
Vasiliki Koutra,
Jonathon Mellor,
George Nicholson,
Joe Packham,
Selina Patel,
Richard Payne,
Stephen J. Roberts,
Björn W. Schuller,
Ana Tendero-Cañadas,
Tracey Thornley,
Alexander Titcomb
Abstract:
Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously ass…
▽ More
Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously assesses state-of-the-art machine learning techniques used to predict COVID-19 infection status based on vocal audio signals, using a dataset collected by the UK Health Security Agency. This dataset includes acoustic recordings and extensive study participant meta-data. We provide guidelines on testing the performance of methods to classify COVID-19 infection status based on acoustic features and we discuss how these can be extended more generally to the development and assessment of predictive methods based on public health datasets.
△ Less
Submitted 27 February, 2023; v1 submitted 15 December, 2022;
originally announced December 2022.
-
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers
Authors:
Harry Coppock,
George Nicholson,
Ivan Kiskin,
Vasiliki Koutra,
Kieran Baker,
Jobie Budd,
Richard Payne,
Emma Karoune,
David Hurley,
Alexander Titcomb,
Sabrina Egglestone,
Ana Tendero Cañadas,
Lorraine Butler,
Radka Jersakova,
Jonathon Mellor,
Selina Patel,
Tracey Thornley,
Peter Diggle,
Sylvia Richardson,
Josef Packham,
Björn W. Schuller,
Davide Pigoli,
Steven Gilmour,
Stephen Roberts,
Chris Holmes
Abstract:
Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata…
▽ More
Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata, including reverse transcription polymerase chain reaction (PCR) test outcomes, of whom 23,514 tested positive for SARS CoV 2. Subjects were recruited via the UK governments National Health Service Test-and-Trace programme and the REal-time Assessment of Community Transmission (REACT) randomised surveillance survey. In an unadjusted analysis of our dataset AI classifiers predict SARS-CoV-2 infection status with high accuracy (Receiver Operating Characteristic Area Under the Curve (ROCAUC) 0.846 [0.838, 0.854]) consistent with the findings of previous studies. However, after matching on measured confounders, such as age, gender, and self reported symptoms, our classifiers performance is much weaker (ROC-AUC 0.619 [0.594, 0.644]). Upon quantifying the utility of audio based classifiers in practical settings, we find them to be outperformed by simple predictive scores based on user reported symptoms.
△ Less
Submitted 2 March, 2023; v1 submitted 15 December, 2022;
originally announced December 2022.
-
A large-scale and PCR-referenced vocal audio dataset for COVID-19
Authors:
Jobie Budd,
Kieran Baker,
Emma Karoune,
Harry Coppock,
Selina Patel,
Ana Tendero Cañadas,
Alexander Titcomb,
Richard Payne,
David Hurley,
Sabrina Egglestone,
Lorraine Butler,
Jonathon Mellor,
George Nicholson,
Ivan Kiskin,
Vasiliki Koutra,
Radka Jersakova,
Rachel A. McKendry,
Peter Diggle,
Sylvia Richardson,
Björn W. Schuller,
Steven Gilmour,
Davide Pigoli,
Stephen Roberts,
Josef Packham,
Tracey Thornley
, et al. (1 additional authors not shown)
Abstract:
The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmi…
▽ More
The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmission of the Alpha and Delta SARS-CoV-2 variants and some Omicron variant sublineages. Audio recordings of volitional coughs, exhalations, and speech were collected in the 'Speak up to help beat coronavirus' digital survey alongside demographic, self-reported symptom and respiratory condition data, and linked to SARS-CoV-2 test results. The UK COVID-19 Vocal Audio Dataset represents the largest collection of SARS-CoV-2 PCR-referenced audio recordings to date. PCR results were linked to 70,794 of 72,999 participants and 24,155 of 25,776 positive cases. Respiratory symptoms were reported by 45.62% of participants. This dataset has additional potential uses for bioacoustics research, with 11.30% participants reporting asthma, and 27.20% with linked influenza PCR test results.
△ Less
Submitted 3 November, 2023; v1 submitted 15 December, 2022;
originally announced December 2022.
-
Interpreting the Mechanism of Synergism for Drug Combinations Using Attention-Based Hierarchical Graph Pooling
Authors:
Zehao Dong,
Heming Zhang,
Yixin Chen,
Philip R. O. Payne,
Fuhai Li
Abstract:
Synergistic drug combinations provide huge potentials to enhance therapeutic efficacy and to reduce adverse reactions. However, effective and synergistic drug combination prediction remains an open question because of the unknown causal disease signaling pathways. Though various deep learning (AI) models have been proposed to quantitatively predict the synergism of drug combinations, the major lim…
▽ More
Synergistic drug combinations provide huge potentials to enhance therapeutic efficacy and to reduce adverse reactions. However, effective and synergistic drug combination prediction remains an open question because of the unknown causal disease signaling pathways. Though various deep learning (AI) models have been proposed to quantitatively predict the synergism of drug combinations, the major limitation of existing deep learning methods is that they are inherently not interpretable, which makes the conclusions of AI models untransparent to human experts, henceforth limiting the robustness of the model conclusion and the implementation ability of these models in real-world human--AI healthcare. In this paper, we develop an interpretable graph neural network (GNN) that reveals the underlying essential therapeutic targets and the mechanism of the synergy (MoS) by mining the sub-molecular network of great importance. The key point of the interpretable GNN prediction model is a novel graph pooling layer, a self-attention-based node and edge pool (henceforth SANEpool), that can compute the attention score (importance) of genes and connections based on the genomic features and topology. As such, the proposed GNN model provides a systematic way to predict and interpret the drug combination synergism based on the detected crucial sub-molecular network. Experiments on various well-adopted drug-synergy-prediction datasets demonstrate that (1) the SANEpool model has superior predictive ability to generate accurate synergy score prediction, and (2) the sub-molecular networks detected by the SANEpool are self-explainable and salient for identifying synergistic drug combinations.
△ Less
Submitted 22 August, 2023; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Hot Earth or Young Venus? A nearby transiting rocky planet mystery
Authors:
L. Kaltenegger,
R. C. Payne,
Z. Lin,
J. Kasting,
L. Delrez
Abstract:
Venus and Earth provide astonishingly different views of the evolution of a rocky planet, raising the question of why these two rocky worlds evolved so differently. The recently discovered transiting Super-Earth LP 890-9c (TOI-4306c, SPECULOOS-2c) is a key to the question. It circles a nearby M6V star in 8.46 days. LP890-9c receives similar flux as modern Earth, which puts it very close to the inn…
▽ More
Venus and Earth provide astonishingly different views of the evolution of a rocky planet, raising the question of why these two rocky worlds evolved so differently. The recently discovered transiting Super-Earth LP 890-9c (TOI-4306c, SPECULOOS-2c) is a key to the question. It circles a nearby M6V star in 8.46 days. LP890-9c receives similar flux as modern Earth, which puts it very close to the inner edge of the Habitable Zone (HZ), where models differ strongly in their prediction of how long rocky planets can hold onto their water. We model the atmosphere of a hot LP890-9c at the inner edge of the HZ, where the planet could sustain several very different environments. The resulting transmission spectra differ considerably between a hot, wet exo-Earth, a steamy planet caught in a runaway greenhouse, and an exo-Venus. Distinguishing these scenarios from the planet's spectra will provide critical new insights into the evolution of hot terrestrial planets into exo-Venus. Our model and spectra are available online as a tool to plan observations. They show that observing LP890-9c can provide key insights into the evolution of a rocky planet at the inner edge of the HZ as well as the long-term future of Earth.
△ Less
Submitted 28 November, 2022; v1 submitted 7 September, 2022;
originally announced September 2022.
-
The Mira-Titan Universe IV. High Precision Power Spectrum Emulation
Authors:
Kelly R. Moran,
Katrin Heitmann,
Earl Lawrence,
Salman Habib,
Derek Bingham,
Amol Upadhye,
Juliana Kwan,
David Higdon,
Richard Payne
Abstract:
Modern cosmological surveys are delivering datasets characterized by unprecedented quality and statistical completeness; this trend is expected to continue into the future as new ground- and space-based surveys come online. In order to maximally extract cosmological information from these observations, matching theoretical predictions are needed. At low redshifts, the surveys probe the nonlinear r…
▽ More
Modern cosmological surveys are delivering datasets characterized by unprecedented quality and statistical completeness; this trend is expected to continue into the future as new ground- and space-based surveys come online. In order to maximally extract cosmological information from these observations, matching theoretical predictions are needed. At low redshifts, the surveys probe the nonlinear regime of structure formation where cosmological simulations are the primary means of obtaining the required information. The computational cost of sufficiently resolved large-volume simulations makes it prohibitive to run very large ensembles. Nevertheless, precision emulators built on a tractable number of high-quality simulations can be used to build very fast prediction schemes to enable a variety of cosmological inference studies. We have recently introduced the Mira-Titan Universe simulation suite designed to construct emulators for a range of cosmological probes. The suite covers the standard six cosmological parameters $\{ω_m,ω_b, σ_8, h, n_s, w_0\}$ and, in addition, includes massive neutrinos and a dynamical dark energy equation of state, $\{ω_ν, w_a\}$. In this paper we present the final emulator for the matter power spectrum based on 111 cosmological simulations, each covering a (2.1Gpc)$^3$ volume and evolving 3200$^3$ particles. An additional set of 1776 lower-resolution simulations and TimeRG perturbation theory results for the power spectrum are used to cover scales straddling the linear to mildly nonlinear regimes. The emulator provides predictions at the two to three percent level of accuracy over a wide range of cosmological parameters and is publicly released as part of this paper.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
A Bayesian Survival Tree Partition Model Using Latent Gaussian Processes
Authors:
Richard D. Payne,
Nilabja Guha,
Bani K. Mallick
Abstract:
Survival models are used to analyze time-to-event data in a variety of disciplines. Proportional hazard models provide interpretable parameter estimates, but proportional hazards assumptions are not always appropriate. Non-parametric models are more flexible but often lack a clear inferential framework. We propose a Bayesian tree partition model which is both flexible and inferential. Inference is…
▽ More
Survival models are used to analyze time-to-event data in a variety of disciplines. Proportional hazard models provide interpretable parameter estimates, but proportional hazards assumptions are not always appropriate. Non-parametric models are more flexible but often lack a clear inferential framework. We propose a Bayesian tree partition model which is both flexible and inferential. Inference is obtained through the posterior tree structure and flexibility is preserved by modeling the the hazard function in each partition using a latent exponentiated Gaussian process. An efficient reversible jump Markov chain Monte Carlo algorithm is accomplished by marginalizing the parameters in each partition element via a Laplace approximation. Consistency properties for the estimator are established. The method can be used to help determine subgroups as well as prognostic and/or predictive biomarkers in time-to-event data. The method is applied to a liver survival dataset and is compared with some existing methods on simulated data.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
OntoMerger: An Ontology Integration Library for Deduplicating and Connecting Knowledge Graph Nodes
Authors:
David Geleta,
Andriy Nikolov,
Mark ODonoghue,
Benedek Rozemberczki,
Anna Gogleva,
Valentina Tamma,
Terry R. Payne
Abstract:
Duplication of nodes is a common problem encountered when building knowledge graphs (KGs) from heterogeneous datasets, where it is crucial to be able to merge nodes having the same meaning. OntoMerger is a Python ontology integration library whose functionality is to deduplicate KG nodes. Our approach takes a set of KG nodes, map**s and disconnected hierarchies and generates a set of merged node…
▽ More
Duplication of nodes is a common problem encountered when building knowledge graphs (KGs) from heterogeneous datasets, where it is crucial to be able to merge nodes having the same meaning. OntoMerger is a Python ontology integration library whose functionality is to deduplicate KG nodes. Our approach takes a set of KG nodes, map**s and disconnected hierarchies and generates a set of merged nodes together with a connected hierarchy. In addition, the library provides analytic and data testing functionalities that can be used to fine-tune the inputs, further reducing duplication, and to increase connectivity of the output graph. OntoMerger can be applied to a wide variety of ontologies and KGs. In this paper we introduce OntoMerger and illustrate its functionality on a real-world biomedical KG.
△ Less
Submitted 5 June, 2022;
originally announced June 2022.
-
Interplay between nonlinear potential theory and fully nonlinear elliptic PDEs
Authors:
F. Reese Harvey,
Kevin R. Payne
Abstract:
We discuss one of the many topics that illustrate the interaction of Blaine Lawson's deep geometric and analytic insights. The first author is extremely grateful to have had the pleasure of collaborating with Blaine over many enjoyable years. The topic to be discussed concerns the fruitful interplay between nonlinear potential theory; that is, the study of subharmonics with respect to a general co…
▽ More
We discuss one of the many topics that illustrate the interaction of Blaine Lawson's deep geometric and analytic insights. The first author is extremely grateful to have had the pleasure of collaborating with Blaine over many enjoyable years. The topic to be discussed concerns the fruitful interplay between nonlinear potential theory; that is, the study of subharmonics with respect to a general constraint set in the 2-jet bundle and the study of subsolutions and supersolutions of a nonlinear (degenerate) elliptic PDE. The main results include (but are not limited to) the validity of the comparison principle and the existence and uniqueness to solutions to the relevant Dirichlet problems on domains which are suitably "pseudoconvex". The methods employed are geometric and flexible as well as being very general on the potential theory side, which is interesting in its own right. Moreover, in many important geometric contexts no natutral operator may be present. On the other hand, the potential theoretic approach can yield results on the PDE side in terms of non standard structual conditions on a given differential operator.
△ Less
Submitted 26 March, 2022;
originally announced March 2022.
-
Governance of Autonomous Agents on the Web: Challenges and Opportunities
Authors:
Timotheus Kampik,
Adnane Mansour,
Olivier Boissier,
Sabrina Kirrane,
Julian Padget,
Terry R. Payne,
Munindar P. Singh,
Valentina Tamma,
Antoine Zimmermann
Abstract:
The study of autonomous agents has a long tradition in the Multiagent Systems and the Semantic Web communities, with applications ranging from automating business processes to personal assistants. More recently, the Web of Things (WoT), which is an extension of the Internet of Things (IoT) with metadata expressed in Web standards, and its community provide further motivation for pushing the autono…
▽ More
The study of autonomous agents has a long tradition in the Multiagent Systems and the Semantic Web communities, with applications ranging from automating business processes to personal assistants. More recently, the Web of Things (WoT), which is an extension of the Internet of Things (IoT) with metadata expressed in Web standards, and its community provide further motivation for pushing the autonomous agents research agenda forward. Although representing and reasoning about norms, policies and preferences is crucial to ensuring that autonomous agents act in a manner that satisfies stakeholder requirements, normative concepts, policies and preferences have yet to be considered as first-class abstractions in Web-based multiagent systems. Towards this end, this paper motivates the need for alignment and joint research across the Multiagent Systems, Semantic Web, and WoT communities, introduces a conceptual framework for governance of autonomous agents on the Web, and identifies several research challenges and opportunities.
△ Less
Submitted 5 February, 2022;
originally announced February 2022.
-
MLFC: From 10 to 50 Planners in the Multi-Agent Programming Contest
Authors:
Rafael C. Cardoso,
Angelo Ferrando,
Fabio Papacchini,
Matt Luckcuck,
Sven Linker,
Terry R. Payne
Abstract:
In this paper, we describe the strategies used by our team, MLFC, that led us to achieve the 2nd place in the 15th edition of the Multi-Agent Programming Contest. The scenario used in the contest is an extension of the previous edition (14th) "Agents Assemble" wherein two teams of agents move around a 2D grid and compete to assemble complex block structures. We discuss the languages and tools used…
▽ More
In this paper, we describe the strategies used by our team, MLFC, that led us to achieve the 2nd place in the 15th edition of the Multi-Agent Programming Contest. The scenario used in the contest is an extension of the previous edition (14th) "Agents Assemble" wherein two teams of agents move around a 2D grid and compete to assemble complex block structures. We discuss the languages and tools used during the development of our team. Then, we summarise the main strategies that were carried over from our previous participation in the 14th edition and list the limitations (if any) of using these strategies in the latest contest edition. We also developed new strategies that were made specifically for the extended scenario: cartography (determining the size of the map); formal verification of the map merging protocol (to provide assurances that it works when increasing the number of agents); plan cache (efficiently scaling the number of planners); task achievement (forming groups of agents to achieve tasks); and bullies (agents that focus on stop** agents from the opposing team). Finally, we give a brief overview of our performance in the contest and discuss what we believe were our shortcomings.
△ Less
Submitted 18 October, 2021; v1 submitted 15 October, 2021;
originally announced October 2021.
-
Self-explaining Neural Network with Concept-based Explanations for ICU Mortality Prediction
Authors:
Sayantan Kumar,
Sean C. Yu,
Thomas Kannampallil,
Zachary Abrams,
Andrew Michelson,
Philip R. O. Payne
Abstract:
Complex deep learning models show high prediction tasks in various clinical prediction tasks but their inherent complexity makes it more challenging to explain model predictions for clinicians and healthcare providers. Existing research on explainability of deep learning models in healthcare have two major limitations: using post-hoc explanations and using raw clinical variables as units of explan…
▽ More
Complex deep learning models show high prediction tasks in various clinical prediction tasks but their inherent complexity makes it more challenging to explain model predictions for clinicians and healthcare providers. Existing research on explainability of deep learning models in healthcare have two major limitations: using post-hoc explanations and using raw clinical variables as units of explanation, both of which are often difficult for human interpretation. In this work, we designed a self-explaining deep learning framework using the expert-knowledge driven clinical concepts or intermediate features as units of explanation. The self-explaining nature of our proposed model comes from generating both explanations and predictions within the same architectural framework via joint training. We tested our proposed approach on a publicly available Electronic Health Records (EHR) dataset for predicting patient mortality in the ICU. In order to analyze the performance-interpretability trade-off, we compared our proposed model with a baseline having the same set-up but without the explanation components. Experimental results suggest that adding explainability components to a deep learning framework does not impact prediction performance and the explanations generated by the model can provide insights to the clinicians to understand the possible reasons behind patient mortality.
△ Less
Submitted 17 November, 2022; v1 submitted 9 October, 2021;
originally announced October 2021.
-
Machine learning for modeling the progression of Alzheimer disease dementia using clinical data: a systematic literature review
Authors:
Sayantan Kumar,
Inez Oh,
Suzanne Schindler,
Albert M Lai,
Philip R O Payne,
Aditi Gupta
Abstract:
Objective Alzheimer disease (AD) is the most common cause of dementia, a syndrome characterized by cognitive impairment severe enough to interfere with activities of daily life. We aimed to conduct a systematic literature review (SLR) of studies that applied machine learning (ML) methods to clinical data derived from electronic health records in order to model risk for progression of AD dementia.…
▽ More
Objective Alzheimer disease (AD) is the most common cause of dementia, a syndrome characterized by cognitive impairment severe enough to interfere with activities of daily life. We aimed to conduct a systematic literature review (SLR) of studies that applied machine learning (ML) methods to clinical data derived from electronic health records in order to model risk for progression of AD dementia.
Materials and Methods: We searched for articles published between January 1, 2010, and May 31, 2020, in PubMed, Scopus, ScienceDirect, IEEE Explore Digital Library, Association for Computing Machinery Digital Library, and arXiv. We used predefined criteria to select relevant articles and summarized them according to key components of ML analysis such as data characteristics, computational algorithms, and research focus.
Results: There has been a considerable rise over the past 5 years in the number of research papers using ML-based analysis for AD dementia modeling. We reviewed 64 relevant articles in our SLR. The results suggest that majority of existing research has focused on predicting progression of AD dementia using publicly available datasets containing both neuroimaging and clinical data (neurobehavioral status exam scores, patient demographics, neuroimaging data, and laboratory test values).
Discussion: Identifying individuals at risk for progression of AD dementia could potentially help to personalize disease management to plan future care. Clinical data consisting of both structured data tables and clinical notes can be effectively used in ML-based approaches to model risk for AD dementia progression. Data sharing and reproducibility of results can enhance the impact, adaptation, and generalizability of this research.
△ Less
Submitted 5 August, 2021;
originally announced August 2021.
-
Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales
Authors:
Jacob Andreas,
Gašper Beguš,
Michael M. Bronstein,
Roee Diamant,
Denley Delaney,
Shane Gero,
Shafi Goldwasser,
David F. Gruber,
Sarah de Haas,
Peter Malkin,
Roger Payne,
Giovanni Petri,
Daniela Rus,
Pratyusha Sharma,
Dan Tchernov,
Pernille Tønnesen,
Antonio Torralba,
Daniel Vogt,
Robert J. Wood
Abstract:
The past decade has witnessed a groundbreaking rise of machine learning for human language analysis, with current methods capable of automatically accurately recovering various aspects of syntax and semantics - including sentence structure and grounded word meaning - from large data collections. Recent research showed the promise of such tools for analyzing acoustic communication in nonhuman speci…
▽ More
The past decade has witnessed a groundbreaking rise of machine learning for human language analysis, with current methods capable of automatically accurately recovering various aspects of syntax and semantics - including sentence structure and grounded word meaning - from large data collections. Recent research showed the promise of such tools for analyzing acoustic communication in nonhuman species. We posit that machine learning will be the cornerstone of future collection, processing, and analysis of multimodal streams of data in animal communication studies, including bioacoustic, behavioral, biological, and environmental data. Cetaceans are unique non-human model species as they possess sophisticated acoustic communications, but utilize a very different encoding system that evolved in an aquatic rather than terrestrial medium. Sperm whales, in particular, with their highly-developed neuroanatomical features, cognitive abilities, social structures, and discrete click-based encoding make for an excellent starting point for advanced machine learning tools that can be applied to other animals in the future. This paper details a roadmap toward this goal based on currently existing technology and multidisciplinary scientific community effort. We outline the key elements required for the collection and processing of massive bioacoustic data of sperm whales, detecting their basic communication units and language-like higher-level structures, and validating these models through interactive playback experiments. The technological capabilities developed by such an undertaking are likely to yield cross-applications and advancements in broader communities investigating non-human communication and animal behavioral research.
△ Less
Submitted 17 April, 2021;
originally announced April 2021.
-
Comparison principles by monotonicity and duality for constant coefficient nonlinear potential theory and PDEs
Authors:
Marco Cirant,
F. Reese Harvey,
H. Blaine Lawson, Jr,
Kevin R. Payne
Abstract:
We prove comparison principles for nonlinear potential theories in euclidian spaces in a very straightforward manner from duality and monotonicity. We shall also show how to deduce comparison principles for nonlinear differential operators, a program seemingly different from the first. However, we shall marry these two points of view, for a wide variety of equations, under something called the cor…
▽ More
We prove comparison principles for nonlinear potential theories in euclidian spaces in a very straightforward manner from duality and monotonicity. We shall also show how to deduce comparison principles for nonlinear differential operators, a program seemingly different from the first. However, we shall marry these two points of view, for a wide variety of equations, under something called the correspondence principle. In potential theory one is given a constraint set F on the 2-jets of a function, and the boundary of F gives a differential equation. There are many differential operators, suitably organized around F, which give the same equation. So potential theory gives a great strengthening and simplification to the operator theory. Conversely, the set of operators associated to F can have much to say about the potential theory. An object of central interest here is that of monotonicity, which explains and unifies much of the theory. We shall always assume that the maximal monotonicity cone for a potential theory has interior. This is automatic for gradient-free equations where monotonicity is simply the standard degenerate ellipticity and properness assumptions. We show that for each such potential theory F there is an associated canonical operator, defined on the entire 2-jet space and having all the desired properties. Furthermore, comparison holds for this operator on any domain which admits a regular strictly M-subharmonic function, where M is a monotonicity subequation for F. On the operator side there is an important dichotomy into the unconstrained cases and constrained cases, where the operator must be restricted to a proper subset of 2-jet space. These two cases are best illustrated by the canonical operators and Dirichlet-Garding operators, respectively. The article gives many, many examples from pure and applied mathematics, and also from theoretical physics.
△ Less
Submitted 3 September, 2020;
originally announced September 2020.
-
Repurposing drugs for COVID-19 based on transcriptional response of host cells to SARS-CoV-2
Authors:
Fuhai Li,
Andrew P. Michelson,
Randi Foraker,
Ming Zhan,
Philip R. O. Payne
Abstract:
The Coronavirus Disease 2019 (COVID-19) pandemic has infected over 10 million people globally with a relatively high mortality rate. There are many therapeutics undergoing clinical trials, but there is no effective vaccine or therapy for treatment thus far. After affected by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), molecular signaling of host cells plays critical roles dur…
▽ More
The Coronavirus Disease 2019 (COVID-19) pandemic has infected over 10 million people globally with a relatively high mortality rate. There are many therapeutics undergoing clinical trials, but there is no effective vaccine or therapy for treatment thus far. After affected by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), molecular signaling of host cells plays critical roles during the life cycle of SARS-CoV-2. Thus, it is significant to identify the involved molecular signaling pathways within the host cells, and drugs targeting these molecular signaling pathways could be potentially effective for COVID-19 treatment. In this study, we aimed to identify these potential molecular signaling pathways, and repurpose existing drugs as a potentially effective treatment of COVID-19 to facilitate the therapeutic discovery, based on the transcriptional response of host cells. We first identified dysfunctional signaling pathways associated with the infection caused SARS-CoV-2 in human lung epithelial cells through analysis of the altered gene expression profiles. In addition to the signaling pathway analysis, the activated gene ontologies (GOs) and super gene ontologies were identified. Signaling pathways and GOs such as MAPK, JNK, STAT, ERK, JAK-STAT, IRF7-NFkB signaling, and MYD88/CXCR6 immune signaling were particularly identified. Based on the identified signaling pathways and GOs, a set of potentially effective drugs were repurposed by integrating the drug-target and reverse gene expression data resources. The dexamethasone was top-ranked in the prediction, which was the first reported drug to be able to significantly reduce the death rate of COVID-19 patients receiving respiratory support. The results can be helpful to understand the associated molecular signaling pathways within host cells, and facilitate the discovery of effective drugs for COVID-19 treatment.
△ Less
Submitted 29 June, 2020; v1 submitted 1 June, 2020;
originally announced June 2020.
-
Warming Early Mars with Climate Cycling: The Effect of CO2-H2 Collision-induced Absorption
Authors:
Benjamin P. C. Hayworth,
Ravi Kumar Kopparapu,
Jacob Haqq-Misra,
Natasha E. Batalha,
Rebecca C. Payne,
Bradford J. Foley,
Mma Ikwut-Ukwa,
James F. Kasting
Abstract:
Explaining the evidence for surface liquid water on early Mars has been a challenge for climate modelers, as the sun was ~30% less luminous during the late-Noachian. We propose that the additional greenhouse forcing of CO2-H2 collision-induced absorption is capable of bringing the surface temperature above freezing and can put early Mars into a limit-cycling regime. Limit cycles occur when insolat…
▽ More
Explaining the evidence for surface liquid water on early Mars has been a challenge for climate modelers, as the sun was ~30% less luminous during the late-Noachian. We propose that the additional greenhouse forcing of CO2-H2 collision-induced absorption is capable of bringing the surface temperature above freezing and can put early Mars into a limit-cycling regime. Limit cycles occur when insolation is low and CO2 outgassing rates are unable to balance with the rapid drawdown of CO2 during warm weathering periods. Planets in this regime will alternate between global glaciation and transient warm climate phases. This mechanism is capable of explaining the geomorphological evidence for transient warm periods in the martian record. Previous work has shown that collision-induced absorption of CO2-H2 was capable of deglaciating early Mars, but only with high H2 outgassing rates (greater than ~600 Tmol/yr) and at high surface pressures (between 3 to 4 bars). We used new theoretically derived collision-induced absorption coefficients for CO2-H2 to reevaluate the climate limit cycling hypothesis for early Mars. Using the new and stronger absorption coefficients in our 1-dimensional radiative convective model as well as our energy balance model, we find that limit cycling can occur with an H2 outgassing rate as low as ~300 Tmol/yr at surface pressures below 3 bars. Our results agree more closely with paleoparameters for early martian surface pressure and hydrogen abundance.
△ Less
Submitted 27 April, 2020; v1 submitted 20 April, 2020;
originally announced April 2020.
-
Comparison principles for viscosity solutions of elliptic branches of fully nonlinear equations independent of the gradient
Authors:
Marco Cirant,
Kevin R. Payne
Abstract:
The validity of the comparison principle in variable coefficient fully nonlinear gradient free potential theory is examined and then used to prove the comparison principle for fully nonlinear partial differential equations which determine a suitable potential theory. The approach combines the notions of proper elliptic branches inspired by Krylov (Trans. Amer. Math. Soc. 1995) with the monotonicit…
▽ More
The validity of the comparison principle in variable coefficient fully nonlinear gradient free potential theory is examined and then used to prove the comparison principle for fully nonlinear partial differential equations which determine a suitable potential theory. The approach combines the notions of proper elliptic branches inspired by Krylov (Trans. Amer. Math. Soc. 1995) with the monotonicity-duality method initiated by Harvey and Lawson (Comm. Pure Appl. Math. 2009). In the variable coefficient nonlinear potential theory, a special role is played by the Hausdorff continuity of the proper elliptic map $Θ$ which defines the potential theory. In the applications to nonlinear equations defined by an operator $F$, structural conditions on $F$ will be determined for which there is a correspondence principle between $Θ$-subharmonics/superharmonics and admissible viscosity sub and supersolutions of the nonlinear equation and for which comparison for the equation follows from the associated compatible potential theory. General results and explicit models of interest from differential geometry will be examined.
△ Less
Submitted 25 February, 2020; v1 submitted 27 January, 2020;
originally announced January 2020.
-
Principal eigenvalues for k-Hessian operators by maximum principle methods
Authors:
Isabeau Birindelli,
Kevin R. Payne
Abstract:
For fully nonlinear $k$-Hessian operators on bounded strictly $(k-1)$-convex domains $Ω$ in ${\mathbb R}^N$, a characterization of the principal eigenvalue associated to a $k$-convex and negative principal eigenfunction will be given as the supremum over values of a spectral parameter for which admissible viscosity supersolutions obey a minimum principle. The admissibility condition is phrased in…
▽ More
For fully nonlinear $k$-Hessian operators on bounded strictly $(k-1)$-convex domains $Ω$ in ${\mathbb R}^N$, a characterization of the principal eigenvalue associated to a $k$-convex and negative principal eigenfunction will be given as the supremum over values of a spectral parameter for which admissible viscosity supersolutions obey a minimum principle. The admissibility condition is phrased in terms of the natural closed convex cone $Σ_k$ in the space of symmetric N by N matrices, which is an elliptic set in the sense of Krylov [Trans. AMS, 1995] and which corresponds to using $k$-convex functions as admissibility constraints in the formulation of viscosity subsolutions and supersolutions. Moreover, the associated principal eigenfunction is constructed by an iterative viscosity solution technique, which exploits a compactness property which results from the establishment of a global Hölder estimate for the unique $k$-convex solutions of the approximating equations.
△ Less
Submitted 19 December, 2019;
originally announced December 2019.
-
Large-scale inference of correlation among mixed-type biological traits with phylogenetic multivariate probit models
Authors:
Zhenyu Zhang,
Akihiko Nishimura,
Paul Bastide,
Xiang Ji,
Rebecca P. Payne,
Philip Goulder,
Philippe Lemey,
Marc A. Suchard
Abstract:
Inferring concerted changes among biological traits along an evolutionary history remains an important yet challenging problem. Besides adjusting for spurious correlation induced from the shared history, the task also requires sufficient flexibility and computational efficiency to incorporate multiple continuous and discrete traits as data size increases. To accomplish this, we jointly model mixed…
▽ More
Inferring concerted changes among biological traits along an evolutionary history remains an important yet challenging problem. Besides adjusting for spurious correlation induced from the shared history, the task also requires sufficient flexibility and computational efficiency to incorporate multiple continuous and discrete traits as data size increases. To accomplish this, we jointly model mixed-type traits by assuming latent parameters for binary outcome dimensions at the tips of an unknown tree informed by molecular sequences. This gives rise to a phylogenetic multivariate probit model. With large sample sizes, posterior computation under this model is problematic, as it requires repeated sampling from a high-dimensional truncated normal distribution. Current best practices employ multiple-try rejection sampling that suffers from slow-mixing and a computational cost that scales quadratically in sample size. We develop a new inference approach that exploits 1) the bouncy particle sampler (BPS) based on piecewise deterministic Markov processes to simultaneously sample all truncated normal dimensions, and 2) novel dynamic programming that reduces the cost of likelihood and gradient evaluations for BPS to linear in sample size. In an application with 535 HIV viruses and 24 traits that necessitates sampling from a 12,840-dimensional truncated normal, our method makes it possible to estimate the across-trait correlation and detect factors that affect the pathogen's capacity to cause disease. This inference framework is also applicable to a broader class of covariance structures beyond comparative biology.
△ Less
Submitted 23 September, 2020; v1 submitted 19 December, 2019;
originally announced December 2019.
-
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
Authors:
Tianyu Zhang,
Liwei Zhang,
Philip R. O. Payne,
Fuhai Li
Abstract:
Drug resistance is still a major challenge in cancer therapy. Drug combination is expected to overcome drug resistance. However, the number of possible drug combinations is enormous, and thus it is infeasible to experimentally screen all effective drug combinations considering the limited resources. Therefore, computational models to predict and prioritize effective drug combinations is important…
▽ More
Drug resistance is still a major challenge in cancer therapy. Drug combination is expected to overcome drug resistance. However, the number of possible drug combinations is enormous, and thus it is infeasible to experimentally screen all effective drug combinations considering the limited resources. Therefore, computational models to predict and prioritize effective drug combinations is important for combinatory therapy discovery in cancer. In this study, we proposed a novel deep learning model, AuDNNsynergy, to prediction drug combinations by integrating multi-omics data and chemical structure data. In specific, three autoencoders were trained using the gene expression, copy number and genetic mutation data of all tumor samples from The Cancer Genome Atlas. Then the physicochemical properties of drugs combined with the output of the three autoencoders, characterizing the individual cancer cell-lines, were used as the input of a deep neural network that predicts the synergy value of given pair-wise drug combinations against the specific cancer cell-lines. The comparison results showed the proposed AuDNNsynergy model outperforms four state-of-art approaches, namely DeepSynergy, Gradient Boosting Machines, Random Forests, and Elastic Nets. Moreover, we conducted the interpretation analysis of the deep learning model to investigate potential vital genetic predictors and the underlying mechanism of synergistic drug combinations on specific cancer cell-lines.
△ Less
Submitted 16 November, 2018;
originally announced November 2018.
-
Computationally efficient methods for fitting mixed models to electronic health records data
Authors:
Kirsty Rhodes,
Rebecca Turner,
Rupert Payne,
Ian White
Abstract:
Motivated by two case studies using primary care records from the Clinical Practice Research Datalink, we describe statistical methods that facilitate the analysis of tall data, with very large numbers of observations. Our focus is on investigating the association between patient characteristics and an outcome of interest, while allowing for variation among general practices. We explore ways to fi…
▽ More
Motivated by two case studies using primary care records from the Clinical Practice Research Datalink, we describe statistical methods that facilitate the analysis of tall data, with very large numbers of observations. Our focus is on investigating the association between patient characteristics and an outcome of interest, while allowing for variation among general practices. We explore ways to fit mixed effects models to tall data, including predictors of interest and confounding factors as covariates, and including random intercepts to allow for heterogeneity in outcome among practices. We introduce: (1) weighted regression and (2) meta-analysis of estimated regression coefficients from each practice. Both methods reduce the size of the dataset, thus decreasing the time required for statistical analysis. We compare the methods to an existing subsampling approach. All methods give similar point estimates, and weighted regression and meta-analysis give similar standard errors for point estimates to analysis of the entire dataset, but the subsampling method gives larger standard errors. Where all data are discrete, weighted regression is equivalent to fitting the mixed model to the entire dataset. In the presence of a continuous covariate, meta-analysis is useful. Both methods are easy to implement in standard statistical software.
△ Less
Submitted 11 May, 2018; v1 submitted 20 April, 2017;
originally announced April 2017.
-
Modelling System of Systems Interface Contract Behaviour
Authors:
Oldrich Faldik,
Richard Payne,
John Fitzgerald,
Barbora Buhnova
Abstract:
A key challenge in System of Systems (SoS) engineering is the analysis and maintenance of global properties under SoS evolution, and the integration of new constituent elements. There is a need to model the constituent systems composing a SoS in order to allow the analysis of emergent behaviours at the SoS boundary. The Contract pattern allows the engineer to specify constrained behaviours to whic…
▽ More
A key challenge in System of Systems (SoS) engineering is the analysis and maintenance of global properties under SoS evolution, and the integration of new constituent elements. There is a need to model the constituent systems composing a SoS in order to allow the analysis of emergent behaviours at the SoS boundary. The Contract pattern allows the engineer to specify constrained behaviours to which constituent systems are required to conform in order to be a part of the SoS. However, the Contract pattern faces some limitations in terms of its accessibility and suitability for verifying contract compatibility. To address these deficiencies, we propose the enrichment of the Contract pattern, which hitherto has been defined using SysML and the COMPASS Modelling Language (CML), by utilising SysML and Object Constraint Language (OCL). In addition, we examine the potential of interface automata, a notation for improving loose coupling between interfaces of constituent systems defined according to the contract, as a means of enabling the verification of contract compatibility. The approach is demonstrated using a case study in audio/video content streaming.
△ Less
Submitted 20 March, 2017;
originally announced March 2017.
-
A Conditional Density Estimation Partition Model Using Logistic Gaussian Processes
Authors:
Richard D. Payne,
Nilabja Guha,
Yu Ding,
Bani K. Mallick
Abstract:
Conditional density estimation (density regression) estimates the distribution of a response variable y conditional on covariates x. Utilizing a partition model framework, a conditional density estimation method is proposed using logistic Gaussian processes. The partition is created using a Voronoi tessellation and is learned from the data using a reversible jump Markov chain Monte Carlo algorithm…
▽ More
Conditional density estimation (density regression) estimates the distribution of a response variable y conditional on covariates x. Utilizing a partition model framework, a conditional density estimation method is proposed using logistic Gaussian processes. The partition is created using a Voronoi tessellation and is learned from the data using a reversible jump Markov chain Monte Carlo algorithm. The Markov chain Monte Carlo algorithm is made possible through a Laplace approximation on the latent variables of the logistic Gaussian process model. This approximation marginalizes the parameters in each partition element, allowing an efficient search of the posterior distribution of the tessellation. The method has desirable consistency properties. In simulation and applications, the model successfully estimates the partition structure and conditional distribution of y.
△ Less
Submitted 20 March, 2017;
originally announced March 2017.
-
On viscosity solutions to the Dirichlet problem for elliptic branches of nonhomogeneous fully nonlinear equations
Authors:
Marco Cirant,
Kevin R. Payne
Abstract:
For scalar fully nonlinear partial differential equations depending on the Hessian andspatial coordinates, we present a general theory for obtaining comparison principles and well posedness for the associated Dirichlet problem with continuous boundary data. In particular, we treat admissible viscosity solutions of elliptic branches of the equation, where the nonlinearity need not be monotone on al…
▽ More
For scalar fully nonlinear partial differential equations depending on the Hessian andspatial coordinates, we present a general theory for obtaining comparison principles and well posedness for the associated Dirichlet problem with continuous boundary data. In particular, we treat admissible viscosity solutions of elliptic branches of the equation, where the nonlinearity need not be monotone on all of the space of symmetric N by N matrices. An elliptic branch (in the sense of Krylov, 1995) of the equation is encoded by a set valued map from the coordinate domain into the elliptic subsets of the symmetric matrices (an elliptic map). The nonlinearity will be monotone along this map and the degenerate elliptic PDE is replaced by the a differential inclusion. Weak solutions to such differential inclusions are defined by using the notion given by Harvey-Lawson (2009) in a pointwise manner. If the elliptic map is uniformly upper semicontinuous, we show that the comparison principle holds for these weak solutions and that Perron's method yields a unique continuous solution to the associated abstract Dirichlet problem provided that the boundary is suitably convex with respect to the elliptic mapand its dual in the sense of Harvey and Lawson. When the map encodes an elliptic branch of a given PDE, these soluitions are shown to be admissible viscosity solutions of the PDE problem. Various applications are described in terms of structural conditions which ensure the existence of the needed elliptic map. Examples include non-totally degenerate equations and equations involving the eigenvalues of the Hessian and their perturbations. In certain situations, the methods employed here will be shown to operate freely, while classical viscosity approaches may not.
△ Less
Submitted 8 May, 2015;
originally announced May 2015.
-
Two-Stage Metropolis-Hastings for Tall Data
Authors:
Richard D. Payne,
Bani K. Mallick
Abstract:
This paper discusses the challenges presented by tall data problems associated with Bayesian classification (specifically binary classification) and the existing methods to handle them. Current methods include parallelizing the likelihood, subsampling, and consensus Monte Carlo. A new method based on the two-stage Metropolis-Hastings algorithm is also proposed. The purpose of this algorithm is to…
▽ More
This paper discusses the challenges presented by tall data problems associated with Bayesian classification (specifically binary classification) and the existing methods to handle them. Current methods include parallelizing the likelihood, subsampling, and consensus Monte Carlo. A new method based on the two-stage Metropolis-Hastings algorithm is also proposed. The purpose of this algorithm is to reduce the exact likelihood computational cost in the tall data situation. In the first stage, a new proposal is tested by the approximate likelihood based model. The full likelihood based posterior computation will be conducted only if the proposal passes the first stage screening. Furthermore, this method can be adopted into the consensus Monte Carlo framework. The two-stage method is applied to logistic regression, hierarchical logistic regression, and Bayesian multivariate adaptive regression splines.
△ Less
Submitted 20 March, 2017; v1 submitted 20 November, 2014;
originally announced November 2014.
-
Towards Verification of Constituent Systems through Automated Proof
Authors:
Luis Diogo Couto,
Simon Foster,
Richard Payne
Abstract:
This paper explores verification of constituent systems within the context of the Symphony tool platform for Systems of Systems (SoS). Our SoS modelling language, CML, supports various contractual specification elements, such as state invariants and operation preconditions, which can be used to specify contractual obligations on the constituent systems of a SoS. To support verification of these ob…
▽ More
This paper explores verification of constituent systems within the context of the Symphony tool platform for Systems of Systems (SoS). Our SoS modelling language, CML, supports various contractual specification elements, such as state invariants and operation preconditions, which can be used to specify contractual obligations on the constituent systems of a SoS. To support verification of these obligations we have developed a proof obligation generator and theorem prover plugin for Symphony. The latter uses the Isabelle/HOL theorem prover to automatically discharge the proof obligations arising from a CML model. Our hope is that the resulting proofs can then be used to formally verify the conformance of each constituent system, which is turn would result in a dependable SoS.
△ Less
Submitted 7 May, 2014; v1 submitted 30 April, 2014;
originally announced April 2014.
-
Fault Modelling in System-of-Systems Contracts
Authors:
Zoe Andrews,
Jeremy Bryans,
Richard Payne,
Klaus Kristensen
Abstract:
The nature of Systems of Systems (SoSs), large complex systems composed of independent, geographically distributed and continuously evolving constituent systems, means that faults are unavoidable. Previous work on defining contractual specifications of the constituent systems of SoSs does not provide any explicit consideration for faults. In this paper we address that gap by extending an existing…
▽ More
The nature of Systems of Systems (SoSs), large complex systems composed of independent, geographically distributed and continuously evolving constituent systems, means that faults are unavoidable. Previous work on defining contractual specifications of the constituent systems of SoSs does not provide any explicit consideration for faults. In this paper we address that gap by extending an existing pattern for modelling contracts with fault modelling concepts. The proposed extensions are introduced with respect to an Audio Visual SoS case study from Bang and Olufsen, before discussing how they relate to previous work on modelling faults in SoSs.
△ Less
Submitted 7 October, 2014; v1 submitted 30 April, 2014;
originally announced April 2014.
-
Final Analysis and Results of the Phase II SIMPLE Dark Matter Search
Authors:
M. Felizardo,
T. A. Girard,
T. Morlat,
A. C. Fernandes,
A. R. Ramos,
J. G. Marques,
M. Auguste,
D. Boyer,
A. Cavaillou,
J. Poupeney,
C. Sudre,
J. Puibasset,
H. S. Miley,
R. F. Payne,
F. P. Carvalho,
M. I. Prudêncio,
R. Marques
Abstract:
We report the final results of the Phase II SIMPLE measurements, comprising two run stages of 15 superheated droplet detectors each, the second stage including an improved neutron shielding. The analyses includes a refined signal analysis, and revised nucleation efficiency based on reanalysis of previously-reported monochromatic neutron irradiations. The combined results yield a contour minimum of…
▽ More
We report the final results of the Phase II SIMPLE measurements, comprising two run stages of 15 superheated droplet detectors each, the second stage including an improved neutron shielding. The analyses includes a refined signal analysis, and revised nucleation efficiency based on reanalysis of previously-reported monochromatic neutron irradiations. The combined results yield a contour minimum of σ_{p} = 4.2 x 10^-3 pb at 35 GeV/c^2 on the spin-dependent sector of WIMP-proton interactions, the most restrictive to date from a direct search experiment and overlap** for the first time results previously obtained only indirectly. In the spin-independent sector, a minimum of 3.6 x 10^-6 pb at 35 GeV/c^2 is achieved, with the exclusion contour challenging the recent CoGeNT region of current interest.
△ Less
Submitted 9 April, 2012; v1 submitted 15 June, 2011;
originally announced June 2011.
-
First Results of the Phase II SIMPLE Dark Matter Search
Authors:
M. Felizardo,
T. Morlat,
A. C. Fernandes,
TA Girard,
J. G. Marques,
A. R. Ramos,
M. Auguste,
D. Boyer,
A. Cavaillou,
C. Sudre,
J. Poupeney,
R. F. Payne,
H. S. Miley,
J. Puibasset
Abstract:
We report results of a 14.1 kgd measurement with 15 superheated droplet detectors of total active mass 0.208 kg, comprising the first stage of a 30 kgd Phase II experiment. In combination with the results of the neutron-spin sensitive XENON10 experiment, these results yield a limit of |a_p| < 0.32 for M_W = 50 GeV/c2 on the spin-dependent sector of weakly interacting massive particle-nucleus inter…
▽ More
We report results of a 14.1 kgd measurement with 15 superheated droplet detectors of total active mass 0.208 kg, comprising the first stage of a 30 kgd Phase II experiment. In combination with the results of the neutron-spin sensitive XENON10 experiment, these results yield a limit of |a_p| < 0.32 for M_W = 50 GeV/c2 on the spin-dependent sector of weakly interacting massive particle-nucleus interactions with a 50% reduction in the previously allowed region of the phase space formerly defined by XENON, KIMS and PICASSO. In the spin-independent sector, a limit of 2.3x10-5 pb at M_W = 45 GeV/c2 is obtained.
△ Less
Submitted 20 October, 2010; v1 submitted 15 March, 2010;
originally announced March 2010.
-
A CF3I-based SDD Prototype for Spin-independent Dark Matter Searches
Authors:
T. Morlata,
M. Felizardo,
F. Giuliani,
TA Girard,
G. Waysand,
R. F. Payne,
H. S. Miley,
A. R. Ramos,
J. G. Marques,
R. C. Martins,
D. Limagne
Abstract:
The application of Superheated Droplet Detectors (SDDs) to dark matter searches has so far been confined to the light nuclei refrigerants C2ClF5 and C4F10 (SIMPLE and PICASSO, respectively), with a principle sensitivity to spin-dependent interactions. Given the competitive results of these devices, as a result of their intrinsic insensitivity to backgrounds, we have developed a prototype trifluo…
▽ More
The application of Superheated Droplet Detectors (SDDs) to dark matter searches has so far been confined to the light nuclei refrigerants C2ClF5 and C4F10 (SIMPLE and PICASSO, respectively), with a principle sensitivity to spin-dependent interactions. Given the competitive results of these devices, as a result of their intrinsic insensitivity to backgrounds, we have developed a prototype trifluoroiodomethane (CF3I)-loaded SDD with increased sensitivity to spin-independent interactions as well. A low (0.102 kgd) exposure test operation of two high concentration, 1 liter devices is described, and the results compared with leading experiments in both spin-dependent and -independent sectors. Although competitive in both sectors when the difference in exposures is accounted for, a problem with fracturing of the detector gel must be addressed before significantly larger exposures can be envisioned.
△ Less
Submitted 26 August, 2008; v1 submitted 16 April, 2007;
originally announced April 2007.