-
A Differentially Private Blockchain-Based Approach for Vertical Federated Learning
Authors:
Linh Tran,
Sanjay Chari,
Md. Saikat Islam Khan,
Aaron Zachariah,
Stacy Patterson,
Oshani Seneviratne
Abstract:
We present the Differentially Private Blockchain-Based Vertical Federal Learning (DP-BBVFL) algorithm that provides verifiability and privacy guarantees for decentralized applications. DP-BBVFL uses a smart contract to aggregate the feature representations, i.e., the embeddings, from clients transparently. We apply local differential privacy to provide privacy for embeddings stored on a blockchain…
▽ More
We present the Differentially Private Blockchain-Based Vertical Federal Learning (DP-BBVFL) algorithm that provides verifiability and privacy guarantees for decentralized applications. DP-BBVFL uses a smart contract to aggregate the feature representations, i.e., the embeddings, from clients transparently. We apply local differential privacy to provide privacy for embeddings stored on a blockchain, hence protecting the original data. We provide the first prototype application of differential privacy with blockchain for vertical federated learning. Our experiments with medical data show that DP-BBVFL achieves high accuracy with a tradeoff in training time due to on-chain aggregation. This innovative fusion of differential privacy and blockchain technology in DP-BBVFL could herald a new era of collaborative and trustworthy machine learning applications across several decentralized application domains.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
Informing clinical assessment by contextualizing post-hoc explanations of risk prediction models in type-2 diabetes
Authors:
Shruthi Chari,
Prasant Acharya,
Daniel M. Gruen,
Olivia Zhang,
Elif K. Eyigoz,
Mohamed Ghalwash,
Oshani Seneviratne,
Fernando Suarez Saiz,
Pablo Meyer,
Prithwish Chakraborty,
Deborah L. McGuinness
Abstract:
Medical experts may use Artificial Intelligence (AI) systems with greater trust if these are supported by contextual explanations that let the practitioner connect system inferences to their context of use. However, their importance in improving model usage and understanding has not been extensively studied. Hence, we consider a comorbidity risk prediction scenario and focus on contexts regarding…
▽ More
Medical experts may use Artificial Intelligence (AI) systems with greater trust if these are supported by contextual explanations that let the practitioner connect system inferences to their context of use. However, their importance in improving model usage and understanding has not been extensively studied. Hence, we consider a comorbidity risk prediction scenario and focus on contexts regarding the patients clinical state, AI predictions about their risk of complications, and algorithmic explanations supporting the predictions. We explore how relevant information for such dimensions can be extracted from Medical guidelines to answer typical questions from clinical practitioners. We identify this as a question answering (QA) task and employ several state-of-the-art LLMs to present contexts around risk prediction model inferences and evaluate their acceptability. Finally, we study the benefits of contextual explanations by building an end-to-end AI pipeline including data cohorting, AI risk modeling, post-hoc model explanations, and prototyped a visual dashboard to present the combined insights from different context dimensions and data sources, while predicting and identifying the drivers of risk of Chronic Kidney Disease - a common type-2 diabetes comorbidity. All of these steps were performed in engagement with medical experts, including a final evaluation of the dashboard results by an expert medical panel. We show that LLMs, in particular BERT and SciBERT, can be readily deployed to extract some relevant explanations to support clinical usage. To understand the value-add of the contextual explanations, the expert panel evaluated these regarding actionable insights in the relevant clinical setting. Overall, our paper is one of the first end-to-end analyses identifying the feasibility and benefits of contextual explanations in a real-world clinical use case.
△ Less
Submitted 11 February, 2023;
originally announced February 2023.
-
Automated Routing of Droplets for DNA Storage on a Digital Microfluidics Platform
Authors:
Ajay Manicka,
Andrew Stephan,
Sriram Chari,
Gemma Mendonsa,
Peyton Okubo,
John Stolzberg-Schray,
Anil Reddy,
Marc Riedel
Abstract:
Technologies for sequencing (reading) and synthesizing (writing) DNA have progressed on a Moore's law-like trajectory over the last three decades. This has motivated the idea of using DNA for data storage. Theoretically, DNA-based storage systems could out-compete all existing forms of archival storage. However, a large gap exists between what is theoretically possible in terms of read and write s…
▽ More
Technologies for sequencing (reading) and synthesizing (writing) DNA have progressed on a Moore's law-like trajectory over the last three decades. This has motivated the idea of using DNA for data storage. Theoretically, DNA-based storage systems could out-compete all existing forms of archival storage. However, a large gap exists between what is theoretically possible in terms of read and write speeds and what has been practically demonstrated with DNA. This paper introduces a novel approach to DNA storage, with automated assembly on a digital microfluidic biochip. This technology offers unprecedented parallelism in DNA assembly using a dual library of "symbols" and "linkers". An algorithmic solution is discussed for the problem of managing droplet traffic on the device, with prioritized three-dimensional "A*" routing. An overview is given of the software that was developed for routing a large number of droplets in parallel on the device, minimizing congestion and maximizing throughput.
△ Less
Submitted 5 July, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Higher dimensional origami constructions
Authors:
Deveena R. Banerjee,
Sara Chari,
Adriana Salerno
Abstract:
Origami is an ancient art that continues to yield both artistic and scientific insights to this day. In 2012, Buhler, Butler, de Launey, and Graham extended these ideas even further by develo** a mathematical construction inspired by origami -- one in which we iteratively construct points on the complex plane (the "paper") from a set of starting points (or "seed points") and lines through those…
▽ More
Origami is an ancient art that continues to yield both artistic and scientific insights to this day. In 2012, Buhler, Butler, de Launey, and Graham extended these ideas even further by develo** a mathematical construction inspired by origami -- one in which we iteratively construct points on the complex plane (the "paper") from a set of starting points (or "seed points") and lines through those points with prescribed angles (or the allowable "folds" on our paper). Any two lines with these prescribed angles through the seed points that intersect generate a new point, and by iterating this process for each pair of points formed, we generate a subset of the complex plane. We extend previously known results about the algebraic and geometric structure of these sets to higher dimensions. In the case when the set obtained is a lattice, we explore the relationship between the set of angles and the generators of the lattice and determine how introducing a new angle alters the lattice.
△ Less
Submitted 26 May, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.
-
Leveraging Clinical Context for User-Centered Explainability: A Diabetes Use Case
Authors:
Shruthi Chari,
Prithwish Chakraborty,
Mohamed Ghalwash,
Oshani Seneviratne,
Elif K. Eyigoz,
Daniel M. Gruen,
Fernando Suarez Saiz,
Ching-Hua Chen,
Pablo Meyer Rojas,
Deborah L. McGuinness
Abstract:
Academic advances of AI models in high-precision domains, like healthcare, need to be made explainable in order to enhance real-world adoption. Our past studies and ongoing interactions indicate that medical experts can use AI systems with greater trust if there are ways to connect the model inferences about patients to explanations that are tied back to the context of use. Specifically, risk pred…
▽ More
Academic advances of AI models in high-precision domains, like healthcare, need to be made explainable in order to enhance real-world adoption. Our past studies and ongoing interactions indicate that medical experts can use AI systems with greater trust if there are ways to connect the model inferences about patients to explanations that are tied back to the context of use. Specifically, risk prediction is a complex problem of diagnostic and interventional importance to clinicians wherein they consult different sources to make decisions. To enable the adoption of the ever improving AI risk prediction models in practice, we have begun to explore techniques to contextualize such models along three dimensions of interest: the patients' clinical state, AI predictions about their risk of complications, and algorithmic explanations supporting the predictions. We validate the importance of these dimensions by implementing a proof-of-concept (POC) in type-2 diabetes (T2DM) use case where we assess the risk of chronic kidney disease (CKD) - a common T2DM comorbidity. Within the POC, we include risk prediction models for CKD, post-hoc explainers of the predictions, and other natural-language modules which operationalize domain knowledge and CPGs to provide context. With primary care physicians (PCP) as our end-users, we present our initial results and clinician feedback in this paper. Our POC approach covers multiple knowledge sources and clinical scenarios, blends knowledge to explain data and predictions to PCPs, and received an enthusiastic response from our medical expert.
△ Less
Submitted 15 July, 2021; v1 submitted 5 July, 2021;
originally announced July 2021.
-
Semantic Modeling for Food Recommendation Explanations
Authors:
Ishita Padhiar,
Oshani Seneviratne,
Shruthi Chari,
Daniel Gruen,
Deborah L. McGuinness
Abstract:
With the increased use of AI methods to provide recommendations in the health, specifically in the food dietary recommendation space, there is also an increased need for explainability of those recommendations. Such explanations would benefit users of recommendation systems by empowering them with justifications for following the system's suggestions. We present the Food Explanation Ontology (FEO)…
▽ More
With the increased use of AI methods to provide recommendations in the health, specifically in the food dietary recommendation space, there is also an increased need for explainability of those recommendations. Such explanations would benefit users of recommendation systems by empowering them with justifications for following the system's suggestions. We present the Food Explanation Ontology (FEO) that provides a formalism for modeling explanations to users for food-related recommendations. FEO models food recommendations, using concepts from the explanation domain to create responses to user questions about food recommendations they receive from AI systems such as personalized knowledge base question answering systems. FEO uses a modular, extensible structure that lends itself to a variety of explanations while still preserving important semantic details to accurately represent explanations of food recommendations. In order to evaluate this system, we used a set of competency questions derived from explanation types present in literature that are relevant to food recommendations. Our motivation with the use of FEO is to empower users to make decisions about their health, fully equipped with an understanding of the AI recommender systems as they relate to user questions, by providing reasoning behind their recommendations in the form of explanations.
△ Less
Submitted 3 May, 2021;
originally announced May 2021.
-
Explanation Ontology: A Model of Explanations for User-Centered AI
Authors:
Shruthi Chari,
Oshani Seneviratne,
Daniel M. Gruen,
Morgan A. Foreman,
Amar K. Das,
Deborah L. McGuinness
Abstract:
Explainability has been a goal for Artificial Intelligence (AI) systems since their conception, with the need for explainability growing as more complex AI models are increasingly used in critical, high-stakes settings such as healthcare. Explanations have often added to an AI system in a non-principled, post-hoc manner. With greater adoption of these systems and emphasis on user-centric explainab…
▽ More
Explainability has been a goal for Artificial Intelligence (AI) systems since their conception, with the need for explainability growing as more complex AI models are increasingly used in critical, high-stakes settings such as healthcare. Explanations have often added to an AI system in a non-principled, post-hoc manner. With greater adoption of these systems and emphasis on user-centric explainability, there is a need for a structured representation that treats explainability as a primary consideration, map** end user needs to specific explanation types and the system's AI capabilities. We design an explanation ontology to model both the role of explanations, accounting for the system and user attributes in the process, and the range of different literature-derived explanation types. We indicate how the ontology can support user requirements for explanations in the domain of healthcare. We evaluate our ontology with a set of competency questions geared towards a system designer who might use our ontology to decide which explanation types to include, given a combination of users' needs and a system's capabilities, both in system design settings and in real-time operations. Through the use of this ontology, system designers will be able to make informed choices on which explanations AI systems can and should provide.
△ Less
Submitted 3 October, 2020;
originally announced October 2020.
-
Explanation Ontology in Action: A Clinical Use-Case
Authors:
Shruthi Chari,
Oshani Seneviratne,
Daniel M. Gruen,
Morgan A. Foreman,
Amar K. Das,
Deborah L. McGuinness
Abstract:
We addressed the problem of a lack of semantic representation for user-centric explanations and different explanation types in our Explanation Ontology (https://purl.org/heals/eo). Such a representation is increasingly necessary as explainability has become an important problem in Artificial Intelligence with the emergence of complex methods and an uptake in high-precision and user-facing settings…
▽ More
We addressed the problem of a lack of semantic representation for user-centric explanations and different explanation types in our Explanation Ontology (https://purl.org/heals/eo). Such a representation is increasingly necessary as explainability has become an important problem in Artificial Intelligence with the emergence of complex methods and an uptake in high-precision and user-facing settings. In this submission, we provide step-by-step guidance for system designers to utilize our ontology, introduced in our resource track paper, to plan and model for explanations during the design of their Artificial Intelligence systems. We also provide a detailed example with our utilization of this guidance in a clinical setting.
△ Less
Submitted 3 October, 2020;
originally announced October 2020.
-
Directions for Explainable Knowledge-Enabled Systems
Authors:
Shruthi Chari,
Daniel M. Gruen,
Oshani Seneviratne,
Deborah L. McGuinness
Abstract:
Interest in the field of Explainable Artificial Intelligence has been growing for decades and has accelerated recently. As Artificial Intelligence models have become more complex, and often more opaque, with the incorporation of complex machine learning techniques, explainability has become more critical. Recently, researchers have been investigating and tackling explainability with a user-centric…
▽ More
Interest in the field of Explainable Artificial Intelligence has been growing for decades and has accelerated recently. As Artificial Intelligence models have become more complex, and often more opaque, with the incorporation of complex machine learning techniques, explainability has become more critical. Recently, researchers have been investigating and tackling explainability with a user-centric focus, looking for explanations to consider trustworthiness, comprehensibility, explicit provenance, and context-awareness. In this chapter, we leverage our survey of explanation literature in Artificial Intelligence and closely related fields and use these past efforts to generate a set of explanation types that we feel reflect the expanded needs of explanation for today's artificial intelligence applications. We define each type and provide an example question that would motivate the need for this style of explanation. We believe this set of explanation types will help future system designers in their generation and prioritization of requirements and further help generate explanations that are better aligned to users' and situational needs.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.
-
Foundations of Explainable Knowledge-Enabled Systems
Authors:
Shruthi Chari,
Daniel M. Gruen,
Oshani Seneviratne,
Deborah L. McGuinness
Abstract:
Explainability has been an important goal since the early days of Artificial Intelligence. Several approaches for producing explanations have been developed. However, many of these approaches were tightly coupled with the capabilities of the artificial intelligence systems at the time. With the proliferation of AI-enabled systems in sometimes critical settings, there is a need for them to be expla…
▽ More
Explainability has been an important goal since the early days of Artificial Intelligence. Several approaches for producing explanations have been developed. However, many of these approaches were tightly coupled with the capabilities of the artificial intelligence systems at the time. With the proliferation of AI-enabled systems in sometimes critical settings, there is a need for them to be explainable to end-users and decision-makers. We present a historical overview of explainable artificial intelligence systems, with a focus on knowledge-enabled systems, spanning the expert systems, cognitive assistants, semantic applications, and machine learning domains. Additionally, borrowing from the strengths of past approaches and identifying gaps needed to make explanations user- and context-focused, we propose new definitions for explanations and explainable knowledge-enabled systems.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.
-
Metacommutation of primes in Eichler orders
Authors:
Angelica Babei,
Sara Chari
Abstract:
In this article, we study the metacommutation problem in locally Eichler orders. From this arises a permutation of the set of locally principal left ideals of a given prime reduced norm. Previous results on the cycle structure were determined for locally maximal orders. As we extend these results, we present an alternative, combinatorial description of the metacommutation permutation as an action…
▽ More
In this article, we study the metacommutation problem in locally Eichler orders. From this arises a permutation of the set of locally principal left ideals of a given prime reduced norm. Previous results on the cycle structure were determined for locally maximal orders. As we extend these results, we present an alternative, combinatorial description of the metacommutation permutation as an action on the Bruhat-Tits tree.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Making Study Populations Visible through Knowledge Graphs
Authors:
Shruthi Chari,
Miao Qi,
Nkcheniyere N. Agu,
Oshani Seneviratne,
James P. McCusker,
Kristin P. Bennett,
Amar K. Das,
Deborah L. McGuinness
Abstract:
Treatment recommendations within Clinical Practice Guidelines (CPGs) are largely based on findings from clinical trials and case studies, referred to here as research studies, that are often based on highly selective clinical populations, referred to here as study cohorts. When medical practitioners apply CPG recommendations, they need to understand how well their patient population matches the ch…
▽ More
Treatment recommendations within Clinical Practice Guidelines (CPGs) are largely based on findings from clinical trials and case studies, referred to here as research studies, that are often based on highly selective clinical populations, referred to here as study cohorts. When medical practitioners apply CPG recommendations, they need to understand how well their patient population matches the characteristics of those in the study cohort, and thus are confronted with the challenges of locating the study cohort information and making an analytic comparison. To address these challenges, we develop an ontology-enabled prototype system, which exposes the population descriptions in research studies in a declarative manner, with the ultimate goal of allowing medical practitioners to better understand the applicability and generalizability of treatment recommendations. We build a Study Cohort Ontology (SCO) to encode the vocabulary of study population descriptions, that are often reported in the first table in the published work, thus they are often referred to as Table 1. We leverage the well-used Semanticscience Integrated Ontology (SIO) for defining property associations between classes. Further, we model the key components of Table 1s, i.e., collections of study subjects, subject characteristics, and statistical measures in RDF knowledge graphs. We design scenarios for medical practitioners to perform population analysis, and generate cohort similarity visualizations to determine the applicability of a study population to the clinical population of interest. Our semantic approach to make study populations visible, by standardized representations of Table 1s, allows users to quickly derive clinically relevant inferences about study populations.
△ Less
Submitted 9 July, 2019;
originally announced July 2019.
-
Linked Open Data Validity -- A Technical Report from ISWS 2018
Authors:
Tayeb Abderrahmani Ghor,
Esha Agrawal,
Mehwish Alam,
Omar Alqawasmeh,
Claudia D'amato,
Amina Annane,
Amr Azzam,
Andrew Berezovskyi,
Russa Biswas,
Mathias Bonduel,
Quentin Brabant,
Cristina-iulia Bucur,
Elena Camossi,
Valentina Anita Carriero,
Shruthi Chari,
David Chaves Fraga,
Fiorela Ciroku,
Michael Cochez,
Hubert Curien,
Vincenzo Cutrona,
Rahma Dandan,
Danilo Dess,
Valerio Di Carlo,
Ahmed El Amine Djebri,
Marieke Van Erp
, et al. (46 additional authors not shown)
Abstract:
Linked Open Data (LOD) is the publicly available RDF data in the Web. Each LOD entity is identfied by a URI and accessible via HTTP. LOD encodes globalscale knowledge potentially available to any human as well as artificial intelligence that may want to benefit from it as background knowledge for supporting their tasks. LOD has emerged as the backbone of applications in diverse fields such as Natu…
▽ More
Linked Open Data (LOD) is the publicly available RDF data in the Web. Each LOD entity is identfied by a URI and accessible via HTTP. LOD encodes globalscale knowledge potentially available to any human as well as artificial intelligence that may want to benefit from it as background knowledge for supporting their tasks. LOD has emerged as the backbone of applications in diverse fields such as Natural Language Processing, Information Retrieval, Computer Vision, Speech Recognition, and many more. Nevertheless, regardless of the specific tasks that LOD-based tools aim to address, the reuse of such knowledge may be challenging for diverse reasons, e.g. semantic heterogeneity, provenance, and data quality. As aptly stated by Heath et al. Linked Data might be outdated, imprecise, or simply wrong": there arouses a necessity to investigate the problem of linked data validity. This work reports a collaborative effort performed by nine teams of students, guided by an equal number of senior researchers, attending the International Semantic Web Research School (ISWS 2018) towards addressing such investigation from different perspectives coupled with different approaches to tackle the issue.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
On basic and Bass quaternion orders
Authors:
Sara Chari,
Daniel Smertnig,
John Voight
Abstract:
A quaternion order O over a Dedekind domain R is Bass if every R-superorder is Gorenstein, and O is basic if it contains an integrally closed quadratic R-order. In this article, we show that these conditions are equivalent in local and global settings: a quaternion order is Bass if and only if it is basic. In particular, we show that the property of being basic is a local property of a quaternion…
▽ More
A quaternion order O over a Dedekind domain R is Bass if every R-superorder is Gorenstein, and O is basic if it contains an integrally closed quadratic R-order. In this article, we show that these conditions are equivalent in local and global settings: a quaternion order is Bass if and only if it is basic. In particular, we show that the property of being basic is a local property of a quaternion order.
△ Less
Submitted 1 March, 2019;
originally announced March 2019.
-
Metacommutation in Central Simple Algebras
Authors:
Sara Chari
Abstract:
In a quaternion order of class number one, an element can be factored in multiple ways depending on the order of the factorization of its reduced norm. The fact that multiplication is not commutative causes an element to induce a permutation on the set of primes of a given reduced norm. We discuss this permutation and previously known results about the cycle structure, sign, and number of fixed po…
▽ More
In a quaternion order of class number one, an element can be factored in multiple ways depending on the order of the factorization of its reduced norm. The fact that multiplication is not commutative causes an element to induce a permutation on the set of primes of a given reduced norm. We discuss this permutation and previously known results about the cycle structure, sign, and number of fixed points for quaternion orders. We generalize these results to other orders in central simple algebras over global fields.
△ Less
Submitted 1 November, 2018; v1 submitted 19 September, 2018;
originally announced September 2018.
-
Knowledge Integration for Disease Characterization: A Breast Cancer Example
Authors:
Oshani Seneviratne,
Sabbir M. Rashid,
Shruthi Chari,
James P. McCusker,
Kristin P. Bennett,
James A. Hendler,
Deborah L. McGuinness
Abstract:
With the rapid advancements in cancer research, the information that is useful for characterizing disease, staging tumors, and creating treatment and survivorship plans has been changing at a pace that creates challenges when physicians try to remain current. One example involves increasing usage of biomarkers when characterizing the pathologic prognostic stage of a breast tumor. We present our se…
▽ More
With the rapid advancements in cancer research, the information that is useful for characterizing disease, staging tumors, and creating treatment and survivorship plans has been changing at a pace that creates challenges when physicians try to remain current. One example involves increasing usage of biomarkers when characterizing the pathologic prognostic stage of a breast tumor. We present our semantic technology approach to support cancer characterization and demonstrate it in our end-to-end prototype system that collects the newest breast cancer staging criteria from authoritative oncology manuals to construct an ontology for breast cancer. Using a tool we developed that utilizes this ontology, physician-facing applications can be used to quickly stage a new patient to support identifying risks, treatment options, and monitoring plans based on authoritative and best practice guidelines. Physicians can also re-stage existing patients or patient populations, allowing them to find patients whose stage has changed in a given patient cohort. As new guidelines emerge, using our proposed mechanism, which is grounded by semantic technologies for ingesting new data from staging manuals, we have created an enriched cancer staging ontology that integrates relevant data from several sources with very little human intervention.
△ Less
Submitted 20 July, 2018;
originally announced July 2018.
-
Measurable Riemannian structure on higher dimensional harmonic Sierpinski gaskets
Authors:
Sara Chari,
Joshua Frisch,
Daniel J. Kelleher,
Luke G. Rogers
Abstract:
We prove existence of a measurable Riemannian structure on higher-dimensional harmonic Sierpinski gasket fractals and deduce Gaussian heat kernel bounds in the geodesic metric. Our proof differs from that given by Kigami for the usual Sierpinski gasket in that we show the geodesics are de Rham curves, for which there is an extensive regularity theory.
We prove existence of a measurable Riemannian structure on higher-dimensional harmonic Sierpinski gasket fractals and deduce Gaussian heat kernel bounds in the geodesic metric. Our proof differs from that given by Kigami for the usual Sierpinski gasket in that we show the geodesics are de Rham curves, for which there is an extensive regularity theory.
△ Less
Submitted 9 March, 2017;
originally announced March 2017.
-
A new truncation scheme for BBGKY hierarchy: conservation of energy and time reversibility
Authors:
S. Siva Nasarayya Chari,
Ramarao Inguva,
K. P. N. Murthy
Abstract:
We propose a new truncation scheme for Bogoliubov-Born-Green-Kirkwood-Yvon (BBGKY) hierarchy. We approximate the three particle distribution function $f_{3}(1,2,3,t)$ in terms of $f_{2}(1,2,t)$, $f_{1}(3,t)$ and two point correlation functions $\left\lbrace g_{2}(1,3,t), g_{2}(2,3,t)\right\rbrace $. Further $f_{2}$ is expressed in terms of $f_{1}(1,t)$ and $g_{2}(1,2,t)$ to close the hierarchy, re…
▽ More
We propose a new truncation scheme for Bogoliubov-Born-Green-Kirkwood-Yvon (BBGKY) hierarchy. We approximate the three particle distribution function $f_{3}(1,2,3,t)$ in terms of $f_{2}(1,2,t)$, $f_{1}(3,t)$ and two point correlation functions $\left\lbrace g_{2}(1,3,t), g_{2}(2,3,t)\right\rbrace $. Further $f_{2}$ is expressed in terms of $f_{1}(1,t)$ and $g_{2}(1,2,t)$ to close the hierarchy, resulting a set of coupled kinetic equations for $f_{1}$ and $g_{2}$. In this paper we show that, for velocity independent correlations, the kinetic equation for $f_{1}$ reduces to the model proposed by Martys[Martys N S 1999 \textit{IJMPC} \textbf{10} 1367-1382]. In the steady state limit, the kinetic equation for $g_{2}$ reduces to Born-Green-Yvon (BGY) hierarchy for homogeneous density. We also prove that the present scheme respects the energy conservation and under specific circumstances, time symmetry \textit{i.e.,} $\displaystyle \frac{dH(t)}{dt} = 0$ where $H(t)$ refers to the Boltzmann's H-function.
△ Less
Submitted 8 August, 2016;
originally announced August 2016.
-
DinTucker: Scaling up Gaussian process models on multidimensional arrays with billions of elements
Authors:
Shandian Zhe,
Yuan Qi,
Youngja Park,
Ian Molloy,
Suresh Chari
Abstract:
Infinite Tucker Decomposition (InfTucker) and random function prior models, as nonparametric Bayesian models on infinite exchangeable arrays, are more powerful models than widely-used multilinear factorization methods including Tucker and PARAFAC decomposition, (partly) due to their capability of modeling nonlinear relationships between array elements. Despite their great predictive performance an…
▽ More
Infinite Tucker Decomposition (InfTucker) and random function prior models, as nonparametric Bayesian models on infinite exchangeable arrays, are more powerful models than widely-used multilinear factorization methods including Tucker and PARAFAC decomposition, (partly) due to their capability of modeling nonlinear relationships between array elements. Despite their great predictive performance and sound theoretical foundations, they cannot handle massive data due to a prohibitively high training time. To overcome this limitation, we present Distributed Infinite Tucker (DINTUCKER), a large-scale nonlinear tensor decomposition algorithm on MAPREDUCE. While maintaining the predictive accuracy of InfTucker, it is scalable on massive data. DINTUCKER is based on a new hierarchical Bayesian model that enables local training of InfTucker on subarrays and information integration from all local training results. We use distributed stochastic gradient descent, coupled with variational inference, to train this model. We apply DINTUCKER to multidimensional arrays with billions of elements from applications in the "Read the Web" project (Carlson et al., 2010) and in information security and compare it with the state-of-the-art large-scale tensor decomposition method, GigaTensor. On both datasets, DINTUCKER achieves significantly higher prediction accuracy with less computational time.
△ Less
Submitted 1 February, 2014; v1 submitted 11 November, 2013;
originally announced November 2013.
-
Causes and Consequences of genetic background effects illuminated by integrative genomic analysis
Authors:
Christopher H. Chandler,
Sudarshan Chari,
David Tack,
Ian Dworkin
Abstract:
The phenotypic consequences of individual mutations are modulated by the wild type genetic background in which they occur.Although such background dependence is widely observed, we do not know whether general patterns across species and traits exist, nor about the mechanisms underlying it. We also lack knowledge on how mutations interact with genetic background to influence gene expression, and ho…
▽ More
The phenotypic consequences of individual mutations are modulated by the wild type genetic background in which they occur.Although such background dependence is widely observed, we do not know whether general patterns across species and traits exist, nor about the mechanisms underlying it. We also lack knowledge on how mutations interact with genetic background to influence gene expression, and how this in turn mediates mutant phenotypes. Furthermore, how genetic background influences patterns of epistasis remains unclear. To investigate the genetic basis and genomic consequences of genetic background dependence of the scallopedE3 allele on the Drosophila melanogaster wing, we generated multiple novel genome level datasets from a map** by introgression experiment and a tagged RNA gene expression dataset. In addition we used whole genome re-sequencing of the parental lines two commonly used laboratory strains to predict polymorphic transcription factor binding sites for SD. We integrated these data with previously published genomic datasets from expression microarrays and a modifier mutation screen. By searching for genes showing a congruent signal across multiple datasets, we were able to identify a robust set of candidate loci contributing to the background dependent effects of mutations in sd. We also show that the majority of background-dependent modifiers previously reported are caused by higher-order epistasis, not quantitative non-complementation. These findings provide a useful foundation for more detailed investigations of genetic background dependence in this system, and this approach is likely to prove useful in exploring the genetic basis of other traits as well.
△ Less
Submitted 1 February, 2014; v1 submitted 2 September, 2013;
originally announced September 2013.
-
Does your gene need a background check? How genetic background impacts the analysis of mutations, genes, and evolution
Authors:
Chris H. Chandler,
Sudarshan Chari,
Ian Dworkin
Abstract:
The premise of genetic analysis is that a causal link exists between phenotypic and allelic variation. Yet it has long been documented that mutant phenotypes are not a simple result of a single DNA lesion, but rather are due to interactions of the focal allele with other genes and the environment. Although an experimentally rigorous approach, focusing on individual mutations and isogenic control s…
▽ More
The premise of genetic analysis is that a causal link exists between phenotypic and allelic variation. Yet it has long been documented that mutant phenotypes are not a simple result of a single DNA lesion, but rather are due to interactions of the focal allele with other genes and the environment. Although an experimentally rigorous approach, focusing on individual mutations and isogenic control strains, has facilitated amazing progress within genetics and related fields, a glimpse back suggests that a vast complexity has been omitted from our current understanding of allelic effects. Armed with traditional genetic analyses and the foundational knowledge they have provided, we argue that the time and tools are ripe to return to the under-explored aspects of gene function and embrace the context-dependent nature of genetic effects. We assert that a broad understanding of genetic effects and the evolutionary dynamics of alleles requires identifying how mutational outcomes depend upon the wild-type genetic background. Furthermore, we discuss how best to exploit genetic background effects to broaden genetic research programs.
△ Less
Submitted 12 January, 2013;
originally announced January 2013.