-
Structure and reaction study of Z=120 isotopes using non-relativistic and relativistic mean-field formalism
Authors:
Jeet Amrit Pattnaik,
K. C. Naik,
R. N. Panda,
M. Bhuyan,
S. K. Patra
Abstract:
An extensive study is carried out for the island of stability in the superheavy nuclei of Z=120 and N=164-220 within the effective field theory motivated relativistic mean-field (E-RMF) and the non-relativistic Skyrme-Hartree-Fock (SHF) approaches. The relativistic G3 and IOPB-I and non-relativistic SLy4 and SkMP parameter sets are used for the investigations. Surface properties such as symmetry e…
▽ More
An extensive study is carried out for the island of stability in the superheavy nuclei of Z=120 and N=164-220 within the effective field theory motivated relativistic mean-field (E-RMF) and the non-relativistic Skyrme-Hartree-Fock (SHF) approaches. The relativistic G3 and IOPB-I and non-relativistic SLy4 and SkMP parameter sets are used for the investigations. Surface properties such as symmetry energy, neutron pressure and the curvature coefficient of symmetry energy are discussed within the coherent density fluctuation model (CDFM) using the Skyrme and the Brückner energy density functionals. The volume and surface contributions of symmetry energy are evaluated using Danielewicz's liquid drop approximation within the CDFM. The total nuclear reaction and elastic differential cross-sections are also obtained for both SHF and E-RMF within the Glauber model. The peaks in the symmetry energy at N = 182 for SHF and N=184 for E-RMF are seen, which are absent in the Brückner functional. The shifting of peak in the symmetry energy with Brückner functional can be correlated to the Coester-band problem. The enhanced total reaction cross-section for relativistic density of $^{304}120$ suggests the extra stability of this nucleus. This further confirms the shell/sub-shell closure of N = 184 in E-RMF force. The differential cross-section shows its force independent nature and significant increase with the scattering angle.
△ Less
Submitted 10 November, 2022;
originally announced November 2022.
-
Impact of Geographic Diversity on Citation of Collaborative Research
Authors:
Cian Naik,
Cassidy R. Sugimoto,
Vincent Larivière,
Chenlei Leng,
Weisi Guo
Abstract:
Diversity in human capital is widely seen as critical to creating holistic and high quality research, especially in areas that engage with diverse cultures, environments, and challenges. Quantifying diverse academic collaborations and its effect on research quality is lacking, especially at international scale and across different domains. Here, we present the first effort to measure the impact of…
▽ More
Diversity in human capital is widely seen as critical to creating holistic and high quality research, especially in areas that engage with diverse cultures, environments, and challenges. Quantifying diverse academic collaborations and its effect on research quality is lacking, especially at international scale and across different domains. Here, we present the first effort to measure the impact of geographic diversity in coauthorships on the citation of their papers across different academic domains. Our results unequivocally show that geographic coauthor diversity improves paper citation, but very long distance collaborations has variable impact. We also discover "well-trodden" collaboration circles that yield much less impact than similar travel distances. These relationships are observed to exist across different subject areas, but with varying strengths. These findings can help academics identify new opportunities from a diversity perspective, as well as inform funders on areas that require additional mobility support.
△ Less
Submitted 25 April, 2022;
originally announced April 2022.
-
Fast Bayesian Coresets via Subsampling and Quasi-Newton Refinement
Authors:
Cian Naik,
Judith Rousseau,
Trevor Campbell
Abstract:
Bayesian coresets approximate a posterior distribution by building a small weighted subset of the data points. Any inference procedure that is too computationally expensive to be run on the full posterior can instead be run inexpensively on the coreset, with results that approximate those on the full data. However, current approaches are limited by either a significant run-time or the need for the…
▽ More
Bayesian coresets approximate a posterior distribution by building a small weighted subset of the data points. Any inference procedure that is too computationally expensive to be run on the full posterior can instead be run inexpensively on the coreset, with results that approximate those on the full data. However, current approaches are limited by either a significant run-time or the need for the user to specify a low-cost approximation to the full posterior. We propose a Bayesian coreset construction algorithm that first selects a uniformly random subset of data, and then optimizes the weights using a novel quasi-Newton method. Our algorithm is a simple to implement, black-box method, that does not require the user to specify a low-cost posterior approximation. It is the first to come with a general high-probability bound on the KL divergence of the output coreset posterior. Experiments demonstrate that our method provides significant improvements in coreset quality against alternatives with comparable construction times, with far less storage cost and user input required.
△ Less
Submitted 15 January, 2023; v1 submitted 17 March, 2022;
originally announced March 2022.
-
Analysis of Software Engineering for Agile Machine Learning Projects
Authors:
Kushal Singla,
Joy Bose,
Chetan Naik
Abstract:
The number of machine learning, artificial intelligence or data science related software engineering projects using Agile methodology is increasing. However, there are very few studies on how such projects work in practice. In this paper, we analyze project issues tracking data taken from Scrum (a popular tool for Agile) for several machine learning projects. We compare this data with correspondin…
▽ More
The number of machine learning, artificial intelligence or data science related software engineering projects using Agile methodology is increasing. However, there are very few studies on how such projects work in practice. In this paper, we analyze project issues tracking data taken from Scrum (a popular tool for Agile) for several machine learning projects. We compare this data with corresponding data from non-machine learning projects, in an attempt to analyze how machine learning projects are executed differently from normal software engineering projects. On analysis, we find that machine learning project issues use different kinds of words to describe issues, have higher number of exploratory or research oriented tasks as compared to implementation tasks, and have a higher number of issues in the product backlog after each sprint, denoting that it is more difficult to estimate the duration of machine learning project related tasks in advance. After analyzing this data, we propose a few ways in which Agile machine learning projects can be better logged and executed, given their differences with normal software engineering projects.
△ Less
Submitted 16 December, 2019;
originally announced December 2019.
-
Sparse Networks with Core-Periphery Structure
Authors:
Cian Naik,
François Caron,
Judith Rousseau
Abstract:
We propose a statistical model for graphs with a core-periphery structure. To do this we define a precise notion of what it means for a graph to have this structure, based on the sparsity properties of the subgraphs of core and periphery nodes. We present a class of sparse graphs with such properties, and provide methods to simulate from this class, and to perform posterior inference. We demonstra…
▽ More
We propose a statistical model for graphs with a core-periphery structure. To do this we define a precise notion of what it means for a graph to have this structure, based on the sparsity properties of the subgraphs of core and periphery nodes. We present a class of sparse graphs with such properties, and provide methods to simulate from this class, and to perform posterior inference. We demonstrate that our model can detect core-periphery structure in simulated and real-world networks.
△ Less
Submitted 21 October, 2019;
originally announced October 2019.
-
Improving Long Distance Slot Carryover in Spoken Dialogue Systems
Authors:
Tongfei Chen,
Chetan Naik,
Hua He,
Pushpendre Rastogi,
Lambert Mathias
Abstract:
Tracking the state of the conversation is a central component in task-oriented spoken dialogue systems. One such approach for tracking the dialogue state is slot carryover, where a model makes a binary decision if a slot from the context is relevant to the current turn. Previous work on the slot carryover task used models that made independent decisions for each slot. A close analysis of the resul…
▽ More
Tracking the state of the conversation is a central component in task-oriented spoken dialogue systems. One such approach for tracking the dialogue state is slot carryover, where a model makes a binary decision if a slot from the context is relevant to the current turn. Previous work on the slot carryover task used models that made independent decisions for each slot. A close analysis of the results show that this approach results in poor performance over longer context dialogues. In this paper, we propose to jointly model the slots. We propose two neural network architectures, one based on pointer networks that incorporate slot ordering information, and the other based on transformer networks that uses self attention mechanism to model the slot interdependencies. Our experiments on an internal dialogue benchmark dataset and on the public DSTC2 dataset demonstrate that our proposed models are able to resolve longer distance slot references and are able to achieve competitive performance.
△ Less
Submitted 3 June, 2019;
originally announced June 2019.
-
Efficient Feature Selection of Power Quality Events using Two Dimensional (2D) Particle Swarms
Authors:
Faizal Hafiz,
Akshya Swain,
Chirag Naik,
Nitish Patel
Abstract:
A novel two-dimensional (2D) learning framework has been proposed to address the feature selection problem in Power Quality (PQ) events. Unlike the existing feature selection approaches, the proposed 2D learning explicitly incorporates the information about the subset cardinality (i.e., the number of features) as an additional learning dimension to effectively guide the search process. The efficac…
▽ More
A novel two-dimensional (2D) learning framework has been proposed to address the feature selection problem in Power Quality (PQ) events. Unlike the existing feature selection approaches, the proposed 2D learning explicitly incorporates the information about the subset cardinality (i.e., the number of features) as an additional learning dimension to effectively guide the search process. The efficacy of this approach has been demonstrated considering fourteen distinct classes of PQ events which conform to the IEEE Standard 1159. The search performance of the 2D learning approach has been compared to the other six well-known feature selection wrappers by considering two induction algorithms: Naive Bayes (NB) and k-Nearest Neighbors (k-NN). Further, the robustness of the selected/reduced feature subsets has been investigated considering seven different levels of noise. The results of this investigation convincingly demonstrate that the proposed 2D learning can identify significantly better and robust feature subsets for PQ events.
△ Less
Submitted 15 April, 2019;
originally announced April 2019.
-
Cross-Lingual Approaches to Reference Resolution in Dialogue Systems
Authors:
Amr Sharaf,
Arpit Gupta,
Hancheng Ge,
Chetan Naik,
Lambert Mathias
Abstract:
In the slot-filling paradigm, where a user can refer back to slots in the context during the conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In this paper, we build on the context carryover system~\citep{Naik2018ContextualSC}, which provides a scalable multi-domain framework for resolving references. How…
▽ More
In the slot-filling paradigm, where a user can refer back to slots in the context during the conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In this paper, we build on the context carryover system~\citep{Naik2018ContextualSC}, which provides a scalable multi-domain framework for resolving references. However, scaling this approach across languages is not a trivial task, due to the large demand on acquisition of annotated data in the target language. Our main focus is on cross-lingual methods for reference resolution as a way to alleviate the need for annotated data in the target language. In the cross-lingual setup, we assume there is access to annotated resources as well as a well trained model in the source language and little to no annotated data in the target language. In this paper, we explore three different approaches for cross-lingual transfer \textemdash~\ delexicalization as data augmentation, multilingual embeddings and machine translation. We compare these approaches both on a low resource setting as well as a large resource setting. Our experiments show that multilingual embeddings and delexicalization via data augmentation have a significant impact in the low resource setting, but the gains diminish as the amount of available data in the target language increases. Furthermore, when combined with machine translation we can get performance very close to actual live data in the target language, with only 25\% of the data projected into the target language.
△ Less
Submitted 27 November, 2018;
originally announced November 2018.
-
A Two-Dimensional (2-D) Learning Framework for Particle Swarm based Feature Selection
Authors:
Faizal Hafiz,
Akshya Swain,
Nitish Patel,
Chirag Naik
Abstract:
This paper proposes a new generalized two dimensional learning approach for particle swarm based feature selection. The core idea of the proposed approach is to include the information about the subset cardinality into the learning framework by extending the dimension of the velocity. The 2D-learning framework retains all the key features of the original PSO, despite the extra learning dimension.…
▽ More
This paper proposes a new generalized two dimensional learning approach for particle swarm based feature selection. The core idea of the proposed approach is to include the information about the subset cardinality into the learning framework by extending the dimension of the velocity. The 2D-learning framework retains all the key features of the original PSO, despite the extra learning dimension. Most of the popular variants of PSO can easily be adapted into this 2D learning framework for feature selection problems. The efficacy of the proposed learning approach has been evaluated considering several benchmark data and two induction algorithms: Naive-Bayes and k-Nearest Neighbor. The results of the comparative investigation including the time-complexity analysis with GA, ACO and five other PSO variants illustrate that the proposed 2D learning approach gives feature subset with relatively smaller cardinality and better classification performance with shorter run times.
△ Less
Submitted 3 August, 2018;
originally announced August 2018.
-
Contextual Slot Carryover for Disparate Schemas
Authors:
Chetan Naik,
Arpit Gupta,
Hancheng Ge,
Lambert Mathias,
Ruhi Sarikaya
Abstract:
In the slot-filling paradigm, where a user can refer back to slots in the context during a conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In large-scale multi-domain systems, this presents two challenges - scaling to a very large and potentially unbounded set of slot values, and dealing with diverse sch…
▽ More
In the slot-filling paradigm, where a user can refer back to slots in the context during a conversation, the goal of the contextual understanding system is to resolve the referring expressions to the appropriate slots in the context. In large-scale multi-domain systems, this presents two challenges - scaling to a very large and potentially unbounded set of slot values, and dealing with diverse schemas. We present a neural network architecture that addresses the slot value scalability challenge by reformulating the contextual interpretation as a decision to carryover a slot from a set of possible candidates. To deal with heterogenous schemas, we introduce a simple data-driven method for trans- forming the candidate slots. Our experiments show that our approach can scale to multiple domains and provides competitive results over a strong baseline.
△ Less
Submitted 5 June, 2018;
originally announced June 2018.
-
Study of hot thermally fissile nuclei using relativistic mean field theory
Authors:
Abdul Quddus,
K. C. Naik,
S. K. Patra
Abstract:
We have studied the properties of hot $^{234,236}$U and $^{240}$Pu nuclei in the framework of relativistic mean field formalism. The recently developed FSUGarnet and IOPB-I parameter sets are implemented for the first time to deform nuclei at finite temperature. The results are compared with the well-known NL3 set. The said isotopes are structurally important because of the thermally fissile natur…
▽ More
We have studied the properties of hot $^{234,236}$U and $^{240}$Pu nuclei in the framework of relativistic mean field formalism. The recently developed FSUGarnet and IOPB-I parameter sets are implemented for the first time to deform nuclei at finite temperature. The results are compared with the well-known NL3 set. The said isotopes are structurally important because of the thermally fissile nature of $^{233,235}$U and $^{239}$Pu as these nuclei ($^{234,236}$U and $^{240}$Pu) are formed after the absorption of a thermal neutron, which undergoes fission. Here, we have evaluated the nuclear properties, such as shell correction energy, neutron-skin thickness, quadrupole and hexadecapole deformation parameters and asymmetry energy coefficient for these nuclei as a function of temperature.
△ Less
Submitted 20 June, 2018; v1 submitted 6 April, 2018;
originally announced April 2018.
-
Quantifying the causal effect of speed cameras on road traffic accidents via an approximate Bayesian doubly robust estimator
Authors:
Daniel J Graham,
Cian Naik,
Emma J McCoy,
Haojie Li
Abstract:
This paper quantifies the effect of speed cameras on road traffic collisions using an approximate Bayesian doubly-robust (DR) causal inference estimation method. Previous empirical work on this topic, which shows a diverse range of estimated effects, is based largely on outcome regression (OR) models using the Empirical Bayes approach or on simple before and after comparisons. Issues of causality…
▽ More
This paper quantifies the effect of speed cameras on road traffic collisions using an approximate Bayesian doubly-robust (DR) causal inference estimation method. Previous empirical work on this topic, which shows a diverse range of estimated effects, is based largely on outcome regression (OR) models using the Empirical Bayes approach or on simple before and after comparisons. Issues of causality and confounding have received little formal attention. A causal DR approach combines propensity score (PS) and OR models to give an average treatment effect (ATE) estimator that is consistent and asymptotically normal under correct specification of either of the two component models. We develop this approach within a novel approximate Bayesian framework to derive posterior predictive distributions for the ATE of speed cameras on road traffic collisions. Our results for England indicate significant reductions in the number of collisions at speed cameras sites (mean ATE = -15%). Our proposed method offers a promising approach for evaluation of transport safety interventions.
△ Less
Submitted 16 August, 2019; v1 submitted 17 March, 2017;
originally announced March 2017.
-
Multiply robust dose-response estimation for multivalued causal inference problems
Authors:
Cian Naik,
Emma J. McCoy,
Daniel J. Graham
Abstract:
This paper develops a multiply robust (MR) dose-response estimator for causal inference problems involving multivalued treatments. We combine a family of generalised propensity score (GPS) models and a family of outcome regression (OR) models to achieve an average potential outcomes estimator that is consistent if just one of the GPS or OR models in each family is correctly specified. We provide p…
▽ More
This paper develops a multiply robust (MR) dose-response estimator for causal inference problems involving multivalued treatments. We combine a family of generalised propensity score (GPS) models and a family of outcome regression (OR) models to achieve an average potential outcomes estimator that is consistent if just one of the GPS or OR models in each family is correctly specified. We provide proofs and simulations that demonstrate multiple robustness in the context of multivalued causal inference problems.
△ Less
Submitted 17 May, 2017; v1 submitted 8 November, 2016;
originally announced November 2016.
-
Bayesian Nonparametrics for Sparse Dynamic Networks
Authors:
Cian Naik,
Francois Caron,
Judith Rousseau,
Yee Whye Teh,
Konstantina Palla
Abstract:
In this paper we propose a Bayesian nonparametric approach to modelling sparse time-varying networks. A positive parameter is associated to each node of a network, which models the sociability of that node. Sociabilities are assumed to evolve over time, and are modelled via a dynamic point process model. The model is able to capture long term evolution of the sociabilities. Moreover, it yields spa…
▽ More
In this paper we propose a Bayesian nonparametric approach to modelling sparse time-varying networks. A positive parameter is associated to each node of a network, which models the sociability of that node. Sociabilities are assumed to evolve over time, and are modelled via a dynamic point process model. The model is able to capture long term evolution of the sociabilities. Moreover, it yields sparse graphs, where the number of edges grows subquadratically with the number of nodes. The evolution of the sociabilities is described by a tractable time-varying generalised gamma process. We provide some theoretical insights into the model and apply it to three datasets: a simulated network, a network of hyperlinks between communities on Reddit, and a network of co-occurences of words in Reuters news articles after the September 11th attacks.
△ Less
Submitted 14 April, 2022; v1 submitted 6 July, 2016;
originally announced July 2016.
-
A flat space-time relativistic explanation for the perihelion advance of Mercury
Authors:
Harihar Behera,
P. C. Naik
Abstract:
Starting with the flat space-time relativistic versions of Maxwell-Heaviside's toy model vector theory of gravity and introducing the gravitational analogues for the electromagnetic Lienard-Wiechert potentials together with the notion of a gravitational Thomas Precession; the observed anomalous perihelion advance of Mercury's orbit is here explained as a relativistic effect in flat (Minkowski) s…
▽ More
Starting with the flat space-time relativistic versions of Maxwell-Heaviside's toy model vector theory of gravity and introducing the gravitational analogues for the electromagnetic Lienard-Wiechert potentials together with the notion of a gravitational Thomas Precession; the observed anomalous perihelion advance of Mercury's orbit is here explained as a relativistic effect in flat (Minkowski) space-time, unlike Einstein's curved space-time relativistic explanation. In this new explanation for the old paradoxical observation of Mercury's perihelion shift, the predicted value of the effect happens to coincide with Einstein's predicted value in General relativity.
△ Less
Submitted 29 June, 2003;
originally announced June 2003.
-
Gravitomagnetic Moments and Dynamics of Dirac's (spin 1/2) fermions in flat space-time Maxwellian Gravity
Authors:
Harihar Behera,
P. C. Naik
Abstract:
The gravitational effects in the relativistic quantum mechanics are investigated in a relativistically derived version of Heaviside's speculative Gravity (in flat space-time) named here as Maxwellian Gravity. The standard Dirac's approach to the intrinsic spin in the fields of Maxwellian Gravity yields the gravitomagnetic moment of a Dirac (spin 1/2) particle exactly equals to its intrinsic spin…
▽ More
The gravitational effects in the relativistic quantum mechanics are investigated in a relativistically derived version of Heaviside's speculative Gravity (in flat space-time) named here as Maxwellian Gravity. The standard Dirac's approach to the intrinsic spin in the fields of Maxwellian Gravity yields the gravitomagnetic moment of a Dirac (spin 1/2) particle exactly equals to its intrinsic spin. Violation of The Equivalence Principle (both at classical and quantum mechanical level) in the relativistic domain has also been reported in this work.
△ Less
Submitted 22 April, 2003; v1 submitted 22 April, 2003;
originally announced April 2003.