-
Differentially Processed Optimized Collaborative Rich Text Editor
Authors:
Nishtha Jatana,
Mansehej Singh,
Charu Gupta,
Geetika Dhand,
Shaily Malik,
Pankaj Dadheech,
Nagender Aneja,
Sandhya Aneja
Abstract:
A collaborative real-time text editor is an application that allows multiple users to edit a document simultaneously and merge their contributions automatically. It can be made collaborative by implementing a conflict resolution algorithm either on the client side (in peer-to-peer collaboration) or on the server side (when using web sockets and a central server to monitor state changes). Although…
▽ More
A collaborative real-time text editor is an application that allows multiple users to edit a document simultaneously and merge their contributions automatically. It can be made collaborative by implementing a conflict resolution algorithm either on the client side (in peer-to-peer collaboration) or on the server side (when using web sockets and a central server to monitor state changes). Although web sockets are ideal for real-time text editors, using multiple collaborative editors on one connection can create problems. This is because a single web connection cannot monitor which user is collaborating on which application state, leading to unnecessary network queries and data being delivered to the wrong state. To address this issue, the current solution is to open multiple web socket connections, with one web socket per collaboration application. However, this can add significant overhead proportional to the number of apps utilized. In this study, we demonstrate an algorithm that enables using a single web socket for multiple collaborative applications in a collaborative editor. Our method involves modifying the socket's code to track which application's shared state is being worked on and by whom. This allows for the simultaneous collaboration of multiple states in real-time, with infinite users, without opening a different socket for each application. Our optimized editor showed an efficiency improvement of over 96% in access time duration. This approach can be implemented in other collaborative editors and web applications with similar architecture to improve performance and eliminate issues arising from network overload.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Extrinsic Evaluation of Cultural Competence in Large Language Models
Authors:
Shaily Bhatt,
Fernando Diaz
Abstract:
Productive interactions between diverse users and language technologies require outputs from the latter to be culturally relevant and sensitive. Prior works have evaluated models' knowledge of cultural norms, values, and artifacts, without considering how this knowledge manifests in downstream applications. In this work, we focus on extrinsic evaluation of cultural competence in two text generatio…
▽ More
Productive interactions between diverse users and language technologies require outputs from the latter to be culturally relevant and sensitive. Prior works have evaluated models' knowledge of cultural norms, values, and artifacts, without considering how this knowledge manifests in downstream applications. In this work, we focus on extrinsic evaluation of cultural competence in two text generation tasks, open-ended question answering and story generation. We quantitatively and qualitatively evaluate model outputs when an explicit cue of culture, specifically nationality, is perturbed in the prompts. Although we find that model outputs do vary when varying nationalities and feature culturally relevant words, we also find weak correlations between text similarity of outputs for different countries and the cultural values of these countries. Finally, we discuss important considerations in designing comprehensive evaluation of cultural competence in user-facing tasks.
△ Less
Submitted 19 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Fairness Without Demographics in Human-Centered Federated Learning
Authors:
Shaily Roy,
Harshit Sharma,
Asif Salekin
Abstract:
Federated learning (FL) enables collaborative model training while preserving data privacy, making it suitable for decentralized human-centered AI applications. However, a significant research gap remains in ensuring fairness in these systems. Current fairness strategies in FL require knowledge of bias-creating/sensitive attributes, clashing with FL's privacy principles. Moreover, in human-centere…
▽ More
Federated learning (FL) enables collaborative model training while preserving data privacy, making it suitable for decentralized human-centered AI applications. However, a significant research gap remains in ensuring fairness in these systems. Current fairness strategies in FL require knowledge of bias-creating/sensitive attributes, clashing with FL's privacy principles. Moreover, in human-centered datasets, sensitive attributes may remain latent. To tackle these challenges, we present a novel bias mitigation approach inspired by "Fairness without Demographics" in machine learning. The presented approach achieves fairness without needing knowledge of sensitive attributes by minimizing the top eigenvalue of the Hessian matrix during training, ensuring equitable loss landscapes across FL participants. Notably, we introduce a novel FL aggregation scheme that promotes participating models based on error rates and loss landscape curvature attributes, fostering fairness across the FL system. This work represents the first approach to attaining "Fairness without Demographics" in human-centered FL. Through comprehensive evaluation, our approach demonstrates effectiveness in balancing fairness and efficacy across various real-world applications, FL setups, and scenarios involving single and multiple bias-inducing factors, representing a significant advancement in human-centered FL.
△ Less
Submitted 15 May, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
Stability analysis of a dark energy model in Rastall gravity
Authors:
Shaily,
Akanksha Singh,
J. K. Singh,
Saddam Hussain,
Ratbay Myrzakulov
Abstract:
We study a cosmological model in Rastall's theory of gravity in the framework of the flat FLRW metric. We formulate the value of the Hubble parameter, which contains two model parameters, $ α$ and $ j $. Employing the Markov Chain Monte Carlo (MCMC) sampling technique, we determine the values of these model parameters along with their uncertainties. Moreover, we derive the equation of state (EoS)…
▽ More
We study a cosmological model in Rastall's theory of gravity in the framework of the flat FLRW metric. We formulate the value of the Hubble parameter, which contains two model parameters, $ α$ and $ j $. Employing the Markov Chain Monte Carlo (MCMC) sampling technique, we determine the values of these model parameters along with their uncertainties. Moreover, we derive the equation of state (EoS) parameter, which converges around the quintessence region. We perform a dynamical system analysis using the linearization technique to validate the results independently. Also, we discuss various physical attributes of the model, highlighting the transition to acceleration and the violation of the strong energy condition (SEC) in the late stages of evolution. In conclusion, our model mimics the behavior of a dark matter fluid during the past epoch and transitions into a quintessence dark energy model in the future epoch.
△ Less
Submitted 27 April, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Late time phantom characteristic of the model in $f(R,T)$ gravity with quadratic curvature term
Authors:
Shaily,
Akanksha Singh,
J. K. Singh,
Saibal Ray
Abstract:
We propose a novel cosmological framework within the $f(R,T)$ type modified gravity theory, incorporating a non-minimally coupled with the higher order of the Ricci scalar ($R$) as well as the trace of the energy-momentum tensor ($T$). Therefore, our well-motivated chosen $f(R,T)$ expression is $ R + R^m + 2 λT^n$, where $λ$, $m$, and $n$ are arbitrary constants. Taking a constant jerk parameter (…
▽ More
We propose a novel cosmological framework within the $f(R,T)$ type modified gravity theory, incorporating a non-minimally coupled with the higher order of the Ricci scalar ($R$) as well as the trace of the energy-momentum tensor ($T$). Therefore, our well-motivated chosen $f(R,T)$ expression is $ R + R^m + 2 λT^n$, where $λ$, $m$, and $n$ are arbitrary constants. Taking a constant jerk parameter ($j$), we derive expressions for the deceleration parameter ($q$) and the Hubble parameter ($H$) as functions of the redshift $z$. We constrained our model with the recent Observational Hubble Dataset (OHD), $Pantheon$, and $ Pantheon $ + OHD datasets by using the analysis of Markov Chain Monte Carlo (MCMC). Our model shows early deceleration followed by late-time acceleration, with the transition occurring in the redshift range $1.10 \leq z_{tr} \leq 1.15$. Our findings suggest that this higher-order model of $f(R,T)$ gravity theory can efficiently provide a dark energy model for addressing the current scenario of cosmic acceleration.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
EDSFD parametrization in $ f(R,T) $ gravity with linear curvature terms
Authors:
J. K. Singh,
Shaily,
Harshna Balhara,
Sushant G. Ghosh,
Sunil D. Maharaj
Abstract:
This paper investigates the flat Friedmann-Lema$\hat{\imath}$tre-Robertson-Walker (FLRW) cosmological model using a suitable parameterization represented as a differential equation concerning the energy density of the scalar field, $ρ_φ$, in the context of $f(R,T)$ gravity theory. This parameterization is known as the Energy Density Scalar Field Differential Equation (EDSFD) parametrization. It re…
▽ More
This paper investigates the flat Friedmann-Lema$\hat{\imath}$tre-Robertson-Walker (FLRW) cosmological model using a suitable parameterization represented as a differential equation concerning the energy density of the scalar field, $ρ_φ$, in the context of $f(R,T)$ gravity theory. This parameterization is known as the Energy Density Scalar Field Differential Equation (EDSFD) parametrization. It results in a solution of the Hubble parameter containing four model parameters, namely, $Ω_{m0},Ω_{φ0}, H_0,$ and $α$. To constrain the model parameters, $77$ data points from the Hubble dataset, $1048$ points from the Pantheon dataset, and $6$ data points from BAO are used. Using the constrained values, we analyze and compare our model with the standard $Λ$CDM model. The evolution of the physical parameters, which includes the deceleration parameter, density parameter, Equation of State (EoS) for Dark Energy, and $Om(z)$ diagnostic, are discussed.
△ Less
Submitted 28 April, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Fair Allocation of goods and chores -- Tutorial and Survey of Recent Results
Authors:
Shaily Mishra,
Manisha Padala,
Sujit Gujar
Abstract:
Fair resource allocation is an important problem in many real-world scenarios, where resources such as goods and chores must be allocated among agents. In this survey, we delve into the intricacies of fair allocation, focusing specifically on the challenges associated with indivisible resources. We define fairness and efficiency within this context and thoroughly survey existential results, algori…
▽ More
Fair resource allocation is an important problem in many real-world scenarios, where resources such as goods and chores must be allocated among agents. In this survey, we delve into the intricacies of fair allocation, focusing specifically on the challenges associated with indivisible resources. We define fairness and efficiency within this context and thoroughly survey existential results, algorithms, and approximations that satisfy various fairness criteria, including envyfreeness, proportionality, MMS, and their relaxations. Additionally, we discuss algorithms that achieve fairness and efficiency, such as Pareto Optimality and Utilitarian Welfare. We also study the computational complexity of these algorithms, the likelihood of finding fair allocations, and the price of fairness for each fairness notion. We also cover mixed instances of indivisible and divisible items and investigate different valuation and allocation settings. By summarizing the state-of-the-art research, this survey provides valuable insights into fair resource allocation of indivisible goods and chores, highlighting computational complexities, fairness guarantees, and trade-offs between fairness and efficiency. It serves as a foundation for future advancements in this vital field.
△ Less
Submitted 21 July, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generation
Authors:
Rishabh Gupta,
Shaily Desai,
Manvi Goel,
Anil Bandhakavi,
Tanmoy Chakraborty,
Md. Shad Akhtar
Abstract:
Counterspeech has been demonstrated to be an efficacious approach for combating hate speech. While various conventional and controlled approaches have been studied in recent years to generate counterspeech, a counterspeech with a certain intent may not be sufficient in every scenario. Due to the complex and multifaceted nature of hate speech, utilizing multiple forms of counter-narratives with var…
▽ More
Counterspeech has been demonstrated to be an efficacious approach for combating hate speech. While various conventional and controlled approaches have been studied in recent years to generate counterspeech, a counterspeech with a certain intent may not be sufficient in every scenario. Due to the complex and multifaceted nature of hate speech, utilizing multiple forms of counter-narratives with varying intents may be advantageous in different circumstances. In this paper, we explore intent-conditioned counterspeech generation. At first, we develop IntentCONAN, a diversified intent-specific counterspeech dataset with 6831 counterspeeches conditioned on five intents, i.e., informative, denouncing, question, positive, and humour. Subsequently, we propose QUARC, a two-stage framework for intent-conditioned counterspeech generation. QUARC leverages vector-quantized representations learned for each intent category along with PerFuMe, a novel fusion module to incorporate intent-specific information into the model. Our evaluation demonstrates that QUARC outperforms several baselines by an average of 10% across evaluation metrics. An extensive human evaluation supplements our hypothesis of better and more appropriate responses than comparative systems.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
A non-singular bouncing cosmology in $ f(R,T) $ gravity
Authors:
J. K. Singh,
Shaily,
Akanksha Singh,
Aroonkumar Beesham,
Hamid Shabani
Abstract:
We investigate a bounce realization in the framework of higher order curvature in $ f(R,T) $ modified theory of gravity. We perform a detailed analysis of the cosmological parameters to explain the contraction phase, the bounce phase, and the expansion phase. Furthermore, we observe a violation of the null energy condition, instability of the model, and a singularity upon deceleration at the bounc…
▽ More
We investigate a bounce realization in the framework of higher order curvature in $ f(R,T) $ modified theory of gravity. We perform a detailed analysis of the cosmological parameters to explain the contraction phase, the bounce phase, and the expansion phase. Furthermore, we observe a violation of the null energy condition, instability of the model, and a singularity upon deceleration at the bouncing point, which are the supporting results for a bouncing cosmology. The outcome of the slow roll parameters is satisfactory to understand the inflation era and the equation of state parameter exhibits a ghost condensate behavior of the model near the bounce. Additionally, we discuss the stability of the model using linear perturbations in the Hubble parameter as well as the energy density.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
Power law cosmology in modified theory with higher order curvature term
Authors:
J. K. Singh,
Shaily,
Anirudh Pradhan,
Aroonkumar Beesham
Abstract:
In this paper, we consider a cosmological model in $ f(R,G) $ gravity in flat space-time, where $ R $ is the Ricci scalar and $ G $ is the Gauss-Bonnet invariant. Here, the function $ f(R,G) $ is taken as a linear combination of $ R $ and an exponential function of $ G $. We analyze the observational constraints under a power law cosmology which depends on two parameters, viz., the Hubble constant…
▽ More
In this paper, we consider a cosmological model in $ f(R,G) $ gravity in flat space-time, where $ R $ is the Ricci scalar and $ G $ is the Gauss-Bonnet invariant. Here, the function $ f(R,G) $ is taken as a linear combination of $ R $ and an exponential function of $ G $. We analyze the observational constraints under a power law cosmology which depends on two parameters, viz., the Hubble constant $ H_0 $ and the deceleration parameter $ q $. We examine the three sets of constraints $ H_0=68.119_{-0.12}^{+0.028} $ $ km S^{-1} Mpc^{-1} $, $ q=-0.109_{-0.014}^{+0.014} $; $ H_0=70.5_{-0.98}^{+1.3} $ $ km S^{-1} Mpc^{-1} $, $ q=-0.25_{-0.15}^{+0.15} $ and $ H_0=69.103_{-0.10}^{+0.019} $ $ Km S^{-1} Mpc^{-1} $, $ q=-0.132_{-0.014}^{+0.014} $, obtained by using the latest 77 points of the $ H(z) $ data, 1048 points of the $ Pantheon $ data and the joint data of $ H(z)+Pantheon $ at the $ 1σ$ level, respectively, We compare our results with the results of the $ Λ$CDM model. We find that our estimate of $ H_0 $ is in very close agreement with some of the latest results from the Planck Collaboration that assume the $ Λ$CDM model. Our work in power law cosmology provides a better fit to the $ Pantheon $ data than the earlier analysis. We also discuss statefinder diagnostics and see that the power law models approach the standard $Λ$CDM model ($ q\rightarrow -0.5 $). Finally, we conclude that in $ f(R,G) $ gravity, power law cosmology explains most of the distinguished attributes of evolution in cosmology.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Parameterized algorithms for Eccentricity Shortest Path Problem
Authors:
Sriram Bhyravarapu,
Satyabrata Jana,
Lawqueen Kanesh,
Saket Saurabh,
Shaily Verma
Abstract:
Given an undirected graph $G=(V,E)$ and an integer $\ell$, the Eccentricity Shortest Path (ESP) asks to find a shortest path $P$ such that for every vertex $v\in V(G)$, there is a vertex $w\in P$ such that $d_G(v,w)\leq \ell$, where $d_G(v,w)$ represents the distance between $v$ and $w$ in $G$. Dragan and Leitert [Theor. Comput. Sci. 2017] showed that the optimization version of this problem, whic…
▽ More
Given an undirected graph $G=(V,E)$ and an integer $\ell$, the Eccentricity Shortest Path (ESP) asks to find a shortest path $P$ such that for every vertex $v\in V(G)$, there is a vertex $w\in P$ such that $d_G(v,w)\leq \ell$, where $d_G(v,w)$ represents the distance between $v$ and $w$ in $G$. Dragan and Leitert [Theor. Comput. Sci. 2017] showed that the optimization version of this problem, which asks to find the minimum $\ell$ for the ESP problem, is NP-hard even on planar bipartite graphs with maximum degree 3. They also showed that ESP is W[2]-hard when parameterized by $\ell$. On the positive side, Ku\v cera and Suchý [IWOCA 2021] showed that the problem exhibits fixed parameter tractable (FPT) behavior when parameterized by modular width, cluster vertex deletion set, maximum leaf number, or the combined parameters disjoint paths deletion set and $\ell$. It was asked as an open question in the above paper, if ESP is FPT parameterized by disjoint paths deletion set or feedback vertex set. We answer these questions partially and obtain the following results: - ESP is FPT when parameterized by disjoint paths deletion set, split vertex deletion set or the combined parameters feedback vertex set and eccentricity of the graph. - We design a $(1+ε)$-factor FPT approximation algorithm when parameterized by the feedback vertex set number. - ESP is W[2]-hard when parameterized by the chordal vertex deletion set.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
The cosmological model in $ f(R,T^φ) $ gravity with Scalar Field conformity
Authors:
J. K. Singh,
Akanksha Singh,
Shaily,
J. Jena
Abstract:
The homogeneous and isotropic cosmological model in generalized $ f(R,T^φ) $ theories associated with scalar field is discussed, which is motivated by the $ f(R,T) $ theory of gravity studied by Harko et al. \cite{Harko:2011kv, Harko:2014pqa}. The $ f(R,T^φ) $ gravity can be explained as $ f(R,T) $ gravity with a self-interacting scalar field $ φ$, where $ T^φ$ is the trace of the energy-momentum…
▽ More
The homogeneous and isotropic cosmological model in generalized $ f(R,T^φ) $ theories associated with scalar field is discussed, which is motivated by the $ f(R,T) $ theory of gravity studied by Harko et al. \cite{Harko:2011kv, Harko:2014pqa}. The $ f(R,T^φ) $ gravity can be explained as $ f(R,T) $ gravity with a self-interacting scalar field $ φ$, where $ T^φ$ is the trace of the energy-momentum tensor. The parametrization of Hubble parameter $ H(t) $ is taken as $ α-βe^{-γt} $, where $ α$, $β$ and $γ$ are arbitrary constants such that $ α, γ>0 $ and $ β<0 $. The model shows no space-time singularity and the expansion of the universe will continue forever, i.e., the future scenario of the universe attains Big Freeze. The model predicts the moderate inflationary scenario at the time of the evolution of the universe and it is consistent with $ Λ$CDM in late times. The consistency of the model has also been examined using recent observational Hubble dataset and supernovae dataset. Finally, the physical features of the model have been discussed in some detail.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
A constrained cosmological model in $f(R,L_m)$ gravity
Authors:
J. K. Singh,
Shaily,
Ratbay Myrzakulov,
Harshna Balhara
Abstract:
In this article, we study the expanding nature of universe in the contest of $f(R,L_m)$ gravity theory, here $ R $ represents the Ricci scalar and $ L_m $ is the matter Lagrangian density. With a specific form of $ f(R,L_m) $, we obtain the field equations for flat FLRW metric. We parametrize the deceleration parameter in terms of the Hubble parameter and from here we find four free parameters, wh…
▽ More
In this article, we study the expanding nature of universe in the contest of $f(R,L_m)$ gravity theory, here $ R $ represents the Ricci scalar and $ L_m $ is the matter Lagrangian density. With a specific form of $ f(R,L_m) $, we obtain the field equations for flat FLRW metric. We parametrize the deceleration parameter in terms of the Hubble parameter and from here we find four free parameters, which are constraints and estimated by using $H(z)$, $Pantheon$, and their joint data sets. Further, we investigate the evolution of the deceleration parameter which depicts a transition from the deceleration to acceleration phases of the universe. The evolution behaviour of energy density, pressure, and EoS parameters shows that the present model is an accelerated quintessence dark energy model. To compare our model with the $ Λ$CDM model we use some of the diagnostic techniques. Thus, we find that our model in $ f(R,L_m) $ gravity supports the recent standard observational studies and delineates the late-time cosmic acceleration.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
Cultural Re-contextualization of Fairness Research in Language Technologies in India
Authors:
Shaily Bhatt,
Sunipa Dev,
Partha Talukdar,
Shachi Dave,
Vinodkumar Prabhakaran
Abstract:
Recent research has revealed undesirable biases in NLP data and models. However, these efforts largely focus on social disparities in the West, and are not directly portable to other geo-cultural contexts. In this position paper, we outline a holistic research agenda to re-contextualize NLP fairness research for the Indian context, accounting for Indian societal context, bridging technological gap…
▽ More
Recent research has revealed undesirable biases in NLP data and models. However, these efforts largely focus on social disparities in the West, and are not directly portable to other geo-cultural contexts. In this position paper, we outline a holistic research agenda to re-contextualize NLP fairness research for the Indian context, accounting for Indian societal context, bridging technological gaps in capability and resources, and adapting to Indian cultural values. We also summarize findings from an empirical study on various social biases along different axes of disparities relevant to India, demonstrating their prevalence in corpora and models.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
Re-contextualizing Fairness in NLP: The Case of India
Authors:
Shaily Bhatt,
Sunipa Dev,
Partha Talukdar,
Shachi Dave,
Vinodkumar Prabhakaran
Abstract:
Recent research has revealed undesirable biases in NLP data and models. However, these efforts focus on social disparities in West, and are not directly portable to other geo-cultural contexts. In this paper, we focus on NLP fair-ness in the context of India. We start with a brief account of the prominent axes of social disparities in India. We build resources for fairness evaluation in the Indian…
▽ More
Recent research has revealed undesirable biases in NLP data and models. However, these efforts focus on social disparities in West, and are not directly portable to other geo-cultural contexts. In this paper, we focus on NLP fair-ness in the context of India. We start with a brief account of the prominent axes of social disparities in India. We build resources for fairness evaluation in the Indian context and use them to demonstrate prediction biases along some of the axes. We then delve deeper into social stereotypes for Region andReligion, demonstrating its prevalence in corpora and models. Finally, we outline a holistic research agenda to re-contextualize NLP fairness research for the Indian context, ac-counting for Indian societal context, bridging technological gaps in NLP capabilities and re-sources, and adapting to Indian cultural values. While we focus on India, this framework can be generalized to other geo-cultural contexts.
△ Less
Submitted 21 November, 2022; v1 submitted 25 September, 2022;
originally announced September 2022.
-
The constrained cosmological model in Lyra geometry
Authors:
J. K. Singh,
Shaily,
Shri Ram,
Joao R. L. Santos,
Jéferson A. S. Fortunato
Abstract:
In this article, we study a flat homogeneous FLRW model in Lyra geometry which is described by a time-dependent displacement vector. We consider an appropriate parametrization of the energy density of scalar field $ ρ_φ$ in terms of the cosmic scale factor. The result shows two transitions from deceleration to acceleration. Furthermore, we constrain the model parameter $ α$ and the displacement fi…
▽ More
In this article, we study a flat homogeneous FLRW model in Lyra geometry which is described by a time-dependent displacement vector. We consider an appropriate parametrization of the energy density of scalar field $ ρ_φ$ in terms of the cosmic scale factor. The result shows two transitions from deceleration to acceleration. Furthermore, we constrain the model parameter $ α$ and the displacement field vector $ β$ using the recent supernovae data, Hubble data set of 77 points, and their joint data which predicts the accelerated expanding phase of the universe in late times. The effective equation of state parameter $ ω_{eff} $ speculate $ Λ$CDM in late times. Finally, we use the statefinder diagnostic to differentiate our model from the various dark energy models.
△ Less
Submitted 3 April, 2023; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Dynamical analysis of a hyperbolic solution in Scale-covariant theory
Authors:
Shaily,
J. K. Singh,
Joao R. L. Santos,
M. Zeyauddin
Abstract:
We study an isotropic flat FLRW-model in Scale-covariant theory of gravity $ f_{γδ}(φ) $ \cite{Canuto:1977zz} which is explained in terms of ordinary and covariant differentiation of scalar field $ φ$. As we know the deceleration parameter is time-dependent, so we consider the deceleration parameter $ q $ as the function of $ t $. Using this methodology, we find all the important cosmological fact…
▽ More
We study an isotropic flat FLRW-model in Scale-covariant theory of gravity $ f_{γδ}(φ) $ \cite{Canuto:1977zz} which is explained in terms of ordinary and covariant differentiation of scalar field $ φ$. As we know the deceleration parameter is time-dependent, so we consider the deceleration parameter $ q $ as the function of $ t $. Using this methodology, we find all the important cosmological factors in terms of a hyperbolic function of the cosmic time $ t $. In turn, we create the model having the behavior of the late-time universe, which is ever accelerated expanding and faces a Big Freeze at the end. The model shows the quintessence dark energy model from early to late times. We compute the constrained values of Hubble parameter $ H_0=70.979^{+0.021}_{-0.0043} $ and the model parameter $ n=1.24079^{+0.00015}_{-0.00079} $ using joint analysis of the $ OHD $ data of 77-points and Pantheon bin data. The model exhibits point-type singularity, beginning with a point of zero volume, infinite energy density, and temperature. Furthermore, we obtain the present deceleration parameter $ (q_0) \approx {-0.54} $. Also, we examine the ultimate behavior of our model by properly analyzing energy conditions, cosmographical parameters, and Statefinder diagnostic. Finally, the proposed model behaves like a quintessence dark energy model.
△ Less
Submitted 29 April, 2024; v1 submitted 11 July, 2022;
originally announced July 2022.
-
Leveraging Emotion-specific Features to Improve Transformer Performance for Emotion Classification
Authors:
Shaily Desai,
Atharva Kshirsagar,
Aditi Sidnerlikar,
Nikhil Khodake,
Manisha Marathe
Abstract:
This paper describes the approach to the Emotion Classification shared task held at WASSA 2022 by team PVGs AI Club. This Track 2 sub-task focuses on building models which can predict a multi-class emotion label based on essays from news articles where a person, group or another entity is affected. Baseline transformer models have been demonstrating good results on sequence classification tasks, a…
▽ More
This paper describes the approach to the Emotion Classification shared task held at WASSA 2022 by team PVGs AI Club. This Track 2 sub-task focuses on building models which can predict a multi-class emotion label based on essays from news articles where a person, group or another entity is affected. Baseline transformer models have been demonstrating good results on sequence classification tasks, and we aim to improve this performance with the help of ensembling techniques, and by leveraging two variations of emotion-specific representations. We observe better results than our baseline models and achieve an accuracy of 0.619 and a macro F1 score of 0.520 on the emotion classification task.
△ Less
Submitted 30 April, 2022;
originally announced May 2022.
-
Bouncing universe in Gauss-Bonnet gravity
Authors:
J. K. Singh,
Shaily,
Kazuharu Bamba
Abstract:
In this paper, a bouncing cosmological scenario is studied in the background of a flat FLRW model with a specific parametrized hyperbolic form of scale factor $ a $ in terms of $ t $, where $ λ$ is taken as the model parameter. This model is discussed in $ f(R,G) $ formalism having structured as $ f(R,G)=R+F(G) $, where $ R $ is Ricci scalar and $ G $ is the Gauss-Bonnet invariant. The proposed fu…
▽ More
In this paper, a bouncing cosmological scenario is studied in the background of a flat FLRW model with a specific parametrized hyperbolic form of scale factor $ a $ in terms of $ t $, where $ λ$ is taken as the model parameter. This model is discussed in $ f(R,G) $ formalism having structured as $ f(R,G)=R+F(G) $, where $ R $ is Ricci scalar and $ G $ is the Gauss-Bonnet invariant. The proposed functional form of the Hubble parameter is considered in such a way that it satisfies the bouncing criteria of the model, which is free from the initial singularity. The physical consequences of the model are discussed. In this model, one can see that the EoS parameter crosses the quintom line $ ω=-1 $ in the neighborhood of bouncing point $ t\approx0 $, which is a very strong criterion for a successful bouncing cosmological model. Finally, we find that all the essential features of the bouncing model are satisfied successfully.
△ Less
Submitted 10 August, 2022; v1 submitted 13 April, 2022;
originally announced April 2022.
-
Multilingual CheckList: Generation and Evaluation
Authors:
Karthikeyan K,
Shaily Bhatt,
Pankaj Singh,
Somak Aditya,
Sandipan Dandapat,
Sunayana Sitaram,
Monojit Choudhury
Abstract:
Multilingual evaluation benchmarks usually contain limited high-resource languages and do not test models for specific linguistic capabilities. CheckList is a template-based evaluation approach that tests models for specific capabilities. The CheckList template creation process requires native speakers, posing a challenge in scaling to hundreds of languages. In this work, we explore multiple appro…
▽ More
Multilingual evaluation benchmarks usually contain limited high-resource languages and do not test models for specific linguistic capabilities. CheckList is a template-based evaluation approach that tests models for specific capabilities. The CheckList template creation process requires native speakers, posing a challenge in scaling to hundreds of languages. In this work, we explore multiple approaches to generate Multilingual CheckLists. We device an algorithm - Template Extraction Algorithm (TEA) for automatically extracting target language CheckList templates from machine translated instances of a source language templates. We compare the TEA CheckLists with CheckLists created with different levels of human intervention. We further introduce metrics along the dimensions of cost, diversity, utility, and correctness to compare the CheckLists. We thoroughly analyze different approaches to creating CheckLists in Hindi. Furthermore, we experiment with 9 more different languages. We find that TEA followed by human verification is ideal for scaling Checklist-based evaluation to multiple languages while TEA gives a good estimates of model performance.
△ Less
Submitted 11 October, 2022; v1 submitted 24 March, 2022;
originally announced March 2022.
-
EEF1-NN: Efficient and EF1 allocations through Neural Networks
Authors:
Shaily Mishra,
Manisha Padala,
Sujit Gujar
Abstract:
Neural networks have shown state-of-the-art performance in designing auctions, where the network learns the optimal allocations and payment rule to ensure desirable properties. Motivated by the same, we focus on learning fair division of resources, with no payments involved. Our goal is to allocate the items, goods and/or chores efficiently among the fair allocations. By fair, we mean an allocatio…
▽ More
Neural networks have shown state-of-the-art performance in designing auctions, where the network learns the optimal allocations and payment rule to ensure desirable properties. Motivated by the same, we focus on learning fair division of resources, with no payments involved. Our goal is to allocate the items, goods and/or chores efficiently among the fair allocations. By fair, we mean an allocation that is Envy-free (EF). However, such an allocation may not always exist for indivisible resources. Therefore, we consider the relaxed notion, Envy-freeness up to one item (EF1) that is guaranteed to exist. However, it is not enough to guarantee EF1 since the allocation of empty bundles is also EF1. Hence, we add the further constraint of efficiency, maximum utilitarian social welfare (USW). In general finding, USW allocations among EF1 is an NP-Hard problem even when valuations are additive. In this work, we design a network for this task which we refer to as EEF1-NN. We propose an UNet inspired architecture, Lagrangian loss function, and training procedure to obtain desired results. We show that EEF1-NN finds allocation close to optimal USW allocation and ensures EF1 with a high probability for different distributions over input valuations. Compared to existing approaches EEF1-NN empirically guarantees higher USW. Moreover, EEF1-NN is scalable and determines the allocations much faster than solving it as a constrained optimization problem.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Multitask Finetuning for Improving Neural Machine Translation in Indian Languages
Authors:
Shaily Desai,
Atharva Kshirsagar,
Manisha Marathe
Abstract:
Transformer based language models have led to impressive results across all domains in Natural Language Processing. Pretraining these models on language modeling tasks and finetuning them on downstream tasks such as Text Classification, Question Answering and Neural Machine Translation has consistently shown exemplary results. In this work, we propose a Multitask Finetuning methodology which combi…
▽ More
Transformer based language models have led to impressive results across all domains in Natural Language Processing. Pretraining these models on language modeling tasks and finetuning them on downstream tasks such as Text Classification, Question Answering and Neural Machine Translation has consistently shown exemplary results. In this work, we propose a Multitask Finetuning methodology which combines the Bilingual Machine Translation task with an auxiliary Causal Language Modeling task to improve performance on the former task on Indian Languages. We conduct an empirical study on three language pairs, Marathi-Hindi, Marathi-English and Hindi-English, where we compare the multitask finetuning approach to the standard finetuning approach, for which we use the mBART50 model. Our study indicates that the multitask finetuning method could be a better technique than standard finetuning, and could improve Bilingual Machine Translation across language pairs.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
An End-to-End Authentication Mechanism for Wireless Body Area Networks
Authors:
Mosarrat Jahan,
Fatema Tuz Zohra,
Md. Kamal Parvez,
Upama Kabir,
Abdul Mohaimen Al Radi,
Shaily Kabir
Abstract:
Wireless Body Area Network (WBAN) ensures high-quality healthcare services by endowing distant and continual monitoring of patients' health conditions. The security and privacy of the sensitive health-related data transmitted through the WBAN should be preserved to maximize its benefits. In this regard, user authentication is one of the primary mechanisms to protect health data that verifies the i…
▽ More
Wireless Body Area Network (WBAN) ensures high-quality healthcare services by endowing distant and continual monitoring of patients' health conditions. The security and privacy of the sensitive health-related data transmitted through the WBAN should be preserved to maximize its benefits. In this regard, user authentication is one of the primary mechanisms to protect health data that verifies the identities of entities involved in the communication process. Since WBAN carries crucial health data, every entity engaged in the data transfer process must be authenticated. In literature, an end-to-end user authentication mechanism covering each communicating party is absent. Besides, most of the existing user authentication mechanisms are designed assuming that the patient's mobile phone is trusted. In reality, a patient's mobile phone can be stolen or comprised by malware and thus behaves maliciously. Our work addresses these drawbacks and proposes an end-to-end user authentication and session key agreement scheme between sensor nodes and medical experts in a scenario where the patient's mobile phone is semi-trusted. We present a formal security analysis using BAN logic. Besides, we also provide an informal security analysis of the proposed scheme. Both studies indicate that our method is robust against well-known security attacks. In addition, our scheme achieves comparable computation and communication costs concerning the related existing works. The simulation shows that our method preserves satisfactory network performance.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Fake News Detection: Experiments and Approaches beyond Linguistic Features
Authors:
Shaily Bhatt,
Sakshi Kalra,
Naman Goenka,
Yashvardhan Sharma
Abstract:
Easier access to the internet and social media has made disseminating information through online sources very easy. Sources like Facebook, Twitter, online news sites and personal blogs of self-proclaimed journalists have become significant players in providing news content. The sheer amount of information and the speed at which it is generated online makes it practically beyond the scope of human…
▽ More
Easier access to the internet and social media has made disseminating information through online sources very easy. Sources like Facebook, Twitter, online news sites and personal blogs of self-proclaimed journalists have become significant players in providing news content. The sheer amount of information and the speed at which it is generated online makes it practically beyond the scope of human verification. There is, hence, a pressing need to develop technologies that can assist humans with automatic fact-checking and reliable identification of fake news. This paper summarizes the multiple approaches that were undertaken and the experiments that were carried out for the task. Credibility information and metadata associated with the news article have been used for improved results. The experiments also show how modelling justification or evidence can lead to improved results. Additionally, the use of visual features in addition to linguistic features is demonstrated. A detailed comparison of the results showing that our models perform significantly well when compared to robust baselines as well as state-of-the-art models are presented.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
On the Universality of Deep Contextual Language Models
Authors:
Shaily Bhatt,
Poonam Goyal,
Sandipan Dandapat,
Monojit Choudhury,
Sunayana Sitaram
Abstract:
Deep Contextual Language Models (LMs) like ELMO, BERT, and their successors dominate the landscape of Natural Language Processing due to their ability to scale across multiple tasks rapidly by pre-training a single model, followed by task-specific fine-tuning. Furthermore, multilingual versions of such models like XLM-R and mBERT have given promising results in zero-shot cross-lingual transfer, po…
▽ More
Deep Contextual Language Models (LMs) like ELMO, BERT, and their successors dominate the landscape of Natural Language Processing due to their ability to scale across multiple tasks rapidly by pre-training a single model, followed by task-specific fine-tuning. Furthermore, multilingual versions of such models like XLM-R and mBERT have given promising results in zero-shot cross-lingual transfer, potentially enabling NLP applications in many under-served and under-resourced languages. Due to this initial success, pre-trained models are being used as `Universal Language Models' as the starting point across diverse tasks, domains, and languages. This work explores the notion of `Universality' by identifying seven dimensions across which a universal model should be able to scale, that is, perform equally well or reasonably well, to be useful across diverse settings. We outline the current theoretical and empirical results that support model performance across these dimensions, along with extensions that may help address some of their current limitations. Through this survey, we lay the foundation for understanding the capabilities and limitations of massive contextual language models and help discern research gaps and directions for future work to make these LMs inclusive and fair to diverse applications, users, and linguistic phenomena.
△ Less
Submitted 18 December, 2021; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Fair Allocation with Special Externalities
Authors:
Shaily Mishra,
Manisha Padala,
Sujit Gujar
Abstract:
Most of the existing algorithms for fair division do not consider externalities. Under externalities, the utility an agent obtains depends not only on its allocation but also on the allocation of other agents. An agent has a positive (negative) value for the assigned goods (chores). This work focuses on a special case of externality, i.e., an agent receives positive or negative value for unassigne…
▽ More
Most of the existing algorithms for fair division do not consider externalities. Under externalities, the utility an agent obtains depends not only on its allocation but also on the allocation of other agents. An agent has a positive (negative) value for the assigned goods (chores). This work focuses on a special case of externality, i.e., an agent receives positive or negative value for unassigned items independent of which other agent gets it. We show that it is possible to adapt existing algorithms using a transformation to ensure certain fairness and efficiency notions in this setting. Despite the positive results, fairness notions like proportionality need to be re-defined. Further, we prove that maximin share (MMS) may not have any multiplicative approximation in this setting. Studying this domain is a step** stone towards full externalities where ensuring fairness is much more challenging.
△ Less
Submitted 25 February, 2022; v1 submitted 29 August, 2021;
originally announced August 2021.
-
Towards Handling Uncertainty-at-Source in AI -- A Review and Next Steps for Interval Regression
Authors:
Shaily Kabir,
Christian Wagner,
Zack Ellerby
Abstract:
Most of statistics and AI draw insights through modelling discord or variance between sources of information (i.e., inter-source uncertainty). Increasingly, however, research is focusing upon uncertainty arising at the level of individual measurements (i.e., within- or intra-source), such as for a given sensor output or human response. Here, adopting intervals rather than numbers as the fundamenta…
▽ More
Most of statistics and AI draw insights through modelling discord or variance between sources of information (i.e., inter-source uncertainty). Increasingly, however, research is focusing upon uncertainty arising at the level of individual measurements (i.e., within- or intra-source), such as for a given sensor output or human response. Here, adopting intervals rather than numbers as the fundamental data-type provides an efficient, powerful, yet challenging way forward -- offering systematic capture of uncertainty-at-source, increasing informational capacity, and ultimately potential for insight. Following recent progress in the capture of interval-valued data, including from human participants, conducting machine learning directly upon intervals is a crucial next step. This paper focuses on linear regression for interval-valued data as a recent growth area, providing an essential foundation for broader use of intervals in AI. We conduct an in-depth analysis of state-of-the-art methods, elucidating their behaviour, advantages, and pitfalls when applied to datasets with different properties. Specific emphasis is given to the challenge of preserving mathematical coherence -- i.e., ensuring that models maintain fundamental mathematical properties of intervals throughout -- and the paper puts forward extensions to an existing approach to guarantee this. Carefully designed experiments, using both synthetic and real-world data, are conducted -- with findings presented alongside novel visualizations for interval-valued regression outputs, designed to maximise model interpretability. Finally, the paper makes recommendations concerning method suitability for data sets with specific properties and highlights remaining challenges and important next steps for develo** AI with the capacity to handle uncertainty-at-source.
△ Less
Submitted 27 February, 2023; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Combining Context-Free and Contextualized Representations for Arabic Sarcasm Detection and Sentiment Identification
Authors:
Amey Hengle,
Atharva Kshirsagar,
Shaily Desai,
Manisha Marathe
Abstract:
Since their inception, transformer-based language models have led to impressive performance gains across multiple natural language processing tasks. For Arabic, the current state-of-the-art results on most datasets are achieved by the AraBERT language model. Notwithstanding these recent advancements, sarcasm and sentiment detection persist to be challenging tasks in Arabic, given the language's ri…
▽ More
Since their inception, transformer-based language models have led to impressive performance gains across multiple natural language processing tasks. For Arabic, the current state-of-the-art results on most datasets are achieved by the AraBERT language model. Notwithstanding these recent advancements, sarcasm and sentiment detection persist to be challenging tasks in Arabic, given the language's rich morphology, linguistic disparity and dialectal variations. This paper proffers team SPPU-AASM's submission for the WANLP ArSarcasm shared-task 2021, which centers around the sarcasm and sentiment polarity detection of Arabic tweets. The study proposes a hybrid model, combining sentence representations from AraBERT with static word vectors trained on Arabic social media corpora. The proposed system achieves a F1-sarcastic score of 0.62 and a F-PN score of 0.715 for the sarcasm and sentiment detection tasks, respectively. Simulation results show that the proposed system outperforms multiple existing approaches for both the tasks, suggesting that the amalgamation of context-free and context-dependent text representations can help capture complementary facets of word meaning in Arabic. The system ranked second and tenth in the respective sub-tasks of sarcasm detection and sentiment identification.
△ Less
Submitted 9 March, 2021;
originally announced March 2021.
-
A Resolution for Shared Memory Conflict in Multiprocessor System-on-a-Chip
Authors:
Shaily Mittal,
Nitin
Abstract:
Now days, manufacturers are focusing on increasing the concurrency in multiprocessor system-on-a-chip (MPSoC) architecture instead of increasing clock speed, for embedded systems. Traditionally lock-based synchronization is provided to support concurrency; as managing locks can be very difficult and error prone. Transactional memories and lock based systems have been extensively used to provide sy…
▽ More
Now days, manufacturers are focusing on increasing the concurrency in multiprocessor system-on-a-chip (MPSoC) architecture instead of increasing clock speed, for embedded systems. Traditionally lock-based synchronization is provided to support concurrency; as managing locks can be very difficult and error prone. Transactional memories and lock based systems have been extensively used to provide synchronization between multiple processors [1] in general-purpose systems. It has been shown that locks have numerous shortcomings over transactional memory in terms of power consumption, ease of programming and performance. In this paper, we propose a new semaphore scheme for synchronization in shared cache memory in an MPSoC. Moreover, we have evaluated and compared our scheme with locks and transactions in terms of energy consumption and cache miss rate using SimpleScalar functional simulator.
△ Less
Submitted 3 February, 2012;
originally announced February 2012.