Search | arXiv e-print repository

Effect of dimensionality change on the bias of word embeddings

Abstract: Word embedding methods (WEMs) are extensively used for representing text data. The dimensionality of these embeddings varies across various tasks and implementations. The effect of dimensionality change on the accuracy of the downstream task is a well-explored question. However, how the dimensionality change affects the bias of word embeddings needs to be investigated. Using the English Wikipedia… ▽ More Word embedding methods (WEMs) are extensively used for representing text data. The dimensionality of these embeddings varies across various tasks and implementations. The effect of dimensionality change on the accuracy of the downstream task is a well-explored question. However, how the dimensionality change affects the bias of word embeddings needs to be investigated. Using the English Wikipedia corpus, we study this effect for two static (Word2Vec and fastText) and two context-sensitive (ElMo and BERT) WEMs. We have two observations. First, there is a significant variation in the bias of word embeddings with the dimensionality change. Second, there is no uniformity in how the dimensionality change affects the bias of word embeddings. These factors should be considered while selecting the dimensionality of word embeddings. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: Accepted for publication in the Young Research Symposium Track of ACM CODS-COMADS 2024. 2 pages

arXiv:2207.11596 [pdf, ps, other]

Constructive comparison in bidding combinatorial games

Authors: Prem Kant, Urban Larsson, Ravi K. Rai, Akshay V. Upasany

Abstract: A class of discrete Bidding Combinatorial Games that generalize alternating normal play was introduced by Kant, Larsson, Rai, and Upasany (2022). The major questions concerning optimal outcomes were resolved. By generalizing standard game comparison techniques from alternating normal play, we propose an algorithmic play-solution to the problem of game comparison for bidding games. We demonstrate s… ▽ More A class of discrete Bidding Combinatorial Games that generalize alternating normal play was introduced by Kant, Larsson, Rai, and Upasany (2022). The major questions concerning optimal outcomes were resolved. By generalizing standard game comparison techniques from alternating normal play, we propose an algorithmic play-solution to the problem of game comparison for bidding games. We demonstrate some consequences of this result that generalize classical results in alternating play (from Winning Ways 1982 and On Numbers and Games 1976). In particular, integers, dyadics and numbers have many nice properties, such as group structures, but on the other hand the game * is non-invertible. We state a couple of thrilling conjectures and open problems for readers to dive into this promising path of bidding combinatorial games. △ Less

Submitted 30 October, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

Comments: 23 pages, 1 figure

MSC Class: 91A46; 91A05

arXiv:2207.08073 [pdf, ps, other]

Bidding combinatorial games

Authors: Prem Kant, Urban Larsson, Ravi K. Rai, Akshay V. Upasany

Abstract: Combinatorial Game Theory is a branch of mathematics and theoretical computer science that studies sequential 2-player games with perfect information. Normal play is the convention where a player who cannot move loses. Here, we generalize the classical alternating normal play to infinitely many game families, by means of discrete Richman auctions (Develin et al. 2010, Larsson et al. 2021, Lazarus… ▽ More Combinatorial Game Theory is a branch of mathematics and theoretical computer science that studies sequential 2-player games with perfect information. Normal play is the convention where a player who cannot move loses. Here, we generalize the classical alternating normal play to infinitely many game families, by means of discrete Richman auctions (Develin et al. 2010, Larsson et al. 2021, Lazarus et al. 1996). We generalize the notion of a perfect play outcome, and find an exact characterization of outcome feasibility. As a main result, we prove existence of a game form for each such outcome class; then we describe their lattice structures. By imposing restrictions to the general families, such as impartial and {\em symmetric termination}, we find surprising analogies with alternating play. △ Less

Submitted 30 October, 2023; v1 submitted 17 July, 2022; originally announced July 2022.

Comments: 5 figures

MSC Class: 91A46; 91A05

arXiv:2206.07772 [pdf, other]

Deep Learning and Handheld Augmented Reality Based System for Optimal Data Collection in Fault Diagnostics Domain

Authors: Ryan Nguyen, Rahul Rai

Abstract: Compared to current AI or robotic systems, humans navigate their environment with ease, making tasks such as data collection trivial. However, humans find it harder to model complex relationships hidden in the data. AI systems, especially deep learning (DL) algorithms, impressively capture those complex relationships. Symbiotically coupling humans and computational machines' strengths can simultan… ▽ More Compared to current AI or robotic systems, humans navigate their environment with ease, making tasks such as data collection trivial. However, humans find it harder to model complex relationships hidden in the data. AI systems, especially deep learning (DL) algorithms, impressively capture those complex relationships. Symbiotically coupling humans and computational machines' strengths can simultaneously minimize the collected data required and build complex input-to-output map** models. This paper enables this coupling by presenting a novel human-machine interaction framework to perform fault diagnostics with minimal data. Collecting data for diagnosing faults for complex systems is difficult and time-consuming. Minimizing the required data will increase the practicability of data-driven models in diagnosing faults. The framework provides instructions to a human user to collect data that mitigates the difference between the data used to train and test the fault diagnostics model. The framework is composed of three components: (1) a reinforcement learning algorithm for data collection to develop a training dataset, (2) a deep learning algorithm for diagnosing faults, and (3) a handheld augmented reality application for data collection for testing data. The proposed framework has provided above 100\% precision and recall on a novel dataset with only one instance of each fault condition. Additionally, a usability study was conducted to gauge the user experience of the handheld augmented reality application, and all users were able to follow the provided steps. △ Less

Submitted 15 June, 2022; originally announced June 2022.

arXiv:2206.07762 [pdf, other]

doi 10.1016/j.ymssp.2022.109611

Physics-Infused Fuzzy Generative Adversarial Network for Robust Failure Prognosis

Authors: Ryan Nguyen, Shubhendu Kumar Singh, Rahul Rai

Abstract: Prognostics aid in the longevity of fielded systems or products. Quantifying the system's current health enable prognosis to enhance the operator's decision-making to preserve the system's health. Creating a prognosis for a system can be difficult due to (a) unknown physical relationships and/or (b) irregularities in data appearing well beyond the initiation of a problem. Traditionally, three diff… ▽ More Prognostics aid in the longevity of fielded systems or products. Quantifying the system's current health enable prognosis to enhance the operator's decision-making to preserve the system's health. Creating a prognosis for a system can be difficult due to (a) unknown physical relationships and/or (b) irregularities in data appearing well beyond the initiation of a problem. Traditionally, three different modeling paradigms have been used to develop a prognostics model: physics-based (PbM), data-driven (DDM), and hybrid modeling. Recently, the hybrid modeling approach that combines the strength of both PbM and DDM based approaches and alleviates their limitations is gaining traction in the prognostics domain. In this paper, a novel hybrid modeling approach for prognostics applications based on combining concepts from fuzzy logic and generative adversarial networks (GANs) is outlined. The FuzzyGAN based method embeds a physics-based model in the aggregation of the fuzzy implications. This technique constrains the output of the learning method to a realistic solution. Results on a bearing problem showcases the efficacy of adding a physics-based aggregation in a fuzzy logic model to improve GAN's ability to model health and give a more accurate system prognosis. △ Less

Submitted 15 June, 2022; originally announced June 2022.

arXiv:2112.01845 [pdf, other]

Semantic Map Injected GAN Training for Image-to-Image Translation

Authors: Balaram Singh Kshatriya, Shiv Ram Dubey, Himangshu Sarma, Kunal Chaudhary, Meva Ram Gurjar, Rahul Rai, Sunny Manchanda

Abstract: Image-to-image translation is the recent trend to transform images from one domain to another domain using generative adversarial network (GAN). The existing GAN models perform the training by only utilizing the input and output modalities of transformation. In this paper, we perform the semantic injected training of GAN models. Specifically, we train with original input and output modalities and… ▽ More Image-to-image translation is the recent trend to transform images from one domain to another domain using generative adversarial network (GAN). The existing GAN models perform the training by only utilizing the input and output modalities of transformation. In this paper, we perform the semantic injected training of GAN models. Specifically, we train with original input and output modalities and inject a few epochs of training for translation from input to semantic map. Lets refer the original training as the training for the translation of input image into target domain. The injection of semantic training in the original training improves the generalization capability of the trained GAN model. Moreover, it also preserves the categorical information in a better way in the generated image. The semantic map is only utilized at the training time and is not required at the test time. The experiments are performed using state-of-the-art GAN models over CityScapes and RGB-NIR stereo datasets. We observe the improved performance in terms of the SSIM, FID and KID scores after injecting semantic training as compared to original training. △ Less

Submitted 3 December, 2021; originally announced December 2021.

Comments: Accepted in Fourth Workshop on Computer Vision Applications (WCVA) at ICVGIP 2021

arXiv:2110.14588 [pdf, other]

Fuzzy Generative Adversarial Networks

Authors: Ryan Nguyen, Shubhendu Kumar Singh, Rahul Rai

Abstract: Generative Adversarial Networks (GANs) are well-known tools for data generation and semi-supervised classification. GANs, with less labeled data, outperform Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs) in classification across various tasks, this shows promise for develo** GANs capable of trespassing into the domain of semi-supervised regression. However, develo** GANs… ▽ More Generative Adversarial Networks (GANs) are well-known tools for data generation and semi-supervised classification. GANs, with less labeled data, outperform Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs) in classification across various tasks, this shows promise for develo** GANs capable of trespassing into the domain of semi-supervised regression. However, develo** GANs for regression introduce two major challenges: (1) inherent instability in the GAN formulation and (2) performing regression and achieving stability simultaneously. This paper introduces techniques that show improvement in the GANs' regression capability through mean absolute error (MAE) and mean squared error (MSE). We bake a differentiable fuzzy logic system at multiple locations in a GAN because fuzzy logic systems have demonstrated high efficacy in classification and regression settings. The fuzzy logic takes the output of either or both the generator and the discriminator to either or both predict the output, $y$, and evaluate the generator's performance. We outline the results of applying the fuzzy logic system to CGAN and summarize each approach's efficacy. This paper shows that adding a fuzzy logic layer can enhance GAN's ability to perform regression; the most desirable injection location is problem-specific, and we show this through experiments over various datasets. Besides, we demonstrate empirically that the fuzzy-infused GAN is competitive with DNNs. △ Less

Submitted 27 October, 2021; originally announced October 2021.

arXiv:2103.02588 [pdf, other]

doi 10.1016/j.cma.2022.115060

IH-GAN: A Conditional Generative Model for Implicit Surface-Based Inverse Design of Cellular Structures

Authors: Jun Wang, Wei Wayne Chen, Daicong Da, Mark Fuge, Rahul Rai

Abstract: Variable-density cellular structures can overcome connectivity and manufacturability issues of topologically optimized structures, particularly those represented as discrete density maps. However, the optimization of such cellular structures is challenging due to the multiscale design problem. Past work addressing this problem generally either only optimizes the volume fraction of single-type unit… ▽ More Variable-density cellular structures can overcome connectivity and manufacturability issues of topologically optimized structures, particularly those represented as discrete density maps. However, the optimization of such cellular structures is challenging due to the multiscale design problem. Past work addressing this problem generally either only optimizes the volume fraction of single-type unit cells but ignores the effects of unit cell geometry on properties, or considers the geometry-property relation but builds this relation via heuristics. In contrast, we propose a simple yet more principled way to accurately model the property to geometry map** using a conditional deep generative model, named Inverse Homogenization Generative Adversarial Network (IH-GAN). It learns the conditional distribution of unit cell geometries given properties and can realize the one-to-many map** from properties to geometries. We further reduce the complexity of IH-GAN by using the implicit function parameterization to represent unit cell geometries. Results show that our method can 1) generate various unit cells that satisfy given material properties with high accuracy ($R^2$-scores between target properties and properties of generated unit cells $>98\%$) and 2) improve the optimized structural performance over the conventional variable-density single-type structure. In the minimum compliance example, our IH-GAN generated structure achieves a $79.7\%$ reduction in concentrated stress and an extra $3.03\%$ reduction in displacement. In the target deformation examples, our IH-GAN generated structure reduces the target matching error by $86.4\%$ and $79.6\%$ for two test cases, respectively. We also demonstrated that the connectivity issue for multi-type unit cells can be solved by transition layer blending. △ Less

Submitted 11 May, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

Comments: Accepted by Computer Methods in Applied Mechanics and Engineering (CMAME)

arXiv:2103.02083 [pdf, other]

Uncertainty guided semi-supervised segmentation of retinal layers in OCT images

Authors: Suman Sedai, Bhavna Antony, Ravneet Rai, Katie Jones, Hiroshi Ishikawa, Joel Schuman, Wollstein Gadi, Rahil Garnavi

Abstract: Deep convolutional neural networks have shown outstanding performance in medical image segmentation tasks. The usual problem when training supervised deep learning methods is the lack of labeled data which is time-consuming and costly to obtain. In this paper, we propose a novel uncertainty-guided semi-supervised learning based on a student-teacher approach for training the segmentation network us… ▽ More Deep convolutional neural networks have shown outstanding performance in medical image segmentation tasks. The usual problem when training supervised deep learning methods is the lack of labeled data which is time-consuming and costly to obtain. In this paper, we propose a novel uncertainty-guided semi-supervised learning based on a student-teacher approach for training the segmentation network using limited labeled samples and a large number of unlabeled images. First, a teacher segmentation model is trained from the labeled samples using Bayesian deep learning. The trained model is used to generate soft segmentation labels and uncertainty maps for the unlabeled set. The student model is then updated using the softly segmented samples and the corresponding pixel-wise confidence of the segmentation quality estimated from the uncertainty of the teacher model using a newly designed loss function. Experimental results on a retinal layer segmentation task show that the proposed method improves the segmentation performance in comparison to the fully supervised approach and is on par with the expert annotator. The proposed semi-supervised segmentation framework is a key contribution and applicable for biomedical image segmentation across various imaging modalities where access to annotated medical images is challenging △ Less

Submitted 2 March, 2021; originally announced March 2021.

Comments: MICCAI,19

Journal ref: MICCAI 2019 pp 282-290

arXiv:2012.04794 [pdf, other]

Deep Learning based Multi-Modal Sensing for Tracking and State Extraction of Small Quadcopters

Authors: Zhibo Zhang, Chen Zeng, Maulikkumar Dhameliya, Souma Chowdhury, Rahul Rai

Abstract: This paper proposes a multi-sensor based approach to detect, track, and localize a quadcopter unmanned aerial vehicle (UAV). Specifically, a pipeline is developed to process monocular RGB and thermal video (captured from a fixed platform) to detect and track the UAV in our FoV. Subsequently, a 2D planar lidar is used to allow conversion of pixel data to actual distance measurements, and thereby en… ▽ More This paper proposes a multi-sensor based approach to detect, track, and localize a quadcopter unmanned aerial vehicle (UAV). Specifically, a pipeline is developed to process monocular RGB and thermal video (captured from a fixed platform) to detect and track the UAV in our FoV. Subsequently, a 2D planar lidar is used to allow conversion of pixel data to actual distance measurements, and thereby enable localization of the UAV in global coordinates. The monocular data is processed through a deep learning-based object detection method that computes an initial bounding box for the UAV. The thermal data is processed through a thresholding and Kalman filter approach to detect and track the bounding box. Training and testing data are prepared by combining a set of original experiments conducted in a motion capture environment and publicly available UAV image data. The new pipeline compares favorably to existing methods and demonstrates promising tracking and localization capacity of sample experiments. △ Less

Submitted 8 December, 2020; originally announced December 2020.

arXiv:2004.08417 [pdf, other]

Identifying Weakly Connected Subsystems in Building Energy Model for Effective Load Estimation in Presence of Parametric Uncertainty

Authors: Arpan Mukherjee, Anna Kuechle Szweda, Andrew Alegria, Rahul Rai, Tarunraj Singh

Abstract: It is necessary to estimate the expected energy usage of a building to determine how to reduce energy usage. The expected energy usage of a building can be reliably simulated using a Building Energy Model (BEM). Many of the numerous input parameters in a BEM are uncertain. To ensure that the building simulation is sufficiently accurate, and to better understand the impact of imprecisions in the in… ▽ More It is necessary to estimate the expected energy usage of a building to determine how to reduce energy usage. The expected energy usage of a building can be reliably simulated using a Building Energy Model (BEM). Many of the numerous input parameters in a BEM are uncertain. To ensure that the building simulation is sufficiently accurate, and to better understand the impact of imprecisions in the input parameters and calculation methods, it is desirable to quantify uncertainty in the BEM throughout the modeling process. Uncertainty quantification (UQ) typically requires a large number of simulations to produce meaningful data, which, due to the vast number of input parameters and the dynamic nature of building simulation, is computationally expensive. Uncertainty Quantification (UQ) in BEM domain is thus intractable due to the size of the problem and parameters involved and hence it needs an advanced methodology for analysis. The current paper outlines a novel Weakly-Connected-Systems (WCSs) identification-based UQ framework developed to propagate the quantifiable uncertainty in the BEM. The overall approach is demonstrated on the physics-based thermal model of an actual building in Central New York. △ Less

Submitted 17 April, 2020; originally announced April 2020.

arXiv:2003.10025 [pdf, other]

Interpretable machine learning models: a physics-based view

Authors: Ion Matei, Johan de Kleer, Christoforos Somarakis, Rahul Rai, John S. Baras

Abstract: To understand changes in physical systems and facilitate decisions, explaining how model predictions are made is crucial. We use model-based interpretability, where models of physical systems are constructed by composing basic constructs that explain locally how energy is exchanged and transformed. We use the port Hamiltonian (p-H) formalism to describe the basic constructs that contain physically… ▽ More To understand changes in physical systems and facilitate decisions, explaining how model predictions are made is crucial. We use model-based interpretability, where models of physical systems are constructed by composing basic constructs that explain locally how energy is exchanged and transformed. We use the port Hamiltonian (p-H) formalism to describe the basic constructs that contain physically interpretable processes commonly found in the behavior of physical systems. We describe how we can build models out of the p-H constructs and how we can train them. In addition we show how we can impose physical properties such as dissipativity that ensure numerical stability of the training process. We give examples on how to build and train models for describing the behavior of two physical systems: the inverted pendulum and swarm dynamics. △ Less

Submitted 22 March, 2020; originally announced March 2020.

arXiv:2003.05635 [pdf, ps, other]

Discrete Richman-bidding Scoring Games

Authors: Urban Larsson, Neel Patel, Ravi Kant Rai

Abstract: We study zero-sum (combinatorial) games, within the framework of so-called Richman auctions (Lazarus et al. 1996) namely, we modify the alternating play scoring ruleset Cumulative Subtraction (CS) (Cohensius et al. 2019), to a discrete bidding scheme (similar to Develin and Payne 2010). Players bid to move and the player with the highest bid wins the move, and hands over the winning bidding amount… ▽ More We study zero-sum (combinatorial) games, within the framework of so-called Richman auctions (Lazarus et al. 1996) namely, we modify the alternating play scoring ruleset Cumulative Subtraction (CS) (Cohensius et al. 2019), to a discrete bidding scheme (similar to Develin and Payne 2010). Players bid to move and the player with the highest bid wins the move, and hands over the winning bidding amount to the other player. The new game is dubbed Bidding Cumulative Subtraction (BCS). In so-called unitary games, players remove exactly one item out of a single heap of identical items, until the heap is empty, and their actions contribute to a common score, which increases or decreases by one unit depending on whether the maximizing player won the turn or not. We show that there is a unique bidding equilibrium for a much larger class of games, that generalize standard scoring play in the literature. We prove that for all sufficiently large heap sizes, the equilibrium outcomes of unitary BCS are eventually periodic, with period 2, and we show that the periodicity appears at the latest for heaps of sizes quadratic in the total budget. △ Less

Submitted 12 March, 2020; originally announced March 2020.

Comments: 26 Pages, 6 figures

MSC Class: 91A46; 11B75; 91A05

arXiv:2003.02671 [pdf, other]

Hybrid modeling: Applications in real-time diagnosis

Authors: Ion Matei, Johan de Kleer, Alexander Feldman, Rahul Rai, Souma Chowdhury

Abstract: Reduced-order models that accurately abstract high fidelity models and enable faster simulation is vital for real-time, model-based diagnosis applications. In this paper, we outline a novel hybrid modeling approach that combines machine learning inspired models and physics-based models to generate reduced-order models from high fidelity models. We are using such models for real-time diagnosis appl… ▽ More Reduced-order models that accurately abstract high fidelity models and enable faster simulation is vital for real-time, model-based diagnosis applications. In this paper, we outline a novel hybrid modeling approach that combines machine learning inspired models and physics-based models to generate reduced-order models from high fidelity models. We are using such models for real-time diagnosis applications. Specifically, we have developed machine learning inspired representations to generate reduced order component models that preserve, in part, the physical interpretation of the original high fidelity component models. To ensure the accuracy, scalability and numerical stability of the learning algorithms when training the reduced-order models we use optimization platforms featuring automatic differentiation. Training data is generated by simulating the high-fidelity model. We showcase our approach in the context of fault diagnosis of a rail switch system. Three new model abstractions whose complexities are two orders of magnitude smaller than the complexity of the high fidelity model, both in the number of equations and simulation time are shown. The numerical experiments and results demonstrate the efficacy of the proposed hybrid modeling approach. △ Less

Submitted 3 March, 2020; originally announced March 2020.

arXiv:1808.01729 [pdf, other]

Executable Trigger-Action Comments

Authors: Pengyu Nie, Rishabh Rai, Junyi Jessy Li, Sarfraz Khurshid, Raymond J. Mooney, Milos Gligoric

Abstract: Natural language elements, e.g., todo comments, are frequently used to communicate among the developers and to describe tasks that need to be performed (actions) when specific conditions hold in the code repository (triggers). As projects evolve, development processes change, and development teams reorganize, these comments, because of their informal nature, frequently become irrelevant or forgott… ▽ More Natural language elements, e.g., todo comments, are frequently used to communicate among the developers and to describe tasks that need to be performed (actions) when specific conditions hold in the code repository (triggers). As projects evolve, development processes change, and development teams reorganize, these comments, because of their informal nature, frequently become irrelevant or forgotten. We present the first technique, dubbed TrigIt, to specify triggeraction todo comments as executable statements. Thus, actions are executed automatically when triggers evaluate to true. TrigIt specifications are written in the host language (e.g., Java) and are evaluated as part of the build process. The triggers are specified as query statements over abstract syntax trees and abstract representation of build configuration scripts, and the actions are specified as code transformation steps. We implemented TrigIt for the Java programming language and migrated 20 existing trigger-action comments from 8 popular open-source projects. We evaluate the cost of using TrigIt in terms of the number of tokens in the executable comments and the time overhead introduced in the build process. △ Less

Submitted 6 August, 2018; originally announced August 2018.

arXiv:1309.2426 [pdf]

doi 10.5121/ijccsa.2013.3401

Securing Software as a Service Model of Cloud Computing: Issues and Solutions

Authors: Rashmi Rai, G. Sahoo, S. Mehfuz

Abstract: Cloud computing, undoubtedly, has become the buzzword in the IT industry today. Looking at the potential impact it has on numerous business applications as well as in our everyday life, it can certainly be said that this disruptive technology is here to stay. Many of the features that make cloud computing attractive, have not just challenged the existing security system, but have also revealed new… ▽ More Cloud computing, undoubtedly, has become the buzzword in the IT industry today. Looking at the potential impact it has on numerous business applications as well as in our everyday life, it can certainly be said that this disruptive technology is here to stay. Many of the features that make cloud computing attractive, have not just challenged the existing security system, but have also revealed new security issues. This paper provides an insightful analysis of the existing status on cloud computing security issues based on a detailed survey carried by the author. It also makes an attempt to describe the security challenges in Software as a Service (SaaS) model of cloud computing and also endeavors to provide future security research directions. △ Less

Submitted 10 September, 2013; originally announced September 2013.

Journal ref: International Journal on Cloud Computing: Services and Architecture (IJCCSA) ,Vol.3, No.4, August 2013

Showing 1–16 of 16 results for author: Rai, R