-
Structural, magnetic and x-ray absorption spectroscopy studies of new Cr-based low, medium and high-entropy spinel oxides
Authors:
Sushanta Mandal,
Jyoti Sharma,
Tirthankar Chakraborty,
Sanjoy Kr. Mahatha,
Sourav Marik
Abstract:
The emergence of high-entropy oxides has spurred significant research interest in recent times. These compounds exhibit exotic functional properties that often transcend simple linear combinations of their constituent elements. Herein, we present a new series of Cr-based low, medium, and high entropy spinel oxides with composition NiCr2O4, [Ni0.5Mn0.5]Cr2O4, [Ni0.33Mn0.33Co0.33]Cr2O4, [Ni0.25Mn0.2…
▽ More
The emergence of high-entropy oxides has spurred significant research interest in recent times. These compounds exhibit exotic functional properties that often transcend simple linear combinations of their constituent elements. Herein, we present a new series of Cr-based low, medium, and high entropy spinel oxides with composition NiCr2O4, [Ni0.5Mn0.5]Cr2O4, [Ni0.33Mn0.33Co0.33]Cr2O4, [Ni0.25Mn0.25Co0.25Cu0.25]Cr2O4, [Ni0.2Mn0.2Co0.2Cu0.2Zn0.2]Cr2O4, and [Ni0.2Mg0.2Co0.2Cu0.2Zn0.2]Cr2O4. We conducted detailed structural (X-ray and Neutron diffraction), microstructural, Raman spectroscopy, magnetic, and X-ray absorption spectroscopy measurements on these materials. Our study reveals that the incorporation of multiple cations at the A-site of the structure (AB2O4) significantly modulates the magnetic properties. These compounds exhibit transitions from complex ferrimagnetic ([Ni0.2Mn0.2Co0.2Cu0.2Zn0.2]Cr2O4) to antiferromagnetic ([Ni0.2Mg0.2Co0.2Cu0.2Zn0.2]Cr2O4) states, with remarkable coercivity variations, demonstrating the ability to tailor magnetic responses through compositional design.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Feature Selection using the concept of Peafowl Mating in IDS
Authors:
Partha Ghosh,
Joy Sharma,
Nilesh Pandey
Abstract:
Cloud computing has high applicability as an Internet based service that relies on sharing computing resources. Cloud computing provides services that are Infrastructure based, Platform based and Software based. The popularity of this technology is due to its superb performance, high level of computing ability, low cost of services, scalability, availability and flexibility. The obtainability and…
▽ More
Cloud computing has high applicability as an Internet based service that relies on sharing computing resources. Cloud computing provides services that are Infrastructure based, Platform based and Software based. The popularity of this technology is due to its superb performance, high level of computing ability, low cost of services, scalability, availability and flexibility. The obtainability and openness of data in cloud environment make it vulnerable to the world of cyber-attacks. To detect the attacks Intrusion Detection System is used, that can identify the attacks and ensure information security. Such a coherent and proficient Intrusion Detection System is proposed in this paper to achieve higher certainty levels regarding safety in cloud environment. In this paper, the mating behavior of peafowl is incorporated into an optimization algorithm which in turn is used as a feature selection algorithm. The algorithm is used to reduce the huge size of cloud data so that the IDS can work efficiently on the cloud to detect intrusions. The proposed model has been experimented with NSL-KDD dataset as well as Kyoto dataset and have proved to be a better as well as an efficient IDS.
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
Searching, fast and slow, through product catalogs
Authors:
Dayananda Ubrangala,
Juhi Sharma,
Sharath Kumar Rangappa,
Kiran R,
Ravi Prasad Kondapalli,
Laurent Boué
Abstract:
String matching algorithms in the presence of abbreviations, such as in Stock Kee** Unit (SKU) product catalogs, remains a relatively unexplored topic. In this paper, we present a unified architecture for SKU search that provides both a real-time suggestion system (based on a Trie data structure) as well as a lower latency search system (making use of character level TF-IDF in combination with l…
▽ More
String matching algorithms in the presence of abbreviations, such as in Stock Kee** Unit (SKU) product catalogs, remains a relatively unexplored topic. In this paper, we present a unified architecture for SKU search that provides both a real-time suggestion system (based on a Trie data structure) as well as a lower latency search system (making use of character level TF-IDF in combination with language model vector embeddings) where users initiate the search process explicitly. We carry out ablation studies that justify designing a complex search system composed of multiple components to address the delicate trade-off between speed and accuracy. Using SKU search in the Dynamics CRM as an example, we show how our system vastly outperforms, in all aspects, the results provided by the default search engine. Finally, we show how SKU descriptions may be enhanced via generative text models (using gpt-3.5-turbo) so that the consumers of the search results may get more context and a generally better experience when presented with the results of their SKU search.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
Generalized Convergence Analysis of Tsetlin Machines: A Probabilistic Approach to Concept Learning
Authors:
Mohamed-Bachir Belaid,
Jivitesh Sharma,
Lei Jiao,
Ole-Christoffer Granmo,
Per-Arne Andersen,
Anis Yazidi
Abstract:
Tsetlin Machines (TMs) have garnered increasing interest for their ability to learn concepts via propositional formulas and their proven efficiency across various application domains. Despite this, the convergence proof for the TMs, particularly for the AND operator (\emph{conjunction} of literals), in the generalized case (inputs greater than two bits) remains an open problem. This paper aims to…
▽ More
Tsetlin Machines (TMs) have garnered increasing interest for their ability to learn concepts via propositional formulas and their proven efficiency across various application domains. Despite this, the convergence proof for the TMs, particularly for the AND operator (\emph{conjunction} of literals), in the generalized case (inputs greater than two bits) remains an open problem. This paper aims to fill this gap by presenting a comprehensive convergence analysis of Tsetlin automaton-based Machine Learning algorithms. We introduce a novel framework, referred to as Probabilistic Concept Learning (PCL), which simplifies the TM structure while incorporating dedicated feedback mechanisms and dedicated inclusion/exclusion probabilities for literals. Given $n$ features, PCL aims to learn a set of conjunction clauses $C_i$ each associated with a distinct inclusion probability $p_i$. Most importantly, we establish a theoretical proof confirming that, for any clause $C_k$, PCL converges to a conjunction of literals when $0.5<p_k<1$. This result serves as a step** stone for future research on the convergence properties of Tsetlin automaton-based learning algorithms. Our findings not only contribute to the theoretical understanding of Tsetlin Machines but also have implications for their practical application, potentially leading to more robust and interpretable machine learning models.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
CorrEmbed: Evaluating Pre-trained Model Image Similarity Efficacy with a Novel Metric
Authors:
Karl Audun Kagnes Borgersen,
Morten Goodwin,
Jivitesh Sharma,
Tobias Aasmoe,
Mari Leonhardsen,
Gro Herredsvela Rørvik
Abstract:
Detecting visually similar images is a particularly useful attribute to look to when calculating product recommendations. Embedding similarity, which utilizes pre-trained computer vision models to extract high-level image features, has demonstrated remarkable efficacy in identifying images with similar compositions. However, there is a lack of methods for evaluating the embeddings generated by the…
▽ More
Detecting visually similar images is a particularly useful attribute to look to when calculating product recommendations. Embedding similarity, which utilizes pre-trained computer vision models to extract high-level image features, has demonstrated remarkable efficacy in identifying images with similar compositions. However, there is a lack of methods for evaluating the embeddings generated by these models, as conventional loss and performance metrics do not adequately capture their performance in image similarity search tasks.
In this paper, we evaluate the viability of the image embeddings from numerous pre-trained computer vision models using a novel approach named CorrEmbed. Our approach computes the correlation between distances in image embeddings and distances in human-generated tag vectors. We extensively evaluate numerous pre-trained Torchvision models using this metric, revealing an intuitive relationship of linear scaling between ImageNet1k accuracy scores and tag-correlation scores. Importantly, our method also identifies deviations from this pattern, providing insights into how different models capture high-level image features.
By offering a robust performance evaluation of these pre-trained models, CorrEmbed serves as a valuable tool for researchers and practitioners seeking to develop effective, data-driven approaches to similar item recommendations in fashion retail.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
Domain specificity and data efficiency in typo tolerant spell checkers: the case of search in online marketplaces
Authors:
Dayananda Ubrangala,
Juhi Sharma,
Ravi Prasad Kondapalli,
Kiran R,
Amit Agarwala,
Laurent Boué
Abstract:
Typographical errors are a major source of frustration for visitors of online marketplaces. Because of the domain-specific nature of these marketplaces and the very short queries users tend to search for, traditional spell cheking solutions do not perform well in correcting typos. We present a data augmentation method to address the lack of annotated typo data and train a recurrent neural network…
▽ More
Typographical errors are a major source of frustration for visitors of online marketplaces. Because of the domain-specific nature of these marketplaces and the very short queries users tend to search for, traditional spell cheking solutions do not perform well in correcting typos. We present a data augmentation method to address the lack of annotated typo data and train a recurrent neural network to learn context-limited domain-specific embeddings. Those embeddings are deployed in a real-time inferencing API for the Microsoft AppSource marketplace to find the closest match between a misspelled user query and the available product names. Our data efficient solution shows that controlled high quality synthetic data may be a powerful tool especially considering the current climate of large language models which rely on prohibitively huge and often uncontrolled datasets.
△ Less
Submitted 3 August, 2023;
originally announced August 2023.
-
Verifying Properties of Tsetlin Machines
Authors:
Emilia Przybysz,
Bimal Bhattarai,
Cosimo Persia,
Ana Ozaki,
Ole-Christoffer Granmo,
Jivitesh Sharma
Abstract:
Tsetlin Machines (TsMs) are a promising and interpretable machine learning method which can be applied for various classification tasks. We present an exact encoding of TsMs into propositional logic and formally verify properties of TsMs using a SAT solver. In particular, we introduce in this work a notion of similarity of machine learning models and apply our notion to check for similarity of TsM…
▽ More
Tsetlin Machines (TsMs) are a promising and interpretable machine learning method which can be applied for various classification tasks. We present an exact encoding of TsMs into propositional logic and formally verify properties of TsMs using a SAT solver. In particular, we introduce in this work a notion of similarity of machine learning models and apply our notion to check for similarity of TsMs. We also consider notions of robustness and equivalence from the literature and adapt them for TsMs. Then, we show the correctness of our encoding and provide results for the properties: adversarial robustness, equivalence, and similarity of TsMs. In our experiments, we employ the MNIST and IMDB datasets for (respectively) image and sentiment classification. We discuss the results for verifying robustness obtained with TsMs with those in the literature obtained with Binarized Neural Networks on MNIST.
△ Less
Submitted 2 July, 2023; v1 submitted 25 March, 2023;
originally announced March 2023.
-
Building Concise Logical Patterns by Constraining Tsetlin Machine Clause Size
Authors:
K. Darshana Abeyrathna,
Ahmed Abdulrahem Othman Abouzeid,
Bimal Bhattarai,
Charul Giri,
Sondre Glimsdal,
Ole-Christoffer Granmo,
Lei Jiao,
Rupsa Saha,
Jivitesh Sharma,
Svein Anders Tunheim,
Xuan Zhang
Abstract:
Tsetlin machine (TM) is a logic-based machine learning approach with the crucial advantages of being transparent and hardware-friendly. While TMs match or surpass deep learning accuracy for an increasing number of applications, large clause pools tend to produce clauses with many literals (long clauses). As such, they become less interpretable. Further, longer clauses increase the switching activi…
▽ More
Tsetlin machine (TM) is a logic-based machine learning approach with the crucial advantages of being transparent and hardware-friendly. While TMs match or surpass deep learning accuracy for an increasing number of applications, large clause pools tend to produce clauses with many literals (long clauses). As such, they become less interpretable. Further, longer clauses increase the switching activity of the clause logic in hardware, consuming more power. This paper introduces a novel variant of TM learning - Clause Size Constrained TMs (CSC-TMs) - where one can set a soft constraint on the clause size. As soon as a clause includes more literals than the constraint allows, it starts expelling literals. Accordingly, oversized clauses only appear transiently. To evaluate CSC-TM, we conduct classification, clustering, and regression experiments on tabular data, natural language text, images, and board games. Our results show that CSC-TM maintains accuracy with up to 80 times fewer literals. Indeed, the accuracy increases with shorter clauses for TREC, IMDb, and BBC Sports. After the accuracy peaks, it drops gracefully as the clause size approaches a single literal. We finally analyze CSC-TM power consumption and derive new convergence properties.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
Tsetlin Machine Embedding: Representing Words Using Logical Expressions
Authors:
Bimal Bhattarai,
Ole-Christoffer Granmo,
Lei Jiao,
Rohan Yadav,
Jivitesh Sharma
Abstract:
Embedding words in vector space is a fundamental first step in state-of-the-art natural language processing (NLP). Typical NLP solutions employ pre-defined vector representations to improve generalization by co-locating similar words in vector space. For instance, Word2Vec is a self-supervised predictive model that captures the context of words using a neural network. Similarly, GLoVe is a popular…
▽ More
Embedding words in vector space is a fundamental first step in state-of-the-art natural language processing (NLP). Typical NLP solutions employ pre-defined vector representations to improve generalization by co-locating similar words in vector space. For instance, Word2Vec is a self-supervised predictive model that captures the context of words using a neural network. Similarly, GLoVe is a popular unsupervised model incorporating corpus-wide word co-occurrence statistics. Such word embedding has significantly boosted important NLP tasks, including sentiment analysis, document classification, and machine translation. However, the embeddings are dense floating-point vectors, making them expensive to compute and difficult to interpret. In this paper, we instead propose to represent the semantics of words with a few defining words that are related using propositional logic. To produce such logical embeddings, we introduce a Tsetlin Machine-based autoencoder that learns logical clauses self-supervised. The clauses consist of contextual words like "black," "cup," and "hot" to define other words like "coffee," thus being human-understandable. We evaluate our embedding approach on several intrinsic and extrinsic benchmarks, outperforming GLoVe on six classification tasks. Furthermore, we investigate the interpretability of our embedding using the logical representations acquired during training. We also visualize word clusters in vector space, demonstrating how our logical embedding co-locate similar words.
△ Less
Submitted 2 January, 2023;
originally announced January 2023.
-
On the Equivalence of the Weighted Tsetlin Machine and the Perceptron
Authors:
Jivitesh Sharma,
Ole-Christoffer Granmo,
Lei Jiao
Abstract:
Tsetlin Machine (TM) has been gaining popularity as an inherently interpretable machine leaning method that is able to achieve promising performance with low computational complexity on a variety of applications. The interpretability and the low computational complexity of the TM are inherited from the Boolean expressions for representing various sub-patterns. Although possessing favorable propert…
▽ More
Tsetlin Machine (TM) has been gaining popularity as an inherently interpretable machine leaning method that is able to achieve promising performance with low computational complexity on a variety of applications. The interpretability and the low computational complexity of the TM are inherited from the Boolean expressions for representing various sub-patterns. Although possessing favorable properties, TM has not been the go-to method for AI applications, mainly due to its conceptual and theoretical differences compared with perceptrons and neural networks, which are more widely known and well understood. In this paper, we provide detailed insights for the operational concept of the TM, and try to bridge the gap in the theoretical understanding between the perceptron and the TM. More specifically, we study the operational concept of the TM following the analytical structure of perceptrons, showing the resemblance between the perceptrons and the TM. Through the analysis, we indicated that the TM's weight update can be considered as a special case of the gradient weight update. We also perform an empirical analysis of TM by showing the flexibility in determining the clause length, visualization of decision boundaries and obtaining interpretable boolean expressions from TM. In addition, we also discuss the advantages of TM in terms of its structure and its ability to solve more complex problems.
△ Less
Submitted 27 December, 2022;
originally announced December 2022.
-
A Comparison Between Tsetlin Machines and Deep Neural Networks in the Context of Recommendation Systems
Authors:
Karl Audun Borgersen,
Morten Goodwin,
Jivitesh Sharma
Abstract:
Recommendation Systems (RSs) are ubiquitous in modern society and are one of the largest points of interaction between humans and AI. Modern RSs are often implemented using deep learning models, which are infamously difficult to interpret. This problem is particularly exasperated in the context of recommendation scenarios, as it erodes the user's trust in the RS. In contrast, the newly introduced…
▽ More
Recommendation Systems (RSs) are ubiquitous in modern society and are one of the largest points of interaction between humans and AI. Modern RSs are often implemented using deep learning models, which are infamously difficult to interpret. This problem is particularly exasperated in the context of recommendation scenarios, as it erodes the user's trust in the RS. In contrast, the newly introduced Tsetlin Machines (TM) possess some valuable properties due to their inherent interpretability. TMs are still fairly young as a technology. As no RS has been developed for TMs before, it has become necessary to perform some preliminary research regarding the practicality of such a system. In this paper, we develop the first RS based on TMs to evaluate its practicality in this application domain. This paper compares the viability of TMs with other machine learning models prevalent in the field of RS. We train and investigate the performance of the TM compared with a vanilla feed-forward deep learning model. These comparisons are based on model performance, interpretability/explainability, and scalability. Further, we provide some benchmark performance comparisons to similar machine learning solutions relevant to RSs.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Evaluating CNN with Oscillatory Activation Function
Authors:
Jeevanshi Sharma
Abstract:
The reason behind CNNs capability to learn high-dimensional complex features from the images is the non-linearity introduced by the activation function. Several advanced activation functions have been discovered to improve the training process of neural networks, as choosing an activation function is a crucial step in the modeling. Recent research has proposed using an oscillating activation funct…
▽ More
The reason behind CNNs capability to learn high-dimensional complex features from the images is the non-linearity introduced by the activation function. Several advanced activation functions have been discovered to improve the training process of neural networks, as choosing an activation function is a crucial step in the modeling. Recent research has proposed using an oscillating activation function to solve classification problems inspired by the human brain cortex. This paper explores the performance of one of the CNN architecture ALexNet on MNIST and CIFAR10 datasets using oscillatory activation function (GCU) and some other commonly used activation functions like ReLu, PReLu, and Mish.
△ Less
Submitted 13 November, 2022;
originally announced November 2022.
-
An RNA Sequencing Analysis of Glaucoma Genesis in Mice
Authors:
Jai Sharma,
Vidhyacharan Bhaskar
Abstract:
Glaucoma is the leading cause of irreversible blindness in people over the age of 60, accounting for 6.6 to 8% of all blindness in 2010, but there is still much to be learned about the genetic origins of the eye disease. With the modern development of Next-Generation Sequencing (NGS) technologies, scientists are starting to learn more about the genetic origins of Glaucoma. This research uses diffe…
▽ More
Glaucoma is the leading cause of irreversible blindness in people over the age of 60, accounting for 6.6 to 8% of all blindness in 2010, but there is still much to be learned about the genetic origins of the eye disease. With the modern development of Next-Generation Sequencing (NGS) technologies, scientists are starting to learn more about the genetic origins of Glaucoma. This research uses differential expression (DE) and gene ontology (GO) analyses to study the genetic differences between mice with severe Glaucoma and multiple control groups. Optical nerve head (ONH) and retina data samples of genome-wide RNA expression from NCBI (NIH) are used for pairwise comparison experimentation. In addition, principal component analysis (PCA) and dispersion visualization methods are employed to perform quality control tests of the sequenced data. Genes with skewed gene counts are also identified, as they may be marker genes for a particular severity of Glaucoma. The gene ontologies found in this experiment support existing knowledge of Glaucoma genesis, providing confidence that the results were valid. Future researchers can thoroughly study the gene lists generated by the DE and GO analyses to find potential activator or protector genes for Glaucoma in mice to develop drug treatments or gene therapies to slow or stop the progression of the disease. The overall goal is that in the future, such treatments can be made for humans as well to improve the quality of life for human patients with Glaucoma and reduce Glaucoma blindness rates.
△ Less
Submitted 2 October, 2022;
originally announced October 2022.
-
Machine Learning For Classification Of Antithetical Emotional States
Authors:
Jeevanshi Sharma,
Rajat Maheshwari,
Yusuf Uzzaman Khan
Abstract:
Emotion Classification through EEG signals has achieved many advancements. However, the problems like lack of data and learning the important features and patterns have always been areas with scope for improvement both computationally and in prediction accuracy. This works analyses the baseline machine learning classifiers' performance on DEAP Dataset along with a tabular learning approach that pr…
▽ More
Emotion Classification through EEG signals has achieved many advancements. However, the problems like lack of data and learning the important features and patterns have always been areas with scope for improvement both computationally and in prediction accuracy. This works analyses the baseline machine learning classifiers' performance on DEAP Dataset along with a tabular learning approach that provided state-of-the-art comparable results leveraging the performance boost due to its deep learning architecture without deploying heavy neural networks.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
A Deep Learning Ensemble Framework for Off-Nadir Geocentric Pose Prediction
Authors:
Christopher Sun,
Jai Sharma,
Milind Maiti
Abstract:
Computational methods to accelerate natural disaster response include change detection, map alignment, and vision-aided navigation. Current software functions optimally only on near-nadir images, though off-nadir images are often the first sources of information following a natural disaster. The use of off-nadir images for the aforementioned tasks requires the computation of geocentric pose, which…
▽ More
Computational methods to accelerate natural disaster response include change detection, map alignment, and vision-aided navigation. Current software functions optimally only on near-nadir images, though off-nadir images are often the first sources of information following a natural disaster. The use of off-nadir images for the aforementioned tasks requires the computation of geocentric pose, which is an aerial vehicle's spatial orientation with respect to gravity. This study proposes a deep learning ensemble framework to predict geocentric pose using 5,923 near-nadir and off-nadir RGB satellite images of cities worldwide. First, a U-Net Fully Convolutional Neural Network predicts the pixel-wise above-ground elevation mask of the RGB images. Then, the elevation masks are concatenated with the RGB images to form four-channel inputs fed into a second convolutional model, which predicts orientation angle and magnification scale. A performance accuracy of R2=0.917 significantly outperforms previous methodologies. In addition, outlier removal is performed through supervised interpolation, and a sensitivity analysis of elevation masks is conducted to gauge the usefulness of data features, motivating future avenues of feature engineering. The high-accuracy software built in this study contributes to map** and navigation procedures for effective disaster response to save lives.
△ Less
Submitted 6 August, 2022; v1 submitted 4 May, 2022;
originally announced May 2022.
-
Controllability Analysis of Motion of Artificial Satellite Under the Effect of Oblateness of the Earth
Authors:
Jaita Sharma,
B. S. Ratanpal,
Shivam Munshi,
Vishant Shah
Abstract:
In this article we have studied the controllability of artificial satellite under the effect of zonal harmonic $J_2$ in cylindrical polar coordinates systems. Seven different cases of thrusters in various directions have been analyzed and it is found that the system is controllable if we apply thrusters in either $r$, $θ$ and $z$ or $θ$ and $z$ direction. The equations governing motion of satellit…
▽ More
In this article we have studied the controllability of artificial satellite under the effect of zonal harmonic $J_2$ in cylindrical polar coordinates systems. Seven different cases of thrusters in various directions have been analyzed and it is found that the system is controllable if we apply thrusters in either $r$, $θ$ and $z$ or $θ$ and $z$ direction. The equations governing motion of satellite have been linearized and Kalman controllability test is applied to check the controllability of the system. We have also derived controller $u$ for the linearized system. The trajectory of the system have been plotted to show the controllability of the system.
△ Less
Submitted 28 February, 2022;
originally announced March 2022.
-
T-Controllability of Evolution Systems having Non-instantaneous Impulses
Authors:
Vishant Shah,
Kalpesh Bharvad,
Dipsha Bhadrecha,
Aashi Maratha,
Jaita Sharma,
Dhanesh Patel
Abstract:
In this manuscript, we have considered the system governed by a noninstantaneous impulsive dynamical system of integer order with classical and nonlocal conditions and derived sufficient conditions for the trajectory controllability of the system on the Banach space. The conditions were obtained through the concept of semigroup properties of operator and Gronwall's inequality. Finally, illustratio…
▽ More
In this manuscript, we have considered the system governed by a noninstantaneous impulsive dynamical system of integer order with classical and nonlocal conditions and derived sufficient conditions for the trajectory controllability of the system on the Banach space. The conditions were obtained through the concept of semigroup properties of operator and Gronwall's inequality. Finally, illustrations with classical and nonlocal conditions were also added to validate the derived conditions.
△ Less
Submitted 13 February, 2022;
originally announced February 2022.
-
Tsetlin Machine for Solving Contextual Bandit Problems
Authors:
Raihan Seraj,
Jivitesh Sharma,
Ole-Christoffer Granmo
Abstract:
This paper introduces an interpretable contextual bandit algorithm using Tsetlin Machines, which solves complex pattern recognition tasks using propositional logic. The proposed bandit learning algorithm relies on straightforward bit manipulation, thus simplifying computation and interpretation. We then present a mechanism for performing Thompson sampling with Tsetlin Machine, given its non-parame…
▽ More
This paper introduces an interpretable contextual bandit algorithm using Tsetlin Machines, which solves complex pattern recognition tasks using propositional logic. The proposed bandit learning algorithm relies on straightforward bit manipulation, thus simplifying computation and interpretation. We then present a mechanism for performing Thompson sampling with Tsetlin Machine, given its non-parametric nature. Our empirical analysis shows that Tsetlin Machine as a base contextual bandit learner outperforms other popular base learners on eight out of nine datasets. We further analyze the interpretability of our learner, investigating how arms are selected based on propositional expressions that model the context.
△ Less
Submitted 3 February, 2022;
originally announced February 2022.
-
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Authors:
Kristen Grauman,
Andrew Westbury,
Eugene Byrne,
Zachary Chavis,
Antonino Furnari,
Rohit Girdhar,
Jackson Hamburger,
Hao Jiang,
Miao Liu,
Xingyu Liu,
Miguel Martin,
Tushar Nagarajan,
Ilija Radosavovic,
Santhosh Kumar Ramakrishnan,
Fiona Ryan,
Jayant Sharma,
Michael Wray,
Mengmeng Xu,
Eric Zhongcong Xu,
Chen Zhao,
Siddhant Bansal,
Dhruv Batra,
Vincent Cartillier,
Sean Crane,
Tien Do
, et al. (60 additional authors not shown)
Abstract:
We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with cons…
▽ More
We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant. Ego4D dramatically expands the volume of diverse egocentric video footage publicly available to the research community. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. Furthermore, we present a host of new benchmark challenges centered around understanding the first-person visual experience in the past (querying an episodic memory), present (analyzing hand-object manipulation, audio-visual conversation, and social interactions), and future (forecasting activities). By publicly sharing this massive annotated dataset and benchmark suite, we aim to push the frontier of first-person perception. Project page: https://ego4d-data.org/
△ Less
Submitted 11 March, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
-
Challenges and Solutions in DeepFakes
Authors:
Jatin Sharma,
Sahil Sharma
Abstract:
Deep learning has been successfully appertained to solve various complex problems in the area of big data analytics to computer vision. A deep learning-powered application recently emerged is Deep Fake. It helps to create fake images and videos that human cannot distinguish them from the real ones and are recent off-shelf manipulation technique that allows swap** two identities in a single video…
▽ More
Deep learning has been successfully appertained to solve various complex problems in the area of big data analytics to computer vision. A deep learning-powered application recently emerged is Deep Fake. It helps to create fake images and videos that human cannot distinguish them from the real ones and are recent off-shelf manipulation technique that allows swap** two identities in a single video. Technology is a controversial technology with many wide-reaching issues impacting society. So, to counter this emerging problem, we introduce a dataset of 140k real and fake faces which contain 70k real faces from the Flickr dataset collected by Nvidia, as well as 70k fake faces sampled from 1 million fake faces generated by style GAN. We will train our model in the dataset so that our model can identify real or fake faces.
△ Less
Submitted 26 September, 2021; v1 submitted 11 September, 2021;
originally announced September 2021.
-
Investigating the Relationship Between Dropout Regularization and Model Complexity in Neural Networks
Authors:
Christopher Sun,
Jai Sharma,
Milind Maiti
Abstract:
Dropout Regularization, serving to reduce variance, is nearly ubiquitous in Deep Learning models. We explore the relationship between the dropout rate and model complexity by training 2,000 neural networks configured with random combinations of the dropout rate and the number of hidden units in each dense layer, on each of the three data sets we selected. The generated figures, with binary cross e…
▽ More
Dropout Regularization, serving to reduce variance, is nearly ubiquitous in Deep Learning models. We explore the relationship between the dropout rate and model complexity by training 2,000 neural networks configured with random combinations of the dropout rate and the number of hidden units in each dense layer, on each of the three data sets we selected. The generated figures, with binary cross entropy loss and binary accuracy on the z-axis, question the common assumption that adding depth to a dense layer while increasing the dropout rate will certainly enhance performance. We also discover a complex correlation between the two hyperparameters that we proceed to quantify by building additional machine learning and Deep Learning models which predict the optimal dropout rate given some hidden units in each dense layer. Linear regression and polynomial logistic regression require the use of arbitrary thresholds to select the cost data points included in the regression and to assign the cost data points a binary classification, respectively. These machine learning models have mediocre performance because their naive nature prevented the modeling of complex decision boundaries. Turning to Deep Learning models, we build neural networks that predict the optimal dropout rate given the number of hidden units in each dense layer, the desired cost, and the desired accuracy of the model. Though, this attempt encounters a mathematical error that can be attributed to the failure of the vertical line test. The ultimate Deep Learning model is a neural network whose decision boundary represents the 2,000 previously generated data points. This final model leads us to devise a promising method for tuning hyperparameters to minimize computational expense yet maximize performance. The strategy can be applied to any model hyperparameters, with the prospect of more efficient tuning in industrial models.
△ Less
Submitted 25 August, 2021; v1 submitted 14 August, 2021;
originally announced August 2021.
-
Differential rotation of the solar transition region from STEREO/EUVI 30.4 nm images
Authors:
Jaidev Sharma,
Brajesh Kumar,
Anil K Malik,
Hari Om Vats
Abstract:
The solar photosphere, chromosphere and corona are known to rotate differentially as a function of latitude. To date, it is unclear if the solar transition region also rotates differentially. In this paper, we investigate differential rotational profile of solar transition region as a function of latitude, using solar full disk (SFD) images at 30.4 nm wavelength recorded by Extreme Ultraviolet Ima…
▽ More
The solar photosphere, chromosphere and corona are known to rotate differentially as a function of latitude. To date, it is unclear if the solar transition region also rotates differentially. In this paper, we investigate differential rotational profile of solar transition region as a function of latitude, using solar full disk (SFD) images at 30.4 nm wavelength recorded by Extreme Ultraviolet Imager (EUVI) onboard Solar Terrestrial Relations Observatory (STEREO) space mission for the period from 2008 to 2018 (Solar Cycle 24). Our investigations show that solar transition region rotates differentially. The sidereal rotation rate obtained at +/- 5 degree equatorial band is quite high (~ 14.7 degree/day), which drops to ~ 13.6 degree/day towards both polar regions. We also obtain that the rotational differentiality is low during the period of high solar activity (rotation rate varies from 14.86 to 14.27 degree/day) while it increases during the ascending and the descending phases of the 24th solar cycle (rotation rate varies from 14.56 to 13.56 degree/day in 2008 and 14.6 to 13.1 degree/day in 2018). Average sidereal rotation rate (over SFD) follows the trend of solar activity (maximum ~ 14.97 degree/day during the peak phase of the solar activity, which slowly decreases to minimum ~ 13.9 degree/day during ascending and the descending phases of the 24th solar cycle). We also observe that solar transition region rotates less differentially than the corona.
△ Less
Submitted 12 July, 2021;
originally announced July 2021.
-
Drop Clause: Enhancing Performance, Interpretability and Robustness of the Tsetlin Machine
Authors:
Jivitesh Sharma,
Rohan Yadav,
Ole-Christoffer Granmo,
Lei Jiao
Abstract:
In this article, we introduce a novel variant of the Tsetlin machine (TM) that randomly drops clauses, the key learning elements of a TM. In effect, TM with drop clause ignores a random selection of the clauses in each epoch, selected according to a predefined probability. In this way, additional stochasticity is introduced in the learning phase of TM. To explore the effects drop clause has on acc…
▽ More
In this article, we introduce a novel variant of the Tsetlin machine (TM) that randomly drops clauses, the key learning elements of a TM. In effect, TM with drop clause ignores a random selection of the clauses in each epoch, selected according to a predefined probability. In this way, additional stochasticity is introduced in the learning phase of TM. To explore the effects drop clause has on accuracy, training time, interpretability and robustness, we conduct extensive experiments on nine benchmark datasets in natural language processing~(NLP) (IMDb, R8, R52, MR and TREC) and image classification (MNIST, Fashion MNIST, CIFAR-10 and CIFAR-100). Our proposed model outperforms baseline machine learning algorithms by a wide margin and achieves competitive performance in comparison with recent deep learning model such as BERT and AlexNET-DFA. In brief, we observe up to +10% increase in accuracy and 2x to 4x faster learning compared with standard TM. We further employ the Convolutional TM to document interpretable results on the CIFAR datasets, visualizing how the heatmaps produced by the TM become more interpretable with drop clause. We also evaluate how drop clause affects learning robustness by introducing corruptions and alterations in the image/language test data. Our results show that drop clause makes TM more robust towards such changes.
△ Less
Submitted 14 January, 2022; v1 submitted 30 May, 2021;
originally announced May 2021.
-
Draw your Neural Networks
Authors:
Jatin Sharma,
Shobha Lata
Abstract:
Deep Neural Networks are the basic building blocks of modern Artificial Intelligence. They are increasingly replacing or augmenting existing software systems due to their ability to learn directly from the data and superior accuracy on variety of tasks. Existing Software Development Life Cycle (SDLC) methodologies fall short on representing the unique capabilities and requirements of AI Developmen…
▽ More
Deep Neural Networks are the basic building blocks of modern Artificial Intelligence. They are increasingly replacing or augmenting existing software systems due to their ability to learn directly from the data and superior accuracy on variety of tasks. Existing Software Development Life Cycle (SDLC) methodologies fall short on representing the unique capabilities and requirements of AI Development and must be replaced with Artificial Intelligence Development Life Cycle (AIDLC) methodologies. In this paper, we discuss an alternative and more natural approach to develop neural networks that involves intuitive GUI elements such as blocks and lines to draw them instead of complex computer programming. We present Sketch framework, that uses this GUI-based approach to design and modify the neural networks and provides interoperability with traditional frameworks. The system provides popular layers and operations out-of-the-box and could import any supported pre-trained model making it a faster method to design and train complex neural networks and ultimately democratizing the AI by removing the learning curve.
△ Less
Submitted 12 December, 2020;
originally announced December 2020.
-
Towards Hardware-Agnostic Gaze-Trackers
Authors:
Jatin Sharma,
Jon Campbell,
Pete Ansell,
Jay Beavers,
Christopher O'Dowd
Abstract:
Gaze-tracking is a novel way of interacting with computers which allows new scenarios, such as enabling people with motor-neuron disabilities to control their computers or doctors to interact with patient information without touching screen or keyboard. Further, there are emerging applications of gaze-tracking in interactive gaming, user experience research, human attention analysis and behavioral…
▽ More
Gaze-tracking is a novel way of interacting with computers which allows new scenarios, such as enabling people with motor-neuron disabilities to control their computers or doctors to interact with patient information without touching screen or keyboard. Further, there are emerging applications of gaze-tracking in interactive gaming, user experience research, human attention analysis and behavioral studies. Accurate estimation of the gaze may involve accounting for head-pose, head-position, eye rotation, distance from the object as well as operating conditions such as illumination, occlusion, background noise and various biological aspects of the user. Commercially available gaze-trackers utilize specialized sensor assemblies that usually consist of an infrared light source and camera. There are several challenges in the universal proliferation of gaze-tracking as accessibility technologies, specifically its affordability, reliability, and ease-of-use. In this paper, we try to address these challenges through the development of a hardware-agnostic gaze-tracker. We present a deep neural network architecture as an appearance-based method for constrained gaze-tracking that utilizes facial imagery captured on an ordinary RGB camera ubiquitous in all modern computing devices. Our system achieved an error of 1.8073cm on GazeCapture dataset without any calibration or device specific fine-tuning. This research shows promise that one day soon any computer, tablet, or phone will be controllable using just your eyes due to the prediction capabilities of deep neutral networks.
△ Less
Submitted 10 October, 2020;
originally announced October 2020.
-
Evidence of strong relationship between hemispheric asymmetry in solar coronal rotation and solar activity during solar cycle 24
Authors:
Jaidev Sharma,
Anil K Malik,
Brajesh Kumar,
Hari Om Vats
Abstract:
In this article, we report an evidence of very high and statistically significant relationship between hemispheric asymmetry in solar coronal rotation rate and solar activity. Our approach is based on cross correlation of hemispheric asymmetry index (AI) in rotation rate with annual solar activity indicators. To obtain hemispheric asymmetry in solar rotation rate, we use solar full disc (SFD) imag…
▽ More
In this article, we report an evidence of very high and statistically significant relationship between hemispheric asymmetry in solar coronal rotation rate and solar activity. Our approach is based on cross correlation of hemispheric asymmetry index (AI) in rotation rate with annual solar activity indicators. To obtain hemispheric asymmetry in solar rotation rate, we use solar full disc (SFD) images at 30.4 nm, 19.5 nm, and 28.4 nm wavelengths for 24th Solar Cycle i.e., for the period from 2008 to 2018, as recorded by the Solar Terrestrial Relations Observatory (STEREO) space mission. Our analysis shows that hemispheric asymmetry in rotation rate is high during the solar maxima from 2011 to 2014. On the other hand, hemispheric asymmetry drops gradually on both sides (i.e., from 2008 to 2011 and from 2014 to 2018). The results show that asymmetry index (AI) leads sunspot numbers by ~1.56 years. This gives a clear indication that hemispheric asymmetry triggers the formation of sunspots working together with the differential rotation of the Sun.
△ Less
Submitted 17 September, 2020;
originally announced September 2020.
-
RapidLearn: A General Purpose Toolkit for Autonomic Networking
Authors:
Jatin Sharma,
Nikhilesh Behera,
Priya Venkatraman,
Boon Thau Loo
Abstract:
Software Defined Networking has unfolded a new area of opportunity in distributed networking and intelligent networks. There has been a great interest in performing machine learning in distributed setting, exploiting the abstraction of SDN which makes it easier to write complex ML queries on standard control plane. However, most of the research has been made towards specialized problems (security,…
▽ More
Software Defined Networking has unfolded a new area of opportunity in distributed networking and intelligent networks. There has been a great interest in performing machine learning in distributed setting, exploiting the abstraction of SDN which makes it easier to write complex ML queries on standard control plane. However, most of the research has been made towards specialized problems (security, performance improvement, middlebox management etc) and not towards a generic framework. Also, existing tools and software require specialized knowledge of the algorithm/network to operate or monitor these systems. We built a generic toolkit which abstracts out the underlying structure, algorithms and other intricacies and gives an intuitive way for a common user to create and deploy distributed machine learning network applications. Decisions are made at local level by the switches and communicated to other switches to improve upon these decisions. Finally, a global decision is taken by controller based on another algorithm (in our case voting). We demonstrate efficacy of the framework through a simple DDoS detection algorithm.
△ Less
Submitted 9 September, 2020;
originally announced September 2020.
-
Development of Adaptive Frame Reservation Scheme and Naive Persistent State Co-Located Coexistence Controller
Authors:
Jatin Sharma,
Vishwanath Sinha
Abstract:
Future broadband networks need to provide high capacity at low cost with increased revenue through enhanced services. WiMAX came up as one of the leading technologies, however, the 2.3 GHz and 2.5 GHz frequency bands allocated create two serious coexistence issues with the adjacent 2.4 GHz ISM band. First problem is to address radio interfaces that are located on two independent platforms and stil…
▽ More
Future broadband networks need to provide high capacity at low cost with increased revenue through enhanced services. WiMAX came up as one of the leading technologies, however, the 2.3 GHz and 2.5 GHz frequency bands allocated create two serious coexistence issues with the adjacent 2.4 GHz ISM band. First problem is to address radio interfaces that are located on two independent platforms and still possess the potential for mutual interference owing to close proximity to each other. The Adaptive Frame Reservation Scheme presented here extends the CTS frame reservation signaling defined in 802.11 specifications to a demand based and adaptive scheme. Second issue is to address the coexistence problem in multi-radio platforms where two or more radios are co-located, creating an even worse interference scenario. This can be managed by hardware signaling that can be made available between radio interfaces through OS control. The development of a smart Co-located Coexistence Controller is explored which continuously receives transmission, reception and sleep requests from attached interfaces and in return grant permissions.
△ Less
Submitted 11 September, 2020; v1 submitted 31 August, 2020;
originally announced September 2020.
-
On the variation of solar coronal rotation using SDO/AIA observations
Authors:
Jaidev Sharma,
Brajesh Kumar,
Anil K. Malik,
Hari Om Vats
Abstract:
We report on the variability of rotation periods of solar coronal layers with respect to temperature (or, height). For this purpose, we have used the observations from Atmospheric Imaging Assembly (AIA) telescope on board Solar Dynamics Observatory (SDO) space mission. The images used are at the wavelengths 94 Å, 131 Å, 171 Å, 193 Å, 211 Å, and 335 Å for the period from 2012 to 2018. Analysis of s…
▽ More
We report on the variability of rotation periods of solar coronal layers with respect to temperature (or, height). For this purpose, we have used the observations from Atmospheric Imaging Assembly (AIA) telescope on board Solar Dynamics Observatory (SDO) space mission. The images used are at the wavelengths 94 Å, 131 Å, 171 Å, 193 Å, 211 Å, and 335 Å for the period from 2012 to 2018. Analysis of solar full disk images obtained at these wavelengths by AIA is carried out using flux modulation method. Seventeen rectangular strips/bins at equal interval of 10 degrees (extending from 80 degree South to 80 degree North on the Sun) are selected to extract a time series of extreme ultraviolet (EUV) intensity variations to obtain auto-correlation coefficient. The peak of Gaussian fit to first secondary maxima in the autocorrelogram gives synodic rotation period. Our analysis shows the differential rotation with respect to latitude as well as temperature (or, height). In the present study, we find that the sidereal rotation periods of different coronal layers decrease with increasing temperature (or, height). Average sidereal rotation period at the lowest temperature (~ 600000 Kelvin) corresponding to AIA-171 Å which originates from the upper transition region/quiet corona is 27.03 days. The sidereal rotation period decreases with temperature (or, height) to 25.47 days at the higher temperature (~10 million Kelvin) corresponding to the flaring regions of solar corona as seen in AIA-131 Å observations.
△ Less
Submitted 20 January, 2020;
originally announced January 2020.
-
Survival of current in a periodically driven hard-core bosonic system
Authors:
Rashmi J. Sharma,
Jayendra N. Bandyopadhyay
Abstract:
We study the survival of the current induced initially by applying a twist at the boundary of a chain of hard-core bosons (HCBs), subject to a periodic double $δ$-function kicks in the staggered on-site potential. We study the current flow and the work-done on the system at the long-time limit as a function of the driving frequency. Like a recent observation in the HCB chain with single $δ$-functi…
▽ More
We study the survival of the current induced initially by applying a twist at the boundary of a chain of hard-core bosons (HCBs), subject to a periodic double $δ$-function kicks in the staggered on-site potential. We study the current flow and the work-done on the system at the long-time limit as a function of the driving frequency. Like a recent observation in the HCB chain with single $δ$-function kick in the staggered on-site potential, here we also observe many dips in the current flow and concurrently many peaks in the work-done on the system at some specific values of the driving frequency. However, unlike the single kicked case, here we do not observe a complete disappearance of the current in the limit of a high driving frequency, which shows the absence of any dynamical localization in the double $δ$-functions kicked HCB chain. Our analytical estimations of the saturated current and the saturated work-done, defined at the limit of a large time together with a high driving frequency, match very well with the exact numerics. In the case of the very small initial current, induced by a very small twist $ν$, we observe that the saturated current is proportional to $ν$. Finally, we study the time-evolution of the half-filled HCB chain where the particles are localized in the central part of the chain. We observe that the particles spread linearly in a light-cone like region at the rate determined by the maximum value of the group velocity. Except for a very trivial case, the maximum group velocity never vanishes, and therefore we do not observe any dynamical localization in the system.
△ Less
Submitted 10 October, 2019;
originally announced October 2019.
-
Environment Sound Classification using Multiple Feature Channels and Attention based Deep Convolutional Neural Network
Authors:
Jivitesh Sharma,
Ole-Christoffer Granmo,
Morten Goodwin
Abstract:
In this paper, we propose a model for the Environment Sound Classification Task (ESC) that consists of multiple feature channels given as input to a Deep Convolutional Neural Network (CNN) with Attention mechanism. The novelty of the paper lies in using multiple feature channels consisting of Mel-Frequency Cepstral Coefficients (MFCC), Gammatone Frequency Cepstral Coefficients (GFCC), the Constant…
▽ More
In this paper, we propose a model for the Environment Sound Classification Task (ESC) that consists of multiple feature channels given as input to a Deep Convolutional Neural Network (CNN) with Attention mechanism. The novelty of the paper lies in using multiple feature channels consisting of Mel-Frequency Cepstral Coefficients (MFCC), Gammatone Frequency Cepstral Coefficients (GFCC), the Constant Q-transform (CQT) and Chromagram. Such multiple features have never been used before for signal or audio processing. And, we employ a deeper CNN (DCNN) compared to previous models, consisting of spatially separable convolutions working on time and feature domain separately. Alongside, we use attention modules that perform channel and spatial attention together. We use some data augmentation techniques to further boost performance. Our model is able to achieve state-of-the-art performance on all three benchmark environment sound classification datasets, i.e. the UrbanSound8K (97.52%), ESC-10 (95.75%) and ESC-50 (88.50%). To the best of our knowledge, this is the first time that a single environment sound classification model is able to achieve state-of-the-art results on all three datasets. For ESC-10 and ESC-50 datasets, the accuracy achieved by the proposed model is beyond human accuracy of 95.7% and 81.3% respectively.
△ Less
Submitted 8 December, 2020; v1 submitted 28 August, 2019;
originally announced August 2019.
-
Reentrant spin-glass and transport behavior of Gd4PtAl, a compound with three sites for Gd
Authors:
Ram Kumar,
Jyoti Sharma,
Kartik K Iyer,
E. V. Sampathkumaran
Abstract:
We report temperature (T) dependence (2-330 K) of DC and AC magnetization (M), isothermal remnant magnetization (M_IRM), heat capacity (C), electrical resistivity (rho), and magnetoresistance (MR) of a ternary intermetallic compound, Gd4PtAl, crystallizing in a cubic (space group F-43m) structure. In this structure, there are three sites for the rare-earth. The magnetization data reveal that, in a…
▽ More
We report temperature (T) dependence (2-330 K) of DC and AC magnetization (M), isothermal remnant magnetization (M_IRM), heat capacity (C), electrical resistivity (rho), and magnetoresistance (MR) of a ternary intermetallic compound, Gd4PtAl, crystallizing in a cubic (space group F-43m) structure. In this structure, there are three sites for the rare-earth. The magnetization data reveal that, in addition to a magnetic transition at 64 K, there is another magnetic feature below 20 K. The C(T) data reveal an upturn below 64 K, shifting to a lower temperature with increasing field, which establishes that the onset of magnetic order is of an antiferromagnetic type. However, there is no worthwhile feature near 20 K in the C(T) curve. AC susceptibility peak undergoes an observable change with frequency and, in particular, the peak around 20 K gets suppressed with the application of a dc magnetic field; in addition, M_IRM undergoes a slow decay with time and isothermal M exhibits low-field hysteresis below 20 K only, which is typical of spin-glasses. The results overall suggest that this compound is a reentrant spin-glass in zero-field. There are experimental signatures pointing to the existence of both antiferromagnetic and ferromagnetic components, competing with the variation of temperature and magnetic field, as a result of which electrical and magnetoresistance behaviors are peculiar. The results overall suggest this compound exhibits interesting magnetic and transport properties.
△ Less
Submitted 5 July, 2019;
originally announced July 2019.
-
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Authors:
Jivitesh Sharma,
Per-Arne Andersen,
Ole-Chrisoffer Granmo,
Morten Goodwin
Abstract:
We focus on the important problem of emergency evacuation, which clearly could benefit from reinforcement learning that has been largely unaddressed. Emergency evacuation is a complex task which is difficult to solve with reinforcement learning, since an emergency situation is highly dynamic, with a lot of changing variables and complex constraints that makes it difficult to train on. In this pape…
▽ More
We focus on the important problem of emergency evacuation, which clearly could benefit from reinforcement learning that has been largely unaddressed. Emergency evacuation is a complex task which is difficult to solve with reinforcement learning, since an emergency situation is highly dynamic, with a lot of changing variables and complex constraints that makes it difficult to train on. In this paper, we propose the first fire evacuation environment to train reinforcement learning agents for evacuation planning. The environment is modelled as a graph capturing the building structure. It consists of realistic features like fire spread, uncertainty and bottlenecks. We have implemented the environment in the OpenAI gym format, to facilitate future research. We also propose a new reinforcement learning approach that entails pretraining the network weights of a DQN based agents to incorporate information on the shortest path to the exit. We achieved this by using tabular Q-learning to learn the shortest path on the building model's graph. This information is transferred to the network by deliberately overfitting it on the Q-matrix. Then, the pretrained DQN model is trained on the fire evacuation environment to generate the optimal evacuation path under time varying conditions. We perform comparisons of the proposed approach with state-of-the-art reinforcement learning algorithms like PPO, VPG, SARSA, A2C and ACKTR. The results show that our method is able to outperform state-of-the-art models by a huge margin including the original DQN based models. Finally, we test our model on a large and complex real building consisting of 91 rooms, with the possibility to move to any other room, hence giving 8281 actions. We use an attention based mechanism to deal with large action spaces. Our model achieves near optimal performance on the real world emergency environment.
△ Less
Submitted 29 May, 2019; v1 submitted 23 May, 2019;
originally announced May 2019.
-
Uncertainty principles on nilpotent Lie groups
Authors:
Jyoti Sharma,
Ajay Kumar
Abstract:
Hardy's type uncertainty principle on connected nilpotent Lie groups for the Fourier transform is proved. An analogue of Hardy's theorem for Gabor transform has been established for connected and simply connected nilpotent Lie groups. Finally Beurling's theorem for Gabor transform is discussed for groups of the form $\mathbb{R}_n \times K$, where $K$ is a compact group
Hardy's type uncertainty principle on connected nilpotent Lie groups for the Fourier transform is proved. An analogue of Hardy's theorem for Gabor transform has been established for connected and simply connected nilpotent Lie groups. Finally Beurling's theorem for Gabor transform is discussed for groups of the form $\mathbb{R}_n \times K$, where $K$ is a compact group
△ Less
Submitted 7 January, 2019;
originally announced January 2019.
-
Continuous abstract wavelet transform on homogeneous space
Authors:
Jyoti Sharma,
Ajay Kumar
Abstract:
The support of wavelet transform associated with square integrable irreducible representation of a homogeneous space is shown to have infinite measure. Pointwise homogeneous approximation property for wavelet transform has been investigated. An analogue of Heisenberg type inequality has been also obtained for wavelet transform
The support of wavelet transform associated with square integrable irreducible representation of a homogeneous space is shown to have infinite measure. Pointwise homogeneous approximation property for wavelet transform has been investigated. An analogue of Heisenberg type inequality has been also obtained for wavelet transform
△ Less
Submitted 7 January, 2019;
originally announced January 2019.
-
Improving Similarity Search with High-dimensional Locality-sensitive Hashing
Authors:
Jaiyam Sharma,
Saket Navlakha
Abstract:
We propose a new class of data-independent locality-sensitive hashing (LSH) algorithms based on the fruit fly olfactory circuit. The fundamental difference of this approach is that, instead of assigning hashes as dense points in a low dimensional space, hashes are assigned in a high dimensional space, which enhances their separability. We show theoretically and empirically that this new family of…
▽ More
We propose a new class of data-independent locality-sensitive hashing (LSH) algorithms based on the fruit fly olfactory circuit. The fundamental difference of this approach is that, instead of assigning hashes as dense points in a low dimensional space, hashes are assigned in a high dimensional space, which enhances their separability. We show theoretically and empirically that this new family of hash functions is locality-sensitive and preserves rank similarity for inputs in any `p space. We then analyze different variations on this strategy and show empirically that they outperform existing LSH methods for nearest-neighbors search on six benchmark datasets. Finally, we propose a multi-probe version of our algorithm that achieves higher performance for the same query time, or conversely, that maintains performance of prior approaches while taking significantly less indexing time and memory. Overall, our approach leverages the advantages of separability provided by high-dimensional spaces, while still remaining computationally efficient
△ Less
Submitted 5 December, 2018;
originally announced December 2018.
-
ECO: Egocentric Cognitive Map**
Authors:
Jayant Sharma,
Zixing Wang,
Alberto Speranzon,
Vijay Venkataraman,
Hyun Soo Park
Abstract:
We present a new method to localize a camera within a previously unseen environment perceived from an egocentric point of view. Although this is, in general, an ill-posed problem, humans can effortlessly and efficiently determine their relative location and orientation and navigate into a previously unseen environments, e.g., finding a specific item in a new grocery store. To enable such a capabil…
▽ More
We present a new method to localize a camera within a previously unseen environment perceived from an egocentric point of view. Although this is, in general, an ill-posed problem, humans can effortlessly and efficiently determine their relative location and orientation and navigate into a previously unseen environments, e.g., finding a specific item in a new grocery store. To enable such a capability, we design a new egocentric representation, which we call ECO (Egocentric COgnitive map). ECO is biologically inspired, by the cognitive map that allows human navigation, and it encodes the surrounding visual semantics with respect to both distance and orientation. ECO possesses three main properties: (1) reconfigurability: complex semantics and geometry is captured via the synthesis of atomic visual representations (e.g., image patch); (2) robustness: the visual semantics are registered in a geometrically consistent way (e.g., aligning with respect to the gravity vector, frontalizing, and rescaling to canonical depth), thus enabling us to learn meaningful atomic representations; (3) adaptability: a domain adaptation framework is designed to generalize the learned representation without manual calibration. As a proof-of-concept, we use ECO to localize a camera within real-world scenes---various grocery stores---and demonstrate performance improvements when compared to existing semantic localization approaches.
△ Less
Submitted 1 December, 2018;
originally announced December 2018.
-
Qualitative uncertainty principle for Gabor transform on certain locally compact groups
Authors:
Jyoti Sharma,
Ajay Kumar
Abstract:
Classes of locally compact groups having qualitative uncertainty principle for Gabor transform have been investigated. These include Moore groups, Heisenberg Group $\mathbb{H}_n, \mathbb{H}_{n} \times D,$ where $D$ is discrete group and other low dimensional nilpotent Lie groups.
Classes of locally compact groups having qualitative uncertainty principle for Gabor transform have been investigated. These include Moore groups, Heisenberg Group $\mathbb{H}_n, \mathbb{H}_{n} \times D,$ where $D$ is discrete group and other low dimensional nilpotent Lie groups.
△ Less
Submitted 31 March, 2017;
originally announced March 2017.
-
Motion of Satellite under the Effect of Oblateness of Earth and Atmospheric Drag
Authors:
Jaita Sharma,
B. S. Ratanpal,
U. M. Pirzada,
Vishant Shah
Abstract:
The equations governing motion of the satellite under the effect of oblateness of Earth and atmospheric drag have been simulated, for a fixed initial position and three different initial velocities, till satellite collapses on Earth. Simulation of motion of artificial Earth satellite subject to the combined effects of oblate Earth and atmospheric drag is presented. The atmospheric model considered…
▽ More
The equations governing motion of the satellite under the effect of oblateness of Earth and atmospheric drag have been simulated, for a fixed initial position and three different initial velocities, till satellite collapses on Earth. Simulation of motion of artificial Earth satellite subject to the combined effects of oblate Earth and atmospheric drag is presented. The atmospheric model considered here takes in to account of exponential variation of the density with initial distance of Satellite from Earth's surface, scale height and radial distance. The minimum and maximum values of orbital elements and their variation over a time for different initial velocities have been reported.
△ Less
Submitted 29 January, 2020; v1 submitted 7 October, 2016;
originally announced October 2016.
-
Charged Anisotropic Star on Paraboloidal Spacetime
Authors:
B. S. Ratanpal,
Jaita Sharma
Abstract:
The charged anisotropic star on paraboloidal spacetime is reported by choosing particular form of radial pressure and electric field intensity. The non-singular solution of Einstein-Maxwell system of equation have been derived and it is shown that model satisfy all the physical plausibility conditions. It is observed that in the absence of electric field intensity, model reduces to particular case…
▽ More
The charged anisotropic star on paraboloidal spacetime is reported by choosing particular form of radial pressure and electric field intensity. The non-singular solution of Einstein-Maxwell system of equation have been derived and it is shown that model satisfy all the physical plausibility conditions. It is observed that in the absence of electric field intensity, model reduces to particular case of uncharged Sharma \& Ratanpal model. It is also observed that the parameter used in electric field intensity directly effects the mass of the star.
△ Less
Submitted 28 August, 2015;
originally announced August 2015.
-
Self-similar spectrum in effective time independent Hamiltonians for kicked systems
Authors:
Rashmi Jangid Sharma,
Jayendra N. Bandyopadhyay,
Tapomoy Guha Sarkar
Abstract:
We study multifractal properties in the spectrum of effective time-independent Hamiltonians obtained using a perturbative method for a class of delta-kicked systems. The evolution operator in the time-dependent problem is factorized into an initial kick, an evolution dictated by a time-independent Hamiltonian, and a final kick. We have used the double kicked $SU(2)$ system and the kicked Harper mo…
▽ More
We study multifractal properties in the spectrum of effective time-independent Hamiltonians obtained using a perturbative method for a class of delta-kicked systems. The evolution operator in the time-dependent problem is factorized into an initial kick, an evolution dictated by a time-independent Hamiltonian, and a final kick. We have used the double kicked $SU(2)$ system and the kicked Harper model to study butterfly spectrum in the corresponding effective Hamiltonians. We have obtained a generic class of $SU(2)$ Hamiltonians showing self-similar spectrum. The statistics of the generalized fractal dimension is studied for a quantitative characterization of the spectra.
△ Less
Submitted 23 April, 2015;
originally announced April 2015.
-
Selection of Arginine-Rich Anti-Gold Antibodies Engineered for Plasmonic Colloid Self-Assembly
Authors:
Purvi Jain,
Anandakumar Soshee,
S Shankara Narayanan,
Jadab Sharma,
Christian Girard,
Erik Dujardin,
Clément Nizak
Abstract:
Antibodies are affinity proteins with a wide spectrum of applications in analytical and therapeutic biology. Proteins showing specific recognition for a chosen molecular target can be isolated and their encoding sequence identified in vitro from a large and diverse library by phage display selection. In this work, we show that this standard biochemical technique rapidly yields a collection of anti…
▽ More
Antibodies are affinity proteins with a wide spectrum of applications in analytical and therapeutic biology. Proteins showing specific recognition for a chosen molecular target can be isolated and their encoding sequence identified in vitro from a large and diverse library by phage display selection. In this work, we show that this standard biochemical technique rapidly yields a collection of antibody protein binders for an inorganic target of major technological importance: crystalline metallic gold surfaces. 21 distinct anti-gold antibody proteins emerged from a large random library of antibodies and were sequenced. The systematic statistical analysis of all the protein sequences reveals a strong occurrence of arginine in anti-gold antibodies, which corroborates recent molecular dynamics predictions on the crucial role of arginine in protein/gold interactions. Once tethered to small gold nanoparticles using histidine tag chemistry, the selected antibodies could drive the self-assembly of the colloids onto the surface of single crystalline gold platelets as a first step towards programmable protein-driven construction of complex plasmonic architectures. Electrodynamic simulations based on the Green Dyadic Method suggest that the antibody-driven assembly demonstrated here could be exploited to significantly modify the plasmonic modal properties of the gold platelets. Our work shows that molecular biology tools can be used to design the interaction between fully folded proteins and inorganic surfaces with potential applications in the bottom-up construction of plasmonic hybrid nanomaterials.
△ Less
Submitted 13 May, 2014;
originally announced May 2014.
-
Synthesis and Structural Characterization of Highly Tetragonal (1-x)Bi(Zn1/2Ti1/2)O3-xPbTiO3 Piezoceramics
Authors:
Jyoti Sharma,
Rishikesh Pandey,
Akhilesh Kumar Singh
Abstract:
We present here the comprehensive X-ray diffraction (XRD) and dielectric measurement of (1-x)Bi(Zn1/2Ti1/2)O3-xPbTiO3 (BZT-xPT) piezoceramics with x=0.65, 0.70, 0.75 and 0.80. Powder X-ray diffraction data reveals the tetragonal structure (space group P4mm) of BZT-xPT ceramics for all the compositions.
We present here the comprehensive X-ray diffraction (XRD) and dielectric measurement of (1-x)Bi(Zn1/2Ti1/2)O3-xPbTiO3 (BZT-xPT) piezoceramics with x=0.65, 0.70, 0.75 and 0.80. Powder X-ray diffraction data reveals the tetragonal structure (space group P4mm) of BZT-xPT ceramics for all the compositions.
△ Less
Submitted 16 November, 2013;
originally announced November 2013.
-
Non-linear stability in photogravitational non-planar restricted three body problem with oblate smaller primary
Authors:
B. Ishwar,
J. P. Sharma
Abstract:
We have discussed non-linear stability in photogravitational non-planar restricted three body problem with oblate smaller primary. By photogravitational we mean that both primaries are radiating. We normalised the Hamiltonian using Lie transform as in Coppola and Rand (1989). We transformed the system into Birkhoff's normal form. Lie transforms reduce the system to an equivalent simpler system whi…
▽ More
We have discussed non-linear stability in photogravitational non-planar restricted three body problem with oblate smaller primary. By photogravitational we mean that both primaries are radiating. We normalised the Hamiltonian using Lie transform as in Coppola and Rand (1989). We transformed the system into Birkhoff's normal form. Lie transforms reduce the system to an equivalent simpler system which is immediately solvable. Applying Arnold's theorem, we have found non-linear stability criteria. We conclude that $L_6$ is stable. We plotted graphs for $(ω_1, D_2).$ They are rectangular hyperbola.
△ Less
Submitted 20 September, 2011;
originally announced September 2011.
-
Revelation of double magicity in N=Z nuclei in the rp-process region
Authors:
M. M. Sharma,
J. K. Sharma
Abstract:
In rapid-proton capture (rp-process), N=Z nuclei above Ni are understood to act as waiting-point nuclei. The N=Z nuclei 68Se, 72Kr, 76Sr and 80Zr among others are known to give rise to a large-energy x-ray flux and peaks in abundances of these nuclei synthesized in the astrophysical rp-process. Investigating the experimental isotope shifts in Kr isotopes near the proton drip-line within the framew…
▽ More
In rapid-proton capture (rp-process), N=Z nuclei above Ni are understood to act as waiting-point nuclei. The N=Z nuclei 68Se, 72Kr, 76Sr and 80Zr among others are known to give rise to a large-energy x-ray flux and peaks in abundances of these nuclei synthesized in the astrophysical rp-process. Investigating the experimental isotope shifts in Kr isotopes near the proton drip-line within the framework of the deformed Relativistic Hartree-Bogoliubov theory, we have discovered that N=Z rp-process nuclei 68Se, 72Kr, 76Sr and 80Zr exhibit large shell gap both at the proton and neutron numbers in the deformed space with the consequence that pairing correlations for protons and neutrons vanish. This lends a doubly magic character to these nuclei. A significant number of nuclei in this region are also shown to exhibit neutron magicity at N=34, 36, 38, and 40 in the deformed space. A unique case of concomitance of the double magicity and the shape-coexistence is found for 68Se.
△ Less
Submitted 7 October, 2010;
originally announced October 2010.
-
Evidence of double magicity of N=Z nuclei near the rp-process path
Authors:
M. M. Sharma,
J. K. Sharma
Abstract:
N=Z nuclei above Ni are understood to be waiting-point nuclei in the rp-process nucleosynthesis. Investigating the experimental isotope shifts in Kr isotopes near the proton drip-line, we have discovered that N=Z rp-process nuclei $^{68}$Se, $^{72}$Kr, $^{76}$Sr and $^{80}$Zr exhibit a significant shell gap both at the proton and neutron numbers in the deformed space with the consequence that pa…
▽ More
N=Z nuclei above Ni are understood to be waiting-point nuclei in the rp-process nucleosynthesis. Investigating the experimental isotope shifts in Kr isotopes near the proton drip-line, we have discovered that N=Z rp-process nuclei $^{68}$Se, $^{72}$Kr, $^{76}$Sr and $^{80}$Zr exhibit a significant shell gap both at the proton and neutron numbers in the deformed space with the consequence that pairing correlations for protons and neutrons vanish, thus lending a double-magic character to these nuclei. A significant number of nuclei in this region are also shown to exhibit neutron magicity at N = 34, 36, 38, and 40 in the deformed space.
△ Less
Submitted 6 July, 2009;
originally announced July 2009.
-
Higher Order Normalizations in the Generalized Photogravitational Restricted Three Body Problem with Poynting-Robertson Drag
Authors:
B. S. Kushvah,
J. P. Sharma,
B. Ishwar
Abstract:
Higher order normalizations are performed in the generalized photogravitational restricted three body problem with Poynting-Robertson drag. In this problem we have taken bigger primary as a source of radiation and smaller primary as an oblate spheroid. Whittaker method is used to transform the second order part of the Hamiltonian into the normal form. We have also performed Birkhoff's normalizat…
▽ More
Higher order normalizations are performed in the generalized photogravitational restricted three body problem with Poynting-Robertson drag. In this problem we have taken bigger primary as a source of radiation and smaller primary as an oblate spheroid. Whittaker method is used to transform the second order part of the Hamiltonian into the normal form. We have also performed Birkhoff's normalization of the Hamiltonian. For this we have tilized Henrard's method and expanded the coordinates of the infinitesimal body in double D'Alembert series. We have found the values of first and second order components. They are affected by radiation pressure, oblateness and P-R drag. Finally we obtained the third order part of the Hamiltonian zero.
Keywords:Higher Order Normalization, Generalized Photogravitational, RTBP,P-R drag
△ Less
Submitted 29 September, 2007;
originally announced October 2007.
-
Nonlinear Stability in the Generalised Photogravitational Restricted Three Body Problem with Poynting-Robertson Drag
Authors:
B. S. Kushvah,
J. P. Sharma,
B. Ishwar
Abstract:
The Nonlinear stability of triangular equilibrium points has been discussed in the generalised photogravitational restricted three body problem with Poynting-Robertson drag. The problem is generalised in the sense that smaller primary is supposed to be an oblate spheroid. The bigger primary is considered as radiating. We have performed first and second order normalization of the Hamiltonian of t…
▽ More
The Nonlinear stability of triangular equilibrium points has been discussed in the generalised photogravitational restricted three body problem with Poynting-Robertson drag. The problem is generalised in the sense that smaller primary is supposed to be an oblate spheroid. The bigger primary is considered as radiating. We have performed first and second order normalization of the Hamiltonian of the problem. We have applied KAM theorem to examine the condition of non-linear stability. We have found three critical mass ratios. Finally we conclude that triangular points are stable in the nonlinear sense except three critical mass ratios at which KAM theorem fails.
△ Less
Submitted 12 November, 2007; v1 submitted 20 September, 2006;
originally announced September 2006.
-
Critical Integrated Raman Scattering Intensity near the cubic-tetragonal phase transition in Strontium Titanate
Authors:
Jyoti Dhar Sharma
Abstract:
Emphasizing the contribution of Professor Roger A Cowley, FRS to the Theory of Raman Scattering from crystals, the development of the Theory of Raman cattering since 1928 has been briefly discussed. Some experimental studies of Strontium Titanate using Inelastic Neutron Scattering, Raman Scattering, Electro- paramagnetic resonance measurement and X-ray & Gamma Ray techniques has been briefly dis…
▽ More
Emphasizing the contribution of Professor Roger A Cowley, FRS to the Theory of Raman Scattering from crystals, the development of the Theory of Raman cattering since 1928 has been briefly discussed. Some experimental studies of Strontium Titanate using Inelastic Neutron Scattering, Raman Scattering, Electro- paramagnetic resonance measurement and X-ray & Gamma Ray techniques has been briefly discussed. Using Schwabl's semi-phenomenological theory for the soft mode and central peak, we have developed (a) a one-phonon Green's function exhibiting the three peaked structure and (b) a two phonon Green's function involving one hard mode under damped quasiharmonic phonon and one three peaked soft-mode phonon. We have developed the pre-cursor order induced Raman scattering near the displacive phase transition in terms of Green's functions. Using Group Theory, we have predicted the Raman-active modes in Strontium Titanate contributing to Critical Raman Scattering near hard-mode frequencies above and below critical temperature. We have calculated the Critical Integrated Raman Scattering Intensity and the Two-phonon Background Raman Scattering Intensity near hard-mode frequencies above and below the critical temperature. The results show the same trends as observed in some of the experimental observations.
△ Less
Submitted 14 July, 2006;
originally announced July 2006.
-
Normalization of Hamiltonian in the Generalized Photogravitaional Restricted Three Body Problem with Poynting-Robertson Drag
Authors:
B. S. Kushvah,
J. P. Sharma,
B. Ishwar
Abstract:
We have performed normalization of Hamiltonian in the generalized photogravitational restricted three body problem with Poynting-Robertson drag. In this problem we have taken bigger primary as source of radiation and smaller primary as an oblate spheroid. Wittaker method is used to transform the second order part of the Hamiltonian into the normal form.
Keywords:Normalization / Generalised Pho…
▽ More
We have performed normalization of Hamiltonian in the generalized photogravitational restricted three body problem with Poynting-Robertson drag. In this problem we have taken bigger primary as source of radiation and smaller primary as an oblate spheroid. Wittaker method is used to transform the second order part of the Hamiltonian into the normal form.
Keywords:Normalization / Generalised Photogravitational/ RTBP/P-R Drag.
△ Less
Submitted 18 May, 2006;
originally announced May 2006.