-
HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs
Authors:
Pranoy Panda,
Ankush Agarwal,
Chaitanya Devaguptapu,
Manohar Kaul,
Prathosh A P
Abstract:
Given unstructured text, Large Language Models (LLMs) are adept at answering simple (single-hop) questions. However, as the complexity of the questions increase, the performance of LLMs degrade. We believe this is due to the overhead associated with understanding the complex question followed by filtering and aggregating unstructured information in the raw text. Recent methods try to reduce this b…
▽ More
Given unstructured text, Large Language Models (LLMs) are adept at answering simple (single-hop) questions. However, as the complexity of the questions increase, the performance of LLMs degrade. We believe this is due to the overhead associated with understanding the complex question followed by filtering and aggregating unstructured information in the raw text. Recent methods try to reduce this burden by integrating structured knowledge triples into the raw text, aiming to provide a structured overview that simplifies information processing. However, this simplistic approach is query-agnostic and the extracted facts are ambiguous as they lack context. To address these drawbacks and to enable LLMs to answer complex (multi-hop) questions with ease, we propose to use a knowledge graph (KG) that is context-aware and is distilled to contain query-relevant information. The use of our compressed distilled KG as input to the LLM results in our method utilizing up to $67\%$ fewer tokens to represent the query relevant information present in the supporting documents, compared to the state-of-the-art (SoTA) method. Our experiments show consistent improvements over the SoTA across several metrics (EM, F1, BERTScore, and Human Eval) on two popular benchmark datasets (HotpotQA and MuSiQue).
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Deep Learning-Based Brain Image Segmentation for Automated Tumour Detection
Authors:
Suman Sourabh,
Murugappan Valliappan,
Narayana Darapaneni,
Anwesh R P
Abstract:
Introduction: The present study on the development and evaluation of an automated brain tumor segmentation technique based on deep learning using the 3D U-Net model. Objectives: The objective is to leverage state-of-the-art convolutional neural networks (CNNs) on a large dataset of brain MRI scans for segmentation. Methods: The proposed methodology applies pre-processing techniques for enhanced pe…
▽ More
Introduction: The present study on the development and evaluation of an automated brain tumor segmentation technique based on deep learning using the 3D U-Net model. Objectives: The objective is to leverage state-of-the-art convolutional neural networks (CNNs) on a large dataset of brain MRI scans for segmentation. Methods: The proposed methodology applies pre-processing techniques for enhanced performance and generalizability. Results: Extensive validation on an independent dataset confirms the model's robustness and potential for integration into clinical workflows. The study emphasizes the importance of data pre-processing and explores various hyperparameters to optimize the model's performance. The 3D U-Net, has given IoUs for training and validation dataset have been 0.8181 and 0.66 respectively. Conclusion: Ultimately, this comprehensive framework showcases the efficacy of deep learning in automating brain tumour detection, offering valuable support in clinical practice.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Music Recommendation Based on Facial Emotion Recognition
Authors:
Rajesh B,
Keerthana V,
Narayana Darapaneni,
Anwesh Reddy P
Abstract:
Introduction: Music provides an incredible avenue for individuals to express their thoughts and emotions, while also serving as a delightful mode of entertainment for enthusiasts and music lovers. Objectives: This paper presents a comprehensive approach to enhancing the user experience through the integration of emotion recognition, music recommendation, and explainable AI using GRAD-CAM. Methods:…
▽ More
Introduction: Music provides an incredible avenue for individuals to express their thoughts and emotions, while also serving as a delightful mode of entertainment for enthusiasts and music lovers. Objectives: This paper presents a comprehensive approach to enhancing the user experience through the integration of emotion recognition, music recommendation, and explainable AI using GRAD-CAM. Methods: The proposed methodology utilizes a ResNet50 model trained on the Facial Expression Recognition (FER) dataset, consisting of real images of individuals expressing various emotions. Results: The system achieves an accuracy of 82% in emotion classification. By leveraging GRAD-CAM, the model provides explanations for its predictions, allowing users to understand the reasoning behind the system's recommendations. The model is trained on both FER and real user datasets, which include labelled facial expressions, and real images of individuals expressing various emotions. The training process involves pre-processing the input images, extracting features through convolutional layers, reasoning with dense layers, and generating emotion predictions through the output layer. Conclusion: The proposed methodology, leveraging the Resnet50 model with ROI-based analysis and explainable AI techniques, offers a robust and interpretable solution for facial emotion detection paper.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
A Deep Look Into -- Automated Lung X-Ray Abnormality Detection System
Authors:
Nagullas KS,
Vivekanand. V,
Narayana Darapaneni,
Anwesh R P
Abstract:
Introduction: Automated Lung X-Ray Abnormality Detection System is the application which distinguish the normal x-ray images from infected x-ray images and highlight area considered for prediction, with the recent pandemic a need to have a non-conventional method and faster detecting diseases, for which X ray serves the purpose. Obectives: As of current situation any viral disease that is infectio…
▽ More
Introduction: Automated Lung X-Ray Abnormality Detection System is the application which distinguish the normal x-ray images from infected x-ray images and highlight area considered for prediction, with the recent pandemic a need to have a non-conventional method and faster detecting diseases, for which X ray serves the purpose. Obectives: As of current situation any viral disease that is infectious is potential pandemic, so there is need for cheap and early detection system. Methods: This research will help to eases the work of expert to do further analysis. Accuracy of three different preexisting models such as DenseNet, MobileNet and VGG16 were high but models over-fitted primarily due to black and white images. Results: This led to building up new method such as as V-BreathNet which gave more than 96% percent accuracy. Conclusion: Thus, it can be stated that not all state-of art CNN models can be used on B/W images. In conclusion not all state-of-art CNN models can be used on B/W images.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Partially Blinded Unlearning: Class Unlearning for Deep Networks a Bayesian Perspective
Authors:
Subhodip Panda,
Shashwat Sourav,
Prathosh A. P
Abstract:
In order to adhere to regulatory standards governing individual data privacy and safety, machine learning models must systematically eliminate information derived from specific subsets of a user's training data that can no longer be utilized. The emerging discipline of Machine Unlearning has arisen as a pivotal area of research, facilitating the process of selectively discarding information design…
▽ More
In order to adhere to regulatory standards governing individual data privacy and safety, machine learning models must systematically eliminate information derived from specific subsets of a user's training data that can no longer be utilized. The emerging discipline of Machine Unlearning has arisen as a pivotal area of research, facilitating the process of selectively discarding information designated to specific sets or classes of data from a pre-trained model, thereby eliminating the necessity for extensive retraining from scratch. The principal aim of this study is to formulate a methodology tailored for the purposeful elimination of information linked to a specific class of data from a pre-trained classification network. This intentional removal is crafted to degrade the model's performance specifically concerning the unlearned data class while concurrently minimizing any detrimental impacts on the model's performance in other classes. To achieve this goal, we frame the class unlearning problem from a Bayesian perspective, which yields a loss function that minimizes the log-likelihood associated with the unlearned data with a stability regularization in parameter space. This stability regularization incorporates Mohalanobis distance with respect to the Fisher Information matrix and $l_2$ distance from the pre-trained model parameters. Our novel approach, termed \textbf{Partially-Blinded Unlearning (PBU)}, surpasses existing state-of-the-art class unlearning methods, demonstrating superior effectiveness. Notably, PBU achieves this efficacy without requiring awareness of the entire training dataset but only to the unlearned data points, marking a distinctive feature of its performance.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks
Authors:
Aman Kumar,
Khushboo Anand,
Shubham Mandloi,
Ashutosh Mishra,
Avinash Thakur,
Neeraj Kasera,
Prathosh A P
Abstract:
Generative Adversarial Networks (GANs) have proven to exhibit remarkable performance and are widely used across many generative computer vision applications. However, the unprecedented demand for the deployment of GANs on resource-constrained edge devices still poses a challenge due to huge number of parameters involved in the generation process. This has led to focused attention on the area of co…
▽ More
Generative Adversarial Networks (GANs) have proven to exhibit remarkable performance and are widely used across many generative computer vision applications. However, the unprecedented demand for the deployment of GANs on resource-constrained edge devices still poses a challenge due to huge number of parameters involved in the generation process. This has led to focused attention on the area of compressing GANs. Most of the existing works use knowledge distillation with the overhead of teacher dependency. Moreover, there is no ability to control the degree of compression in these methods. Hence, we propose CoroNet-GAN for compressing GAN using the combined strength of differentiable pruning method via hypernetworks. The proposed method provides the advantage of performing controllable compression while training along with reducing training time by a substantial factor. Experiments have been done on various conditional GAN architectures (Pix2Pix and CycleGAN) to signify the effectiveness of our approach on multiple benchmark datasets such as Edges-to-Shoes, Horse-to-Zebra and Summer-to-Winter. The results obtained illustrate that our approach succeeds to outperform the baselines on Zebra-to-Horse and Summer-to-Winter achieving the best FID score of 32.3 and 72.3 respectively, yielding high-fidelity images across all the datasets. Additionally, our approach also outperforms the state-of-the-art methods in achieving better inference time on various smart-phone chipsets and data-types making it a feasible solution for deployment on edge devices.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Leveraging Internal Representations of Model for Magnetic Image Classification
Authors:
Adarsh N L,
Arun P V,
Alok Porwal,
Malcolm Aranha
Abstract:
Data generated by edge devices has the potential to train intelligent autonomous systems across various domains. Despite the emergence of diverse machine learning approaches addressing privacy concerns and utilizing distributed data, security issues persist due to the sensitive storage of data shards in disparate locations. This paper introduces a potentially groundbreaking paradigm for machine le…
▽ More
Data generated by edge devices has the potential to train intelligent autonomous systems across various domains. Despite the emergence of diverse machine learning approaches addressing privacy concerns and utilizing distributed data, security issues persist due to the sensitive storage of data shards in disparate locations. This paper introduces a potentially groundbreaking paradigm for machine learning model training, specifically designed for scenarios with only a single magnetic image and its corresponding label image available. We harness the capabilities of Deep Learning to generate concise yet informative samples, aiming to overcome data scarcity. Through the utilization of deep learning's internal representations, our objective is to efficiently address data scarcity issues and produce meaningful results. This methodology presents a promising avenue for training machine learning models with minimal data.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Enhancing Image Caption Generation Using Reinforcement Learning with Human Feedback
Authors:
Adarsh N L,
Arun P V,
Aravindh N L
Abstract:
Research on generative models to produce human-aligned / human-preferred outputs has seen significant recent contributions. Between text and image-generative models, we narrowed our focus to text-based generative models, particularly to produce captions for images that align with human preferences. In this research, we explored a potential method to amplify the performance of the Deep Neural Netwo…
▽ More
Research on generative models to produce human-aligned / human-preferred outputs has seen significant recent contributions. Between text and image-generative models, we narrowed our focus to text-based generative models, particularly to produce captions for images that align with human preferences. In this research, we explored a potential method to amplify the performance of the Deep Neural Network Model to generate captions that are preferred by humans. This was achieved by integrating Supervised Learning and Reinforcement Learning with Human Feedback (RLHF) using the Flickr8k dataset. Also, a novel loss function that is capable of optimizing the model based on human feedback is introduced. In this paper, we provide a concise sketch of our approach and results, ho** to contribute to the ongoing advances in the field of human-aligned generative AI models.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Cyclic Characters of Alternating Groups
Authors:
Amrutha P,
Amritanshu Prasad,
Velmurugan S
Abstract:
We determine the decomposition of cyclic characters of alternating groups into irreducible characters. As an application, we characterize pairs $(w, V)$, where $w\in A_n$ and $V$ is an irreducible representation of $A_n$ such that $w$ admits a non-zero invariant vector in $V$. We also establish new global conjugacy classes for alternating groups, thereby giving a new proof of a result of Heide and…
▽ More
We determine the decomposition of cyclic characters of alternating groups into irreducible characters. As an application, we characterize pairs $(w, V)$, where $w\in A_n$ and $V$ is an irreducible representation of $A_n$ such that $w$ admits a non-zero invariant vector in $V$. We also establish new global conjugacy classes for alternating groups, thereby giving a new proof of a result of Heide and Zalessky on the existence of such classes.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Authors:
Saurabh Srivastava,
Annarose M B,
Anto P V,
Shashank Menon,
Ajay Sukumar,
Adwaith Samod T,
Alan Philipose,
Stevin Prince,
Sooraj Thomas
Abstract:
We propose a framework for robust evaluation of reasoning capabilities of language models, using functional variants of benchmarks. Models that solve a reasoning test should exhibit no difference in performance over the static version of a problem compared to a snapshot of the functional variant. We have rewritten the relevant fragment of the MATH benchmark into its functional variant MATH(), with…
▽ More
We propose a framework for robust evaluation of reasoning capabilities of language models, using functional variants of benchmarks. Models that solve a reasoning test should exhibit no difference in performance over the static version of a problem compared to a snapshot of the functional variant. We have rewritten the relevant fragment of the MATH benchmark into its functional variant MATH(), with functionalization of other benchmarks to follow. When evaluating current state-of-the-art models over snapshots of MATH(), we find a reasoning gap -- the percentage difference between the static and functional accuracies. We find reasoning gaps from 58.35% to 80.31% among the state-of-the-art closed and open weights models that perform well on static benchmarks, with the caveat that the gaps are likely to be smaller with more sophisticated prompting strategies. Here we show that models which anecdotally have good reasoning performance over real-world tasks, have quantifiable lower gaps, motivating the open problem of building "gap 0" models. Code for evaluation and new evaluation datasets, three MATH() snapshots, are publicly available at https://github.com/consequentai/fneval/.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Bivariate Bernstein Fractal Interpolation and Numerical Integration on Triangular Domains
Authors:
Aparna M. P.,
P. Paramanathan
Abstract:
The fundamental aim of this paper is to provide the approximation and numerical integration of a discrete set of data points with Bernstein fractal approach. Using Bernstein polynomials in the iterated function system, the paper initially proposes the numerical integration formula for the data set corresponding to univariate functions. The proposed formula of integration is shown to be convergent…
▽ More
The fundamental aim of this paper is to provide the approximation and numerical integration of a discrete set of data points with Bernstein fractal approach. Using Bernstein polynomials in the iterated function system, the paper initially proposes the numerical integration formula for the data set corresponding to univariate functions. The proposed formula of integration is shown to be convergent by examining the data sets of certain weierstrass functions.
The paper then extends the Bernstein fractal approximation and numerical integration technique to two dimensional interpolating regions. Bernstein polynomials defined over triangular domain has been used for the purpose. The triangular domain has been partitioned and the newly generated points are assigned colors in a particular manner to maintain the chromatic number as 3. Following the above mentioned construction and approximation of bivariate Bernstein fractal interpolation functions, the paper introduces the numerical double integration formula using the constructed functions. The convergence of the double integration formula towards the actual integral value of the data sets is displayed with the help of some examples including the benchmark functions. Both the newly introduced iterated function systems are verified for their hyperbolicity and the resultant fractal interpolation functions are shown to be continuous.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Hybrid subterahertz atmospheric pressure plasmatron for plasma chemical applications
Authors:
Sintsov S. V.,
Vodopyanov A. V.,
Mansfeld D. A.,
Fokin A. P.,
Ananichev A. A.,
Goryunov A. A.,
Preobrazhensky E. I.,
Chekmarev N. V.,
Glyavin M. Yu
Abstract:
This paper presents the results of an experimental study of a new hybrid plasmatron scheme, which was used to realize a gas discharge at atmospheric pressure supported by continuous focused submillimeter radiation with a frequency of 263 GHz. The implemented design allowed organizing a self-consistent interaction between submillimeter radiation and the supercritical plasma in a localized area both…
▽ More
This paper presents the results of an experimental study of a new hybrid plasmatron scheme, which was used to realize a gas discharge at atmospheric pressure supported by continuous focused submillimeter radiation with a frequency of 263 GHz. The implemented design allowed organizing a self-consistent interaction between submillimeter radiation and the supercritical plasma in a localized area both in terms of gas flow and electrodynamic. It is experimentally shown that the gas discharge absorbs up to 80% of the introduced submillimeter radiation power. The hybrid subterahertz plasmatron as an effective reactor for non-equilibrium plasma chemical processes was tested for the atmospheric nitrogen fixation.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Dynamic Multi Color Switching using Ultrathin Vanadium Oxide on Aluminium based Asymmetric Fabry-Perot Resonant Structure
Authors:
Shubhangi Saini,
Ashok P,
Amit Verma
Abstract:
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color g…
▽ More
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color gamut coverage with these methodologies remains notably limited. This work overcomes these limitations and demonstrates dynamic multi-colour switching covering a large color gamut using a simple, unpatterned, ultrathin ($\sim$ $\fracλ{14}$, where wavelength $λ$ is taken as 575 nm at the center of visible spectrum) asymmetric Fabry-Pérot structure of $VO_{2}$ on Aluminium (Al). We use the transfer matrix method to design the $VO_{2}/Aluminium\,(Al)/Sapphire$ structure for maximum visible reflectance switching. $VO_{2}$ films are synthesized using a simple, low thermal budget atmospheric oxidation of Vanadium (V). With varying oxidation durations, different colors of the oxidized samples are observed. Consistent and reversible color-switching is observed visibly and in reflectance measurements with the change in temperature from low (RT $\sim$ 30$^{\circ}$C) to high (HT $\sim$ 100$^{\circ}$C) or vice versa due to the phase transition property of the $VO_{2}$ layer in the structure. Compared to the existing studies, this work shows a significant change in chromaticities and covers a large color gamut when plotted on the CIE chromaticity diagram. This work has potential applications in the fields of display, thermochromic structures, and visible camouflage.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Bilayer Vanadium Dioxide Thin Film with Elevated Transition Temperatures and High Resistance Switching
Authors:
Achintya Dutta,
Ashok P,
Amit Verma
Abstract:
Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire…
▽ More
Despite widespread interest in the phase-change applications of vanadium dioxide (VO$_2$), the fabrication of high-quality VO$_2$ thin films with elevated transition temperatures (TIMT) and high Insulator-Metal-Transition resistance switching still remains a challenge. This study introduces a two-step atmospheric oxidation approach to fabricate bilayer VO$_{2-x}$/VO$_2$ films on a c-plane sapphire substrate. To quantify the impact of the VO$_2$ buffer layer, a single-layer VO$_2$ film of the same thickness was also fabricated. The bilayer VO$_{2-x}$/VO$_2$ films wherein the top VO$_{2-x}$ film was under-oxidized demonstrated an elevation in TIMT reaching ~97 $^\circ$C, one of the highest reported to date for VO$_2$ films and is achieved in a do**-free manner. Our results also reveal a one-order increase in resistance switching, with the optimum bilayer VO$_2$/VO$_2$ film exhibiting ~3.6 orders of switching from 25 $^\circ$C to 110 $^\circ$C, compared to the optimum single-layer VO$_2$ reference film. This is accompanied by a one-order decrease in the on-state resistance in its metallic phase. The elevation in TIMT, coupled with increased strain extracted from the XRD characterization of the bilayer film, suggests the possibility of compressive strain along the c-axis. These VO$_{2-x}$/VO$_2$ films also demonstrate a significant change in the slope of their resistance vs temperature curves contrary to the conventional smooth transition. This feature was ascribed to the rutile/monoclinic quasi-heterostructure formed due to the top VO$_{2-x}$ film having a reduced TIMT. Our findings carry significant implications for both the lucid fabrication of VO$_2$ thin film devices as well as the study of phase transitions in correlated oxides.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Guided Prompting in SAM for Weakly Supervised Cell Segmentation in Histopathological Images
Authors:
Aayush Kumar Tyagi,
Vaibhav Mishra,
Prathosh A. P.,
Mausam
Abstract:
Cell segmentation in histopathological images plays a crucial role in understanding, diagnosing, and treating many diseases. However, data annotation for this is expensive since there can be a large number of cells per image, and expert pathologists are needed for labelling images. Instead, our paper focuses on using weak supervision -- annotation from related tasks -- to induce a segmenter. Recen…
▽ More
Cell segmentation in histopathological images plays a crucial role in understanding, diagnosing, and treating many diseases. However, data annotation for this is expensive since there can be a large number of cells per image, and expert pathologists are needed for labelling images. Instead, our paper focuses on using weak supervision -- annotation from related tasks -- to induce a segmenter. Recent foundation models, such as Segment Anything (SAM), can use prompts to leverage additional supervision during inference. SAM has performed remarkably well in natural image segmentation tasks; however, its applicability to cell segmentation has not been explored.
In response, we investigate guiding the prompting procedure in SAM for weakly supervised cell segmentation when only bounding box supervision is available. We develop two workflows: (1) an object detector's output as a test-time prompt to SAM (D-SAM), and (2) SAM as pseudo mask generator over training data to train a standalone segmentation model (SAM-S). On finding that both workflows have some complementary strengths, we develop an integer programming-based approach to reconcile the two sets of segmentation masks, achieving yet higher performance. We experiment on three publicly available cell segmentation datasets namely, ConSep, MoNuSeg, and TNBC, and find that all SAM-based solutions hugely outperform existing weakly supervised image segmentation models, obtaining 9-15 pt Dice gains.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
MalFake: A Multimodal Fake News Identification for Malayalam using Recurrent Neural Networks and VGG-16
Authors:
Adhish S. Sujan,
Ajitha. V,
Aleena Benny,
Amiya M. P.,
V. S. Anoop
Abstract:
The amount of news being consumed online has substantially expanded in recent years. Fake news has become increasingly common, especially in regional languages like Malayalam, due to the rapid publication and lack of editorial standards on some online sites. Fake news may have a terrible effect on society, causing people to make bad judgments, lose faith in authorities, and even engage in violent…
▽ More
The amount of news being consumed online has substantially expanded in recent years. Fake news has become increasingly common, especially in regional languages like Malayalam, due to the rapid publication and lack of editorial standards on some online sites. Fake news may have a terrible effect on society, causing people to make bad judgments, lose faith in authorities, and even engage in violent behavior. When we take into the context of India, there are many regional languages, and fake news is spreading in every language. Therefore, providing efficient techniques for identifying false information in regional tongues is crucial. Until now, little to no work has been done in Malayalam, extracting features from multiple modalities to classify fake news. Multimodal approaches are more accurate in detecting fake news, as features from multiple modalities are extracted to build the deep learning classification model. As far as we know, this is the first piece of work in Malayalam that uses multimodal deep learning to tackle false information. Models trained with more than one modality typically outperform models taught with only one modality. Our study in the Malayalam language utilizing multimodal deep learning is a significant step toward more effective misinformation detection and mitigation.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
CoNO: Complex Neural Operator for Continuous Dynamical Systems
Authors:
Karn Tiwari,
N M Anoop Krishnan,
Prathosh A P
Abstract:
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. These models have successfully solved continuous dynamical systems represented by differential equations, viz weather forecasting, fluid flow, or solid mechanics. However, the existing operators still rely on real space, thereby losing rich representations potentially captured in the complex space by…
▽ More
Neural operators extend data-driven models to map between infinite-dimensional functional spaces. These models have successfully solved continuous dynamical systems represented by differential equations, viz weather forecasting, fluid flow, or solid mechanics. However, the existing operators still rely on real space, thereby losing rich representations potentially captured in the complex space by functional transforms. In this paper, we introduce a Complex Neural Operator (CoNO), that parameterizes the integral kernel in the complex fractional Fourier domain. Additionally, the model employing a complex-valued neural network along with aliasing-free activation functions preserves the complex values and complex algebraic properties, thereby enabling improved representation, robustness to noise, and generalization. We show that the model effectively captures the underlying partial differential equation with a single complex fractional Fourier transform. We perform an extensive empirical evaluation of CoNO on several datasets and additional tasks such as zero-shot super-resolution, evaluation of out-of-distribution data, data efficiency, and robustness to noise. CoNO exhibits comparable or superior performance to all the state-of-the-art models in these tasks. Altogether, CoNO presents a robust and superior model for modeling continuous dynamical systems, providing a fillip to scientific machine learning.
△ Less
Submitted 4 October, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
CoDBench: A Critical Evaluation of Data-driven Models for Continuous Dynamical Systems
Authors:
Priyanshu Burark,
Karn Tiwari,
Meer Mehran Rashid,
Prathosh A P,
N M Anoop Krishnan
Abstract:
Continuous dynamical systems, characterized by differential equations, are ubiquitously used to model several important problems: plasma dynamics, flow through porous media, weather forecasting, and epidemic dynamics. Recently, a wide range of data-driven models has been used successfully to model these systems. However, in contrast to established fields like computer vision, limited studies are a…
▽ More
Continuous dynamical systems, characterized by differential equations, are ubiquitously used to model several important problems: plasma dynamics, flow through porous media, weather forecasting, and epidemic dynamics. Recently, a wide range of data-driven models has been used successfully to model these systems. However, in contrast to established fields like computer vision, limited studies are available analyzing the strengths and potential applications of different classes of these models that could steer decision-making in scientific machine learning. Here, we introduce CodBench, an exhaustive benchmarking suite comprising 11 state-of-the-art data-driven models for solving differential equations. Specifically, we comprehensively evaluate 4 distinct categories of models, viz., feed forward neural networks, deep operator regression models, frequency-based neural operators, and transformer architectures against 8 widely applicable benchmark datasets encompassing challenges from fluid and solid mechanics. We conduct extensive experiments, assessing the operators' capabilities in learning, zero-shot super-resolution, data efficiency, robustness to noise, and computational efficiency. Interestingly, our findings highlight that current operators struggle with the newer mechanics datasets, motivating the need for more robust neural operators. All the datasets and codes will be shared in an easy-to-use fashion for the scientific community. We hope this resource will be an impetus for accelerated progress and exploration in modeling dynamical systems.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Adapt then Unlearn: Exploiting Parameter Space Semantics for Unlearning in Generative Adversarial Networks
Authors:
Piyush Tiwary,
Atri Guha,
Subhodip Panda,
Prathosh A. P
Abstract:
The increased attention to regulating the outputs of deep generative models, driven by growing concerns about privacy and regulatory compliance, has highlighted the need for effective control over these models. This necessity arises from instances where generative models produce outputs containing undesirable, offensive, or potentially harmful content. To tackle this challenge, the concept of mach…
▽ More
The increased attention to regulating the outputs of deep generative models, driven by growing concerns about privacy and regulatory compliance, has highlighted the need for effective control over these models. This necessity arises from instances where generative models produce outputs containing undesirable, offensive, or potentially harmful content. To tackle this challenge, the concept of machine unlearning has emerged, aiming to forget specific learned information or to erase the influence of undesired data subsets from a trained model. The objective of this work is to prevent the generation of outputs containing undesired features from a pre-trained GAN where the underlying training data set is inaccessible. Our approach is inspired by a crucial observation: the parameter space of GANs exhibits meaningful directions that can be leveraged to suppress specific undesired features. However, such directions usually result in the degradation of the quality of generated samples. Our proposed method, known as 'Adapt-then-Unlearn,' excels at unlearning such undesirable features while also maintaining the quality of generated samples. This method unfolds in two stages: in the initial stage, we adapt the pre-trained GAN using negative samples provided by the user, while in the subsequent stage, we focus on unlearning the undesired feature. During the latter phase, we train the pre-trained GAN using positive samples, incorporating a repulsion regularizer. This regularizer encourages the model's parameters to be away from the parameters associated with the adapted model from the first stage while also maintaining the quality of generated samples. To the best of our knowledge, our approach stands as first method addressing unlearning in GANs. We validate the effectiveness of our method through comprehensive experiments.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
SN 2022jli: a type Ic supernova with periodic modulation of its light curve and an unusually long rise
Authors:
Moore T.,
Smartt S. J.,
Nicholl M.,
Srivastav S.,
Stevance H. F.,
Jess D. B.,
Grant S. D. T.,
Fulton M. D.,
Rhodes L.,
Sim S. A.,
Hirai R.,
Podsiadlowski P.,
Anderson J. P.,
Ashall C.,
Bate W.,
Fender R.,
Gutierrez C. P.,
Howell D. A.,
Huber M. E.,
Inserra C.,
Leloudas G.,
Monard L. A. G.,
Muller-Bravo T. E.,
Shappee B. J.,
Smith K. W.
, et al. (20 additional authors not shown)
Abstract:
We present multi-wavelength photometry and spectroscopy of SN 2022jli, an unprecedented Type Ic supernova discovered in the galaxy NGC 157 at a distance of $\approx$ 23 Mpc. The multi-band light curves reveal many remarkable characteristics. Peaking at a magnitude of $g=15.11\pm0.02$, the high-cadence photometry reveals 12.5$\pm0.2\ $day periodic undulations superimposed on the 200 day supernova d…
▽ More
We present multi-wavelength photometry and spectroscopy of SN 2022jli, an unprecedented Type Ic supernova discovered in the galaxy NGC 157 at a distance of $\approx$ 23 Mpc. The multi-band light curves reveal many remarkable characteristics. Peaking at a magnitude of $g=15.11\pm0.02$, the high-cadence photometry reveals 12.5$\pm0.2\ $day periodic undulations superimposed on the 200 day supernova decline. This periodicity is observed in the light curves from nine separate filter and instrument configurations with peak-to-peak amplitudes of $\simeq$ 0.1 mag. This is the first time that repeated periodic oscillations, over many cycles, have been detected in a supernova light curve. SN 2022jli also displays an extreme early excess which fades over $\approx$ 25 days followed by a rise to a peak luminosity of $L_{\rm opt} = 10^{42.1}$ erg s$^{-1}$. Although the exact explosion epoch is not constrained by data, the time from explosion to maximum light is $\gtrsim$ 59 days. The luminosity can be explained by a large ejecta mass ($M_{\rm ej}\approx12\pm6$M$_{\odot}$) powered by $^{56}$Ni but we find difficulty in quantitatively modelling the early excess with circumstellar interaction and cooling. Collision between the supernova ejecta and a binary companion is a possible source of this emission. We discuss the origin of the periodic variability in the light curve, including interaction of the SN ejecta with nested shells of circumstellar matter and neutron stars colliding with binary companions.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
HiveLink, an IoT based Smart Bee Hive Monitoring System
Authors:
Ajwin Dsouza,
Aditya P,
Sameer Hegde
Abstract:
HiveLink, the IoT-based Smart Bee Hive Monitoring System addresses the challenges faced by beekeepers in managing the influence of environmental impact, diseases, and collapse in honey bee colonies. Integrated with advanced sensors, the system monitors temperature, humidity, hive weight, and diurnal cycle. Leveraging IoT technology, the system provides real-time data, remote connectivity, and acti…
▽ More
HiveLink, the IoT-based Smart Bee Hive Monitoring System addresses the challenges faced by beekeepers in managing the influence of environmental impact, diseases, and collapse in honey bee colonies. Integrated with advanced sensors, the system monitors temperature, humidity, hive weight, and diurnal cycle. Leveraging IoT technology, the system provides real-time data, remote connectivity, and actionable insights for beekeepers. Monitoring the hive with the system enables early disease detection, proactive interventions, and optimized hive management. Minimizing manual inspections, enhancing productivity, and promoting sustainable practices to mitigate environmental impact and support honey bee populations. Therefore, this system is a demonstration of technology-driven solution to ensure the well-being of bee hives by facilitating data-driven decision-making and contributes to the resilience of beekee** in the face of diverse challenges.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Enhancing Knee Osteoarthritis severity level classification using diffusion augmented images
Authors:
Paleti Nikhil Chowdary,
Gorantla V N S L Vishnu Vardhan,
Menta Sai Akshay,
Menta Sai Aashish,
Vadlapudi Sai Aravind,
Garapati Venkata Krishna Rayalu,
Aswathy P
Abstract:
This research paper explores the classification of knee osteoarthritis (OA) severity levels using advanced computer vision models and augmentation techniques. The study investigates the effectiveness of data preprocessing, including Contrast-Limited Adaptive Histogram Equalization (CLAHE), and data augmentation using diffusion models. Three experiments were conducted: training models on the origin…
▽ More
This research paper explores the classification of knee osteoarthritis (OA) severity levels using advanced computer vision models and augmentation techniques. The study investigates the effectiveness of data preprocessing, including Contrast-Limited Adaptive Histogram Equalization (CLAHE), and data augmentation using diffusion models. Three experiments were conducted: training models on the original dataset, training models on the preprocessed dataset, and training models on the augmented dataset. The results show that data preprocessing and augmentation significantly improve the accuracy of the models. The EfficientNetB3 model achieved the highest accuracy of 84\% on the augmented dataset. Additionally, attention visualization techniques, such as Grad-CAM, are utilized to provide detailed attention maps, enhancing the understanding and trustworthiness of the models. These findings highlight the potential of combining advanced models with augmented data and attention visualization for accurate knee OA severity classification.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
Neural Discovery of Permutation Subgroups
Authors:
Pavan Karjol,
Rohan Kashyap,
Prathosh A P
Abstract:
We consider the problem of discovering subgroup $H$ of permutation group $S_{n}$. Unlike the traditional $H$-invariant networks wherein $H$ is assumed to be known, we present a method to discover the underlying subgroup, given that it satisfies certain conditions. Our results show that one could discover any subgroup of type $S_{k} (k \leq n)$ by learning an $S_{n}$-invariant function and a linear…
▽ More
We consider the problem of discovering subgroup $H$ of permutation group $S_{n}$. Unlike the traditional $H$-invariant networks wherein $H$ is assumed to be known, we present a method to discover the underlying subgroup, given that it satisfies certain conditions. Our results show that one could discover any subgroup of type $S_{k} (k \leq n)$ by learning an $S_{n}$-invariant function and a linear transformation. We also prove similar results for cyclic and dihedral subgroups. Finally, we provide a general theorem that can be extended to discover other subgroups of $S_{n}$. We also demonstrate the applicability of our results through numerical experiments on image-digit sum and symmetric polynomial regression tasks.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
A Unified Framework for Discovering Discrete Symmetries
Authors:
Pavan Karjol,
Rohan Kashyap,
Aditya Gopalan,
Prathosh A. P
Abstract:
We consider the problem of learning a function respecting a symmetry from among a class of symmetries. We develop a unified framework that enables symmetry discovery across a broad range of subgroups including locally symmetric, dihedral and cyclic subgroups. At the core of the framework is a novel architecture composed of linear, matrix-valued and non-linear functions that expresses functions inv…
▽ More
We consider the problem of learning a function respecting a symmetry from among a class of symmetries. We develop a unified framework that enables symmetry discovery across a broad range of subgroups including locally symmetric, dihedral and cyclic subgroups. At the core of the framework is a novel architecture composed of linear, matrix-valued and non-linear functions that expresses functions invariant to these subgroups in a principled manner. The structure of the architecture enables us to leverage multi-armed bandit algorithms and gradient descent to efficiently optimize over the linear and the non-linear functions, respectively, and to infer the symmetry that is ultimately learnt. We also discuss the necessity of the matrix-valued functions in the architecture. Experiments on image-digit sum and polynomial regression tasks demonstrate the effectiveness of our approach.
△ Less
Submitted 27 October, 2023; v1 submitted 6 September, 2023;
originally announced September 2023.
-
GenSelfDiff-HIS: Generative Self-Supervision Using Diffusion for Histopathological Image Segmentation
Authors:
Vishnuvardhan Purma,
Suhas Srinath,
Seshan Srirangarajan,
Aanchal Kakkar,
Prathosh A. P
Abstract:
Histopathological image segmentation is a laborious and time-intensive task, often requiring analysis from experienced pathologists for accurate examinations. To reduce this burden, supervised machine-learning approaches have been adopted using large-scale annotated datasets for histopathological image analysis. However, in several scenarios, the availability of large-scale annotated data is a bot…
▽ More
Histopathological image segmentation is a laborious and time-intensive task, often requiring analysis from experienced pathologists for accurate examinations. To reduce this burden, supervised machine-learning approaches have been adopted using large-scale annotated datasets for histopathological image analysis. However, in several scenarios, the availability of large-scale annotated data is a bottleneck while training such models. Self-supervised learning (SSL) is an alternative paradigm that provides some respite by constructing models utilizing only the unannotated data which is often abundant. The basic idea of SSL is to train a network to perform one or many pseudo or pretext tasks on unannotated data and use it subsequently as the basis for a variety of downstream tasks. It is seen that the success of SSL depends critically on the considered pretext task. While there have been many efforts in designing pretext tasks for classification problems, there haven't been many attempts on SSL for histopathological segmentation. Motivated by this, we propose an SSL approach for segmenting histopathological images via generative diffusion models in this paper. Our method is based on the observation that diffusion models effectively solve an image-to-image translation task akin to a segmentation task. Hence, we propose generative diffusion as the pretext task for histopathological image segmentation. We also propose a multi-loss function-based fine-tuning for the downstream task. We validate our method using several metrics on two publically available datasets along with a newly proposed head and neck (HN) cancer dataset containing hematoxylin and eosin (H\&E) stained images along with annotations. Codes will be made public at https://github.com/PurmaVishnuVardhanReddy/GenSelfDiff-HIS.git.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
On the Existence of Elementwise Invariant Vectors in Representations of Symmetric Groups
Authors:
Amrutha P,
Amritanshu Prasad,
Velmurugan S
Abstract:
We determine when a permutation with cycle type $μ$ admits a non-zero invariant vector in the irreducible representation $V_λ$ of the symmetric group. We find that a majority of pairs $(λ,μ)$ have this property, with only a few simple exceptions.
We determine when a permutation with cycle type $μ$ admits a non-zero invariant vector in the irreducible representation $V_λ$ of the symmetric group. We find that a majority of pairs $(λ,μ)$ have this property, with only a few simple exceptions.
△ Less
Submitted 30 October, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Correlating Medi-Claim Service by Deep Learning Neural Networks
Authors:
Jayanthi Vajiram,
Negha Senthil,
Nean Adhith. P
Abstract:
Medical insurance claims are of organized crimes related to patients, physicians, diagnostic centers, and insurance providers, forming a chain reaction that must be monitored constantly. These kinds of frauds affect the financial growth of both insured people and health insurance companies. The Convolution Neural Network architecture is used to detect fraudulent claims through a correlation study…
▽ More
Medical insurance claims are of organized crimes related to patients, physicians, diagnostic centers, and insurance providers, forming a chain reaction that must be monitored constantly. These kinds of frauds affect the financial growth of both insured people and health insurance companies. The Convolution Neural Network architecture is used to detect fraudulent claims through a correlation study of regression models, which helps to detect money laundering on different claims given by different providers. Supervised and unsupervised classifiers are used to detect fraud and non-fraud claims.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Exploration of legal implications of air and space travel for international and domestic travel and the Environment
Authors:
Jayanthi Vajiram,
Negha Senthil,
Nean Adhith. P,
Ritikaa. VN
Abstract:
The rapid growth of air and space travel in recent years has resulted in an increased demand for legal regulation in the aviation and aerospace fields. This paper provides an overview of air and space law, including the topics of aircraft accident investigations, air traffic control, international borders and law, and the regulation of space activities. With the increasing complexity of air and sp…
▽ More
The rapid growth of air and space travel in recent years has resulted in an increased demand for legal regulation in the aviation and aerospace fields. This paper provides an overview of air and space law, including the topics of aircraft accident investigations, air traffic control, international borders and law, and the regulation of space activities. With the increasing complexity of air and space travel, it is important to understand the legal implications of these activities. This paper examines the various legal aspects of air and space law, including the roles of national governments, international organizations, and private entities. It also provides an overview of the legal frameworks that govern these activities and the implications of international law. Finally, it considers the potential for future developments in the field of air and space law. This paper provides a comprehensive overview of the legal aspects of air and space travel and their implications for international and domestic travel, as well as for international business and other activities in the air and space domains.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Multi-mission view of low-luminosity 'obscured' phase of GRS 1915+105
Authors:
Athulya M. P.,
Anuj Nandi
Abstract:
GRS 1915+105 is observed in an 'obscured' phase since May 2019, exhibiting steady and low X-ray luminosities, while being intervened by sporadic re-brightenings. In this work, we perform a comprehensive and wide-band analysis of the spectral and timing properties of the source during the period $2019-2021$ using AstroSat (SXT: $0.5-8$ keV; LAXPC: $3-60$ keV), NICER ($0.5-12$ keV), and NuSTAR (…
▽ More
GRS 1915+105 is observed in an 'obscured' phase since May 2019, exhibiting steady and low X-ray luminosities, while being intervened by sporadic re-brightenings. In this work, we perform a comprehensive and wide-band analysis of the spectral and timing properties of the source during the period $2019-2021$ using AstroSat (SXT: $0.5-8$ keV; LAXPC: $3-60$ keV), NICER ($0.5-12$ keV), and NuSTAR ($3-60$ keV) observations. Spectral analysis reveals the presence of a highly variable obscurer (N$_{H_{1}}\sim~10^{22} - 10^{24}$ atoms cm$^{-2}$) throughout the observation period. Source is detected in the Low/Hard state for most of the time, with the spectra being described by a Comptonised component ($Γ\sim 1.16 - 1.79$, kT$_{e}\sim 2-31$ keV). The source spectra steepen ($Γ\sim2.5$) indicating softening of the spectrum during the rise of the re-brightenings. Various emission and absorption lines corresponding to the neutral Fe-K$α$, Fe-XXV K$α$, Fe-XXVI K$α$, and the Ni-XXVIII K$α$ were detected with equivalent widths varying between 70 eV $-$ 3.5 keV. The column density of the absorbing plasma varied between $10^{16} - 10^{18}$ atoms cm$^{-2}$ at a distance $\leq2\times$10$^{10}$ cm. Interestingly, the source is also seen exhibiting various variability classes ($ρ, λ, δ, χ$) at relatively low luminosities ($\sim$0.01L$_{Edd}$) during the re-brightening phases. Different variability classes show signature of QPOs ($ν_{QPO}$: 20--180 mHz, rms$_{QPO}$: 7.5% - 16%). The source showed a maximum bolometric luminosity {(L$_{bol}$)} of $\sim$0.01L$_{Edd}$ (Re-brightening phases) and a minimum L$_{bol}$ of 0.004L$_{Edd}$ (Quiet phase) during the period. We discuss the possible disc dynamics around the black hole during this low-luminosity `obscured' phase.
△ Less
Submitted 9 July, 2023;
originally announced July 2023.
-
Hypergraph representation in brain network analysis
Authors:
Anagha P,
Selvakumar R
Abstract:
For the study of functional aspects of the brain network. This paper is a study on the hypergraph representation, based on the functional regions of the brain network. A new parameter that can measure how many multifunctioning regions each function contains and thereby the correlation of other functions with each function.
For the study of functional aspects of the brain network. This paper is a study on the hypergraph representation, based on the functional regions of the brain network. A new parameter that can measure how many multifunctioning regions each function contains and thereby the correlation of other functions with each function.
△ Less
Submitted 19 December, 2023; v1 submitted 5 May, 2023;
originally announced May 2023.
-
Analyzing travel time reliability of a bus route in a limited data set scenario: A case study
Authors:
Ashwini B P,
R Sumathi,
Sudhira H S
Abstract:
In this information era commuters prefer to know a reliable travel time to plan ahead of their journey using both public and private modes. In this direction reliability analysis using the location data of the buses is conducted in two folds in the current work; (i) Reliability analysis of a public transit service at route level, and (ii) Travel time reliability analysis of a route utilizing the l…
▽ More
In this information era commuters prefer to know a reliable travel time to plan ahead of their journey using both public and private modes. In this direction reliability analysis using the location data of the buses is conducted in two folds in the current work; (i) Reliability analysis of a public transit service at route level, and (ii) Travel time reliability analysis of a route utilizing the location data of the buses. The reliability parameters assessed for public transit service are headway, passenger waiting time, travel speed, and travel time as per the Service Level Benchmarks for Urban Transport by the National Urban Transport Policy, Government of India. And travel time reliability parameters such as Buffer Time Index, Travel Time Index, and Planning Time Index are assessed as per Federal Highway Administration, Department of Transportation, U S. The study is conducted in Tumakuru city, India for a significant bus route in a limited data sources scenario. The results suggest that (i) the Level of Service of the public transit service needs improvement. (ii)around 30% excess of average travel time is needed as buffer time. (iii) more than double the amount of free flow travel time must be planned during peak hours and in the worst case. In the future, the analysis conducted for the route can be extended for citywide performance analysis in both folds. Also, the same method can be applied to cities with similar demographics and traffic-related infrastructure.
△ Less
Submitted 31 March, 2023;
originally announced March 2023.
-
Bayesian Pseudo-Coresets via Contrastive Divergence
Authors:
Piyush Tiwary,
Kumar Shubham,
Vivek V. Kashyap,
Prathosh A. P
Abstract:
Bayesian methods provide an elegant framework for estimating parameter posteriors and quantification of uncertainty associated with probabilistic models. However, they often suffer from slow inference times. To address this challenge, Bayesian Pseudo-Coresets (BPC) have emerged as a promising solution. BPC methods aim to create a small synthetic dataset, known as pseudo-coresets, that approximates…
▽ More
Bayesian methods provide an elegant framework for estimating parameter posteriors and quantification of uncertainty associated with probabilistic models. However, they often suffer from slow inference times. To address this challenge, Bayesian Pseudo-Coresets (BPC) have emerged as a promising solution. BPC methods aim to create a small synthetic dataset, known as pseudo-coresets, that approximates the posterior inference achieved with the original dataset. This approximation is achieved by optimizing a divergence measure between the true posterior and the pseudo-coreset posterior. Various divergence measures have been proposed for constructing pseudo-coresets, with forward Kullback-Leibler (KL) divergence being the most successful. However, using forward KL divergence necessitates sampling from the pseudo-coreset posterior, often accomplished through approximate Gaussian variational distributions. Alternatively, one could employ Markov Chain Monte Carlo (MCMC) methods for sampling, but this becomes challenging in high-dimensional parameter spaces due to slow mixing. In this study, we introduce a novel approach for constructing pseudo-coresets by utilizing contrastive divergence. Importantly, optimizing contrastive divergence eliminates the need for approximations in the pseudo-coreset construction process. Furthermore, it enables the use of finite-step MCMC methods, alleviating the requirement for extensive mixing to reach a stationary distribution. To validate our method's effectiveness, we conduct extensive experiments on multiple datasets, demonstrating its superiority over existing BPC techniques.
△ Less
Submitted 8 May, 2024; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Some Coupled Fixed Point Theorems for (ψ, φ)- contraction with Applications to Fractals
Authors:
Athul P,
D. Ramesh Kumar
Abstract:
In this paper, we obtain coupled fixed point theorem for (ψ, φ)-contractions under some generalized conditions on the real valued functions ψand φdefined on (0,\infinity). Also, we present a generalized version of coupled fixed point theorem for the same (ψ, φ)- contractions. A new approach to fractal generation using the relation between fractals and fixed points is given in light of these fixed…
▽ More
In this paper, we obtain coupled fixed point theorem for (ψ, φ)-contractions under some generalized conditions on the real valued functions ψand φdefined on (0,\infinity). Also, we present a generalized version of coupled fixed point theorem for the same (ψ, φ)- contractions. A new approach to fractal generation using the relation between fractals and fixed points is given in light of these fixed point theorems. We establish a new type of iterated function system consisting of generalized (ψ, φ)-contractions. We also extend those results to coupled fractals.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
On the stability of inhomogeneous fluids under acoustic fields
Authors:
Varun Kumar Rajendran,
Aravind Ram S P,
Karthick Subramani
Abstract:
In this work, we present the stability theory for inhomogeneous fluids subjected to standing acoustic fields. Starting from the first principles, the stability criterion is established for two fluids of different acoustic impedance separated by a plane interface. Through stability theory and numerical simulations we show that, in the presence of interfacial tension, the relocation of high-impedanc…
▽ More
In this work, we present the stability theory for inhomogeneous fluids subjected to standing acoustic fields. Starting from the first principles, the stability criterion is established for two fluids of different acoustic impedance separated by a plane interface. Through stability theory and numerical simulations we show that, in the presence of interfacial tension, the relocation of high-impedance fluid from anti-node to node occurs when the acoustic force overcomes interfacial tension force, which is in agreement with recent microchannel experiments. Furthermore, we establish an acoustic Bond number that characterizes stable (Bo_a < 1) and relocation (Bo_a > 1) regimes. Remarkably, it is found that the critical acoustic energy density required for relocation can be significantly reduced by increasing the channel height which could help design acoustofluidic microchannel devices that handle immiscible fluids.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Experimental investigation on the performance of thermosyphon charging of a single-medium stratified storage system for concentrated solar power applications
Authors:
Dipti Ranjan Parida,
Saptarshi Basu,
Dhanush A P
Abstract:
Concentrated solar power (CSP) plants utilize two-tank, sensible-heat thermal energy storage (TES) for uninterrupted electricity generation. However, the cost for the design and operation of TES is expensive. Therefore, researchers are focusing on implementing single-tank storage. Additional cutbacks can be made by utilizing pump-less thermosyphon charging for the TES. But prior thermosyphon resea…
▽ More
Concentrated solar power (CSP) plants utilize two-tank, sensible-heat thermal energy storage (TES) for uninterrupted electricity generation. However, the cost for the design and operation of TES is expensive. Therefore, researchers are focusing on implementing single-tank storage. Additional cutbacks can be made by utilizing pump-less thermosyphon charging for the TES. But prior thermosyphon researches for TES are related to domestic water-heating systems of small-capacity (<100 liters) and low-temperature (<100 °C). Thus, investigations into thermosyphon charging for high-temperature storage are desired. This study focuses on thermosyphon-charging and storing of a single-medium stratified TES. The experiments were conducted on a 370 liters cylindrical storage (aspect ratio 4:1) with a heat-pipe system (3-liter volume) acting as a collector. Dowtherm-A oil was used as the heat transfer fluid (HTF), and the thermal expansion of HTF was accommodated in an expansion tank via two different designs (top and bottom connections from storage tank to expansion tank). Moreover, continuous and pulsatile charging are investigated for low (150 °C) and high (250 and 300 °C) temperatures. The results indicate that the maximum HTF temperature coming out of the heating pipes is ~25 °C more for the bottom-expansion design. Furthermore, it results in higher charging efficiency than the top-expansion setup for high-temperature studies. Finally, it is revealed that under design conditions, there are limits on the degree of thermal stratification achieved in the charging cycle and the maximum layover time allowable for interrupted charging. These results provide insights into the operational strategy of thermosyphon-charging stratified storage for CSP applications.
△ Less
Submitted 30 November, 2022;
originally announced November 2022.
-
Discrete Control in Real-World Driving Environments using Deep Reinforcement Learning
Authors:
Avinash Amballa,
Advaith P.,
Pradip Sasmal,
Sumohana Channappayya
Abstract:
Training self-driving cars is often challenging since they require a vast amount of labeled data in multiple real-world contexts, which is computationally and memory intensive. Researchers often resort to driving simulators to train the agent and transfer the knowledge to a real-world setting. Since simulators lack realistic behavior, these methods are quite inefficient. To address this issue, we…
▽ More
Training self-driving cars is often challenging since they require a vast amount of labeled data in multiple real-world contexts, which is computationally and memory intensive. Researchers often resort to driving simulators to train the agent and transfer the knowledge to a real-world setting. Since simulators lack realistic behavior, these methods are quite inefficient. To address this issue, we introduce a framework (perception, planning, and control) in a real-world driving environment that transfers the real-world environments into gaming environments by setting up a reliable Markov Decision Process (MDP). We propose variations of existing Reinforcement Learning (RL) algorithms in a multi-agent setting to learn and execute the discrete control in real-world environments. Experiments show that the multi-agent setting outperforms the single-agent setting in all the scenarios. We also propose reliable initialization, data augmentation, and training techniques that enable the agents to learn and generalize to navigate in a real-world environment with minimal input video data, and with minimal training. Additionally, to show the efficacy of our proposed algorithm, we deploy our method in the virtual driving environment TORCS.
△ Less
Submitted 30 November, 2022; v1 submitted 28 November, 2022;
originally announced November 2022.
-
BioJam Camp: toward justice through bioengineering and biodesign co-learning with youth
Authors:
Callie Chappell,
Henry A. -A.,
Elvia B. O.,
Emily B.,
Bailey B.,
Jacqueline C. -M.,
Caroline Daws,
Cristian F.,
Emiliano G.,
Page Goddard,
Xavier G.,
Anne Hu,
Gabriela J.,
Kelley Langhans,
Briana Martin-Villa,
Penny M. -S.,
Jennifer M.,
Soyang N.,
Melissa Ortiz,
Aryana P.,
Trisha S,
Corinne Takara,
Emily T.,
Paloma Vazquez,
Rolando Perez
, et al. (1 additional authors not shown)
Abstract:
BioJam is a political, artistic, and educational project in which Bay Area artists, scientists, and educators collaborate with youth and communities of color to address historical exclusion of their communities in STEM fields and reframe what science can be. As an intergenerational collective, we co-learn on topics of culture (social and biological), community (cultural and ecological), and creati…
▽ More
BioJam is a political, artistic, and educational project in which Bay Area artists, scientists, and educators collaborate with youth and communities of color to address historical exclusion of their communities in STEM fields and reframe what science can be. As an intergenerational collective, we co-learn on topics of culture (social and biological), community (cultural and ecological), and creativity. We reject the notion that increasing the number of scientists of color requires inculcation in the ways of the dominant culture. Instead, we center cultural practices, traditional ways of knowing, storytelling, art, experiential learning, and community engagement to break down the framing that positions these practices as distinct from science. The goal of this work is to realize a future in which the practice of science is relatable, accessible, and liberatory.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
SurfMyoAiR: A surface Electromyography based framework for Airwriting Recognition
Authors:
Ayush Tripathi,
Lalan Kumar,
Prathosh A. P.,
Suriya Prakash Muthukrishnan
Abstract:
Airwriting Recognition is the task of identifying letters written in free space with finger movement. Electromyography (EMG) is a technique used to record electrical activity during muscle contraction and relaxation as a result of movement and is widely used for gesture recognition. Most of the current research in gesture recognition is focused on identifying static gestures. However, dynamic gest…
▽ More
Airwriting Recognition is the task of identifying letters written in free space with finger movement. Electromyography (EMG) is a technique used to record electrical activity during muscle contraction and relaxation as a result of movement and is widely used for gesture recognition. Most of the current research in gesture recognition is focused on identifying static gestures. However, dynamic gestures are natural and user-friendly for being used as alternate input methods in Human-Computer Interaction applications. Airwriting recognition using EMG signals recorded from forearm muscles is therefore a viable solution. Since the user does not need to learn any new gestures and a large range of words can be formed by concatenating these letters, it is generalizable to a wider population. There has been limited work in recognition of airwriting using EMG signals and forms the core idea of the current work. The SurfMyoAiR dataset comprising of EMG signals recorded during writing English uppercase alphabets is constructed. Several different time-domain features to construct EMG envelope and two different time-frequency image representations: Short-Time Fourier Transform and Continuous Wavelet Transform were explored to form the input to a deep learning model for airwriting recognition. Several different deep learning architectures were exploited for this task. Additionally, the effect of various parameters such as signal length, window length and interpolation techniques on the recognition performance is comprehensively explored. The best-achieved accuracy was 78.50% and 62.19% in user-dependent and independent scenarios respectively by using Short-Time Fourier Transform in conjunction with a 2D Convolutional Neural Network based classifier. Airwriting has great potential as a user-friendly modality to be used as an alternate input method in Human-Computer Interaction applications.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
Bivariate fractal interpolation functions on triangular domain for numerical integration and approximation
Authors:
Aparna M P,
P Paramanathan
Abstract:
The primary objectives of this paper are to present the construction of bivariate fractal interpolation functions over triangular interpolating domain using the concept of vertex coloring and to propose a double integration formula for the constructed interpolation functions. Unlike the conventional constructions, each vertex in the partition of the triangular region has been assigned a color such…
▽ More
The primary objectives of this paper are to present the construction of bivariate fractal interpolation functions over triangular interpolating domain using the concept of vertex coloring and to propose a double integration formula for the constructed interpolation functions. Unlike the conventional constructions, each vertex in the partition of the triangular region has been assigned a color such that the chromatic number of the partition is 3. A new method for the partitioning of the triangle is proposed with a result concerning the chromatic number of its graph. Following the construction, a formula determining the vertical scaling factor is provided. With the newly defined vertical scaling factor, it is clearly observed that the value of the double integral coincides with the integral value calculated using fractal theory. Further, a relation connecting the fractal interpolation function with the equation of the plane passing through the vertices of the triangle is established. Convergence of the proposed method to the actual integral value is proven with sufficient lemmas and theorems. Sufficient examples are also provided to illustrate the method of construction and to verify the formula of double integration.
△ Less
Submitted 8 August, 2022;
originally announced October 2022.
-
Multi Spectral Switchable Infra-Red Reflectance Resonances in Highly Subwavelength Partially Oxidized Vanadium Thin Films
Authors:
Ashok P,
Yogesh Singh Chauhan,
Amit Verma
Abstract:
Phase transition materials are promising for realization of switchable optics. In this work, we show reflectance resonances in the near-infrared and long-wave infrared wavelengths in highly subwavelength partially oxidized Vanadium thin films. These partially oxidized films consist of a multilayer of Vanadium dioxide and Vanadium as shown using Raman spectroscopy and four-probe measurements. As Va…
▽ More
Phase transition materials are promising for realization of switchable optics. In this work, we show reflectance resonances in the near-infrared and long-wave infrared wavelengths in highly subwavelength partially oxidized Vanadium thin films. These partially oxidized films consist of a multilayer of Vanadium dioxide and Vanadium as shown using Raman spectroscopy and four-probe measurements. As Vanadium dioxide is a phase transition material that shows insulator to metal phase transition at 68 C, the observed infra-red resonances can be switched with temperature into a high-reflectance state. The wavelength of these resonances are passively tunable as a function of the oxidation duration. The obtained reflectance resonance at near-infrared wavelength red shifts from 1.78 um to 2.68 um with increasing oxidation duration while the long-wavelength infrared resonance blue shifts from 12.68 um to 9.96 um. To find the origin of the reflectance resonances, we model the reflectance spectra as a function of the oxidation duration using the transfer matrix method. The presented model captures the dual reflectance resonances reasonably well. These passive wavelength-tunable and switchable resonances with easy to fabricate lithography-free multilayer structure will be useful for multispectral applications such as camouflage, spectral selective microbolometer, and thermal management.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
Intelligent analysis of EEG signals to assess consumer decisions: A Study on Neuromarketing
Authors:
Nikunj Phutela,
Abhilash P,
Kaushik Sreevathsan,
B N Krupa
Abstract:
Neuromarketing is an emerging field that combines neuroscience and marketing to understand the factors that influence consumer decisions better. The study proposes a method to understand consumers' positive and negative reactions to advertisements (ads) and products by analysing electroencephalogram (EEG) signals. These signals are recorded using a low-cost single electrode headset from volunteers…
▽ More
Neuromarketing is an emerging field that combines neuroscience and marketing to understand the factors that influence consumer decisions better. The study proposes a method to understand consumers' positive and negative reactions to advertisements (ads) and products by analysing electroencephalogram (EEG) signals. These signals are recorded using a low-cost single electrode headset from volunteers belonging to the ages 18-22. A detailed subject dependent (SD) and subject independent (SI) analysis was performed employing machine learning methods like Naive Bayes (NB), Support Vector Machine (SVM), k-nearest neighbour and Decision Tree and the proposed deep learning (DL) model. SVM and NB yielded an accuracy (Acc.) of 0.63 for the SD analysis. In SI analysis, SVM performed better for the advertisement, product and gender-based analysis. Furthermore, the performance of the DL model was on par with that of SVM, especially, in product and ads-based analysis.
△ Less
Submitted 29 May, 2022;
originally announced June 2022.
-
ImAiR: Airwriting Recognition framework using Image Representation of IMU Signals
Authors:
Ayush Tripathi,
Arnab Kumar Mondal,
Lalan Kumar,
Prathosh A. P
Abstract:
The problem of Airwriting Recognition is focused on identifying letters written by movement of finger in free space. It is a type of gesture recognition where the dictionary corresponds to letters in a specific language. In particular, airwriting recognition using sensor data from wrist-worn devices can be used as a medium of user input for applications in Human-Computer Interaction (HCI). Recogni…
▽ More
The problem of Airwriting Recognition is focused on identifying letters written by movement of finger in free space. It is a type of gesture recognition where the dictionary corresponds to letters in a specific language. In particular, airwriting recognition using sensor data from wrist-worn devices can be used as a medium of user input for applications in Human-Computer Interaction (HCI). Recognition of in-air trajectories using such wrist-worn devices is limited in literature and forms the basis of the current work. In this paper, we propose an airwriting recognition framework by first encoding the time-series data obtained from a wearable Inertial Measurement Unit (IMU) on the wrist as images and then utilizing deep learning-based models for identifying the written alphabets. The signals recorded from 3-axis accelerometer and gyroscope in IMU are encoded as images using different techniques such as Self Similarity Matrix (SSM), Gramian Angular Field (GAF) and Markov Transition Field (MTF) to form two sets of 3-channel images. These are then fed to two separate classification models and letter prediction is made based on an average of the class conditional probabilities obtained from the two models. Several standard model architectures for image classification such as variants of ResNet, DenseNet, VGGNet, AlexNet and GoogleNet have been utilized. Experiments performed on two publicly available datasets demonstrate the efficacy of the proposed strategy. The code for our implementation will be made available at https://github.com/ayushayt/ImAiR.
△ Less
Submitted 8 September, 2022; v1 submitted 4 May, 2022;
originally announced May 2022.
-
Accretion Scenario of MAXI J1820+070 during 2018 Outbursts with Multi-mission Observations
Authors:
Geethu Prabhakar,
Samir Mandal,
Athulya M. P,
Anuj Nandi
Abstract:
We present a comprehensive spectral and temporal study of the black hole X-ray transient MAXI J1820+070 during its outbursts in 2018 using Swift/XRT, NICER, NuSTAR and AstroSat observations. The Swift/XRT and NICER spectral study shows a plateau in the light curve with spectral softening (hardness changes from $\sim$ $2.5$ to $2$) followed by a gradual decline without spectral softening during the…
▽ More
We present a comprehensive spectral and temporal study of the black hole X-ray transient MAXI J1820+070 during its outbursts in 2018 using Swift/XRT, NICER, NuSTAR and AstroSat observations. The Swift/XRT and NICER spectral study shows a plateau in the light curve with spectral softening (hardness changes from $\sim$ $2.5$ to $2$) followed by a gradual decline without spectral softening during the first outburst. Also, spectral modelling suggests that the first outburst is in the low/hard state throughout with a truncated disk whereas the thermal disk emission dominates during the second outburst. During the entire outburst, strong reflection signature (reflection fraction varies between $\sim$ $0.38 - 3.8$) is observed in the simultaneous wideband (NICER-NuSTAR, XRT-NuSTAR, AstroSat) data due to the presence of a dynamically evolving corona. The NICER timing analysis shows Quasi-periodic Oscillation (QPO) signatures and the characteristic frequency increases (decreases) in the plateau (decline) phase with time during the first outburst. We understand that the reduction of the electron cooling timescale in the corona due to spectral softening and the resonance oscillation with the local dynamical timescale may explain the above behavior of the source during the outburst. Also, we propose a possible scenario of outburst triggering and the associated accretion geometry of the source.
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
Exploring the pattern of Emotion in children with ASD as an early biomarker through Recurring-Convolution Neural Network (R-CNN)
Authors:
Abirami S P,
Kousalya G,
Karthick R
Abstract:
Autism Spectrum Disorder (ASD) is found to be a major concern among various occupational therapists. The foremost challenge of this neurodevelopmental disorder lies in the fact of analyzing and exploring various symptoms of the children at their early stage of development. Such early identification could prop up the therapists and clinicians to provide proper assistive support to make the children…
▽ More
Autism Spectrum Disorder (ASD) is found to be a major concern among various occupational therapists. The foremost challenge of this neurodevelopmental disorder lies in the fact of analyzing and exploring various symptoms of the children at their early stage of development. Such early identification could prop up the therapists and clinicians to provide proper assistive support to make the children lead an independent life. Facial expressions and emotions perceived by the children could contribute to such early intervention of autism. In this regard, the paper implements in identifying basic facial expression and exploring their emotions upon a time variant factor. The emotions are analyzed by incorporating the facial expression identified through CNN using 68 landmark points plotted on the frontal face with a prediction network formed by RNN known as RCNN-FER system. The paper adopts R-CNN to take the advantage of increased accuracy and performance with decreased time complexity in predicting emotion as a textual network analysis. The papers proves better accuracy in identifying the emotion in autistic children when compared over simple machine learning models built for such identifications contributing to autistic society.
△ Less
Submitted 30 December, 2021;
originally announced December 2021.
-
SCLAiR : Supervised Contrastive Learning for User and Device Independent Airwriting Recognition
Authors:
Ayush Tripathi,
Arnab Kumar Mondal,
Lalan Kumar,
Prathosh A. P
Abstract:
Airwriting Recognition is the problem of identifying letters written in free space with finger movement. It is essentially a specialized case of gesture recognition, wherein the vocabulary of gestures corresponds to letters as in a particular language. With the wide adoption of smart wearables in the general population, airwriting recognition using motion sensors from a smart-band can be used as a…
▽ More
Airwriting Recognition is the problem of identifying letters written in free space with finger movement. It is essentially a specialized case of gesture recognition, wherein the vocabulary of gestures corresponds to letters as in a particular language. With the wide adoption of smart wearables in the general population, airwriting recognition using motion sensors from a smart-band can be used as a medium of user input for applications in Human-Computer Interaction. There has been limited work in the recognition of in-air trajectories using motion sensors, and the performance of the techniques in the case when the device used to record signals is changed has not been explored hitherto. Motivated by these, a new paradigm for device and user-independent airwriting recognition based on supervised contrastive learning is proposed. A two stage classification strategy is employed, the first of which involves training an encoder network with supervised contrastive loss. In the subsequent stage, a classification head is trained with the encoder weights kept frozen. The efficacy of the proposed method is demonstrated through experiments on a publicly available dataset and also with a dataset recorded in our lab using a different device. Experiments have been performed in both supervised and unsupervised settings and compared against several state-of-the-art domain adaptation techniques. Data and the code for our implementation will be made available at https://github.com/ayushayt/SCLAiR.
△ Less
Submitted 29 December, 2021; v1 submitted 25 November, 2021;
originally announced November 2021.
-
Orthogonal Delay Scale Space Modulation: A New Technique for Wideband Time-Varying Channels
Authors:
Arunkumar K. P.,
Chandra R. Murthy
Abstract:
Orthogonal Time Frequency Space (OTFS) modulation is a recently proposed scheme for time-varying narrowband channels in terrestrial radio-frequency communications. Underwater acoustic (UWA) and ultra-wideband (UWB) communication systems, on the other hand, confront wideband time-varying channels. Unlike narrowband channels, for which time contractions or dilations due to Doppler effect can be appr…
▽ More
Orthogonal Time Frequency Space (OTFS) modulation is a recently proposed scheme for time-varying narrowband channels in terrestrial radio-frequency communications. Underwater acoustic (UWA) and ultra-wideband (UWB) communication systems, on the other hand, confront wideband time-varying channels. Unlike narrowband channels, for which time contractions or dilations due to Doppler effect can be approximated by frequency-shifts, the Doppler effect in wideband channels results in frequency-dependent non-uniform shift of signal frequencies across the band. In this paper, we develop an OTFS-like modulation scheme -- Orthogonal Delay Scale Space (ODSS) modulation -- for handling wideband time-varying channels. We derive the ODSS transmission and reception schemes from first principles. In the process, we introduce the notion of $ω$-convolution in the delay-scale space that parallels the twisted convolution used in the time-frequency space. The preprocessing 2D transformation from the Fourier-Mellin domain to the delay-scale space in ODSS, which plays the role of inverse symplectic Fourier transform (ISFFT) in OTFS, improves the bit error rate performance compared to OTFS and Orthogonal Frequency Division Multiplexing (OFDM) in wideband time-varying channels. Furthermore, since the channel matrix is rendered near-diagonal, ODSS retains the advantage of OFDM in terms of its low-complexity receiver structure.
△ Less
Submitted 8 May, 2022; v1 submitted 21 November, 2021;
originally announced November 2021.
-
Unraveling the foretime of GRS 1915+105 using AstroSat observations: Wide-band spectral and temporal characteristics
Authors:
Athulya M. P.,
Radhika D.,
V. K. Agrawal,
Ravishankar B. T.,
Sachindra Naik,
Samir Mandal,
Anuj Nandi
Abstract:
We present a comprehensive study of GRS 1915+105 in wide energy band ($0.5-60$ keV) using AstroSat observations during the period of $2016-2019$. The MAXI X-ray lightcurve of the source shows rise and decay profiles similar to canonical outbursting black holes. However, the source does not follow the exemplary 'q'-diagram in the Hardness-Intensity Diagram (HID). Model independent analysis of light…
▽ More
We present a comprehensive study of GRS 1915+105 in wide energy band ($0.5-60$ keV) using AstroSat observations during the period of $2016-2019$. The MAXI X-ray lightcurve of the source shows rise and decay profiles similar to canonical outbursting black holes. However, the source does not follow the exemplary 'q'-diagram in the Hardness-Intensity Diagram (HID). Model independent analysis of lightcurves suggests that GRS 1915+105 displays various types of variability classes ($δ,χ,ρ,κ,ω$ and $γ$). We also report possible transitions from one class to another ($χ\rightarrowρ,ρ\rightarrowκ$ via an 'unknown' class and $ω\rightarrowγ\rightarrowω+γ$) within a few hours duration. Broadband energy spectra are well modeled with multi-coloured disc blackbody and Comptonised components. We explore the 'spectro-temporal' features of the source in the different variability classes, transitions between classes, and evolution during $2016-2019$. Detailed analysis indicates a gradual increase in the photon index ($Γ$) from $1.83$ to $3.8$, disc temperature ($kT_{in}$) from $1.33$ to $2.67$ keV, and Quasi-periodic Oscillation (QPO) frequency ($ν$) from $4$ to $5.64$ Hz during the rise, while the parameters decrease to $Γ$ ~$1.18$, $kT_{in}$ ~$1.18$ keV, and $ν$ ~$1.38$ Hz respectively in the decline phase. The source shows maximum bolometric luminosity (L$_{bol}$) during the peak at ~$36$% of Eddington luminosity (L$_{EDD}$), and a minimum of ~$2.4$% L$_{EDD}$ during the decay phase. Further evolution of the source towards an obscured low-luminosity (L$_{bol}$ of ~ 1% L$_{EDD}$) phase, with a decrease in the intrinsic bolometric luminosity of the source due to obscuration, has also been indicated from our analysis. The implication of our results are discussed in the context of accretion disc dynamics around the black hole.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
Fronthaul Compression for Uplink Massive MIMO using Matrix Decomposition
Authors:
Aswathylakshmi P,
Radha Krishna Ganti
Abstract:
Massive MIMO opens up attractive possibilities for next generation wireless systems with its large number of antennas offering spatial diversity and multiplexing gain. However, the fronthaul link that connects a massive MIMO Remote Radio Head (RRH) and carries IQ samples to the Baseband Unit (BBU) of the base station can throttle the network capacity/speed if appropriate data compression technique…
▽ More
Massive MIMO opens up attractive possibilities for next generation wireless systems with its large number of antennas offering spatial diversity and multiplexing gain. However, the fronthaul link that connects a massive MIMO Remote Radio Head (RRH) and carries IQ samples to the Baseband Unit (BBU) of the base station can throttle the network capacity/speed if appropriate data compression techniques are not applied. In this paper, we propose an iterative technique for fronthaul load reduction in the uplink for massive MIMO systems that utilizes the convolution structure of the received signals. We use an alternating minimisation algorithm for blind deconvolution of the received data matrix that provides compression ratios of 30-50. In addition, the technique presented here can be used for blind decoding of OFDM signals in massive MIMO systems.
△ Less
Submitted 24 October, 2021;
originally announced October 2021.
-
Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages
Authors:
Anoop C S,
Prathosh A P,
A G Ramakrishnan
Abstract:
Building an automatic speech recognition (ASR) system from scratch requires a large amount of annotated speech data, which is difficult to collect in many languages. However, there are cases where the low-resource language shares a common acoustic space with a high-resource language having enough annotated data to build an ASR. In such cases, we show that the domain-independent acoustic models lea…
▽ More
Building an automatic speech recognition (ASR) system from scratch requires a large amount of annotated speech data, which is difficult to collect in many languages. However, there are cases where the low-resource language shares a common acoustic space with a high-resource language having enough annotated data to build an ASR. In such cases, we show that the domain-independent acoustic models learned from the high-resource language through unsupervised domain adaptation (UDA) schemes can enhance the performance of the ASR in the low-resource language. We use the specific example of Hindi in the source domain and Sanskrit in the target domain. We explore two architectures: i) domain adversarial training using gradient reversal layer (GRL) and ii) domain separation networks (DSN). The GRL and DSN architectures give absolute improvements of 6.71% and 7.32%, respectively, in word error rate over the baseline deep neural network model when trained on just 5.5 hours of data in the target domain. We also show that choosing a proper language (Telugu) in the source domain can bring further improvement. The results suggest that UDA schemes can be helpful in the development of ASR systems for low-resource languages, mitigating the hassle of collecting large amounts of annotated speech data.
△ Less
Submitted 16 September, 2021; v1 submitted 12 September, 2021;
originally announced September 2021.
-
Bootstrap** time correlation functions of molecular dynamics
Authors:
Desbiens N.,
Arnault P.,
Weens W.,
Perrin G.,
Dubois V
Abstract:
Molecular dynamics is often considered as a numerical experiment. The error bars on the results are therefore mandatory, but sometimes difficult to determine and computationally demanding. As a low-cost approach, we describe the application of the bootstrap (BS) method to the quantification of uncertainties pertaining to the time correlation functions. We chose the autocorrelation functions of vel…
▽ More
Molecular dynamics is often considered as a numerical experiment. The error bars on the results are therefore mandatory, but sometimes difficult to determine and computationally demanding. As a low-cost approach, we describe the application of the bootstrap (BS) method to the quantification of uncertainties pertaining to the time correlation functions. We chose the autocorrelation functions of velocity and interdiffusion current for a binary ionic mixture as a test bed, and we assessed the merit of the Darken approximation relating both of them. The intrinsic errors related to phase space sampling is investigated comparing the BS method with the reference method of replica. We also study how the BS method can assist in addressing the finite size effects.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.