Search | arXiv e-print repository

Large-scale quantum reservoir learning with an analog quantum computer

Authors: Milan Kornjača, Hong-Ye Hu, Chen Zhao, Jonathan Wurtz, Phillip Weinberg, Majd Hamdan, Andrii Zhdanov, Sergio H. Cantu, Hengyun Zhou, Rodrigo Araiza Bravo, Kevin Bagnall, James I. Basham, Joseph Campo, Adam Choukri, Robert DeAngelo, Paige Frederick, David Haines, Julian Hammett, Ning Hsu, Ming-Guang Hu, Florian Huber, Paul Niklas Jepsen, Ningyuan Jia, Thomas Karolyshyn, Minho Kwon , et al. (28 additional authors not shown)

Abstract: Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac… ▽ More Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lack potential for quantum advantage. To address this, we develop a general-purpose, gradient-free, and scalable quantum reservoir learning algorithm that harnesses the quantum dynamics of neutral-atom analog quantum computers to process data. We experimentally implement the algorithm, achieving competitive performance across various categories of machine learning tasks, including binary and multi-class classification, as well as timeseries prediction. Effective and improving learning is observed with increasing system sizes of up to 108 qubits, demonstrating the largest quantum machine learning experiment to date. We further observe comparative quantum kernel advantage in learning tasks by constructing synthetic datasets based on the geometric differences between generated quantum and classical data kernels. Our findings demonstrate the potential of utilizing classically intractable quantum correlations for effective machine learning. We expect these results to stimulate further extensions to different quantum hardware and machine learning paradigms, including early fault-tolerant hardware and generative machine learning tasks. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 10 + 14 pages, 4 + 7 figures

arXiv:2406.11402 [pdf, other]

Evaluating Open Language Models Across Task Types, Application Domains, and Reasoning Types: An In-Depth Experimental Analysis

Authors: Neelabh Sinha, Vinija Jain, Aman Chadha

Abstract: The rapid rise of Language Models (LMs) has expanded their use in several applications. Yet, due to constraints of model size, associated cost, or proprietary restrictions, utilizing state-of-the-art (SOTA) LLMs is not always feasible. With open, smaller LMs emerging, more applications can leverage their capabilities, but selecting the right LM can be challenging. This work conducts an in-depth ex… ▽ More The rapid rise of Language Models (LMs) has expanded their use in several applications. Yet, due to constraints of model size, associated cost, or proprietary restrictions, utilizing state-of-the-art (SOTA) LLMs is not always feasible. With open, smaller LMs emerging, more applications can leverage their capabilities, but selecting the right LM can be challenging. This work conducts an in-depth experimental analysis of the semantic correctness of outputs of 10 smaller, open LMs across three aspects: task types, application domains and reasoning types, using diverse prompt styles. We demonstrate that most effective models and prompt styles vary depending on the specific requirements. Our analysis provides a comparative assessment of LMs and prompt styles using a proposed three-tier schema of aspects for their strategic selection based on use-case and other constraints. We also show that if utilized appropriately, these LMs can compete with, and sometimes outperform, SOTA LLMs like DeepSeek-v2, GPT-3.5-Turbo, and GPT-4o. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2404.12403 [pdf, other]

Multi-Objective Hardware Aware Neural Architecture Search using Hardware Cost Diversity

Authors: Nilotpal Sinha, Peyman Rostami, Abd El Rahman Shabayek, Anis Kacem, Djamila Aouada

Abstract: Hardware-aware Neural Architecture Search approaches (HW-NAS) automate the design of deep learning architectures, tailored specifically to a given target hardware platform. Yet, these techniques demand substantial computational resources, primarily due to the expensive process of assessing the performance of identified architectures. To alleviate this problem, a recent direction in the literature… ▽ More Hardware-aware Neural Architecture Search approaches (HW-NAS) automate the design of deep learning architectures, tailored specifically to a given target hardware platform. Yet, these techniques demand substantial computational resources, primarily due to the expensive process of assessing the performance of identified architectures. To alleviate this problem, a recent direction in the literature has employed representation similarity metric for efficiently evaluating architecture performance. Nonetheless, since it is inherently a single objective method, it requires multiple runs to identify the optimal architecture set satisfying the diverse hardware cost constraints, thereby increasing the search cost. Furthermore, simply converting the single objective into a multi-objective approach results in an under-explored architectural search space. In this study, we propose a Multi-Objective method to address the HW-NAS problem, called MO-HDNAS, to identify the trade-off set of architectures in a single run with low computational cost. This is achieved by optimizing three objectives: maximizing the representation similarity metric, minimizing hardware cost, and maximizing the hardware cost diversity. The third objective, i.e. hardware cost diversity, is used to facilitate a better exploration of the architecture search space. Experimental results demonstrate the effectiveness of our proposed method in efficiently addressing the HW-NAS problem across six edge devices for the image classification task. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: Accepted at the CVPR 2024 Workshop, called "Efficient Deep Learning for Computer Vision (ECV)"

arXiv:2404.08293 [pdf, other]

Overcoming Scene Context Constraints for Object Detection in wild using Defilters

Authors: Vamshi Krishna Kancharla, Neelam sinha

Abstract: This paper focuses on improving object detection performance by addressing the issue of image distortions, commonly encountered in uncontrolled acquisition environments. High-level computer vision tasks such as object detection, recognition, and segmentation are particularly sensitive to image distortion. To address this issue, we propose a novel approach employing an image defilter to rectify ima… ▽ More This paper focuses on improving object detection performance by addressing the issue of image distortions, commonly encountered in uncontrolled acquisition environments. High-level computer vision tasks such as object detection, recognition, and segmentation are particularly sensitive to image distortion. To address this issue, we propose a novel approach employing an image defilter to rectify image distortion prior to object detection. This method enhances object detection accuracy, as models perform optimally when trained on non-distorted images. Our experiments demonstrate that utilizing defiltered images significantly improves mean average precision compared to training object detection models on distorted images. Consequently, our proposed method offers considerable benefits for real-world applications plagued by image distortion. To our knowledge, the contribution lies in employing distortion-removal paradigm for object detection on images captured in natural settings. We achieved an improvement of 0.562 and 0.564 of mean Average precision on validation and test data. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2403.11480 [pdf, other]

Towards understanding the nature of direct functional connectivity in visual brain network

Authors: Debanjali Bhattacharya, Neelam Sinha

Abstract: Recent advances in neuroimaging have enabled studies in functional connectivity (FC) of human brain, alongside investigation of the neuronal basis of cognition. One important FC study is the representation of vision in human brain. The release of publicly available dataset BOLD5000 has made it possible to study the brain dynamics during visual tasks in greater detail. In this paper, a comprehensiv… ▽ More Recent advances in neuroimaging have enabled studies in functional connectivity (FC) of human brain, alongside investigation of the neuronal basis of cognition. One important FC study is the representation of vision in human brain. The release of publicly available dataset BOLD5000 has made it possible to study the brain dynamics during visual tasks in greater detail. In this paper, a comprehensive analysis of fMRI time series (TS) has been performed to explore different types of visual brain networks (VBN). The novelty of this work lies in (1) constructing VBN with consistently significant direct connectivity using both marginal and partial correlation, which is further analyzed using graph theoretic measures, (2) classification of VBNs as formed by image complexity-specific TS, using graphical features. In image complexity-specific VBN classification, XGBoost yields average accuracy in the range of 86.5% to 91.5% for positively correlated VBN, which is 2% greater than that using negative correlation. This result not only reflects the distinguishing graphical characteristics of each image complexity-specific VBN, but also highlights the importance of studying both positively correlated and negatively correlated VBN to understand the how differently brain functions while viewing different complexities of real-world images. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2402.03277 [pdf, other]

Event-based Product Carousel Recommendation with Query-Click Graph

Authors: Luyi Ma, Nimesh Sinha, Parth Vajge, Jason HD Cho, Sushant Kumar, Kannan Achan

Abstract: Many current recommender systems mainly focus on the product-to-product recommendations and user-to-product recommendations even during the time of events rather than modeling the typical recommendations for the target event (e.g., festivals, seasonal activities, or social activities) without addressing the multiple aspects of the shop** demands for the target event. Product recommendations for… ▽ More Many current recommender systems mainly focus on the product-to-product recommendations and user-to-product recommendations even during the time of events rather than modeling the typical recommendations for the target event (e.g., festivals, seasonal activities, or social activities) without addressing the multiple aspects of the shop** demands for the target event. Product recommendations for the multiple aspects of the target event are usually generated by human curators who manually identify the aspects and select a list of aspect-related products (i.e., product carousel) for each aspect as recommendations. However, building a recommender system with machine learning is non-trivial due to the lack of both the ground truth of event-related aspects and the aspect-related products. To fill this gap, we define the novel problem as the event-based product carousel recommendations in e-commerce and propose an effective recommender system based on the query-click bipartite graph. We apply the iterative clustering algorithm over the query-click bipartite graph and infer the event-related aspects by the clusters of queries. The aspect-related recommendations are powered by the click-through rate of products regarding each aspect. We show through experiments that this approach effectively mines product carousels for the target event. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 7 pages, 2 figures, 2021 IEEE International Conference on Big Data (Big Data)

arXiv:2402.02811 [pdf, other]

Multi-scale fMRI time series analysis for understanding neurodegeneration in MCI

Authors: Ammu R., Debanjali Bhattacharya, Ameiy Acharya, Ninad Aithal, Neelam Sinha

Abstract: In this study, we present a technique that spans multi-scale views (global scale -- meaning brain network-level and local scale -- examining each individual ROI that constitutes the network) applied to resting-state fMRI volumes. Deep learning based classification is utilized in understanding neurodegeneration. The novelty of the proposed approach lies in utilizing two extreme scales of analysis.… ▽ More In this study, we present a technique that spans multi-scale views (global scale -- meaning brain network-level and local scale -- examining each individual ROI that constitutes the network) applied to resting-state fMRI volumes. Deep learning based classification is utilized in understanding neurodegeneration. The novelty of the proposed approach lies in utilizing two extreme scales of analysis. One branch considers the entire network within graph-analysis framework. Concurrently, the second branch scrutinizes each ROI within a network independently, focusing on evolution of dynamics. For each subject, graph-based approach employs partial correlation to profile the subject in a single graph where each ROI is a node, providing insights into differences in levels of participation. In contrast, non-linear analysis employs recurrence plots to profile a subject as a multichannel 2D image, revealing distinctions in underlying dynamics. The proposed approach is employed for classification of a cohort of 50 healthy control (HC) and 50 Mild Cognitive Impairment (MCI), sourced from ADNI dataset. Results point to: (1) reduced activity in ROIs such as PCC in MCI (2) greater activity in occipital in MCI, which is not seen in HC (3) when analysed for dynamics, all ROIs in MCI show greater predictability in time-series. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 12 pages, 3 figures and 4 tables

arXiv:2401.18083 [pdf, other]

Improved Scene Landmark Detection for Camera Localization

Authors: Tien Do, Sudipta N. Sinha

Abstract: Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-spe… ▽ More Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-specific 3D points or landmarks and computing camera pose from the associated 2D-3D correspondences. Although SLD outperformed existing learning-based approaches, it was notably less accurate than 3D structure-based methods. In this paper, we show that the accuracy gap was due to insufficient model capacity and noisy labels during training. To mitigate the capacity issue, we propose to split the landmarks into subgroups and train a separate network for each subgroup. To generate better training labels, we propose using dense reconstructions to estimate visibility of scene landmarks. Finally, we present a compact architecture to improve memory efficiency. Accuracy wise, our approach is on par with state of the art structure based methods on the INDOOR-6 dataset but runs significantly faster and uses less storage. Code and models can be found at https://github.com/microsoft/SceneLandmarkLocalization. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: To be presented at 3DV 2024

arXiv:2312.01768 [pdf, other]

Localizing and Assessing Node Significance in Default Mode Network using Sub-Community Detection in Mild Cognitive Impairment

Authors: Ameiy Acharya, Chakka Sai Pradeep, Neelam Sinha

Abstract: Our study aims to utilize fMRI to identify the affected brain regions within the Default Mode Network (DMN) in subjects with Mild Cognitive Impairment (MCI), using a novel Node Significance Score (NSS). We construct subject-specific DMN graphs by employing partial correlation of Regions of Interest (ROIs) that make-up the DMN. For the DMN graph, ROIs are the nodes and edges are determined based on… ▽ More Our study aims to utilize fMRI to identify the affected brain regions within the Default Mode Network (DMN) in subjects with Mild Cognitive Impairment (MCI), using a novel Node Significance Score (NSS). We construct subject-specific DMN graphs by employing partial correlation of Regions of Interest (ROIs) that make-up the DMN. For the DMN graph, ROIs are the nodes and edges are determined based on partial correlation. Four popular community detection algorithms (Clique Percolation Method (CPM), Louvain algorithm, Greedy Modularity and Leading Eigenvectors) are applied to determine the largest sub-community. NSS ratings are derived for each node, considering (I) frequency in the largest sub-community within a class across all subjects and (II) occurrence in the largest sub-community according to all four methods. After computing the NSS of each ROI in both healthy and MCI subjects, we quantify the score disparity to identify nodes most impacted by MCI. The results reveal a disparity exceeding 20% for 10 DMN nodes, maximally for PCC and Fusiform, showing 45.69% and 43.08% disparity. This aligns with existing medical literature, additionally providing a quantitative measure that enables the ordering of the affected ROIs. These findings offer valuable insights and could lead to treatment strategies aggressively targeting the affected nodes. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: 4 pages, 2 figures

arXiv:2311.18265 [pdf, other]

MCI Detection using fMRI time series embeddings of Recurrence plots

Authors: Ninad Aithal, Chakka Sai Pradeep, Neelam Sinha

Abstract: The human brain can be conceptualized as a dynamical system. Utilizing resting state fMRI time series imaging, we can study the underlying dynamics at ear-marked Regions of Interest (ROIs) to understand structure or lack thereof. This differential behavior could be key to understanding the neurodegeneration and also to classify between healthy and Mild Cognitive Impairment (MCI) subjects. In this… ▽ More The human brain can be conceptualized as a dynamical system. Utilizing resting state fMRI time series imaging, we can study the underlying dynamics at ear-marked Regions of Interest (ROIs) to understand structure or lack thereof. This differential behavior could be key to understanding the neurodegeneration and also to classify between healthy and Mild Cognitive Impairment (MCI) subjects. In this study, we consider 6 brain networks spanning over 160 ROIs derived from Dosenbach template, where each network consists of 25-30 ROIs. Recurrence plot, extensively used to understand evolution of time series, is employed. Representative time series at each ROI is converted to its corresponding recurrence plot visualization, which is subsequently condensed to low-dimensional feature embeddings through Autoencoders. The performance of the proposed method is shown on fMRI volumes of 100 subjects (balanced data), taken from publicly available ADNI dataset. Results obtained show peak classification accuracy of 93% among the 6 brain networks, mean accuracy of 89.3% thereby illustrating promise in the proposed approach. △ Less

Submitted 30 November, 2023; originally announced November 2023.

Comments: 4 pages, 5 figures

arXiv:2311.08417 [pdf, other]

Image complexity based fMRI-BOLD visual network categorization across visual datasets using topological descriptors and deep-hybrid learning

Authors: Debanjali Bhattacharya, Neelam Sinha, Yashwanth R., Amit Chattopadhyay

Abstract: This study proposes a new approach that investigates differences in topological characteristics of visual networks, which are constructed using fMRI BOLD time-series corresponding to visual datasets of COCO, ImageNet, and SUN. A publicly available BOLD5000 dataset is utilized that contains fMRI scans while viewing 5254 images of diverse complexities. The objective of this study is to examine how n… ▽ More This study proposes a new approach that investigates differences in topological characteristics of visual networks, which are constructed using fMRI BOLD time-series corresponding to visual datasets of COCO, ImageNet, and SUN. A publicly available BOLD5000 dataset is utilized that contains fMRI scans while viewing 5254 images of diverse complexities. The objective of this study is to examine how network topology differs in response to distinct visual stimuli from these visual datasets. To achieve this, 0- and 1-dimensional persistence diagrams are computed for each visual network representing COCO, ImageNet, and SUN. For extracting suitable features from topological persistence diagrams, K-means clustering is executed. The extracted K-means cluster features are fed to a novel deep-hybrid model that yields accuracy in the range of 90%-95% in classifying these visual networks. To understand vision, this type of visual network categorization across visual datasets is important as it captures differences in BOLD signals while perceiving images with different contexts and complexities. Furthermore, distinctive topological patterns of visual network associated with each dataset, as revealed from this study, could potentially lead to the development of future neuroimaging biomarkers for diagnosing visual processing disorders like visual agnosia or prosopagnosia, and tracking changes in visual cognition over time. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.03923 [pdf, other]

Hardware Aware Evolutionary Neural Architecture Search using Representation Similarity Metric

Authors: Nilotpal Sinha, Abd El Rahman Shabayek, Anis Kacem, Peyman Rostami, Carl Shneider, Djamila Aouada

Abstract: Hardware-aware Neural Architecture Search (HW-NAS) is a technique used to automatically design the architecture of a neural network for a specific task and target hardware. However, evaluating the performance of candidate architectures is a key challenge in HW-NAS, as it requires significant computational resources. To address this challenge, we propose an efficient hardware-aware evolution-based… ▽ More Hardware-aware Neural Architecture Search (HW-NAS) is a technique used to automatically design the architecture of a neural network for a specific task and target hardware. However, evaluating the performance of candidate architectures is a key challenge in HW-NAS, as it requires significant computational resources. To address this challenge, we propose an efficient hardware-aware evolution-based NAS approach called HW-EvRSNAS. Our approach re-frames the neural architecture search problem as finding an architecture with performance similar to that of a reference model for a target hardware, while adhering to a cost constraint for that hardware. This is achieved through a representation similarity metric known as Representation Mutual Information (RMI) employed as a proxy performance evaluator. It measures the mutual information between the hidden layer representations of a reference model and those of sampled architectures using a single training batch. We also use a penalty term that penalizes the search process in proportion to how far an architecture's hardware cost is from the desired hardware cost threshold. This resulted in a significantly reduced search time compared to the literature that reached up to 8000x speedups resulting in lower CO2 emissions. The proposed approach is evaluated on two different search spaces while using lower computational resources. Furthermore, our approach is thoroughly examined on six different edge devices under various hardware cost constraints. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: WACV 2024

arXiv:2311.00784 [pdf, other]

Rational design of acid stable oxide catalysts for OER with OC22

Authors: Richard Tran, Liqiang Huang, Yuan Zi, Shengguang Wang, Benjamin M. Comer, Xuqing Wu, Stefan J. Raaijman, Nishant K. Sinha, Shibin Thundiyil, Ganesh Iyer, Lars Grabow, Ligang Lu, Jiefu Chen

Abstract: The efficiency of $H_2$ production via water electrolysis is typically limited to the sluggish oxygen evolution reaction (OER). As such, significant emphasis has been placed upon improving the rate of OER through the anode catalyst. More recently, the Open Catalyst 2022 (OC22) has provided a large dataset of density functional theory (DFT) calculations for OER intermediates on the surfaces of oxid… ▽ More The efficiency of $H_2$ production via water electrolysis is typically limited to the sluggish oxygen evolution reaction (OER). As such, significant emphasis has been placed upon improving the rate of OER through the anode catalyst. More recently, the Open Catalyst 2022 (OC22) has provided a large dataset of density functional theory (DFT) calculations for OER intermediates on the surfaces of oxides. When coupled with state-of-the-art graph neural network models, total energy predictions can be achieved with a mean absolute error as low as of 0.22 eV. In this work, we interpolated a database of the total energy predictions for all slabs and OER surface intermediates for 4,119 oxide materialas in the original OC22 dataset using pre-trained models from the OC22 framework. This database includes all terminations of all facets up to a maximum Miller index of 1 with adsorption configurations for $O^*$ and $OH^*$. To demonstrate the full utility of this database, we constructed a flexible screening framework to identify viable candidate anode catalysts under a bulk and nanoscale regime for OER by assessing the price, thermodynamic stability, and resistance to corrosion, surface stability, and overpotential. Finally we verified the overpotentials and reaction energies of the final candidate catalysts using DFT. From our assessment, we were able to identify 48 and 69 viable candidates for OER under the bulk and nanoscale regime respectively. △ Less

Submitted 29 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

arXiv:2309.15495 [pdf, other]

Investigating the changes in BOLD responses during viewing of images with varied complexity: An fMRI time-series based analysis on human vision

Authors: Naveen Kanigiri, Manohar Suggula, Debanjali Bhattacharya, Neelam Sinha

Abstract: Functional MRI (fMRI) is widely used to examine brain functionality by detecting alteration in oxygenated blood flow that arises with brain activity. This work aims to investigate the neurological variation of human brain responses during viewing of images with varied complexity using fMRI time series (TS) analysis. Publicly available BOLD5000 dataset is used for this purpose which contains fMRI s… ▽ More Functional MRI (fMRI) is widely used to examine brain functionality by detecting alteration in oxygenated blood flow that arises with brain activity. This work aims to investigate the neurological variation of human brain responses during viewing of images with varied complexity using fMRI time series (TS) analysis. Publicly available BOLD5000 dataset is used for this purpose which contains fMRI scans while viewing 5254 distinct images of diverse categories, drawn from three standard computer vision datasets: COCO, Imagenet and SUN. To understand vision, it is important to study how brain functions while looking at images of diverse complexities. Our first study employs classical machine learning and deep learning strategies to classify image complexity-specific fMRI TS, represents instances when images from COCO, Imagenet and SUN datasets are seen. The implementation of this classification across visual datasets holds great significance, as it provides valuable insights into the fluctuations in BOLD signals when perceiving images of varying complexities. Subsequently, temporal semantic segmentation is also performed on whole fMRI TS to segment these time instances. The obtained result of this analysis has established a baseline in studying how differently human brain functions while looking into images of diverse complexities. Therefore, accurate identification and distinguishing of variations in BOLD signals from fMRI TS data serves as a critical initial step in vision studies, providing insightful explanations for how static images with diverse complexities are perceived. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: The paper is accepted for publication in 3rd International Conference on AI-ML Systems (AIMLSystems 2023), to be held on 25-28 October 2023, Bengaluru, India. arXiv admin note: text overlap with arXiv:2309.03590

arXiv:2309.03590 [pdf, other]

Spatial encoding of BOLD fMRI time series for categorizing static images across visual datasets: A pilot study on human vision

Authors: Vamshi K. Kancharala, Debanjali Bhattacharya, Neelam Sinha

Abstract: Functional MRI (fMRI) is widely used to examine brain functionality by detecting alteration in oxygenated blood flow that arises with brain activity. In this study, complexity specific image categorization across different visual datasets is performed using fMRI time series (TS) to understand differences in neuronal activities related to vision. Publicly available BOLD5000 dataset is used for this… ▽ More Functional MRI (fMRI) is widely used to examine brain functionality by detecting alteration in oxygenated blood flow that arises with brain activity. In this study, complexity specific image categorization across different visual datasets is performed using fMRI time series (TS) to understand differences in neuronal activities related to vision. Publicly available BOLD5000 dataset is used for this purpose, containing fMRI scans while viewing 5254 images of diverse categories, drawn from three standard computer vision datasets: COCO, ImageNet and SUN. To understand vision, it is important to study how brain functions while looking at different images. To achieve this, spatial encoding of fMRI BOLD TS has been performed that uses classical Gramian Angular Field (GAF) and Markov Transition Field (MTF) to obtain 2D BOLD TS, representing images of COCO, Imagenet and SUN. For classification, individual GAF and MTF features are fed into regular CNN. Subsequently, parallel CNN model is employed that uses combined 2D features for classifying images across COCO, Imagenet and SUN. The result of 2D CNN models is also compared with 1D LSTM and Bi-LSTM that utilizes raw fMRI BOLD signal for classification. It is seen that parallel CNN model outperforms other network models with an improvement of 7% for multi-class classification. Clinical relevance- The obtained result of this analysis establishes a baseline in studying how differently human brain functions while looking at images of diverse complexities. △ Less

Submitted 7 September, 2023; originally announced September 2023.

Comments: This paper is accepted for publication in IEEE Region 10 Technical conference, TENCON 2023, to be held in Chiang Mai, Thailand from 31 October - 3 November, 2023

arXiv:2307.15170 [pdf]

Quantifying interictal intracranial EEG to predict focal epilepsy

Authors: Ryan S Gallagher, Nishant Sinha, Akash R Pattnaik, William K. S. Ojemann, Alfredo Lucas, Joshua J. LaRocque, John M Bernabei, Adam S Greenblatt, Elizabeth M Sweeney, H Isaac Chen, Kathryn A Davis, Erin C Conrad, Brian Litt

Abstract: Intracranial EEG (IEEG) is used for 2 main purposes, to determine: (1) if epileptic networks are amenable to focal treatment and (2) where to intervene. Currently these questions are answered qualitatively and sometimes differently across centers. There is a need for objective, standardized methods to guide surgical decision making and to enable large scale data analysis across centers and prospec… ▽ More Intracranial EEG (IEEG) is used for 2 main purposes, to determine: (1) if epileptic networks are amenable to focal treatment and (2) where to intervene. Currently these questions are answered qualitatively and sometimes differently across centers. There is a need for objective, standardized methods to guide surgical decision making and to enable large scale data analysis across centers and prospective clinical trials. We analyzed interictal data from 101 patients with drug resistant epilepsy who underwent presurgical evaluation with IEEG. We chose interictal data because of its potential to reduce the morbidity and cost associated with ictal recording. 65 patients had unifocal seizure onset on IEEG, and 36 were non-focal or multi-focal. We quantified the spatial dispersion of implanted electrodes and interictal IEEG abnormalities for each patient. We compared these measures against the 5 Sense Score (5SS), a pre-implant estimate of the likelihood of focal seizure onset, and assessed their ability to predict the clinicians choice of therapeutic intervention and the patient outcome. The spatial dispersion of IEEG electrodes predicted network focality with precision similar to the 5SS (AUC = 0.67), indicating that electrode placement accurately reflected pre-implant information. A cross-validated model combining the 5SS and the spatial dispersion of interictal IEEG abnormalities significantly improved this prediction (AUC = 0.79; p<0.05). The combined model predicted ultimate treatment strategy (surgery vs. device) with an AUC of 0.81 and post-surgical outcome at 2 years with an AUC of 0.70. The 5SS, interictal IEEG, and electrode placement were not correlated and provided complementary information. Quantitative, interictal IEEG significantly improved upon pre-implant estimates of network focality and predicted treatment with precision approaching that of clinical experts. △ Less

Submitted 27 July, 2023; originally announced July 2023.

Comments: 25 pages, 4 Figures, 1 table

arXiv:2307.09994 [pdf, other]

Impact of Disentanglement on Pruning Neural Networks

Authors: Carl Shneider, Peyman Rostami, Anis Kacem, Nilotpal Sinha, Abd El Rahman Shabayek, Djamila Aouada

Abstract: Deploying deep learning neural networks on edge devices, to accomplish task specific objectives in the real-world, requires a reduction in their memory footprint, power consumption, and latency. This can be realized via efficient model compression. Disentangled latent representations produced by variational autoencoder (VAE) networks are a promising approach for achieving model compression because… ▽ More Deploying deep learning neural networks on edge devices, to accomplish task specific objectives in the real-world, requires a reduction in their memory footprint, power consumption, and latency. This can be realized via efficient model compression. Disentangled latent representations produced by variational autoencoder (VAE) networks are a promising approach for achieving model compression because they mainly retain task-specific information, discarding useless information for the task at hand. We make use of the Beta-VAE framework combined with a standard criterion for pruning to investigate the impact of forcing the network to learn disentangled representations on the pruning process for the task of classification. In particular, we perform experiments on MNIST and CIFAR10 datasets, examine disentanglement challenges, and propose a path forward for future works. △ Less

Submitted 19 July, 2023; originally announced July 2023.

Comments: Presented in ISCS23

Report number: ISCS23-19

arXiv:2307.07703 [pdf, other]

Identification of Stochasticity by Matrix-decomposition: Applied on Black Hole Data

Authors: Sai Pradeep Chakka, Sunil Kumar Vengalil, Neelam Sinha

Abstract: Timeseries classification as stochastic (noise-like) or non-stochastic (structured), helps understand the underlying dynamics, in several domains. Here we propose a two-legged matrix decomposition-based algorithm utilizing two complementary techniques for classification. In Singular Value Decomposition (SVD) based analysis leg, we perform topological analysis (Betti numbers) on singular vectors co… ▽ More Timeseries classification as stochastic (noise-like) or non-stochastic (structured), helps understand the underlying dynamics, in several domains. Here we propose a two-legged matrix decomposition-based algorithm utilizing two complementary techniques for classification. In Singular Value Decomposition (SVD) based analysis leg, we perform topological analysis (Betti numbers) on singular vectors containing temporal information, leading to SVD-label. Parallely, temporal-ordering agnostic Principal Component Analysis (PCA) is performed, and the proposed PCA-derived features are computed. These features, extracted from synthetic timeseries of the two labels, are observed to map the timeseries to a linearly separable feature space. Support Vector Machine (SVM) is used to produce PCA-label. The proposed methods have been applied to synthetic data, comprising 41 realisations of white-noise, pink-noise (stochastic), Logistic-map at growth-rate 4 and Lorentz-system (non-stochastic), as proof-of-concept. Proposed algorithm is applied on astronomical data: 12 temporal-classes of timeseries of black hole GRS 1915+105, obtained from RXTE satellite with average length 25000. For a given timeseries, if SVD-label and PCA-label concur, then the label is retained; else deemed "Uncertain". Comparison of obtained results with those in literature are presented. It's found that out of 12 temporal classes of GRS 1915+105, concurrence between SVD-label and PCA-label is obtained on 11 of them. △ Less

Submitted 15 July, 2023; originally announced July 2023.

Comments: 10 pages, 7 figures

arXiv:2306.12331 [pdf, other]

Decentralized Aerial Transportation and Manipulation of a Cable-Slung Payload With Swarm of Agents

Authors: Aniket Sharma, Nandan K Sinha

Abstract: With the advent of Unmanned Aerial Vehicles (UAV) and Micro Aerial Vehicles (MAV) in commercial sectors, their application for transporting and manipulating payloads has attracted many research work. A swarm of agents, cooperatively working to transport and manipulate a payload can overcome the physical limitations of a single agent, adding redundancy and tolerance against failures. In this paper,… ▽ More With the advent of Unmanned Aerial Vehicles (UAV) and Micro Aerial Vehicles (MAV) in commercial sectors, their application for transporting and manipulating payloads has attracted many research work. A swarm of agents, cooperatively working to transport and manipulate a payload can overcome the physical limitations of a single agent, adding redundancy and tolerance against failures. In this paper, the dynamics of a swarm connected to a payload via flexible cables are modeled, and a decentralized control is designed using Artificial Potential Field (APF). The swarm is able to transport the payload through an unknown environment to a goal position while avoiding obstacles from the local information received from the onboard sensors. The key contributions are (a) the cables are modelled more accurately using lumped mass model instead of geometric constraints, (b) a decentralized swarm control is designed using potential field approach to ensure hover stability of system without payload state information, (c) the manipulation of payload elevation and azimuth angles are controlled by APF, and (d) the trajectory of the payload for transportation is governed by potential fields generated by goal point and obstacles. The efficacy of the method proposed in this work are evaluated through numerical simulations under the influence of external disturbances and failure of agents. △ Less

Submitted 21 June, 2023; originally announced June 2023.

arXiv:2306.00473 [pdf, other]

Interpretable simultaneous localization of MRI corpus callosum and classification of atypical Parkinsonian disorders using YOLOv5

Authors: Vamshi Krishna Kancharla, Debanjali Bhattacharya, Neelam Sinha, Jitender Saini, Pramod Kumar Pal, Sandhya M

Abstract: Structural MRI(S-MRI) is one of the most versatile imaging modality that revolutionized the anatomical study of brain in past decades. The corpus callosum (CC) is the principal white matter fibre tract, enabling all kinds of inter-hemispheric communication. Thus, subtle changes in CC might be associated with various neurological disorders. The present work proposes the potential of YOLOv5-based CC… ▽ More Structural MRI(S-MRI) is one of the most versatile imaging modality that revolutionized the anatomical study of brain in past decades. The corpus callosum (CC) is the principal white matter fibre tract, enabling all kinds of inter-hemispheric communication. Thus, subtle changes in CC might be associated with various neurological disorders. The present work proposes the potential of YOLOv5-based CC detection framework to differentiate atypical Parkinsonian disorders (PD) from healthy controls (HC). With 3 rounds of hold-out validation, mean classification accuracy of 92% is obtained using the proposed method on a proprietary dataset consisting of 20 healthy subjects and 20 cases of APDs, with an improvement of 5% over SOTA methods (CC morphometry and visual texture analysis) that used the same dataset. Subsequently, in order to incorporate the explainability of YOLO predictions, Eigen CAM based heatmap is generated for identifying the most important sub-region in CC that leads to the classification. The result of Eigen CAM showed CC mid-body as the most distinguishable sub-region in classifying APDs and HC, which is in-line with SOTA methodologies and the current prevalent understanding in medicine. △ Less

Submitted 1 June, 2023; originally announced June 2023.

arXiv:2304.11560 [pdf, other]

Identifying Stochasticity in Time-Series with Autoencoder-Based Content-aware 2D Representation: Application to Black Hole Data

Authors: Chakka Sai Pradeep, Neelam Sinha

Abstract: In this work, we report an autoencoder-based 2D representation to classify a time-series as stochastic or non-stochastic, to understand the underlying physical process. Content-aware conversion of 1D time-series to 2D representation, that simultaneously utilizes time- and frequency-domain characteristics, is proposed. An autoencoder is trained with a loss function to learn latent space (using both… ▽ More In this work, we report an autoencoder-based 2D representation to classify a time-series as stochastic or non-stochastic, to understand the underlying physical process. Content-aware conversion of 1D time-series to 2D representation, that simultaneously utilizes time- and frequency-domain characteristics, is proposed. An autoencoder is trained with a loss function to learn latent space (using both time- and frequency domains) representation, that is designed to be, time-invariant. Every element of the time-series is represented as a tuple with two components, one each, from latent space representation in time- and frequency-domains, forming a binary image. In this binary image, those tuples that represent the points in the time-series, together form the ``Latent Space Signature" (LSS) of the input time-series. The obtained binary LSS images are fed to a classification network. The EfficientNetv2-S classifier is trained using 421 synthetic time-series, with fair representation from both categories. The proposed methodology is evaluated on publicly available astronomical data which are 12 distinct temporal classes of time-series pertaining to the black hole GRS 1915 + 105, obtained from RXTE satellite. Results obtained using the proposed methodology are compared with existing techniques. Concurrence in labels obtained across the classes, illustrates the efficacy of the proposed 2D representation using the latent space co-ordinates. The proposed methodology also outputs the confidence in the classification label. △ Less

Submitted 23 April, 2023; originally announced April 2023.

arXiv:2212.08022 [pdf, other]

iCardo: A Machine Learning Based Smart Healthcare Framework for Cardiovascular Disease Prediction

Authors: Nidhi Sinha, Teena Jangid, Amit M. Joshi, Saraju P. Mohanty

Abstract: The point of care services and medication have become simpler with efficient consumer electronics devices in a smart healthcare system. Cardiovascular disease is a critical illness which causes heart failure, and early and prompt identification can lessen damage and prevent premature mortality. Machine learning has been used to predict cardiovascular disease (CVD) in the literature. The article ex… ▽ More The point of care services and medication have become simpler with efficient consumer electronics devices in a smart healthcare system. Cardiovascular disease is a critical illness which causes heart failure, and early and prompt identification can lessen damage and prevent premature mortality. Machine learning has been used to predict cardiovascular disease (CVD) in the literature. The article explains choosing the best classifier model for the selected feature sets and the distinct feature sets selected using four feature selection models. The paper compares seven classifiers using each of the sixteen feature sets. Originally, the data had 56 attributes and 303 occurrences, of which 87 were in good health, and the remainder had cardiovascular disease (CVD). Demographic data with several features make up the four groups of overall features. Lasso, Tree-based algorithms, Chi-Square and RFE have all been used to choose the four distinct feature sets, each containing five, ten, fifteen, and twenty features, respectively. Seven distinct classifiers have been trained and evaluated for each of the sixteen feature sets. To determine the most effective blend of feature set and model, a total of 112 models have been trained, tested, and their performance has been compared. SVM classifier with fifteen chosen features is shown to be the best in terms of overall accuracy. The healthcare data has been maintained in the cloud and would be accessible to patients, caretakers, and healthcare providers through integration with the Internet of Medical Things (IoMT) enabled smart healthcare. Subsequently, the feature selection model chooses the most appropriate feature for CVD prediction to calibrate the system, and the proposed framework can be utilised to anticipate CVD. △ Less

Submitted 7 December, 2022; originally announced December 2022.

Comments: 19 Pages, 9 Figures, 5 Tables

arXiv:2211.01513 [pdf, other]

Optimizing Fiducial Marker Placement for Improved Visual Localization

Authors: Qiangqiang Huang, Joseph DeGol, Victor Fragoso, Sudipta N. Sinha, John J. Leonard

Abstract: Adding fiducial markers to a scene is a well-known strategy for making visual localization algorithms more robust. Traditionally, these marker locations are selected by humans who are familiar with visual localization techniques. This paper explores the problem of automatic marker placement within a scene. Specifically, given a predetermined set of markers and a scene model, we compute optimized m… ▽ More Adding fiducial markers to a scene is a well-known strategy for making visual localization algorithms more robust. Traditionally, these marker locations are selected by humans who are familiar with visual localization techniques. This paper explores the problem of automatic marker placement within a scene. Specifically, given a predetermined set of markers and a scene model, we compute optimized marker positions within the scene that can improve accuracy in visual localization. Our main contribution is a novel framework for modeling camera localizability that incorporates both natural scene features and artificial fiducial markers added to the scene. We present optimized marker placement (OMP), a greedy algorithm that is based on the camera localizability framework. We have also designed a simulation framework for testing marker placement algorithms on 3D models and images generated from synthetic scenes. We have evaluated OMP within this testbed and demonstrate an improvement in the localization rate by up to 20 percent on four different scenes. △ Less

Submitted 16 March, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

Comments: Extended technical report for publication in IEEE Robotics and Automation Letters (RA-L)

arXiv:2206.01080 [pdf]

Longitudinal abnormalities in white matter extracellular free water volume fraction and neuropsychological functioning in patients with traumatic brain injury

Authors: James J Gugger, Alexa E Walter, Drew Parker, Nishant Sinha, Justin Morrison, Jeffrey Ware, Andrea LC Schneider, Dmitriy Petrov, Danielle K Sandsmark, Ragini Verma, Ramon Diaz-Arrastia

Abstract: Traumatic brain injury is a global public health problem associated with chronic neurological complications and long-term disability. Biomarkers that map onto the underlying brain pathology driving these complications are urgently needed to identify individuals at risk for poor recovery and to inform design of clinical trials of neuroprotective therapies. Neuroinflammation and neurodegeneration ar… ▽ More Traumatic brain injury is a global public health problem associated with chronic neurological complications and long-term disability. Biomarkers that map onto the underlying brain pathology driving these complications are urgently needed to identify individuals at risk for poor recovery and to inform design of clinical trials of neuroprotective therapies. Neuroinflammation and neurodegeneration are two endophenotypes associated with increases in brain extracellular water content after trauma. The objective of this study was to describe the relationship between a neuroimaging biomarker of extracellular free water content and the clinical features of patients with traumatic brain injury. We analyzed a cohort of 64 adult patients requiring hospitalization for non-penetrating traumatic brain injury of all severities as well as 32 healthy controls. Patients underwent brain MRI and clinical neuropsychological assessment in the subacute (2-weeks) and chronic (6-months) post-injury period, and controls underwent a single MRI. For each subject, we derived a summary score representing deviations in whole brain white matter (1) extracellular free water volume fraction (VF) and (2) free water-corrected fractional anisotropy (fw-FA). The summary specific anomaly score (SAS) for VF was significantly higher in TBI patients in the subacute and chronic post-injury period relative to controls. SAS for VF significantly correlated with neuropsychological functioning in the subacute, but not chronic post-injury period. These findings indicate abnormalities in whole brain white matter extracellular water fraction in patients with TBI and are an important step toward identifying and validating noninvasive biomarkers that map onto the pathology driving disability after TBI. △ Less

Submitted 2 June, 2022; originally announced June 2022.

arXiv:2205.14663 [pdf]

Change in structural brain network abnormalities after traumatic brain injury determines post-injury recovery

Authors: James J Gugger, Nishant Sinha, Yiming Huang, Alexa Walter, Cillian Lynch, Justin Morrison, Nathan Smyk, Danielle Sandsmark, Ramon Diaz-Arrastia, Kathryn A Davis

Abstract: The trajectory of an individual's recovery after traumatic brain injury (TBI) is heterogeneous, with complete recovery in some cases but persistent disability in others. We hypothesized that changes in structural brain network abnormalities guide the trajectory of an individual's recovery post-injury. Our objective was to characterize the variability in recovery post-TBI by identifying a putative… ▽ More The trajectory of an individual's recovery after traumatic brain injury (TBI) is heterogeneous, with complete recovery in some cases but persistent disability in others. We hypothesized that changes in structural brain network abnormalities guide the trajectory of an individual's recovery post-injury. Our objective was to characterize the variability in recovery post-TBI by identifying a putative neuroimaging biomarker of traumatic axonal injury (TAI) in individuals with mild TBI. We analyzed 70 T1-weighted and diffusion MRIs longitudinally collected from 35 individuals during the subacute and chronic post-injury periods. Each individual underwent longitudinal blood work to characterize blood protein biomarkers of axonal and glial injury and assessment of post-injury recovery in the subacute and chronic periods. By comparing the MRI data of individual cases with 35 controls, we estimated the longitudinal change in structural brain network abnormalities. We validated this proxy measure of TAI with independent measures of acute intracranial injury estimated from head CT and blood protein biomarkers. Post-injury structural network abnormality was significantly higher than controls in both subacute and chronic periods, associated with an acute CT lesion and subacute blood levels of glial fibrillary acid protein (r=0.5, p=0.008) and neurofilament light (r=0.41, p=0.02). Longitudinal change in abnormality associated with change in functional outcome status (r=-0.51, p=0.003) and post-concussive symptoms (BSI: r=0.46, p=0.03; RPQ:r = 0.46, p=0.02). Brain regions that most closely mapped onto symptom change over time corresponded to structural network hubs or areas susceptible to neurotrauma. Structural network abnormalities might be a biomarker of TAI. Assessing changes in brain network abnormality might enable better patient stratification for monitoring recovery after neurotrauma. △ Less

Submitted 29 May, 2022; originally announced May 2022.

Comments: 34 pages, 8 figures, 2 tables

arXiv:2204.08086 [pdf]

Intracranial EEG structure-function coupling predicts surgical outcomes in focal epilepsy

Authors: Nishant Sinha, John S. Duncan, Beate Diehl, Fahmida A. Chowdhury, Jane de Tisi, Anna Miserocchi, Andrew W. McEvoy, Kathryn A. Davis, Sjoerd B. Vos, Gavin P. Winston, Yujiang Wang, Peter N. Taylor

Abstract: Alterations to structural and functional brain networks have been reported across many neurological conditions. However, the relationship between structure and function -- their coupling -- is relatively unexplored, particularly in the context of an intervention. Epilepsy surgery alters the brain structure and networks to control the functional abnormality of seizures. Given that surgery is a stru… ▽ More Alterations to structural and functional brain networks have been reported across many neurological conditions. However, the relationship between structure and function -- their coupling -- is relatively unexplored, particularly in the context of an intervention. Epilepsy surgery alters the brain structure and networks to control the functional abnormality of seizures. Given that surgery is a structural modification aiming to alter the function, we hypothesized that stronger structure-function coupling preoperatively is associated with a greater chance of post-operative seizure control. We constructed structural and functional brain networks in 39 subjects with medication-resistant focal epilepsy using data from intracranial EEG (pre-surgery), structural MRI (pre-and post-surgery), and diffusion MRI (pre-surgery). We investigated pre-operative structure-function coupling at two spatial scales a) at the global iEEG network level and b) at the resolution of individual iEEG electrode contacts using virtual surgeries. At global network level, seizure-free individuals had stronger structure-function coupling pre-operatively than those that were not seizure-free regardless of the choice of interictal segment or frequency band. At the resolution of individual iEEG contacts, the virtual surgery approach provided complementary information to localize epileptogenic tissues. In predicting seizure outcomes, structure-function coupling measures were more important than clinical attributes, and together they predicted seizure outcomes with an accuracy of 85% and sensitivity of 87%. The underlying assumption that the structural changes induced by surgery translate to the functional level to control seizures is valid when the structure-functional coupling is strong. Map** the regions that contribute to structure-functional coupling using virtual surgeries may help aid surgical planning. △ Less

Submitted 17 April, 2022; originally announced April 2022.

arXiv:2204.00188 [pdf, other]

Novelty Driven Evolutionary Neural Architecture Search

Authors: Nilotpal Sinha, Kuan-Wen Chen

Abstract: Evolutionary algorithms (EA) based neural architecture search (NAS) involves evaluating each architecture by training it from scratch, which is extremely time-consuming. This can be reduced by using a supernet for estimating the fitness of an architecture due to weight sharing among all architectures in the search space. However, the estimated fitness is very noisy due to the co-adaptation of the… ▽ More Evolutionary algorithms (EA) based neural architecture search (NAS) involves evaluating each architecture by training it from scratch, which is extremely time-consuming. This can be reduced by using a supernet for estimating the fitness of an architecture due to weight sharing among all architectures in the search space. However, the estimated fitness is very noisy due to the co-adaptation of the operations in the supernet which results in NAS methods getting trapped in local optimum. In this paper, we propose a method called NEvoNAS wherein the NAS problem is posed as a multi-objective problem with 2 objectives: (i) maximize architecture novelty, (ii) maximize architecture fitness/accuracy. The novelty search is used for maintaining a diverse set of solutions at each generation which helps avoiding local optimum traps while the architecture fitness is calculated using supernet. NSGA-II is used for finding the \textit{pareto optimal front} for the NAS problem and the best architecture in the pareto front is returned as the searched architecture. Exerimentally, NEvoNAS gives better results on 2 different search spaces while using significantly less computational resources as compared to previous EA-based methods. The code for our paper can be found in https://github.com/nightstorm0909/NEvoNAS. △ Less

Submitted 31 March, 2022; originally announced April 2022.

Comments: Accepted as poster in GECCO 2022. arXiv admin note: substantial text overlap with arXiv:2107.07266, arXiv:2203.01559

arXiv:2203.01559 [pdf, other]

Neural Architecture Search using Progressive Evolution

Authors: Nilotpal Sinha, Kuan-Wen Chen

Abstract: Vanilla neural architecture search using evolutionary algorithms (EA) involves evaluating each architecture by training it from scratch, which is extremely time-consuming. This can be reduced by using a supernet to estimate the fitness of every architecture in the search space due to its weight sharing nature. However, the estimated fitness is very noisy due to the co-adaptation of the operations… ▽ More Vanilla neural architecture search using evolutionary algorithms (EA) involves evaluating each architecture by training it from scratch, which is extremely time-consuming. This can be reduced by using a supernet to estimate the fitness of every architecture in the search space due to its weight sharing nature. However, the estimated fitness is very noisy due to the co-adaptation of the operations in the supernet. In this work, we propose a method called pEvoNAS wherein the whole neural architecture search space is progressively reduced to smaller search space regions with good architectures. This is achieved by using a trained supernet for architecture evaluation during the architecture search using genetic algorithm to find search space regions with good architectures. Upon reaching the final reduced search space, the supernet is then used to search for the best architecture in that search space using evolution. The search is also enhanced by using weight inheritance wherein the supernet for the smaller search space inherits its weights from previous trained supernet for the bigger search space. Exerimentally, pEvoNAS gives better results on CIFAR-10 and CIFAR-100 while using significantly less computational resources as compared to previous EA-based methods. The code for our paper can be found in https://github.com/nightstorm0909/pEvoNAS △ Less

Submitted 3 March, 2022; originally announced March 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2107.07266

arXiv:2202.01443 [pdf, other]

doi 10.1016/j.physletb.2022.137117

Two-component scalar and fermionic dark matter candidates in a generic U$(1)_X$ model

Authors: Arindam Das, Shivam Gola, Sanjoy Mandal, Nita Sinha

Abstract: We consider a $U(1)_X\otimes \mathbb{Z}_2\otimes \mathbb{Z}'_2$ extension of the Standard Model (SM), where the $U(1)_X$ charge of an SM field is given by a linear combination of its hypercharge and B$-$L number. Apart from the SM particle content, the model contains three right-handed neutrinos (RHNs) $N_R^i$ and two scalars $Φ$, $χ$, all singlets under the SM gauge group but charged under… ▽ More We consider a $U(1)_X\otimes \mathbb{Z}_2\otimes \mathbb{Z}'_2$ extension of the Standard Model (SM), where the $U(1)_X$ charge of an SM field is given by a linear combination of its hypercharge and B$-$L number. Apart from the SM particle content, the model contains three right-handed neutrinos (RHNs) $N_R^i$ and two scalars $Φ$, $χ$, all singlets under the SM gauge group but charged under $U(1)_X$ gauge group. Two of these additional fields, fermion $N_R^3$ is odd under $\mathbb{Z}_2$ and scalar $χ$ is odd under $\mathbb{Z}'_2$ symmetry. Thus both $χ$ and $N_R^3$ contribute to the observed dark matter relic density, leading to two-component dark matter candidates. We study in detail its dark matter properties such as relic density and direct detection taking into account the constraints coming from collider studies. We find that in our model, there can be possible annihilation of one Dark Matter (DM) into the other, which may potentially alter the relic density in a significant way. △ Less

Submitted 27 April, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

Comments: One Table changed

arXiv:2201.12055 [pdf, other]

doi 10.3390/s22062346

Automated Feature Extraction on AsMap for Emotion Classification using EEG

Authors: Md. Zaved Iqubal Ahmed, Nidul Sinha, Souvik Phadikar, Ebrahim Ghaderpour

Abstract: Emotion recognition using EEG has been widely studied to address the challenges associated with affective computing. Using manual feature extraction methods on EEG signals results in sub-optimal performance by the learning models. With the advancements in deep learning as a tool for automated feature engineering, in this work, a hybrid of manual and automatic feature extraction methods has been pr… ▽ More Emotion recognition using EEG has been widely studied to address the challenges associated with affective computing. Using manual feature extraction methods on EEG signals results in sub-optimal performance by the learning models. With the advancements in deep learning as a tool for automated feature engineering, in this work, a hybrid of manual and automatic feature extraction methods has been proposed. The asymmetry in different brain regions is captured in a 2D vector, termed the AsMap, from the differential entropy features of EEG signals. These AsMaps are then used to extract features automatically using a convolutional neural network model. The proposed feature extraction method has been compared with differential entropy and other feature extraction methods such as relative asymmetry, differential asymmetry and differential caudality. Experiments are conducted using the SJTU emotion EEG dataset and the DEAP dataset on different classification problems based on the number of classes. Results obtained indicate that the proposed method of feature extraction results in higher classification accuracy, outperforming the other feature extraction methods. The highest classification accuracy of 97.10% is achieved on a three-class classification problem using the SJTU emotion EEG dataset. Further, this work has also assessed the impact of window size on classification accuracy. △ Less

Submitted 20 March, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

Comments: 18 pages, 7 figures, published in Sensors (MDPI)

ACM Class: I.2; J.7

Journal ref: Sensors 2022, 22(6), 2346

arXiv:2201.08973 [pdf, other]

Simulated Thick, Fully-Depleted CCD Exposures Analyzed with Deep Learning Techniques

Authors: C. Britt, E. Church, T. Hossbach, B. Loer, R. Saldanha, N. Sinha, K. Woodruff

Abstract: Thick, Charge Coupled Devices (CCDs) have recently been explored for applied physics, such as nuclear explosion monitoring, and dark matter detection purposes. When run in fully-depleted mode, these devices are sensitive detectors for energy depositions by a variety of primary particles. In this study we are interested in applying the Deep Learning (DL) technique known as panoptic segmentation to… ▽ More Thick, Charge Coupled Devices (CCDs) have recently been explored for applied physics, such as nuclear explosion monitoring, and dark matter detection purposes. When run in fully-depleted mode, these devices are sensitive detectors for energy depositions by a variety of primary particles. In this study we are interested in applying the Deep Learning (DL) technique known as panoptic segmentation to simulated CCD images to identify, attribute and measure energy depositions from radioisotopes of interest. We simulate CCD exposures of a chosen radioxenon isotope, $^{135}$Xe, and overlay a simulated cosmic muon background appropriate for a surface-lab. We show that with this DL technique we can reproduce the beta spectrum to good accuracy, while suffering expected confusion with same-topology gammas and conversion electrons and identifying cosmic muons less than optimally. △ Less

Submitted 22 January, 2022; originally announced January 2022.

Report number: PNNL-SA-169763

arXiv:2201.01703 [pdf, other]

Probing TryOnGAN

Authors: Saurabh Kumar, Nishant Sinha

Abstract: TryOnGAN is a recent virtual try-on approach, which generates highly realistic images and outperforms most previous approaches. In this article, we reproduce the TryOnGAN implementation and probe it along diverse angles: impact of transfer learning, variants of conditioning image generation with poses and properties of latent space interpolation. Some of these facets have never been explored in li… ▽ More TryOnGAN is a recent virtual try-on approach, which generates highly realistic images and outperforms most previous approaches. In this article, we reproduce the TryOnGAN implementation and probe it along diverse angles: impact of transfer learning, variants of conditioning image generation with poses and properties of latent space interpolation. Some of these facets have never been explored in literature earlier. We find that transfer helps training initially but gains are lost as models train longer and pose conditioning via concatenation performs better. The latent space self-disentangles the pose and the style features and enables style transfer across poses. Our code and models are available in open source. △ Less

Submitted 5 January, 2022; originally announced January 2022.

Comments: 5 pages, to appear in the proceedings of the 9th ACM IKDD CODS and 27th COMAD (CODS-COMAD '22)

arXiv:2201.01468 [pdf]

doi 10.1016/j.eswa.2022.118901

Neural Network-Based Feature Extraction for Multi-Class Motor Imagery Classification

Authors: Souvik Phadikar, Nidul Sinha, Rajdeep Ghosh

Abstract: Decoding of motor imagery (MI) from Electroencephalogram (EEG) is an important component of the Brain-Computer Interface (BCI) system that helps motor-disabled people interact with the outside world via external devices. The main issue in develo** the EEG based BCI is the informative confusion due to the non-stationary characteristics of EEG data. In this work, an innovative idea of transforming… ▽ More Decoding of motor imagery (MI) from Electroencephalogram (EEG) is an important component of the Brain-Computer Interface (BCI) system that helps motor-disabled people interact with the outside world via external devices. The main issue in develo** the EEG based BCI is the informative confusion due to the non-stationary characteristics of EEG data. In this work, an innovative idea of transforming an EEG signal into the weight vector of an unsupervised neural network called the autoencoder is proposed for the first time to solve that problem. Separate autoencoders are trained for the individual EEG data. The weight vectors are then optimized for the individual EEG signals. The EEG signals are thus represented in a new domain that is in the form of weight vectors of the individual autoencoder. The weight vectors are then used to extract features such as autoregressive coefficients (ARs), Shannon entropy (SE), and wavelet leader. A window-based feature extraction technique is implemented to capture the local features of the EEG data. Finally, extracted features are classified using a classifier network. The proposed approach is tested on two publicly accessible EEG datasets (BCI competition-III and Competition-IV) to ensure that it is as successful as and superior to the previously published methods. The proposed technique achieves a mean accuracy of 95.33 % for dataset-IIIa from BCI-III and a mean accuracy of 97% for dataset-IIa from BCI-IV for four-class EEG-based MI classification. The experimental outcomes show that the proposed approach is a promising way to increase BCI performance. △ Less

Submitted 5 January, 2022; originally announced January 2022.

arXiv:2201.01462 [pdf, other]

doi 10.3390/s22082948

Automatic Muscle Artifacts Identification and Removal from Single-Channel EEG Using Wavelet Transform with Meta-heuristically Optimized Non-local Means Filter

Authors: Souvik Phadikar, Nidul Sinha, Rajdeep Ghosh, Ebrahim Ghaderpour

Abstract: Electroencephalogram (EEG) signals may get easily contaminated by muscle artifacts, which may lead to wrong interpretation in the brain--computer interface (BCI) system as well as in various medical diagnoses. The main objective of this paper is to remove muscle artifacts without distorting the information contained in the EEG. A novel multi-stage EEG denoising method is proposed for the first tim… ▽ More Electroencephalogram (EEG) signals may get easily contaminated by muscle artifacts, which may lead to wrong interpretation in the brain--computer interface (BCI) system as well as in various medical diagnoses. The main objective of this paper is to remove muscle artifacts without distorting the information contained in the EEG. A novel multi-stage EEG denoising method is proposed for the first time in which wavelet packet decomposition (WPD) is combined with a modified non-local means (NLM) algorithm. At first, the artifact EEG signal is identified through a pre-trained classifier. Next, the identified EEG signal is decomposed into wavelet coefficients and corrected through a modified NLM filter. Finally, the artifact-free EEG is reconstructed from corrected wavelet coefficients through inverse WPD. To optimize the filter parameters, two meta-heuristic algorithms are used in this paper for the first time. The proposed system is first validated on simulated EEG data and then tested on real EEG data. The proposed approach achieved average mutual information (MI) as 2.9684 $\pm$ 0.7045 on real EEG data. The result reveals that the proposed system outperforms recently developed denoising techniques with higher average MI, which indicates that the proposed approach is better in terms of quality of reconstruction and is fully automatic. △ Less

Submitted 13 April, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

Comments: 21 pages, 9 figures, 9 tables

Report number: Sensors 2022, 22(8), 2948

Journal ref: Sensors 2022, 22, 2948

arXiv:2112.12180 [pdf, other]

doi 10.5220/0010841400003124

Multimodal Personality Recognition using Cross-Attention Transformer and Behaviour Encoding

Authors: Tanay Agrawal, Dhruv Agarwal, Michal Balazia, Neelabh Sinha, Francois Bremond

Abstract: Personality computing and affective computing have gained recent interest in many research areas. The datasets for the task generally have multiple modalities like video, audio, language and bio-signals. In this paper, we propose a flexible model for the task which exploits all available data. The task involves complex relations and to avoid using a large model for video processing specifically, w… ▽ More Personality computing and affective computing have gained recent interest in many research areas. The datasets for the task generally have multiple modalities like video, audio, language and bio-signals. In this paper, we propose a flexible model for the task which exploits all available data. The task involves complex relations and to avoid using a large model for video processing specifically, we propose the use of behaviour encoding which boosts performance with minimal change to the model. Cross-attention using transformers has become popular in recent times and is utilised for fusion of different modalities. Since long term relations may exist, breaking the input into chunks is not desirable, thus the proposed model processes the entire input together. Our experiments show the importance of each of the above contributions △ Less

Submitted 12 January, 2023; v1 submitted 22 December, 2021; originally announced December 2021.

Comments: Preprint. Final paper accepted at the 17th International Conference on Computer Vision Theory and Applications (VISAPP), virtual, February, 2022. 8 pages

MSC Class: 68T05; 68T10 ACM Class: I.5

arXiv:2111.02720 [pdf]

A Theoretical and Computational Study of H$_2$ Physisorption on Covalent Organic Framework Linkers and Metalated Linkers: A Strategy to Enhance Binding Strength

Authors: Nilima Sinha, Srimanta Pakhira

Abstract: Hydrogen is deemed as an attractive energy carrier alternative to fossil fuels, and it is required to store for many applications. Physisorption is one of the promising ways to store H$_2$ for its practical applications. Covalent Organic Frameworks (COFs) are promising candidates for H$_2$-storage due to high porosity, surface area and tunable characteristics. To improve the hydrogen physisorption… ▽ More Hydrogen is deemed as an attractive energy carrier alternative to fossil fuels, and it is required to store for many applications. Physisorption is one of the promising ways to store H$_2$ for its practical applications. Covalent Organic Frameworks (COFs) are promising candidates for H$_2$-storage due to high porosity, surface area and tunable characteristics. To improve the hydrogen physisorption in the COFs, the chelation of transition metals (TM) in the building blocks of the framework has been studied by using first principle-based density functional theory (DFT) method. Here, we report total 96 H$_2$ complexes made of six different COF linkers and chelated with the Sc, Ti and V atoms interacting with up to H$_2$ molecules. The molecular interactions between physisorption H$_2$ and these Sc-, Ti- and V-chelated linkers have been explored in detail. The binding enthalpy of the most complexes is higher than ~10 kJ/mol, which is the basic requirement for practical H$_2$-storage. In the total interaction energy (between physisorption H$_2$ and chelated linkers), the dispersion and electrostatic interactions are dominant. This study is essential in finding out the more efficient COF linkers for practical H$_2$ storage. It can also help to improve the uptake of existing porous materials for H$_2$ storage. The present study paves a way to design transition metal chelated COFs for an effective H$_2$-storage and the knowledge gained from this study is expected to provide some inspiration for develo** the corresponding experiments. △ Less

Submitted 4 November, 2021; originally announced November 2021.

arXiv:2110.04828 [pdf, other]

doi 10.1109/AVSS52988.2021.9663816

FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

Authors: Neelabh Sinha, Michal Balazia, Francois Bremond

Abstract: 3D gaze estimation is about predicting the line of sight of a person in 3D space. Person-independent models for the same lack precision due to anatomical differences of subjects, whereas person-specific calibrated techniques add strict constraints on scalability. To overcome these issues, we propose a novel technique, Facial Landmark Heatmap Activated Multimodal Gaze Estimation (FLAME), as a way o… ▽ More 3D gaze estimation is about predicting the line of sight of a person in 3D space. Person-independent models for the same lack precision due to anatomical differences of subjects, whereas person-specific calibrated techniques add strict constraints on scalability. To overcome these issues, we propose a novel technique, Facial Landmark Heatmap Activated Multimodal Gaze Estimation (FLAME), as a way of combining eye anatomical information using eye landmark heatmaps to obtain precise gaze estimation without any person-specific calibration. Our evaluation demonstrates a competitive performance of about 10% improvement on benchmark datasets ColumbiaGaze and EYEDIAP. We also conduct an ablation study to validate our method. △ Less

Submitted 7 December, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

Comments: Preprint. Final paper accepted at the 17th IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS), virtual, November 2021. 8 pages

MSC Class: 68T05; 68T10 ACM Class: I.5

arXiv:2109.03435 [pdf, other]

SSEGEP: Small SEGment Emphasized Performance evaluation metric for medical image segmentation

Authors: Ammu R, Neelam Sinha

Abstract: Automatic image segmentation is a critical component of medical image analysis, and hence quantifying segmentation performance is crucial. Challenges in medical image segmentation are mainly due to spatial variations of regions to be segmented and imbalance in distribution of classes. Commonly used metrics treat all detected pixels, indiscriminately. However, pixels in smaller segments must be tre… ▽ More Automatic image segmentation is a critical component of medical image analysis, and hence quantifying segmentation performance is crucial. Challenges in medical image segmentation are mainly due to spatial variations of regions to be segmented and imbalance in distribution of classes. Commonly used metrics treat all detected pixels, indiscriminately. However, pixels in smaller segments must be treated differently from pixels in larger segments, as detection of smaller ones aid in early treatment of associated disease and are also easier to miss. To address this, we propose a novel evaluation metric for segmentation performance, emphasizing smaller segments, by assigning higher weightage to smaller segment pixels. Weighted false positives are also considered in deriving the new metric named, "SSEGEP"(Small SEGment Emphasized Performance evaluation metric), (range : 0(Bad) to 1(Good)). The experiments were performed on diverse anatomies(eye, liver, pancreas and breast) from publicly available datasets to show applicability of the proposed metric across different imaging techniques. Mean opinion score (MOS) and statistical significance testing is used to quantify the relevance of proposed approach. Across 33 fundus images, where the largest exudate is 1.41%, and the smallest is 0.0002% of the image, the proposed metric is 30% closer to MOS, as compared to Dice Similarity Coefficient (DSC). Statistical significance testing resulted in promising p-value of order 10^{-18} with SSEGEP for hepatic tumor compared to DSC. The proposed metric is found to perform better for the images having multiple segments for a single label. △ Less

Submitted 8 September, 2021; originally announced September 2021.

arXiv:2109.01572 [pdf, other]

Using Topological Framework for the Design of Activation Function and Model Pruning in Deep Neural Networks

Authors: Yogesh Kochar, Sunil Kumar Vengalil, Neelam Sinha

Abstract: Success of deep neural networks in diverse tasks across domains of computer vision, speech recognition and natural language processing, has necessitated understanding the dynamics of training process and also working of trained models. Two independent contributions of this paper are 1) Novel activation function for faster training convergence 2) Systematic pruning of filters of models trained irre… ▽ More Success of deep neural networks in diverse tasks across domains of computer vision, speech recognition and natural language processing, has necessitated understanding the dynamics of training process and also working of trained models. Two independent contributions of this paper are 1) Novel activation function for faster training convergence 2) Systematic pruning of filters of models trained irrespective of activation function. We analyze the topological transformation of the space of training samples as it gets transformed by each successive layer during training, by changing the activation function. The impact of changing activation function on the convergence during training is reported for the task of binary classification. A novel activation function aimed at faster convergence for classification tasks is proposed. Here, Betti numbers are used to quantify topological complexity of data. Results of experiments on popular synthetic binary classification datasets with large Betti numbers(>150) using MLPs are reported. Results show that the proposed activation function results in faster convergence requiring fewer epochs by a factor of 1.5 to 2, since Betti numbers reduce faster across layers with the proposed activation function. The proposed methodology was verified on benchmark image datasets: fashion MNIST, CIFAR-10 and cat-vs-dog images, using CNNs. Based on empirical results, we propose a novel method for pruning a trained model. The trained model was pruned by eliminating filters that transform data to a topological space with large Betti numbers. All filters with Betti numbers greater than 300 were removed from each layer without significant reduction in accuracy. This resulted in faster prediction time and reduced memory size of the model. △ Less

Submitted 3 September, 2021; originally announced September 2021.

arXiv:2109.00479 [pdf, other]

Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks

Authors: Sunil Kumar Vengalil, Neelam Sinha

Abstract: Deep neural networks have become the default choice for many applications like image and video recognition, segmentation and other image and video related tasks.However, a critical challenge with these models is the lack of explainability.This requirement of generating explainable predictions has motivated the research community to perform various analysis on trained models.In this study, we analy… ▽ More Deep neural networks have become the default choice for many applications like image and video recognition, segmentation and other image and video related tasks.However, a critical challenge with these models is the lack of explainability.This requirement of generating explainable predictions has motivated the research community to perform various analysis on trained models.In this study, we analyze the learned feature maps of trained models using MNIST images for achieving more explainable predictions.Our study is focused on deriving a set of primitive elements, here called visual concepts, that can be used to generate any arbitrary sample from the data generating distribution.We derive the primitive elements from the feature maps learned by the model.We illustrate the idea by generating visual concepts from a Variational Autoencoder trained using MNIST images.We augment the training data of MNIST dataset by adding about 60,000 new images generated with visual concepts chosen at random.With this we were able to reduce the reconstruction loss (mean square error) from an initial value of 120 without augmentation to 60 with augmentation.Our approach is a first step towards the final goal of achieving trained deep neural network models whose predictions, features in hidden layers and the learned filters can be well explained.Such a model when deployed in production can easily be modified to adapt to new data, whereas existing deep learning models need a re training or fine tuning. This process again needs a huge number of data samples that are not easy to generate unless the model has good explainability. △ Less

Submitted 1 September, 2021; originally announced September 2021.

arXiv:2107.07266 [pdf, other]

Neural Architecture Search using Covariance Matrix Adaptation Evolution Strategy

Authors: Nilotpal Sinha, Kuan-Wen Chen

Abstract: Evolution-based neural architecture search requires high computational resources, resulting in long search time. In this work, we propose a framework of applying the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) to the neural architecture search problem called CMANAS, which achieves better results than previous evolution-based methods while reducing the search time significantly. The ar… ▽ More Evolution-based neural architecture search requires high computational resources, resulting in long search time. In this work, we propose a framework of applying the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) to the neural architecture search problem called CMANAS, which achieves better results than previous evolution-based methods while reducing the search time significantly. The architectures are modelled using a normal distribution, which is updated using CMA-ES based on the fitness of the sampled population. We used the accuracy of a trained one shot model (OSM) on the validation data as a prediction of the fitness of an individual architecture to reduce the search time. We also used an architecture-fitness table (AF table) for kee** record of the already evaluated architecture, thus further reducing the search time. CMANAS finished the architecture search on CIFAR-10 with the top-1 test accuracy of 97.44% in 0.45 GPU day and on CIFAR-100 with the top-1 test accuracy of 83.24% for 0.6 GPU day on a single GPU. The top architectures from the searches on CIFAR-10 and CIFAR-100 were then transferred to ImageNet, achieving the top-5 accuracy of 92.6% and 92.1%, respectively. △ Less

Submitted 15 July, 2021; originally announced July 2021.

Comments: Under review (Submitted to IEEE Transactions on Evolutionary Computation)

arXiv:2106.00547 [pdf, other]

doi 10.1142/S0217751X22501317

ALP-portal majorana dark matter

Authors: Shivam Gola, Sanjoy Mandal, Nita Sinha

Abstract: Axion like particles(ALPs) and right handed neutrinos~(RHNs) are two well-motivated dark matter(DM) candidates. However, these two particles have a completely different origin. Axion was proposed to solve the Strong CP problem, whereas RHNs were introduced to explain light neutrino masses through seesaw mechanisms. We study the case of ALP portal RHN DM taking into account existing constraints on… ▽ More Axion like particles(ALPs) and right handed neutrinos~(RHNs) are two well-motivated dark matter(DM) candidates. However, these two particles have a completely different origin. Axion was proposed to solve the Strong CP problem, whereas RHNs were introduced to explain light neutrino masses through seesaw mechanisms. We study the case of ALP portal RHN DM taking into account existing constraints on ALPs. We consider the leading effective operators mediating interactions between the ALP and SM particles and three RHNs to generate light neutrino masses through type-I seesaw. Further, ALP-RHN neutrino coupling is introduced to generalize the model which is restricted by the relic density and indirect detection constraint. △ Less

Submitted 1 August, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

Comments: 18 pages, 6 figures, Matched with the published version

arXiv:2012.12540 [pdf, other]

Evolving Neural Architecture Using One Shot Model

Authors: Nilotpal Sinha, Kuan-Wen Chen

Abstract: Neural Architecture Search (NAS) is emerging as a new research direction which has the potential to replace the hand-crafted neural architectures designed for specific tasks. Previous evolution based architecture search requires high computational resources resulting in high search time. In this work, we propose a novel way of applying a simple genetic algorithm to the NAS problem called EvNAS (Ev… ▽ More Neural Architecture Search (NAS) is emerging as a new research direction which has the potential to replace the hand-crafted neural architectures designed for specific tasks. Previous evolution based architecture search requires high computational resources resulting in high search time. In this work, we propose a novel way of applying a simple genetic algorithm to the NAS problem called EvNAS (Evolving Neural Architecture using One Shot Model) which reduces the search time significantly while still achieving better result than previous evolution based methods. The architectures are represented by using the architecture parameter of the one shot model which results in the weight sharing among the architectures for a given population of architectures and also weight inheritance from one generation to the next generation of architectures. We propose a decoding technique for the architecture parameter which is used to divert majority of the gradient information towards the given architecture and is also used for improving the performance prediction of the given architecture from the one shot model during the search process. Furthermore, we use the accuracy of the partially trained architecture on the validation data as a prediction of its fitness in order to reduce the search time. EvNAS searches for the architecture on the proxy dataset i.e. CIFAR-10 for 4.4 GPU day on a single GPU and achieves top-1 test error of 2.47% with 3.63M parameters which is then transferred to CIFAR-100 and ImageNet achieving top-1 error of 16.37% and top-5 error of 7.4% respectively. All of these results show the potential of evolutionary methods in solving the architecture search problem. △ Less

Submitted 23 December, 2020; originally announced December 2020.

arXiv:2011.10568 [pdf, other]

Learn to Bind and Grow Neural Structures

Authors: Azhar Shaikh, Nishant Sinha

Abstract: Task-incremental learning involves the challenging problem of learning new tasks continually, without forgetting past knowledge. Many approaches address the problem by expanding the structure of a shared neural network as tasks arrive, but struggle to grow optimally, without losing past knowledge. We present a new framework, Learn to Bind and Grow, which learns a neural architecture for a new task… ▽ More Task-incremental learning involves the challenging problem of learning new tasks continually, without forgetting past knowledge. Many approaches address the problem by expanding the structure of a shared neural network as tasks arrive, but struggle to grow optimally, without losing past knowledge. We present a new framework, Learn to Bind and Grow, which learns a neural architecture for a new task incrementally, either by binding with layers of a similar task or by expanding layers which are more likely to conflict between tasks. Central to our approach is a novel, interpretable, parameterization of the shared, multi-task architecture space, which then enables computing globally optimal architectures using Bayesian optimization. Experiments on continual learning benchmarks show that our framework performs comparably with earlier expansion based approaches and is able to flexibly compute multiple optimal solutions with performance-size trade-offs. △ Less

Submitted 21 November, 2020; originally announced November 2020.

Comments: Accepted to 8th ACM IKDD CODS and 26th COMAD (CODS-COMAD '21) conference

arXiv:2010.13197 [pdf, other]

doi 10.1145/3430984.3430993

Gestop : Customizable Gesture Control of Computer Systems

Authors: Sriram Krishna, Nishant Sinha

Abstract: The established way of interfacing with most computer systems is a mouse and keyboard. Hand gestures are an intuitive and effective touchless way to interact with computer systems. However, hand gesture based systems have seen low adoption among end-users primarily due to numerous technical hurdles in detecting in-air gestures accurately. This paper presents Gestop, a framework developed to bridge… ▽ More The established way of interfacing with most computer systems is a mouse and keyboard. Hand gestures are an intuitive and effective touchless way to interact with computer systems. However, hand gesture based systems have seen low adoption among end-users primarily due to numerous technical hurdles in detecting in-air gestures accurately. This paper presents Gestop, a framework developed to bridge this gap. The framework learns to detect gestures from demonstrations, is customizable by end-users and enables users to interact in real-time with computers having only RGB cameras, using gestures. △ Less

Submitted 25 October, 2020; originally announced October 2020.

Comments: 5 pages, 5 figures, to appear in the proceedings of the 8th ACM IKDD CODS and 26th COMAD (CODS-COMAD '21)

arXiv:2009.13567 [pdf]

doi 10.1111/epi.16819

Focal to bilateral tonic-clonic seizures are associated with widespread network abnormality in temporal lobe epilepsy

Authors: Nishant Sinha, Natalie Peternell, Gabrielle M. Schroeder, Jane de Tisi, Sjoerd B. Vos, Gavin P. Winston, John S. Duncan, Yujiang Wang, Peter N. Taylor

Abstract: Objective: To identify if whole-brain structural network alterations in patients with temporal lobe epilepsy (TLE) and focal to bilateral tonic-clonic seizures (FBTCS) differ from alterations in patients without FBTCS. Methods: We dichotomized a cohort of 83 drug-resistant patients with TLE into those with and without FBTCS and compared each group to 29 healthy controls. For each subject, we use… ▽ More Objective: To identify if whole-brain structural network alterations in patients with temporal lobe epilepsy (TLE) and focal to bilateral tonic-clonic seizures (FBTCS) differ from alterations in patients without FBTCS. Methods: We dichotomized a cohort of 83 drug-resistant patients with TLE into those with and without FBTCS and compared each group to 29 healthy controls. For each subject, we used diffusion MRI to construct whole-brain structural networks. First, we measured the extent of alterations by performing FBTCS-negative (FBTCS-) versus control and FBTCS-positive (FBTCS+) versus control comparisons, thereby delineating altered sub-networks of the whole-brain structural network. Second, by standardising networks of each patient using control networks, we measured the subject-specific abnormality at every brain region in the network, thereby quantifying the spatial localisation and the amount of abnormality in every patient. Results: Both FBTCS+ and FBTCS- patient groups had altered sub-networks with reduced fractional anisotropy (FA) and increased mean diffusivity (MD) compared to controls. The altered subnetwork in FBTCS+ patients was more widespread than in FBTCS- patients (441 connections altered at t>3, p<0.001 in FBTCS+ compared to 21 connections altered at t>3, p=0.01 in FBTCS-). Significantly greater abnormalities-aggregated over the entire brain network as well as assessed at the resolution of individual brain areas-were present in FBTCS+ patients (p<0.001, d=0.82). In contrast, the fewer abnormalities present in FBTCS- patients were mainly localised to the temporal and frontal areas. Significance: The whole-brain structural network is altered to a greater and more widespread extent in patients with TLE and FBTCS. We suggest that these abnormal networks may serve as an underlying structural basis or consequence of the greater seizure spread observed in FBTCS. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Journal ref: Epilepsia. 2021 62(3):729-741

arXiv:2009.02362 [pdf]

Bootstrap p-values reduce type 1 error of the robust rank-order test of difference in medians

Authors: Nirvik Sinha

Abstract: The robust rank-order test (Fligner and Policello, 1981) was designed as an improvement of the non-parametric Wilcoxon-Mann-Whitney U-test to be more appropriate when the samples being compared have unequal variance. However, it tends to be excessively liberal when the samples are asymmetric. This is likely because the test statistic is assumed to have a standard normal distribution for sample siz… ▽ More The robust rank-order test (Fligner and Policello, 1981) was designed as an improvement of the non-parametric Wilcoxon-Mann-Whitney U-test to be more appropriate when the samples being compared have unequal variance. However, it tends to be excessively liberal when the samples are asymmetric. This is likely because the test statistic is assumed to have a standard normal distribution for sample sizes > 12. This work proposes an on-the-fly method to obtain the distribution of the test statistic from which the critical/p-value may be computed directly. The method of likelihood maximization is used to estimate the parameters of the parent distributions of the samples being compared. Using these estimated populations, the null distribution of the test statistic is obtained by the Monte-Carlo method. Simulations are performed to compare the proposed method with that of standard normal approximation of the test statistic. For small sample sizes (<= 20), the Monte-Carlo method outperforms the normal approximation method. This is especially true for low values of significance levels (< 5%). Additionally, when the smaller sample has the larger standard deviation, the Monte-Carlo method outperforms the normal approximation method even for large sample sizes (= 40/60). The two methods do not differ in power. Finally, a Monte-Carlo sample size of 10^4 is found to be sufficient to obtain the aforementioned relative improvements in performance. Thus, the results of this study pave the way for development of a toolbox to perform the robust rank-order test in a distribution-free manner. △ Less

Submitted 4 September, 2020; originally announced September 2020.

Comments: 22 pages, 1 table, 8 figures

MSC Class: 62G10

arXiv:2008.05467 [pdf, other]

doi 10.1103/PhysRevD.104.095009

Interference effect in lepton number violating and conserving meson decays for a left-right symmetric model

Authors: Rohini M. Godbole, Siddharth P. Maharathy, Sanjoy Mandal, Manimala Mitra, Nita Sinha

Abstract: We study the effect of interference on the lepton number violating~(LNV) and lepton number conserving~(LNC) three-body meson decays $M_1^{+}\to l_i^{+} l_j^{\pm}π^{\mp}$, that arise in a TeV scale Left Right Symmetric model~(LRSM) with degenerate or nearly degenerate right handed~(RH) neutrinos. LRSM contains three RH neutrinos and a RH gauge boson. The RH neutrinos with masses in the range of… ▽ More We study the effect of interference on the lepton number violating~(LNV) and lepton number conserving~(LNC) three-body meson decays $M_1^{+}\to l_i^{+} l_j^{\pm}π^{\mp}$, that arise in a TeV scale Left Right Symmetric model~(LRSM) with degenerate or nearly degenerate right handed~(RH) neutrinos. LRSM contains three RH neutrinos and a RH gauge boson. The RH neutrinos with masses in the range of $M_N \sim$ (MeV - few GeV) can give resonant enhancement in the semi-leptonic LNV and LNC meson decays. In the case, where only one RH neutrino contributes to these decays, the predicted new physics branching ratio of semi-leptonic LNV and LNC meson decays $M_1^{+}\to l_i^{+} l_j^{+}π^{-}$ and $M_1^{+}\to l_i^{+} l_j^{-}π^{+}$ are equal. We find that with at least two RH neutrinos contributing to the process, the LNV and LNC decay rates can differ. Depending on the neutrino mixing angles and $CP$ violating phases, the branching ratios of LNV and LNC decay channels mediated by the heavy neutrinos can be either enhanced or suppressed, and the ratio of these two rates can differ from unity. △ Less

Submitted 15 November, 2021; v1 submitted 12 August, 2020; originally announced August 2020.

Comments: 40 pages, 12 figures

Journal ref: Phys.Rev.D 104 (2021) 9, 095009

arXiv:2007.14491 [pdf, other]

doi 10.1088/1361-6471/abf3ba

The Large Hadron-Electron Collider at the HL-LHC

Authors: P. Agostini, H. Aksakal, S. Alekhin, P. P. Allport, N. Andari, K. D. J. Andre, D. Angal-Kalinin, S. Antusch, L. Aperio Bella, L. Apolinario, R. Apsimon, A. Apyan, G. Arduini, V. Ari, A. Armbruster, N. Armesto, B. Auchmann, K. Aulenbacher, G. Azuelos, S. Backovic, I. Bailey, S. Bailey, F. Balli, S. Behera, O. Behnke , et al. (312 additional authors not shown)

Abstract: The Large Hadron electron Collider (LHeC) is designed to move the field of deep inelastic scattering (DIS) to the energy and intensity frontier of particle physics. Exploiting energy recovery technology, it collides a novel, intense electron beam with a proton or ion beam from the High Luminosity--Large Hadron Collider (HL-LHC). The accelerator and interaction region are designed for concurrent el… ▽ More The Large Hadron electron Collider (LHeC) is designed to move the field of deep inelastic scattering (DIS) to the energy and intensity frontier of particle physics. Exploiting energy recovery technology, it collides a novel, intense electron beam with a proton or ion beam from the High Luminosity--Large Hadron Collider (HL-LHC). The accelerator and interaction region are designed for concurrent electron-proton and proton-proton operation. This report represents an update of the Conceptual Design Report (CDR) of the LHeC, published in 2012. It comprises new results on parton structure of the proton and heavier nuclei, QCD dynamics, electroweak and top-quark physics. It is shown how the LHeC will open a new chapter of nuclear particle physics in extending the accessible kinematic range in lepton-nucleus scattering by several orders of magnitude. Due to enhanced luminosity, large energy and the cleanliness of the hadronic final states, the LHeC has a strong Higgs physics programme and its own discovery potential for new physics. Building on the 2012 CDR, the report represents a detailed updated design of the energy recovery electron linac (ERL) including new lattice, magnet, superconducting radio frequency technology and further components. Challenges of energy recovery are described and the lower energy, high current, 3-turn ERL facility, PERLE at Orsay, is presented which uses the LHeC characteristics serving as a development facility for the design and operation of the LHeC. An updated detector design is presented corresponding to the acceptance, resolution and calibration goals which arise from the Higgs and parton density function physics programmes. The paper also presents novel results on the Future Circular Collider in electron-hadron mode, FCC-eh, which utilises the same ERL technology to further extend the reach of DIS to even higher centre-of-mass energies. △ Less

Submitted 12 April, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

Comments: 373 pages, many figures, to be published by J. Phys. G

Report number: CERN-ACC-Note-2020-0002

Journal ref: J.Phys.G 48 (2021) 11, 110501

arXiv:2006.06634 [pdf, other]

Privacy-Preserving Image Features via Adversarial Affine Subspace Embeddings

Authors: Mihai Dusmanu, Johannes L. Schönberger, Sudipta N. Sinha, Marc Pollefeys

Abstract: Many computer vision systems require users to upload image features to the cloud for processing and storage. These features can be exploited to recover sensitive information about the scene or subjects, e.g., by reconstructing the appearance of the original image. To address this privacy concern, we propose a new privacy-preserving feature representation. The core idea of our work is to drop const… ▽ More Many computer vision systems require users to upload image features to the cloud for processing and storage. These features can be exploited to recover sensitive information about the scene or subjects, e.g., by reconstructing the appearance of the original image. To address this privacy concern, we propose a new privacy-preserving feature representation. The core idea of our work is to drop constraints from each feature descriptor by embedding it within an affine subspace containing the original feature as well as adversarial feature samples. Feature matching on the privacy-preserving representation is enabled based on the notion of subspace-to-subspace distance. We experimentally demonstrate the effectiveness of our method and its high practical relevance for the applications of visual localization and map** as well as face authentication. Compared to the original features, our approach makes it significantly more difficult for an adversary to recover private information. △ Less

Submitted 30 March, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

Comments: Accepted at CVPR 2021. 16 pages, 10 figures, 4 tables

Showing 1–50 of 138 results for author: Sinha, N