Search | arXiv e-print repository

Weakly supervised information extraction from inscrutable handwritten document images

Authors: Sujoy Paul, Gagan Madan, Akankshya Mishra, Narayan Hegde, Pradeep Kumar, Gaurav Aggarwal

Abstract: State-of-the-art information extraction methods are limited by OCR errors. They work well for printed text in form-like documents, but unstructured, handwritten documents still remain a challenge. Adapting existing models to domain-specific training data is quite expensive, because of two factors, 1) limited availability of the domain-specific documents (such as handwritten prescriptions, lab note… ▽ More State-of-the-art information extraction methods are limited by OCR errors. They work well for printed text in form-like documents, but unstructured, handwritten documents still remain a challenge. Adapting existing models to domain-specific training data is quite expensive, because of two factors, 1) limited availability of the domain-specific documents (such as handwritten prescriptions, lab notes, etc.), and 2) annotations become even more challenging as one needs domain-specific knowledge to decode inscrutable handwritten document images. In this work, we focus on the complex problem of extracting medicine names from handwritten prescriptions using only weakly labeled data. The data consists of images along with the list of medicine names in it, but not their location in the image. We solve the problem by first identifying the regions of interest, i.e., medicine lines from just weak labels and then injecting a domain-specific medicine language model learned using only synthetically generated data. Compared to off-the-shelf state-of-the-art methods, our approach performs >2.5x better in medicine names extraction from prescriptions. △ Less

Submitted 11 June, 2023; originally announced June 2023.

Comments: Accepted at ICDAR 2023

arXiv:2306.03827 [pdf, other]

doi 10.1093/mnras/stad2909

Follow-up Analyses to the O3 LIGO-Virgo-KAGRA Lensing Searches

Authors: Justin Janquart, Mick Wright, Srashti Goyal, Juno C. L. Chan, Apratim Ganguly, Ángel Garrón, David Keitel, Alvin K. Y. Li, Anna Liu, Rico K. L. Lo, Anuj Mishra, Anupreeta More, Hemantakumar Phurailatpam, Prasia Pankunni, Sylvia Biscoveanu, Paolo Cremonese, Jean-René Cudell, José M. Ezquiaga, Juan Garcia-Bellido, Otto A. Hannuksela, K. Haris, Ian Harry, Martin Hendry, Sascha Husa, Shasvath Kapadia , et al. (6 additional authors not shown)

Abstract: Along their path from source to observer, gravitational waves may be gravitationally lensed by massive objects. This results in distortions of the observed signal which can be used to extract new information about fundamental physics, astrophysics, and cosmology. Searches for these distortions amongst the observed signals from the current detector network have already been carried out, though ther… ▽ More Along their path from source to observer, gravitational waves may be gravitationally lensed by massive objects. This results in distortions of the observed signal which can be used to extract new information about fundamental physics, astrophysics, and cosmology. Searches for these distortions amongst the observed signals from the current detector network have already been carried out, though there have as yet been no confident detections. However, predictions of the observation rate of lensing suggest detection in the future is a realistic possibility. Therefore, preparations need to be made to thoroughly investigate the candidate lensed signals. In this work, we present some of the follow-up analyses and strategies that could be applied to assess the significance of such events and ascertain what information may be extracted about the lens-source system from such candidate signals by applying them to a number of O3 candidate events, even if these signals did not yield a high significance for any of the lensing hypotheses. For strongly-lensed candidates, we verify their significance using a background of simulated unlensed events and statistics computed from lensing catalogs. We also look for potential electromagnetic counterparts. In addition, we analyse in detail a candidate for a strongly-lensed sub-threshold counterpart that is identified by a new method. For microlensing candidates, we perform model selection using a number of lens models to investigate our ability to determine the mass density profile of the lens and constrain the lens parameters. We also look for millilensing signatures in one of the lensed candidates. Applying these additional analyses does not lead to any additional evidence for lensing in the candidates that have been examined. However, it does provide important insight into potential avenues to deal with high-significance candidates in future observations. △ Less

Submitted 15 August, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

Comments: 29 pages, 27 figures

Journal ref: Monthly Notices of the Royal Astronomical Society, 526, 3, 2023

arXiv:2305.18426 [pdf]

Employing Explainable Artificial Intelligence (XAI) Methodologies to Analyze the Correlation between Input Variables and Tensile Strength in Additively Manufactured Samples

Authors: Akshansh Mishra, Vijaykumar S Jatti

Abstract: This research paper explores the impact of various input parameters, including Infill percentage, Layer Height, Extrusion Temperature, and Print Speed, on the resulting Tensile Strength in objects produced through additive manufacturing. The main objective of this study is to enhance our understanding of the correlation between the input parameters and Tensile Strength, as well as to identify the… ▽ More This research paper explores the impact of various input parameters, including Infill percentage, Layer Height, Extrusion Temperature, and Print Speed, on the resulting Tensile Strength in objects produced through additive manufacturing. The main objective of this study is to enhance our understanding of the correlation between the input parameters and Tensile Strength, as well as to identify the key factors influencing the performance of the additive manufacturing process. To achieve this objective, we introduced the utilization of Explainable Artificial Intelligence (XAI) techniques for the first time, which allowed us to analyze the data and gain valuable insights into the system's behavior. Specifically, we employed SHAP (SHapley Additive exPlanations), a widely adopted framework for interpreting machine learning model predictions, to provide explanations for the behavior of a machine learning model trained on the data. Our findings reveal that the Infill percentage and Extrusion Temperature have the most significant influence on Tensile Strength, while the impact of Layer Height and Print Speed is relatively minor. Furthermore, we discovered that the relationship between the input parameters and Tensile Strength is highly intricate and nonlinear, making it difficult to accurately describe using simple linear models. △ Less

Submitted 28 May, 2023; originally announced May 2023.

arXiv:2305.07901 [pdf, other]

Morpheus: Automated Safety Verification of Data-dependent Parser Combinator Programs

Authors: Ashish Mishra, Suresh Jagannathan

Abstract: Parser combinators are a well-known mechanism used for the compositional construction of parsers, and have shown to be particularly useful in writing parsers for rich grammars with data-dependencies and global state. Verifying applications written using them, however, has proven to be challenging in large part because of the inherently effectful nature of the parsers being composed and the difficu… ▽ More Parser combinators are a well-known mechanism used for the compositional construction of parsers, and have shown to be particularly useful in writing parsers for rich grammars with data-dependencies and global state. Verifying applications written using them, however, has proven to be challenging in large part because of the inherently effectful nature of the parsers being composed and the difficulty in reasoning about the arbitrarily rich data-dependent semantic actions that can be associated with parsing actions. In this paper, we address these challenges by defining a parser combinator framework called Morpheus equipped with abstractions for defining composable effects tailored for parsing and semantic actions and a rich specification language used to define safety properties over the constituent parsers comprising a program. Even though its abstractions yield many of the same expressivity benefits as other parser combinator systems, Morpheus is carefully engineered to yield a substantially more tractable automated verification pathway. We demonstrate its utility in verifying a number of realistic, challenging parsing applications, including several cases that involve non-trivial data-dependent relations. △ Less

Submitted 13 May, 2023; originally announced May 2023.

Comments: 41 pages

arXiv:2305.05668 [pdf]

Neurosymbolic Artificial Intelligence (NSAI) based Algorithm for predicting the Impact Strength of Additive Manufactured Polylactic Acid (PLA) Specimens

Authors: Akshansh Mishra, Vijaykumar S Jatti

Abstract: In this study, we introduce application of Neurosymbolic Artificial Intelligence (NSAI) for predicting the impact strength of additive manufactured polylactic acid (PLA) components, representing the first-ever use of NSAI in the domain of additive manufacturing. The NSAI model amalgamates the advantages of neural networks and symbolic AI, offering a more robust and accurate prediction than traditi… ▽ More In this study, we introduce application of Neurosymbolic Artificial Intelligence (NSAI) for predicting the impact strength of additive manufactured polylactic acid (PLA) components, representing the first-ever use of NSAI in the domain of additive manufacturing. The NSAI model amalgamates the advantages of neural networks and symbolic AI, offering a more robust and accurate prediction than traditional machine learning techniques. Experimental data was collected and synthetically augmented to 1000 data points, enhancing the model's precision. The Neurosymbolic model was developed using a neural network architecture comprising input, two hidden layers, and an output layer, followed by a decision tree regressor representing the symbolic component. The model's performance was benchmarked against a Simple Artificial Neural Network (ANN) model by assessing mean squared error (MSE) and R-squared (R2) values for both training and validation datasets. The results reveal that the Neurosymbolic model surpasses the Simple ANN model, attaining lower MSE and higher R2 values for both training and validation sets. This innovative application of the Neurosymbolic approach in estimating the impact strength of additive manufactured PLA components underscores its potential for optimizing the additive manufacturing process. Future research could investigate further refinements to the Neurosymbolic model, extend its application to other materials and additive manufacturing processes, and incorporate real-time monitoring and control for enhanced process optimization. △ Less

Submitted 7 May, 2023; originally announced May 2023.

arXiv:2305.04456 [pdf, ps, other]

doi 10.1109/JSYST.2024.3404600

Distributed Coordination of Multi-Microgrids in Active Distribution Networks for Provisioning Ancillary Services

Authors: Arghya Mallick, Abhishek Mishra, Ashish R. Hota, Prabodh Bajpai

Abstract: With the phenomenal growth in renewable energy generation, the conventional synchronous generator-based power plants are gradually getting replaced by renewable energy sources-based microgrids. Such transition gives rise to the challenges of procuring various ancillary services from microgrids. We propose a distributed optimization framework that coordinates multiple microgrids in an active distri… ▽ More With the phenomenal growth in renewable energy generation, the conventional synchronous generator-based power plants are gradually getting replaced by renewable energy sources-based microgrids. Such transition gives rise to the challenges of procuring various ancillary services from microgrids. We propose a distributed optimization framework that coordinates multiple microgrids in an active distribution network for provisioning passive voltage support-based ancillary services while satisfying operational constraints. Specifically, we exploit the reactive power support capability of the inverters and the flexibility offered by storage systems available with microgrids for provisioning ancillary service support to the transmission grid. We develop novel mixed-integer inequalities to represent the set of feasible active and reactive power exchange with the transmission grid that ensures passive voltage support. The proposed alternating direction method of multipliers-based algorithm is fully distributed, and does not require the presence of a centralized entity to achieve coordination among the microgrids. We present detailed numerical results on the IEEE 33-bus distribution test system to demonstrate the effectiveness of the proposed approach and examine the scalability and convergence behavior of the distributed algorithm for different choice of hyperparameters and network sizes. △ Less

Submitted 2 July, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

Journal ref: IEEE Systems Journal, 2024

arXiv:2304.13142 [pdf]

Quantum Machine Learning Approach for the Prediction of Surface Roughness in Additive Manufactured Specimens

Authors: Akshansh Mishra, Vijaykumar S. Jatti

Abstract: Surface roughness is a crucial factor influencing the performance and functionality of additive manufactured components. Accurate prediction of surface roughness is vital for optimizing manufacturing processes and ensuring the quality of the final product. Quantum computing has recently gained attention as a potential solution for tackling complex problems and creating precise predictive models. I… ▽ More Surface roughness is a crucial factor influencing the performance and functionality of additive manufactured components. Accurate prediction of surface roughness is vital for optimizing manufacturing processes and ensuring the quality of the final product. Quantum computing has recently gained attention as a potential solution for tackling complex problems and creating precise predictive models. In this research paper, we conduct an in-depth comparison of three quantum algorithms i.e. the Quantum Neural Network (QNN), Quantum Forest (Q-Forest), and Variational Quantum Classifier (VQC) adapted for regression for predicting surface roughness in additive manufactured specimens for the first time. We assess the algorithms performance using Mean Squared Error (MSE), Mean Absolute Error (MAE), and Explained Variance Score (EVS) as evaluation metrics. Our findings show that the Q-Forest algorithm surpasses the other algorithms, achieving an MSE of 56.905, MAE of 7.479, and an EVS of 0.2957. In contrast, the QNN algorithm displays a higher MSE of 60.840 and MAE of 7.671, coupled with a negative EVS of -0.444, indicating that it may not be appropriate for predicting surface roughness in this application. The VQC adapted for regression exhibits an MSE of 59.121, MAE of 7.597, and an EVS of -0.0106, suggesting its performance is also inferior to the Q-Forest algorithm. △ Less

Submitted 24 April, 2023; originally announced April 2023.

arXiv:2304.07789 [pdf]

Smart Watch Supported System for Health Care Monitoring

Authors: Anshuman Mishra, Richards Joe Stanislaus

Abstract: This work presents a smartwatch attached to patients at remote locations, which would help in the navigation of wheel chair and monitor the vitals of patients and relay it through IoT. This wearable smartwatch is equipped with sensors to measure health parameters, namely, heartbeat, blood pressure, body temperature, and step count. An esp8266 Wi-Fi module uploads the health parameters into the thi… ▽ More This work presents a smartwatch attached to patients at remote locations, which would help in the navigation of wheel chair and monitor the vitals of patients and relay it through IoT. This wearable smartwatch is equipped with sensors to measure health parameters, namely, heartbeat, blood pressure, body temperature, and step count. An esp8266 Wi-Fi module uploads the health parameters into the thingspeak cloud platform with a time stamp. This smartwatch is equipped with a joystick for cruise and navigation control of the motor driver-enabled wheelchair. Additionally, an ultrasonic sensor mounted in front of the wheelchair continuously scans for any obstacles ahead and stops the motion of the wheelchair upon detection of an obstacle. The primary controller of the system is an Arduino UNO microcontroller, which interfaces the input and output modules. △ Less

Submitted 16 April, 2023; originally announced April 2023.

Comments: 5 pages and 9 figures

ACM Class: B.1.4

arXiv:2304.06966 [pdf]

Self-Supervised Learning based Depth Estimation from Monocular Images

Authors: Mayank Poddar, Akash Mishra, Mohit Kewlani, Haoyang Pei

Abstract: Depth Estimation has wide reaching applications in the field of Computer vision such as target tracking, augmented reality, and self-driving cars. The goal of Monocular Depth Estimation is to predict the depth map, given a 2D monocular RGB image as input. The traditional depth estimation methods are based on depth cues and used concepts like epipolar geometry. With the evolution of Convolutional N… ▽ More Depth Estimation has wide reaching applications in the field of Computer vision such as target tracking, augmented reality, and self-driving cars. The goal of Monocular Depth Estimation is to predict the depth map, given a 2D monocular RGB image as input. The traditional depth estimation methods are based on depth cues and used concepts like epipolar geometry. With the evolution of Convolutional Neural Networks, depth estimation has undergone tremendous strides. In this project, our aim is to explore possible extensions to existing SoTA Deep Learning based Depth Estimation Models and to see whether performance metrics could be further improved. In a broader sense, we are looking at the possibility of implementing Pose Estimation, Efficient Sub-Pixel Convolution Interpolation, Semantic Segmentation Estimation techniques to further enhance our proposed architecture and to provide fine-grained and more globally coherent depth map predictions. We also plan to do away with camera intrinsic parameters during training and apply weather augmentations to further generalize our model. △ Less

Submitted 14 April, 2023; originally announced April 2023.

arXiv:2304.06543 [pdf, other]

Load Balanced Demand Distribution under Overload Penalties

Authors: Sarnath Ramnath, Venkata M. V. Gunturi, Subi Dangol, Abhishek Mishra, Pradeep Kumar

Abstract: Input to the Load Balanced Demand Distribution (LBDD) consists of the following: (a) a set of public service centers (e.g., schools); (b) a set of demand (people) units and; (c) a cost matrix containing the cost of assignment for all demand unit-service center pairs. In addition, each service center is also associated with a notion of capacity and a penalty which is incurred if it gets overloaded.… ▽ More Input to the Load Balanced Demand Distribution (LBDD) consists of the following: (a) a set of public service centers (e.g., schools); (b) a set of demand (people) units and; (c) a cost matrix containing the cost of assignment for all demand unit-service center pairs. In addition, each service center is also associated with a notion of capacity and a penalty which is incurred if it gets overloaded. Given the input, the LBDD problem determines a map** from the set of demand units to the set of service centers. The objective is to determine a map** that minimizes the sum of the following two terms: (i) the total assignment cost between demand units and their allotted service centers and, (ii) total of penalties incurred. The problem of LBDD finds its application in the domain of urban planning. An instance of the LBDD problem can be reduced to an instance of the min-cost bi-partite matching problem. However, this approach cannot scale up to the real world large problem instances. The current state of the art related to LBDD makes simplifying assumptions such as infinite capacity or total capacity being equal to the total demand. This paper proposes a novel allotment subspace re-adjustment based approach (ASRAL) for the LBDD problem. We analyze ASRAL theoretically and present its asymptotic time complexity. We also evaluate ASRAL experimentally on large problem instances and compare with alternative approaches. Our results indicate that ASRAL is able to scale-up while maintaining significantly better solution quality over the alternative approaches. In addition, we also extend ASRAL to para-ASRAL which uses the GPU and CPU cores to speed-up the execution while maintaining the same solution quality as ASRAL. △ Less

Submitted 13 April, 2023; originally announced April 2023.

Comments: arXiv admin note: text overlap with arXiv:2009.01765

arXiv:2304.04640 [pdf, other]

NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

Authors: Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar, Maxime Fabre, Paul Hueber, Denis Kleyko, Noah Pacik-Nelson, Pao-Sheng Vincent Sun, Guangzhi Tang, Shenqi Wang, Biyan Zhou, Soikat Hasan Ahmed, George Vathakkattil Joseph, Benedetto Leto, Aurora Micheli, Anurag Kumar Mishra, Gregor Lenz, Tao Sun, Zergham Ahmed, Mahmoud Akl, Brian Anderson, Andreas G. Andreou, Chiara Bartolozzi, Arindam Basu , et al. (73 additional authors not shown)

Abstract: Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu… ▽ More Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neuromorphic computing benchmark efforts have not seen widespread adoption due to a lack of inclusive, actionable, and iterative benchmark design and guidelines. To address these shortcomings, we present NeuroBench: a benchmark framework for neuromorphic computing algorithms and systems. NeuroBench is a collaboratively-designed effort from an open community of nearly 100 co-authors across over 50 institutions in industry and academia, aiming to provide a representative structure for standardizing the evaluation of neuromorphic approaches. The NeuroBench framework introduces a common set of tools and systematic methodology for inclusive benchmark measurement, delivering an objective reference framework for quantifying neuromorphic approaches in both hardware-independent (algorithm track) and hardware-dependent (system track) settings. In this article, we present initial performance baselines across various model architectures on the algorithm track and outline the system track benchmark tasks and guidelines. NeuroBench is intended to continually expand its benchmarks and features to foster and track the progress made by the research community. △ Less

Submitted 17 January, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

Comments: Updated from whitepaper to full perspective article preprint

arXiv:2304.03745 [pdf, ps, other]

Assessing Perceived Fairness from Machine Learning Developer's Perspective

Authors: Anoop Mishra, Deepak Khazanchi

Abstract: Fairness in machine learning (ML) applications is an important practice for developers in research and industry. In ML applications, unfairness is triggered due to bias in the data, curation process, erroneous assumptions, and implicit bias rendered within the algorithmic development process. As ML applications come into broader use develo** fair ML applications is critical. Literature suggests… ▽ More Fairness in machine learning (ML) applications is an important practice for developers in research and industry. In ML applications, unfairness is triggered due to bias in the data, curation process, erroneous assumptions, and implicit bias rendered within the algorithmic development process. As ML applications come into broader use develo** fair ML applications is critical. Literature suggests multiple views on how fairness in ML is described from the users perspective and students as future developers. In particular, ML developers have not been the focus of research relating to perceived fairness. This paper reports on a pilot investigation of ML developers perception of fairness. In describing the perception of fairness, the paper performs an exploratory pilot study to assess the attributes of this construct using a systematic focus group of developers. In the focus group, we asked participants to discuss three questions- 1) What are the characteristics of fairness in ML? 2) What factors influence developers belief about the fairness of ML? and 3) What practices and tools are utilized for fairness in ML development? The findings of this exploratory work from the focus group show that to assess fairness developers generally focus on the overall ML application design and development, i.e., business-specific requirements, data collection, pre-processing, in-processing, and post-processing. Thus, we conclude that the procedural aspects of organizational justice theory can explain developers perception of fairness. The findings of this study can be utilized further to assist development teams in integrating fairness in the ML application development lifecycle. It will also motivate ML developers and organizations to develop best practices for assessing the fairness of ML-based applications. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 12 pages, 5 tables, pilot study

ACM Class: J.4; J.1; K.4; K.6; I.2; E.m

arXiv:2304.03718 [pdf, other]

Integrating Edge-AI in Structural Health Monitoring domain

Authors: Anoop Mishra, Gopinath Gangisetti, Deepak Khazanchi

Abstract: Structural health monitoring (SHM) tasks like damage detection are crucial for decision-making regarding maintenance and deterioration. For example, crack detection in SHM is crucial for bridge maintenance as crack progression can lead to structural instability. However, most AI/ML models in the literature have low latency and late inference time issues while performing in real-time environments.… ▽ More Structural health monitoring (SHM) tasks like damage detection are crucial for decision-making regarding maintenance and deterioration. For example, crack detection in SHM is crucial for bridge maintenance as crack progression can lead to structural instability. However, most AI/ML models in the literature have low latency and late inference time issues while performing in real-time environments. This study aims to explore the integration of edge-AI in the SHM domain for real-time bridge inspections. Based on edge-AI literature, its capabilities will be valuable integration for a real-time decision support system in SHM tasks such that real-time inferences can be performed on physical sites. This study will utilize commercial edge-AI platforms, such as Google Coral Dev Board or Kneron KL520, to develop and analyze the effectiveness of edge-AI devices. Thus, this study proposes an edge AI framework for the structural health monitoring domain. An edge-AI-compatible deep learning model is developed to validate the framework to perform real-time crack classification. The effectiveness of this model will be evaluated based on its accuracy, the confusion matrix generated, and the inference time observed in a real-time setting. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 7 pages, 8 figures

ACM Class: I.2.m; I.4; J.m

arXiv:2304.03487 [pdf, other]

ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels

Authors: Ali TehraniJamsaz, Alok Mishra, Akash Dutta, Abid M. Malik, Barbara Chapman, Ali Jannesari

Abstract: GPU-based HPC clusters are attracting more scientific application developers due to their extensive parallelism and energy efficiency. In order to achieve portability among a variety of multi/many core architectures, a popular choice for an application developer is to utilize directive-based parallel programming models, such as OpenMP. However, even with OpenMP, the developer must choose from amon… ▽ More GPU-based HPC clusters are attracting more scientific application developers due to their extensive parallelism and energy efficiency. In order to achieve portability among a variety of multi/many core architectures, a popular choice for an application developer is to utilize directive-based parallel programming models, such as OpenMP. However, even with OpenMP, the developer must choose from among many strategies for exploiting a GPU or a CPU. Recently, Machine Learning (ML) approaches have brought significant advances in the optimizations of HPC applications. To this end, several ways have been proposed to represent application characteristics for ML models. However, the available techniques fail to capture features that are crucial for exposing parallelism. In this paper, we introduce a new graph-based program representation for parallel applications that extends the Abstract Syntax Tree to represent control and data flow information. The originality of this work lies in the addition of new edges exploiting the implicit ordering and parent-child relationships in ASTs, as well as the introduction of edge weights to account for loop and condition information. We evaluate our proposed representation by training a Graph Neural Network (GNN) to predict the runtime of an OpenMP code region across CPUs and GPUs. Various transformations utilizing collapse and data transfer between the CPU and GPU are used to construct the dataset. The predicted runtime of the model is used to determine which transformation provides the best performance. Results show that our approach is indeed effective and has normalized RMSE as low as 0.004 to at most 0.01 in its runtime predictions. △ Less

Submitted 7 April, 2023; originally announced April 2023.

arXiv:2304.03393 [pdf, ps, other]

Covering All the Bases: Type-Based Verification of Test Input Generators

Authors: Zhe Zhou, Ashish Mishra, Benjamin Delaware, Suresh Jagannathan

Abstract: Test input generators are an important part of property-based testing (PBT) frameworks. Because PBT is intended to test deep semantic and structural properties of a program, the outputs produced by these generators can be complex data structures, constrained to satisfy properties the developer believes is most relevant to testing the function of interest. An important feature expected of these gen… ▽ More Test input generators are an important part of property-based testing (PBT) frameworks. Because PBT is intended to test deep semantic and structural properties of a program, the outputs produced by these generators can be complex data structures, constrained to satisfy properties the developer believes is most relevant to testing the function of interest. An important feature expected of these generators is that they be capable of producing all acceptable elements that satisfy the function's input type and generator-provided constraints. However, it is not readily apparent how we might validate whether a particular generator's output satisfies this coverage requirement. Typically, developers must rely on manual inspection and post-mortem analysis of test runs to determine if the generator is providing sufficient coverage; these approaches are error-prone and difficult to scale as generators become more complex. To address this important concern, we present a new refinement type-based verification procedure for validating the coverage provided by input test generators, based on a novel interpretation of types that embeds ``must-style'' underapproximate reasoning principles as a fundamental part of the type system. The types associated with expressions now capture the set of values guaranteed to be produced by the expression, rather than the typical formulation that uses types to represent the set of values an expression may produce. Beyond formalizing the notion of coverage types in the context of a rich core language with higher-order procedures and inductive datatypes, we also present a detailed evaluation study to justify the utility of our ideas. △ Less

Submitted 9 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

arXiv:2304.01964 [pdf, other]

PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models

Authors: Aditi Mishra, Utkarsh Soni, Anjana Arunkumar, **bin Huang, Bum Chul Kwon, Chris Bryan

Abstract: Large Language Models (LLMs) have gained widespread popularity due to their ability to perform ad-hoc Natural Language Processing (NLP) tasks with a simple natural language prompt. Part of the appeal for LLMs is their approachability to the general public, including individuals with no prior technical experience in NLP techniques. However, natural language prompts can vary significantly in terms o… ▽ More Large Language Models (LLMs) have gained widespread popularity due to their ability to perform ad-hoc Natural Language Processing (NLP) tasks with a simple natural language prompt. Part of the appeal for LLMs is their approachability to the general public, including individuals with no prior technical experience in NLP techniques. However, natural language prompts can vary significantly in terms of their linguistic structure, context, and other semantics. Modifying one or more of these aspects can result in significant differences in task performance. Non-expert users may find it challenging to identify the changes needed to improve a prompt, especially when they lack domain-specific knowledge and lack appropriate feedback. To address this challenge, we present PromptAid, a visual analytics system designed to interactively create, refine, and test prompts through exploration, perturbation, testing, and iteration. PromptAid uses multiple, coordinated visualizations which allow users to improve prompts by using the three strategies: keyword perturbations, paraphrasing perturbations, and obtaining the best set of in-context few-shot examples. PromptAid was designed through an iterative prototy** process involving NLP experts and was evaluated through quantitative and qualitative assessments for LLMs. Our findings indicate that PromptAid helps users to iterate over prompt template alterations with less cognitive overhead, generate diverse prompts with help of recommendations, and analyze the performance of the generated prompts while surpassing existing state-of-the-art prompting interfaces in performance. △ Less

Submitted 8 April, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

arXiv:2304.00890 [pdf, other]

MIMO Radars and Massive MIMO Communication Systems can Coexist

Authors: Aparna Mishra, Ribhu Chopra

Abstract: In this paper, we investigate the coexistence of a single cell massive MIMO communication system with a MIMO radar. We consider the case where the massive MIMO BS is aware of the radar's existence and treats it as a non-serviced user, but the radar is unaware of the communication system's existence and treats the signals transmitted by both the BS and the communication users as noise. Using result… ▽ More In this paper, we investigate the coexistence of a single cell massive MIMO communication system with a MIMO radar. We consider the case where the massive MIMO BS is aware of the radar's existence and treats it as a non-serviced user, but the radar is unaware of the communication system's existence and treats the signals transmitted by both the BS and the communication users as noise. Using results from random matrix theory, we derive the rates achievable by the communication system and the radar. We then use these expressions to obtain the achievable rate regions for the proposed joint radar and communications system. We observe that due to the availability of a large number of degrees of freedom at the mMIMO BS, results in minimal interference even without co-design. Finally we corroborate our findings via detailed numerical simulations and verify the validity of the results derived previously under different settings. △ Less

Submitted 22 July, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

Comments: 15 pages, 11 figures

arXiv:2303.12700 [pdf, other]

ReorientDiff: Diffusion Model based Reorientation for Object Manipulation

Authors: Utkarsh A. Mishra, Yongxin Chen

Abstract: The ability to manipulate objects in a desired configurations is a fundamental requirement for robots to complete various practical applications. While certain goals can be achieved by picking and placing the objects of interest directly, object reorientation is needed for precise placement in most of the tasks. In such scenarios, the object must be reoriented and re-positioned into intermediate p… ▽ More The ability to manipulate objects in a desired configurations is a fundamental requirement for robots to complete various practical applications. While certain goals can be achieved by picking and placing the objects of interest directly, object reorientation is needed for precise placement in most of the tasks. In such scenarios, the object must be reoriented and re-positioned into intermediate poses that facilitate accurate placement at the target pose. To this end, we propose a reorientation planning method, ReorientDiff, that utilizes a diffusion model-based approach. The proposed method employs both visual inputs from the scene, and goal-specific language prompts to plan intermediate reorientation poses. Specifically, the scene and language-task information are mapped into a joint scene-task representation feature space, which is subsequently leveraged to condition the diffusion model. The diffusion model samples intermediate poses based on the representation using classifier-free guidance and then uses gradients of learned feasibility-score models for implicit iterative pose-refinement. The proposed method is evaluated using a set of YCB-objects and a suction gripper, demonstrating a success rate of 95.2% in simulation. Overall, our study presents a promising approach to address the reorientation challenge in manipulation by learning a conditional distribution, which is an effective way to move towards more generalizable object manipulation. For more results, checkout our website: https://utkarshmishra04.github.io/ReorientDiff. △ Less

Submitted 14 September, 2023; v1 submitted 27 February, 2023; originally announced March 2023.

Comments: 7 pages, 5 figures; More details here: https://utkarshmishra04.github.io/ReorientDiff

arXiv:2303.08784 [pdf, other]

Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch

Authors: Aditay Tripathi, Anand Mishra, Anirban Chakraborty

Abstract: In this work, we investigate the problem of sketch-based object localization on natural images, where given a crude hand-drawn sketch of an object, the goal is to localize all the instances of the same object on the target image. This problem proves difficult due to the abstract nature of hand-drawn sketches, variations in the style and quality of sketches, and the large domain gap existing betwee… ▽ More In this work, we investigate the problem of sketch-based object localization on natural images, where given a crude hand-drawn sketch of an object, the goal is to localize all the instances of the same object on the target image. This problem proves difficult due to the abstract nature of hand-drawn sketches, variations in the style and quality of sketches, and the large domain gap existing between the sketches and the natural images. To mitigate these challenges, existing works proposed attention-based frameworks to incorporate query information into the image features. However, in these works, the query features are incorporated after the image features have already been independently learned, leading to inadequate alignment. In contrast, we propose a sketch-guided vision transformer encoder that uses cross-attention after each block of the transformer-based image encoder to learn query-conditioned image features leading to stronger alignment with the query sketch. Further, at the output of the decoder, the object and the sketch features are refined to bring the representation of relevant objects closer to the sketch query and thereby improve the localization. The proposed model also generalizes to the object categories not seen during training, as the target image features learned by our method are query-aware. Our localization framework can also utilize multiple sketch queries via a trainable novel sketch fusion strategy. The model is evaluated on the images from the public object detection benchmark, namely MS-COCO, using the sketch queries from QuickDraw! and Sketchy datasets. Compared with existing localization methods, the proposed approach gives a $6.6\%$ and $8.0\%$ improvement in mAP for seen objects using sketch queries from QuickDraw! and Sketchy datasets, respectively, and a $12.2\%$ improvement in AP@50 for large objects that are `unseen' during training. △ Less

Submitted 15 March, 2023; originally announced March 2023.

arXiv:2303.04641 [pdf, ps, other]

Yukawa-Casimir wormholes in f(Q) gravity

Authors: Ambuj Kumar Mishra, Shweta, Umesh Kumar Sharma

Abstract: Casimir energy is always suggested as a possible source to create a traversable wormhole. It is also used to demonstrate the existence of negative energy, which can be created in a lab. To generalize, this idea, Yukawa modification of Casimir source has been considered in Remo Garattini (Eur. Phys. J. C 81 no.9, 824, 2021). In this work, we explore the Yukawa Casimir wormholes in symmetric telepar… ▽ More Casimir energy is always suggested as a possible source to create a traversable wormhole. It is also used to demonstrate the existence of negative energy, which can be created in a lab. To generalize, this idea, Yukawa modification of Casimir source has been considered in Remo Garattini (Eur. Phys. J. C 81 no.9, 824, 2021). In this work, we explore the Yukawa Casimir wormholes in symmetric teleparallel gravity. We have taken four different forms of $f(Q)$ to obtain wormhole solutions powered by the original Casimir energy source and Yukawa modification of the Casimir energy source. In power law form $f(Q)= αQ^2 + β$ and quadratic form $f(Q)= αQ^2 + βQ + γ$, where $α, β, γ$ are constants and $Q$ is non-metricity scalar, we analyze that wormhole throat is filled with non-exotic matter. We find self-sustained traversable wormholes in the Casimir source where null energy conditions are violated in all specific forms of $f(Q)$, while after Yukawa modification it is observed that violation of null energy conditions is restricted to some regions in the vicinity of the throat. △ Less

Submitted 23 February, 2023; originally announced March 2023.

Comments: 21 pages, 12 figures

arXiv:2303.01501 [pdf, ps, other]

Stability and Machine Learning Applications of Persistent Homology Using the Delaunay-Rips Complex

Authors: Amish Mishra, Francis C. Motta

Abstract: In this paper we define, implement, and investigate a simplicial complex construction for computing persistent homology of Euclidean point cloud data, which we call the Delaunay-Rips complex (DR). Assigning the Vietoris-Rips weights to simplices, DR experiences speed-up in the persistence calculations by only considering simplices that appear in the Delaunay triangulation of the point cloud. We do… ▽ More In this paper we define, implement, and investigate a simplicial complex construction for computing persistent homology of Euclidean point cloud data, which we call the Delaunay-Rips complex (DR). Assigning the Vietoris-Rips weights to simplices, DR experiences speed-up in the persistence calculations by only considering simplices that appear in the Delaunay triangulation of the point cloud. We document and compare a Python implementation of DR with other simplicial complex constructions for generating persistence diagrams. By imposing sufficient conditions on point cloud data, we are able to theoretically justify the stability of the persistence diagrams produced using DR. When the Delaunay triangulation of the point cloud changes under perturbations of the points, we prove that DR-produced persistence diagrams exhibit instability. Since we cannot guarantee that real-world data will satisfy our stability conditions, we demonstrate the practical robustness of DR for persistent homology in comparison with other simplicial complexes in machine learning applications. We find in our experiments that using DR for an ML-TDA pipeline performs comparatively well as using other simplicial complex constructions. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: 23 pages, 10 figures and tables

arXiv:2303.01180 [pdf, ps, other]

On associated graded modules of maximal Cohen-Macaulay modules over hypersurface rings-II

Authors: Ankit Mishra, Tony J. Puthenpurakal

Abstract: If $(A,\mathfrak{m})$ is a hypersurface ring of dimension $d$ with $e(A)=3$. Let $M$ be an MCM $A$-module with $μ(M)=4$ then we prove that $\depth{G(M)}\geq d-3$. If $(A,\mathfrak{m})$ is a hypersurface ring of dimension $d$ with $e(A)=3$. Let $M$ be an MCM $A$-module with $μ(M)=4$ then we prove that $\depth{G(M)}\geq d-3$. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: This paper consists of part of our paper arXiv:2106.13758. This was done due to the advice of some of our colleagues. arXiv admin note: substantial text overlap with arXiv:2208.02667

arXiv:2302.14777 [pdf, other]

VQA with Cascade of Self- and Co-Attention Blocks

Authors: Aakansha Mishra, Ashish Anand, Prithwijit Guha

Abstract: The use of complex attention modules has improved the performance of the Visual Question Answering (VQA) task. This work aims to learn an improved multi-modal representation through dense interaction of visual and textual modalities. The proposed model has an attention block containing both self-attention and co-attention on image and text. The self-attention modules provide the contextual informa… ▽ More The use of complex attention modules has improved the performance of the Visual Question Answering (VQA) task. This work aims to learn an improved multi-modal representation through dense interaction of visual and textual modalities. The proposed model has an attention block containing both self-attention and co-attention on image and text. The self-attention modules provide the contextual information of objects (for an image) and words (for a question) that are crucial for inferring an answer. On the other hand, co-attention aids the interaction of image and text. Further, fine-grained information is obtained from two modalities by using a Cascade of Self- and Co-Attention blocks (CSCA). This proposal is benchmarked on the widely used VQA2.0 and TDIUC datasets. The efficacy of key components of the model and cascading of attention modules are demonstrated by experiments involving ablation analysis. △ Less

Submitted 28 February, 2023; originally announced February 2023.

arXiv:2302.14493 [pdf, other]

Open Strange Mesons in (magnetized) nuclear matter

Authors: Ankit Kumar, Amruta Mishra

Abstract: We investigate the mass modifications of open strange mesons (vector $K^*$ and axial vector $K_1$) in (magnetized) isospin asymmetric nuclear matter using quantum chromodynamics sum rule (QCDSR) approach. The in-medium decay widths of $K^*$ $\rightarrow$ $Kπ$ and $K_1$ $\rightarrow$ $K^*π$ are studied from the mass modifications of $K_1$, $K^*$ and $K$ mesons, using a light quark-antiquark pair cr… ▽ More We investigate the mass modifications of open strange mesons (vector $K^*$ and axial vector $K_1$) in (magnetized) isospin asymmetric nuclear matter using quantum chromodynamics sum rule (QCDSR) approach. The in-medium decay widths of $K^*$ $\rightarrow$ $Kπ$ and $K_1$ $\rightarrow$ $K^*π$ are studied from the mass modifications of $K_1$, $K^*$ and $K$ mesons, using a light quark-antiquark pair creation model, namely the ${}^3 P_0$ model. The in-medium decay width for $K_1$ $\rightarrow$ $K^*π$ is compared with the decay widths calculated using a phenomenological Lagrangian. The effects of magnetic fields are also studied on the mass and the partial decay width of the vector $K^*$ meson decaying to $Kπ$. Within the QCD sum rule approach, the medium effects on the masses of the open strange mesons are calculated from the light quark condensates and the gluon condensates in the hadronic medium. The quark condensates are calculated from the medium modifications of the scalar fields ($σ$, $ζ$, and $δ$) in the mean field approximation within a chiral $SU(3)$ model, while the scalar gluon condensate is obtained from the medium modification of a scalar dilaton field ($χ$), which is introduced within the model to imitate the scale invariance breaking of QCD. △ Less

Submitted 8 November, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

Comments: 42 pages, 11 figures, minor revisions

arXiv:2302.08940 [pdf, other]

doi 10.1088/1475-7516/2024/02/034

Primordial Black Holes Dark Matter and Secondary Gravitational Waves from Warm Higgs-G Inflation

Authors: Richa Arya, Rajeev Kumar Jain, Arvind Kumar Mishra

Abstract: We explore the role of dissipative effects during warm inflation leading to the small-scale enhancement of the power spectrum of curvature perturbations. In this paper, we specifically focus on non-canonical warm inflationary scenarios and study a model of warm Higgs-G inflation, in which the Standard Model Higgs boson drives inflation, with a Galileon-like non-linear kinetic term. We show that in… ▽ More We explore the role of dissipative effects during warm inflation leading to the small-scale enhancement of the power spectrum of curvature perturbations. In this paper, we specifically focus on non-canonical warm inflationary scenarios and study a model of warm Higgs-G inflation, in which the Standard Model Higgs boson drives inflation, with a Galileon-like non-linear kinetic term. We show that in the Galileon-dominated regime, the primordial power spectrum is strongly enhanced, leading to the formation of primordial black holes (PBH) with a wide range of the mass spectrum. Interestingly, PBHs in the asteroid mass window $\sim (10^{17}$ -- $10^{23}$) g are generated in this model, which can explain the total abundance of the dark matter in the Universe. In our analysis, we also calculate the secondary gravitational waves (GW) sourced by these small-scale overdense fluctuations and find that the induced GW spectrum can be detected in the future GW detectors, such as LISA, BBO, DECIGO, etc. Our scenario thus provides a novel way of generating PBHs as dark matter and a detectable stochastic GW background from warm inflation. We also show that our scenario is consistent with the swampland and the trans-Planckian censorship conjectures and, thus, remains in the viable landscape of UV complete theories. △ Less

Submitted 21 March, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

Comments: 39 pages, 11 figures

Journal ref: Published in Journal of Cosmology and Astroparticle Physics \textbf{02}, 034 (2024)

arXiv:2302.03787 [pdf, other]

Deep Neural Network Uncertainty Quantification for LArTPC Reconstruction

Authors: Dae Heun Koh, Aashwin Mishra, Kazuhiro Terao

Abstract: We evaluate uncertainty quantification (UQ) methods for deep learning applied to liquid argon time projection chamber (LArTPC) physics analysis tasks. As deep learning applications enter widespread usage among physics data analysis, neural networks with reliable estimates of prediction uncertainty and robust performance against overconfidence and out-of-distribution (OOD) samples are critical for… ▽ More We evaluate uncertainty quantification (UQ) methods for deep learning applied to liquid argon time projection chamber (LArTPC) physics analysis tasks. As deep learning applications enter widespread usage among physics data analysis, neural networks with reliable estimates of prediction uncertainty and robust performance against overconfidence and out-of-distribution (OOD) samples are critical for their full deployment in analyzing experimental data. While numerous UQ methods have been tested on simple datasets, performance evaluations for more complex tasks and datasets are scarce. We assess the application of selected deep learning UQ methods on the task of particle classification using the PiLArNet [1] monte carlo 3D LArTPC point cloud dataset. We observe that UQ methods not only allow for better rejection of prediction mistakes and OOD detection, but also generally achieve higher overall accuracy across different task settings. We assess the precision of uncertainty quantification using different evaluation metrics, such as distributional separation of prediction entropy across correctly and incorrectly identified samples, receiver operating characteristic curves (ROCs), and expected calibration error from observed empirical accuracy. We conclude that ensembling methods can obtain well calibrated classification probabilities and generally perform better than other existing methods in deep learning UQ literature. △ Less

Submitted 31 October, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

arXiv:2302.03676 [pdf, other]

doi 10.3847/1538-4365/acdc9f

Open data from the third observing run of LIGO, Virgo, KAGRA and GEO

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné, A. Allocca , et al. (1719 additional authors not shown)

Abstract: The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti… ▽ More The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages. △ Less

Submitted 7 February, 2023; originally announced February 2023.

Comments: 27 pages, 3 figures

Report number: LIGO-P2200316

arXiv:2301.10426 [pdf, other]

doi 10.1103/PhysRevD.107.094001

Deep learning predicted elliptic flow of identified particles in heavy-ion collisions at the RHIC and LHC energies

Authors: Neelkamal Mallick, Suraj Prasad, Aditya Nath Mishra, Raghunath Sahoo, Gergely Gábor Barnaföldi

Abstract: Recent developments on a deep learning feed-forward network for estimating elliptic flow ($v_2$) coefficients in heavy-ion collisions have shown us the prediction power of this technique. The success of the model is mainly the estimation of $v_2$ from final state particle kinematic information and learning the centrality and the transverse momentum ($p_{\rm T}$) dependence of $v_2$. The deep learn… ▽ More Recent developments on a deep learning feed-forward network for estimating elliptic flow ($v_2$) coefficients in heavy-ion collisions have shown us the prediction power of this technique. The success of the model is mainly the estimation of $v_2$ from final state particle kinematic information and learning the centrality and the transverse momentum ($p_{\rm T}$) dependence of $v_2$. The deep learning model is trained with Pb-Pb collisions at $\sqrt{s_{\rm NN}} = 5.02$ TeV minimum bias events simulated with a multiphase transport model (AMPT). We extend this work to estimate $v_2$ for light-flavor identified particles such as $π^{\pm}$, $\rm K^{\pm}$, and $\rm p+\bar{p}$ in heavy-ion collisions at RHIC and LHC energies. The number of constituent quark (NCQ) scaling is also shown. The evolution of $p_{\rm T}$-crossing point of $v_2(p_{\rm T})$, depicting a change in meson-baryon elliptic flow at intermediate-$p_{\rm T}$, is studied for various collision systems and energies. The model is further evaluated by training it for different $p_{\rm T}$ regions. These results are compared with the available experimental data wherever possible. △ Less

Submitted 4 May, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

Comments: Same as the published version

Journal ref: Phys. Rev. D 107, 094001 (2023)

arXiv:2301.06302 [pdf]

doi 10.1016/j.sna.2023.114786

Eigenfrequency splitting with EPs order tunability in a coupled triple cavity system

Authors: Priyanka Chaudhary, Akhilesh Kumar Mishra

Abstract: Degeneracies of non-Hermitian Hamiltonian i.e., exceptional points (EPs) of parity-time (PT)-symmetric systems have received considerable research attention due to their various possible applications in optical devices. At EPs, at least two eigenvalues as well as their eigenvector coalesce. Recently, the effect of the eigenfrequency splitting on transfer function near EP was studied for an optical… ▽ More Degeneracies of non-Hermitian Hamiltonian i.e., exceptional points (EPs) of parity-time (PT)-symmetric systems have received considerable research attention due to their various possible applications in optical devices. At EPs, at least two eigenvalues as well as their eigenvector coalesce. Recently, the effect of the eigenfrequency splitting on transfer function near EP was studied for an optical system consisting of two micro ring resonators, which led to complex splitting in PT-symmetric and anti-PT-symmetric sensors. In present work, we propose a simple system of three coupled ring resonators to show real splitting in both PT-symmetric and anti-PT-symmetric parameter domains by exploiting higher-order EPs. We indirectly couple two rings with equal amount of gain and loss via an intermediate neutral ring. This system is then tested for refractive index (RI) sensing by modulating the cladding index and we numerically show a huge enhancement in sensitivity as compared to those reported in previous studies of micro ring resonators. Importantly, the enhancement is found to be of the order of 10^8. Further, we have found that the order of EP can be tuned by perturbating the cladding. The outcomes of this study may set up a wide range of applications in non-Hermitian triplet cavity systems. △ Less

Submitted 2 February, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

Comments: 11 pages, 11 figures

Journal ref: Sensors and Actuators: A. Physical 364 (2023) 114786 Sensors and Actuators: A. Physical 364 (2023) 114786 Sensors and Actuators: A. Physical 364 (2023) 114786 P Chaudhary, Sensors and Actuators A. Physical, Vol.364, 114786 (2023)

arXiv:2301.04115 [pdf, other]

Sensing the Environment with 5G Scattered Signals (5G-CommSense): A Feasibility Analysis

Authors: Sandip Jana, Amit Kumar Mishra, Mohammed Zafar Ali Khan

Abstract: By making use of the sensors and AI (SensAI) algorithms for a specialized task, Application Specific INstrumentation (ASIN) framework uses less computational overhead and gives a good performance. This work evaluates the feasibility of the ASIN framework dependent Communication based Sensing (CommSense) system using 5th Generation New Radio (5G NR) infrastructure. Since our proposed system is back… ▽ More By making use of the sensors and AI (SensAI) algorithms for a specialized task, Application Specific INstrumentation (ASIN) framework uses less computational overhead and gives a good performance. This work evaluates the feasibility of the ASIN framework dependent Communication based Sensing (CommSense) system using 5th Generation New Radio (5G NR) infrastructure. Since our proposed system is backed up by 5G NR infra, this system is termed as 5G-CommSense. In this paper, we have used NR channel models specified by the 3rd Generation Partnership Project (3GPP) and added white Gaussian noise (AWGN) to vary the signal to noise ratio at the receiver. Finally, from our simulation result, we conclude that the proposed system is practically feasible. △ Less

Submitted 10 January, 2023; originally announced January 2023.

Comments: 3 pages, Accepted in conference

arXiv:2301.03636 [pdf, other]

OpenMP Advisor

Authors: Alok Mishra, Abid M. Malik, Meifeng Lin, Barbara Chapman

Abstract: With the increasing diversity of heterogeneous architecture in the HPC industry, porting a legacy application to run on different architectures is a tough challenge. In this paper, we present OpenMP Advisor, a first of its kind compiler tool that enables code offloading to a GPU with OpenMP using Machine Learning. Although the tool is currently limited to GPUs, it can be extended to support other… ▽ More With the increasing diversity of heterogeneous architecture in the HPC industry, porting a legacy application to run on different architectures is a tough challenge. In this paper, we present OpenMP Advisor, a first of its kind compiler tool that enables code offloading to a GPU with OpenMP using Machine Learning. Although the tool is currently limited to GPUs, it can be extended to support other OpenMP-capable devices. The tool has two modes: Training mode and Prediction mode. The training mode must be executed on the target hardware. It takes benchmark codes as input, generates and executes every variant of the code that could possibly run on the target device, and then collects data from all of the executed codes to train an ML-based cost model for use in prediction mode. However, in prediction mode the tool does not need any interaction with the target device. It accepts a C code as input and returns the best code variant that can be used to offload the code to the specified device. The tool can determine the kernels that are best suited for offloading by predicting their runtime using a machine learning-based cost model. The main objective behind this tool is to maintain the portability aspect of OpenMP. Using our Advisor, we were able to generate code of multiple applications for seven different architectures, and correctly predict the top ten best variants for each application on every architecture. Preliminary findings indicate that this tool can assist compiler developers and HPC application researchers in porting their legacy HPC codes to the upcoming heterogeneous computing environment. △ Less

Submitted 9 January, 2023; originally announced January 2023.

arXiv:2301.02050 [pdf, other]

Open Charm Mesons and Charmonium states in Magnetized Strange Hadronic Medium at Finite Temperature

Authors: Amal Jahan C. S., Amruta Mishra

Abstract: We investigate the masses of the pseudoscalar ($D$($D^0$, $D^+$), $\bar{D}$($\bar{D^0}$, $D^-$) and vector open charm mesons ($D^*$($D^{*0}$, $D^{*+}$), ${\bar{D}}^*$(${\bar{D}}^{*0}$, $D^{*-}$) as well as the pseudoscalar ($η_c(1S)$, $η_c(2S)$) and the vector charmonium states ($J/ψ$, $ψ(2S)$, $ψ(1D)$) in the asymmetric hot strange hadronic medium in the presence of strong magnetic fields. In the… ▽ More We investigate the masses of the pseudoscalar ($D$($D^0$, $D^+$), $\bar{D}$($\bar{D^0}$, $D^-$) and vector open charm mesons ($D^*$($D^{*0}$, $D^{*+}$), ${\bar{D}}^*$(${\bar{D}}^{*0}$, $D^{*-}$) as well as the pseudoscalar ($η_c(1S)$, $η_c(2S)$) and the vector charmonium states ($J/ψ$, $ψ(2S)$, $ψ(1D)$) in the asymmetric hot strange hadronic medium in the presence of strong magnetic fields. In the magnetized medium, the mass modification of open charm mesons due to their interactions with baryons and the scalar fields ($σ$, $ζ$, and $δ$) are investigated in a chiral effective model. Moreover, the charged pseudoscalar meson ($D^\pm$), as well as the longitudinal component of charged vector meson ($D^{*\pm \parallel}$), experience additional positive mass modifications in the magnetic field due to Landau quantization. The effect of the modification of gluon condensates simulated by the medium change of dilaton field $χ$ on the masses of the charmonia is also calculated in the chiral effective model. At high temperatures, the magnetically induced modifications of scalar fields significantly reduce the in-medium masses of mesons. The effects of magnetically induced spin mixing between the pseudoscalar and the vector mesons are incorporated in our study. The spin mixing result in a positive mass shift for the longitudinal component of the vector mesons and a negative mass shift for the pseudoscalar mesons in the presence of the magnetic field. From the obtained in-medium mass shifts of charmonia and open charm mesons, we have also calculated the partial decay widths of $ψ(1D)$ to $D\bar{D}$, using a light quark pair creation model, namely the $^3P_0$ model. Spin mixing and strangeness fraction enhance the partial decay width at small magnetic fields. △ Less

Submitted 5 January, 2023; originally announced January 2023.

Comments: 32 pages, 8 figures

arXiv:2212.12239 [pdf, other]

Medium modifications of Heavy Quarkonia masses in a generalized Linear Sigma Model

Authors: Arpita Mondal, Pallabi Parui, Amruta Mishra

Abstract: We study the mass shifts of the charmonium ($\bar{c}c$) states ($J/ψ$, $ψ(2S)$, $ψ(1D)$, $χ_{c0}$, $χ_{c1}$ and $χ_{c2}$) as well as the bottomonium ($\bar{b}b$) states ($Υ(1S)$, $Υ(2S)$, $Υ_2(1D)$, $χ_{b0}$, $χ_{b1}$ and $χ_{b2}$) in isospin asymmetric nuclear matter. These are investigated using a generalized linear sigma model. The broken scale invariance of QCD is incorporated in the chiral… ▽ More We study the mass shifts of the charmonium ($\bar{c}c$) states ($J/ψ$, $ψ(2S)$, $ψ(1D)$, $χ_{c0}$, $χ_{c1}$ and $χ_{c2}$) as well as the bottomonium ($\bar{b}b$) states ($Υ(1S)$, $Υ(2S)$, $Υ_2(1D)$, $χ_{b0}$, $χ_{b1}$ and $χ_{b2}$) in isospin asymmetric nuclear matter. These are investigated using a generalized linear sigma model. The broken scale invariance of QCD is incorporated in the chiral $SU(2)\times SU(2)$ Lagrangian through an effective potential involving logarithmic terms of a scalar (glueball) dilaton field $χ$. The mass shifts of the quarkonium states are obtained through the medium modifications of the dilaton field which simulates the scalar gluon condensate of QCD. We observe an appreciable mass drop in the states of heavy quarkonia under this study. The in-medium masses at finite densities thus obtained should modify the in-medium partial decay widths of heavy quarkonia to open heavy flavor mesons. These density effects can be probed in in the high energy nuclear collisions at the future facility at GSI (at Germany) and JINR (at Russia) in the experiments producing highly dense baryonic matter. △ Less

Submitted 23 December, 2022; originally announced December 2022.

Comments: 28 pages,7 figures

arXiv:2212.11690 [pdf, other]

Geometric genuine multipartite entanglement for four-qubit systems

Authors: Ansh Mishra, Soumik Mahanti, Abhinash Kumar Roy, Prasanta K. Panigrahi

Abstract: Xie and Eberly introduced a genuine multipartite entanglement (GME) measure `concurrence fill'(\textit{Phys. Rev. Lett., \textbf{127}, 040403} (2021)) for three-party systems. It is defined as the area of a triangle whose side lengths represent squared concurrence in each bi-partition. However, it has been recently shown that concurrence fill is not monotonic under LOCC, hence not a faithful measu… ▽ More Xie and Eberly introduced a genuine multipartite entanglement (GME) measure `concurrence fill'(\textit{Phys. Rev. Lett., \textbf{127}, 040403} (2021)) for three-party systems. It is defined as the area of a triangle whose side lengths represent squared concurrence in each bi-partition. However, it has been recently shown that concurrence fill is not monotonic under LOCC, hence not a faithful measure of entanglement. Though it is not a faithful entanglement measure, it encapsulates an elegant geometric interpretation of bipartite squared concurrences. There have been a few attempts to generalize GME measure to four-party settings and beyond. However, some of them are not faithful, and others simply lack an elegant geometric interpretation. The recent proposal from Xie et al. constructs a concurrence tetrahedron, whose volume gives the amount of GME for four-party systems; with generalization to more than four parties being the hypervolume of the simplex structure in that dimension. Here, we show by construction that to capture all aspects of multipartite entanglement, one does not need a more complex structure, and the four-party entanglement can be demonstrated using \textit{2D geometry only}. The subadditivity together with the Araki-Lieb inequality of linear entropy is used to construct a direct extension of the geometric GME to four-party systems resulting in quadrilateral geometry. Our measure can be geometrically interpreted as a combination of three quadrilaterals whose sides result from the concurrence in one-to-three bi-partition, and diagonal as concurrence in two-to-two bipartition. △ Less

Submitted 11 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

Comments: 14 pages, 3 figures

arXiv:2212.08568 [pdf, other]

Biomedical image analysis competitions: The state of current participation practice

Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps. △ Less

Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

arXiv:2212.01345 [pdf, other]

doi 10.1063/5.0128743

Extreme events in a complex network: interplay between degree distribution and repulsive interaction

Authors: Arnob Ray, Timo Bröhl, Arindam Mishra, Subrata Ghosh, Dibakar Ghosh, Tomasz Kapitaniak, Syamal K. Dana, Chittaranjan Hens

Abstract: The role of topological heterogeneity in the origin of extreme events in a network is investigated here. The dynamics of the oscillators associated with the nodes are assumed to be identical and influenced by mean-field repulsive interactions. An interplay of topological heterogeneity and the repulsive interaction between the dynamical units of the network triggers extreme events in the nodes when… ▽ More The role of topological heterogeneity in the origin of extreme events in a network is investigated here. The dynamics of the oscillators associated with the nodes are assumed to be identical and influenced by mean-field repulsive interactions. An interplay of topological heterogeneity and the repulsive interaction between the dynamical units of the network triggers extreme events in the nodes when each node succumbs to such events for discretely different ranges of repulsive coupling. A high degree node is vulnerable to weaker repulsive interactions, while a low degree node is susceptible to stronger interactions. As a result, the formation of extreme events changes position with increasing strength of repulsive interaction from high to low degree nodes. Extreme events at any node are identified with the appearance of occasional large-amplitude events (amplitude of the temporal dynamics) that are larger than a threshold height and rare in occurrence, which we confirm by estimating the probability distribution of all events. Extreme events appear at any oscillator near the boundary of transition from rotation to libration at a critical value of the repulsive coupling strength. To explore the phenomenon, a paradigmatic second-order phase model is used to represent the dynamics of the oscillator associated with each node. We make an annealed network approximation to reduce our original model and thereby confirm the dual role of the repulsive interaction and the degree of a node in the origin of extreme events in any oscillator. △ Less

Submitted 19 November, 2022; originally announced December 2022.

Comments: 10 pages, 5 figures, Accepted for publication in Chaos: An Interdisciplinary Journal of Nonlinear Science

arXiv:2212.00749 [pdf, other]

Multimodal Query-guided Object Localization

Authors: Aditay Tripathi, Rajath R Dani, Anand Mishra, Anirban Chakraborty

Abstract: Consider a scenario in one-shot query-guided object localization where neither an image of the object nor the object category name is available as a query. In such a scenario, a hand-drawn sketch of the object could be a choice for a query. However, hand-drawn crude sketches alone, when used as queries, might be ambiguous for object localization, e.g., a sketch of a laptop could be confused for a… ▽ More Consider a scenario in one-shot query-guided object localization where neither an image of the object nor the object category name is available as a query. In such a scenario, a hand-drawn sketch of the object could be a choice for a query. However, hand-drawn crude sketches alone, when used as queries, might be ambiguous for object localization, e.g., a sketch of a laptop could be confused for a sofa. On the other hand, a linguistic definition of the category, e.g., a small portable computer small enough to use in your lap" along with the sketch query, gives better visual and semantic cues for object localization. In this work, we present a multimodal query-guided object localization approach under the challenging open-set setting. In particular, we use queries from two modalities, namely, hand-drawn sketch and description of the object (also known as gloss), to perform object localization. Multimodal query-guided object localization is a challenging task, especially when a large domain gap exists between the queries and the natural images, as well as due to the challenge of combining the complementary and minimal information present across the queries. For example, hand-drawn crude sketches contain abstract shape information of an object, while the text descriptions often capture partial semantic information about a given object category. To address the aforementioned challenges, we present a novel cross-modal attention scheme that guides the region proposal network to generate object proposals relevant to the input queries and a novel orthogonal projection-based proposal scoring technique that scores each proposal with respect to the queries, thereby yielding the final localization results. ... △ Less

Submitted 1 December, 2022; originally announced December 2022.

Comments: Under Review

arXiv:2212.00051 [pdf, other]

doi 10.3847/1538-4357/acd546

Morphological Parameters and Associated Uncertainties for 8 Million Galaxies in the Hyper Suprime-Cam Wide Survey

Authors: Aritra Ghosh, C. Megan Urry, Aayush Mishra, Laurence Perreault-Levasseur, Priyamvada Natarajan, David B. Sanders, Daisuke Nagai, Chuan Tian, Nico Cappelluti, Jeyhan S. Kartaltepe, Meredith C. Powell, Amrit Rau, Ezequiel Treister

Abstract: We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for $\sim 8$ million galaxies in the Hyper Suprime-Cam (HSC) Wide survey with $z \leq 0.75$ and $m \leq 23$. GaMPEN is a machine learning framework that estimates Bayesian posteriors for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and… ▽ More We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for $\sim 8$ million galaxies in the Hyper Suprime-Cam (HSC) Wide survey with $z \leq 0.75$ and $m \leq 23$. GaMPEN is a machine learning framework that estimates Bayesian posteriors for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and flux ($F$). By first training on simulations of galaxies and then applying transfer learning using real data, we trained GaMPEN with $<1\%$ of our dataset. This two-step process will be critical for applying machine learning algorithms to future large imaging surveys, such as the Rubin-Legacy Survey of Space and Time (LSST), the Nancy Grace Roman Space Telescope (NGRST), and Euclid. By comparing our results to those obtained using light-profile fitting, we demonstrate that GaMPEN's predicted posterior distributions are well-calibrated ($\lesssim 5\%$ deviation) and accurate. This represents a significant improvement over light profile fitting algorithms which underestimate uncertainties by as much as $\sim60\%$. For an overlap** sub-sample, we also compare the derived morphological parameters with values in two external catalogs and find that the results agree within the limits of uncertainties predicted by GaMPEN. This step also permits us to define an empirical relationship between the Sérsic index and $L_B/L_T$ that can be used to convert between these two parameters. The catalog presented here represents a significant improvement in size ($\sim10 \times $), depth ($\sim4$ magnitudes), and uncertainty quantification over previous state-of-the-art bulge+disk decomposition catalogs. With this work, we also release GaMPEN's source code and trained models, which can be adapted to other datasets. △ Less

Submitted 1 March, 2024; v1 submitted 30 November, 2022; originally announced December 2022.

Comments: 39 pages, 31 figures. Published in The Astrophysical Journal. We welcome comments and constructive criticism. Public Data Release at http://gampen.ghosharitra.com/

Journal ref: The Astrophysical Journal 953.2 (2023): 134

arXiv:2211.12950 [pdf, other]

Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

Authors: Soumya Jahagirdar, Shankar Gangisetty, Anand Mishra

Abstract: We present a novel problem of text-based visual question generation or TextVQG in short. Given the recent growing interest of the document image analysis community in combining text understanding with conversational artificial intelligence, e.g., text-based visual question answering, TextVQG becomes an important task. TextVQG aims to generate a natural language question for a given input image and… ▽ More We present a novel problem of text-based visual question generation or TextVQG in short. Given the recent growing interest of the document image analysis community in combining text understanding with conversational artificial intelligence, e.g., text-based visual question answering, TextVQG becomes an important task. TextVQG aims to generate a natural language question for a given input image and an automatically extracted text also known as OCR token from it such that the OCR token is an answer to the generated question. TextVQG is an essential ability for a conversational agent. However, it is challenging as it requires an in-depth understanding of the scene and the ability to semantically bridge the visual content with the text present in the image. To address TextVQG, we present an OCR consistent visual question generation model that Looks into the visual content, Reads the scene text, and Asks a relevant and meaningful natural language question. We refer to our proposed model as OLRA. We perform an extensive evaluation of OLRA on two public benchmarks and compare them against baselines. Our model OLRA automatically generates questions similar to the public text-based visual question answering datasets that were curated manually. Moreover, we significantly outperform baseline approaches on the performance measures popularly used in text generation literature. △ Less

Submitted 23 November, 2022; originally announced November 2022.

arXiv:2211.12926 [pdf, other]

doi 10.1145/3571600.3571625

Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification

Authors: Nakul Sharma, Abhirama S. Penamakuri, Anand Mishra

Abstract: In this paper, we study the problem of identifying logos of business brands in natural scenes in an open-set one-shot setting. This problem setup is significantly more challenging than traditionally-studied 'closed-set' and 'large-scale training samples per category' logo recognition settings. We propose a novel multi-view textual-visual encoding framework that encodes text appearing in the logos… ▽ More In this paper, we study the problem of identifying logos of business brands in natural scenes in an open-set one-shot setting. This problem setup is significantly more challenging than traditionally-studied 'closed-set' and 'large-scale training samples per category' logo recognition settings. We propose a novel multi-view textual-visual encoding framework that encodes text appearing in the logos as well as the graphical design of the logos to learn robust contrastive representations. These representations are jointly learned for multiple views of logos over a batch and thereby they generalize well to unseen logos. We evaluate our proposed framework for cropped logo verification, cropped logo identification, and end-to-end logo identification in natural scene tasks; and compare it against state-of-the-art methods. Further, the literature lacks a 'very-large-scale' collection of reference logo images that can facilitate the study of one-hundred thousand-scale logo identification. To fill this gap in the literature, we introduce Wikidata Reference Logo Dataset (WiRLD), containing logos for 100K business brands harvested from Wikidata. Our proposed framework that achieves an area under the ROC curve of 91.3% on the QMUL-OpenLogo dataset for the verification task, outperforms state-of-the-art methods by 9.1% and 2.6% on the one-shot logo identification task on the Toplogos-10 and the FlickrLogos32 datasets, respectively. Further, we show that our method is more stable compared to other baselines even when the number of candidate logos is on a 100K scale. △ Less

Submitted 23 November, 2022; originally announced November 2022.

Comments: Accepted to ICVGIP 2022

arXiv:2211.10811 [pdf, other]

Nonlinear evolution of magnetorotational instability in a magnetized Taylor-Couette flow: scaling properties and relation to upcoming DRESDYN-MRI experiment

Authors: A. Mishra, G. Mamatsashvili, F. Stefani

Abstract: Magnetorotational instability (MRI) is the most likely mechanism driving angular momentum transport in astrophysical disks. However, despite many efforts, a conclusive experimental evidence of MRI is still missing. Recently, performing 1D linear analysis of the standard MRI (SMRI) in a cylindrical Taylor-Couette (TC) flow with an axial magnetic field, we showed that SMRI can be detected in the upc… ▽ More Magnetorotational instability (MRI) is the most likely mechanism driving angular momentum transport in astrophysical disks. However, despite many efforts, a conclusive experimental evidence of MRI is still missing. Recently, performing 1D linear analysis of the standard MRI (SMRI) in a cylindrical Taylor-Couette (TC) flow with an axial magnetic field, we showed that SMRI can be detected in the upcoming DRESDYN-MRI experiment based on a magnetized TC flow of liquid sodium. In this study, also related to DRESDYN-MRI experiments, we focused on the nonlinear evolution and saturation properties of SMRI and analyzed its scaling behavior with respect to the main parameters of the TC flow. We did a detailed analysis over the extensive ranges of magnetic Reynolds number $Rm\in [8.5, 37.1]$, Lundquist number $Lu\in[1.5, 15.5]$ and Reynolds number, $Re\in[10^3, 10^5]$. We considered small magnetic Prandtl numbers, $Pm \ll 1$, down to $Pm\sim 10^{-4}$, aiming at values typical of liquid sodium in the experiments. In the saturated state, the magnetic energy of SMRI and torque due to perturbations on the cylinders, which characterizes angular momentum transport, both increase with $Rm$ for fixed $(Lu, Re)$, while for fixed $(Lu, Rm)$, the magnetic energy decreases and torque increases with increasing $Re$. We studied the scaling of the magnetic energy and torque in the saturated state as a function of $Re$ and find a power law dependence $Re^{-0.6...-0.5}$ for the magnetic energy and $Re^{0.4...0.5}$ for the torque at all $(Lu, Rm)$ and high $Re\geq 4000$. We also explored the dependence on Lundquist number and angular velocity of the cylinders. These scaling laws will be instrumental in the subsequent analysis of more realistic finite-length TC flows and comparison of numerical results with those obtained from the DRESDYN-MRI experiments to unambiguously identify SMRI in laboratory. △ Less

Submitted 27 July, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

Comments: 22 pages, 18 figures, 2 Tables, accepted for publication in Physical Review Fluids

arXiv:2211.05456 [pdf]

Review of Methods for Handling Class-Imbalanced in Classification Problems

Authors: Satyendra Singh Rawat, Amit Kumar Mishra

Abstract: Learning classifiers using skewed or imbalanced datasets can occasionally lead to classification issues; this is a serious issue. In some cases, one class contains the majority of examples while the other, which is frequently the more important class, is nevertheless represented by a smaller proportion of examples. Using this kind of data could make many carefully designed machine-learning systems… ▽ More Learning classifiers using skewed or imbalanced datasets can occasionally lead to classification issues; this is a serious issue. In some cases, one class contains the majority of examples while the other, which is frequently the more important class, is nevertheless represented by a smaller proportion of examples. Using this kind of data could make many carefully designed machine-learning systems ineffective. High training fidelity was a term used to describe biases vs. all other instances of the class. The best approach to all possible remedies to this issue is typically to gain from the minority class. The article examines the most widely used methods for addressing the problem of learning with a class imbalance, including data-level, algorithm-level, hybrid, cost-sensitive learning, and deep learning, etc. including their advantages and limitations. The efficiency and performance of the classifier are assessed using a myriad of evaluation metrics. △ Less

Submitted 10 November, 2022; originally announced November 2022.

arXiv:2211.01969 [pdf, other]

Grounding Scene Graphs on Natural Images via Visio-Lingual Message Passing

Authors: Aditay Tripathi, Anand Mishra, Anirban Chakraborty

Abstract: This paper presents a framework for jointly grounding objects that follow certain semantic relationship constraints given in a scene graph. A typical natural scene contains several objects, often exhibiting visual relationships of varied complexities between them. These inter-object relationships provide strong contextual cues toward improving grounding performance compared to a traditional object… ▽ More This paper presents a framework for jointly grounding objects that follow certain semantic relationship constraints given in a scene graph. A typical natural scene contains several objects, often exhibiting visual relationships of varied complexities between them. These inter-object relationships provide strong contextual cues toward improving grounding performance compared to a traditional object query-only-based localization task. A scene graph is an efficient and structured way to represent all the objects and their semantic relationships in the image. In an attempt towards bridging these two modalities representing scenes and utilizing contextual information for improving object localization, we rigorously study the problem of grounding scene graphs on natural images. To this end, we propose a novel graph neural network-based approach referred to as Visio-Lingual Message PAssing Graph Neural Network (VL-MPAG Net). In VL-MPAG Net, we first construct a directed graph with object proposals as nodes and an edge between a pair of nodes representing a plausible relation between them. Then a three-step inter-graph and intra-graph message passing is performed to learn the context-dependent representation of the proposals and query objects. These object representations are used to score the proposals to generate object localization. The proposed method significantly outperforms the baselines on four public datasets. △ Less

Submitted 3 November, 2022; originally announced November 2022.

Journal ref: WACV 2023

arXiv:2210.11047 [pdf, other]

Thwarting Piracy: Anti-debugging Using GPU-assisted Self-healing Codes

Authors: Adhokshaj Mishra, Manjesh Kumar Hanawal

Abstract: Software piracy is one of the concerns in the IT sector. Pirates leverage the debugger tools to reverse engineer the logic that verifies the license keys or bypass the entire verification process. Anti-debugging techniques are used to defeat piracy using self-healing codes. However, anti-debugging methods can be defeated when the licensing protections are limited to CPU-based implementation by wri… ▽ More Software piracy is one of the concerns in the IT sector. Pirates leverage the debugger tools to reverse engineer the logic that verifies the license keys or bypass the entire verification process. Anti-debugging techniques are used to defeat piracy using self-healing codes. However, anti-debugging methods can be defeated when the licensing protections are limited to CPU-based implementation by writing custom codes to deactivate the anti-debugging methods. In the paper, we demonstrate how GPU implementation can prevent pirates from deactivating the anti-debugging methods by using the limitations of debugging on GPU. Generally, GPUs do not support debugging directly on the hardware, and therefore all the debugging is limited to CPU-based emulation. Also, a process running on CPU generally does not have any visibility on codes running on GPU, which comes as an added benefit for our work. We provide an implementation on GPU to show the feasibility of our method. As GPUs are getting widespread with the raise in popularity of gaming software, our technique provides a method to protect against piracy. Our method thwarts any attempts to bypass the license verification step thus offering a better anti-piracy mechanism. △ Less

Submitted 20 October, 2022; originally announced October 2022.

arXiv:2210.10137 [pdf, other]

doi 10.1109/XLOOP56614.2022.00006

Testing the data framework for an AI algorithm in preparation for high data rate X-ray facilities

Authors: Hongwei Chen, Sathya R. Chitturi, Rajan Plumley, Lingjia Shen, Nathan C. Drucker, Nicolas Burdet, Cheng Peng, Sougata Mardanya, Daniel Ratner, Aashwin Mishra, Chun Hong Yoon, Sanghoon Song, Matthieu Chollet, Gilberto Fabbris, Mike Dunne, Silke Nelson, Mingda Li, Aaron Lindenberg, Chun**g Jia, Youssef Nashed, Arun Bansil, Sugata Chowdhury, Adrian E. Feiguin, Joshua J. Turner, Jana B. Thayer

Abstract: The advent of next-generation X-ray free electron lasers will be capable of delivering X-rays at a repetition rate approaching 1 MHz continuously. This will require the development of data systems to handle experiments at these type of facilities, especially for high throughput applications, such as femtosecond X-ray crystallography and X-ray photon fluctuation spectroscopy. Here, we demonstrate a… ▽ More The advent of next-generation X-ray free electron lasers will be capable of delivering X-rays at a repetition rate approaching 1 MHz continuously. This will require the development of data systems to handle experiments at these type of facilities, especially for high throughput applications, such as femtosecond X-ray crystallography and X-ray photon fluctuation spectroscopy. Here, we demonstrate a framework which captures single shot X-ray data at the LCLS and implements a machine-learning algorithm to automatically extract the contrast parameter from the collected data. We measure the time required to return the results and assess the feasibility of using this framework at high data volume. We use this experiment to determine the feasibility of solutions for `live' data analysis at the MHz repetition rate. △ Less

Submitted 18 October, 2022; originally announced October 2022.

Journal ref: 2022 4th Annual Workshop on Extreme-scale Experiment-in-the-Loop Computing (XLOOP) (2022) 1-9

arXiv:2210.09192 [pdf, other]

doi 10.1103/PhysRevD.107.074003

Dirac sea effects on Heavy Quarkonia decay widths in magnetized matter -- a field theoretic model of composite hadrons

Authors: Amruta Mishra, S. P. Misra

Abstract: We study the partial decay widths of charmonium (bottomonium) states to ${\rm D\bar D \; (B\bar B)}$ mesons in magnetized (nuclear) matter using a field theoretical model of composite hadrons with quark (and antiquark) constituents. These are computed from the mass modifications of the decaying and produced mesons within a chiral effective model, including the nucleon Dirac sea effects. The mass m… ▽ More We study the partial decay widths of charmonium (bottomonium) states to ${\rm D\bar D \; (B\bar B)}$ mesons in magnetized (nuclear) matter using a field theoretical model of composite hadrons with quark (and antiquark) constituents. These are computed from the mass modifications of the decaying and produced mesons within a chiral effective model, including the nucleon Dirac sea effects. The mass modifications of the open charm (bottom) mesons are calculated from their interactions with the nucleons and the scalar mesons, whereas the mass shift of the heavy quarkonium state is obtained from the medium change of a scalar dilaton field, $χ$, which mimics the gluon condensates of QCD. The Dirac sea contributions are observed to lead to a rise (drop) in the quark condensates as the magnetic field is increased, an effect called the (inverse) magnetic catalysis. These effects are observed to be significant and the anomalous magnetic moments (AMMs) of the nucleons are observed to play an important role. For $ρ_B$=0, there is observed to be magnetic catalysis (MC) without and with AMMs, whereas, for $ρ_B=ρ_0$, the inverse magnetic catalysis (IMC) is observed when the AMMs are taken into account, contrary to MC, when the AMMs are ignored. In the presence of a magnetic field, there are also mixings of spin 0 (pseudoscalar) and spin 1 (vector) states (PV mixing) which modify the masses of these mesons. The magnetic field effects on the heavy quarkonium decay widths should have observable consequences on the production the heavy flavour mesons, which are created in the early stage of ultra-relativistic peripheral heavy ion collisions, at RHIC and LHC, when the produced magnetic fields can still be extremely large. △ Less

Submitted 23 August, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: 42 pages, 7 figures, version published in Phys. Rev. D 107 (2023) 074003

arXiv:2210.08554 [pdf, other]

COFAR: Commonsense and Factual Reasoning in Image Search

Authors: Prajwal Gatti, Abhirama Subramanyam Penamakuri, Revant Teotia, Anand Mishra, Shubhashis Sengupta, Roshni Ramnani

Abstract: One characteristic that makes humans superior to modern artificially intelligent models is the ability to interpret images beyond what is visually apparent. Consider the following two natural language search queries - (i) "a queue of customers patiently waiting to buy ice cream" and (ii) "a queue of tourists going to see a famous Mughal architecture in India." Interpreting these queries requires o… ▽ More One characteristic that makes humans superior to modern artificially intelligent models is the ability to interpret images beyond what is visually apparent. Consider the following two natural language search queries - (i) "a queue of customers patiently waiting to buy ice cream" and (ii) "a queue of tourists going to see a famous Mughal architecture in India." Interpreting these queries requires one to reason with (i) Commonsense such as interpreting people as customers or tourists, actions as waiting to buy or going to see; and (ii) Fact or world knowledge associated with named visual entities, for example, whether the store in the image sells ice cream or whether the landmark in the image is a Mughal architecture located in India. Such reasoning goes beyond just visual recognition. To enable both commonsense and factual reasoning in the image search, we present a unified framework, namely Knowledge Retrieval-Augmented Multimodal Transformer (KRAMT), that treats the named visual entities in an image as a gateway to encyclopedic knowledge and leverages them along with natural language query to ground relevant knowledge. Further, KRAMT seamlessly integrates visual content and grounded knowledge to learn alignment between images and search queries. This unified framework is then used to perform image search requiring commonsense and factual reasoning. The retrieval performance of KRAMT is evaluated and compared with related approaches on a new dataset we introduce - namely COFAR. We make our code and dataset available at https://vl2g.github.io/projects/cofar △ Less

Submitted 16 October, 2022; originally announced October 2022.

Comments: Accepted in AACL-IJCNLP 2022

arXiv:2209.13090 [pdf, other]

EEG-based Image Feature Extraction for Visual Classification using Deep Learning

Authors: Alankrit Mishra, Nikhil Raj, Garima Bajwa

Abstract: While capable of segregating visual data, humans take time to examine a single piece, let alone thousands or millions of samples. The deep learning models efficiently process sizeable information with the help of modern-day computing. However, their questionable decision-making process has raised considerable concerns. Recent studies have identified a new approach to extract image features from EE… ▽ More While capable of segregating visual data, humans take time to examine a single piece, let alone thousands or millions of samples. The deep learning models efficiently process sizeable information with the help of modern-day computing. However, their questionable decision-making process has raised considerable concerns. Recent studies have identified a new approach to extract image features from EEG signals and combine them with standard image features. These approaches make deep learning models more interpretable and also enables faster converging of models with fewer samples. Inspired by recent studies, we developed an efficient way of encoding EEG signals as images to facilitate a more subtle understanding of brain signals with deep learning models. Using two variations in such encoding methods, we classified the encoded EEG signals corresponding to 39 image classes with a benchmark accuracy of 70% on the layered dataset of six subjects, which is significantly higher than the existing work. Our image classification approach with combined EEG features achieved an accuracy of 82% compared to the slightly better accuracy of a pure deep learning approach; nevertheless, it demonstrates the viability of the theory. △ Less

Submitted 26 September, 2022; originally announced September 2022.

Comments: 8 pages, 4 figures, to be published in 2022 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)

arXiv:2209.12291 [pdf, other]

doi 10.1140/epjc/s10052-023-11252-0

Regularized Stable Kerr Black Hole: Cosmic Censorships, Shadow and Quasi-Normal Modes

Authors: Rajes Ghosh, Mostafizur Rahman, Akash K Mishra

Abstract: Black hole solutions in general relativity come with pathologies such as singularity and mass inflation instability, which are believed to be cured by a yet-to-be-found quantum theory of gravity. Without such consistent description, one may model theory-agnostic phenomenological black holes that bypass the aforesaid issues. These so-called regular black holes are extensively studied in the literat… ▽ More Black hole solutions in general relativity come with pathologies such as singularity and mass inflation instability, which are believed to be cured by a yet-to-be-found quantum theory of gravity. Without such consistent description, one may model theory-agnostic phenomenological black holes that bypass the aforesaid issues. These so-called regular black holes are extensively studied in the literature using parameterized modifications over the black hole solutions of general relativity. However, since there exist several ways to model such black holes, it is important to study the consistency and viability of these solutions from both theoretical and observational perspectives. In this work, we consider a recently proposed model of regularized stable rotating black holes having two extra parameters in addition to the mass and spin of a Kerr solution. We start by computing their quasi-normal modes under scalar perturbation and investigate the impact of those additional parameters on black hole stability. In the second part, we study the shadow structures of these regularized black holes and obtain stringent bounds on the parameter space requiring consistency with Event Horizon Telescope observations of $M87^*$ and $Sgr\, A^*$ shadows. △ Less

Submitted 11 February, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

Comments: 11 pages, 5 figures, 1 table, Journal Version

Journal ref: Eur. Phys. J. C 83, 91 (2023)

arXiv:2209.09139 [pdf, other]

Machine Learning based Extraction of Boundary Conditions from Doppler Echo Images for Patient Specific Coarctation of the Aorta: Computational Fluid Dynamics Study

Authors: Vincent Milimo Masilokwa Punabantu, Malebogo Ngoepe, Amit Kumar Mishra, Thomas Aldersley, John Lawrenson, Liesl Zuhlke

Abstract: Purpose- Coarctation of the Aorta (CoA) patient-specific computational fluid dynamics (CFD) studies in resource constrained settings are limited by the available imaging modalities for geometry and velocity data acquisition. Doppler echocardiography has been seen as a suitable velocity acquisition modality due to its higher availability and safety. This study aimed to investigate the application o… ▽ More Purpose- Coarctation of the Aorta (CoA) patient-specific computational fluid dynamics (CFD) studies in resource constrained settings are limited by the available imaging modalities for geometry and velocity data acquisition. Doppler echocardiography has been seen as a suitable velocity acquisition modality due to its higher availability and safety. This study aimed to investigate the application of classical machine learning (ML) methods to create an adequate and robust approach for obtaining boundary conditions (BCs) from Doppler Echocardiography images, for haemodynamic modeling using CFD. Methods- Our proposed approach combines ML and CFD to model haemodynamic flow within the region of interest. With the key feature of the approach being the use of ML models to calibrate the inlet and outlet boundary conditions (BCs) of the CFD model. The key input variable for the ML model was the patients heart rate as this was the parameter that varied in time across the measured vessels within the study. ANSYS Fluent was used for the CFD component of the study whilst the scikit-learn python library was used for the ML component. Results- We validated our approach against a real clinical case of severe CoA before intervention. The maximum coarctation velocity of our simulations were compared to the measured maximum coarctation velocity obtained from the patient whose geometry is used within the study. Of the 5 ML models used to obtain BCs the top model was within 5\% of the measured maximum coarctation velocity. Conclusion- The framework demonstrated that it was capable of taking variations of the patients heart rate between measurements into account. Thus, enabling the calculation of BCs that were physiologically realistic when the heart rate was scaled across each vessel whilst providing a reasonably accurate solution. △ Less

Submitted 25 November, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

Comments: Article to be submitted to Springer Nature Cardiovascular Engineering and Technology Journal

Showing 101–150 of 641 results for author: Mishra, A