Search | arXiv e-print repository

Machine learning driven high-resolution Raman spectral generation for accurate molecular feature recognition

Authors: Vikas Yadav, Abhay Kumar Tiwari, Soumik Siddhanta

Abstract: Through the probing of light-matter interactions, Raman spectroscopy provides invaluable insights into the composition, structure, and dynamics of materials, and obtaining such data from portable and cheap instruments is of immense practical relevance. Here, we propose the integration of a Generative Adversarial Network (GAN) with low-resolution Raman spectroscopy with a portable hand-held spectro… ▽ More Through the probing of light-matter interactions, Raman spectroscopy provides invaluable insights into the composition, structure, and dynamics of materials, and obtaining such data from portable and cheap instruments is of immense practical relevance. Here, we propose the integration of a Generative Adversarial Network (GAN) with low-resolution Raman spectroscopy with a portable hand-held spectrometer to facilitate concurrent spectral analysis and compound classification. Portable spectrometers generally have a lower resolution, and the Raman signal is usually buried under the background noise. The GAN-based model could not only generate high-resolution data but also reduced the spectral noise significantly. The generated data was used further to train an Artificial Neural Network (ANN)-based model for the classification of organic and pharmaceutical drug molecules. The high-resolution generated Raman data was subsequently used for spectral barcoding for identification of the pharmaceutical drugs. GAN also demonstrated enhanced robustness in extracting weak signals compared to conventional noise removal methods. This integrated system holds the potential for achieving accurate and real-time monitoring of noisy inputs to obtain high throughput output, thereby opening new avenues for applications in different domains. This synergy between spectroscopy and machine learning (ML) facilitates improved data processing, noise reduction, and feature extraction and opens avenues for predictive modeling and automated decision-making using cost-effective portable devices. △ Less

Submitted 25 June, 2024; originally announced July 2024.

Comments: 37 Pages

arXiv:2406.06308 [pdf, other]

Topological structures, dark matter and gravitational waves in $E_6$

Authors: Rinku Maji, Qaisar Shafi, Amit Tiwari

Abstract: We discuss the appearance of topological structures from the spontaneous breaking of $E_6$ to the Standard Model via its maximal subgroup $SO(10) \times U(1)_ψ$. They include dumbbells, metastable strings, as well as domain walls bounded by necklaces. We provide a novel scenario for producing metastable strings based on the symmetry breaking $U(1)_ψ\longrightarrow Z_8 \longrightarrow Z_4$. The met… ▽ More We discuss the appearance of topological structures from the spontaneous breaking of $E_6$ to the Standard Model via its maximal subgroup $SO(10) \times U(1)_ψ$. They include dumbbells, metastable strings, as well as domain walls bounded by necklaces. We provide a novel scenario for producing metastable strings based on the symmetry breaking $U(1)_ψ\longrightarrow Z_8 \longrightarrow Z_4$. The metastable string arises from the merger of $Z_8$ strings that bound a domain wall. An unbroken gauge $Z_2$ symmetry from $SO(10)$ breaking yields viable stable dark matter candidates as well as topologically stable strings. We discuss the gravitational wave emission from two varieties of cosmic strings, namely the superheavy metastable ones and the intermediate scale topologically stable cosmic strings. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2406.06088 [pdf]

Stabilizing Solution-Substrate Interaction of Perovskite Ink on PEDOT:PSS for Scalable Blade Coated Narrow Bandgap Perovskite Solar Modules by Gas Quenching

Authors: Severin Siegrist, Johnpaul K. Pious, Huagui Lai, Radha K. Kothandaraman, **cheng Luo, Vitor Vlnieska, Ayodhya N. Tiwari, Fan Fu

Abstract: The development of scalable 1.25 eV mixed Pb-Sn perovskite solar modules by blade coating lags behind Pb-based perovskites due to limited understanding of solution-substrate interaction of the perovskite ink on PEDOT:PSS and subsequent gas quenching. To address this challenge, we systematically studied the wet film deposition and quenching process to better understand narrow bandgap perovskite fil… ▽ More The development of scalable 1.25 eV mixed Pb-Sn perovskite solar modules by blade coating lags behind Pb-based perovskites due to limited understanding of solution-substrate interaction of the perovskite ink on PEDOT:PSS and subsequent gas quenching. To address this challenge, we systematically studied the wet film deposition and quenching process to better understand narrow bandgap perovskite film formation on PEDOT:PSS. We found, the wetting of Pb-Sn perovskite ink on PEDOT:PSS is highly unstable over relevant coating time scales, causing the contact angles to decrease rapidly from 42° to 16° within seconds. This instability leads to localized irregularities in the wet film, resulting in uneven solvent extraction and inhomogeneous nuclei density. As a result, rough perovskite films with voids at the buried interface are obtained. To overcome this problem, we developed a quasi-static wetting process by reducing the blade coating speed, thereby stabilizing the wetting behavior of Pb-Sn perovskite precursor ink on PEDOT:PSS. This optimized process facilitates the deposition of high-quality, void-free Pb-Sn perovskite films with uniform thickness over 8 cm of coating length using moderate (1.4 bar) N2 quenching. We achieved 20 % efficient narrow bandgap perovskite solar cells and mini-modules with 15.8 % active area efficiency on 15.9 cm2. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2405.11181 [pdf, other]

Towards Knowledge-Infused Automated Disease Diagnosis Assistant

Authors: Mohit Tomar, Abhisek Tiwari, Sriparna Saha

Abstract: With the advancement of internet communication and telemedicine, people are increasingly turning to the web for various healthcare activities. With an ever-increasing number of diseases and symptoms, diagnosing patients becomes challenging. In this work, we build a diagnosis assistant to assist doctors, which identifies diseases based on patient-doctor interaction. During diagnosis, doctors utiliz… ▽ More With the advancement of internet communication and telemedicine, people are increasingly turning to the web for various healthcare activities. With an ever-increasing number of diseases and symptoms, diagnosing patients becomes challenging. In this work, we build a diagnosis assistant to assist doctors, which identifies diseases based on patient-doctor interaction. During diagnosis, doctors utilize both symptomatology knowledge and diagnostic experience to identify diseases accurately and efficiently. Inspired by this, we investigate the role of medical knowledge in disease diagnosis through doctor-patient interaction. We propose a two-channel, knowledge-infused, discourse-aware disease diagnosis model (KI-DDI), where the first channel encodes patient-doctor communication using a transformer-based encoder, while the other creates an embedding of symptom-disease using a graph attention network (GAT). In the next stage, the conversation and knowledge graph embeddings are infused together and fed to a deep neural network for disease identification. Furthermore, we first develop an empathetic conversational medical corpus comprising conversations between patients and doctors, annotated with intent and symptoms information. The proposed model demonstrates a significant improvement over the existing state-of-the-art models, establishing the crucial roles of (a) a doctor's effort for additional symptom extraction (in addition to patient self-report) and (b) infusing medical knowledge in identifying diseases effectively. Many times, patients also show their medical conditions, which acts as crucial evidence in diagnosis. Therefore, integrating visual sensory information would represent an effective avenue for enhancing the capabilities of diagnostic assistants. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.09754 [pdf, other]

Fermionic Non-Invertible Symmetries in (1+1)d: Gapped and Gapless Phases, Transitions, and Symmetry TFTs

Authors: Lakshya Bhardwaj, Kansei Inamura, Apoorv Tiwari

Abstract: We study fermionic non-invertible symmetries in (1+1)d, which are generalized global symmetries that mix fermion parity symmetry with other invertible and non-invertible internal symmetries. Such symmetries are described by fermionic fusion supercategories, which are fusion $π$-supercategories with a choice of fermion parity. The aim of this paper is to flesh out the categorical Landau paradigm fo… ▽ More We study fermionic non-invertible symmetries in (1+1)d, which are generalized global symmetries that mix fermion parity symmetry with other invertible and non-invertible internal symmetries. Such symmetries are described by fermionic fusion supercategories, which are fusion $π$-supercategories with a choice of fermion parity. The aim of this paper is to flesh out the categorical Landau paradigm for fermionic symmetries. We use the formalism of Symmetry Topological Field Theory (SymTFT) to study possible gapped and gapless phases for such symmetries, along with possible deformations between these phases, which are organized into a Hasse phase diagram. The phases can be characterized in terms of sets of condensed, confined and deconfined generalized symmetry charges, reminiscent of notions familiar from superconductivity. Many of the gapless phases also serve as phase transitions between gapped phases. The associated fermionic conformal field theories (CFTs) can be obtained by performing generalized fermionic Kennedy-Tasaki (KT) transformations on bosonic CFTs describing simpler transitions. The fermionic non-invertible symmetries along with their charges and phases discussed here can be obtained from those of bosonic non-invertible symmetries via fermionization or Jordan-Wigner transformation, which is discussed in detail. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: 49 pages

arXiv:2405.05964 [pdf, other]

Lattice Models for Phases and Transitions with Non-Invertible Symmetries

Authors: Lakshya Bhardwaj, Lea E. Bottini, Sakura Schafer-Nameki, Apoorv Tiwari

Abstract: Non-invertible categorical symmetries have emerged as a powerful tool to uncover new beyond-Landau phases of matter, both gapped and gapless, along with second order phase transitions between them. The general theory of such phases in (1+1)d has been studied using the Symmetry Topological Field Theory (SymTFT), also known as topological holography. This has unearthed the infrared (IR) structure of… ▽ More Non-invertible categorical symmetries have emerged as a powerful tool to uncover new beyond-Landau phases of matter, both gapped and gapless, along with second order phase transitions between them. The general theory of such phases in (1+1)d has been studied using the Symmetry Topological Field Theory (SymTFT), also known as topological holography. This has unearthed the infrared (IR) structure of these phases and transitions. In this paper, we describe how the SymTFT information can be converted into an ultraviolet (UV) anyonic chain lattice model realizing in the IR limit these phases and transitions. In many cases, the Hilbert space of the anyonic chain is tensor product decomposable and the model can be realized as a quantum spin-chain Hamiltonian. We also describe operators acting on the lattice models that are charged under non-invertible symmetries and act as order parameters for the phases and transitions. In order to fully describe the action of non-invertible symmetries, it is crucial to understand the symmetry twisted sectors of the lattice models, which we describe in detail. Throughout the paper, we illustrate the general concepts using the symmetry category $\mathsf{Rep}(S_3)$ formed by representations of the permutation group $S_3$, but our procedure can be applied to any fusion category symmetry. △ Less

Submitted 16 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

Comments: 76 pages + appendices; v2: references added

arXiv:2405.05302 [pdf, other]

Illustrating the Categorical Landau Paradigm in Lattice Models

Authors: Lakshya Bhardwaj, Lea E. Bottini, Sakura Schafer-Nameki, Apoorv Tiwari

Abstract: Recent years have seen the concept of global symmetry extended to non-invertible (or categorical) symmetries, for which composition of symmetry generators is not necessarily invertible. Such non-invertible symmetries lead to a generalization of the standard Landau paradigm. In this work we substantiate this framework by providing a (1+1)d lattice model, whose gapped phases and phase transitions ca… ▽ More Recent years have seen the concept of global symmetry extended to non-invertible (or categorical) symmetries, for which composition of symmetry generators is not necessarily invertible. Such non-invertible symmetries lead to a generalization of the standard Landau paradigm. In this work we substantiate this framework by providing a (1+1)d lattice model, whose gapped phases and phase transitions can only be explained by symmetry breaking of non-invertible symmetries. △ Less

Submitted 16 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

Comments: 4.5 pages + appendices, v2: references added

arXiv:2405.03099 [pdf, other]

SketchGPT: Autoregressive Modeling for Sketch Generation and Recognition

Authors: Adarsh Tiwari, Sanket Biswas, Josep Lladós

Abstract: We present SketchGPT, a flexible framework that employs a sequence-to-sequence autoregressive model for sketch generation, and completion, and an interpretation case study for sketch recognition. By map** complex sketches into simplified sequences of abstract primitives, our approach significantly streamlines the input for autoregressive modeling. SketchGPT leverages the next token prediction ob… ▽ More We present SketchGPT, a flexible framework that employs a sequence-to-sequence autoregressive model for sketch generation, and completion, and an interpretation case study for sketch recognition. By map** complex sketches into simplified sequences of abstract primitives, our approach significantly streamlines the input for autoregressive modeling. SketchGPT leverages the next token prediction objective strategy to understand sketch patterns, facilitating the creation and completion of drawings and also categorizing them accurately. This proposed sketch representation strategy aids in overcoming existing challenges of autoregressive modeling for continuous stroke data, enabling smoother model training and competitive performance. Our findings exhibit SketchGPT's capability to generate a diverse variety of drawings by adding both qualitative and quantitative comparisons with existing state-of-the-art, along with a comprehensive human evaluation study. The code and pretrained models will be released on our official GitHub. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: Accepted in ICDAR 2024

arXiv:2403.03276 [pdf, other]

ARNN: Attentive Recurrent Neural Network for Multi-channel EEG Signals to Identify Epileptic Seizures

Authors: Salim Rukhsar, Anil Kumar Tiwari

Abstract: We proposed an Attentive Recurrent Neural Network (ARNN), which recurrently applies attention layers along a sequence and has linear complexity with respect to the sequence length. The proposed model operates on multi-channel EEG signals rather than single channel signals and leverages parallel computation. In this cell, the attention layer is a computational unit that efficiently applies self-att… ▽ More We proposed an Attentive Recurrent Neural Network (ARNN), which recurrently applies attention layers along a sequence and has linear complexity with respect to the sequence length. The proposed model operates on multi-channel EEG signals rather than single channel signals and leverages parallel computation. In this cell, the attention layer is a computational unit that efficiently applies self-attention and cross-attention mechanisms to compute a recurrent function over a wide number of state vectors and input signals. Our architecture is inspired in part by the attention layer and long short-term memory (LSTM) cells, and it uses long-short style gates, but it scales this typical cell up by several orders to parallelize for multi-channel EEG signals. It inherits the advantages of attention layers and LSTM gate while avoiding their respective drawbacks. We evaluated the model effectiveness through extensive experiments with heterogeneous datasets, including the CHB-MIT and UPenn and Mayos Clinic, CHB-MIT datasets. The empirical findings suggest that the ARNN model outperforms baseline methods such as LSTM, Vision Transformer (ViT), Compact Convolution Transformer (CCT), and R-Transformer (RT), showcasing superior performance and faster processing capabilities across a wide range of tasks. The code has been made publicly accessible at \url{https://github.com/Salim-Lysiun/ARNN}. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 9 pages, 7 figures, Journal Paper

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2403.02556 [pdf]

Revealing the EuCd_{2}As_{2} Semiconducting Band Gap via n-type La-Do**

Authors: Ryan A. Nelson, Jesaiah King, Shuyu Cheng, Archibald J. Williams, Christopher Jozwiak, Aaron Bostwick, Eli Rotenberg, Souvik Sasmal, I-Hsuan Kao, Aalok Tiwari, Natalie R. Jones, Chuting Cai, Emma Martin, Andrei Dolocan, Li Shi, Roland Kawakami, Joseph P. Heremans, Jyoti Katoch, Joshua E. Goldberger

Abstract: EuCd_{2}As_{2} has attracted considerable interest as one of the few magnetic Weyl semimetal candidate materials, although recently there have been emerging reports that claim it to have a semiconducting electronic structure. To resolve this debate, we established the growth of n-type EuCd_{2}As_{2} crystals, to directly visualize the nature of the conduction band using angle resolve photoemission… ▽ More EuCd_{2}As_{2} has attracted considerable interest as one of the few magnetic Weyl semimetal candidate materials, although recently there have been emerging reports that claim it to have a semiconducting electronic structure. To resolve this debate, we established the growth of n-type EuCd_{2}As_{2} crystals, to directly visualize the nature of the conduction band using angle resolve photoemission spectroscopy (ARPES). We show that La-do** leads to n-type transport signatures in both the thermopower and Hall effect measurements, in crystals with do** levels at 2 - 6 x 10^{17} e^{-} cm^{-3}. Both p-type and n-type doped samples exhibit antiferromagnetic ordering at 9 K. ARPES experiments at 6 K clearly show the presence of the conduction band minimum at 0.8 eV above the valence band maximum, which is further corroborated by the observation of a 0.71 - 0.72 eV band gap in room temperature diffuse reflectance absorbance measurements. Together these findings unambiguously show that EuCd_{2}As_{2} is indeed a semiconductor with a substantial band gap and not a topological semimetal. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2402.17069 [pdf]

Leveraging power of deep learning for fast and efficient elite pixel selection in time series SAR interferometry

Authors: Ashutosh Tiwari, Nitheshnirmal Sadhashivam, Leonard O. Ohenhen, Manoochehr Shirzaei

Abstract: This work proposes an improved convolutional long short-term memory (ConvLSTM) based architecture for selection of elite pixels (i.e., less noisy) in time series interferometric synthetic aperture radar (TS-InSAR). Compared to previous version, the model can process InSAR stacks of variable time steps and select both persistent (PS) and distributed scatterers (DS). We trained the model on ~20,000… ▽ More This work proposes an improved convolutional long short-term memory (ConvLSTM) based architecture for selection of elite pixels (i.e., less noisy) in time series interferometric synthetic aperture radar (TS-InSAR). Compared to previous version, the model can process InSAR stacks of variable time steps and select both persistent (PS) and distributed scatterers (DS). We trained the model on ~20,000 training images (interferograms), each of size 100 by 100 pixels, extracted from InSAR time series interferograms containing both artificial features (buildings and infrastructure) and objects of natural environment (vegetation, forests, barren or agricultural land, water bodies). Based on such categorization, we developed two deep learning models, primarily focusing on urban and coastal sites. Training labels were generated from elite pixel selection outputs generated from the wavelet-based InSAR (WabInSAR) software developed by Shirzaei (2013) and improved in Lee and Shirzaei (2023). With 4 urban and 7 coastal sites used for training and validation, the predicted elite pixel selection maps reveal that the proposed models efficiently learn from WabInSAR-generated labels, reaching a validation accuracy of 94%. The models accurately discard pixels affected by geometric and temporal decorrelation while selecting pixels corresponding to urban objects and those with stable phase history unaffected by temporal and geometric decorrelation. The density of pixels in urban areas is comparable to and higher for coastal areas compared to WabInSAR outputs. With significantly reduced time computation (order of minutes) and improved selection of elite pixels, the proposed models can efficiently process long InSAR time series stacks and generate rapid deformation maps. △ Less

Submitted 26 February, 2024; originally announced February 2024.

arXiv:2402.07627 [pdf]

Unveiling the GeI2-Assisted Oriented Growth of Perovskite Crystallite for High-Performance Flexible Sn Perovskite Solar Cells

Authors: Huagui Lai, Selina Olthof, Shengqiang Ren, Radha K. Kothandaraman, Matthias Diethelm, Quentin Jeangros, Roland Hany, Ayodhya N. Tiwari, Dewei Zhao, Fan Fu

Abstract: Tin perovskites are emerging as promising alternatives to their lead-based counterparts for high-performance and flexible perovskite solar cells (PSCs). However, their rapid crystallization often leads to inadequate film quality and poor device performance. In this study, the role of GeI2 as an additive is investigated for controlling the nucleation and crystallization processes of formamidium tin… ▽ More Tin perovskites are emerging as promising alternatives to their lead-based counterparts for high-performance and flexible perovskite solar cells (PSCs). However, their rapid crystallization often leads to inadequate film quality and poor device performance. In this study, the role of GeI2 as an additive is investigated for controlling the nucleation and crystallization processes of formamidium tin triiodide (FASnI3). The findings reveal the preferential formation of a Ge-rich layer at the bottom of the perovskite film upon the introduction of GeI2. It is proposed that the initial formation of the Ge-complex acts as a crystallization regulator, promoting oriented growth of subsequent FASnI3 crystals and enhancing overall crystallinity. Through the incorporation of an optimal amount of GeI2, flexible Sn PSCs with an efficiency of 10.8% were achieved. Furthermore, it was observed that the GeI2 additive ensures a remarkable shelf-life for the devices, with the rigid cells retaining 91% of their initial performance after more than 13,800 hours of storage in an N2 gas environment. This study elucidates the mechanistic role of GeI2 in regulating the nucleation and crystallization process of tin perovskites, providing valuable insights into the significance of additive engineering for the development of high-performance flexible tin PSCs. △ Less

Submitted 12 February, 2024; originally announced February 2024.

arXiv:2402.01758 [pdf, other]

Aalap: AI Assistant for Legal & Paralegal Functions in India

Authors: Aman Tiwari, Prathamesh Kalamkar, Atreyo Banerjee, Saurabh Karn, Varun Hemachandran, Smita Gupta

Abstract: Using proprietary Large Language Models on legal tasks poses challenges due to data privacy issues, domain data heterogeneity, domain knowledge sophistication, and domain objectives uniqueness. We created Aalalp, a fine-tuned Mistral 7B model on instructions data related to specific Indian legal tasks. The performance of Aalap is better than gpt-3.5-turbo in 31\% of our test data and obtains an eq… ▽ More Using proprietary Large Language Models on legal tasks poses challenges due to data privacy issues, domain data heterogeneity, domain knowledge sophistication, and domain objectives uniqueness. We created Aalalp, a fine-tuned Mistral 7B model on instructions data related to specific Indian legal tasks. The performance of Aalap is better than gpt-3.5-turbo in 31\% of our test data and obtains an equivalent score in 34\% of the test data as evaluated by GPT4. Training Aalap mainly focuses on teaching legal reasoning rather than legal recall. Aalap is definitely helpful for the day-to-day activities of lawyers, judges, or anyone working in legal systems. △ Less

Submitted 30 January, 2024; originally announced February 2024.

arXiv:2401.06807 [pdf, other]

An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant

Authors: Mohit Tomar, Abhisek Tiwari, Tulika Saha, Prince Jha, Sriparna Saha

Abstract: In recent times, there has been an increasing awareness about imminent environmental challenges, resulting in people showing a stronger dedication to taking care of the environment and nurturing green life. The current $19.6 billion indoor gardening industry, reflective of this growing sentiment, not only signifies a monetary value but also speaks of a profound human desire to reconnect with the n… ▽ More In recent times, there has been an increasing awareness about imminent environmental challenges, resulting in people showing a stronger dedication to taking care of the environment and nurturing green life. The current $19.6 billion indoor gardening industry, reflective of this growing sentiment, not only signifies a monetary value but also speaks of a profound human desire to reconnect with the natural world. However, several recent surveys cast a revealing light on the fate of plants within our care, with more than half succumbing primarily due to the silent menace of improper care. Thus, the need for accessible expertise capable of assisting and guiding individuals through the intricacies of plant care has become paramount more than ever. In this work, we make the very first attempt at building a plant care assistant, which aims to assist people with plant(-ing) concerns through conversations. We propose a plant care conversational dataset named Plantational, which contains around 1K dialogues between users and plant care experts. Our end-to-end proposed approach is two-fold : (i) We first benchmark the dataset with the help of various large language models (LLMs) and visual language model (VLM) by studying the impact of instruction tuning (zero-shot and few-shot prompting) and fine-tuning techniques on this task; (ii) finally, we build EcoSage, a multi-modal plant care assisting dialogue generation framework, incorporating an adapter-based modality infusion using a gated mechanism. We performed an extensive examination (both automated and manual evaluation) of the performance exhibited by various LLMs and VLM in the generation of the domain-specific dialogue responses to underscore the respective strengths and weaknesses of these diverse models. △ Less

Submitted 10 January, 2024; originally announced January 2024.

arXiv:2401.05134 [pdf, other]

Yes, this is what I was looking for! Towards Multi-modal Medical Consultation Concern Summary Generation

Authors: Abhisek Tiwari, Shreyangshu Bera, Sriparna Saha, Pushpak Bhattacharyya, Samrat Ghosh

Abstract: Over the past few years, the use of the Internet for healthcare-related tasks has grown by leaps and bounds, posing a challenge in effectively managing and processing information to ensure its efficient utilization. During moments of emotional turmoil and psychological challenges, we frequently turn to the internet as our initial source of support, choosing this over discussing our feelings with o… ▽ More Over the past few years, the use of the Internet for healthcare-related tasks has grown by leaps and bounds, posing a challenge in effectively managing and processing information to ensure its efficient utilization. During moments of emotional turmoil and psychological challenges, we frequently turn to the internet as our initial source of support, choosing this over discussing our feelings with others due to the associated social stigma. In this paper, we propose a new task of multi-modal medical concern summary (MMCS) generation, which provides a short and precise summary of patients' major concerns brought up during the consultation. Nonverbal cues, such as patients' gestures and facial expressions, aid in accurately identifying patients' concerns. Doctors also consider patients' personal information, such as age and gender, in order to describe the medical condition appropriately. Motivated by the potential efficacy of patients' personal context and visual gestures, we propose a transformer-based multi-task, multi-modal intent-recognition, and medical concern summary generation (IR-MMCSG) system. Furthermore, we propose a multitasking framework for intent recognition and medical concern summary generation for doctor-patient consultations. We construct the first multi-modal medical concern summary generation (MM-MediConSummation) corpus, which includes patient-doctor consultations annotated with medical concern summaries, intents, patient personal information, doctor's recommendations, and keywords. Our experiments and analysis demonstrate (a) the significant role of patients' expressions/gestures and their personal information in intent identification and medical concern summary generation, and (b) the strong correlation between intent recognition and patients' medical concern summary generation The dataset and source code are available at https://github.com/NLP-RL/MMCSG. △ Less

Submitted 10 January, 2024; originally announced January 2024.

arXiv:2312.12020 [pdf, ps, other]

Spatial Metric Space for Pattern Recognition Problems

Authors: Q. M. Danish Lohani, Ashutosh Tiwari, Mohd Shoaib Khan

Abstract: The definition of weighted distance measure involves weights. The paper proposes a weighted distance measure without the help of weights. Here, weights are intrinsically added to the measure, and for this, the concept of metric space is generalized based on a novel divided difference operator. The proposed operator is used over a two-dimensional sequence of bounded variation, and it generalizes me… ▽ More The definition of weighted distance measure involves weights. The paper proposes a weighted distance measure without the help of weights. Here, weights are intrinsically added to the measure, and for this, the concept of metric space is generalized based on a novel divided difference operator. The proposed operator is used over a two-dimensional sequence of bounded variation, and it generalizes metric space with the introduction of a multivalued metric space called spatial metric space. The environment considered for the study is a two-dimensional Atanassov intuitionistic fuzzy set (AIFS) under the assumption that membership and non-membership components are its independent variables. The weighted distance measure is proposed as a spatial distance measure in the spatial metric space. The spatial distance measure consists of three branches. In the first branch, there is a domination of membership values, non-membership values dominate the second branch, and the third branch is equidominant. The domination of membership and non-membership values are not in the form of weights in the proposed spatial distance measure, and hence it is a measure independent of weights. The proposed spatial metric space is mathematically studied, and as an implication, the spatial similarity measure is multivalued in nature. The spatial similarity measure can recognize a maximum of three patterns simultaneously. The spatial similarity measure is tested for the pattern recognition problems and the obtained classification results are compared with some other existing similarity measures to show its potency. This study connects the double sequence to the application domain via a divided difference operator for the first time while proposing a novel divided difference operator-based spatial metric space. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: 27

arXiv:2312.10553 [pdf, other]

Machine Learning-Enhanced Prediction of Surface Smoothness for Inertial Confinement Fusion Target Polishing Using Limited Data

Authors: Antonios Alexos, Junze Liu, Akash Tiwari, Kshitij Bhardwaj, Sean Hayes, Pierre Baldi, Satish Bukkapatnam, Suhas Bhandarkar

Abstract: In Inertial Confinement Fusion (ICF) process, roughly a 2mm spherical shell made of high density carbon is used as target for laser beams, which compress and heat it to energy levels needed for high fusion yield. These shells are polished meticulously to meet the standards for a fusion shot. However, the polishing of these shells involves multiple stages, with each stage taking several hours. To m… ▽ More In Inertial Confinement Fusion (ICF) process, roughly a 2mm spherical shell made of high density carbon is used as target for laser beams, which compress and heat it to energy levels needed for high fusion yield. These shells are polished meticulously to meet the standards for a fusion shot. However, the polishing of these shells involves multiple stages, with each stage taking several hours. To make sure that the polishing process is advancing in the right direction, we are able to measure the shell surface roughness. This measurement, however, is very labor-intensive, time-consuming, and requires a human operator. We propose to use machine learning models that can predict surface roughness based on the data collected from a vibration sensor that is connected to the polisher. Such models can generate surface roughness of the shells in real-time, allowing the operator to make any necessary changes to the polishing for optimal result. △ Less

Submitted 16 December, 2023; originally announced December 2023.

Comments: Accepted as Extended Abstract in AIM 2024

arXiv:2311.11662 [pdf, other]

Enhanced Spatio-Temporal Context for Temporally Consistent Robust 3D Human Motion Recovery from Monocular Videos

Authors: Sushovan Chanda, Amogh Tiwari, Lokender Tiwari, Brojeshwar Bhowmick, Avinash Sharma, Hrishav Barua

Abstract: Recovering temporally consistent 3D human body pose, shape and motion from a monocular video is a challenging task due to (self-)occlusions, poor lighting conditions, complex articulated body poses, depth ambiguity, and limited availability of annotated data. Further, doing a simple perframe estimation is insufficient as it leads to jittery and implausible results. In this paper, we propose a nove… ▽ More Recovering temporally consistent 3D human body pose, shape and motion from a monocular video is a challenging task due to (self-)occlusions, poor lighting conditions, complex articulated body poses, depth ambiguity, and limited availability of annotated data. Further, doing a simple perframe estimation is insufficient as it leads to jittery and implausible results. In this paper, we propose a novel method for temporally consistent motion estimation from a monocular video. Instead of using generic ResNet-like features, our method uses a body-aware feature representation and an independent per-frame pose and camera initialization over a temporal window followed by a novel spatio-temporal feature aggregation by using a combination of self-similarity and self-attention over the body-aware features and the perframe initialization. Together, they yield enhanced spatiotemporal context for every frame by considering remaining past and future frames. These features are used to predict the pose and shape parameters of the human body model, which are further refined using an LSTM. Experimental results on the publicly available benchmark data show that our method attains significantly lower acceleration error and outperforms the existing state-of-the-art methods over all key quantitative evaluation metrics, including complex scenarios like partial occlusion, complex poses and even relatively low illumination. △ Less

Submitted 20 November, 2023; originally announced November 2023.

arXiv:2311.05564 [pdf, other]

doi 10.1016/j.physletb.2024.138516

Gravitational wave emission from metastable current-carrying strings in $E_6$

Authors: Adeela Afzal, Qaisar Shafi, Amit Tiwari

Abstract: We discuss $E_6$ based extensions of the Standard Model (SM) containing two varieties of superheavy metastable cosmic strings (CSs) that respectively have neutral and electrically charged current carriers. We employ an extended version of the velocity-dependent one-scale (VOS) model, recently discussed by some authors, to estimate the gravitational wave (GW) spectrum emitted by metastable strings… ▽ More We discuss $E_6$ based extensions of the Standard Model (SM) containing two varieties of superheavy metastable cosmic strings (CSs) that respectively have neutral and electrically charged current carriers. We employ an extended version of the velocity-dependent one-scale (VOS) model, recently discussed by some authors, to estimate the gravitational wave (GW) spectrum emitted by metastable strings with a dimensionless string tension $G μ\approx 10^{-6}$ that carry a right-handed neutrino (RHN) current. We find that with a low to moderate amount of current, the spectrum is compatible with the LIGO O3 run and also consistent at the 1$σ$ level with the recent PTA signals. △ Less

Submitted 9 February, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: We added a paragraph summarizing Section 2 at the end of Section 2 for better clarity

Journal ref: Phys.Lett.B 850 (2024) 138516

arXiv:2310.18115 [pdf, ps, other]

First-order electronic phase transition in $α$-(BEDT-TTF)$_2$I$_3$ revealed by temperature-dependent generalized ellipsometry

Authors: Achyut Tiwari, Bruno Gompf, Dieter Schweitzer, Martin Dressel

Abstract: The nature of correlation-driven metal-insulator transitions remains a longstanding puzzle in solid-state physics. While some theories suggest a second-order character, various experimental observations in these materials indicate first-order phase transitions. Despite considerable progress over the last decades in understanding the underlying driving mechanisms of metal-insulator transitions, in… ▽ More The nature of correlation-driven metal-insulator transitions remains a longstanding puzzle in solid-state physics. While some theories suggest a second-order character, various experimental observations in these materials indicate first-order phase transitions. Despite considerable progress over the last decades in understanding the underlying driving mechanisms of metal-insulator transitions, in particular the phase coexistence remains poorly understood on a microscopic scale. Here, we employ Mueller matrix spectroscopic and temperature-dependent ellipsometry to determine the anisotropic dielectric functions of the two-dimensional charge-transfer salt $α$-(BEDT-TTF)$_2$I$_3$ across its charge-order metal-insulator transition. Our results offer valuable insights into temperature-dependent changes of the dielectric functions along the different crystallographic axes. Furthermore, we apply an effective-medium approximation to quantify the correlation between the metal-to-insulator transition and the volume fraction of the metallic phase embedded within the insulating phase. Through this comprehensive approach, generalized ellipsometry unravels the nature of the correlation-driven metal-insulator transition. △ Less

Submitted 27 October, 2023; originally announced October 2023.

arXiv:2310.16164 [pdf, other]

Conversational Challenges in AI-Powered Data Science: Obstacles, Needs, and Design Opportunities

Authors: Bhavya Chopra, Ananya Singha, Anna Fariha, Sumit Gulwani, Chris Parnin, Ashish Tiwari, Austin Z. Henley

Abstract: Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on their suggestions and answers. We conducted a mixed-methods study, including contextual observations, semi-structured interviews (n=14), and a survey (n=114), to… ▽ More Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on their suggestions and answers. We conducted a mixed-methods study, including contextual observations, semi-structured interviews (n=14), and a survey (n=114), to identify these challenges. Our findings highlight key issues faced by data scientists, including contextual data retrieval, formulating prompts for complex tasks, adapting generated code to local environments, and refining prompts iteratively. Based on these insights, we propose actionable design recommendations, such as data brushing to support context selection, and inquisitive feedback loops to improve communications with AI-based assistants in data-science tools. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 24 pages, 8 figures

arXiv:2310.05380 [pdf, other]

Augmented Embeddings for Custom Retrievals

Authors: Anirudh Khatry, Yasharth Bajpai, Priyanshu Gupta, Sumit Gulwani, Ashish Tiwari

Abstract: Information retrieval involves selecting artifacts from a corpus that are most relevant to a given search query. The flavor of retrieval typically used in classical applications can be termed as homogeneous and relaxed, where queries and corpus elements are both natural language (NL) utterances (homogeneous) and the goal is to pick most relevant elements from the corpus in the Top-K, where K is la… ▽ More Information retrieval involves selecting artifacts from a corpus that are most relevant to a given search query. The flavor of retrieval typically used in classical applications can be termed as homogeneous and relaxed, where queries and corpus elements are both natural language (NL) utterances (homogeneous) and the goal is to pick most relevant elements from the corpus in the Top-K, where K is large, such as 10, 25, 50 or even 100 (relaxed). Recently, retrieval is being used extensively in preparing prompts for large language models (LLMs) to enable LLMs to perform targeted tasks. These new applications of retrieval are often heterogeneous and strict -- the queries and the corpus contain different kinds of entities, such as NL and code, and there is a need for improving retrieval at Top-K for small values of K, such as K=1 or 3 or 5. Current dense retrieval techniques based on pretrained embeddings provide a general-purpose and powerful approach for retrieval, but they are oblivious to task-specific notions of similarity of heterogeneous artifacts. We introduce Adapted Dense Retrieval, a mechanism to transform embeddings to enable improved task-specific, heterogeneous and strict retrieval. Adapted Dense Retrieval works by learning a low-rank residual adaptation of the pretrained black-box embedding. We empirically validate our approach by showing improvements over the state-of-the-art general-purpose embeddings-based baseline. △ Less

Submitted 8 October, 2023; originally announced October 2023.

Comments: 14 pages

ACM Class: I.2.6

arXiv:2309.15739 [pdf, other]

doi 10.1145/3583780.3614870

Experience and Evidence are the eyes of an excellent summarizer! Towards Knowledge Infused Multi-modal Clinical Conversation Summarization

Authors: Abhisek Tiwari, Anisha Saha, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar

Abstract: With the advancement of telemedicine, both researchers and medical practitioners are working hand-in-hand to develop various techniques to automate various medical operations, such as diagnosis report generation. In this paper, we first present a multi-modal clinical conversation summary generation task that takes a clinician-patient interaction (both textual and visual information) and generates… ▽ More With the advancement of telemedicine, both researchers and medical practitioners are working hand-in-hand to develop various techniques to automate various medical operations, such as diagnosis report generation. In this paper, we first present a multi-modal clinical conversation summary generation task that takes a clinician-patient interaction (both textual and visual information) and generates a succinct synopsis of the conversation. We propose a knowledge-infused, multi-modal, multi-tasking medical domain identification and clinical conversation summary generation (MM-CliConSummation) framework. It leverages an adapter to infuse knowledge and visual features and unify the fused feature vector using a gated mechanism. Furthermore, we developed a multi-modal, multi-intent clinical conversation summarization corpus annotated with intent, symptom, and summary. The extensive set of experiments, both quantitatively and qualitatively, led to the following findings: (a) critical significance of visuals, (b) more precise and medical entity preserving summary with additional knowledge infusion, and (c) a correlation between medical department identification and clinical synopsis generation. Furthermore, the dataset and source code are available at https://github.com/NLP-RL/MM-CliConSummation. △ Less

Submitted 27 September, 2023; originally announced September 2023.

arXiv:2309.12436 [pdf, other]

Rapidash: Efficient Constraint Discovery via Rapid Verification

Authors: Zifan Liu, Shaleen Deep, Anna Fariha, Fotis Psallidas, Ashish Tiwari, Avrilia Floratou

Abstract: Denial Constraint (DC) is a well-established formalism that captures a wide range of integrity constraints commonly encountered, including candidate keys, functional dependencies, and ordering constraints, among others. Given their significance, there has been considerable research interest in achieving fast verification and discovery of exact DCs within the database community. Despite the signifi… ▽ More Denial Constraint (DC) is a well-established formalism that captures a wide range of integrity constraints commonly encountered, including candidate keys, functional dependencies, and ordering constraints, among others. Given their significance, there has been considerable research interest in achieving fast verification and discovery of exact DCs within the database community. Despite the significant advancements in the field, prior work exhibits notable limitations when confronted with large-scale datasets. The current state-of-the-art exact DC verification algorithm demonstrates a quadratic (worst-case) time complexity relative to the dataset's number of rows. In the context of DC discovery, existing methodologies rely on a two-step algorithm that commences with an expensive data structure-building phase, often requiring hours to complete even for datasets containing only a few million rows. Consequently, users are left without any insights into the DCs that hold on their dataset until this lengthy building phase concludes. In this paper, we introduce Rapidash, a comprehensive framework for DC verification and discovery. Our work makes a dual contribution. First, we establish a connection between orthogonal range search and DC verification. We introduce a novel exact DC verification algorithm that demonstrates near-linear time complexity, representing a theoretical improvement over prior work. Second, we propose an anytime DC discovery algorithm that leverages our novel verification algorithm to gradually provide DCs to users, eliminating the need for the time-intensive building phase observed in prior work. To validate the effectiveness of our algorithms, we conduct extensive evaluations on four large-scale production datasets. Our results reveal that our DC verification algorithm achieves up to 40 times faster performance compared to state-of-the-art approaches. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: comments and suggestions are welcome!

arXiv:2309.05804 [pdf, other]

Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation Metric

Authors: Abhisek Tiwari, Muhammed Sinan, Kaushik Roy, Amit Sheth, Sriparna Saha, Pushpak Bhattacharyya

Abstract: Over the past two decades, dialogue modeling has made significant strides, moving from simple rule-based responses to personalized and persuasive response generation. However, despite these advancements, the objective functions and evaluation metrics for dialogue generation have remained stagnant. These lexical-based metrics, e.g., cross-entropy and BLEU, have two key limitations: (a) word-to-word… ▽ More Over the past two decades, dialogue modeling has made significant strides, moving from simple rule-based responses to personalized and persuasive response generation. However, despite these advancements, the objective functions and evaluation metrics for dialogue generation have remained stagnant. These lexical-based metrics, e.g., cross-entropy and BLEU, have two key limitations: (a) word-to-word matching without semantic consideration: It assigns the same credit for failure to generate "nice" and "rice" for "good", (b) missing context attribute for evaluating the generated response: Even if a generated response is relevant to the ongoing dialogue context, it may still be penalized for not matching the gold utterance provided in the corpus. In this paper, we first investigate these limitations comprehensively and propose a new loss function called Semantic Infused Contextualized diaLogue (SemTextualLogue) loss function. We also formulate an evaluation metric called Dialuation, incorporating both context and semantic relevance. We experimented with both non-pretrained and pre-trained models on two dialogue corpora, encompassing task-oriented and open-domain scenarios. We found that the dialogue generation models trained with SemTextualLogueloss attained superior performance compared to the traditional cross-entropy loss function. The findings establish that the effective training of a dialogue generation model hinges significantly on incorporating semantics and context. This pattern is also mirrored in the introduced Dialuation metric, where the consideration of both context and semantics correlates more strongly with human evaluation compared to traditional metrics. △ Less

Submitted 29 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

arXiv:2308.14682 [pdf, other]

Muon $g-2$ and dark matter in Supersymmetric $SU(4)_c \times SU(2)_L \times SU(2)_R$

Authors: Qaisar Shafi, Amit Tiwari, Cem Salih Un

Abstract: The latest FermiLab muon $g-2$ result shows a $5σ$ discrepancy with a ``widely advertised" Standard Model prediction. We consider a supersymmetric $SU(4)_c \times SU(2)_L \times SU(2)_R$ model in which this discrepancy is resolved by including contributions to muon $g-2$ from a relatively light SUSY sector. A variety of realistic coannihilation scenarios can reproduce the observed dark matter reli… ▽ More The latest FermiLab muon $g-2$ result shows a $5σ$ discrepancy with a ``widely advertised" Standard Model prediction. We consider a supersymmetric $SU(4)_c \times SU(2)_L \times SU(2)_R$ model in which this discrepancy is resolved by including contributions to muon $g-2$ from a relatively light SUSY sector. A variety of realistic coannihilation scenarios can reproduce the observed dark matter relic abundance. With a significantly reduced discrepancy, of order $1 σ$ or less, the Higgsino-like dark matter solutions are also viable. We provide benchmark points for these solutions that will be probed in the direct detection dark matter experiments and collider searches. △ Less

Submitted 28 August, 2023; originally announced August 2023.

arXiv:2308.14073 [pdf, other]

Direct Determination of Photonic Stopband Topological Character: A Framework based on Dispersion Measurements

Authors: Nitish Kumar Gupta, Sapireddy Srinivasu, Mukesh Kumar, Anjani Kumar Tiwari, Sudipta Sarkar Pal, Harshawardhan Wanare, S. Anantha Ramakrishna

Abstract: Ascertainment of photonic stopband absolute topological character requires information regarding the Bloch eigenfunction spatial distribution. Consequently, the experimental investigations predominantly restrict themselves to the bulk-boundary correspondence principle and the ensuing emergence of topological surface state. Although capable of establishing the equivalence or inequivalence of bandga… ▽ More Ascertainment of photonic stopband absolute topological character requires information regarding the Bloch eigenfunction spatial distribution. Consequently, the experimental investigations predominantly restrict themselves to the bulk-boundary correspondence principle and the ensuing emergence of topological surface state. Although capable of establishing the equivalence or inequivalence of bandgaps, the determination of their absolute topological identity remains out of its purview. The alternate method of reflection phase-based identification also provides only contentious improvements owing to the measurement complexities pertaining to the interferometric setups. To circumvent these limitations, we resort to the Kramers-Kronig amplitude-phase causality considerations and propose an experimentally conducive method for bandgap topological character determination directly from the parametric reflectance measurements. Particularly, it has been demonstrated that in case of one-dimensional photonic crystals, polarization-resolved dispersion measurements suffice in qualitatively determining bandgap absolute topological identities. By invoking the translational invariance of the investigated samples, we also define a parameter Differential Effective Mass that encapsulates bandgap topological identities and engenders an experimentally discernible bandgap classifier. △ Less

Submitted 27 August, 2023; originally announced August 2023.

arXiv:2308.10995 [pdf, ps, other]

Deep Learning Techniques in Extreme Weather Events: A Review

Authors: Shikha Verma, Kuldeep Srivastava, Akhilesh Tiwari, Shekhar Verma

Abstract: Extreme weather events pose significant challenges, thereby demanding techniques for accurate analysis and precise forecasting to mitigate its impact. In recent years, deep learning techniques have emerged as a promising approach for weather forecasting and understanding the dynamics of extreme weather events. This review aims to provide a comprehensive overview of the state-of-the-art deep learni… ▽ More Extreme weather events pose significant challenges, thereby demanding techniques for accurate analysis and precise forecasting to mitigate its impact. In recent years, deep learning techniques have emerged as a promising approach for weather forecasting and understanding the dynamics of extreme weather events. This review aims to provide a comprehensive overview of the state-of-the-art deep learning in the field. We explore the utilization of deep learning architectures, across various aspects of weather prediction such as thunderstorm, lightning, precipitation, drought, heatwave, cold waves and tropical cyclones. We highlight the potential of deep learning, such as its ability to capture complex patterns and non-linear relationships. Additionally, we discuss the limitations of current approaches and highlight future directions for advancements in the field of meteorology. The insights gained from this systematic review are crucial for the scientific community to make informed decisions and mitigate the impacts of extreme weather events. △ Less

Submitted 18 August, 2023; originally announced August 2023.

arXiv:2308.02225 [pdf, other]

Deep Semantic Model Fusion for Ancient Agricultural Terrace Detection

Authors: Yi Wang, Chenying Liu, Arti Tiwari, Micha Silver, Arnon Karnieli, Xiao Xiang Zhu, Conrad M Albrecht

Abstract: Discovering ancient agricultural terraces in desert regions is important for the monitoring of long-term climate changes on the Earth's surface. However, traditional ground surveys are both costly and limited in scale. With the increasing accessibility of aerial and satellite data, machine learning techniques bear large potential for the automatic detection and recognition of archaeological landsc… ▽ More Discovering ancient agricultural terraces in desert regions is important for the monitoring of long-term climate changes on the Earth's surface. However, traditional ground surveys are both costly and limited in scale. With the increasing accessibility of aerial and satellite data, machine learning techniques bear large potential for the automatic detection and recognition of archaeological landscapes. In this paper, we propose a deep semantic model fusion method for ancient agricultural terrace detection. The input data includes aerial images and LiDAR generated terrain features in the Negev desert. Two deep semantic segmentation models, namely DeepLabv3+ and UNet, with EfficientNet backbone, are trained and fused to provide segmentation maps of ancient terraces and walls. The proposed method won the first prize in the International AI Archaeology Challenge. Codes are available at https://github.com/wangyi111/international-archaeology-ai-challenge. △ Less

Submitted 4 August, 2023; originally announced August 2023.

Comments: IEEE Big Data 2022 workshop on Digital Twins for Accelerated Discovery of Climate & Sustainability Solutions (ADoCS)

arXiv:2308.00743 [pdf, other]

doi 10.21468/SciPostPhys.16.1.022

Lieb-Schultz-Mattis anomalies and web of dualities induced by gauging in quantum spin chains

Authors: Ömer M. Aksoy, Christopher Mudry, Akira Furusaki, Apoorv Tiwari

Abstract: Lieb-Schultz-Mattis (LSM) theorems impose non-perturbative constraints on the zero-temperature phase diagrams of quantum lattice Hamiltonians (always assumed to be local in this paper). LSM theorems have recently been interpreted as the lattice counterparts to mixed 't Hooft anomalies in quantum field theories that arise from a combination of crystalline and global internal symmetry groups. Accord… ▽ More Lieb-Schultz-Mattis (LSM) theorems impose non-perturbative constraints on the zero-temperature phase diagrams of quantum lattice Hamiltonians (always assumed to be local in this paper). LSM theorems have recently been interpreted as the lattice counterparts to mixed 't Hooft anomalies in quantum field theories that arise from a combination of crystalline and global internal symmetry groups. Accordingly, LSM theorems have been reinterpreted as LSM anomalies. In this work, we provide a systematic diagnostic for LSM anomalies in one spatial dimension. We show that gauging subgroups of the global internal symmetry group of a quantum lattice model obeying an LSM anomaly delivers a dual quantum lattice Hamiltonian such that its internal and crystalline symmetries mix non-trivially through a group extension. This mixing of crystalline and internal symmetries after gauging is a direct consequence of the LSM anomaly, i.e., it can be used as a diagnostic of an LSM anomaly. We exemplify this procedure for a quantum spin-1/2 chain obeying an LSM anomaly resulting from combining a global internal $\mathbb{Z}^{\,}_{2}\times\mathbb{Z}^{\,}_{2}$ symmetry with translation or reflection symmetry. We establish a triality of models by gauging a $\mathbb{Z}^{\,}_{2}\subset\mathbb{Z}^{\,}_{2}\times\mathbb{Z}^{\,}_{2}$ symmetry in two ways, one of which amounts to performing a Kramers-Wannier duality, while the other implements a Jordan-Wigner duality. We discuss the map** of the phase diagram of the quantum spin-1/2 $XYZ$ chains under such a triality. We show that the deconfined quantum critical transitions between Neel and dimer orders are mapped to either topological or conventional Landau-Ginzburg transitions. Finally, we extend our results to $\mathbb{Z}^{\,}_{n}$ clock models and provide a reinterpretation of the dual internal symmetries in terms of $\mathbb{Z}^{\,}_{n}$ charge and dipole symmetries. △ Less

Submitted 26 January, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: 88 pages, 6 figures

Journal ref: SciPost Phys. 16, 022 (2024)

arXiv:2308.00705 [pdf]

A Bibliographic Study on Artificial Intelligence Research: Global Panorama and Indian Appearance

Authors: Amit Tiwari, Susmita Bardhan, Vikas Kumar

Abstract: The present study identifies and assesses the bibliographic trend in Artificial Intelligence (AI) research for the years 2015-2020 using the science map** method of bibliometric study. The required data has been collected from the Scopus database. To make the collected data analysis-ready, essential data transformation was performed manually and with the help of a tool viz. OpenRefine. For deter… ▽ More The present study identifies and assesses the bibliographic trend in Artificial Intelligence (AI) research for the years 2015-2020 using the science map** method of bibliometric study. The required data has been collected from the Scopus database. To make the collected data analysis-ready, essential data transformation was performed manually and with the help of a tool viz. OpenRefine. For determining the trend and performing the map** techniques, top five open access and commercial journals of AI have been chosen based on their citescore driven ranking. The work includes 6880 articles published in the specified period for analysis. The trend is based on Country-wise publications, year-wise publications, topical terms in AI, top-cited articles, prominent authors, major institutions, involvement of industries in AI and Indian appearance. The results show that compared to open access journals; commercial journals have a higher citescore and number of articles published over the years. Additionally, IEEE is the prominent publisher which publishes 84% of the top-cited publications. Further, China and the United States are the major contributors to literature in the AI domain. The study reveals that neural networks and deep learning are the major topics included in top AI research publications. Recently, not only public institutions but also private bodies are investing their resources in AI research. The study also investigates the relative position of Indian researchers in terms of AI research. Present work helps in understanding the initial development, current stand and future direction of AI. △ Less

Submitted 4 July, 2023; originally announced August 2023.

Comments: 21 pages, 9 figures, 6 tables

arXiv:2307.01266 [pdf, other]

Symmetry fractionalization, mixed-anomalies and dualities in quantum spin models with generalized symmetries

Authors: Heidar Moradi, Ömer M. Aksoy, Jens H. Bardarson, Apoorv Tiwari

Abstract: We investigate the gauging of higher-form finite Abelian symmetries and their sub-groups in quantum spin models in spatial dimensions $d=2$ and 3. Doing so, we naturally uncover gauged models with dual higher-group symmetries and potential mixed 't Hooft anomalies. We demonstrate that the mixed anomalies manifest as the symmetry fractionalization of higher-form symmetries participating in the mixe… ▽ More We investigate the gauging of higher-form finite Abelian symmetries and their sub-groups in quantum spin models in spatial dimensions $d=2$ and 3. Doing so, we naturally uncover gauged models with dual higher-group symmetries and potential mixed 't Hooft anomalies. We demonstrate that the mixed anomalies manifest as the symmetry fractionalization of higher-form symmetries participating in the mixed anomaly. Gauging is realized as an isomorphism or duality between the bond algebras that generate the space of quantum spin models with the dual generalized symmetry structures. We explore the map** of gapped phases under such gauging related dualities for 0-form and 1-form symmetries in spatial dimension $d=2$ and 3. In $d=2$, these include several non-trivial dualities between short-range entangled gapped phases with 0-form symmetries and 0-form symmetry enriched Higgs and (twisted) deconfined phases of the gauged theory with possible symmetry fractionalizations. Such dualities also imply strong constraints on several unconventional, i.e., deconfined or topological transitions. In $d=3$, among others, we find, dualities between topological orders via gauging of 1-form symmetries. Hamiltonians self-dual under gauging of 1-form symmetries host emergent non-invertible symmetries, realizing higher-categorical generalizations of the Tambara-Yamagami fusion category. △ Less

Submitted 14 September, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: 90 pages, 19 figures

arXiv:2307.00930 [pdf, other]

doi 10.1093/mnras/stad3749

Accelerated binary black holes in globular clusters: forecasts and detectability in the era of space-based gravitational-wave detectors

Authors: Avinash Tiwari, Aditya Vijaykumar, Shasvath J. Kapadia, Giacomo Fragione, Sourav Chatterjee

Abstract: The motion of the center of mass of a coalescing binary black hole (BBH) in a gravitational potential imprints a line-of-sight acceleration (LOSA) onto the emitted gravitational wave (GW) signal. The acceleration could be sufficiently large in dense stellar environments, such as globular clusters (GCs), to be detectable with next-generation space-based detectors. In this work, we use outputs of th… ▽ More The motion of the center of mass of a coalescing binary black hole (BBH) in a gravitational potential imprints a line-of-sight acceleration (LOSA) onto the emitted gravitational wave (GW) signal. The acceleration could be sufficiently large in dense stellar environments, such as globular clusters (GCs), to be detectable with next-generation space-based detectors. In this work, we use outputs of the \textsc{cluster monte carlo (cmc)} simulations of dense star clusters to forecast the distribution of detectable LOSAs in DECIGO and LISA eras. We study the effect of cluster properties -- metallicity, virial and galactocentric radii -- on the distribution of detectable accelerations, account for cosmologically-motivated distributions of cluster formation times, masses, and metallicities, and also incorporate the delay time between the formation of BBHs and their merger in our analysis. We find that larger metallicities provide a larger fraction of detectable accelerations by virtue of a greater abundance of relatively lighter BBHs, which allow a higher number of GW cycles in the detectable frequency band. Conversely, smaller metallicities result in fewer detections, most of which come from relatively more massive BBHs with fewer cycles but larger LOSAs. We similarly find correlations between the virial radii of the clusters and the fractions of detectable accelerations. Our work, therefore, provides an important science case for space-based GW detectors in the context of probing GC properties via the detection of LOSAs of merging BBHs. △ Less

Submitted 31 January, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 527, Issue 3, January 2024, Pages 8586-8597

arXiv:2306.17198 [pdf]

Labour Monitoring in Pregnant Women Using Phonocardiography, Electrocardiography and Electromyography Technique

Authors: Anushka Tiwari

Abstract: Continuous monitoring of fetal and maternal vital signs, particularly during labor, can be critical for the child and mother's health. We present a novel wearable electronic system that measures, in real-time, maternal heart rate using phonocardiography (PCG) and Electrocardiography (ECG). Uterine contractions using electromyography (EMG). When in later stages we employed ECG technique for materna… ▽ More Continuous monitoring of fetal and maternal vital signs, particularly during labor, can be critical for the child and mother's health. We present a novel wearable electronic system that measures, in real-time, maternal heart rate using phonocardiography (PCG) and Electrocardiography (ECG). Uterine contractions using electromyography (EMG). When in later stages we employed ECG technique for maternal heart rate monitoring. The heart rate is determined using moving average filters to remove noises in the signal and ACF(Autocorrelation Function) for determining periodicity. For UC monitoring we stick to the same EMG technique. We also tried employing EMG technique to monitor the Fetal Heart Rate(FHR). But, in later stages of this design, this idea was aborted as we concluded that it needs further research on pregnancy stages and would require more intricate sensor integration that might not be in our reach at the moment. The system is accurate, low-cost, and portable, so it can be deployed at primary healthcare centers in low-income countries. The system can also be used by women in the comfort of their homes. At the same time, the data collected is transferred to their doctor for analysis and diagnosis, which can bring a revolutionary change in the continuous monitoring of fetal wellbeing during labor. △ Less

Submitted 29 June, 2023; originally announced June 2023.

Comments: masters thesis

arXiv:2306.09989 [pdf]

doi 10.1016/j.compbiomed.2022.105624

Ensemble Framework for Cardiovascular Disease Prediction

Authors: Achyut Tiwari, Aryan Chugh, Aman Sharma

Abstract: Heart disease is the major cause of non-communicable and silent death worldwide. Heart diseases or cardiovascular diseases are classified into four types: coronary heart disease, heart failure, congenital heart disease, and cardiomyopathy. It is vital to diagnose heart disease early and accurately in order to avoid further injury and save patients' lives. As a result, we need a system that can pre… ▽ More Heart disease is the major cause of non-communicable and silent death worldwide. Heart diseases or cardiovascular diseases are classified into four types: coronary heart disease, heart failure, congenital heart disease, and cardiomyopathy. It is vital to diagnose heart disease early and accurately in order to avoid further injury and save patients' lives. As a result, we need a system that can predict cardiovascular disease before it becomes a critical situation. Machine learning has piqued the interest of researchers in the field of medical sciences. For heart disease prediction, researchers implement a variety of machine learning methods and approaches. In this work, to the best of our knowledge, we have used the dataset from IEEE Data Port which is one of the online available largest datasets for cardiovascular diseases individuals. The dataset isa combination of Hungarian, Cleveland, Long Beach VA, Switzerland & Statlog datasets with important features such as Maximum Heart Rate Achieved, Serum Cholesterol, Chest Pain Type, Fasting blood sugar, and so on. To assess the efficacy and strength of the developed model, several performance measures are used, such as ROC, AUC curve, specificity, F1-score, sensitivity, MCC, and accuracy. In this study, we have proposed a framework with a stacked ensemble classifier using several machine learning algorithms including ExtraTrees Classifier, Random Forest, XGBoost, and so on. Our proposed framework attained an accuracy of 92.34% which is higher than the existing literature. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Journal ref: Computers in Biology and Medicine Volume 146, July 2022, 105624

arXiv:2305.16493 [pdf, other]

Origin of magnetic anisotropy in $La_{(1\-x)}Sr_{x}MnO_{3}$

Authors: Birendra Kumar, Harish Chandr Chauhan, Ajay Baro, Jyoti Saini, Ankita Tiwari, Mukesh Verma, Yugandhar Bitla, Subhasis Ghosh

Abstract: Here, we report the origin of magnetic anisotropy in Sr-doped infinite layer manganites $La_{(1\-x)}Sr_{x}MnO_{3}$ (0.125 \leq x \leq 0.400). Magnetic anisotropy is responsible for the large difference in the temperature dependence of field-cooled and zero-field-cooled magnetization. Translational symmetry breaking in the context of spins around the boundary between the ferromagnetic (FM) antiferr… ▽ More Here, we report the origin of magnetic anisotropy in Sr-doped infinite layer manganites $La_{(1\-x)}Sr_{x}MnO_{3}$ (0.125 \leq x \leq 0.400). Magnetic anisotropy is responsible for the large difference in the temperature dependence of field-cooled and zero-field-cooled magnetization. Translational symmetry breaking in the context of spins around the boundary between the ferromagnetic (FM) antiferromagnetic (AFM) region leads to FM-AFM interaction and results in magnetic anisotropy (exchange anisotropy). Here, we propose that FM-AFM interaction around the boundary between FM clusters or domains in the AFM background or between AFM clusters or domains in the ferromagnetic background is responsible for do**-dependent nonmonotonic behavior and the origin of magnetic anisotropy. △ Less

Submitted 25 May, 2023; originally announced May 2023.

Comments: 9 pages, 7 figures, and 7 supplementary pages(4 figures)

arXiv:2305.14129 [pdf, other]

GrACE: Generation using Associated Code Edits

Authors: Priyanshu Gupta, Avishree Khare, Yasharth Bajpai, Saikat Chakraborty, Sumit Gulwani, Aditya Kanade, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

Abstract: Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large langua… ▽ More Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large language models (LLMs) of code with the knowledge of prior, relevant edits. The generative capability of the LLMs helps address the diversity in code changes and conditioning code generation on prior edits helps capture the latent developer intent. We evaluate two well-known LLMs, Codex and CodeT5, in zero-shot and fine-tuning settings respectively. In our experiments with two datasets, the knowledge of prior edits boosts the performance of the LLMs significantly and enables them to generate 29% and 54% more correctly edited code in top-1 suggestions relative to the current state-of-the-art symbolic and neural approaches, respectively. △ Less

Submitted 20 September, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.11581 [pdf]

Trustworthy, responsible, ethical AI in manufacturing and supply chains: synthesis and emerging research questions

Authors: Alexandra Brintrup, George Baryannis, Ashutosh Tiwari, Svetan Ratchev, Giovanna Martinez-Arellano, Jatinder Singh

Abstract: While the increased use of AI in the manufacturing sector has been widely noted, there is little understanding on the risks that it may raise in a manufacturing organisation. Although various high level frameworks and definitions have been proposed to consolidate potential risks, practitioners struggle with understanding and implementing them. This lack of understanding exposes manufacturing to… ▽ More While the increased use of AI in the manufacturing sector has been widely noted, there is little understanding on the risks that it may raise in a manufacturing organisation. Although various high level frameworks and definitions have been proposed to consolidate potential risks, practitioners struggle with understanding and implementing them. This lack of understanding exposes manufacturing to a multitude of risks, including the organisation, its workers, as well as suppliers and clients. In this paper, we explore and interpret the applicability of responsible, ethical, and trustworthy AI within the context of manufacturing. We then use a broadened adaptation of a machine learning lifecycle to discuss, through the use of illustrative examples, how each step may result in a given AI trustworthiness concern. We additionally propose a number of research questions to the manufacturing research community, in order to help guide future research so that the economic and societal benefits envisaged by AI in manufacturing are delivered safely and responsibly. △ Less

Submitted 19 May, 2023; originally announced May 2023.

Comments: Pre-print under peer-review

arXiv:2305.07552 [pdf, other]

Dish detection in food platters: A framework for automated diet logging and nutrition management

Authors: Mansi Goel, Shashank Dargar, Shounak Ghatak, Nidhi Verma, Pratik Chauhan, Anushka Gupta, Nikhila Vishnumolakala, Hareesh Amuru, Ekta Gambhir, Ronak Chhajed, Meenal Jain, Astha Jain, Samiksha Garg, Nitesh Narwade, Nikhilesh Verhwani, Abhuday Tiwari, Kirti Vashishtha, Ganesh Bagler

Abstract: Diet is central to the epidemic of lifestyle disorders. Accurate and effortless diet logging is one of the significant bottlenecks for effective diet management and calorie restriction. Dish detection from food platters is a challenging problem due to a visually complex food layout. We present an end-to-end computational framework for diet management, from data compilation, annotation, and state-o… ▽ More Diet is central to the epidemic of lifestyle disorders. Accurate and effortless diet logging is one of the significant bottlenecks for effective diet management and calorie restriction. Dish detection from food platters is a challenging problem due to a visually complex food layout. We present an end-to-end computational framework for diet management, from data compilation, annotation, and state-of-the-art model identification to its mobile app implementation. As a case study, we implement the framework in the context of Indian food platters known for their complex presentation that poses a challenge for the automated detection of dishes. Starting with the 61 most popular Indian dishes, we identify the state-of-the-art model through a comparative analysis of deep-learning-based object detection architectures. Rooted in a meticulous compilation of 68,005 platter images with 134,814 manual dish annotations, we first compare ten architectures for multi-label classification to identify ResNet152 (mAP=84.51%) as the best model. YOLOv8x (mAP=87.70%) emerged as the best model architecture for dish detection among the eight deep-learning models implemented after a thorough performance evaluation. By comparing with the state-of-the-art model for the IndianFood10 dataset, we demonstrate the superior object detection performance of YOLOv8x for this subset and establish Resnet152 as the best architecture for multi-label classification. The models thus trained on richly annotated data can be extended to include dishes from across global cuisines. The proposed framework is demonstrated through a proof-of-concept mobile application with diverse applications for diet logging, food recommendation systems, nutritional interventions, and mitigation of lifestyle disorders. △ Less

Submitted 12 May, 2023; originally announced May 2023.

Comments: 11 pages, 5 figures, 5 tables. Submitted to the 8th International Conference on Computer Vision & Image Processing (CVIP-2023)

ACM Class: I.4.9; I.5.4; J.3

arXiv:2305.04325 [pdf, other]

Lightweight Convolution Transformer for Cross-patient Seizure Detection in Multi-channel EEG Signals

Authors: Salim Rukhsar, Anil K. Tiwari

Abstract: Background: Epilepsy is a neurological illness affecting the brain that makes people more likely to experience frequent, spontaneous seizures. There has to be an accurate automated method for measuring seizure frequency and severity in order to assess the efficacy of pharmacological therapy for epilepsy. The drug quantities are often derived from patient reports which may cause significant issues… ▽ More Background: Epilepsy is a neurological illness affecting the brain that makes people more likely to experience frequent, spontaneous seizures. There has to be an accurate automated method for measuring seizure frequency and severity in order to assess the efficacy of pharmacological therapy for epilepsy. The drug quantities are often derived from patient reports which may cause significant issues owing to inadequate or inaccurate descriptions of seizures and their frequencies. Methods and materials: This study proposes a novel deep learning architecture based lightweight convolution transformer (LCT). The transformer is able to learn spatial and temporal correlated information simultaneously from the multi-channel electroencephalogram (EEG) signal to detect seizures at smaller segment lengths. In the proposed model, the lack of translation equivariance and localization of ViT is reduced using convolution tokenization, and rich information from the transformer encoder is extracted by sequence pooling instead of the learnable class token. Results: Extensive experimental results demonstrate that the proposed model of cross-patient learning can effectively detect seizures from the raw EEG signals. The accuracy and F1-score of seizure detection in the cross-patient case on the CHB-MIT dataset are shown to be 96.31% and 96.32%, respectively, at 0.5 sec segment length. In addition, the performance metrics show that the inclusion of inductive biases and attention-based pooling in the model enhances the performance and reduces the number of transformer encoder layers, which significantly reduces the computational complexity. In this research work, we provided a novel approach to enhance efficiency and simplify the architecture for multi-channel automated seizure detection. △ Less

Submitted 7 May, 2023; originally announced May 2023.

Comments: The paper is under review in Neural Network, Elsevier

arXiv:2305.03916 [pdf, other]

Unifying Pointer Analyses for Polyglot Inter-operations through Summary Specialization

Authors: Jyoti Prakash, Abhishek Tiwari, Christian Hammer

Abstract: Modular analysis of polyglot applications is challenging because heap object flows across language boundaries must be resolved. The state-of-the-art analyses for polyglot applications have two fundamental limitations. First, they assume explicit boundaries between the host and the guest language to determine inter-language dataflows. Second, they rely on specific analyses of the host and guest lan… ▽ More Modular analysis of polyglot applications is challenging because heap object flows across language boundaries must be resolved. The state-of-the-art analyses for polyglot applications have two fundamental limitations. First, they assume explicit boundaries between the host and the guest language to determine inter-language dataflows. Second, they rely on specific analyses of the host and guest languages. The former assumption is impractical concerning recent advancements in polyglot programming techniques, while the latter disregards advances in pointer analysis of the underlying languages. In this work, we propose to extend existing pointer analyses with a novel summary specialization technique so that points-to set across language boundaries can be unified. Our novel technique leverages various combinations of host and guest analyses with minor modifications. We demonstrate the efficacy and generalizability of our approach by evaluating it with two polyglot language models: Java-C communication via Android's NDK and Java-Python communication in GraalVM. △ Less

Submitted 5 May, 2023; originally announced May 2023.

arXiv:2305.01598 [pdf, other]

From Words to Code: Harnessing Data for Program Synthesis from Natural Language

Authors: Anirudh Khatry, Joyce Cahoon, Jordan Henkel, Shaleen Deep, Venkatesh Emani, Avrilia Floratou, Sumit Gulwani, Vu Le, Mohammad Raza, Sherry Shi, Mukul Singh, Ashish Tiwari

Abstract: Creating programs to correctly manipulate data is a difficult task, as the underlying programming languages and APIs can be challenging to learn for many users who are not skilled programmers. Large language models (LLMs) demonstrate remarkable potential for generating code from natural language, but in the data manipulation domain, apart from the natural language (NL) description of the intended… ▽ More Creating programs to correctly manipulate data is a difficult task, as the underlying programming languages and APIs can be challenging to learn for many users who are not skilled programmers. Large language models (LLMs) demonstrate remarkable potential for generating code from natural language, but in the data manipulation domain, apart from the natural language (NL) description of the intended task, we also have the dataset on which the task is to be performed, or the "data context". Existing approaches have utilized data context in a limited way by simply adding relevant information from the input data into the prompts sent to the LLM. In this work, we utilize the available input data to execute the candidate programs generated by the LLMs and gather their outputs. We introduce semantic reranking, a technique to rerank the programs generated by LLMs based on three signals coming the program outputs: (a) semantic filtering and well-formedness based score tuning: do programs even generate well-formed outputs, (b) semantic interleaving: how do the outputs from different candidates compare to each other, and (c) output-based score tuning: how do the outputs compare to outputs predicted for the same task. We provide theoretical justification for semantic interleaving. We also introduce temperature mixing, where we combine samples generated by LLMs using both high and low temperatures. We extensively evaluate our approach in three domains, namely databases (SQL), data science (Pandas) and business intelligence (Excel's Power Query M) on a variety of new and existing benchmarks. We observe substantial gains across domains, with improvements of up to 45% in top-1 accuracy and 34% in top-3 accuracy. △ Less

Submitted 3 May, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

Comments: 14 pages

arXiv:2304.11611 [pdf, other]

doi 10.1109/TPWRS.2023.3283472

Robust Short-term Operation of AC Power Network with Injection Uncertainties

Authors: Anamika Tiwari, Abheejeet Mohapatra, Soumya Ranjan Sahoo

Abstract: With uncertain injections from Renewable Energy Sources (RESs) and loads, deterministic AC Optimal Power Flow (OPF) often fails to provide optimal setpoints of conventional generators. A computationally time-efficient, economical, and robust solution is essential for ACOPF with short-term injection uncertainties. Usually, applying Robust Optimization (RO) for conventional non-linear ACOPF results… ▽ More With uncertain injections from Renewable Energy Sources (RESs) and loads, deterministic AC Optimal Power Flow (OPF) often fails to provide optimal setpoints of conventional generators. A computationally time-efficient, economical, and robust solution is essential for ACOPF with short-term injection uncertainties. Usually, applying Robust Optimization (RO) for conventional non-linear ACOPF results in computationally intractable Robust Counterpart (RC), which is undesirable as ACOPF is an operational problem. Hence, this paper proposes a single-stage non-integer non-recursive RC of ACOPF, using a dual transformation, for short-term injection uncertainties. The proposed RC is convex, tractable, and provides base-point active power generations and terminal voltage magnitudes (setpoints) of conventional generators that satisfy all constraints for all realizations of defined injection uncertainties. The non-linear impact of uncertainties on other variables is inherently modeled without using any affine policy. The proposed approach also includes the budget of uncertainty constraints for low conservatism of the obtained setpoints. Monte-Carlo Simulation (MCS) based participation factored AC power flows validate the robustness of the obtained setpoints on NESTA and case9241pegase systems for different injection uncertainties. Comparison with previous approaches indicates the efficacy of the proposed approach in terms of low operational cost and computation time. △ Less

Submitted 23 April, 2023; originally announced April 2023.

Comments: 16 pages, 5 figures, 5 tables

arXiv:2304.09548 [pdf, other]

SemEval 2023 Task 6: LegalEval - Understanding Legal Texts

Authors: Ashutosh Modi, Prathamesh Kalamkar, Saurabh Karn, Aman Tiwari, Abhinav Joshi, Sai Kiran Tanikella, Shouvik Kumar Guha, Sachin Malhan, Vivek Raghavan

Abstract: In populous countries, pending legal cases have been growing exponentially. There is a need for develo** NLP-based techniques for processing and automatically understanding legal documents. To promote research in the area of Legal NLP we organized the shared task LegalEval - Understanding Legal Texts at SemEval 2023. LegalEval task has three sub-tasks: Task-A (Rhetorical Roles Labeling) is about… ▽ More In populous countries, pending legal cases have been growing exponentially. There is a need for develo** NLP-based techniques for processing and automatically understanding legal documents. To promote research in the area of Legal NLP we organized the shared task LegalEval - Understanding Legal Texts at SemEval 2023. LegalEval task has three sub-tasks: Task-A (Rhetorical Roles Labeling) is about automatically structuring legal documents into semantically coherent units, Task-B (Legal Named Entity Recognition) deals with identifying relevant entities in a legal document and Task-C (Court Judgement Prediction with Explanation) explores the possibility of automatically predicting the outcome of a legal case along with providing an explanation for the prediction. In total 26 teams (approx. 100 participants spread across the world) submitted systems paper. In each of the sub-tasks, the proposed systems outperformed the baselines; however, there is a lot of scope for improvement. This paper describes the tasks, and analyzes techniques proposed by various teams. △ Less

Submitted 1 May, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

Comments: 13 Pages (9 Pages + References), Accepted at SemEval 2023 at ACL 2023

arXiv:2304.01908 [pdf]

Leveraging Deep Learning Approaches for Deepfake Detection: A Review

Authors: Aniruddha Tiwari, Rushit Dave, Mounika Vanamala

Abstract: Conspicuous progression in the field of machine learning and deep learning have led the jump of highly realistic fake media, these media oftentimes referred as deepfakes. Deepfakes are fabricated media which are generated by sophisticated AI that are at times very difficult to set apart from the real media. So far, this media can be uploaded to the various social media platforms, hence advertising… ▽ More Conspicuous progression in the field of machine learning and deep learning have led the jump of highly realistic fake media, these media oftentimes referred as deepfakes. Deepfakes are fabricated media which are generated by sophisticated AI that are at times very difficult to set apart from the real media. So far, this media can be uploaded to the various social media platforms, hence advertising it to the world got easy, calling for an efficacious countermeasure. Thus, one of the optimistic counter steps against deepfake would be deepfake detection. To undertake this threat, researchers in the past have created models to detect deepfakes based on ML/DL techniques like Convolutional Neural Networks. This paper aims to explore different methodologies with an intention to achieve a cost-effective model with a higher accuracy with different types of the datasets, which is to address the generalizability of the dataset. △ Less

Submitted 4 April, 2023; originally announced April 2023.

arXiv:2303.15159 [pdf, other]

doi 10.1007/JHEP05(2023)119

Composite Topological Structures in SO(10)

Authors: George Lazarides, Qaisar Shafi, Amit Tiwari

Abstract: We explore a variety of composite topological structures that arise from the spontaneous breaking of $SO(10)$ to $SU(3)_c \times U(1)_{em}$ via one of its maximal subgroups $SU(5) \times U(1)_χ$, $SU(4)_c \times SU(2)_L \times SU(2)_R$, and $SU(5) \times U(1)_X$ (also known as flipped $SU(5)$). They include i) a network of $\mathbb{Z}$ strings which develop monopoles and turn into necklaces with t… ▽ More We explore a variety of composite topological structures that arise from the spontaneous breaking of $SO(10)$ to $SU(3)_c \times U(1)_{em}$ via one of its maximal subgroups $SU(5) \times U(1)_χ$, $SU(4)_c \times SU(2)_L \times SU(2)_R$, and $SU(5) \times U(1)_X$ (also known as flipped $SU(5)$). They include i) a network of $\mathbb{Z}$ strings which develop monopoles and turn into necklaces with the structure of $\mathbb{Z}_2$ strings, ii) dumbbells connecting two different types of monopoles, or monopoles and antimonpoles, iii) starfish-like configurations, iv) polypole configurations, and v) walls bounded by a necklace. We display these structures both before and after the electroweak breaking. The appearance of these composite structures in the early universe and their astrophysical implications including gravitational wave emission would depend on the symmetry breaking patterns and scales, and the nature of the associated phase transitions. △ Less

Submitted 13 May, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

arXiv:2303.13894 [pdf, ps, other]

Polynomial correspondences expressible as maps of $d$-tuples

Authors: Shrihari Sridharan, Subith G., Atma Ram Tiwari

Abstract: In this paper, we consider polynomial correspondences $f (x, y)$ in $\mathbb{C}[x, y]$ of degree $d \ge 2$ in both the variables and obtain necessary and sufficient conditions in order that the equation $f (x, y) = 0$ can be expressed as $φ(x) = ψ(y)$, where $φ$ and $ψ$ are fractional degree $d$ rational maps in the Riemann sphere. In the absence of involutions that played a vital role towards cha… ▽ More In this paper, we consider polynomial correspondences $f (x, y)$ in $\mathbb{C}[x, y]$ of degree $d \ge 2$ in both the variables and obtain necessary and sufficient conditions in order that the equation $f (x, y) = 0$ can be expressed as $φ(x) = ψ(y)$, where $φ$ and $ψ$ are fractional degree $d$ rational maps in the Riemann sphere. In the absence of involutions that played a vital role towards characterising quadratic correspondences ($d = 2$), we employ certain elementary ideas from theory of equations and matrices to achieve our results. We further explore certain symmetry conditions on the matrix of coefficients of correspondences that satisfy the above factorisation. We conclude this short note with a few examples. △ Less

Submitted 24 March, 2023; originally announced March 2023.

arXiv:2302.11826 [pdf, other]

doi 10.1103/PhysRevD.108.L011502

Regeneration of bottomonia in an open quantum systems approach

Authors: Nora Brambilla, Miguel Ángel Escobedo, Ajaharul Islam, Michael Strickland, Anurag Tiwari, Antonio Vairo, Peter Vander Griend

Abstract: We demonstrate the importance of quantum jumps in the nonequilibrium evolution of bottomonium states in the quark-gluon plasma. Based on nonrelativistic effective field theory and the open quantum system framework, we evolve the density matrix of color singlet and octet pairs. We show that quantum regeneration of singlet states from octet configurations is necessary to understand experimental resu… ▽ More We demonstrate the importance of quantum jumps in the nonequilibrium evolution of bottomonium states in the quark-gluon plasma. Based on nonrelativistic effective field theory and the open quantum system framework, we evolve the density matrix of color singlet and octet pairs. We show that quantum regeneration of singlet states from octet configurations is necessary to understand experimental results for the suppression of both bottomonium ground and excited states. The values of the heavy-quarkonium transport coefficients used are consistent with recent lattice QCD determinations. △ Less

Submitted 8 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

Comments: 5 pages, 7 figures, 1 page supplement; v2 - minor changes, updated references, published version

Report number: TUM-EFT 178/23; FERMILAB-PUB-23-060-V

Journal ref: Phys. Rev. D 108, L011502 (2023)

arXiv:2302.09651 [pdf, other]

doi 10.3847/1538-4357/acd77d

Waltzing binaries: Probing line-of-sight acceleration of merging compact objects with gravitational waves

Authors: Aditya Vijaykumar, Avinash Tiwari, Shasvath J. Kapadia, K. G. Arun, Parameswaran Ajith

Abstract: Line-of-sight acceleration of a compact binary coalescence (CBC) event would modulate the shape of the gravitational waves (GWs) it produces with respect to the corresponding non-accelerated CBC. Such modulations could be indicative of its astrophysical environment. We investigate the prospects of detecting this acceleration in future observing runs of the LIGO-Virgo-KAGRA network, as well as in n… ▽ More Line-of-sight acceleration of a compact binary coalescence (CBC) event would modulate the shape of the gravitational waves (GWs) it produces with respect to the corresponding non-accelerated CBC. Such modulations could be indicative of its astrophysical environment. We investigate the prospects of detecting this acceleration in future observing runs of the LIGO-Virgo-KAGRA network, as well as in next-generation (XG) detectors and the proposed DECIGO. We place the first observational constraints on this acceleration, for putative binary neutron star mergers GW170817 and GW190425. We find no evidence of line-of-sight acceleration in these events at $90\%$ confidence. Prospective constraints for the fifth observing run of the LIGO at A+ sensitivity suggest that accelerations for typical BNSs could be constrained with a precision of $a/c \sim 10^{-7}~[\mathrm{s}^{-1}]$, assuming a signal-to-noise ratio of $10$. These improve to $a/c \sim 10^{-9}~[\mathrm{s}^{-1}]$ in XG detectors, and $a/c \sim 10^{-16}~[\mathrm{s}^{-1}]$ in DECIGO. We also interpret these constraints in the context of mergers around supermassive black holes. △ Less

Submitted 13 July, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

Comments: Accepted to ApJ

Showing 1–50 of 265 results for author: Tiwari, A