-
Dark matter-electron scattering and freeze-in scenarios in the light of $Z^\prime$ mediation
Authors:
Basabendu Barman,
Arindam Das,
Sanjoy Mandal
Abstract:
We investigate dark matter (DM)-electron scattering in a minimal $U(1)_X$ extension of the Standard Model (SM), where the DM can appear as a Majorana fermion, a complex singlet scalar or a Dirac fermion. To study bounds on the $U(1)_X$ gauge coupling $(g_X)$ and new gauge boson mass $(M_{Z^\prime})$, from DM-electron scattering, we consider several direct search experiments like CDMS, DAMIC, SENSE…
▽ More
We investigate dark matter (DM)-electron scattering in a minimal $U(1)_X$ extension of the Standard Model (SM), where the DM can appear as a Majorana fermion, a complex singlet scalar or a Dirac fermion. To study bounds on the $U(1)_X$ gauge coupling $(g_X)$ and new gauge boson mass $(M_{Z^\prime})$, from DM-electron scattering, we consider several direct search experiments like CDMS, DAMIC, SENSEI, PandaX-II, DarkSide-50 and XENON1T-S2 for different $U(1)_X$ charges. In this set-up we consider DM production via freeze-in both in radiation dominated and modified cosmological background to project sensitivities on $g_X-M_{Z^\prime}$ plane satisfying observed relic abundance. DM-electron scattering could provide comparable, or even stronger bounds than those obtained from the electron/ muon $(g-2)$, low energy scattering and intensity frontier experiments within 0.01 GeV $\lesssim M_{Z^\prime} \lesssim$ 0.1 GeV. Constrains from freeze-in could provide stronger sensitivities for $M_{Z^\prime}\gtrsim \mathcal{O}(1)$ GeV, however, these limits are comparable to those obtained from LHCb, LEP experiments for $\mathcal{O}(10)$ GeV $\lesssim M_{Z^\prime} \lesssim 150$ GeV. In future, electron-muon scattering (MUonE), proton (FASER, DUNE) and electron/positron (ILC) beam dump experiments could probe these parameters.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Constant Modulus Waveform Design with Interference Exploitation for DFRC Systems: A Block-Level Optimization Approach
Authors:
Byunghyun Lee,
Anindya Bijoy Das,
David Love,
Christopher Brinton,
James Krogmeier
Abstract:
Dual-function radar-communication (DFRC) is a key enabler of location-based services for next-generation communication systems. In this paper, we investigate the problem of designing constant modulus waveforms for DFRC systems. For high-precision radar sensing, we consider joint optimization of the correlation properties and spatial beam pattern. For communication, we employ constructive interfere…
▽ More
Dual-function radar-communication (DFRC) is a key enabler of location-based services for next-generation communication systems. In this paper, we investigate the problem of designing constant modulus waveforms for DFRC systems. For high-precision radar sensing, we consider joint optimization of the correlation properties and spatial beam pattern. For communication, we employ constructive interference-based block-level precoding (CI-BLP) to leverage distortion induced by multiuser multiple-input multiple-output (MU-MIMO) and radar transmission on a block level. We propose two solution algorithms based on the alternating direction method of multipliers (ADMM) and majorization-minimization (MM) principles, which are effective for small and large block sizes, respectively. The proposed ADMM-based solution decomposes the nonconvex formulated problem into multiple tractable subproblems, each of which admits a closed-form solution. To accelerate convergence of the MM-based solution, we propose an improved majorizing function that leverages a novel diagonal matrix structure. After majorization, we decompose the approximated problem into independent subproblems for parallelization, mitigating the complexity that increases with block size. We then evaluate the performance of the proposed algorithms through a series of numerical experiments. Simulation results demonstrate that the proposed methods can substantially enhance spatial/temporal sidelobe suppression through block-level optimization.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Machine Learning Techniques in Automatic Music Transcription: A Systematic Survey
Authors:
Fatemeh Jamshidi,
Gary Pike,
Amit Das,
Richard Chapman
Abstract:
In the domain of Music Information Retrieval (MIR), Automatic Music Transcription (AMT) emerges as a central challenge, aiming to convert audio signals into symbolic notations like musical notes or sheet music. This systematic review accentuates the pivotal role of AMT in music signal analysis, emphasizing its importance due to the intricate and overlap** spectral structure of musical harmonies.…
▽ More
In the domain of Music Information Retrieval (MIR), Automatic Music Transcription (AMT) emerges as a central challenge, aiming to convert audio signals into symbolic notations like musical notes or sheet music. This systematic review accentuates the pivotal role of AMT in music signal analysis, emphasizing its importance due to the intricate and overlap** spectral structure of musical harmonies. Through a thorough examination of existing machine learning techniques utilized in AMT, we explore the progress and constraints of current models and methodologies. Despite notable advancements, AMT systems have yet to match the accuracy of human experts, largely due to the complexities of musical harmonies and the need for nuanced interpretation. This review critically evaluates both fully automatic and semi-automatic AMT systems, emphasizing the importance of minimal user intervention and examining various methodologies proposed to date. By addressing the limitations of prior techniques and suggesting avenues for improvement, our objective is to steer future research towards fully automated AMT systems capable of accurately and efficiently translating intricate audio signals into precise symbolic representations. This study not only synthesizes the latest advancements but also lays out a road-map for overcoming existing challenges in AMT, providing valuable insights for researchers aiming to narrow the gap between current systems and human-level transcription accuracy.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Effective Generation of Feasible Solutions for Integer Programming via Guided Diffusion
Authors:
Hao Zeng,
Jiaqi Wang,
Avirup Das,
Junying He,
Kunpeng Han,
Haoyuan Hu,
Mingfei Sun
Abstract:
Feasible solutions are crucial for Integer Programming (IP) since they can substantially speed up the solving process. In many applications, similar IP instances often exhibit similar structures and shared solution distributions, which can be potentially modeled by deep learning methods. Unfortunately, existing deep-learning-based algorithms, such as Neural Diving and Predict-and-search framework,…
▽ More
Feasible solutions are crucial for Integer Programming (IP) since they can substantially speed up the solving process. In many applications, similar IP instances often exhibit similar structures and shared solution distributions, which can be potentially modeled by deep learning methods. Unfortunately, existing deep-learning-based algorithms, such as Neural Diving and Predict-and-search framework, are limited to generating only partial feasible solutions, and they must rely on solvers like SCIP and Gurobi to complete the solutions for a given IP problem. In this paper, we propose a novel framework that generates complete feasible solutions end-to-end. Our framework leverages contrastive learning to characterize the relationship between IP instances and solutions, and learns latent embeddings for both IP instances and their solutions. Further, the framework employs diffusion models to learn the distribution of solution embeddings conditioned on IP representations, with a dedicated guided sampling strategy that accounts for both constraints and objectives. We empirically evaluate our framework on four typical datasets of IP problems, and show that it effectively generates complete feasible solutions with a high probability (> 89.7 \%) without the reliance of Solvers and the quality of solutions is comparable to the best heuristic solutions from Gurobi. Furthermore, by integrating our method's sampled partial solutions with the CompleteSol heuristic from SCIP, the resulting feasible solutions outperform those from state-of-the-art methods across all datasets, exhibiting a 3.7 to 33.7\% improvement in the gap to optimal values, and maintaining a feasible ratio of over 99.7\% for all datasets.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Ethical Framework for Responsible Foundational Models in Medical Imaging
Authors:
Abhijit Das,
Debesh Jha,
Jasmer Sanjotra,
Onkar Susladkar,
Suramyaa Sarkar,
Ashish Rauniyar,
Nikhil Tomar,
Vanshali Sharma,
Ulas Bagci
Abstract:
Foundational models (FMs) have tremendous potential to revolutionize medical imaging. However, their deployment in real-world clinical settings demands extensive ethical considerations. This paper aims to highlight the ethical concerns related to FMs and propose a framework to guide their responsible development and implementation within medicine. We meticulously examine ethical issues such as pri…
▽ More
Foundational models (FMs) have tremendous potential to revolutionize medical imaging. However, their deployment in real-world clinical settings demands extensive ethical considerations. This paper aims to highlight the ethical concerns related to FMs and propose a framework to guide their responsible development and implementation within medicine. We meticulously examine ethical issues such as privacy of patient data, bias mitigation, algorithmic transparency, explainability and accountability. The proposed framework is designed to prioritize patient welfare, mitigate potential risks, and foster trust in AI-assisted healthcare.
△ Less
Submitted 13 April, 2024;
originally announced June 2024.
-
Investigating Annotator Bias in Large Language Models for Hate Speech Detection
Authors:
Amit Das,
Zheng Zhang,
Fatemeh Jamshidi,
Vinija Jain,
Aman Chadha,
Nilanjana Raychawdhary,
Mary Sandage,
Lauramarie Pope,
Gerry Dozier,
Cheryl Seals
Abstract:
Data annotation, the practice of assigning descriptive labels to raw data, is pivotal in optimizing the performance of machine learning models. However, it is a resource-intensive process susceptible to biases introduced by annotators. The emergence of sophisticated Large Language Models (LLMs), like ChatGPT presents a unique opportunity to modernize and streamline this complex procedure. While ex…
▽ More
Data annotation, the practice of assigning descriptive labels to raw data, is pivotal in optimizing the performance of machine learning models. However, it is a resource-intensive process susceptible to biases introduced by annotators. The emergence of sophisticated Large Language Models (LLMs), like ChatGPT presents a unique opportunity to modernize and streamline this complex procedure. While existing research extensively evaluates the efficacy of LLMs, as annotators, this paper delves into the biases present in LLMs, specifically GPT 3.5 and GPT 4o when annotating hate speech data. Our research contributes to understanding biases in four key categories: gender, race, religion, and disability. Specifically targeting highly vulnerable groups within these categories, we analyze annotator biases. Furthermore, we conduct a comprehensive examination of potential factors contributing to these biases by scrutinizing the annotated data. We introduce our custom hate speech detection dataset, HateSpeechCorpus, to conduct this research. Additionally, we perform the same experiments on the ETHOS (Mollas et al., 2022) dataset also for comparative analysis. This paper serves as a crucial resource, guiding researchers and practitioners in harnessing the potential of LLMs for dataannotation, thereby fostering advancements in this critical field. The HateSpeechCorpus dataset is available here: https://github.com/AmitDasRup123/HateSpeechCorpus
△ Less
Submitted 18 June, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
Optical Investigations of Coherence and Relaxation Dynamics of a Thulium-doped Yttrium Gallium Garnet Crystal at sub-Kelvin Temperatures for Optical Quantum Memory
Authors:
Antariksha Das,
Mohsen Falamarzi Askarani,
Jacob H. Davidson,
Neil Sinclair,
Joshua A. Slater,
Sara Marzban,
Daniel Oblak,
Charles W. Thiel,
Rufus L. Cone,
Wolfgang Tittel
Abstract:
Rare-earth ion-doped crystals are of great interest for quantum memories, a central component in future quantum repeaters. To assess the promise of 1$\%$ Tm$^{3+}$-doped yttrium gallium garnet (Tm:YGG), we report measurements of optical coherence and energy-level lifetimes of its $^3$H$_6$ $\leftrightarrow$ $^3$H$_4$ transition at a temperature of around 500 mK and various magnetic fields. Using s…
▽ More
Rare-earth ion-doped crystals are of great interest for quantum memories, a central component in future quantum repeaters. To assess the promise of 1$\%$ Tm$^{3+}$-doped yttrium gallium garnet (Tm:YGG), we report measurements of optical coherence and energy-level lifetimes of its $^3$H$_6$ $\leftrightarrow$ $^3$H$_4$ transition at a temperature of around 500 mK and various magnetic fields. Using spectral hole burning, we find hyperfine ground-level (Zeeman level) lifetimes of several minutes at magnetic fields of less than 1000 G. We also measure coherence time exceeding one millisecond using two-pulse photon echoes. Three-pulse photon echo and spectral hole burning measurements reveal that due to spectral diffusion, the effective coherence time reduces to a few $μ$s over a timescale of around two hundred seconds. Finally, temporal and frequency-multiplexed storage of optical pulses using the atomic frequency comb protocol is demonstrated. Our results suggest Tm:YGG to be promising for multiplexed photonic quantum memory for quantum repeaters.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Design,fabrication and characterization of 8x9 n-type silicon pad array for sampling calorimetry
Authors:
Sawan,
G. Tambave,
J. L. Bouly,
O. Bourrion,
T. Chujo,
A. Das,
M. Inaba,
V. K. S. Kashyap,
C. Krug,
R. Laha,
C. Loizides,
B. Mohanty,
M. M. Mondal N. Ponchant,
K. P. Sharma,
R. Singh,
D. Tourres
Abstract:
This paper reports the development and testing of n-type silicon pad array detectors targeted for the Forward Calorimeter (FoCal) detector, which is an upgrade of the ALICE detector at CERN, scheduled for data taking in Run~4~(2029-2034). The FoCal detector includes hadronic and electromagnetic calorimeters, with the latter made of tungsten absorber layers and granular silicon pad arrays read out…
▽ More
This paper reports the development and testing of n-type silicon pad array detectors targeted for the Forward Calorimeter (FoCal) detector, which is an upgrade of the ALICE detector at CERN, scheduled for data taking in Run~4~(2029-2034). The FoCal detector includes hadronic and electromagnetic calorimeters, with the latter made of tungsten absorber layers and granular silicon pad arrays read out using the High Granularity Calorimeter Readout Chip~(HGCROC). This paper covers the Technology Computer-Aided Design (TCAD) simulations, the fabrication process, current versus voltage (IV) and capacitance versus voltage (CV) measurements, test results with a blue LED and $^{90}$Sr beta source, and neutron radiation hardness tests. IV measurements for the detector showed that 90\% of the pads had leakage current below 10~nA at full depletion voltage. Simulations predicted a breakdown voltage of 1000~V and practical tests confirmed stable operation up to 500~V without breakdown. CV measurements in the data and the simulations gave a full depletion voltage of around 50~V at a capacitance of 35~pF. LED tests verified that all detector pads responded correctly. Additionally, the 1$\times$1 cm$^2$ pads were also tested with the neutron radiations at a fluence of $5\times10^{13}$ 1~MeV~n$_{eq}$/cm$^2$.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Apparate: Evading Memory Hierarchy with GodSpeed Wireless-on-Chip
Authors:
Nitesh Narayana GS,
Abhijit Das
Abstract:
The rapid advancements in memory systems, CPU technology, and emerging technologies herald a transformative potential in computing, promising to revolutionize memory hierarchies. Innovations in DDR memory are delivering unprecedented bandwidth, while advancements in on-chip wireless technology are reducing size and increasing speed. The introduction of godspeed wireless transceivers on chip, along…
▽ More
The rapid advancements in memory systems, CPU technology, and emerging technologies herald a transformative potential in computing, promising to revolutionize memory hierarchies. Innovations in DDR memory are delivering unprecedented bandwidth, while advancements in on-chip wireless technology are reducing size and increasing speed. The introduction of godspeed wireless transceivers on chip, alongside near high-speed DRAM, is poised to directly facilitate memory requests. This integration suggests the potential for eliminating traditional memory hierarchies, offering a new paradigm in computing efficiency and speed. These developments indicate a near-future where computing systems are significantly more responsive and powerful, leveraging direct, high-speed memory access mechanisms.
△ Less
Submitted 23 April, 2024;
originally announced June 2024.
-
Grokking Modular Polynomials
Authors:
Darshil Doshi,
Tianyu He,
Aritra Das,
Andrey Gromov
Abstract:
Neural networks readily learn a subset of the modular arithmetic tasks, while failing to generalize on the rest. This limitation remains unmoved by the choice of architecture and training strategies. On the other hand, an analytical solution for the weights of Multi-layer Perceptron (MLP) networks that generalize on the modular addition task is known in the literature. In this work, we (i) extend…
▽ More
Neural networks readily learn a subset of the modular arithmetic tasks, while failing to generalize on the rest. This limitation remains unmoved by the choice of architecture and training strategies. On the other hand, an analytical solution for the weights of Multi-layer Perceptron (MLP) networks that generalize on the modular addition task is known in the literature. In this work, we (i) extend the class of analytical solutions to include modular multiplication as well as modular addition with many terms. Additionally, we show that real networks trained on these datasets learn similar solutions upon generalization (grokking). (ii) We combine these "expert" solutions to construct networks that generalize on arbitrary modular polynomials. (iii) We hypothesize a classification of modular polynomials into learnable and non-learnable via neural networks training; and provide experimental evidence supporting our claims.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
Authors:
Tianyu He,
Darshil Doshi,
Aritra Das,
Andrey Gromov
Abstract:
Large language models can solve tasks that were not present in the training set. This capability is believed to be due to in-context learning and skill composition. In this work, we study the emergence of in-context learning and skill composition in a collection of modular arithmetic tasks. Specifically, we consider a finite collection of linear modular functions…
▽ More
Large language models can solve tasks that were not present in the training set. This capability is believed to be due to in-context learning and skill composition. In this work, we study the emergence of in-context learning and skill composition in a collection of modular arithmetic tasks. Specifically, we consider a finite collection of linear modular functions $z = a \, x + b \, y \;\mathrm{mod}\; p$ labeled by the vector $(a, b) \in \mathbb{Z}_p^2$. We use some of these tasks for pre-training and the rest for out-of-distribution testing. We empirically show that a GPT-style transformer exhibits a transition from in-distribution to out-of-distribution generalization as the number of pre-training tasks increases. We find that the smallest model capable of out-of-distribution generalization requires two transformer blocks, while for deeper models, the out-of-distribution generalization phase is \emph{transient}, necessitating early stop**. Finally, we perform an interpretability study of the pre-trained models, revealing the highly structured representations in both phases; and discuss the learnt algorithm.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Modal Analysis of Cellular Dynamics in the Morphospace in Epithelial-Mesenchymal Transition
Authors:
Akash Chandra Das,
Debanga Raj Neog,
Biplab Bose
Abstract:
During epithelial-mesenchymal transition (EMT), epithelial cells change their morphology, disperse, and gain mesenchymal-like characteristics. Usually, cells are categorized into discrete cell types or states based on gene expression and other cellular features. Subsequently, EMT is investigated as a dynamical process where cells jump from one discrete state to another. In the current work, we mov…
▽ More
During epithelial-mesenchymal transition (EMT), epithelial cells change their morphology, disperse, and gain mesenchymal-like characteristics. Usually, cells are categorized into discrete cell types or states based on gene expression and other cellular features. Subsequently, EMT is investigated as a dynamical process where cells jump from one discrete state to another. In the current work, we moved away from this idea of discrete state transition and investigated EMT dynamics in a continuous phenotypic space. We used morphology to define the phenotype of a cell. We used the data from quantitative image analysis of MDA-MB-468 cells undergoing EGF-induced EMT. We defined the morphological state space or 'morphospace' using the morphological features extracted through image analysis. During EMT, as the morphology changed, the distribution of cells in the morphospace also changed. However, this morphospace had a very high dimension. We reduced it to a 2-dimensional "reduced morphospace" and investigated the temporal change in the spatial distribution of cells in this reduced space. We used proper orthogonal decomposition to find dominant dynamical features of this spatio-temporal data. The modal analysis detected key features of EMT in this experimental system - reversible transition, distinct paths of phenotypic transition during induction and reversal of EMT, and enhanced diversity of cells during reversal of EMT. We also provide some intuitive physical meaning of the spatial modes and connect them to the key molecular event during EMT.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
MDIW-13: a New Multi-Lingual and Multi-Script Database and Benchmark for Script Identification
Authors:
Miguel A. Ferrer,
Abhijit Das,
Moises Diaz,
Aythami Morales,
Cristina Carmona-Duarte,
Umapada Pal
Abstract:
Script identification plays a vital role in applications that involve handwriting and document analysis within a multi-script and multi-lingual environment. Moreover, it exhibits a profound connection with human cognition. This paper provides a new database for benchmarking script identification algorithms, which contains both printed and handwritten documents collected from a wide variety of scri…
▽ More
Script identification plays a vital role in applications that involve handwriting and document analysis within a multi-script and multi-lingual environment. Moreover, it exhibits a profound connection with human cognition. This paper provides a new database for benchmarking script identification algorithms, which contains both printed and handwritten documents collected from a wide variety of scripts, such as Arabic, Bengali (Bangla), Gujarati, Gurmukhi, Devanagari, Japanese, Kannada, Malayalam, Oriya, Roman, Tamil, Telugu, and Thai. The dataset consists of 1,135 documents scanned from local newspaper and handwritten letters as well as notes from different native writers. Further, these documents are segmented into lines and words, comprising a total of 13,979 and 86,655 lines and words, respectively, in the dataset. Easy-to-go benchmarks are proposed with handcrafted and deep learning methods. The benchmark includes results at the document, line, and word levels with printed and handwritten documents. Results of script identification independent of the document/line/word level and independent of the printed/handwritten letters are also given. The new multi-lingual database is expected to create new script identifiers, present various challenges, including identifying handwritten and printed samples and serve as a foundation for future research in script identification based on the reported results of the three benchmarks.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Effect of Ni substitution on the fragile magnetic system ${\text{La}_{5}\text{Co}_{2}\text {Ge}_{3}}$
Authors:
Atreyee Das,
Tyler J. Slade,
Rustem Khasanov,
Sergey L. Bud'ko,
Paul C. Canfield
Abstract:
$\text{La}_{5}\text{Co}_{2}\text{Ge}_{3}$ is an itinerant ferromagnet with a Curie temperature, $T_C$, of $\sim$ 3.8 K and a remarkably small saturated moment of 0.1 $μ_{B}/\text{Co}$. Here we present the growth and characterization of single crystals of the ${\text{La}_{5}\text{(Co}_{1-x}\text {Ni}_{x})_2\text {Ge}_{3}}$ series for 0.00 $\leq x \leq…
▽ More
$\text{La}_{5}\text{Co}_{2}\text{Ge}_{3}$ is an itinerant ferromagnet with a Curie temperature, $T_C$, of $\sim$ 3.8 K and a remarkably small saturated moment of 0.1 $μ_{B}/\text{Co}$. Here we present the growth and characterization of single crystals of the ${\text{La}_{5}\text{(Co}_{1-x}\text {Ni}_{x})_2\text {Ge}_{3}}$ series for 0.00 $\leq x \leq$ 0.186. We measured powder X-ray diffraction, composition as well as anisotropic temperature dependent resistivity, temperature and field dependent magnetization along with heat capacity on these single crystals. We also measured muon-spin rotation/relaxation ($μ\text{SR}$) for some Ni substitutions ($x$ = 0.027, 0.036, 0.074) to study the evolution of internal field with Ni substitution. Using the measured data we infer a low temperature, transition temperature-composition phase diagram for ${\text{La}_{5}\text{(Co}_{1-x}\text {Ni}_{x})_2\text {Ge}_{3}}$. We find that $T_{C}$ is suppressed for low do**s, $x \leq 0.014 $; whereas for $0.036 \leq {x} \leq 0.186 $, the samples are antiferromagnetic with a Neel temperature, $T_{N}$, that goes through a weak and shallow maximum ($T_N \sim$ 3.4 K for $ x \sim$ 0.07) and then gradually decreases to 2.4 K by $x$ = 0.186. For intermediate Ni substitutions, $0.016 \leq {x} \leq 0.027 $, two transition temperatures are inferred with $T_N > T_C$. Whereas the $T-x$ phase diagram for ${\text{La}_{5}\text{(Co}_{1-x}\text {Ni}_{x})_2\text {Ge}_{3}}$ and the $T-p$ phase diagram determined for the parent $\text{La}_{5}\text{Co}_{2}\text{Ge}_{3}$ under hydrostatic pressure are grossly similar, changing from a low do** or low pressure ferromagnetic (FM) ground state to a high doped or pressure antiferromagnetic (AFM) state, perturbation by Ni substitution enabled us to identify an intermediate do** regime where both FM and AFM transitions occur.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Generation of mega-gauss axial and azimuthal magnetic fields in a solid plasma by ultrahigh intensity, circularly polarised femtosecond laser pulses
Authors:
Anandam Choudhary,
Laxman Prasad Goswami,
C. Aparajit,
Amit D. Lad,
Ameya Parab,
Yash M. Ved,
Amita Das,
G. Ravindra Kumar
Abstract:
The interaction of intense linearly polarized femtosecond laser pulses with solids is known to generate azimuthal magnetic fields, while circularly polarized light has been shown to create axial fields. We demonstrate through experiments and particle-in-cell simulations that circularly polarized light can generate both axial and azimuthal fields of comparable magnitude in a plasma created in a sol…
▽ More
The interaction of intense linearly polarized femtosecond laser pulses with solids is known to generate azimuthal magnetic fields, while circularly polarized light has been shown to create axial fields. We demonstrate through experiments and particle-in-cell simulations that circularly polarized light can generate both axial and azimuthal fields of comparable magnitude in a plasma created in a solid. Angular distributions of the generated fast electrons at target front and rear show significant differences between the results for the two polarization states, with circular polarization enforcing more axial confinement. The measurement of the spatial distribution of both types of magnetic fields captures their turbulent evolution.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Symmetry and symmetry-breaking in soil pores and climate change mitigation: What fractal geometry can tell us?
Authors:
Abhijeet Das
Abstract:
Soil is a critical component of terrestrial ecosystems, directly influencing global biogeochemical cycles. Despite its importance, the complex architecture of soil pores and their impact on greenhouse gas emissions remain poorly understood. This perspective aims to address this gap by applying symmetry and symmetry-breaking concepts through fractal geometry to elucidate the structural and function…
▽ More
Soil is a critical component of terrestrial ecosystems, directly influencing global biogeochemical cycles. Despite its importance, the complex architecture of soil pores and their impact on greenhouse gas emissions remain poorly understood. This perspective aims to address this gap by applying symmetry and symmetry-breaking concepts through fractal geometry to elucidate the structural and functional complexities of soil pores. We highlight how fractal parameters can quantify the self-similar nature of soil pore structures, revealing their size, shape, and connectivity. These geometric attributes influence soil properties such as permeability and diffusivity, which are essential for understanding gas exchange and microbial activity within the soil matrix. Furthermore, we emphasize the effects of various land management practices, including tillage and wetting-drying cycles, on soil pore complexity using three-dimensional multi-fractal analysis. Literature indicates that different agricultural practices significantly alter pore heterogeneity and connectivity, affecting greenhouse gas emissions. Conventional tillage decreases pore connectivity and increases randomness, whereas no-tillage preserves larger, more complex pore structures. We propose that integrating combinatorial, geometric, and functional symmetry concepts offers a comprehensive framework for examining the structure-property-function relationships in soil. This novel approach could enhance our understanding of soil's role in the global cycle of greenhouse gases and provide insights into sustainable land management practices aimed at mitigating climate change.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery
Authors:
Runlong He,
Mengya Xu,
Adrito Das,
Danyal Z. Khan,
Sophia Bano,
Hani J. Marcus,
Danail Stoyanov,
Matthew J. Clarkson,
Mobarakol Islam
Abstract:
Visual Question Answering (VQA) within the surgical domain, utilizing Large Language Models (LLMs), offers a distinct opportunity to improve intra-operative decision-making and facilitate intuitive surgeon-AI interaction. However, the development of LLMs for surgical VQA is hindered by the scarcity of diverse and extensive datasets with complex reasoning tasks. Moreover, contextual fusion of the i…
▽ More
Visual Question Answering (VQA) within the surgical domain, utilizing Large Language Models (LLMs), offers a distinct opportunity to improve intra-operative decision-making and facilitate intuitive surgeon-AI interaction. However, the development of LLMs for surgical VQA is hindered by the scarcity of diverse and extensive datasets with complex reasoning tasks. Moreover, contextual fusion of the image and text modalities remains an open research challenge due to the inherent differences between these two types of information and the complexity involved in aligning them. This paper introduces PitVQA, a novel dataset specifically designed for VQA in endonasal pituitary surgery and PitVQA-Net, an adaptation of the GPT2 with a novel image-grounded text embedding for surgical VQA. PitVQA comprises 25 procedural videos and a rich collection of question-answer pairs spanning crucial surgical aspects such as phase and step recognition, context understanding, tool detection and localization, and tool-tissue interactions. PitVQA-Net consists of a novel image-grounded text embedding that projects image and text features into a shared embedding space and GPT2 Backbone with an excitation block classification head to generate contextually relevant answers within the complex domain of endonasal pituitary surgery. Our image-grounded text embedding leverages joint embedding, cross-attention and contextual representation to understand the contextual relationship between questions and surgical images. We demonstrate the effectiveness of PitVQA-Net on both the PitVQA and the publicly available EndoVis18-VQA dataset, achieving improvements in balanced accuracy of 8% and 9% over the most recent baselines, respectively. Our code and dataset is available at https://github.com/mobarakol/PitVQA.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
The Narrow Depth and Breadth of Corporate Responsible AI Research
Authors:
Nur Ahmed,
Amit Das,
Kirsten Martin,
Kawshik Banerjee
Abstract:
The transformative potential of AI presents remarkable opportunities, but also significant risks, underscoring the importance of responsible AI development and deployment. Despite a growing emphasis on this area, there is limited understanding of industry's engagement in responsible AI research, i.e., the critical examination of AI's ethical, social, and legal dimensions. To address this gap, we a…
▽ More
The transformative potential of AI presents remarkable opportunities, but also significant risks, underscoring the importance of responsible AI development and deployment. Despite a growing emphasis on this area, there is limited understanding of industry's engagement in responsible AI research, i.e., the critical examination of AI's ethical, social, and legal dimensions. To address this gap, we analyzed over 6 million peer-reviewed articles and 32 million patent citations using multiple methods across five distinct datasets to quantify industry's engagement. Our findings reveal that the majority of AI firms show limited or no engagement in this critical subfield of AI. We show a stark disparity between industry's dominant presence in conventional AI research and its limited engagement in responsible AI. Leading AI firms exhibit significantly lower output in responsible AI research compared to their conventional AI research and the contributions of leading academic institutions. Our linguistic analysis documents a narrower scope of responsible AI research within industry, with a lack of diversity in key topics addressed. Our large-scale patent citation analysis uncovers a pronounced disconnect between responsible AI research and the commercialization of AI technologies, suggesting that industry patents rarely build upon insights generated by the responsible AI literature. This gap highlights the potential for AI development to diverge from a socially optimal path, risking unintended consequences due to insufficient consideration of ethical and societal implications. Our results highlight the urgent need for industry to publicly engage in responsible AI research to absorb academic knowledge, cultivate public trust, and proactively mitigate AI-induced societal harms.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Testing neutrino mass hierarchy under type-II seesaw scenario in $U(1)_X$ at colliders
Authors:
Arindam Das,
Puja Das,
Nobuchika Okada
Abstract:
The origin of tiny neutrino mass is a long standing unsolved puzzle of the Standard Model (SM) which allows us to consider scenarios beyond the Standard Model (BSM) in a variety of ways. One of them being the gauge extension of the SM could be realized as in the form of an anomaly free, general $U(1)_X$ extension of the SM where an $SU(2)_L$ triplet scalar being charged under $U(1)_X$ gauge group…
▽ More
The origin of tiny neutrino mass is a long standing unsolved puzzle of the Standard Model (SM) which allows us to consider scenarios beyond the Standard Model (BSM) in a variety of ways. One of them being the gauge extension of the SM could be realized as in the form of an anomaly free, general $U(1)_X$ extension of the SM where an $SU(2)_L$ triplet scalar being charged under $U(1)_X$ gauge group is introduced through a Dirac Yukawa coupling with the SM lepton doublet. Once the triplet scalar generates VEV, light neutrinos could acquire tiny Majorana mass and hence affecting the decay modes of the triplet scalar involving the neutrino oscillation data for different neutrino mass hierarchies. After the breaking of $U(1)_X$ scenarios, a neutral BSM, neutral gauge boson $(Z^\prime)$ acquires mass which interact differently with the left and right handed fermions. Satisfying the recent LHC bounds on the triplet scalar and $Z^\prime$ production, we study the pair production of the triplet scalar at LHC, $e^-e^+$ and $μ^-μ^+$ colliders followed by its decay into dominant mode depending on the neutrino mass hierarchy. Generating the SM generic backgrounds, we study the possible signal significance of four lepton final states. We also compare our results with the purely SM gauge mediated triplet scalar pair production followed by four lepton final states which could be significant only in $μ^- μ^+$ collider.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Real Time Monitoring and Forecasting of COVID 19 Cases using an Adjusted Holt based Hybrid Model embedded with Wavelet based ANN
Authors:
Agniva Das,
Kunnummal Muralidharan
Abstract:
Since the inception of the SARS - CoV - 2 (COVID - 19) novel coronavirus, a lot of time and effort is being allocated to estimate the trajectory and possibly, forecast with a reasonable degree of accuracy, the number of cases, recoveries, and deaths due to the same. The model proposed in this paper is a mindful step in the same direction. The primary model in question is a Hybrid Holt's Model embe…
▽ More
Since the inception of the SARS - CoV - 2 (COVID - 19) novel coronavirus, a lot of time and effort is being allocated to estimate the trajectory and possibly, forecast with a reasonable degree of accuracy, the number of cases, recoveries, and deaths due to the same. The model proposed in this paper is a mindful step in the same direction. The primary model in question is a Hybrid Holt's Model embedded with a Wavelet-based ANN. To test its forecasting ability, we have compared three separate models, the first, being a simple ARIMA model, the second, also an ARIMA model with a wavelet-based function, and the third, being the proposed model. We have also compared the forecast accuracy of this model with that of a modern day Vanilla LSTM recurrent neural network model. We have tested the proposed model on the number of confirmed cases (daily) for the entire country as well as 6 hotspot states. We have also proposed a simple adjustment algorithm in addition to the hybrid model so that daily and/or weekly forecasts can be meted out, with respect to the entirety of the country, as well as a moving window performance metric based on out-of-sample forecasts. In order to have a more rounded approach to the analysis of COVID-19 dynamics, focus has also been given to the estimation of the Basic Reproduction Number, $R_0$ using a compartmental epidemiological model (SIR). Lastly, we have also given substantial attention to estimating the shelf-life of the proposed model. It is obvious yet noteworthy how an accurate model, in this regard, can ensure better allocation of healthcare resources, as well as, enable the government to take necessary measures ahead of time.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
A Systematic Review and Meta-Analysis on Sleep Stage Classification and Sleep Disorder Detection Using Artificial Intelligence
Authors:
Tayab Uddin Wara,
Ababil Hossain Fahad,
Adri Shankar Das,
Md. Mehedi Hasan Shawon
Abstract:
Sleep is vital for people's physical and mental health, and sound sleep can help them focus on daily activities. Therefore, a sleep study that includes sleep patterns and disorders is crucial to enhancing our knowledge about individuals' health status. The findings on sleep stages and sleep disorders relied on polysomnography and self-report measures, and then the study went through clinical asses…
▽ More
Sleep is vital for people's physical and mental health, and sound sleep can help them focus on daily activities. Therefore, a sleep study that includes sleep patterns and disorders is crucial to enhancing our knowledge about individuals' health status. The findings on sleep stages and sleep disorders relied on polysomnography and self-report measures, and then the study went through clinical assessments by expert physicians. However, the evaluation process of sleep stage classification and sleep disorder has become more convenient with artificial intelligence applications and numerous investigations focusing on various datasets with advanced algorithms and techniques that offer improved computational ease and accuracy. This study aims to provide a comprehensive, systematic review and meta-analysis of the recent literature to analyze the different approaches and their outcomes in sleep studies, which includes works on sleep stages classification and sleep disorder detection using AI. In this review, 183 articles were initially selected from different journals, among which 80 records were enlisted for explicit review, ranging from 2016 to 2023. Brain waves were the most commonly employed body parameters for sleep staging and disorder studies. The convolutional neural network, the most widely used of the 34 distinct artificial intelligence models, comprised 27%. The other models included the long short-term memory, support vector machine, random forest, and recurrent neural network, which consisted of 11%, 6%, 6%, and 5% sequentially. For performance metrics, accuracy was widely used for a maximum of 83.75% of the cases, the F1 score of 45%, Kappa of 36.25%, Sensitivity of 31.25%, and Specificity of 30% of cases, along with the other metrics. This article would help physicians and researchers get the gist of AI's contribution to sleep studies and the feasibility of their intended work.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Holevo Cramér-Rao bound: How close can we get without entangling measurements?
Authors:
Aritra Das,
Lorcán O. Conlon,
Jun Suzuki,
Simon K. Yung,
** K. Lam,
Syed M. Assad
Abstract:
In multi-parameter quantum metrology, the resource of entanglement can lead to an increase in efficiency of the estimation process. Entanglement can be used in the state preparation stage, or the measurement stage, or both, to harness this advantage; here we focus on the role of entangling measurements. Specifically, entangling or collective measurements over multiple identical copies of a probe s…
▽ More
In multi-parameter quantum metrology, the resource of entanglement can lead to an increase in efficiency of the estimation process. Entanglement can be used in the state preparation stage, or the measurement stage, or both, to harness this advantage; here we focus on the role of entangling measurements. Specifically, entangling or collective measurements over multiple identical copies of a probe state are known to be superior to measuring each probe individually, but the extent of this improvement is an open problem. It is also known that such entangling measurements, though resource-intensive, are required to attain the ultimate limits in multi-parameter quantum metrology and quantum information processing tasks. In this work we investigate the maximum precision improvement that collective quantum measurements can offer over individual measurements for estimating parameters of qudit states, calling this the 'collective quantum enhancement'. We show that, whereas the maximum enhancement can, in principle, be a factor of $n$ for estimating $n$ parameters, this bound is not tight for large $n$. Instead, our results prove an enhancement linear in dimension of the qudit is possible using collective measurements and lead us to conjecture that this is the maximum collective quantum enhancement in any local estimation scenario.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Revealing the Production Mechanism of High-Energy Neutrinos from NGC 1068
Authors:
Abhishek Das,
B. Theodore Zhang,
Kohta Murase
Abstract:
The detection of high-energy neutrino signals from the nearby Seyfert galaxy NGC 1068 provides us with an opportunity to study nonthermal processes near the center of supermassive black holes. Using the IceCube and latest Fermi-LAT data, we present general multimessenger constraints on the energetics of cosmic rays and the size of neutrino emission regions. In the photohadronic scenario, the requi…
▽ More
The detection of high-energy neutrino signals from the nearby Seyfert galaxy NGC 1068 provides us with an opportunity to study nonthermal processes near the center of supermassive black holes. Using the IceCube and latest Fermi-LAT data, we present general multimessenger constraints on the energetics of cosmic rays and the size of neutrino emission regions. In the photohadronic scenario, the required cosmic-ray luminosity should be larger than about 1-10 percent of the Eddington luminosity, and the emission radius should be less than about 15 Schwarzschild radii in low-beta plasma and less than about 3 Schwarzschild radii in high-beta plasma. The leptonic scenario overshoots the NuSTAR or Fermi-LAT data for any emission radii we consider, and the required gamma-ray luminosity is much larger than the Eddington luminosity. The beta decay scenario also violates not only the energetics requirement but also gamma-ray constraints especially when the Bethe-Heitler and photomeson production processes are consistently considered. Our results rule out the leptonic and beta decay scenarios in a nearly model-independent manner, and support hadronic mechanisms in magnetically-powered coronae if NGC 1068 is a source of high-energy neutrinos.
△ Less
Submitted 18 June, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Cosmological constraints on mass-varying dark matter
Authors:
Amlan Chakraborty,
Anirban Das,
Subinoy Das,
Shiv K. Sethi
Abstract:
Light mass warm dark matter is an interesting and viable alternative to the cold dark matter paradigm. An intriguing variation of this scenario is the mass-varying dark matter model where the dark matter mass varies with time during its cosmic history. This is realized in multiple particle physics models. In this work, we study the cosmological constraints on such a model where the dark matter mas…
▽ More
Light mass warm dark matter is an interesting and viable alternative to the cold dark matter paradigm. An intriguing variation of this scenario is the mass-varying dark matter model where the dark matter mass varies with time during its cosmic history. This is realized in multiple particle physics models. In this work, we study the cosmological constraints on such a model where the dark matter mass transitions from zero to a finite value in the early Universe. In this model, the matter power spectrum exhibits power suppression below a scale that depends on the epoch of transition, and the angular power spectrum of the cosmic microwave background show a distinctive phase shift. We use the latest cosmic microwave background and the weak lensing data to place lower limit on the transition redshift and ease the $S_8$ tension, unlike the warm dark matter model. This analysis also facilitates a marginal detection of the dark matter (DM) mass. Our findings reveal that while Planck data alone reduces the $S_8$ tension to approximately $2σ$, it does not sufficiently constrain the DM mass. However, when combined with the $S_8$ measurement from KIDS1000+BOSS+2dfLenS, the tension significantly decreases to roughly $1.3σ$, and we observe the detection of a DM mass at $41.7^{+7.81}_{-27.5}\,\mathrm{eV}$. Further analysis incorporating a combined data set from ACT and weak lensing results in an even more pronounced reduction in the tension to approximately $0.4σ$, alongside a higher detected mass of $51.2^{+16}_{-33.5}\,\mathrm{eV}$. We also find a better fit to the combined data compared to the $Λ$CDM model.
△ Less
Submitted 7 June, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Towards Adaptive IMFs -- Generalization of utility functions in Multi-Agent Frameworks
Authors:
Kaushik Dey,
Satheesh K. Perepu,
Abir Das,
Pallab Dasgupta
Abstract:
Intent Management Function (IMF) is an integral part of future-generation networks. In recent years, there has been some work on AI-based IMFs that can handle conflicting intents and prioritize the global objective based on apriori definition of the utility function and accorded priorities for competing intents. Some of the earlier works use Multi-Agent Reinforcement Learning (MARL) techniques wit…
▽ More
Intent Management Function (IMF) is an integral part of future-generation networks. In recent years, there has been some work on AI-based IMFs that can handle conflicting intents and prioritize the global objective based on apriori definition of the utility function and accorded priorities for competing intents. Some of the earlier works use Multi-Agent Reinforcement Learning (MARL) techniques with AdHoc Teaming (AHT) approaches for efficient conflict handling in IMF. However, the success of such frameworks in real-life scenarios requires them to be flexible to business situations. The intent priorities can change and the utility function, which measures the extent of intent fulfilment, may also vary in definition. This paper proposes a novel mechanism whereby the IMF can generalize to different forms of utility functions and change of intent priorities at run-time without additional training. Such generalization ability, without additional training requirements, would help to deploy IMF in live networks where customer intents and priorities change frequently. Results on the network emulator demonstrate the efficacy of the approach, scalability for new intents, outperforming existing techniques that require additional training to achieve the same degree of flexibility thereby saving cost, and increasing efficiency and adaptability.
△ Less
Submitted 14 May, 2024; v1 submitted 13 May, 2024;
originally announced May 2024.
-
HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
Authors:
Tanmay Sen,
Ansuman Das,
Mrinmay Sen
Abstract:
Hate speech encompasses verbal, written, or behavioral communication that targets derogatory or discriminatory language against individuals or groups based on sensitive characteristics. Automated hate speech detection plays a crucial role in curbing its propagation, especially across social media platforms. Various methods, including recent advancements in deep learning, have been devised to addre…
▽ More
Hate speech encompasses verbal, written, or behavioral communication that targets derogatory or discriminatory language against individuals or groups based on sensitive characteristics. Automated hate speech detection plays a crucial role in curbing its propagation, especially across social media platforms. Various methods, including recent advancements in deep learning, have been devised to address this challenge. In this study, we introduce HateTinyLLM, a novel framework based on fine-tuned decoder-only tiny large language models (tinyLLMs) for efficient hate speech detection. Our experimental findings demonstrate that the fine-tuned HateTinyLLM outperforms the pretrained mixtral-7b model by a significant margin. We explored various tiny LLMs, including PY007/TinyLlama-1.1B-step-50K-105b, Microsoft/phi-2, and facebook/opt-1.3b, and fine-tuned them using LoRA and adapter methods. Our observations indicate that all LoRA-based fine-tuned models achieved over 80\% accuracy.
△ Less
Submitted 26 April, 2024;
originally announced May 2024.
-
PAM-UNet: Shifting Attention on Region of Interest in Medical Images
Authors:
Abhijit Das,
Debesh Jha,
Vandan Gorade,
Koushik Biswas,
Hongyi Pan,
Zheyuan Zhang,
Daniela P. Ladner,
Yury Velichko,
Amir Borhani,
Ulas Bagci
Abstract:
Computer-aided segmentation methods can assist medical personnel in improving diagnostic outcomes. While recent advancements like UNet and its variants have shown promise, they face a critical challenge: balancing accuracy with computational efficiency. Shallow encoder architectures in UNets often struggle to capture crucial spatial features, leading in inaccurate and sparse segmentation. To addre…
▽ More
Computer-aided segmentation methods can assist medical personnel in improving diagnostic outcomes. While recent advancements like UNet and its variants have shown promise, they face a critical challenge: balancing accuracy with computational efficiency. Shallow encoder architectures in UNets often struggle to capture crucial spatial features, leading in inaccurate and sparse segmentation. To address this limitation, we propose a novel \underline{P}rogressive \underline{A}ttention based \underline{M}obile \underline{UNet} (\underline{PAM-UNet}) architecture. The inverted residual (IR) blocks in PAM-UNet help maintain a lightweight framework, while layerwise \textit{Progressive Luong Attention} ($\mathcal{PLA}$) promotes precise segmentation by directing attention toward regions of interest during synthesis. Our approach prioritizes both accuracy and speed, achieving a commendable balance with a mean IoU of 74.65 and a dice score of 82.87, while requiring only 1.32 floating-point operations per second (FLOPS) on the Liver Tumor Segmentation Benchmark (LiTS) 2017 dataset. These results highlight the importance of develo** efficient segmentation models to accelerate the adoption of AI in clinical practice.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Modeling Linear and Non-linear Layers: An MILP Approach Towards Finding Differential and Impossible Differential Propagations
Authors:
Debranjan Pal,
Vishal Pankaj Chandratreya,
Abhijit Das,
Dipanwita Roy Chowdhury
Abstract:
Symmetric key cryptography stands as a fundamental cornerstone in ensuring security within contemporary electronic communication frameworks. The cryptanalysis of classical symmetric key ciphers involves traditional methods and techniques aimed at breaking or analyzing these cryptographic systems. In the evaluation of new ciphers, the resistance against linear and differential cryptanalysis is comm…
▽ More
Symmetric key cryptography stands as a fundamental cornerstone in ensuring security within contemporary electronic communication frameworks. The cryptanalysis of classical symmetric key ciphers involves traditional methods and techniques aimed at breaking or analyzing these cryptographic systems. In the evaluation of new ciphers, the resistance against linear and differential cryptanalysis is commonly a key design criterion. The wide trail design technique for block ciphers facilitates the demonstration of security against linear and differential cryptanalysis. Assessing the scheme's security against differential attacks often involves determining the minimum number of active SBoxes for all rounds of a cipher. The propagation characteristics of a cryptographic component, such as an SBox, can be expressed using Boolean functions. Mixed Integer Linear Programming (MILP) proves to be a valuable technique for solving Boolean functions. We formulate a set of inequalities to model a Boolean function, which is subsequently solved by an MILP solver. To efficiently model a Boolean function and select a minimal set of inequalities, two key challenges must be addressed. We propose algorithms to address the second challenge, aiming to find more optimized linear and non-linear components. Our approaches are applied to modeling SBoxes (up to six bits) and EXOR operations with any number of inputs. Additionally, we introduce an MILP-based automatic tool for exploring differential and impossible differential propagations within a cipher. The tool is successfully applied to five lightweight block ciphers: Lilliput, GIFT64, SKINNY64, Klein, and MIBS.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Transmon Qubit Constraints on Dark Matter-Nucleon Scattering
Authors:
Anirban Das,
Noah Kurinsky,
Rebecca K. Leane
Abstract:
We recently pointed out that power measurements of single quasiparticle devices can be used to detect dark matter. These devices have the lowest known energy thresholds, far surpassing standard direct detection experiments, requiring energy deposition above only about an meV. We calculate dark matter induced quasiparticle densities in transmon qubits, and use the latest transmon qubit measurements…
▽ More
We recently pointed out that power measurements of single quasiparticle devices can be used to detect dark matter. These devices have the lowest known energy thresholds, far surpassing standard direct detection experiments, requiring energy deposition above only about an meV. We calculate dark matter induced quasiparticle densities in transmon qubits, and use the latest transmon qubit measurements that provide one of the strongest existing lab-based bounds on dark matter-nucleon scattering below about 100 MeV. We strongly constrain sub-component dark matter, using both a dark matter population thermalized in the Earth as well as the dark matter wind from the Galactic halo. We demonstrate future potential sensitivities using devices with low quasiparticle densities.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Espresso: Robust Concept Filtering in Text-to-Image Models
Authors:
Anudeep Das,
Vasisht Duddu,
Rui Zhang,
N. Asokan
Abstract:
Diffusion-based text-to-image (T2I) models generate high-fidelity images for given textual prompts. They are trained on large datasets scraped from the Internet, potentially containing unacceptable concepts (e.g., copyright infringing or unsafe). Retraining T2I models after filtering out unacceptable concepts in the training data is inefficient and degrades utility. Hence, there is a need for conc…
▽ More
Diffusion-based text-to-image (T2I) models generate high-fidelity images for given textual prompts. They are trained on large datasets scraped from the Internet, potentially containing unacceptable concepts (e.g., copyright infringing or unsafe). Retraining T2I models after filtering out unacceptable concepts in the training data is inefficient and degrades utility. Hence, there is a need for concept removal techniques (CRTs) which are effective in removing unacceptable concepts, utility-preserving on acceptable concepts, and robust against evasion with adversarial prompts. None of the prior filtering and fine-tuning CRTs satisfy all these requirements simultaneously.
We introduce Espresso, the first robust concept filter based on Contrastive Language-Image Pre-Training (CLIP). It identifies unacceptable concepts by projecting the generated image's embedding onto the vector connecting unacceptable and acceptable concepts in the joint text-image embedding space. This ensures robustness by restricting the adversary to adding noise only along this vector, in the direction of the acceptable concept. Further fine-tuning Espresso to separate embeddings of acceptable and unacceptable concepts, while preserving their pairing with image embeddings, ensures both effectiveness and utility. We evaluate Espresso on eleven concepts to show that it is effective (~5% CLIP accuracy on unacceptable concepts), utility-preserving (~93% normalized CLIP score on acceptable concepts), and robust (~4% CLIP accuracy on adversarial prompts for unacceptable concepts). Finally, we present theoretical bounds for the certified robustness of Espresso against adversarial prompts, and an empirical analysis.
△ Less
Submitted 7 June, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Bipartite powers of some classes of bipartite graphs
Authors:
Indrajit Paul,
Ashok Kumar Das
Abstract:
Graph powers are a well-studied concept in graph theory. Analogous to graph powers, Chandran et al.[3] introduced the concept of bipartite powers for bipartite graphs. In this paper, we will demonstrate that some well-known classes of bipartite graphs, namely the interval bigraphs, proper interval bigraphs, and bigraphs of Ferrers dimension 2, are closed under the operation of taking bipartite pow…
▽ More
Graph powers are a well-studied concept in graph theory. Analogous to graph powers, Chandran et al.[3] introduced the concept of bipartite powers for bipartite graphs. In this paper, we will demonstrate that some well-known classes of bipartite graphs, namely the interval bigraphs, proper interval bigraphs, and bigraphs of Ferrers dimension 2, are closed under the operation of taking bipartite powers. Finally, we define strongly closed property for bipartite graphs under powers and have shown that the class of chordal bipartite graphs is strongly closed under powers.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Enhancing nanocrystal superlattice self-assembly near a metastable liquid binodal
Authors:
Christian P. N. Tanner,
Vivian R. K. Wall,
Joshua Portner,
Ahhyun Jeong,
Avishek Das,
James K. Utterback,
Leo M. Hamerlynck,
Jonathan G. Raybin,
Matthew J. Hurley,
Nicholas Leonard,
Rebecca B. Wai,
Jenna A. Tan,
Mumtaz Gababa,
Chenhui Zhu,
Eric Schaible,
Christopher J. Tassone,
David T. Limmer,
Samuel W. Teitelbaum,
Dmitri V. Talapin,
Naomi S. Ginsberg
Abstract:
Bottom-up assembly of nanocrystals (NCs) into ordered arrays, or superlattices (SLs), is a promising route to design materials with new functionalities, but the degree of control over assembly into functional structures remains challenging. Using electrostatics, rather than density, to tune the interactions between semiconductor NCs, we watch self-assembly proceeding through a metastable liquid ph…
▽ More
Bottom-up assembly of nanocrystals (NCs) into ordered arrays, or superlattices (SLs), is a promising route to design materials with new functionalities, but the degree of control over assembly into functional structures remains challenging. Using electrostatics, rather than density, to tune the interactions between semiconductor NCs, we watch self-assembly proceeding through a metastable liquid phase. We systematically investigate the phase behavior as a function of quench conditions in situ and in real time using small angle X-ray scattering (SAXS). Through quantitative fitting to colloid, liquid, and SL models, we extract the time evolution of each phase and the system phase diagram, which we find to be consistent with short-range attractive interactions. Using the phase diagram's predictive power, we establish control of the self-assembly rate over three orders of magnitude, and identify one- and two-step self-assembly regimes, with only the latter implicating the metastable liquid as an intermediate. Importantly, the presence of the metastable liquid increases SL formation rates relative to the equivalent one-step pathway, and SL order counterintuitively increases with the rate, revealing a highly desirable and generalizable kinetic strategy to promote and enhance ordered assembly.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
A proof theory of (omega-)context-free languages, via non-wellfounded proofs
Authors:
Anupam Das,
Abhishek De
Abstract:
We investigate the proof theory of regular expressions with fixed points, construed as a notation for (omega-)context-free grammars. Starting with a hypersequential system for regular expressions due to Das and Pous, we define its extension by least fixed points and prove soundness and completeness of its non-wellfounded proofs for the standard language model. From here we apply proof-theoretic te…
▽ More
We investigate the proof theory of regular expressions with fixed points, construed as a notation for (omega-)context-free grammars. Starting with a hypersequential system for regular expressions due to Das and Pous, we define its extension by least fixed points and prove soundness and completeness of its non-wellfounded proofs for the standard language model. From here we apply proof-theoretic techniques to recover an infinitary axiomatisation of the resulting equational theory, complete for inclusions of context-free languages. Finally, we extend our syntax by greatest fixed points, now computing omega-context-free languages. We show the soundness and completeness of the corresponding system using a mixture of proof-theoretic and game-theoretic techniques.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Minimum Description Feature Selection for Complexity Reduction in Machine Learning-based Wireless Positioning
Authors:
Myeung Suk Oh,
Anindya Bijoy Das,
Taejoon Kim,
David J. Love,
Christopher G. Brinton
Abstract:
Recently, deep learning approaches have provided solutions to difficult problems in wireless positioning (WP). Although these WP algorithms have attained excellent and consistent performance against complex channel environments, the computational complexity coming from processing high-dimensional features can be prohibitive for mobile applications. In this work, we design a novel positioning neura…
▽ More
Recently, deep learning approaches have provided solutions to difficult problems in wireless positioning (WP). Although these WP algorithms have attained excellent and consistent performance against complex channel environments, the computational complexity coming from processing high-dimensional features can be prohibitive for mobile applications. In this work, we design a novel positioning neural network (P-NN) that utilizes the minimum description features to substantially reduce the complexity of deep learning-based WP. P-NN's feature selection strategy is based on maximum power measurements and their temporal locations to convey information needed to conduct WP. We improve P-NN's learning ability by intelligently processing two different types of inputs: sparse image and measurement matrices. Specifically, we implement a self-attention layer to reinforce the training ability of our network. We also develop a technique to adapt feature space size, optimizing over the expected information gain and the classification capability quantified with information-theoretic measures on signal bin selection. Numerical results show that P-NN achieves a significant advantage in performance-complexity tradeoff over deep learning baselines that leverage the full power delay profile (PDP). In particular, we find that P-NN achieves a large improvement in performance for low SNR, as unnecessary measurements are discarded in our minimum description features.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Evolution of Shielding Cloud Under Oscillatory External Forcing in Strongly Coupled Ultracold Neutral Plasma
Authors:
Mamta Yadav,
Aman Singh Katariya,
Animesh Sharma,
Amita Das
Abstract:
This paper investigates the dynamics of crystalline clusters observed in Molecular Dynamics (MD) studies conducted earlier [Yadav, M., et al. Physical Review E, 107(5), 055214(2023)] for ultra-cold neutral plasmas. An external oscillatory forcing is applied for this purpose and the evolution is tracked with the help of MD simulations using the open source LAMMPS software. Interesting observations…
▽ More
This paper investigates the dynamics of crystalline clusters observed in Molecular Dynamics (MD) studies conducted earlier [Yadav, M., et al. Physical Review E, 107(5), 055214(2023)] for ultra-cold neutral plasmas. An external oscillatory forcing is applied for this purpose and the evolution is tracked with the help of MD simulations using the open source LAMMPS software. Interesting observations relating to cluster dynamics are presented. The formation of a pentagonal arrangement of particles is also reported.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Multi-Agent Hybrid SAC for Joint SS-DSA in CRNs
Authors:
David R. Nickel,
Anindya Bijoy Das,
David J. Love,
Christopher G. Brinton
Abstract:
Opportunistic spectrum access has the potential to increase the efficiency of spectrum utilization in cognitive radio networks (CRNs). In CRNs, both spectrum sensing and resource allocation (SSRA) are critical to maximizing system throughput while minimizing collisions of secondary users with the primary network. However, many works in dynamic spectrum access do not consider the impact of imperfec…
▽ More
Opportunistic spectrum access has the potential to increase the efficiency of spectrum utilization in cognitive radio networks (CRNs). In CRNs, both spectrum sensing and resource allocation (SSRA) are critical to maximizing system throughput while minimizing collisions of secondary users with the primary network. However, many works in dynamic spectrum access do not consider the impact of imperfect sensing information such as mis-detected channels, which the additional information available in joint SSRA can help remediate. In this work, we examine joint SSRA as an optimization which seeks to maximize a CRN's net communication rate subject to constraints on channel sensing, channel access, and transmit power. Given the non-trivial nature of the problem, we leverage multi-agent reinforcement learning to enable a network of secondary users to dynamically access unoccupied spectrum via only local test statistics, formulated under the energy detection paradigm of spectrum sensing. In doing so, we develop a novel multi-agent implementation of hybrid soft actor critic, MHSAC, based on the QMIX mixing scheme. Through experiments, we find that our SSRA algorithm, HySSRA, is successful in maximizing the CRN's utilization of spectrum resources while also limiting its interference with the primary network, and outperforms the current state-of-the-art by a wide margin. We also explore the impact of wireless variations such as coherence time on the efficacy of the system.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Shifting Focus with HCEye: Exploring the Dynamics of Visual Highlighting and Cognitive Load on User Attention and Saliency Prediction
Authors:
Anwesha Das,
Zekun Wu,
Iza Škrjanec,
Anna Maria Feit
Abstract:
Visual highlighting can guide user attention in complex interfaces. However, its effectiveness under limited attentional capacities is underexplored. This paper examines the joint impact of visual highlighting (permanent and dynamic) and dual-task-induced cognitive load on gaze behaviour. Our analysis, using eye-movement data from 27 participants viewing 150 unique webpages reveals that while part…
▽ More
Visual highlighting can guide user attention in complex interfaces. However, its effectiveness under limited attentional capacities is underexplored. This paper examines the joint impact of visual highlighting (permanent and dynamic) and dual-task-induced cognitive load on gaze behaviour. Our analysis, using eye-movement data from 27 participants viewing 150 unique webpages reveals that while participants' ability to attend to UI elements decreases with increasing cognitive load, dynamic adaptations (i.e., highlighting) remain attention-grabbing. The presence of these factors significantly alters what people attend to and thus what is salient. Accordingly, we show that state-of-the-art saliency models increase their performance when accounting for different cognitive loads. Our empirical insights, along with our openly available dataset, enhance our understanding of attentional processes in UIs under varying cognitive (and perceptual) loads and open the door for new models that can predict user attention while multitasking.
△ Less
Submitted 2 May, 2024; v1 submitted 22 April, 2024;
originally announced April 2024.
-
Enhanced plasma ion heating by lasers in inhomogeneous external magnetic field
Authors:
Rohit Juneja,
Trishul Dhalia,
Amita Das
Abstract:
Recent studies have shown direct ion heating (vashistha2020new,Juneja_2023) by lasers EM (Electromagnetic) wave interacting with a plasma threaded by an external uniform magnetic field. The EM wave frequency was near the lower hybrid (LH) resonance frequency. The LH resonance occurs at the edge of the pass band of the magnetized dispersion relation. The group speed of the wave is negligible at res…
▽ More
Recent studies have shown direct ion heating (vashistha2020new,Juneja_2023) by lasers EM (Electromagnetic) wave interacting with a plasma threaded by an external uniform magnetic field. The EM wave frequency was near the lower hybrid (LH) resonance frequency. The LH resonance occurs at the edge of the pass band of the magnetized dispersion relation. The group speed of the wave is negligible at resonance. In these studies, the energy absorption remains essentially confined at the plasma surface. However, to heat the ions in the bulk plasma and at a desired location, a tailored inhomogeneous external magnetic field profile has been chosen here. The strength of the magnetic field at the plasma edge is such that the EM wave frequency lies inside the pass band, where the group velocity has a significant value. It enables the wave to enter the bulk plasma. The external magnetic field is then spatially tailored appropriately to have the LH resonance at a desired spatial location inside the plasma. The Particle-In-Cell (PIC) simulations using the OSIRIS4.0 platform have been carried out, which demonstrates that the EM wave pulse comes to a standstill at the location of the resonance. The wave pulse is observed to break down subsequently, and the energy consequently goes dominantly to the local plasma ions. The absorption is significantly enhanced compared to the case in which the magnetic field profile was homogeneous. The dependence of absorption on the choice of magnetic field profile, the laser intensity, etc., has also been carried out.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Soil Fertility Prediction Using Combined USB-microscope Based Soil Image, Auxiliary Variables, and Portable X-Ray Fluorescence Spectrometry
Authors:
Shubhadip Dasgupta,
Satwik Pate,
Divya Rathore,
L. G. Divyanth,
Ayan Das,
Anshuman Nayak,
Subhadip Dey,
Asim Biswas,
David C. Weindorf,
Bin Li,
Sergio Henrique Godinho Silva,
Bruno Teixeira Ribeiro,
Sanjay Srivastava,
Somsubhra Chakraborty
Abstract:
This study explored the application of portable X-ray fluorescence (PXRF) spectrometry and soil image analysis to rapidly assess soil fertility, focusing on critical parameters such as available B, organic carbon (OC), available Mn, available S, and the sulfur availability index (SAI). Analyzing 1,133 soil samples from various agro-climatic zones in Eastern India, the research combined color and t…
▽ More
This study explored the application of portable X-ray fluorescence (PXRF) spectrometry and soil image analysis to rapidly assess soil fertility, focusing on critical parameters such as available B, organic carbon (OC), available Mn, available S, and the sulfur availability index (SAI). Analyzing 1,133 soil samples from various agro-climatic zones in Eastern India, the research combined color and texture features from microscopic soil images, PXRF data, and auxiliary soil variables (AVs) using a Random Forest model. Results indicated that integrating image features (IFs) with auxiliary variables (AVs) significantly enhanced prediction accuracy for available B (R^2 = 0.80) and OC (R^2 = 0.88). A data fusion approach, incorporating IFs, AVs, and PXRF data, further improved predictions for available Mn and SAI with R^2 values of 0.72 and 0.70, respectively. The study demonstrated how these integrated technologies have the potential to provide quick and affordable options for soil testing, opening up access to more sophisticated prediction models and a better comprehension of the fertility and health of the soil. Future research should focus on the application of deep learning models on a larger dataset of soil images, developed using soils from a broader range of agro-climatic zones under field condition.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Fluid Simulation for a Finite Size Plasma
Authors:
Subhasish Bag,
Vikrant Saxena,
Amita Das
Abstract:
Studies on finite-size plasma have attracted a lot of attention lately. They can form by ionizing liquid droplets by lasers. The dynamical behavior of such plasma droplets is, therefore, a topic of significant interest. In particular, questions related to the linear and nonlinear characteristics (associated with the inhomogeneous density typically at the edge of the droplet), the behavior of plasm…
▽ More
Studies on finite-size plasma have attracted a lot of attention lately. They can form by ionizing liquid droplets by lasers. The dynamical behavior of such plasma droplets is, therefore, a topic of significant interest. In particular, questions related to the linear and nonlinear characteristics (associated with the inhomogeneous density typically at the edge of the droplet), the behavior of plasma expansion, etc., are of interest. A one-dimensional fluid simulation study has been carried out to investigate this behavior. It is observed that a slight imbalance in the charge density leads to oscillations that are concentrated and keep acquiring higher amplitude and sharper profile at the inhomogeneous edge region. Such oscillations lead to the expansion of the droplet. Though the fluid description breaks when the sharpness of these structures becomes comparable to the grid size, it provides a reasonable estimate of wave-breaking time. The presence of dissipative effects like diffusion is shown to arrest the sharpness of these structures. The dynamics of these structures in the presence of an externally applied oscillating electric field corresponding to a long wavelength radiation has also been studied.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
On the complexity of normalization for the planar $λ$-calculus
Authors:
Anupam Das,
Damiano Mazza,
Lê Thành Dũng Nguyên,
Noam Zeilberger
Abstract:
We sketch a tentative proof of P-completeness for the $β$-convertibility problem on untyped planar (a.k.a. ordered or non-commutative) $λ$-terms.
We sketch a tentative proof of P-completeness for the $β$-convertibility problem on untyped planar (a.k.a. ordered or non-commutative) $λ$-terms.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
New methods for computing the generalized chi-square distribution
Authors:
Abhranil Das
Abstract:
We present several exact and approximate mathematical methods and open-source software to compute the cdf, pdf and inverse cdf of the generalized chi-square distribution, which appears in Bayesian classification problems. Some methods are geared for speed, while others are designed to be accurate far into the tails, using which we can also measure large values of the discriminability index $d'$ be…
▽ More
We present several exact and approximate mathematical methods and open-source software to compute the cdf, pdf and inverse cdf of the generalized chi-square distribution, which appears in Bayesian classification problems. Some methods are geared for speed, while others are designed to be accurate far into the tails, using which we can also measure large values of the discriminability index $d'$ between multinormals. We compare the accuracy and speed of these methods against the best existing methods.
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
Online Learning under Haphazard Input Conditions: A Comprehensive Review and Analysis
Authors:
Rohit Agarwal,
Arijit Das,
Alexander Horsch,
Krishna Agarwal,
Dilip K. Prasad
Abstract:
The domain of online learning has experienced multifaceted expansion owing to its prevalence in real-life applications. Nonetheless, this progression operates under the assumption that the input feature space of the streaming data remains constant. In this survey paper, we address the topic of online learning in the context of haphazard inputs, explicitly foregoing such an assumption. We discuss,…
▽ More
The domain of online learning has experienced multifaceted expansion owing to its prevalence in real-life applications. Nonetheless, this progression operates under the assumption that the input feature space of the streaming data remains constant. In this survey paper, we address the topic of online learning in the context of haphazard inputs, explicitly foregoing such an assumption. We discuss, classify, evaluate, and compare the methodologies that are adept at modeling haphazard inputs, additionally providing the corresponding code implementations and their carbon footprint. Moreover, we classify the datasets related to the field of haphazard inputs and introduce evaluation metrics specifically designed for datasets exhibiting imbalance. The code of each methodology can be found at https://github.com/Rohit102497/HaphazardInputsReview
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
Universal time scalings of sensitivity in Markovian quantum metrology
Authors:
Arpan Das,
Wojciech Gorecki,
Rafal Demkowicz-Dobrzanski
Abstract:
Assuming a Markovian time evolution of a quantum sensing system, we provide a general characterization of the optimal sensitivity scalings with time, under the most general quantum control protocols. We allow the estimated parameter to influence both the Hamiltonian as well as the dissipative part of the quantum master equation. We focus on the asymptotic-time as well as the short-time sensitivity…
▽ More
Assuming a Markovian time evolution of a quantum sensing system, we provide a general characterization of the optimal sensitivity scalings with time, under the most general quantum control protocols. We allow the estimated parameter to influence both the Hamiltonian as well as the dissipative part of the quantum master equation. We focus on the asymptotic-time as well as the short-time sensitivity scalings, and investigate the relevant time scales on which the transition between the two regimes appears. This allows us to characterize, via simple algebraic conditions (in terms of the Hamiltonian, the jump operators as well as their parameter derivatives), the four classes of metrological models that represent: quadratic-linear, quadratic-quadratic, linear-linear and linear-quadratic time scalings. We also provide universal numerical methods to obtain quantitative bounds on sensitivity that are the tightest that exist in the literature.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Securing Social Spaces: Harnessing Deep Learning to Eradicate Cyberbullying
Authors:
Rohan Biswas,
Kasturi Ganguly,
Arijit Das,
Diganta Saha
Abstract:
In today's digital world, cyberbullying is a serious problem that can harm the mental and physical health of people who use social media. This paper explains just how serious cyberbullying is and how it really affects indi-viduals exposed to it. It also stresses how important it is to find better ways to detect cyberbullying so that online spaces can be safer. Plus, it talks about how making more…
▽ More
In today's digital world, cyberbullying is a serious problem that can harm the mental and physical health of people who use social media. This paper explains just how serious cyberbullying is and how it really affects indi-viduals exposed to it. It also stresses how important it is to find better ways to detect cyberbullying so that online spaces can be safer. Plus, it talks about how making more accurate tools to spot cyberbullying will be really helpful in the future. Our paper introduces a deep learning-based ap-proach, primarily employing BERT and BiLSTM architectures, to effective-ly address cyberbullying. This approach is designed to analyse large vol-umes of posts and predict potential instances of cyberbullying in online spaces. Our results demonstrate the superiority of the hateBERT model, an extension of BERT focused on hate speech detection, among the five mod-els, achieving an accuracy rate of 89.16%. This research is a significant con-tribution to "Computational Intelligence for Social Transformation," prom-ising a safer and more inclusive digital landscape.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
A Rolling Horizon Restoration Framework for Post-disaster Restoration of Electrical Distribution Networks
Authors:
Ran Wei,
Arindam K. Das,
Payman Arabshahi,
Daniel S. Kirschen
Abstract:
Severe weather events such as floods, hurricanes, earthquakes, and large wind or ice storms can cause extensive damage to electrical distribution networks, requiring a multi-day restoration effort. Complicating the recovery process is the lack of complete and accurate information regarding the extent and locations of damages, at least during the initial part of the recovery process. These factors…
▽ More
Severe weather events such as floods, hurricanes, earthquakes, and large wind or ice storms can cause extensive damage to electrical distribution networks, requiring a multi-day restoration effort. Complicating the recovery process is the lack of complete and accurate information regarding the extent and locations of damages, at least during the initial part of the recovery process. These factors make workforce planning challenging. In this paper, we adopt a rolling horizon restoration framework whereby repairs are planned for adjustable finite length restoration windows. Considering both repair times as well as travel times, we show that the optimal scheduling problem with multiple crews, each with their own time budget, can be recast in terms of a cost constrained reward maximizing mTSP (traveling salesman problem) on doubly weighted graphs, where the objective is to maximize the aggregate reward earned during the upcoming restoration window, provided no crew violates its time budget and certain electrical continuity constraints are met. We propose a mixed integer linear programming (MILP) model for solving the above problem which is validated on standard IEEE PES test feeder networks.
△ Less
Submitted 30 May, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
A Fully-Configurable Open-Source Software-Defined Digital Quantized Spiking Neural Core Architecture
Authors:
Shadi Matinizadeh,
Noah Pacik-Nelson,
Ioannis Polykretis,
Krupa Tishbi,
Suman Kumar,
M. L. Varshika,
Arghavan Mohammadhassani,
Abhishek Mishra,
Nagarajan Kandasamy,
James Shackleford,
Eric Gallo,
Anup Das
Abstract:
We introduce QUANTISENC, a fully configurable open-source software-defined digital quantized spiking neural core architecture to advance research in neuromorphic computing. QUANTISENC is designed hierarchically using a bottom-up methodology with multiple neurons in each layer and multiple layers in each core. The number of layers and neurons per layer can be configured via software in a top-down m…
▽ More
We introduce QUANTISENC, a fully configurable open-source software-defined digital quantized spiking neural core architecture to advance research in neuromorphic computing. QUANTISENC is designed hierarchically using a bottom-up methodology with multiple neurons in each layer and multiple layers in each core. The number of layers and neurons per layer can be configured via software in a top-down methodology to generate the hardware for a target spiking neural network (SNN) model. QUANTISENC uses leaky integrate and fire neurons (LIF) and current-based excitatory and inhibitory synapses (CUBA). The nonlinear dynamics of a neuron can be configured at run-time via programming its internal control registers. Each neuron performs signed fixed-point arithmetic with user-defined quantization and decimal precision. QUANTISENC supports all-to-all, one-to-one, and Gaussian connections between layers. Its hardware-software interface is integrated with a PyTorch-based SNN simulator. This integration allows to define and train an SNN model in PyTorch and evaluate the hardware performance (e.g., area, power, latency, and throughput) through FPGA prototy** and ASIC design. The hardware-software interface also takes advantage of the layer-based architecture and distributed memory organization of QUANTISENC to enable pipelining by overlap** computations on streaming data. Overall, the proposed software-defined hardware design methodology offers flexibility similar to that of high-level synthesis (HLS), but provides better hardware performance with zero hardware development effort. We evaluate QUANTISENC using three spiking datasets and show its superior performance against state-of the-art designs.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
On the reduction of Linear Parameter-Varying State-Space models
Authors:
E. Javier Olucha,
Bogoljub Terzin,
Amritam Das,
Roland Tóth
Abstract:
This paper presents an overview and comparative study of the state of the art in State-Order Reduction (SOR) and Scheduling Dimension Reduction (SDR) for Linear Parameter-Varying (LPV) State-Space (SS) models, comparing and benchmarking their capabilities, limitations and performance. The use case chosen for these studies is an interconnected network of nonlinear coupled mass spring damper systems…
▽ More
This paper presents an overview and comparative study of the state of the art in State-Order Reduction (SOR) and Scheduling Dimension Reduction (SDR) for Linear Parameter-Varying (LPV) State-Space (SS) models, comparing and benchmarking their capabilities, limitations and performance. The use case chosen for these studies is an interconnected network of nonlinear coupled mass spring damper systems with three different configurations, where some spring coefficients are described by arbitrary user-defined static nonlinear functions. For SOR, the following methods are compared: Linear Time-Invariant (LTI), LPV and LFR-based balanced reductions, moment matching and parameter-varying oblique projection. For SDR, the following methods are compared: Principal Component Analysis (PCA), trajectory PCA, Kernel PCA and LTI balanced truncation, autoencoders and deep neural network. The comparison reveals the most suitable reduction methods for the different benchmark configurations, from which we provide use case SOR and SDR guidelines that can be used to choose the best reduction method for a given LPV-SS model.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
SONIC: Synergizing VisiON Foundation Models for Stress RecogNItion from ECG signals
Authors:
Orchid Chetia Phukan,
Ankita Das,
Arun Balaji Buduru,
Rajesh Sharma
Abstract:
Stress recognition through physiological signals such as Electrocardiogram (ECG) signals has garnered significant attention. Traditionally, research in this field predominantly focused on utilizing handcrafted features or raw signals as inputs for learning algorithms. However, there is now a burgeoning interest within the community in leveraging large-scale vision foundation models (VFMs) like Res…
▽ More
Stress recognition through physiological signals such as Electrocardiogram (ECG) signals has garnered significant attention. Traditionally, research in this field predominantly focused on utilizing handcrafted features or raw signals as inputs for learning algorithms. However, there is now a burgeoning interest within the community in leveraging large-scale vision foundation models (VFMs) like ResNet50, VGG19, and others. These VFMs are increasingly preferred due to their ability to capture complex features, enhancing the accuracy and effectiveness of stress recognition systems. However, no particular focus has been given on combining these VFMs. The combination of VFMs offers promising benefits by harnessing their collective knowledge to extract richer representations for improved stress recognition. So, to mitigate this research gap, we focus on combining different VFMs for stress recognition from ECG and propose SONIC, a novel framework that combines VFMs through their logits and training a fully connected network on the combined logits. Through extensive experimentation, SONIC showed the top performance against individual VFMs performance on the WESAD benchmark. With SONIC, we report state-of-the-art (SOTA) performance in WESAD with 99.36% and 99.24% (stress vs non-stress) and 97.66% and 97.10% (amusement vs stress vs baseline) in accuracy and F1 respectively.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
On tensor products of representations of Lie superalgebras
Authors:
Abhishek Das,
Santosha Pattanayak
Abstract:
We consider typical finite dimensional complex irreducible representations of a basic classical simple Lie superalgebra, and give a sufficient condition on when unique factorization of finite tensor products of such representations hold. We also prove unique factorization of tensor products of singly atypical finite dimensional irreducible modules for $\mathfrak{sl}(m+1,n+1)$,…
▽ More
We consider typical finite dimensional complex irreducible representations of a basic classical simple Lie superalgebra, and give a sufficient condition on when unique factorization of finite tensor products of such representations hold. We also prove unique factorization of tensor products of singly atypical finite dimensional irreducible modules for $\mathfrak{sl}(m+1,n+1)$, $\mathfrak{osp}(2,2n)$, $G(3)$ and $F(4)$ under an additional assumption. This result is a Lie superalgebra analogue of Rajan's fundamental result \cite{MR2123935} on unique factorization of tensor products for finite dimensional complex simple Lie algebras.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.