Search | arXiv e-print repository

arXiv:2406.19674 [pdf, other]

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

Authors: Krishna C. Puvvada, Piotr Żelasko, He Huang, Oleksii Hrinchuk, Nithin Rao Koluguri, Kunal Dhawan, Somshubra Majumdar, Elena Rastorgueva, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Boris Ginsburg

Abstract: Recent advances in speech recognition and translation rely on hundreds of thousands of hours of Internet speech data. We argue that state-of-the art accuracy can be reached without relying on web-scale data. Canary - multilingual ASR and speech translation model, outperforms current state-of-the-art models - Whisper, OWSM, and Seamless-M4T on English, French, Spanish, and German languages, while b… ▽ More Recent advances in speech recognition and translation rely on hundreds of thousands of hours of Internet speech data. We argue that state-of-the art accuracy can be reached without relying on web-scale data. Canary - multilingual ASR and speech translation model, outperforms current state-of-the-art models - Whisper, OWSM, and Seamless-M4T on English, French, Spanish, and German languages, while being trained on an order of magnitude less data than these models. Three key factors enables such data-efficient model: (1) a FastConformer-based attention encoder-decoder architecture (2) training on synthetic data generated with machine translation and (3) advanced training techniques: data-balancing, dynamic data blending, dynamic bucketing and noise-robust fine-tuning. The model, weights, and training code will be open-sourced. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: Accepted at Interspeech-2024

arXiv:2406.15832 [pdf, other]

Comparing sampling techniques to chart parameter space of 21 cm Global signal with Artificial Neural Networks

Authors: Anshuman Tripathi, Gursharanjit Kaur, Abhirup Datta, Suman Majumdar

Abstract: Understanding the first billion years of the universe requires studying two critical epochs: the Epoch of Reionization (EoR) and Cosmic Dawn (CD). However, due to limited data, the properties of the Intergalactic Medium (IGM) during these periods remain poorly understood, leading to a vast parameter space for the global 21cm signal. Training an Artificial Neural Network (ANN) with a narrowly defin… ▽ More Understanding the first billion years of the universe requires studying two critical epochs: the Epoch of Reionization (EoR) and Cosmic Dawn (CD). However, due to limited data, the properties of the Intergalactic Medium (IGM) during these periods remain poorly understood, leading to a vast parameter space for the global 21cm signal. Training an Artificial Neural Network (ANN) with a narrowly defined parameter space can result in biased inferences. To mitigate this, the training dataset must be uniformly drawn from the entire parameter space to cover all possible signal realizations. However, drawing all possible realizations is computationally challenging, necessitating the sampling of a representative subset of this space. This study aims to identify optimal sampling techniques for the extensive dimensionality and volume of the 21cm signal parameter space. The optimally sampled training set will be used to train the ANN to infer from the global signal experiment. We investigate three sampling techniques: random, Latin Hypercube (stratified), and Hammersley Sequence (quasi-Monte Carlo) sampling, and compare their outcomes. Our findings reveal that sufficient samples must be drawn for robust and accurate ANN model training, regardless of the sampling technique employed. The required sample size depends primarily on two factors: the complexity of the data and the number of free parameters. More free parameters necessitate drawing more realizations. Among the sampling techniques utilized, we find that ANN models trained with Hammersley Sequence sampling demonstrate greater robustness compared to those trained with Latin Hypercube and Random sampling. △ Less

Submitted 22 June, 2024; originally announced June 2024.

Comments: 30 pages, 17 figures, comments are welcome, prepared for submission to JCAP

arXiv:2406.12946 [pdf]

Instruction Data Generation and Unsupervised Adaptation for Speech Language Models

Authors: Vahid Noroozi, Zhehuai Chen, Somshubra Majumdar, Steve Huang, Jagadeesh Balam, Boris Ginsburg

Abstract: In this paper, we propose three methods for generating synthetic samples to train and evaluate multimodal large language models capable of processing both text and speech inputs. Addressing the scarcity of samples containing both modalities, synthetic data generation emerges as a crucial strategy to enhance the performance of such systems and facilitate the modeling of cross-modal relationships be… ▽ More In this paper, we propose three methods for generating synthetic samples to train and evaluate multimodal large language models capable of processing both text and speech inputs. Addressing the scarcity of samples containing both modalities, synthetic data generation emerges as a crucial strategy to enhance the performance of such systems and facilitate the modeling of cross-modal relationships between the speech and text domains. Our process employs large language models to generate textual components and text-to-speech systems to generate speech components. The proposed methods offer a practical and effective means to expand the training dataset for these models. Experimental results show progress in achieving an integrated understanding of text and speech. We also highlight the potential of using unlabeled speech data to generate synthetic samples comparable in quality to those with available transcriptions, enabling the expansion of these models to more languages. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: Accepted for Interspeech 2024

arXiv:2406.11871 [pdf, other]

Generative AI Voting: Fair Collective Choice is Resilient to LLM Biases and Inconsistencies

Authors: Srijoni Majumdar, Edith Elkind, Evangelos Pournaras

Abstract: Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) provide unprecedented opportunities, but also alerting risks for digital democracy. AI personal assistants can overcome cognitive bandwidth limitations of… ▽ More Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) provide unprecedented opportunities, but also alerting risks for digital democracy. AI personal assistants can overcome cognitive bandwidth limitations of humans, providing decision support capabilities or even direct AI representation of human voters at large scale. However, the quality of this representation and what underlying biases manifest when delegating collective decision making to LLMs is an alarming and timely challenge to tackle. By rigorously emulating with high realism more than >50K LLM voting personas in 81 real-world voting elections, we show that different LLMs (GPT 3, GPT 3.5, and Llama2) come with biases and significant inconsistencies in complex preferential ballot formats, compared to simpler and more consistent majoritarian elections. Strikingly, fair voting aggregation methods, such as equal shares, prove to be a win-win: fairer voting outcomes for humans with fairer AI representation. This novel underlying relationship proves paramount for democratic resilience in progressives scenarios with low voters turnout and voter fatigue supported by AI representatives: abstained voters are mitigated by recovering highly representative voting outcomes that are fairer. These insights provide remarkable foundations for science, policymakers and citizens in explaining and mitigating AI risks in democratic innovations. △ Less

Submitted 30 May, 2024; originally announced June 2024.

Comments: 35 pages, 10 figures

arXiv:2406.11704 [pdf, other]

Nemotron-4 340B Technical Report

Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation benchmarks, and were sized to fit on a single DGX H100 with 8 GPUs when deployed in FP8 precision. We believe that the community can benefit from these models in various research studies and commercial applications, especially for generating synthetic data to train smaller language models. Notably, over 98% of data used in our model alignment process is synthetically generated, showcasing the effectiveness of these models in generating synthetic data. To further support open research and facilitate model development, we are also open-sourcing the synthetic data generation pipeline used in our model alignment process. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.11036 [pdf, other]

garak: A Framework for Security Probing Large Language Models

Authors: Leon Derczynski, Erick Galinkin, Jeffrey Martin, Subho Majumdar, Nanna Inie

Abstract: As Large Language Models (LLMs) are deployed and integrated into thousands of applications, the need for scalable evaluation of how models respond to adversarial attacks grows rapidly. However, LLM security is a moving target: models produce unpredictable output, are constantly updated, and the potential adversary is highly diverse: anyone with access to the internet and a decent command of natura… ▽ More As Large Language Models (LLMs) are deployed and integrated into thousands of applications, the need for scalable evaluation of how models respond to adversarial attacks grows rapidly. However, LLM security is a moving target: models produce unpredictable output, are constantly updated, and the potential adversary is highly diverse: anyone with access to the internet and a decent command of natural language. Further, what constitutes a security weak in one context may not be an issue in a different context; one-fits-all guardrails remain theoretical. In this paper, we argue that it is time to rethink what constitutes ``LLM security'', and pursue a holistic approach to LLM security evaluation, where exploration and discovery of issues are central. To this end, this paper introduces garak (Generative AI Red-teaming and Assessment Kit), a framework which can be used to discover and identify vulnerabilities in a target LLM or dialog system. garak probes an LLM in a structured fashion to discover potential vulnerabilities. The outputs of the framework describe a target model's weaknesses, contribute to an informed discussion of what composes vulnerabilities in unique contexts, and can inform alignment and policy discussions for LLM deployment. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: https://garak.ai

arXiv:2406.10353 [pdf, other]

On the Carrollian Nature of the Light Front

Authors: Sucheta Majumdar

Abstract: Motivated by recent advances in non-Lorentzian physics, we revisit the light-cone formulation of quantum field theories. We discuss some interesting subalgebras within the light-cone Poincaré algebra, with a key emphasis on the Carroll, Bargmann, and Galilean kinds. We show that theories on the light front possess a Hamiltonian of the magnetic Carroll type, thereby proposing a straightforward meth… ▽ More Motivated by recent advances in non-Lorentzian physics, we revisit the light-cone formulation of quantum field theories. We discuss some interesting subalgebras within the light-cone Poincaré algebra, with a key emphasis on the Carroll, Bargmann, and Galilean kinds. We show that theories on the light front possess a Hamiltonian of the magnetic Carroll type, thereby proposing a straightforward method for deriving magnetic Carroll Hamiltonian actions from Lorentzian field theories. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: To appear in the World Scientific volume dedicated to the memory of Lars Brink, 16 pages, 1 figure, 3 tables

arXiv:2406.09111 [pdf, other]

Optimal demonstration of generalized quantum contextuality

Authors: Soumyabrata Hazra, Debashis Saha, Anubhav Chaturvedi, Subhankar Bera, A. S. Majumdar

Abstract: Finding a set of empirical criteria fulfilled by any theory that satisfies the generalized notion of noncontextuality is a challenging task of both operational and foundational importance. The conventional approach of deriving facet inequalities from the relevant noncontextual polytope is computationally demanding. Specifically, the noncontextual polytope is a product of two polytopes, one for pre… ▽ More Finding a set of empirical criteria fulfilled by any theory that satisfies the generalized notion of noncontextuality is a challenging task of both operational and foundational importance. The conventional approach of deriving facet inequalities from the relevant noncontextual polytope is computationally demanding. Specifically, the noncontextual polytope is a product of two polytopes, one for preparations and the other for measurements, and the dimension of the former typically increases polynomially with the number of measurements. This work presents an alternative methodology for constructing a polytope that encompasses the actual noncontextual polytope while ensuring that the dimension of the polytope associated with the preparations remains constant regardless of the number of measurements and their outcome size. In particular, the facet inequalities of this polytope serve as necessary conditions for noncontextuality. To demonstrate the efficacy of our methodology, we apply it to nine distinct contextuality scenarios involving four to nine preparations and two to three measurements to obtain the respective sets of facet inequalities. Additionally, we retrieve the maximum quantum violations of these inequalities. Our investigation uncovers many novel non-trivial noncontextuality inequalities and reveals intriguing aspects and applications of quantum contextual correlations. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 18 pages, 9 figures

arXiv:2406.08387 [pdf, other]

Resetting by rescaling: exact results for a diffusing particle in one-dimension

Authors: Marco Biroli, Yannick Feld, Alexander K. Hartmann, Satya N. Majumdar, Gregory Schehr

Abstract: In this paper, we study a simple model of a diffusive particle on a line, undergoing a stochastic resetting with rate $r$, via rescaling its current position by a factor $a$, which can be either positive or negative. For $|a|<1$, the position distribution becomes stationary at long times and we compute this limiting distribution exactly for all $|a|<1$. This symmetric distribution has a Gaussian s… ▽ More In this paper, we study a simple model of a diffusive particle on a line, undergoing a stochastic resetting with rate $r$, via rescaling its current position by a factor $a$, which can be either positive or negative. For $|a|<1$, the position distribution becomes stationary at long times and we compute this limiting distribution exactly for all $|a|<1$. This symmetric distribution has a Gaussian shape near its peak at $x=0$, but decays exponentially for large $|x|$. We also studied the mean first-passage time (MFPT) $T(0)$ to a target located at a distance $L$ from the initial position (the origin) of the particle. As a function of the initial position $x$, the MFPT $T(x)$ satisfies a nonlocal second order differential equation and we have solved it explicitly for $0 \leq a < 1$. For $-1<a\leq 0$, we also solved it analytically but up to a constant factor $κ$ whose value can be determined independently from numerical simulations. Our results show that, for all $-1<a<1$, the MFPT $T(0)$ (starting from the origin) shows a minimum at $r=r^*(a)$. However, the optimised MFPT $T_{\rm opt}(a)$ turns out to be a monotonically increasing function of $a$ for $-1<a<1$. This demonstrates that, compared to the standard resetting to the origin ($a=0$), while the positive rescaling is not beneficial for the search of a target, the negative rescaling is. Thus resetting via rescaling followed by a reflection around the origin expedites the search of a target in one dimension. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 19 pages, 8 figures

arXiv:2406.03118 [pdf, other]

Impact of the Epoch of Reionization sources on the 21-cm bispectrum

Authors: Leon Noble, Mohd Kamran, Suman Majumdar, Chandra Shekhar Murmu, Raghunath Ghara, Garrelt Mellema, Ilian T. Iliev, Jonathan R. Pritchard

Abstract: The morphology of the 21-cm signal emitted by the neutral hydrogen present in the intergalactic medium (IGM) during the Epoch of Reionization (EoR) depends both on the properties of the sources of ionizing radiation and on the underlying physical processes within the IGM. Variation in the morphology of the IGM 21-cm signal due to the different sources of the EoR is expected to have a significant i… ▽ More The morphology of the 21-cm signal emitted by the neutral hydrogen present in the intergalactic medium (IGM) during the Epoch of Reionization (EoR) depends both on the properties of the sources of ionizing radiation and on the underlying physical processes within the IGM. Variation in the morphology of the IGM 21-cm signal due to the different sources of the EoR is expected to have a significant impact on the 21-cm bispectrum, which is one of the crucial observable statistics that can evaluate the non-Gaussianity present in the signal and which can be estimated from radio interferometric observations of the EoR. Here we present the 21-cm bispectrum for different reionization scenarios assuming different simulated models for the sources of reionization. We also demonstrate how well the 21-cm bispectrum can distinguish between different IGM 21-cm signal morphologies, arising due to the differences in the reionization scenarios, which will help us shed light on the nature of the sources of ionizing photons. Our estimated large-scale bispectrum for all unique $k$-triangle shapes shows a significant difference in their magnitude and sign across different reionization scenarios. Additionally, our focused analysis of bispectrum for a few specific $k$-triangle shapes (e.g. squeezed-limit, linear, and shapes in the vicinity of the squeezed-limit) shows that the large scale 21-cm bispectrum can distinguish between reionization scenarios that show inside-out, outside-in and a combination of inside-out and outside-in morphologies. These results highlight the potential of using the 21-cm bispectrum for constraining different reionization scenarios. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 28 pages, 7 figures, comments are welcome, prepared for submission to JCAP

arXiv:2405.20955 [pdf, other]

Number of distinct and common sites visited by $N$ independent random walkers

Authors: Satya N. Majumdar, Gregory Schehr

Abstract: In this Chapter, we consider a model of $N$ independent random walkers, each of duration $t$, and each starting from the origin, on a lattice in $d$ dimensions. We focus on two observables, namely $D_N(t)$ and $C_N(t)$ denoting respectively the number of distinct and common sites visited by the walkers. For large $t$, where the lattice random walkers converge to independent Brownian motions, we co… ▽ More In this Chapter, we consider a model of $N$ independent random walkers, each of duration $t$, and each starting from the origin, on a lattice in $d$ dimensions. We focus on two observables, namely $D_N(t)$ and $C_N(t)$ denoting respectively the number of distinct and common sites visited by the walkers. For large $t$, where the lattice random walkers converge to independent Brownian motions, we compute exactly the mean $\langle D_N(t) \rangle$ and $\langle C_N(t) \rangle$. Our main interest is on the $N$-dependence of these quantities. While for $\langle D_N(t) \rangle$ the $N$-dependence only appears in the prefactor of the power-law growth with time, a more interesting behavior emerges for $\langle C_N(t) \rangle$. For this latter case, we show that there is a ``phase transition'' in the $(N, d)$ plane where the two critical line $d=2$ and $d=d_c(N) = 2N/(N-1)$ separate three phases of the growth of $\langle C_N(t)\rangle$. The results are extended to the mean number of sites visited exactly by $K$ of the $N$ walkers. Furthermore in $d=1$, the full distribution of $D_N(t)$ and $C_N(t)$ are computed, exploiting a map** to the extreme value statistics. Extensions to two other models, namely $N$ independent Brownian bridges and $N$ independent resetting Brownian motions/bridges are also discussed. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 21 pages, 5 figures. Contribution to the book "The Mathematics of Movement: an Interdisciplinary Approach to Mutual Challenges in Animal Ecology and Cell Biology" edited by Luca Giuggioli and Philip Maini

arXiv:2405.18195 [pdf, other]

Importance Sampling for counting statistics in one-dimensional systems

Authors: Ivan N. Burenev, Satya N. Majumdar, Alberto Rosso

Abstract: In this paper, we consider the problem of numerical investigation of the counting statistics for a class of one-dimensional systems. Importance sampling, the cornerstone technique usually implemented for such problems, critically hinges on selecting an appropriate biased distribution. While exponential tilt in the observable stands as the conventional choice for various problems, its efficiency in… ▽ More In this paper, we consider the problem of numerical investigation of the counting statistics for a class of one-dimensional systems. Importance sampling, the cornerstone technique usually implemented for such problems, critically hinges on selecting an appropriate biased distribution. While exponential tilt in the observable stands as the conventional choice for various problems, its efficiency in the context of counting statistics may be significantly hindered by the genuine discreteness of the observable. To address this challenge, we propose an alternative strategy which we call importance sampling with the local tilt. We demonstrate the efficiency of the proposed approach through the analysis of three prototypical examples: a set of independent Gaussian random variables, Dyson gas, and Symmetric Simple Exclusion Process (SSEP) with a steplike initial condition. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 11 pages, 13 figures

arXiv:2405.14577 [pdf, other]

Representation noising effectively prevents harmful fine-tuning on LLMs

Authors: Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, David Atanasov, Robie Gonzales, Subhabrata Majumdar, Carsten Maple, Hassan Sajjad, Frank Rudzicz

Abstract: Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such me… ▽ More Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such measures can easily be reversed through fine-tuning. In this work, we propose Representation Noising (RepNoise), a defence mechanism that is effective even when attackers have access to the weights and the defender no longer has any control. RepNoise works by removing information about harmful representations such that it is difficult to recover them during fine-tuning. Importantly, our defence is also able to generalize across different subsets of harm that have not been seen during the defence process. Our method does not degrade the general capability of LLMs and retains the ability to train the model on harmless tasks. We provide empirical evidence that the effectiveness of our defence lies in its "depth": the degree to which information about harmful representations is removed across all layers of the LLM. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.10810 [pdf, other]

Flux rope modeling of the 2022 Sep 5 CME observed by Parker Solar Probe and Solar Orbiter from 0.07 to 0.69 au

Authors: Emma E. Davies, Hannah T. Rüdisser, Ute V. Amerstorfer, Christian Möstl, Maike Bauer, Eva Weiler, Tanja Amerstorfer, Satabdwa Majumdar, Phillip Hess, Andreas J. Weiss, Martin A. Reiss, Lucie M. Green, David M. Long, Teresa Nieves-Chinchilla, Domenico Trotta, Timothy S. Horbury, Helen O'Brien, Edward Fauchon-Jones, Jean Morris, Christopher J. Owen, Stuart D. Bale, Justin C. Kasper

Abstract: As both Parker Solar Probe (PSP) and Solar Orbiter (SolO) reach heliocentric distances closer to the Sun, they present an exciting opportunity to study the structure of CMEs in the inner heliosphere. We present an analysis of the global flux rope structure of the 2022 September 5 CME event that impacted PSP at a heliocentric distance of only 0.07 au and SolO at 0.69 au. We compare in situ measurem… ▽ More As both Parker Solar Probe (PSP) and Solar Orbiter (SolO) reach heliocentric distances closer to the Sun, they present an exciting opportunity to study the structure of CMEs in the inner heliosphere. We present an analysis of the global flux rope structure of the 2022 September 5 CME event that impacted PSP at a heliocentric distance of only 0.07 au and SolO at 0.69 au. We compare in situ measurements at PSP and SolO to determine global and local expansion measures, finding a good agreement between magnetic field relationships with heliocentric distance, but significant differences with respect to flux rope size. We use PSP/WISPR images as input to the ELEvoHI model, providing a direct link between remote and in situ observations; we find a large discrepancy between the resulting modeled arrival times, suggesting that the underlying model assumptions may not be suitable when using data obtained close to the Sun, where the drag regime is markedly different in comparison to larger heliocentric distances. Finally, we fit the SolO/MAG and PSP/FIELDS data independently with the 3DCORE model and find that many parameters are consistent between spacecraft, however, challenges are apparent when reconstructing a global 3D structure that aligns with arrival times at PSP and Solar Orbiter, likely due to the large radial and longitudinal separations between spacecraft. From our model results, it is clear the solar wind background speed and drag regime strongly affects the modeled expansion and propagation of CMEs and needs to be taken into consideration. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2405.10283 [pdf, other]

Power-law relaxation of a confined diffusing particle subject to resetting with memory

Authors: Denis Boyer, Satya N. Majumdar

Abstract: We study the relaxation of a Brownian particle with long range memory under confinement in one dimension. The particle diffuses in an arbitrary confining potential and resets at random times to previously visited positions, chosen with a probability proportional to the local time spent there by the particle since the initial time. This model mimics an animal which moves erratically in its home ran… ▽ More We study the relaxation of a Brownian particle with long range memory under confinement in one dimension. The particle diffuses in an arbitrary confining potential and resets at random times to previously visited positions, chosen with a probability proportional to the local time spent there by the particle since the initial time. This model mimics an animal which moves erratically in its home range and returns preferentially to familiar places from time to time, as observed in nature. The steady state density of the position is given by the equilibrium Boltzmann-Gibbs distribution, as in standard diffusion, while the transient part of the density can be obtained through a map** of the Fokker-Planck equation of the process to a Schrödinger eigenvalue problem. Due to memory, the approach at large time toward the steady state is critically self-organised, in the sense that it always follows a sluggish power-law form, in contrast to the exponential decay that characterises Markov processes. The exponent of this power-law depends in a simple way on the resetting rate and on the leading relaxation rate of the Brownian particle in the absence of resetting. We apply these findings to several exactly solvable examples, such as the harmonic, V-shaped and box potentials. △ Less

Submitted 22 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: 19 pages, 3 figures

arXiv:2405.05495 [pdf, other]

PARSAC: Fast, Human-quality Floorplanning for Modern SoCs with Complex Design Constraints

Authors: Hesham Mostafa, Uday Mallappa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

Abstract: The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in indus… ▽ More The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in industry-relevant floorplanning problems, however, is not only to minimize area and wire-length, but to do that while respecting hard placement constraints that specify the general area and/or the specific locations for the placement of some blocks. We show that simply incorporating these constraints into the SA objective function leads to sub-optimal, and often illegal, solutions. We propose the Constraints-Aware Simulated Annealing (CA-SA) method and show that it strongly outperforms vanilla SA in floorplanning problems with hard placement constraints. We developed a new floorplanning tool on top of CA-SA: PARSAC (Parallel Simulated Annealing with Constraints). PARSAC is an efficient, easy-to-use, and massively parallel floorplanner. Unlike current SA-based or learning-based floorplanning tools that cannot effectively incorporate hard placement-constraints, PARSAC can quickly construct the Pareto-optimal legal solutions front for constrained floorplanning problems. PARSAC also outperforms traditional SA on legacy floorplanning benchmarks. PARSAC is available as an open-source repository for researchers to replicate and build on our result. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: 9 pages, 7 figures

arXiv:2405.05480 [pdf, other]

FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCs

Authors: Uday Mallappa, Hesham Mostafa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

Abstract: Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark th… ▽ More Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark that comprises a large training dataset and performance metrics that better reflect real-world constraints and objectives compared to existing benchmarks. To address this need, we present FloorSet -- two comprehensive datasets of synthetic fixed-outline floorplan layouts that reflect the distribution of real SoCs. Each dataset has 1M training samples and 100 test samples where each sample is a synthetic floor-plan. FloorSet-Prime comprises fully-abutted rectilinear partitions and near-optimal wire-length. A simplified dataset that reflects early design phases, FloorSet-Lite comprises rectangular partitions, with under 5 percent white-space and near-optimal wire-length. Both datasets define hard constraints seen in modern design flows such as shape constraints, edge-affinity, grou** constraints, and pre-placement constraints. FloorSet is intended to spur fundamental research on large-scale constrained optimization problems. Crucially, FloorSet alleviates the core issue of reproducibility in modern ML driven solutions to such problems. FloorSet is available as an open-source repository for the research community. △ Less

Submitted 27 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

Comments: 10 pages, 11 figures

arXiv:2405.05085 [pdf, other]

Fair Voting Outcomes with Impact and Novelty Compromises? Unraveling Biases of Equal Shares in Participatory Budgeting

Authors: Sajan Maharjan, Srijoni Majumdar, Evangelos Pournaras

Abstract: Participatory budgeting, as a paradigm for democratic innovations, engages citizens in the distribution of a public budget to projects, which they propose and vote for implementation. So far, voting algorithms have been devised and studied in social choice literature to elect projects that are popular, while others prioritize on a proportional representation of voters' preferences, for instance, e… ▽ More Participatory budgeting, as a paradigm for democratic innovations, engages citizens in the distribution of a public budget to projects, which they propose and vote for implementation. So far, voting algorithms have been devised and studied in social choice literature to elect projects that are popular, while others prioritize on a proportional representation of voters' preferences, for instance, equal shares. However, the anticipated impact and novelty in the broader society by the winning projects, as selected by different algorithms, remains totally under-explored, lacking both a universal theory of impact for voting and a rigorous framework for impact and novelty assessments. This papers tackles this grand challenge towards new axiomatic foundations for designing effective and fair voting methods. This is via new and striking insights derived from a large-scale analysis of biases over 345 real-world voting outcomes, characterized for the first time by a novel portfolio of impact and novelty metrics. We find strong causal evidence that equal shares comes with impact loss in several infrastructural projects of different cost levels that have been so far over-represented. However, it also comes with a novel, yet over-represented, impact gain in welfare, education and culture. We discuss broader implications of these results and how impact loss can be mitigated at the stage of campaign design and project ideation. △ Less

Submitted 9 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

Comments: 23 pages, 9 figures

arXiv:2405.02927 [pdf, other]

doi 10.1093/mnras/stae1186

Project Hephaistos - II. Dyson sphere candidates from Gaia DR3, 2MASS, and WISE

Authors: Matías Suazo, Erik Zackrisson, Priyatam K. Mahto, Fabian Lundell, Carl Nettelblad, Andreas J. Korn, Jason T. Wright, Suman Majumdar

Abstract: The search for extraterrestrial intelligence is currently being pursued using multiple techniques and in different wavelength bands. Dyson spheres, megastructures that could be constructed by advanced civilizations to harness the radiation energy of their host stars, represent a potential technosignature, that in principle may be hiding in public data already collected as part of large astronomica… ▽ More The search for extraterrestrial intelligence is currently being pursued using multiple techniques and in different wavelength bands. Dyson spheres, megastructures that could be constructed by advanced civilizations to harness the radiation energy of their host stars, represent a potential technosignature, that in principle may be hiding in public data already collected as part of large astronomical surveys. In this study, we present a comprehensive search for partial Dyson spheres by analyzing optical and infrared observations from Gaia, 2MASS, and WISE. We develop a pipeline that employs multiple filters to identify potential candidates and reject interlopers in a sample of five million objects, which incorporates a convolutional neural network to help identify confusion in WISE data. Finally, the pipeline identifies 7 candidates deserving of further analysis. All of these objects are M-dwarfs, for which astrophysical phenomena cannot easily account for the observed infrared excess emission. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: Accepted to be published in MNRAS

arXiv:2404.16368 [pdf, ps, other]

doi 10.1103/PhysRevB.109.134428

Does carrier localization affect the anomalous Hall effect?

Authors: Prasanta Chowdhury, Mohamad Numan, Shuvankar Gupta, Souvik Chatterjee, Saurav Giri, Subham Majumdar

Abstract: The effect of carrier localization due to electron-electron interaction in anomalous Hall effect is elusive and there are contradictory results in the literature. To address the issue, we report here the detailed transport study including the Hall measurements on $β$-Mn type cubic compound Co$_7$Zn$_7$Mn$_6$ with chiral crystal structure, which lacks global mirror symmetry. The alloy orders magnet… ▽ More The effect of carrier localization due to electron-electron interaction in anomalous Hall effect is elusive and there are contradictory results in the literature. To address the issue, we report here the detailed transport study including the Hall measurements on $β$-Mn type cubic compound Co$_7$Zn$_7$Mn$_6$ with chiral crystal structure, which lacks global mirror symmetry. The alloy orders magnetically below $T_c$ = 204 K, and reported to show spin glass state at low temperature. The longitudinal resistivity ($ρ_{xx}$) shows a pronounced upturn below $T_{min}$ = 75 K, which is found to be associated with carrier localization due to quantum interference effect. The upturn in $ρ_{xx}$ shows a $T^{1/2}$ dependence and it is practically insensitive to the externally applied magnetic field, which indicate that electron-electron interaction is primarily responsible for the low-$T$ upturn. The studied sample shows considerable value of anomalous Hall effect below $T_c$. We found that the localization effect is present in the ordinary Hall coefficient ($R_0$), but we failed to observe any signature of localization in the anomalous Hall resistivity or conductivity. The absence of localization effect in the anomalous Hall effect in Co$_7$Zn$_7$Mn$_6$ may be due to large carrier density, and it warrants further theoretical investigations, particularly with systems having broken mirror symmetry. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 9 pages, 5 figures

Journal ref: Phys. Rev. B 109, 134428 (2024)

arXiv:2404.05482 [pdf, other]

WaveCatBoost for Probabilistic Forecasting of Regional Air Quality Data

Authors: **tu Borah, Tanujit Chakraborty, Md. Shahrul Md. Nadzir, Mylene G. Cayetano, Shubhankar Majumdar

Abstract: Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlap** discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approa… ▽ More Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlap** discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approach efficiently transforms time series into high-frequency and low-frequency components, thereby extracting signal from noise and improving prediction accuracy and robustness. Evaluation of two distinct regional datasets, from the Central Air Pollution Control Board (CPCB) sensor network and a low-cost air quality sensor system (LAQS), underscores the superior performance of our proposed methodology in real-time forecasting compared to the state-of-the-art statistical and deep learning architectures. Moreover, we employ a conformal prediction strategy to provide probabilistic bands with our forecasts. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.02480 [pdf, other]

Noninteracting particles in a harmonic trap with a stochastically driven center

Authors: Sanjib Sabhapandit, Satya N. Majumdar

Abstract: We study a system of $N$ noninteracting particles on a line in the presence of a harmonic trap $U(x)=μ\bigl[x-z(t)\bigr]^2/2$, where the trap center $z(t)$ undergoes a bounded stochastic modulation. We show that this stochastic modulation drives the system into a nonequilibrium stationary state, where the joint distribution of the positions of the particles is not factorizable. This indicates stro… ▽ More We study a system of $N$ noninteracting particles on a line in the presence of a harmonic trap $U(x)=μ\bigl[x-z(t)\bigr]^2/2$, where the trap center $z(t)$ undergoes a bounded stochastic modulation. We show that this stochastic modulation drives the system into a nonequilibrium stationary state, where the joint distribution of the positions of the particles is not factorizable. This indicates strong correlations between the positions of the particles that are not inbuilt, but rather get generated by the dynamics itself. Moreover, we show that the stationary joint distribution can be fully characterized and has a special conditionally independent and identically distributed (CIID) structure. This special structure allows us to compute several observables analytically even in such a strongly correlated system, for an arbitrary bounded drive $z(t)$. These observables include the average density profile, the correlations between particle positions, the order and gap statistics, as well as the full counting statistics. We then apply our general results to two specific examples where (i) $z(t)$ represents a dichotomous telegraphic noise, and (ii) $z(t)$ represents an Ornstein-Uhlenbeck process. Our analytical predictions are verified in numerical simulations, finding excellent agreement. △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: 29 pages, 8 figures

arXiv:2404.00215 [pdf, other]

Minimizing the Profligacy of Searches with Reset

Authors: John C. Sunil, Richard A. Blythe, Martin R. Evans, Satya N. Majumdar

Abstract: We introduce the profligacy of a search process as a competition between its expected cost and the probability of finding the target. The arbiter of the competition is a parameter $λ$ that represents how much a searcher invests into increasing the chance of success. Minimizing the profligacy with respect to the search strategy specifies the optimal search. We show that in the case of diffusion wit… ▽ More We introduce the profligacy of a search process as a competition between its expected cost and the probability of finding the target. The arbiter of the competition is a parameter $λ$ that represents how much a searcher invests into increasing the chance of success. Minimizing the profligacy with respect to the search strategy specifies the optimal search. We show that in the case of diffusion with stochastic resetting, the amount of resetting in the optimal strategy has a highly nontrivial dependence on model parameters resulting in classical continuous transitions, discontinuous transitions and tricritical points as well as non-standard discontinuous transitions exhibiting re-entrant behavior and overhangs. △ Less

Submitted 29 March, 2024; originally announced April 2024.

Comments: 15 pages (6 pages main text, 9 Pages supplemental material), 8 figures (3 figures in main text, 5 figures in supplemental material)

arXiv:2403.18750 [pdf, other]

Full counting statistics of 1d short-range Riesz gases in confinement

Authors: Jitendra Kethepalli, Manas Kulkarni, Anupam Kundu, Satya N. Majumdar, David Mukamel, Grégory Schehr

Abstract: We investigate the full counting statistics (FCS) of a harmonically confined 1d short-range Riesz gas consisting of $N$ particles in equilibrium at finite temperature. The particles interact with each other through a repulsive power-law interaction with an exponent $k>1$ which includes the Calogero-Moser model for $k=2$. We examine the probability distribution of the number of particles in a finit… ▽ More We investigate the full counting statistics (FCS) of a harmonically confined 1d short-range Riesz gas consisting of $N$ particles in equilibrium at finite temperature. The particles interact with each other through a repulsive power-law interaction with an exponent $k>1$ which includes the Calogero-Moser model for $k=2$. We examine the probability distribution of the number of particles in a finite domain $[-W, W]$ called number distribution, denoted by $\mathcal{N}(W, N)$. We analyze the probability distribution of $\mathcal{N}(W, N)$ and show that it exhibits a large deviation form for large $N$ characterised by a speed $N^{\frac{3k+2}{k+2}}$ and by a large deviation function of the fraction $c = \mathcal{N}(W, N)/N$ of the particles inside the domain and $W$. We show that the density profiles that create the large deviations display interesting shape transitions as one varies $c$ and $W$. This is manifested by a third-order phase transition exhibited by the large deviation function that has discontinuous third derivatives. Monte-Carlo (MC) simulations show good agreement with our analytical expressions for the corresponding density profiles. We find that the typical fluctuations of $\mathcal{N}(W, N)$, obtained from our field theoretic calculations are Gaussian distributed with a variance that scales as $N^{ν_k}$, with $ν_k = (2-k)/(2+k)$. We also present some numerical findings on the mean and the variance. Furthermore, we adapt our formalism to study the index distribution (where the domain is semi-infinite $(-\infty, W])$, linear statistics (the variance), thermodynamic pressure and bulk modulus. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: 36 pages, 7 figures

arXiv:2403.16152 [pdf, other]

Cost of excursions until first crossing of the origin for random walks and Lévy flights: an exact general formula

Authors: Francesco Mori, Satya N. Majumdar, Pierpaolo Vivo

Abstract: We consider a discrete-time random walk on a line starting at $x_0\geq 0$ where a cost is incurred at each jump. We obtain an exact analytical formula for the distribution of the total cost of a trajectory until the process crosses the origin for the first time. The formula is valid for arbitrary jump distribution and cost function (heavy- and light-tailed alike), provided they are symmetric and c… ▽ More We consider a discrete-time random walk on a line starting at $x_0\geq 0$ where a cost is incurred at each jump. We obtain an exact analytical formula for the distribution of the total cost of a trajectory until the process crosses the origin for the first time. The formula is valid for arbitrary jump distribution and cost function (heavy- and light-tailed alike), provided they are symmetric and continuous. We analyze the formula in different scaling regimes, and find a high degree of universality with respect to the details of the jump distribution and the cost function. Applications are given to the motion of an active run-and-tumble particle in one dimension and extensions to multiple cost variables are considered. The analytical results are in perfect agreement with numerical simulations. △ Less

Submitted 20 May, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

Comments: 27 pages, 9 figures. Extended version which includes a detailed analysis of the $x_0>0$ case

arXiv:2403.06964 [pdf, other]

Decorrelation of a leader by the increasing number of followers

Authors: Satya N. Majumdar, Gregory Schehr

Abstract: We compute the connected two-time correlator of the maximum $M_N(t)$ of $N$ independent Gaussian stochastic processes (GSP) characterised by a common correlation coefficient $ρ$ that depends on the two times $t_1$ and $t_2$. We show analytically that this correlator, for fixed times $t_1$ and $t_2$, decays for large $N$ as a power law $N^{-γ}$ (with logarithmic corrections) with a decorrelation ex… ▽ More We compute the connected two-time correlator of the maximum $M_N(t)$ of $N$ independent Gaussian stochastic processes (GSP) characterised by a common correlation coefficient $ρ$ that depends on the two times $t_1$ and $t_2$. We show analytically that this correlator, for fixed times $t_1$ and $t_2$, decays for large $N$ as a power law $N^{-γ}$ (with logarithmic corrections) with a decorrelation exponent $γ= (1-ρ)/(1+ ρ)$ that depends only on $ρ$, but otherwise is universal for any GSP. We study several examples of physical processes including the fractional Brownian motion (fBm) with Hurst exponent $H$ and the Ornstein-Uhlenbeck (OU) process. For the fBm, $ρ$ is only a function of $τ= \sqrt{t_1/t_2}$ and we find an interesting ``freezing'' transition at a critical value $τ= τ_c=(3-\sqrt{5})/2$. For $τ< τ_c$, there is an optimal $H^*(τ) > 0$ that maximises the exponent $γ$ and this maximal value freezes to $γ= 1/3$ for $τ>τ_c$. For the OU process, we show that $γ= {\rm tanh}(μ\,|t_1-t_2|/2)$ where $μ$ is the stiffness of the harmonic trap. Numerical simulations confirm our analytical predictions. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: Main text: 6 pages, 3 figures. Supplementary material: 10 pages

arXiv:2402.07961 [pdf, other]

doi 10.1029/2023SW003805

Correcting Projection Effects in CMEs using GCS-based Large Statistics of Multi-viewpoint Observations

Authors: Harshita Gandhi, Ritesh Patel, Vaibhav Pant, Satabdwa Majumdar, Sanchita Pal, Dipankar Banerjee, Huw Morgan

Abstract: This study addresses the limitations of single-viewpoint observations of Coronal Mass Ejections (CMEs) by presenting results from a 3D catalog of 360 CMEs during solar cycle 24, fitted using the GCS model. The dataset combines 326 previously analyzed CMEs and 34 newly examined events, categorized by their source regions into active region (AR) eruptions, active prominence (AP) eruptions, and promi… ▽ More This study addresses the limitations of single-viewpoint observations of Coronal Mass Ejections (CMEs) by presenting results from a 3D catalog of 360 CMEs during solar cycle 24, fitted using the GCS model. The dataset combines 326 previously analyzed CMEs and 34 newly examined events, categorized by their source regions into active region (AR) eruptions, active prominence (AP) eruptions, and prominence eruptions (PE). Estimates of errors are made using a bootstrap** approach. The findings highlight that the average 3D speed of CMEs is $\sim$1.3 times greater than the 2D speed. PE CMEs tend to be slow, with an average speed of 432 km $s^{-1}$. AR and AP speeds are higher, at 723 km $s^{-1}$ and 813 km $s^{-1}$, respectively, with the latter having fewer slow CMEs. The distinctive behavior of AP CMEs is attributed to factors like overlying magnetic field distribution or geometric complexities leading to less accurate GCS fits. A linear fit of projected speed to width gives a gradient of 2 km $s^{-1}deg^{-1}$, which increases to 5 km $s^{-1}deg^{-1}$ when the GCS-fitted `true' parameters are used. Notably, AR CMEs exhibit a high gradient of 7 km $s^{-1}deg^{-1}$, while AP CMEs show a gradient of 4 km $s^{-1}deg^{-1}$. PE CMEs, however, lack a significant speed-width relationship. We show that fitting multi-viewpoint CME images to a geometrical model such as GCS is important to study the statistical properties of CMEs, and can lead to a deeper insight into CME behavior that is essential for improving future space weather forecasting. △ Less

Submitted 11 February, 2024; originally announced February 2024.

Comments: Accepted for publication in Space Weather Journal

arXiv:2402.04215 [pdf, other]

Universal distribution of the number of minima for random walks and Lévy flights

Authors: Anupam Kundu, Satya N. Majumdar, Gregory Schehr

Abstract: We compute exactly the full distribution of the number $m$ of local minima in a one-dimensional landscape generated by a random walk or a Lévy flight. We consider two different ensembles of landscapes, one with a fixed number of steps $N$ and the other till the first-passage time of the random walk to the origin. We show that the distribution of $m$ is drastically different in the two ensembles (G… ▽ More We compute exactly the full distribution of the number $m$ of local minima in a one-dimensional landscape generated by a random walk or a Lévy flight. We consider two different ensembles of landscapes, one with a fixed number of steps $N$ and the other till the first-passage time of the random walk to the origin. We show that the distribution of $m$ is drastically different in the two ensembles (Gaussian in the former case, while having a power-law tail in the latter $m^{-3/2}$ in the latter case). However, the most striking aspect of our results is that, in each case, the distribution is completely universal for all $m$ (and not just for large $m$), i.e., independent of the jump distribution in the random walk. This means that the distributions are exactly identical for Lévy flights and random walks with finite jump variance. Our analytical results are in excellent agreement with our numerical simulations. △ Less

Submitted 14 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: Typos corrected, submitted version. Main text: 6 pages, 3 figures. Supplementary material: 20 pages, 12 figures

arXiv:2402.03351 [pdf, ps, other]

Single and entangled atomic systems in thermal bath and the Fulling-Davies-Unruh effect

Authors: Arnab Mukherjee, Sunandan Gangopadhyay, Archan S. Majumdar

Abstract: We revisit the Fulling-Davies-Unruh effect in the context of two-level single and entangled atomic systems that are either uniformly accelerated or static in a thermal bath. We consider the interaction between the systems and a massless scalar field, covering the scenarios of free space as well as within a cavity. Through the calculation of atomic transition rates, it is found that in free space t… ▽ More We revisit the Fulling-Davies-Unruh effect in the context of two-level single and entangled atomic systems that are either uniformly accelerated or static in a thermal bath. We consider the interaction between the systems and a massless scalar field, covering the scenarios of free space as well as within a cavity. Through the calculation of atomic transition rates, it is found that in free space there is an equivalence between a uniformly accelerated atom with respect to an observer with that of a single atom which is static with respect to the observer and immersed in a thermal bath, as long as the temperature of the thermal bath matches the Unruh temperature. This equivalence breaks down in the presence of a cavity. For two-atom systems, we consider the initial state to be in a general pure entangled form. We find that in this case, the equivalence between the accelerated and static thermal bath scenarios holds only under specific limiting conditions in free space but breaks down completely in a cavity set-up. △ Less

Submitted 25 January, 2024; originally announced February 2024.

Comments: 32 pages LaTeX, Comments are welcome. arXiv admin note: substantial text overlap with arXiv:2305.08867

arXiv:2401.14873 [pdf, ps, other]

Lessons from discrete light-cone quantization for physics at null infinity: Bosons in two dimensions

Authors: Glenn Barnich, Sucheta Majumdar, Simone Speziale, Wen-Di Tan

Abstract: Motivated by issues in the context of asymptotically flat spacetimes at null infinity, we discuss in the simplest example of a massless scalar field in two dimensions several subtleties that arise when setting up the canonical formulation on a single or on two intersecting null hyperplanes with a special emphasis on the infinite-dimensional global and conformal symmetries and their canonical gener… ▽ More Motivated by issues in the context of asymptotically flat spacetimes at null infinity, we discuss in the simplest example of a massless scalar field in two dimensions several subtleties that arise when setting up the canonical formulation on a single or on two intersecting null hyperplanes with a special emphasis on the infinite-dimensional global and conformal symmetries and their canonical generators, the free data, a consistent treatment of zero modes, matching conditions, and implications for quantization of massless versus massive fields. △ Less

Submitted 20 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: 52 pages, 3 figures, cosmetic changes

arXiv:2401.09246 [pdf, other]

Work Distribution for Unzip** Processes

Authors: P. Werner, A. K. Hartmann, S. N. Majumdar

Abstract: A simple zipper model is introduced, representing in a simplified way, e.g., the folded DNA double helix or hairpin structures in RNA. The double stranded hairpin is connected to a heat bath at temperature $T$ and subject to an external force $f$, which couples to the free length $L$ of the unzipped sequence. Increasing the force, leads to an zip**/unzip** first-order phase transition at a cri… ▽ More A simple zipper model is introduced, representing in a simplified way, e.g., the folded DNA double helix or hairpin structures in RNA. The double stranded hairpin is connected to a heat bath at temperature $T$ and subject to an external force $f$, which couples to the free length $L$ of the unzipped sequence. Increasing the force, leads to an zip**/unzip** first-order phase transition at a critical force $f_c(T)$ in the thermodynamic limit of a very large chain. We compute analytically, as a function of temperature $T$ and force $f$, the full distribution $P(L)$ of free lengths in the thermodynamic limit and show that it is qualitatively very different for $f<f_c$, $f=f_c$ and $f>f_c$. Next we consider quasistatic work processes where the force is incremented according to a linear protocol. Having obtained $P(L)$ already allows us to derive an analytical expression for the work distribution $P(W)$ in the zipped phase $f<f_c$ for a long chain. We compute the large-deviation tails of the work distribution explicitly. Our analytical result for the work distribution is compared over a large range of the support down to probabilities as small as $10^{-200}$ with numerical simulations, which were performed by applying sophisticated large-deviation algorithms. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: 14 pages, 9 figures

arXiv:2401.05947 [pdf, other]

Send Message to the Future? Blockchain-based Time Machines for Decentralized Reveal of Locked Information

Authors: Zhuolun Li, Srijoni Majumdar, Evangelos Pournaras

Abstract: Conditional information reveal systems automate the release of information upon meeting specific predefined conditions, such as time or location. This paper introduces a breakthrough in the understanding, design and application of conditional information reveal systems that are highly secure and decentralized. By designing a new practical timed-release cryptography system and a verifiable secret s… ▽ More Conditional information reveal systems automate the release of information upon meeting specific predefined conditions, such as time or location. This paper introduces a breakthrough in the understanding, design and application of conditional information reveal systems that are highly secure and decentralized. By designing a new practical timed-release cryptography system and a verifiable secret sharing scheme, a novel data sharing system is devised on the blockchain that `sends messages in the future' with highly accurate decryption times. This paper provides a complete evaluation portfolio of this pioneering paradigm, including analytical results, a validation of its robustness in the Tamarin Prover and a performance evaluation of a real-world, open-source system prototype deployed across the globe. Using real-world election data, we also demonstrate the applicability of this innovative system in e-voting, illustrating its capacity to secure and ensure fair electronic voting processes. △ Less

Submitted 24 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

arXiv:2401.02645 [pdf, ps, other]

Information entropy in excited states in confined quantum systems

Authors: Sangita Majumdar, Neetik Mukherjee, Amlan K. Roy

Abstract: The present contribution constitutes a brief account of information theoretical analysis in several representative model as well as real quantum mechanical systems. There has been an overwhelming interest to study such measures in various quantum systems, as evidenced by a vast amount of publications in the literature that has taken place in recent years. However, while such works are numerous in… ▽ More The present contribution constitutes a brief account of information theoretical analysis in several representative model as well as real quantum mechanical systems. There has been an overwhelming interest to study such measures in various quantum systems, as evidenced by a vast amount of publications in the literature that has taken place in recent years. However, while such works are numerous in so-called \emph{free} systems, there is a genuine lack of these in their constrained counterparts. With this in mind, this chapter will focus on some of the recent exciting progresses that has been witnessed in our laboratory \cite{sen06,roy14mpla,roy14mpla_manning,roy15ijqc, roy16ijqc, mukherjee15,mukherjee16,majumdar17,mukherjee18a,mukherjee18b,mukherjee18c,mukherjee18d,majumdar20,mukherjee21,majumdar21a, majumdar21b}, and elsewhere, with special emphasis on following prototypical systems, namely, (i) double well (DW) potential (symmetric and asymmetric) (ii) \emph{free}, as well as a \emph{confined hydrogen atom} (CHA) enclosed in a spherical impenetrable cavity (iii) a many-electron atom under similar enclosed environment. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 13 figures

arXiv:2401.01935 [pdf, other]

doi 10.1093/mnras/stae078

Extracting the Global 21-cm signal from Cosmic Dawn and Epoch of Reionization in the presence of Foreground and Ionosphere

Authors: Anshuman Tripathi, Abhirup Datta, Madhurima Choudhury, Suman Majumdar

Abstract: Detection of redshifted \ion{H}{i} 21-cm emission is a potential probe for investigating the Universe's first billion years. However, given the significantly brighter foreground, detecting 21-cm is observationally difficult. The Earth's ionosphere considerably distorts the signal at low frequencies by introducing directional-dependent effects. Here, for the first time, we report the use of Artific… ▽ More Detection of redshifted \ion{H}{i} 21-cm emission is a potential probe for investigating the Universe's first billion years. However, given the significantly brighter foreground, detecting 21-cm is observationally difficult. The Earth's ionosphere considerably distorts the signal at low frequencies by introducing directional-dependent effects. Here, for the first time, we report the use of Artificial Neural Networks (ANNs) to extract the global 21cm signal characteristics from the composite all-sky averaged signal, including foreground and ionospheric effects such as refraction, absorption, and thermal emission from the ionosphere's F and D-layers. We assume a 'perfect' instrument and neglect instrumental calibration and beam effects. To model the ionospheric effect, we considered the static and time-varying ionospheric conditions for the mid-latitude region where LOFAR is situated. In this work, we trained the ANN model for various situations using a synthetic set of the global 21cm signals created by altering its parameter space based on the "$\rm \tanh$" parameterized model and the Accelerated Reionization Era Simulations (ARES) algorithm. The obtained result shows that the ANN model can extract the global signal parameters with an accuracy of $\ge 96 \% $ in the final study when we include foreground and ionospheric effects. On the other hand, a similar ANN model can extract the signal parameters from the final prediction dataset with an accuracy ranging from $97 \%$ to $98 \%$ when considering more realistic sets of the global 21cm signals based on physical models. △ Less

Submitted 28 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: 21 pages, 19 figures, Published in MNRAS

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 528, Issue 2, February 2024, Pages 1945-1964

arXiv:2401.01236 [pdf, other]

An operational approach to classifying measurement incompatibility

Authors: Arun Kumar Das, Saheli Mukherjee, Debashis Saha, Debarshi Das, A. S. Majumdar

Abstract: Measurement incompatibility has proved to be an important resource for information-processing tasks. In this work, we analyze various levels of incompatibility of measurement sets. We provide operational classification of measurement incompatibility with respect to two elementary classical operations, viz., coarse-graining of measurement outcomes and convex mixing of different measurements. We der… ▽ More Measurement incompatibility has proved to be an important resource for information-processing tasks. In this work, we analyze various levels of incompatibility of measurement sets. We provide operational classification of measurement incompatibility with respect to two elementary classical operations, viz., coarse-graining of measurement outcomes and convex mixing of different measurements. We derive analytical criteria for determining when a set of projective measurements is fully incompatible with respect to coarse-graining or convex mixing. Robustness against white noise is investigated for mutually unbiased bases that can sustain full incompatibility. Furthermore, we propose operational witnesses for different levels of incompatibility subject to classical operations, using the input-output statistics of Bell-type experiments as well as experiments in the prepare-and-measure scenario. △ Less

Submitted 2 January, 2024; originally announced January 2024.

Comments: 11 pages, Preliminary draft, Comments are welcome

arXiv:2312.17279 [pdf, other]

Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition

Authors: Vahid Noroozi, Somshubra Majumdar, Ankur Kumar, Jagadeesh Balam, Boris Ginsburg

Abstract: In this paper, we propose an efficient and accurate streaming speech recognition model based on the FastConformer architecture. We adapted the FastConformer architecture for streaming applications through: (1) constraining both the look-ahead and past contexts in the encoder, and (2) introducing an activation caching mechanism to enable the non-autoregressive encoder to operate autoregressively du… ▽ More In this paper, we propose an efficient and accurate streaming speech recognition model based on the FastConformer architecture. We adapted the FastConformer architecture for streaming applications through: (1) constraining both the look-ahead and past contexts in the encoder, and (2) introducing an activation caching mechanism to enable the non-autoregressive encoder to operate autoregressively during inference. The proposed model is thoughtfully designed in a way to eliminate the accuracy disparity between the train and inference time which is common for many streaming models. Furthermore, our proposed encoder works with various decoder configurations including Connectionist Temporal Classification (CTC) and RNN-Transducer (RNNT) decoders. Additionally, we introduced a hybrid CTC/RNNT architecture which utilizes a shared encoder with both a CTC and RNNT decoder to boost the accuracy and save computation. We evaluate the proposed model on LibriSpeech dataset and a multi-domain large scale dataset and demonstrate that it can achieve better accuracy with lower latency and inference time compared to a conventional buffered streaming model baseline. We also showed that training a model with multiple latencies can achieve better accuracy than single latency models while it enables us to support multiple latencies with a single model. Our experiments also showed the hybrid architecture would not only speedup the convergence of the CTC decoder but also improves the accuracy of streaming models compared to single decoder models. △ Less

Submitted 2 May, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

Comments: Shorter version accepted to ICASSP 2024

arXiv:2312.13439 [pdf, other]

Active particle in one dimension subjected to resetting with memory

Authors: Denis Boyer, Satya N. Majumdar

Abstract: The study of diffusion with preferential returns to places visited in the past has attracted an increased attention in recent years. In these highly non-Markov processes, a standard diffusive particle intermittently resets at a given rate to previously visited positions. At each reset, a position to be revisited is randomly chosen with a probability proportional to the accumulated amount of time s… ▽ More The study of diffusion with preferential returns to places visited in the past has attracted an increased attention in recent years. In these highly non-Markov processes, a standard diffusive particle intermittently resets at a given rate to previously visited positions. At each reset, a position to be revisited is randomly chosen with a probability proportional to the accumulated amount of time spent by the particle at that position. These preferential revisits typically generate a very slow diffusion, logarithmic in time, but still with a Gaussian position distribution at late times. Here we consider an active version of this model, where between resets the particle is self-propelled with constant speed and switches direction in one dimension according to a telegraphic noise. Hence there are two sources of non-Markovianity in the problem. We exactly derive the position distribution in Fourier space, as well as the variance of the position at all times. The crossover from the short-time ballistic regime, dominated by activity, to the large-time anomalous logarithmic growth induced by memory is studied. We also analytically derive a large deviation principle for the position, which exhibits a logarithmic time-scaling instead of the usual algebraic form. Interestingly, at large distances, the large deviations become independent of time and match the non-equilibrium steady state of a particle under resetting to its starting position only. △ Less

Submitted 6 May, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 18 pages, 3 figures

Journal ref: Phys. Rev. E 109, 054105 (2024)

arXiv:2312.13095 [pdf, other]

doi 10.1103/PhysRevB.109.205148

Antiferromagnetic order enhanced by local dissipation

Authors: Oscar Bouverot-Dupuis, Saptarshi Majumdar, Alberto Rosso, Laura Foini

Abstract: We study an XXZ spin chain at zero magnetization coupled to a collection of local harmonic baths at zero temperature. We map this system on a (1+1)D effective field theory using bosonization, where the effect of the bath is taken care of in an exact manner. We provide analytical and numerical evidence of the existence of two phases at zero temperature: a Luttinger liquid (LL) and an antiferromagne… ▽ More We study an XXZ spin chain at zero magnetization coupled to a collection of local harmonic baths at zero temperature. We map this system on a (1+1)D effective field theory using bosonization, where the effect of the bath is taken care of in an exact manner. We provide analytical and numerical evidence of the existence of two phases at zero temperature: a Luttinger liquid (LL) and an antiferromagnetic phase (AFM), separated by a phase transition akin to the Berezinsky--Kosterlitz--Thouless (BKT) type. While the bath is responsible for the LL-AFM transition for subohmic baths, the LL-AFM transition for superohmic baths is due to the interactions within the spin chain. △ Less

Submitted 12 May, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 20 pages, 11 figures

Journal ref: Phys. Rev. B 109, 205148 (2024)

arXiv:2312.03040 [pdf, other]

doi 10.1007/s11128-024-04397-8

Device-Independent Quantum Secure Direct Communication Under Non-Markovian Quantum Channels

Authors: Pritam Roy, Subhankar Bera, Shashank Gupta, A. S. Majumdar

Abstract: Device-independent quantum secure direct communication (DI-QSDC) is a promising primitive in quantum cryptography aimed towards addressing the problems of device imperfections and key management. However, significant effort is required to tackle practical challenges such as the distance limitation due to the decohering effects of quantum channels. Here, we explore the constructive effect of non-Ma… ▽ More Device-independent quantum secure direct communication (DI-QSDC) is a promising primitive in quantum cryptography aimed towards addressing the problems of device imperfections and key management. However, significant effort is required to tackle practical challenges such as the distance limitation due to the decohering effects of quantum channels. Here, we explore the constructive effect of non-Markovian noise to improve the performance of DI-QSDC. Considering two different environmental dynamics modelled by the amplitude dam** and the dephasing channels, we show that for both cases non-Markovianty leads to a considerable improvement over Markovian dynamics in terms of three benchmark performance criteria of the DI-QSDC task. Specifically, we find that non-Markovian noise (i) enhances the protocol security measured by Bell violation, (ii) leads to a lower quantum bit error rate, and (iii) enables larger communication distances by increasing the capacity of secret communication. △ Less

Submitted 6 May, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

Comments: 13 pages, 10 figures, comments are welcome

Journal ref: Quantum Inf Process 23, 170 (2024)

arXiv:2312.02570 [pdf, other]

doi 10.1103/PhysRevE.109.L032106

Dynamically emergent correlations between particles in a switching harmonic trap

Authors: Marco Biroli, Manas Kulkarni, Satya N. Majumdar, Gregory Schehr

Abstract: We study a one dimensional gas of $N$ noninteracting diffusing particles in a harmonic trap, whose stiffness switches between two values $μ_1$ and $μ_2$ with constant rates $r_1$ and $r_2$ respectively. Despite the absence of direct interaction between the particles, we show that strong correlations between them emerge in the stationary state at long times, induced purely by the dynamics itself. W… ▽ More We study a one dimensional gas of $N$ noninteracting diffusing particles in a harmonic trap, whose stiffness switches between two values $μ_1$ and $μ_2$ with constant rates $r_1$ and $r_2$ respectively. Despite the absence of direct interaction between the particles, we show that strong correlations between them emerge in the stationary state at long times, induced purely by the dynamics itself. We compute exactly the joint distribution of the positions of the particles in the stationary state, which allows us to compute several physical observables analytically. In particular, we show that the extreme value statistics (EVS), i.e., the distribution of the position of the rightmost particle has a nontrivial shape in the large $N$ limit. The scaling function characterizing this EVS has a finite support with a tunable shape (by varying the parameters). Remarkably, this scaling function turns out to be universal. First, it also describes the distribution of the position of the $k$-th rightmost particle in a $1d$ trap. Moreover, the distribution of the position of the particle farthest from the center of the harmonic trap in $d$ dimensions is also described by the same scaling function for all $d \geq 1$. Numerical simulations are in excellent agreement with our analytical predictions. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: Main text: 6 pages + Supp. Mat.: 16 pages

Journal ref: Phys. Rev. E 109, L032106 (2024)

arXiv:2311.17689 [pdf, other]

doi 10.1103/PhysRevE.109.044150

Occupation time of a system of Brownian particles on the line with steplike initial condition

Authors: Ivan N. Burenev, Satya N. Majumdar, Alberto Rosso

Abstract: We consider a system of non-interacting Brownian particles on the line with steplike initial condition and study the statistics of the occupation time on the positive half-line. We demonstrate that this system exhibits long-lasting memory effects of the initialization. Specifically, we calculate the mean and the variance of the occupation time, demonstrating that the memory effects in the variance… ▽ More We consider a system of non-interacting Brownian particles on the line with steplike initial condition and study the statistics of the occupation time on the positive half-line. We demonstrate that this system exhibits long-lasting memory effects of the initialization. Specifically, we calculate the mean and the variance of the occupation time, demonstrating that the memory effects in the variance are determined by a generalized compressibility (or Fano factor), associated with the initial condition. In the particular case of the uncorrelated uniform initial condition we conduct a detailed study of two probability distributions of the occupation time: annealed (averaged over all possible initial configurations) and quenched (for a typical configuration). We show that at large times both the annealed and the quenched distributions admit large deviation form and we compute analytically the associated rate functions. We verify our analytical predictions via numerical simulations using Importance Sampling Monte-Carlo strategy. △ Less

Submitted 30 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

Comments: v2, 17 pages, 10 figures

Journal ref: Phys. Rev. E 109, 044150 (2024)

arXiv:2311.17062 [pdf, other]

Impact of astrophysical scatter on the Epoch of Reionization [H I]$_{\rm 21cm}$ bispectrum

Authors: Chandra Shekhar Murmu, Kanan K. Datta, Suman Majumdar, Thomas R. Greve

Abstract: It is believed that the first star-forming galaxies are the main drivers of cosmic reionization. It is usually assumed that there is a one-to-one relationship between the star formation rate (SFR) inside a galaxy and the host halo mass in semi-analytical/numerical modeling of large-scale ionization maps during the epoch of reionization. However, more accurate simulations and observations suggest t… ▽ More It is believed that the first star-forming galaxies are the main drivers of cosmic reionization. It is usually assumed that there is a one-to-one relationship between the star formation rate (SFR) inside a galaxy and the host halo mass in semi-analytical/numerical modeling of large-scale ionization maps during the epoch of reionization. However, more accurate simulations and observations suggest that the SFR and ionizing luminosity in galaxies may vary considerably even if the host halo mass is the same. This astrophysical scatter can introduce an additional non-Gaussianity in the [H I]$_{21\rm cm}$ signal, which might not be captured adequately in the power spectrum. In this work, we have studied the impact of the scatter on the [H I]$_{21\rm cm}$ bispectrum using semi-numerical simulations. We find that the scatter primarily affects small ionized regions, whereas the large ionized bubbles remain largely unaffected. Although, the fractional change in the [H I]$_{21\rm cm}$ bispectra due to the scatter is found to be more than a factor of $10$ at large scales ($\lesssim 1\, {\rm Mpc}^{-1}$), it is found to be statistically insignificant. However, at small scales ($k\sim2.55$ Mpc$^{-1}$), we have found the impact due to the scatter to be high in magnitude ($|\langle ΔB \rangle/B_{\text{no-scatter}}| \sim 1$) and statistically significant ($|\langleΔB\rangle/σ_{ΔB}| \gtrsim 5$) at neutral fraction, $\overline{x}_{\rm HI}\sim 0.8$. We have also found that in the most optimistic scenario, SKA1-Low might be able to detect these signatures of astrophysical scatter, at $\sim 3σ$ and $\sim 5σ$ detection significance for $\overline{x}_{\rm HI} \sim$ 0.8 and 0.9 respectively, for the equilateral [H I]$_{21\rm cm}$ bispectrum. △ Less

Submitted 10 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

Comments: 22 pages, 7 figures, comments are welcome, prepared for submission to JCAP

arXiv:2311.09156 [pdf, other]

doi 10.21468/SciPostPhys.16.4.092

Radiative Asymptotic Symmetries of 3D Einstein-Maxwell Theory

Authors: Jorrit Bosma, Marc Geiller, Sucheta Majumdar, Blagoje Oblak

Abstract: We study the null asymptotic structure of Einstein-Maxwell theory in three-dimensional (3D) spacetimes. Although devoid of bulk gravitational degrees of freedom, the system admits a massless photon and can therefore accommodate electromagnetic radiation. We derive fall-off conditions for the Maxwell field that contain both Coulombic and radiative modes with non-vanishing news. The latter produces… ▽ More We study the null asymptotic structure of Einstein-Maxwell theory in three-dimensional (3D) spacetimes. Although devoid of bulk gravitational degrees of freedom, the system admits a massless photon and can therefore accommodate electromagnetic radiation. We derive fall-off conditions for the Maxwell field that contain both Coulombic and radiative modes with non-vanishing news. The latter produces non-integrability and fluxes in the asymptotic surface charges, and gives rise to a non-trivial 3D Bondi mass loss formula. The resulting solution space is thus analogous to a dimensional reduction of 4D pure gravity, with the role of gravitational radiation played by its electromagnetic cousin. We use this simplified setup to investigate choices of charge brackets in detail, and compute in particular the recently introduced Koszul bracket. When the latter is applied to Wald-Zoupas charges, which are conserved in the absence of news, it leads to the field-dependent central extension found earlier in [arXiv:1503.00856]. We also consider (Anti-)de Sitter asymptotics to further exhibit the analogy between this model and 4D gravity with leaky boundary conditions. △ Less

Submitted 5 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

Comments: 50 pages, 1 figure. v2: added references and a paragraph in the conclusion on supersymmetric extensions

Journal ref: SciPost Phys. 16, 092 (2024)

arXiv:2311.06923 [pdf, other]

doi 10.1209/0295-5075/ad2ba3

Optimal mean first-passage time of a run-and-tumble particle in a class of one-dimensional confining potentials

Authors: Mathis Guéneau, Satya N. Majumdar, Gregory Schehr

Abstract: We consider a run-and-tumble particle (RTP) in one dimension, subjected to a telegraphic noise with a constant rate $γ$, and in the presence of an external confining potential $V(x) = α|x|^p$ with $p \geq 1$. We compute the mean first-passage time (MFPT) at the origin $τ_γ(x_0)$ for an RTP starting at $x_0$. We obtain a closed form expression for $τ_γ(x_0)$ for all $p \geq 1$, which becomes fully… ▽ More We consider a run-and-tumble particle (RTP) in one dimension, subjected to a telegraphic noise with a constant rate $γ$, and in the presence of an external confining potential $V(x) = α|x|^p$ with $p \geq 1$. We compute the mean first-passage time (MFPT) at the origin $τ_γ(x_0)$ for an RTP starting at $x_0$. We obtain a closed form expression for $τ_γ(x_0)$ for all $p \geq 1$, which becomes fully explicit in the case $p=1$, $p=2$ and in the limit $p \to \infty$. For generic $p>1$ we find that there exists an optimal rate $γ_{\rm opt}$ that minimizes the MFPT and we characterize in detail its dependence on $x_0$. We find that $γ_{\rm opt} \propto 1/x_0$ as $x_0 \to 0$, while $γ_{\rm opt}$ converges to a nontrivial constant as $x_0 \to \infty$. In contrast, for $p=1$, there is no finite optimum and $γ_{\rm opt} \to \infty$ in this case. These analytical results are confirmed by our numerical simulations. △ Less

Submitted 19 January, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

Comments: Main text: 7+eps pages, 5 figures. Supp. Mat.: 12 pages (revised version)

Journal ref: EPL 145 61002 (2024)

arXiv:2311.03374 [pdf, other]

Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023

Authors: Srijoni Majumdar, Soumen Paul, Debjyoti Paul, Ayan Bandyopadhyay, Samiran Chattopadhyay, Partha Pratim Das, Paul D Clough, Prasenjit Majumder

Abstract: The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs e… ▽ More The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs extracted from open source github C based projects and an additional dataset generated individually by teams using large language models. Overall 56 experiments have been submitted by 17 teams from various universities and software companies. The submissions have been evaluated quantitatively using the F1-Score and qualitatively based on the type of features developed, the supervised learning model used and their corresponding hyper-parameters. The labels generated from large language models increase the bias in the prediction model but lead to less over-fitted results. △ Less

Submitted 27 October, 2023; originally announced November 2023.

Comments: Overview Paper of the Information Retrieval of Software Engineering Track at the Forum for Information Retrieval, 2023

arXiv:2310.20272 [pdf, other]

Beyond Best-Fits and Model Selection -- Introducing "Reliability" of cusp-core inference of dark matter halos

Authors: Manush Manju, Subhabrata Majumdar

Abstract: We introduce the notion of a Bayesian analysis motivated `reliability' that gives a truer distinction of cusp-core and other halo-parameters (like mass-concentration) in an ensemble of observed galaxies. Our approach goes beyond the standard statistical techniques of parameter estimation and model fitting. We create hundreds of thousands of realistic mock SPARC RCs, with both cuspy and cored DM de… ▽ More We introduce the notion of a Bayesian analysis motivated `reliability' that gives a truer distinction of cusp-core and other halo-parameters (like mass-concentration) in an ensemble of observed galaxies. Our approach goes beyond the standard statistical techniques of parameter estimation and model fitting. We create hundreds of thousands of realistic mock SPARC RCs, with both cuspy and cored DM density profiles as model inputs. These RCs carefully incorporate the details of SPARC data such as the nature of observed uncertainties and different sources of scatters arising from observation, presence of baryons, DM mass-concentration, etc. Bayesian analysis of these mock RCs enables us to reconstruct and identify the parameter space in galaxy observable and theory where one can venture beyond best-fits to a preferred DM halo model or model selections between different density models. We find that it is imperative to choose low stellar surface density ($Σ_{\star}$) galaxies for reliable cusp-vs-core distinction; for example, RC data for galaxies with $Σ_{\star} \leq 2.5$ is needed for a 75\% confidence in distinguishing cusps from cores. Similarly, we also find that for correct estimations of the halo masses and concentrations, the RCs need to be measured to at least a radial distance $\geq 0.8r_s$ where $r_s$ is the scale radii of the corresponding DM halo density profiles. Out of the total $\sim$ 135 SPARC galaxies, using our reliability criteria, we find that only 21 RCs clear the bar to be used for any unbiased cusp-core distinction as well as DM halo mass-concentration estimates at $\geq$75\% reliability confidence level. With $\geq$66\% ( $\geq$50\%) reliability settings, the sample size increases to 44 (59). Interestingly, in the $\geq 75$\% reliable subsample, there are 5 times more galaxies that are reliably cored than cuspy. [Abridged] △ Less

Submitted 31 October, 2023; originally announced October 2023.

Comments: For submission to JCAP. Feedback welcome. 45 pages, 23 figures. A single key figure is Fig-15

arXiv:2310.18757 [pdf, other]

Analyzing the 21-cm signal brightness temperature in the Universe with inhomogeneities

Authors: Shashank Shekhar Pandey, Ashadul Halder, A. S. Majumdar

Abstract: We explore the 21-cm signal in our Universe containing inhomogeneous matter distribution at considerably large scales. Employing Buchert's averaging procedure in the context of a model of spacetime with multiple inhomogeneous domains, we evaluate the effect of our model parameters on the observable 21-cm signal brightness temperature. Our model parameters are constrained through the Markov Chain M… ▽ More We explore the 21-cm signal in our Universe containing inhomogeneous matter distribution at considerably large scales. Employing Buchert's averaging procedure in the context of a model of spacetime with multiple inhomogeneous domains, we evaluate the effect of our model parameters on the observable 21-cm signal brightness temperature. Our model parameters are constrained through the Markov Chain Monte Carlo method using the Union 2.1 supernova Ia observational data. We find that a significant dip in the brightness temperature compared to the $Λ$CDM prediction could arise as an effect of the inhomogeneities present in the Universe. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Comments: 17 pages, 7 figures

arXiv:2310.17152 [pdf]

Technical Note: Feasibility of translating 3.0T-trained Deep-Learning Segmentation Models Out-of-the-Box on Low-Field MRI 0.55T Knee-MRI of Healthy Controls

Authors: Rupsa Bhattacharjee, Zehra Akkaya, Johanna Luitjens, Pan Su, Yang Yang, Valentina Pedoia, Sharmila Majumdar

Abstract: In the current study, our purpose is to evaluate the feasibility of applying deep learning (DL) enabled algorithms to quantify bilateral knee biomarkers in healthy controls scanned at 0.55T and compared with 3.0T. The current study assesses the performance of standard in-practice bone, and cartilage segmentation algorithms at 0.55T, both qualitatively and quantitatively, in terms of comparing segm… ▽ More In the current study, our purpose is to evaluate the feasibility of applying deep learning (DL) enabled algorithms to quantify bilateral knee biomarkers in healthy controls scanned at 0.55T and compared with 3.0T. The current study assesses the performance of standard in-practice bone, and cartilage segmentation algorithms at 0.55T, both qualitatively and quantitatively, in terms of comparing segmentation performance, areas of improvement, and compartment-wise cartilage thickness values between 0.55T vs. 3.0T. Initial results demonstrate a usable to good technical feasibility of translating existing quantitative deep-learning-based image segmentation techniques, trained on 3.0T, out of 0.55T for knee MRI, in a multi-vendor acquisition environment. Especially in terms of segmenting cartilage compartments, the models perform almost equivalent to 3.0T in terms of Likert ranking. The 0.55T low-field sustainable and easy-to-install MRI, as demonstrated, thus, can be utilized for evaluating knee cartilage thickness and bone segmentations aided by established DL algorithms trained at higher-field strengths out-of-the-box initially. This could be utilized at the far-spread point-of-care locations with a lack of radiologists available to manually segment low-field images, at least till a decent base of low-field data pool is collated. With further fine-tuning with manual labeling of low-field data or utilizing synthesized higher SNR images from low-field images, OA biomarker quantification performance is potentially guaranteed to be further improved. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 11 Pages, 3 Figures, 2 Tables

arXiv:2310.16420 [pdf, ps, other]

Linear statistics for Coulomb gases: higher order cumulants

Authors: Benjamin De Bruyne, Pierre Le Doussal, Satya N. Majumdar, Gregory Schehr

Abstract: We consider $N$ classical particles interacting via the Coulomb potential in spatial dimension $d$ and in the presence of an external trap, at equilibrium at inverse temperature $β$. In the large $N$ limit, the particles are confined within a droplet of finite size. We study smooth linear statistics, i.e. the fluctuations of sums of the form ${\cal L}_N = \sum_{i=1}^N f({\bf x}_i)$, where… ▽ More We consider $N$ classical particles interacting via the Coulomb potential in spatial dimension $d$ and in the presence of an external trap, at equilibrium at inverse temperature $β$. In the large $N$ limit, the particles are confined within a droplet of finite size. We study smooth linear statistics, i.e. the fluctuations of sums of the form ${\cal L}_N = \sum_{i=1}^N f({\bf x}_i)$, where ${\bf x}_i$'s are the positions of the particles and where $f({\bf x}_i)$ is a sufficiently regular function. There exists at present standard results for the first and second moments of ${\cal L}_N$ in the large $N$ limit, as well as associated Central Limit Theorems in general dimension and for a wide class of confining potentials. Here we obtain explicit expressions for the higher order cumulants of ${\cal L}_N$ at large $N$, when the function $f({\bf x})=f(|{\bf x}|)$ and the confining potential are both rotationnally invariant. A remarkable feature of our results is that these higher cumulants depend only on the value of $f'(|{\bf x}|)$ and its higher order derivatives evaluated exactly at the boundary of the droplet, which in this case is a $d$-dimensional sphere. In the particular two-dimensional case $d=2$ at the special value $β=2$, a connection to the Ginibre ensemble allows us to derive these results in an alternative way using the tools of determinantal point processes. Finally we also obtain the large deviation form of the full probability distribution function of ${\cal L}_N$. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 19 pages

Journal ref: J. Phys. A: Math. Theor. 57, 155002 (2024)

arXiv:2310.15579 [pdf, other]

doi 10.1093/mnras/stad3273

The monopole and quadrupole moments of the Epoch of Reionization (EoR) 21-cm bispectrum

Authors: Sukhdeep Singh Gill, Suman Pramanick, Somnath Bharadwaj, Abinash Kumar Shaw, Suman Majumdar

Abstract: We study the monopole ($\bar{B}^0_0$) and quadrupole ($\bar{B}^0_2$) moments of the 21-cm bispectrum (BS) from EoR simulations and present results for squeezed and stretched triangles. Both $\bar{B}^0_0$ and $\bar{B}^0_2$ are positive at the early stage of EoR where the mean neutral hydrogen (HI) density fraction $\bar{x}_{\rm HI} \approx 0.99$. The subsequent evolution of $\bar{B}^0_0$ and… ▽ More We study the monopole ($\bar{B}^0_0$) and quadrupole ($\bar{B}^0_2$) moments of the 21-cm bispectrum (BS) from EoR simulations and present results for squeezed and stretched triangles. Both $\bar{B}^0_0$ and $\bar{B}^0_2$ are positive at the early stage of EoR where the mean neutral hydrogen (HI) density fraction $\bar{x}_{\rm HI} \approx 0.99$. The subsequent evolution of $\bar{B}^0_0$ and $\bar{B}^0_2$ at large and intermediate scales $(k=0.29$ and $0.56 \, {\rm Mpc}^{-1}$ respectively) is punctuated by two sign changes which mark transitions in the HI distribution. The first sign flip where $\bar{B}^0_0$ becomes negative occurs in the intermediate stages of EoR $(\bar{x}_{\rm HI} > 0.5)$, at large scale first followed by the intermediate scale. This marks the emergence of distinct ionized bubbles in the neutral background. $\bar{B}^0_2$ is relatively less affected by this transition, and it mostly remains positive even when $\bar{B}^0_0$ becomes negative. The second sign flip, which affects both $\bar{B}^0_0$ and $\bar{B}^0_2$, occurs at the late stage of EoR $(\bar{x}_{\rm HI} < 0.5)$. This marks a transition in the topology of the HI distribution, after which we have distinct HI islands in an ionized background. This causes $\bar{B}^0_0$ to become positive. The negative $\bar{B}^0_2$ is a definite indication that the HI islands survive only in under-dense regions. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: Accepted for publication in MNRAS

Report number: MNRAS, Volume 527, Issue 1

Journal ref: 2024

Showing 1–50 of 864 results for author: Majumdar, S