-
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data
Authors:
Krishna C. Puvvada,
Piotr Żelasko,
He Huang,
Oleksii Hrinchuk,
Nithin Rao Koluguri,
Kunal Dhawan,
Somshubra Majumdar,
Elena Rastorgueva,
Zhehuai Chen,
Vitaly Lavrukhin,
Jagadeesh Balam,
Boris Ginsburg
Abstract:
Recent advances in speech recognition and translation rely on hundreds of thousands of hours of Internet speech data. We argue that state-of-the art accuracy can be reached without relying on web-scale data. Canary - multilingual ASR and speech translation model, outperforms current state-of-the-art models - Whisper, OWSM, and Seamless-M4T on English, French, Spanish, and German languages, while b…
▽ More
Recent advances in speech recognition and translation rely on hundreds of thousands of hours of Internet speech data. We argue that state-of-the art accuracy can be reached without relying on web-scale data. Canary - multilingual ASR and speech translation model, outperforms current state-of-the-art models - Whisper, OWSM, and Seamless-M4T on English, French, Spanish, and German languages, while being trained on an order of magnitude less data than these models. Three key factors enables such data-efficient model: (1) a FastConformer-based attention encoder-decoder architecture (2) training on synthetic data generated with machine translation and (3) advanced training techniques: data-balancing, dynamic data blending, dynamic bucketing and noise-robust fine-tuning. The model, weights, and training code will be open-sourced.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Comparing sampling techniques to chart parameter space of 21 cm Global signal with Artificial Neural Networks
Authors:
Anshuman Tripathi,
Gursharanjit Kaur,
Abhirup Datta,
Suman Majumdar
Abstract:
Understanding the first billion years of the universe requires studying two critical epochs: the Epoch of Reionization (EoR) and Cosmic Dawn (CD). However, due to limited data, the properties of the Intergalactic Medium (IGM) during these periods remain poorly understood, leading to a vast parameter space for the global 21cm signal. Training an Artificial Neural Network (ANN) with a narrowly defin…
▽ More
Understanding the first billion years of the universe requires studying two critical epochs: the Epoch of Reionization (EoR) and Cosmic Dawn (CD). However, due to limited data, the properties of the Intergalactic Medium (IGM) during these periods remain poorly understood, leading to a vast parameter space for the global 21cm signal. Training an Artificial Neural Network (ANN) with a narrowly defined parameter space can result in biased inferences. To mitigate this, the training dataset must be uniformly drawn from the entire parameter space to cover all possible signal realizations. However, drawing all possible realizations is computationally challenging, necessitating the sampling of a representative subset of this space. This study aims to identify optimal sampling techniques for the extensive dimensionality and volume of the 21cm signal parameter space. The optimally sampled training set will be used to train the ANN to infer from the global signal experiment. We investigate three sampling techniques: random, Latin Hypercube (stratified), and Hammersley Sequence (quasi-Monte Carlo) sampling, and compare their outcomes. Our findings reveal that sufficient samples must be drawn for robust and accurate ANN model training, regardless of the sampling technique employed. The required sample size depends primarily on two factors: the complexity of the data and the number of free parameters. More free parameters necessitate drawing more realizations. Among the sampling techniques utilized, we find that ANN models trained with Hammersley Sequence sampling demonstrate greater robustness compared to those trained with Latin Hypercube and Random sampling.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Authors:
Vahid Noroozi,
Zhehuai Chen,
Somshubra Majumdar,
Steve Huang,
Jagadeesh Balam,
Boris Ginsburg
Abstract:
In this paper, we propose three methods for generating synthetic samples to train and evaluate multimodal large language models capable of processing both text and speech inputs. Addressing the scarcity of samples containing both modalities, synthetic data generation emerges as a crucial strategy to enhance the performance of such systems and facilitate the modeling of cross-modal relationships be…
▽ More
In this paper, we propose three methods for generating synthetic samples to train and evaluate multimodal large language models capable of processing both text and speech inputs. Addressing the scarcity of samples containing both modalities, synthetic data generation emerges as a crucial strategy to enhance the performance of such systems and facilitate the modeling of cross-modal relationships between the speech and text domains. Our process employs large language models to generate textual components and text-to-speech systems to generate speech components. The proposed methods offer a practical and effective means to expand the training dataset for these models. Experimental results show progress in achieving an integrated understanding of text and speech. We also highlight the potential of using unlabeled speech data to generate synthetic samples comparable in quality to those with available transcriptions, enabling the expansion of these models to more languages.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Generative AI Voting: Fair Collective Choice is Resilient to LLM Biases and Inconsistencies
Authors:
Srijoni Majumdar,
Edith Elkind,
Evangelos Pournaras
Abstract:
Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) provide unprecedented opportunities, but also alerting risks for digital democracy. AI personal assistants can overcome cognitive bandwidth limitations of…
▽ More
Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) provide unprecedented opportunities, but also alerting risks for digital democracy. AI personal assistants can overcome cognitive bandwidth limitations of humans, providing decision support capabilities or even direct AI representation of human voters at large scale. However, the quality of this representation and what underlying biases manifest when delegating collective decision making to LLMs is an alarming and timely challenge to tackle. By rigorously emulating with high realism more than >50K LLM voting personas in 81 real-world voting elections, we show that different LLMs (GPT 3, GPT 3.5, and Llama2) come with biases and significant inconsistencies in complex preferential ballot formats, compared to simpler and more consistent majoritarian elections. Strikingly, fair voting aggregation methods, such as equal shares, prove to be a win-win: fairer voting outcomes for humans with fairer AI representation. This novel underlying relationship proves paramount for democratic resilience in progressives scenarios with low voters turnout and voter fatigue supported by AI representatives: abstained voters are mitigated by recovering highly representative voting outcomes that are fairer. These insights provide remarkable foundations for science, policymakers and citizens in explaining and mitigating AI risks in democratic innovations.
△ Less
Submitted 30 May, 2024;
originally announced June 2024.
-
Nemotron-4 340B Technical Report
Authors:
Nvidia,
:,
Bo Adler,
Niket Agarwal,
Ashwath Aithal,
Dong H. Anh,
Pallab Bhattacharya,
Annika Brundyn,
Jared Casper,
Bryan Catanzaro,
Sharon Clay,
Jonathan Cohen,
Sirshak Das,
Ayush Dattagupta,
Olivier Delalleau,
Leon Derczynski,
Yi Dong,
Daniel Egert,
Ellie Evans,
Aleksander Ficek,
Denys Fridman,
Shaona Ghosh,
Boris Ginsburg,
Igor Gitman,
Tomasz Grzegorzek
, et al. (58 additional authors not shown)
Abstract:
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be…
▽ More
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation benchmarks, and were sized to fit on a single DGX H100 with 8 GPUs when deployed in FP8 precision. We believe that the community can benefit from these models in various research studies and commercial applications, especially for generating synthetic data to train smaller language models. Notably, over 98% of data used in our model alignment process is synthetically generated, showcasing the effectiveness of these models in generating synthetic data. To further support open research and facilitate model development, we are also open-sourcing the synthetic data generation pipeline used in our model alignment process.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
garak: A Framework for Security Probing Large Language Models
Authors:
Leon Derczynski,
Erick Galinkin,
Jeffrey Martin,
Subho Majumdar,
Nanna Inie
Abstract:
As Large Language Models (LLMs) are deployed and integrated into thousands of applications, the need for scalable evaluation of how models respond to adversarial attacks grows rapidly. However, LLM security is a moving target: models produce unpredictable output, are constantly updated, and the potential adversary is highly diverse: anyone with access to the internet and a decent command of natura…
▽ More
As Large Language Models (LLMs) are deployed and integrated into thousands of applications, the need for scalable evaluation of how models respond to adversarial attacks grows rapidly. However, LLM security is a moving target: models produce unpredictable output, are constantly updated, and the potential adversary is highly diverse: anyone with access to the internet and a decent command of natural language. Further, what constitutes a security weak in one context may not be an issue in a different context; one-fits-all guardrails remain theoretical. In this paper, we argue that it is time to rethink what constitutes ``LLM security'', and pursue a holistic approach to LLM security evaluation, where exploration and discovery of issues are central. To this end, this paper introduces garak (Generative AI Red-teaming and Assessment Kit), a framework which can be used to discover and identify vulnerabilities in a target LLM or dialog system. garak probes an LLM in a structured fashion to discover potential vulnerabilities. The outputs of the framework describe a target model's weaknesses, contribute to an informed discussion of what composes vulnerabilities in unique contexts, and can inform alignment and policy discussions for LLM deployment.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
On the Carrollian Nature of the Light Front
Authors:
Sucheta Majumdar
Abstract:
Motivated by recent advances in non-Lorentzian physics, we revisit the light-cone formulation of quantum field theories. We discuss some interesting subalgebras within the light-cone Poincaré algebra, with a key emphasis on the Carroll, Bargmann, and Galilean kinds. We show that theories on the light front possess a Hamiltonian of the magnetic Carroll type, thereby proposing a straightforward meth…
▽ More
Motivated by recent advances in non-Lorentzian physics, we revisit the light-cone formulation of quantum field theories. We discuss some interesting subalgebras within the light-cone Poincaré algebra, with a key emphasis on the Carroll, Bargmann, and Galilean kinds. We show that theories on the light front possess a Hamiltonian of the magnetic Carroll type, thereby proposing a straightforward method for deriving magnetic Carroll Hamiltonian actions from Lorentzian field theories.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Optimal demonstration of generalized quantum contextuality
Authors:
Soumyabrata Hazra,
Debashis Saha,
Anubhav Chaturvedi,
Subhankar Bera,
A. S. Majumdar
Abstract:
Finding a set of empirical criteria fulfilled by any theory that satisfies the generalized notion of noncontextuality is a challenging task of both operational and foundational importance. The conventional approach of deriving facet inequalities from the relevant noncontextual polytope is computationally demanding. Specifically, the noncontextual polytope is a product of two polytopes, one for pre…
▽ More
Finding a set of empirical criteria fulfilled by any theory that satisfies the generalized notion of noncontextuality is a challenging task of both operational and foundational importance. The conventional approach of deriving facet inequalities from the relevant noncontextual polytope is computationally demanding. Specifically, the noncontextual polytope is a product of two polytopes, one for preparations and the other for measurements, and the dimension of the former typically increases polynomially with the number of measurements. This work presents an alternative methodology for constructing a polytope that encompasses the actual noncontextual polytope while ensuring that the dimension of the polytope associated with the preparations remains constant regardless of the number of measurements and their outcome size. In particular, the facet inequalities of this polytope serve as necessary conditions for noncontextuality. To demonstrate the efficacy of our methodology, we apply it to nine distinct contextuality scenarios involving four to nine preparations and two to three measurements to obtain the respective sets of facet inequalities. Additionally, we retrieve the maximum quantum violations of these inequalities. Our investigation uncovers many novel non-trivial noncontextuality inequalities and reveals intriguing aspects and applications of quantum contextual correlations.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Resetting by rescaling: exact results for a diffusing particle in one-dimension
Authors:
Marco Biroli,
Yannick Feld,
Alexander K. Hartmann,
Satya N. Majumdar,
Gregory Schehr
Abstract:
In this paper, we study a simple model of a diffusive particle on a line, undergoing a stochastic resetting with rate $r$, via rescaling its current position by a factor $a$, which can be either positive or negative. For $|a|<1$, the position distribution becomes stationary at long times and we compute this limiting distribution exactly for all $|a|<1$. This symmetric distribution has a Gaussian s…
▽ More
In this paper, we study a simple model of a diffusive particle on a line, undergoing a stochastic resetting with rate $r$, via rescaling its current position by a factor $a$, which can be either positive or negative. For $|a|<1$, the position distribution becomes stationary at long times and we compute this limiting distribution exactly for all $|a|<1$. This symmetric distribution has a Gaussian shape near its peak at $x=0$, but decays exponentially for large $|x|$. We also studied the mean first-passage time (MFPT) $T(0)$ to a target located at a distance $L$ from the initial position (the origin) of the particle. As a function of the initial position $x$, the MFPT $T(x)$ satisfies a nonlocal second order differential equation and we have solved it explicitly for $0 \leq a < 1$. For $-1<a\leq 0$, we also solved it analytically but up to a constant factor $κ$ whose value can be determined independently from numerical simulations. Our results show that, for all $-1<a<1$, the MFPT $T(0)$ (starting from the origin) shows a minimum at $r=r^*(a)$. However, the optimised MFPT $T_{\rm opt}(a)$ turns out to be a monotonically increasing function of $a$ for $-1<a<1$. This demonstrates that, compared to the standard resetting to the origin ($a=0$), while the positive rescaling is not beneficial for the search of a target, the negative rescaling is. Thus resetting via rescaling followed by a reflection around the origin expedites the search of a target in one dimension.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Impact of the Epoch of Reionization sources on the 21-cm bispectrum
Authors:
Leon Noble,
Mohd Kamran,
Suman Majumdar,
Chandra Shekhar Murmu,
Raghunath Ghara,
Garrelt Mellema,
Ilian T. Iliev,
Jonathan R. Pritchard
Abstract:
The morphology of the 21-cm signal emitted by the neutral hydrogen present in the intergalactic medium (IGM) during the Epoch of Reionization (EoR) depends both on the properties of the sources of ionizing radiation and on the underlying physical processes within the IGM. Variation in the morphology of the IGM 21-cm signal due to the different sources of the EoR is expected to have a significant i…
▽ More
The morphology of the 21-cm signal emitted by the neutral hydrogen present in the intergalactic medium (IGM) during the Epoch of Reionization (EoR) depends both on the properties of the sources of ionizing radiation and on the underlying physical processes within the IGM. Variation in the morphology of the IGM 21-cm signal due to the different sources of the EoR is expected to have a significant impact on the 21-cm bispectrum, which is one of the crucial observable statistics that can evaluate the non-Gaussianity present in the signal and which can be estimated from radio interferometric observations of the EoR. Here we present the 21-cm bispectrum for different reionization scenarios assuming different simulated models for the sources of reionization. We also demonstrate how well the 21-cm bispectrum can distinguish between different IGM 21-cm signal morphologies, arising due to the differences in the reionization scenarios, which will help us shed light on the nature of the sources of ionizing photons. Our estimated large-scale bispectrum for all unique $k$-triangle shapes shows a significant difference in their magnitude and sign across different reionization scenarios. Additionally, our focused analysis of bispectrum for a few specific $k$-triangle shapes (e.g. squeezed-limit, linear, and shapes in the vicinity of the squeezed-limit) shows that the large scale 21-cm bispectrum can distinguish between reionization scenarios that show inside-out, outside-in and a combination of inside-out and outside-in morphologies. These results highlight the potential of using the 21-cm bispectrum for constraining different reionization scenarios.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Number of distinct and common sites visited by $N$ independent random walkers
Authors:
Satya N. Majumdar,
Gregory Schehr
Abstract:
In this Chapter, we consider a model of $N$ independent random walkers, each of duration $t$, and each starting from the origin, on a lattice in $d$ dimensions. We focus on two observables, namely $D_N(t)$ and $C_N(t)$ denoting respectively the number of distinct and common sites visited by the walkers. For large $t$, where the lattice random walkers converge to independent Brownian motions, we co…
▽ More
In this Chapter, we consider a model of $N$ independent random walkers, each of duration $t$, and each starting from the origin, on a lattice in $d$ dimensions. We focus on two observables, namely $D_N(t)$ and $C_N(t)$ denoting respectively the number of distinct and common sites visited by the walkers. For large $t$, where the lattice random walkers converge to independent Brownian motions, we compute exactly the mean $\langle D_N(t) \rangle$ and $\langle C_N(t) \rangle$. Our main interest is on the $N$-dependence of these quantities. While for $\langle D_N(t) \rangle$ the $N$-dependence only appears in the prefactor of the power-law growth with time, a more interesting behavior emerges for $\langle C_N(t) \rangle$. For this latter case, we show that there is a ``phase transition'' in the $(N, d)$ plane where the two critical line $d=2$ and $d=d_c(N) = 2N/(N-1)$ separate three phases of the growth of $\langle C_N(t)\rangle$. The results are extended to the mean number of sites visited exactly by $K$ of the $N$ walkers. Furthermore in $d=1$, the full distribution of $D_N(t)$ and $C_N(t)$ are computed, exploiting a map** to the extreme value statistics. Extensions to two other models, namely $N$ independent Brownian bridges and $N$ independent resetting Brownian motions/bridges are also discussed.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Importance Sampling for counting statistics in one-dimensional systems
Authors:
Ivan N. Burenev,
Satya N. Majumdar,
Alberto Rosso
Abstract:
In this paper, we consider the problem of numerical investigation of the counting statistics for a class of one-dimensional systems. Importance sampling, the cornerstone technique usually implemented for such problems, critically hinges on selecting an appropriate biased distribution. While exponential tilt in the observable stands as the conventional choice for various problems, its efficiency in…
▽ More
In this paper, we consider the problem of numerical investigation of the counting statistics for a class of one-dimensional systems. Importance sampling, the cornerstone technique usually implemented for such problems, critically hinges on selecting an appropriate biased distribution. While exponential tilt in the observable stands as the conventional choice for various problems, its efficiency in the context of counting statistics may be significantly hindered by the genuine discreteness of the observable. To address this challenge, we propose an alternative strategy which we call importance sampling with the local tilt. We demonstrate the efficiency of the proposed approach through the analysis of three prototypical examples: a set of independent Gaussian random variables, Dyson gas, and Symmetric Simple Exclusion Process (SSEP) with a steplike initial condition.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Representation noising effectively prevents harmful fine-tuning on LLMs
Authors:
Domenic Rosati,
Jan Wehner,
Kai Williams,
Łukasz Bartoszcze,
David Atanasov,
Robie Gonzales,
Subhabrata Majumdar,
Carsten Maple,
Hassan Sajjad,
Frank Rudzicz
Abstract:
Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such me…
▽ More
Releasing open-source large language models (LLMs) presents a dual-use risk since bad actors can easily fine-tune these models for harmful purposes. Even without the open release of weights, weight stealing and fine-tuning APIs make closed models vulnerable to harmful fine-tuning attacks (HFAs). While safety measures like preventing jailbreaks and improving safety guardrails are important, such measures can easily be reversed through fine-tuning. In this work, we propose Representation Noising (RepNoise), a defence mechanism that is effective even when attackers have access to the weights and the defender no longer has any control. RepNoise works by removing information about harmful representations such that it is difficult to recover them during fine-tuning. Importantly, our defence is also able to generalize across different subsets of harm that have not been seen during the defence process. Our method does not degrade the general capability of LLMs and retains the ability to train the model on harmless tasks. We provide empirical evidence that the effectiveness of our defence lies in its "depth": the degree to which information about harmful representations is removed across all layers of the LLM.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Flux rope modeling of the 2022 Sep 5 CME observed by Parker Solar Probe and Solar Orbiter from 0.07 to 0.69 au
Authors:
Emma E. Davies,
Hannah T. Rüdisser,
Ute V. Amerstorfer,
Christian Möstl,
Maike Bauer,
Eva Weiler,
Tanja Amerstorfer,
Satabdwa Majumdar,
Phillip Hess,
Andreas J. Weiss,
Martin A. Reiss,
Lucie M. Green,
David M. Long,
Teresa Nieves-Chinchilla,
Domenico Trotta,
Timothy S. Horbury,
Helen O'Brien,
Edward Fauchon-Jones,
Jean Morris,
Christopher J. Owen,
Stuart D. Bale,
Justin C. Kasper
Abstract:
As both Parker Solar Probe (PSP) and Solar Orbiter (SolO) reach heliocentric distances closer to the Sun, they present an exciting opportunity to study the structure of CMEs in the inner heliosphere. We present an analysis of the global flux rope structure of the 2022 September 5 CME event that impacted PSP at a heliocentric distance of only 0.07 au and SolO at 0.69 au. We compare in situ measurem…
▽ More
As both Parker Solar Probe (PSP) and Solar Orbiter (SolO) reach heliocentric distances closer to the Sun, they present an exciting opportunity to study the structure of CMEs in the inner heliosphere. We present an analysis of the global flux rope structure of the 2022 September 5 CME event that impacted PSP at a heliocentric distance of only 0.07 au and SolO at 0.69 au. We compare in situ measurements at PSP and SolO to determine global and local expansion measures, finding a good agreement between magnetic field relationships with heliocentric distance, but significant differences with respect to flux rope size. We use PSP/WISPR images as input to the ELEvoHI model, providing a direct link between remote and in situ observations; we find a large discrepancy between the resulting modeled arrival times, suggesting that the underlying model assumptions may not be suitable when using data obtained close to the Sun, where the drag regime is markedly different in comparison to larger heliocentric distances. Finally, we fit the SolO/MAG and PSP/FIELDS data independently with the 3DCORE model and find that many parameters are consistent between spacecraft, however, challenges are apparent when reconstructing a global 3D structure that aligns with arrival times at PSP and Solar Orbiter, likely due to the large radial and longitudinal separations between spacecraft. From our model results, it is clear the solar wind background speed and drag regime strongly affects the modeled expansion and propagation of CMEs and needs to be taken into consideration.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Power-law relaxation of a confined diffusing particle subject to resetting with memory
Authors:
Denis Boyer,
Satya N. Majumdar
Abstract:
We study the relaxation of a Brownian particle with long range memory under confinement in one dimension. The particle diffuses in an arbitrary confining potential and resets at random times to previously visited positions, chosen with a probability proportional to the local time spent there by the particle since the initial time. This model mimics an animal which moves erratically in its home ran…
▽ More
We study the relaxation of a Brownian particle with long range memory under confinement in one dimension. The particle diffuses in an arbitrary confining potential and resets at random times to previously visited positions, chosen with a probability proportional to the local time spent there by the particle since the initial time. This model mimics an animal which moves erratically in its home range and returns preferentially to familiar places from time to time, as observed in nature. The steady state density of the position is given by the equilibrium Boltzmann-Gibbs distribution, as in standard diffusion, while the transient part of the density can be obtained through a map** of the Fokker-Planck equation of the process to a Schrödinger eigenvalue problem. Due to memory, the approach at large time toward the steady state is critically self-organised, in the sense that it always follows a sluggish power-law form, in contrast to the exponential decay that characterises Markov processes. The exponent of this power-law depends in a simple way on the resetting rate and on the leading relaxation rate of the Brownian particle in the absence of resetting. We apply these findings to several exactly solvable examples, such as the harmonic, V-shaped and box potentials.
△ Less
Submitted 22 May, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
PARSAC: Fast, Human-quality Floorplanning for Modern SoCs with Complex Design Constraints
Authors:
Hesham Mostafa,
Uday Mallappa,
Mikhail Galkin,
Mariano Phielipp,
Somdeb Majumdar
Abstract:
The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in indus…
▽ More
The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in industry-relevant floorplanning problems, however, is not only to minimize area and wire-length, but to do that while respecting hard placement constraints that specify the general area and/or the specific locations for the placement of some blocks. We show that simply incorporating these constraints into the SA objective function leads to sub-optimal, and often illegal, solutions. We propose the Constraints-Aware Simulated Annealing (CA-SA) method and show that it strongly outperforms vanilla SA in floorplanning problems with hard placement constraints. We developed a new floorplanning tool on top of CA-SA: PARSAC (Parallel Simulated Annealing with Constraints). PARSAC is an efficient, easy-to-use, and massively parallel floorplanner. Unlike current SA-based or learning-based floorplanning tools that cannot effectively incorporate hard placement-constraints, PARSAC can quickly construct the Pareto-optimal legal solutions front for constrained floorplanning problems. PARSAC also outperforms traditional SA on legacy floorplanning benchmarks. PARSAC is available as an open-source repository for researchers to replicate and build on our result.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCs
Authors:
Uday Mallappa,
Hesham Mostafa,
Mikhail Galkin,
Mariano Phielipp,
Somdeb Majumdar
Abstract:
Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark th…
▽ More
Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark that comprises a large training dataset and performance metrics that better reflect real-world constraints and objectives compared to existing benchmarks. To address this need, we present FloorSet -- two comprehensive datasets of synthetic fixed-outline floorplan layouts that reflect the distribution of real SoCs. Each dataset has 1M training samples and 100 test samples where each sample is a synthetic floor-plan. FloorSet-Prime comprises fully-abutted rectilinear partitions and near-optimal wire-length. A simplified dataset that reflects early design phases, FloorSet-Lite comprises rectangular partitions, with under 5 percent white-space and near-optimal wire-length. Both datasets define hard constraints seen in modern design flows such as shape constraints, edge-affinity, grou** constraints, and pre-placement constraints. FloorSet is intended to spur fundamental research on large-scale constrained optimization problems. Crucially, FloorSet alleviates the core issue of reproducibility in modern ML driven solutions to such problems. FloorSet is available as an open-source repository for the research community.
△ Less
Submitted 27 June, 2024; v1 submitted 8 May, 2024;
originally announced May 2024.
-
Fair Voting Outcomes with Impact and Novelty Compromises? Unraveling Biases of Equal Shares in Participatory Budgeting
Authors:
Sajan Maharjan,
Srijoni Majumdar,
Evangelos Pournaras
Abstract:
Participatory budgeting, as a paradigm for democratic innovations, engages citizens in the distribution of a public budget to projects, which they propose and vote for implementation. So far, voting algorithms have been devised and studied in social choice literature to elect projects that are popular, while others prioritize on a proportional representation of voters' preferences, for instance, e…
▽ More
Participatory budgeting, as a paradigm for democratic innovations, engages citizens in the distribution of a public budget to projects, which they propose and vote for implementation. So far, voting algorithms have been devised and studied in social choice literature to elect projects that are popular, while others prioritize on a proportional representation of voters' preferences, for instance, equal shares. However, the anticipated impact and novelty in the broader society by the winning projects, as selected by different algorithms, remains totally under-explored, lacking both a universal theory of impact for voting and a rigorous framework for impact and novelty assessments. This papers tackles this grand challenge towards new axiomatic foundations for designing effective and fair voting methods. This is via new and striking insights derived from a large-scale analysis of biases over 345 real-world voting outcomes, characterized for the first time by a novel portfolio of impact and novelty metrics. We find strong causal evidence that equal shares comes with impact loss in several infrastructural projects of different cost levels that have been so far over-represented. However, it also comes with a novel, yet over-represented, impact gain in welfare, education and culture. We discuss broader implications of these results and how impact loss can be mitigated at the stage of campaign design and project ideation.
△ Less
Submitted 9 May, 2024; v1 submitted 8 May, 2024;
originally announced May 2024.
-
Project Hephaistos - II. Dyson sphere candidates from Gaia DR3, 2MASS, and WISE
Authors:
Matías Suazo,
Erik Zackrisson,
Priyatam K. Mahto,
Fabian Lundell,
Carl Nettelblad,
Andreas J. Korn,
Jason T. Wright,
Suman Majumdar
Abstract:
The search for extraterrestrial intelligence is currently being pursued using multiple techniques and in different wavelength bands. Dyson spheres, megastructures that could be constructed by advanced civilizations to harness the radiation energy of their host stars, represent a potential technosignature, that in principle may be hiding in public data already collected as part of large astronomica…
▽ More
The search for extraterrestrial intelligence is currently being pursued using multiple techniques and in different wavelength bands. Dyson spheres, megastructures that could be constructed by advanced civilizations to harness the radiation energy of their host stars, represent a potential technosignature, that in principle may be hiding in public data already collected as part of large astronomical surveys. In this study, we present a comprehensive search for partial Dyson spheres by analyzing optical and infrared observations from Gaia, 2MASS, and WISE. We develop a pipeline that employs multiple filters to identify potential candidates and reject interlopers in a sample of five million objects, which incorporates a convolutional neural network to help identify confusion in WISE data. Finally, the pipeline identifies 7 candidates deserving of further analysis. All of these objects are M-dwarfs, for which astrophysical phenomena cannot easily account for the observed infrared excess emission.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Does carrier localization affect the anomalous Hall effect?
Authors:
Prasanta Chowdhury,
Mohamad Numan,
Shuvankar Gupta,
Souvik Chatterjee,
Saurav Giri,
Subham Majumdar
Abstract:
The effect of carrier localization due to electron-electron interaction in anomalous Hall effect is elusive and there are contradictory results in the literature. To address the issue, we report here the detailed transport study including the Hall measurements on $β$-Mn type cubic compound Co$_7$Zn$_7$Mn$_6$ with chiral crystal structure, which lacks global mirror symmetry. The alloy orders magnet…
▽ More
The effect of carrier localization due to electron-electron interaction in anomalous Hall effect is elusive and there are contradictory results in the literature. To address the issue, we report here the detailed transport study including the Hall measurements on $β$-Mn type cubic compound Co$_7$Zn$_7$Mn$_6$ with chiral crystal structure, which lacks global mirror symmetry. The alloy orders magnetically below $T_c$ = 204 K, and reported to show spin glass state at low temperature. The longitudinal resistivity ($ρ_{xx}$) shows a pronounced upturn below $T_{min}$ = 75 K, which is found to be associated with carrier localization due to quantum interference effect. The upturn in $ρ_{xx}$ shows a $T^{1/2}$ dependence and it is practically insensitive to the externally applied magnetic field, which indicate that electron-electron interaction is primarily responsible for the low-$T$ upturn. The studied sample shows considerable value of anomalous Hall effect below $T_c$. We found that the localization effect is present in the ordinary Hall coefficient ($R_0$), but we failed to observe any signature of localization in the anomalous Hall resistivity or conductivity. The absence of localization effect in the anomalous Hall effect in Co$_7$Zn$_7$Mn$_6$ may be due to large carrier density, and it warrants further theoretical investigations, particularly with systems having broken mirror symmetry.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
WaveCatBoost for Probabilistic Forecasting of Regional Air Quality Data
Authors:
**tu Borah,
Tanujit Chakraborty,
Md. Shahrul Md. Nadzir,
Mylene G. Cayetano,
Shubhankar Majumdar
Abstract:
Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlap** discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approa…
▽ More
Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlap** discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approach efficiently transforms time series into high-frequency and low-frequency components, thereby extracting signal from noise and improving prediction accuracy and robustness. Evaluation of two distinct regional datasets, from the Central Air Pollution Control Board (CPCB) sensor network and a low-cost air quality sensor system (LAQS), underscores the superior performance of our proposed methodology in real-time forecasting compared to the state-of-the-art statistical and deep learning architectures. Moreover, we employ a conformal prediction strategy to provide probabilistic bands with our forecasts.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Noninteracting particles in a harmonic trap with a stochastically driven center
Authors:
Sanjib Sabhapandit,
Satya N. Majumdar
Abstract:
We study a system of $N$ noninteracting particles on a line in the presence of a harmonic trap $U(x)=μ\bigl[x-z(t)\bigr]^2/2$, where the trap center $z(t)$ undergoes a bounded stochastic modulation. We show that this stochastic modulation drives the system into a nonequilibrium stationary state, where the joint distribution of the positions of the particles is not factorizable. This indicates stro…
▽ More
We study a system of $N$ noninteracting particles on a line in the presence of a harmonic trap $U(x)=μ\bigl[x-z(t)\bigr]^2/2$, where the trap center $z(t)$ undergoes a bounded stochastic modulation. We show that this stochastic modulation drives the system into a nonequilibrium stationary state, where the joint distribution of the positions of the particles is not factorizable. This indicates strong correlations between the positions of the particles that are not inbuilt, but rather get generated by the dynamics itself. Moreover, we show that the stationary joint distribution can be fully characterized and has a special conditionally independent and identically distributed (CIID) structure. This special structure allows us to compute several observables analytically even in such a strongly correlated system, for an arbitrary bounded drive $z(t)$. These observables include the average density profile, the correlations between particle positions, the order and gap statistics, as well as the full counting statistics. We then apply our general results to two specific examples where (i) $z(t)$ represents a dichotomous telegraphic noise, and (ii) $z(t)$ represents an Ornstein-Uhlenbeck process. Our analytical predictions are verified in numerical simulations, finding excellent agreement.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Minimizing the Profligacy of Searches with Reset
Authors:
John C. Sunil,
Richard A. Blythe,
Martin R. Evans,
Satya N. Majumdar
Abstract:
We introduce the profligacy of a search process as a competition between its expected cost and the probability of finding the target. The arbiter of the competition is a parameter $λ$ that represents how much a searcher invests into increasing the chance of success. Minimizing the profligacy with respect to the search strategy specifies the optimal search. We show that in the case of diffusion wit…
▽ More
We introduce the profligacy of a search process as a competition between its expected cost and the probability of finding the target. The arbiter of the competition is a parameter $λ$ that represents how much a searcher invests into increasing the chance of success. Minimizing the profligacy with respect to the search strategy specifies the optimal search. We show that in the case of diffusion with stochastic resetting, the amount of resetting in the optimal strategy has a highly nontrivial dependence on model parameters resulting in classical continuous transitions, discontinuous transitions and tricritical points as well as non-standard discontinuous transitions exhibiting re-entrant behavior and overhangs.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Full counting statistics of 1d short-range Riesz gases in confinement
Authors:
Jitendra Kethepalli,
Manas Kulkarni,
Anupam Kundu,
Satya N. Majumdar,
David Mukamel,
Grégory Schehr
Abstract:
We investigate the full counting statistics (FCS) of a harmonically confined 1d short-range Riesz gas consisting of $N$ particles in equilibrium at finite temperature. The particles interact with each other through a repulsive power-law interaction with an exponent $k>1$ which includes the Calogero-Moser model for $k=2$. We examine the probability distribution of the number of particles in a finit…
▽ More
We investigate the full counting statistics (FCS) of a harmonically confined 1d short-range Riesz gas consisting of $N$ particles in equilibrium at finite temperature. The particles interact with each other through a repulsive power-law interaction with an exponent $k>1$ which includes the Calogero-Moser model for $k=2$. We examine the probability distribution of the number of particles in a finite domain $[-W, W]$ called number distribution, denoted by $\mathcal{N}(W, N)$. We analyze the probability distribution of $\mathcal{N}(W, N)$ and show that it exhibits a large deviation form for large $N$ characterised by a speed $N^{\frac{3k+2}{k+2}}$ and by a large deviation function of the fraction $c = \mathcal{N}(W, N)/N$ of the particles inside the domain and $W$. We show that the density profiles that create the large deviations display interesting shape transitions as one varies $c$ and $W$. This is manifested by a third-order phase transition exhibited by the large deviation function that has discontinuous third derivatives. Monte-Carlo (MC) simulations show good agreement with our analytical expressions for the corresponding density profiles. We find that the typical fluctuations of $\mathcal{N}(W, N)$, obtained from our field theoretic calculations are Gaussian distributed with a variance that scales as $N^{ν_k}$, with $ν_k = (2-k)/(2+k)$. We also present some numerical findings on the mean and the variance. Furthermore, we adapt our formalism to study the index distribution (where the domain is semi-infinite $(-\infty, W])$, linear statistics (the variance), thermodynamic pressure and bulk modulus.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Cost of excursions until first crossing of the origin for random walks and Lévy flights: an exact general formula
Authors:
Francesco Mori,
Satya N. Majumdar,
Pierpaolo Vivo
Abstract:
We consider a discrete-time random walk on a line starting at $x_0\geq 0$ where a cost is incurred at each jump. We obtain an exact analytical formula for the distribution of the total cost of a trajectory until the process crosses the origin for the first time. The formula is valid for arbitrary jump distribution and cost function (heavy- and light-tailed alike), provided they are symmetric and c…
▽ More
We consider a discrete-time random walk on a line starting at $x_0\geq 0$ where a cost is incurred at each jump. We obtain an exact analytical formula for the distribution of the total cost of a trajectory until the process crosses the origin for the first time. The formula is valid for arbitrary jump distribution and cost function (heavy- and light-tailed alike), provided they are symmetric and continuous. We analyze the formula in different scaling regimes, and find a high degree of universality with respect to the details of the jump distribution and the cost function. Applications are given to the motion of an active run-and-tumble particle in one dimension and extensions to multiple cost variables are considered. The analytical results are in perfect agreement with numerical simulations.
△ Less
Submitted 20 May, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
Decorrelation of a leader by the increasing number of followers
Authors:
Satya N. Majumdar,
Gregory Schehr
Abstract:
We compute the connected two-time correlator of the maximum $M_N(t)$ of $N$ independent Gaussian stochastic processes (GSP) characterised by a common correlation coefficient $ρ$ that depends on the two times $t_1$ and $t_2$. We show analytically that this correlator, for fixed times $t_1$ and $t_2$, decays for large $N$ as a power law $N^{-γ}$ (with logarithmic corrections) with a decorrelation ex…
▽ More
We compute the connected two-time correlator of the maximum $M_N(t)$ of $N$ independent Gaussian stochastic processes (GSP) characterised by a common correlation coefficient $ρ$ that depends on the two times $t_1$ and $t_2$. We show analytically that this correlator, for fixed times $t_1$ and $t_2$, decays for large $N$ as a power law $N^{-γ}$ (with logarithmic corrections) with a decorrelation exponent $γ= (1-ρ)/(1+ ρ)$ that depends only on $ρ$, but otherwise is universal for any GSP. We study several examples of physical processes including the fractional Brownian motion (fBm) with Hurst exponent $H$ and the Ornstein-Uhlenbeck (OU) process. For the fBm, $ρ$ is only a function of $τ= \sqrt{t_1/t_2}$ and we find an interesting ``freezing'' transition at a critical value $τ= τ_c=(3-\sqrt{5})/2$. For $τ< τ_c$, there is an optimal $H^*(τ) > 0$ that maximises the exponent $γ$ and this maximal value freezes to $γ= 1/3$ for $τ>τ_c$. For the OU process, we show that $γ= {\rm tanh}(μ\,|t_1-t_2|/2)$ where $μ$ is the stiffness of the harmonic trap. Numerical simulations confirm our analytical predictions.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Correcting Projection Effects in CMEs using GCS-based Large Statistics of Multi-viewpoint Observations
Authors:
Harshita Gandhi,
Ritesh Patel,
Vaibhav Pant,
Satabdwa Majumdar,
Sanchita Pal,
Dipankar Banerjee,
Huw Morgan
Abstract:
This study addresses the limitations of single-viewpoint observations of Coronal Mass Ejections (CMEs) by presenting results from a 3D catalog of 360 CMEs during solar cycle 24, fitted using the GCS model. The dataset combines 326 previously analyzed CMEs and 34 newly examined events, categorized by their source regions into active region (AR) eruptions, active prominence (AP) eruptions, and promi…
▽ More
This study addresses the limitations of single-viewpoint observations of Coronal Mass Ejections (CMEs) by presenting results from a 3D catalog of 360 CMEs during solar cycle 24, fitted using the GCS model. The dataset combines 326 previously analyzed CMEs and 34 newly examined events, categorized by their source regions into active region (AR) eruptions, active prominence (AP) eruptions, and prominence eruptions (PE). Estimates of errors are made using a bootstrap** approach. The findings highlight that the average 3D speed of CMEs is $\sim$1.3 times greater than the 2D speed. PE CMEs tend to be slow, with an average speed of 432 km $s^{-1}$. AR and AP speeds are higher, at 723 km $s^{-1}$ and 813 km $s^{-1}$, respectively, with the latter having fewer slow CMEs. The distinctive behavior of AP CMEs is attributed to factors like overlying magnetic field distribution or geometric complexities leading to less accurate GCS fits. A linear fit of projected speed to width gives a gradient of 2 km $s^{-1}deg^{-1}$, which increases to 5 km $s^{-1}deg^{-1}$ when the GCS-fitted `true' parameters are used. Notably, AR CMEs exhibit a high gradient of 7 km $s^{-1}deg^{-1}$, while AP CMEs show a gradient of 4 km $s^{-1}deg^{-1}$. PE CMEs, however, lack a significant speed-width relationship. We show that fitting multi-viewpoint CME images to a geometrical model such as GCS is important to study the statistical properties of CMEs, and can lead to a deeper insight into CME behavior that is essential for improving future space weather forecasting.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
Universal distribution of the number of minima for random walks and Lévy flights
Authors:
Anupam Kundu,
Satya N. Majumdar,
Gregory Schehr
Abstract:
We compute exactly the full distribution of the number $m$ of local minima in a one-dimensional landscape generated by a random walk or a Lévy flight. We consider two different ensembles of landscapes, one with a fixed number of steps $N$ and the other till the first-passage time of the random walk to the origin. We show that the distribution of $m$ is drastically different in the two ensembles (G…
▽ More
We compute exactly the full distribution of the number $m$ of local minima in a one-dimensional landscape generated by a random walk or a Lévy flight. We consider two different ensembles of landscapes, one with a fixed number of steps $N$ and the other till the first-passage time of the random walk to the origin. We show that the distribution of $m$ is drastically different in the two ensembles (Gaussian in the former case, while having a power-law tail in the latter $m^{-3/2}$ in the latter case). However, the most striking aspect of our results is that, in each case, the distribution is completely universal for all $m$ (and not just for large $m$), i.e., independent of the jump distribution in the random walk. This means that the distributions are exactly identical for Lévy flights and random walks with finite jump variance. Our analytical results are in excellent agreement with our numerical simulations.
△ Less
Submitted 14 February, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Single and entangled atomic systems in thermal bath and the Fulling-Davies-Unruh effect
Authors:
Arnab Mukherjee,
Sunandan Gangopadhyay,
Archan S. Majumdar
Abstract:
We revisit the Fulling-Davies-Unruh effect in the context of two-level single and entangled atomic systems that are either uniformly accelerated or static in a thermal bath. We consider the interaction between the systems and a massless scalar field, covering the scenarios of free space as well as within a cavity. Through the calculation of atomic transition rates, it is found that in free space t…
▽ More
We revisit the Fulling-Davies-Unruh effect in the context of two-level single and entangled atomic systems that are either uniformly accelerated or static in a thermal bath. We consider the interaction between the systems and a massless scalar field, covering the scenarios of free space as well as within a cavity. Through the calculation of atomic transition rates, it is found that in free space there is an equivalence between a uniformly accelerated atom with respect to an observer with that of a single atom which is static with respect to the observer and immersed in a thermal bath, as long as the temperature of the thermal bath matches the Unruh temperature. This equivalence breaks down in the presence of a cavity. For two-atom systems, we consider the initial state to be in a general pure entangled form. We find that in this case, the equivalence between the accelerated and static thermal bath scenarios holds only under specific limiting conditions in free space but breaks down completely in a cavity set-up.
△ Less
Submitted 25 January, 2024;
originally announced February 2024.
-
Lessons from discrete light-cone quantization for physics at null infinity: Bosons in two dimensions
Authors:
Glenn Barnich,
Sucheta Majumdar,
Simone Speziale,
Wen-Di Tan
Abstract:
Motivated by issues in the context of asymptotically flat spacetimes at null infinity, we discuss in the simplest example of a massless scalar field in two dimensions several subtleties that arise when setting up the canonical formulation on a single or on two intersecting null hyperplanes with a special emphasis on the infinite-dimensional global and conformal symmetries and their canonical gener…
▽ More
Motivated by issues in the context of asymptotically flat spacetimes at null infinity, we discuss in the simplest example of a massless scalar field in two dimensions several subtleties that arise when setting up the canonical formulation on a single or on two intersecting null hyperplanes with a special emphasis on the infinite-dimensional global and conformal symmetries and their canonical generators, the free data, a consistent treatment of zero modes, matching conditions, and implications for quantization of massless versus massive fields.
△ Less
Submitted 20 May, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.
-
Work Distribution for Unzip** Processes
Authors:
P. Werner,
A. K. Hartmann,
S. N. Majumdar
Abstract:
A simple zipper model is introduced, representing in a simplified way, e.g., the folded DNA double helix or hairpin structures in RNA. The double stranded hairpin is connected to a heat bath at temperature $T$ and subject to an external force $f$, which couples to the free length $L$ of the unzipped sequence. Increasing the force, leads to an zip**/unzip** first-order phase transition at a cri…
▽ More
A simple zipper model is introduced, representing in a simplified way, e.g., the folded DNA double helix or hairpin structures in RNA. The double stranded hairpin is connected to a heat bath at temperature $T$ and subject to an external force $f$, which couples to the free length $L$ of the unzipped sequence. Increasing the force, leads to an zip**/unzip** first-order phase transition at a critical force $f_c(T)$ in the thermodynamic limit of a very large chain. We compute analytically, as a function of temperature $T$ and force $f$, the full distribution $P(L)$ of free lengths in the thermodynamic limit and show that it is qualitatively very different for $f<f_c$, $f=f_c$ and $f>f_c$. Next we consider quasistatic work processes where the force is incremented according to a linear protocol. Having obtained $P(L)$ already allows us to derive an analytical expression for the work distribution $P(W)$ in the zipped phase $f<f_c$ for a long chain. We compute the large-deviation tails of the work distribution explicitly. Our analytical result for the work distribution is compared over a large range of the support down to probabilities as small as $10^{-200}$ with numerical simulations, which were performed by applying sophisticated large-deviation algorithms.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Send Message to the Future? Blockchain-based Time Machines for Decentralized Reveal of Locked Information
Authors:
Zhuolun Li,
Srijoni Majumdar,
Evangelos Pournaras
Abstract:
Conditional information reveal systems automate the release of information upon meeting specific predefined conditions, such as time or location. This paper introduces a breakthrough in the understanding, design and application of conditional information reveal systems that are highly secure and decentralized. By designing a new practical timed-release cryptography system and a verifiable secret s…
▽ More
Conditional information reveal systems automate the release of information upon meeting specific predefined conditions, such as time or location. This paper introduces a breakthrough in the understanding, design and application of conditional information reveal systems that are highly secure and decentralized. By designing a new practical timed-release cryptography system and a verifiable secret sharing scheme, a novel data sharing system is devised on the blockchain that `sends messages in the future' with highly accurate decryption times. This paper provides a complete evaluation portfolio of this pioneering paradigm, including analytical results, a validation of its robustness in the Tamarin Prover and a performance evaluation of a real-world, open-source system prototype deployed across the globe. Using real-world election data, we also demonstrate the applicability of this innovative system in e-voting, illustrating its capacity to secure and ensure fair electronic voting processes.
△ Less
Submitted 24 May, 2024; v1 submitted 11 January, 2024;
originally announced January 2024.
-
Information entropy in excited states in confined quantum systems
Authors:
Sangita Majumdar,
Neetik Mukherjee,
Amlan K. Roy
Abstract:
The present contribution constitutes a brief account of information theoretical analysis in several representative model as well as real quantum mechanical systems. There has been an overwhelming interest to study such measures in various quantum systems, as evidenced by a vast amount of publications in the literature that has taken place in recent years. However, while such works are numerous in…
▽ More
The present contribution constitutes a brief account of information theoretical analysis in several representative model as well as real quantum mechanical systems. There has been an overwhelming interest to study such measures in various quantum systems, as evidenced by a vast amount of publications in the literature that has taken place in recent years. However, while such works are numerous in so-called \emph{free} systems, there is a genuine lack of these in their constrained counterparts. With this in mind, this chapter will focus on some of the recent exciting progresses that has been witnessed in our laboratory \cite{sen06,roy14mpla,roy14mpla_manning,roy15ijqc, roy16ijqc, mukherjee15,mukherjee16,majumdar17,mukherjee18a,mukherjee18b,mukherjee18c,mukherjee18d,majumdar20,mukherjee21,majumdar21a, majumdar21b}, and elsewhere, with special emphasis on following prototypical systems, namely, (i) double well (DW) potential (symmetric and asymmetric) (ii) \emph{free}, as well as a \emph{confined hydrogen atom} (CHA) enclosed in a spherical impenetrable cavity (iii) a many-electron atom under similar enclosed environment.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Extracting the Global 21-cm signal from Cosmic Dawn and Epoch of Reionization in the presence of Foreground and Ionosphere
Authors:
Anshuman Tripathi,
Abhirup Datta,
Madhurima Choudhury,
Suman Majumdar
Abstract:
Detection of redshifted \ion{H}{i} 21-cm emission is a potential probe for investigating the Universe's first billion years. However, given the significantly brighter foreground, detecting 21-cm is observationally difficult. The Earth's ionosphere considerably distorts the signal at low frequencies by introducing directional-dependent effects. Here, for the first time, we report the use of Artific…
▽ More
Detection of redshifted \ion{H}{i} 21-cm emission is a potential probe for investigating the Universe's first billion years. However, given the significantly brighter foreground, detecting 21-cm is observationally difficult. The Earth's ionosphere considerably distorts the signal at low frequencies by introducing directional-dependent effects. Here, for the first time, we report the use of Artificial Neural Networks (ANNs) to extract the global 21cm signal characteristics from the composite all-sky averaged signal, including foreground and ionospheric effects such as refraction, absorption, and thermal emission from the ionosphere's F and D-layers. We assume a 'perfect' instrument and neglect instrumental calibration and beam effects. To model the ionospheric effect, we considered the static and time-varying ionospheric conditions for the mid-latitude region where LOFAR is situated. In this work, we trained the ANN model for various situations using a synthetic set of the global 21cm signals created by altering its parameter space based on the "$\rm \tanh$" parameterized model and the Accelerated Reionization Era Simulations (ARES) algorithm. The obtained result shows that the ANN model can extract the global signal parameters with an accuracy of $\ge 96 \% $ in the final study when we include foreground and ionospheric effects. On the other hand, a similar ANN model can extract the signal parameters from the final prediction dataset with an accuracy ranging from $97 \%$ to $98 \%$ when considering more realistic sets of the global 21cm signals based on physical models.
△ Less
Submitted 28 January, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
An operational approach to classifying measurement incompatibility
Authors:
Arun Kumar Das,
Saheli Mukherjee,
Debashis Saha,
Debarshi Das,
A. S. Majumdar
Abstract:
Measurement incompatibility has proved to be an important resource for information-processing tasks. In this work, we analyze various levels of incompatibility of measurement sets. We provide operational classification of measurement incompatibility with respect to two elementary classical operations, viz., coarse-graining of measurement outcomes and convex mixing of different measurements. We der…
▽ More
Measurement incompatibility has proved to be an important resource for information-processing tasks. In this work, we analyze various levels of incompatibility of measurement sets. We provide operational classification of measurement incompatibility with respect to two elementary classical operations, viz., coarse-graining of measurement outcomes and convex mixing of different measurements. We derive analytical criteria for determining when a set of projective measurements is fully incompatible with respect to coarse-graining or convex mixing. Robustness against white noise is investigated for mutually unbiased bases that can sustain full incompatibility. Furthermore, we propose operational witnesses for different levels of incompatibility subject to classical operations, using the input-output statistics of Bell-type experiments as well as experiments in the prepare-and-measure scenario.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition
Authors:
Vahid Noroozi,
Somshubra Majumdar,
Ankur Kumar,
Jagadeesh Balam,
Boris Ginsburg
Abstract:
In this paper, we propose an efficient and accurate streaming speech recognition model based on the FastConformer architecture. We adapted the FastConformer architecture for streaming applications through: (1) constraining both the look-ahead and past contexts in the encoder, and (2) introducing an activation caching mechanism to enable the non-autoregressive encoder to operate autoregressively du…
▽ More
In this paper, we propose an efficient and accurate streaming speech recognition model based on the FastConformer architecture. We adapted the FastConformer architecture for streaming applications through: (1) constraining both the look-ahead and past contexts in the encoder, and (2) introducing an activation caching mechanism to enable the non-autoregressive encoder to operate autoregressively during inference. The proposed model is thoughtfully designed in a way to eliminate the accuracy disparity between the train and inference time which is common for many streaming models. Furthermore, our proposed encoder works with various decoder configurations including Connectionist Temporal Classification (CTC) and RNN-Transducer (RNNT) decoders. Additionally, we introduced a hybrid CTC/RNNT architecture which utilizes a shared encoder with both a CTC and RNNT decoder to boost the accuracy and save computation. We evaluate the proposed model on LibriSpeech dataset and a multi-domain large scale dataset and demonstrate that it can achieve better accuracy with lower latency and inference time compared to a conventional buffered streaming model baseline. We also showed that training a model with multiple latencies can achieve better accuracy than single latency models while it enables us to support multiple latencies with a single model. Our experiments also showed the hybrid architecture would not only speedup the convergence of the CTC decoder but also improves the accuracy of streaming models compared to single decoder models.
△ Less
Submitted 2 May, 2024; v1 submitted 27 December, 2023;
originally announced December 2023.
-
Active particle in one dimension subjected to resetting with memory
Authors:
Denis Boyer,
Satya N. Majumdar
Abstract:
The study of diffusion with preferential returns to places visited in the past has attracted an increased attention in recent years. In these highly non-Markov processes, a standard diffusive particle intermittently resets at a given rate to previously visited positions. At each reset, a position to be revisited is randomly chosen with a probability proportional to the accumulated amount of time s…
▽ More
The study of diffusion with preferential returns to places visited in the past has attracted an increased attention in recent years. In these highly non-Markov processes, a standard diffusive particle intermittently resets at a given rate to previously visited positions. At each reset, a position to be revisited is randomly chosen with a probability proportional to the accumulated amount of time spent by the particle at that position. These preferential revisits typically generate a very slow diffusion, logarithmic in time, but still with a Gaussian position distribution at late times. Here we consider an active version of this model, where between resets the particle is self-propelled with constant speed and switches direction in one dimension according to a telegraphic noise. Hence there are two sources of non-Markovianity in the problem. We exactly derive the position distribution in Fourier space, as well as the variance of the position at all times. The crossover from the short-time ballistic regime, dominated by activity, to the large-time anomalous logarithmic growth induced by memory is studied. We also analytically derive a large deviation principle for the position, which exhibits a logarithmic time-scaling instead of the usual algebraic form. Interestingly, at large distances, the large deviations become independent of time and match the non-equilibrium steady state of a particle under resetting to its starting position only.
△ Less
Submitted 6 May, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Antiferromagnetic order enhanced by local dissipation
Authors:
Oscar Bouverot-Dupuis,
Saptarshi Majumdar,
Alberto Rosso,
Laura Foini
Abstract:
We study an XXZ spin chain at zero magnetization coupled to a collection of local harmonic baths at zero temperature. We map this system on a (1+1)D effective field theory using bosonization, where the effect of the bath is taken care of in an exact manner. We provide analytical and numerical evidence of the existence of two phases at zero temperature: a Luttinger liquid (LL) and an antiferromagne…
▽ More
We study an XXZ spin chain at zero magnetization coupled to a collection of local harmonic baths at zero temperature. We map this system on a (1+1)D effective field theory using bosonization, where the effect of the bath is taken care of in an exact manner. We provide analytical and numerical evidence of the existence of two phases at zero temperature: a Luttinger liquid (LL) and an antiferromagnetic phase (AFM), separated by a phase transition akin to the Berezinsky--Kosterlitz--Thouless (BKT) type. While the bath is responsible for the LL-AFM transition for subohmic baths, the LL-AFM transition for superohmic baths is due to the interactions within the spin chain.
△ Less
Submitted 12 May, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Device-Independent Quantum Secure Direct Communication Under Non-Markovian Quantum Channels
Authors:
Pritam Roy,
Subhankar Bera,
Shashank Gupta,
A. S. Majumdar
Abstract:
Device-independent quantum secure direct communication (DI-QSDC) is a promising primitive in quantum cryptography aimed towards addressing the problems of device imperfections and key management. However, significant effort is required to tackle practical challenges such as the distance limitation due to the decohering effects of quantum channels. Here, we explore the constructive effect of non-Ma…
▽ More
Device-independent quantum secure direct communication (DI-QSDC) is a promising primitive in quantum cryptography aimed towards addressing the problems of device imperfections and key management. However, significant effort is required to tackle practical challenges such as the distance limitation due to the decohering effects of quantum channels. Here, we explore the constructive effect of non-Markovian noise to improve the performance of DI-QSDC. Considering two different environmental dynamics modelled by the amplitude dam** and the dephasing channels, we show that for both cases non-Markovianty leads to a considerable improvement over Markovian dynamics in terms of three benchmark performance criteria of the DI-QSDC task. Specifically, we find that non-Markovian noise (i) enhances the protocol security measured by Bell violation, (ii) leads to a lower quantum bit error rate, and (iii) enables larger communication distances by increasing the capacity of secret communication.
△ Less
Submitted 6 May, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.
-
Dynamically emergent correlations between particles in a switching harmonic trap
Authors:
Marco Biroli,
Manas Kulkarni,
Satya N. Majumdar,
Gregory Schehr
Abstract:
We study a one dimensional gas of $N$ noninteracting diffusing particles in a harmonic trap, whose stiffness switches between two values $μ_1$ and $μ_2$ with constant rates $r_1$ and $r_2$ respectively. Despite the absence of direct interaction between the particles, we show that strong correlations between them emerge in the stationary state at long times, induced purely by the dynamics itself. W…
▽ More
We study a one dimensional gas of $N$ noninteracting diffusing particles in a harmonic trap, whose stiffness switches between two values $μ_1$ and $μ_2$ with constant rates $r_1$ and $r_2$ respectively. Despite the absence of direct interaction between the particles, we show that strong correlations between them emerge in the stationary state at long times, induced purely by the dynamics itself. We compute exactly the joint distribution of the positions of the particles in the stationary state, which allows us to compute several physical observables analytically. In particular, we show that the extreme value statistics (EVS), i.e., the distribution of the position of the rightmost particle has a nontrivial shape in the large $N$ limit. The scaling function characterizing this EVS has a finite support with a tunable shape (by varying the parameters). Remarkably, this scaling function turns out to be universal. First, it also describes the distribution of the position of the $k$-th rightmost particle in a $1d$ trap. Moreover, the distribution of the position of the particle farthest from the center of the harmonic trap in $d$ dimensions is also described by the same scaling function for all $d \geq 1$. Numerical simulations are in excellent agreement with our analytical predictions.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Occupation time of a system of Brownian particles on the line with steplike initial condition
Authors:
Ivan N. Burenev,
Satya N. Majumdar,
Alberto Rosso
Abstract:
We consider a system of non-interacting Brownian particles on the line with steplike initial condition and study the statistics of the occupation time on the positive half-line. We demonstrate that this system exhibits long-lasting memory effects of the initialization. Specifically, we calculate the mean and the variance of the occupation time, demonstrating that the memory effects in the variance…
▽ More
We consider a system of non-interacting Brownian particles on the line with steplike initial condition and study the statistics of the occupation time on the positive half-line. We demonstrate that this system exhibits long-lasting memory effects of the initialization. Specifically, we calculate the mean and the variance of the occupation time, demonstrating that the memory effects in the variance are determined by a generalized compressibility (or Fano factor), associated with the initial condition. In the particular case of the uncorrelated uniform initial condition we conduct a detailed study of two probability distributions of the occupation time: annealed (averaged over all possible initial configurations) and quenched (for a typical configuration). We show that at large times both the annealed and the quenched distributions admit large deviation form and we compute analytically the associated rate functions. We verify our analytical predictions via numerical simulations using Importance Sampling Monte-Carlo strategy.
△ Less
Submitted 30 April, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
Impact of astrophysical scatter on the Epoch of Reionization [H I]$_{\rm 21cm}$ bispectrum
Authors:
Chandra Shekhar Murmu,
Kanan K. Datta,
Suman Majumdar,
Thomas R. Greve
Abstract:
It is believed that the first star-forming galaxies are the main drivers of cosmic reionization. It is usually assumed that there is a one-to-one relationship between the star formation rate (SFR) inside a galaxy and the host halo mass in semi-analytical/numerical modeling of large-scale ionization maps during the epoch of reionization. However, more accurate simulations and observations suggest t…
▽ More
It is believed that the first star-forming galaxies are the main drivers of cosmic reionization. It is usually assumed that there is a one-to-one relationship between the star formation rate (SFR) inside a galaxy and the host halo mass in semi-analytical/numerical modeling of large-scale ionization maps during the epoch of reionization. However, more accurate simulations and observations suggest that the SFR and ionizing luminosity in galaxies may vary considerably even if the host halo mass is the same. This astrophysical scatter can introduce an additional non-Gaussianity in the [H I]$_{21\rm cm}$ signal, which might not be captured adequately in the power spectrum. In this work, we have studied the impact of the scatter on the [H I]$_{21\rm cm}$ bispectrum using semi-numerical simulations. We find that the scatter primarily affects small ionized regions, whereas the large ionized bubbles remain largely unaffected. Although, the fractional change in the [H I]$_{21\rm cm}$ bispectra due to the scatter is found to be more than a factor of $10$ at large scales ($\lesssim 1\, {\rm Mpc}^{-1}$), it is found to be statistically insignificant. However, at small scales ($k\sim2.55$ Mpc$^{-1}$), we have found the impact due to the scatter to be high in magnitude ($|\langle ΔB \rangle/B_{\text{no-scatter}}| \sim 1$) and statistically significant ($|\langleΔB\rangle/σ_{ΔB}| \gtrsim 5$) at neutral fraction, $\overline{x}_{\rm HI}\sim 0.8$. We have also found that in the most optimistic scenario, SKA1-Low might be able to detect these signatures of astrophysical scatter, at $\sim 3σ$ and $\sim 5σ$ detection significance for $\overline{x}_{\rm HI} \sim$ 0.8 and 0.9 respectively, for the equilateral [H I]$_{21\rm cm}$ bispectrum.
△ Less
Submitted 10 April, 2024; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Radiative Asymptotic Symmetries of 3D Einstein-Maxwell Theory
Authors:
Jorrit Bosma,
Marc Geiller,
Sucheta Majumdar,
Blagoje Oblak
Abstract:
We study the null asymptotic structure of Einstein-Maxwell theory in three-dimensional (3D) spacetimes. Although devoid of bulk gravitational degrees of freedom, the system admits a massless photon and can therefore accommodate electromagnetic radiation. We derive fall-off conditions for the Maxwell field that contain both Coulombic and radiative modes with non-vanishing news. The latter produces…
▽ More
We study the null asymptotic structure of Einstein-Maxwell theory in three-dimensional (3D) spacetimes. Although devoid of bulk gravitational degrees of freedom, the system admits a massless photon and can therefore accommodate electromagnetic radiation. We derive fall-off conditions for the Maxwell field that contain both Coulombic and radiative modes with non-vanishing news. The latter produces non-integrability and fluxes in the asymptotic surface charges, and gives rise to a non-trivial 3D Bondi mass loss formula. The resulting solution space is thus analogous to a dimensional reduction of 4D pure gravity, with the role of gravitational radiation played by its electromagnetic cousin. We use this simplified setup to investigate choices of charge brackets in detail, and compute in particular the recently introduced Koszul bracket. When the latter is applied to Wald-Zoupas charges, which are conserved in the absence of news, it leads to the field-dependent central extension found earlier in [arXiv:1503.00856]. We also consider (Anti-)de Sitter asymptotics to further exhibit the analogy between this model and 4D gravity with leaky boundary conditions.
△ Less
Submitted 5 March, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Optimal mean first-passage time of a run-and-tumble particle in a class of one-dimensional confining potentials
Authors:
Mathis Guéneau,
Satya N. Majumdar,
Gregory Schehr
Abstract:
We consider a run-and-tumble particle (RTP) in one dimension, subjected to a telegraphic noise with a constant rate $γ$, and in the presence of an external confining potential $V(x) = α|x|^p$ with $p \geq 1$. We compute the mean first-passage time (MFPT) at the origin $τ_γ(x_0)$ for an RTP starting at $x_0$. We obtain a closed form expression for $τ_γ(x_0)$ for all $p \geq 1$, which becomes fully…
▽ More
We consider a run-and-tumble particle (RTP) in one dimension, subjected to a telegraphic noise with a constant rate $γ$, and in the presence of an external confining potential $V(x) = α|x|^p$ with $p \geq 1$. We compute the mean first-passage time (MFPT) at the origin $τ_γ(x_0)$ for an RTP starting at $x_0$. We obtain a closed form expression for $τ_γ(x_0)$ for all $p \geq 1$, which becomes fully explicit in the case $p=1$, $p=2$ and in the limit $p \to \infty$. For generic $p>1$ we find that there exists an optimal rate $γ_{\rm opt}$ that minimizes the MFPT and we characterize in detail its dependence on $x_0$. We find that $γ_{\rm opt} \propto 1/x_0$ as $x_0 \to 0$, while $γ_{\rm opt}$ converges to a nontrivial constant as $x_0 \to \infty$. In contrast, for $p=1$, there is no finite optimum and $γ_{\rm opt} \to \infty$ in this case. These analytical results are confirmed by our numerical simulations.
△ Less
Submitted 19 January, 2024; v1 submitted 12 November, 2023;
originally announced November 2023.
-
Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023
Authors:
Srijoni Majumdar,
Soumen Paul,
Debjyoti Paul,
Ayan Bandyopadhyay,
Samiran Chattopadhyay,
Partha Pratim Das,
Paul D Clough,
Prasenjit Majumder
Abstract:
The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs e…
▽ More
The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs extracted from open source github C based projects and an additional dataset generated individually by teams using large language models. Overall 56 experiments have been submitted by 17 teams from various universities and software companies. The submissions have been evaluated quantitatively using the F1-Score and qualitatively based on the type of features developed, the supervised learning model used and their corresponding hyper-parameters. The labels generated from large language models increase the bias in the prediction model but lead to less over-fitted results.
△ Less
Submitted 27 October, 2023;
originally announced November 2023.
-
Beyond Best-Fits and Model Selection -- Introducing "Reliability" of cusp-core inference of dark matter halos
Authors:
Manush Manju,
Subhabrata Majumdar
Abstract:
We introduce the notion of a Bayesian analysis motivated `reliability' that gives a truer distinction of cusp-core and other halo-parameters (like mass-concentration) in an ensemble of observed galaxies. Our approach goes beyond the standard statistical techniques of parameter estimation and model fitting. We create hundreds of thousands of realistic mock SPARC RCs, with both cuspy and cored DM de…
▽ More
We introduce the notion of a Bayesian analysis motivated `reliability' that gives a truer distinction of cusp-core and other halo-parameters (like mass-concentration) in an ensemble of observed galaxies. Our approach goes beyond the standard statistical techniques of parameter estimation and model fitting. We create hundreds of thousands of realistic mock SPARC RCs, with both cuspy and cored DM density profiles as model inputs. These RCs carefully incorporate the details of SPARC data such as the nature of observed uncertainties and different sources of scatters arising from observation, presence of baryons, DM mass-concentration, etc. Bayesian analysis of these mock RCs enables us to reconstruct and identify the parameter space in galaxy observable and theory where one can venture beyond best-fits to a preferred DM halo model or model selections between different density models. We find that it is imperative to choose low stellar surface density ($Σ_{\star}$) galaxies for reliable cusp-vs-core distinction; for example, RC data for galaxies with $Σ_{\star} \leq 2.5$ is needed for a 75\% confidence in distinguishing cusps from cores. Similarly, we also find that for correct estimations of the halo masses and concentrations, the RCs need to be measured to at least a radial distance $\geq 0.8r_s$ where $r_s$ is the scale radii of the corresponding DM halo density profiles. Out of the total $\sim$ 135 SPARC galaxies, using our reliability criteria, we find that only 21 RCs clear the bar to be used for any unbiased cusp-core distinction as well as DM halo mass-concentration estimates at $\geq$75\% reliability confidence level. With $\geq$66\% ( $\geq$50\%) reliability settings, the sample size increases to 44 (59). Interestingly, in the $\geq 75$\% reliable subsample, there are 5 times more galaxies that are reliably cored than cuspy. [Abridged]
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
Analyzing the 21-cm signal brightness temperature in the Universe with inhomogeneities
Authors:
Shashank Shekhar Pandey,
Ashadul Halder,
A. S. Majumdar
Abstract:
We explore the 21-cm signal in our Universe containing inhomogeneous matter distribution at considerably large scales. Employing Buchert's averaging procedure in the context of a model of spacetime with multiple inhomogeneous domains, we evaluate the effect of our model parameters on the observable 21-cm signal brightness temperature. Our model parameters are constrained through the Markov Chain M…
▽ More
We explore the 21-cm signal in our Universe containing inhomogeneous matter distribution at considerably large scales. Employing Buchert's averaging procedure in the context of a model of spacetime with multiple inhomogeneous domains, we evaluate the effect of our model parameters on the observable 21-cm signal brightness temperature. Our model parameters are constrained through the Markov Chain Monte Carlo method using the Union 2.1 supernova Ia observational data. We find that a significant dip in the brightness temperature compared to the $Λ$CDM prediction could arise as an effect of the inhomogeneities present in the Universe.
△ Less
Submitted 28 October, 2023;
originally announced October 2023.
-
Technical Note: Feasibility of translating 3.0T-trained Deep-Learning Segmentation Models Out-of-the-Box on Low-Field MRI 0.55T Knee-MRI of Healthy Controls
Authors:
Rupsa Bhattacharjee,
Zehra Akkaya,
Johanna Luitjens,
Pan Su,
Yang Yang,
Valentina Pedoia,
Sharmila Majumdar
Abstract:
In the current study, our purpose is to evaluate the feasibility of applying deep learning (DL) enabled algorithms to quantify bilateral knee biomarkers in healthy controls scanned at 0.55T and compared with 3.0T. The current study assesses the performance of standard in-practice bone, and cartilage segmentation algorithms at 0.55T, both qualitatively and quantitatively, in terms of comparing segm…
▽ More
In the current study, our purpose is to evaluate the feasibility of applying deep learning (DL) enabled algorithms to quantify bilateral knee biomarkers in healthy controls scanned at 0.55T and compared with 3.0T. The current study assesses the performance of standard in-practice bone, and cartilage segmentation algorithms at 0.55T, both qualitatively and quantitatively, in terms of comparing segmentation performance, areas of improvement, and compartment-wise cartilage thickness values between 0.55T vs. 3.0T. Initial results demonstrate a usable to good technical feasibility of translating existing quantitative deep-learning-based image segmentation techniques, trained on 3.0T, out of 0.55T for knee MRI, in a multi-vendor acquisition environment. Especially in terms of segmenting cartilage compartments, the models perform almost equivalent to 3.0T in terms of Likert ranking. The 0.55T low-field sustainable and easy-to-install MRI, as demonstrated, thus, can be utilized for evaluating knee cartilage thickness and bone segmentations aided by established DL algorithms trained at higher-field strengths out-of-the-box initially. This could be utilized at the far-spread point-of-care locations with a lack of radiologists available to manually segment low-field images, at least till a decent base of low-field data pool is collated. With further fine-tuning with manual labeling of low-field data or utilizing synthesized higher SNR images from low-field images, OA biomarker quantification performance is potentially guaranteed to be further improved.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Linear statistics for Coulomb gases: higher order cumulants
Authors:
Benjamin De Bruyne,
Pierre Le Doussal,
Satya N. Majumdar,
Gregory Schehr
Abstract:
We consider $N$ classical particles interacting via the Coulomb potential in spatial dimension $d$ and in the presence of an external trap, at equilibrium at inverse temperature $β$. In the large $N$ limit, the particles are confined within a droplet of finite size. We study smooth linear statistics, i.e. the fluctuations of sums of the form ${\cal L}_N = \sum_{i=1}^N f({\bf x}_i)$, where…
▽ More
We consider $N$ classical particles interacting via the Coulomb potential in spatial dimension $d$ and in the presence of an external trap, at equilibrium at inverse temperature $β$. In the large $N$ limit, the particles are confined within a droplet of finite size. We study smooth linear statistics, i.e. the fluctuations of sums of the form ${\cal L}_N = \sum_{i=1}^N f({\bf x}_i)$, where ${\bf x}_i$'s are the positions of the particles and where $f({\bf x}_i)$ is a sufficiently regular function. There exists at present standard results for the first and second moments of ${\cal L}_N$ in the large $N$ limit, as well as associated Central Limit Theorems in general dimension and for a wide class of confining potentials. Here we obtain explicit expressions for the higher order cumulants of ${\cal L}_N$ at large $N$, when the function $f({\bf x})=f(|{\bf x}|)$ and the confining potential are both rotationnally invariant. A remarkable feature of our results is that these higher cumulants depend only on the value of $f'(|{\bf x}|)$ and its higher order derivatives evaluated exactly at the boundary of the droplet, which in this case is a $d$-dimensional sphere. In the particular two-dimensional case $d=2$ at the special value $β=2$, a connection to the Ginibre ensemble allows us to derive these results in an alternative way using the tools of determinantal point processes. Finally we also obtain the large deviation form of the full probability distribution function of ${\cal L}_N$.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
The monopole and quadrupole moments of the Epoch of Reionization (EoR) 21-cm bispectrum
Authors:
Sukhdeep Singh Gill,
Suman Pramanick,
Somnath Bharadwaj,
Abinash Kumar Shaw,
Suman Majumdar
Abstract:
We study the monopole ($\bar{B}^0_0$) and quadrupole ($\bar{B}^0_2$) moments of the 21-cm bispectrum (BS) from EoR simulations and present results for squeezed and stretched triangles. Both $\bar{B}^0_0$ and $\bar{B}^0_2$ are positive at the early stage of EoR where the mean neutral hydrogen (HI) density fraction $\bar{x}_{\rm HI} \approx 0.99$. The subsequent evolution of $\bar{B}^0_0$ and…
▽ More
We study the monopole ($\bar{B}^0_0$) and quadrupole ($\bar{B}^0_2$) moments of the 21-cm bispectrum (BS) from EoR simulations and present results for squeezed and stretched triangles. Both $\bar{B}^0_0$ and $\bar{B}^0_2$ are positive at the early stage of EoR where the mean neutral hydrogen (HI) density fraction $\bar{x}_{\rm HI} \approx 0.99$. The subsequent evolution of $\bar{B}^0_0$ and $\bar{B}^0_2$ at large and intermediate scales $(k=0.29$ and $0.56 \, {\rm Mpc}^{-1}$ respectively) is punctuated by two sign changes which mark transitions in the HI distribution. The first sign flip where $\bar{B}^0_0$ becomes negative occurs in the intermediate stages of EoR $(\bar{x}_{\rm HI} > 0.5)$, at large scale first followed by the intermediate scale. This marks the emergence of distinct ionized bubbles in the neutral background. $\bar{B}^0_2$ is relatively less affected by this transition, and it mostly remains positive even when $\bar{B}^0_0$ becomes negative. The second sign flip, which affects both $\bar{B}^0_0$ and $\bar{B}^0_2$, occurs at the late stage of EoR $(\bar{x}_{\rm HI} < 0.5)$. This marks a transition in the topology of the HI distribution, after which we have distinct HI islands in an ionized background. This causes $\bar{B}^0_0$ to become positive. The negative $\bar{B}^0_2$ is a definite indication that the HI islands survive only in under-dense regions.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.