-
RG Interfaces from Double-Trace Deformations
Authors:
Simone Giombi,
Elizabeth Helfenberger,
Himanshu Khanchandani
Abstract:
We study a class of interface conformal field theories obtained by taking a large $N$ CFT and turning on a relevant double-trace deformation over half space. At low energies, this leads to a conformal interface separating two CFTs which are related by RG flow. We set up the large $N$ expansion of these models by employing a Hubbard-Stratonovich transformation over half space, and use this approach…
▽ More
We study a class of interface conformal field theories obtained by taking a large $N$ CFT and turning on a relevant double-trace deformation over half space. At low energies, this leads to a conformal interface separating two CFTs which are related by RG flow. We set up the large $N$ expansion of these models by employing a Hubbard-Stratonovich transformation over half space, and use this approach to compute some of the defect CFT data. We also calculate the free energy of the theory in the case of spherical interface, which encodes a conformal anomaly coefficient for even dimensional interface, and the analog of the $g$-function for odd-dimensional interface. These models have a dual description in terms of a gravitational theory in AdS where a bulk scalar field satisfies different boundary conditions on each half of the AdS boundary. We review this construction and show that the results of the large $N$ expansion on the CFT side are in precise agreement with the holographic predictions.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Perfect Matching Complexes of Polygonal Line Tiling
Authors:
Himanshu Chandrakar,
Anurag Singh
Abstract:
The perfect matching complex of a simple graph G is a simplicial complex having facets (maximal faces) as the perfect matchings of G. This article discusses the perfect matching complex of polygonal line tiling and the $\left(2 \times n\right)$-grid graph in particular. We use tools from discrete Morse theory to show that the perfect matching complex of any polygonal line tiling is either contract…
▽ More
The perfect matching complex of a simple graph G is a simplicial complex having facets (maximal faces) as the perfect matchings of G. This article discusses the perfect matching complex of polygonal line tiling and the $\left(2 \times n\right)$-grid graph in particular. We use tools from discrete Morse theory to show that the perfect matching complex of any polygonal line tiling is either contractible or homotopically equivalent to a wedge of spheres. While proving our results, we also characterise all the matchings that can not be extended to form a perfect matching.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Cosmological tests of the dark energy models in Finsler-Randers Space-time
Authors:
Z. Nekouee,
Himanshu Chaudhary,
S. K. Narasimhamurthy,
S. K. J. Pacif,
Manjunath Malligawad
Abstract:
The Finsler-Randers space-time offers a novel perspective on cosmic dynamics, departing from the constraints of General Relativity. This paper thoroughly investigates two dark energy models resulting from the parametrization of $H$ within this geometric framework. We have conducted some geometrical and physical analysis of the dark energy models in Finslerian geometry. First, We have derived the f…
▽ More
The Finsler-Randers space-time offers a novel perspective on cosmic dynamics, departing from the constraints of General Relativity. This paper thoroughly investigates two dark energy models resulting from the parametrization of $H$ within this geometric framework. We have conducted some geometrical and physical analysis of the dark energy models in Finslerian geometry. First, We have derived the field equations governing the universe's evolution within the Finsler-Randers formalism, incorporating the presence of dark energy. Through this, we explore its implications on cosmological phenomena, including cosmic expansion, late-time behavior of the universe, cosmological phase transition, and a few more. Also, we employ observational data such as Cosmic Chronometer, Supernovae, Gamma-Ray Bursts, Quasar, and baryon acoustic oscillations to constrain the parameters associated with dark energy in the Finsler-Randers universe. Comparing theoretical predictions with empirical observations, we assess the model viability and discern any deviations from the standard $Λ$CDM cosmology. Our findings offer intriguing insights into the nature of dark energy within this alternative gravitational framework, providing a deeper understanding of its role in sha** cosmic evolution. The implications of our results extend to fundamental cosmology, hinting at new avenues for research to unravel the mysteries surrounding dark energy and the geometric structure of the universe within non-standard gravitational theories.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Efficient Document Ranking with Learnable Late Interactions
Authors:
Ziwei Ji,
Himanshu Jain,
Andreas Veit,
Sashank J. Reddi,
Sadeep Jayasumana,
Ankit Singh Rawat,
Aditya Krishna Menon,
Felix Yu,
Sanjiv Kumar
Abstract:
Cross-Encoder (CE) and Dual-Encoder (DE) models are two fundamental approaches for query-document relevance in information retrieval. To predict relevance, CE models use joint query-document embeddings, while DE models maintain factorized query and document embeddings; usually, the former has higher quality while the latter benefits from lower latency. Recently, late-interaction models have been p…
▽ More
Cross-Encoder (CE) and Dual-Encoder (DE) models are two fundamental approaches for query-document relevance in information retrieval. To predict relevance, CE models use joint query-document embeddings, while DE models maintain factorized query and document embeddings; usually, the former has higher quality while the latter benefits from lower latency. Recently, late-interaction models have been proposed to realize more favorable latency-quality tradeoffs, by using a DE structure followed by a lightweight scorer based on query and document token embeddings. However, these lightweight scorers are often hand-crafted, and there is no understanding of their approximation power; further, such scorers require access to individual document token embeddings, which imposes an increased latency and storage burden. In this paper, we propose novel learnable late-interaction models (LITE) that resolve these issues. Theoretically, we prove that LITE is a universal approximator of continuous scoring functions, even for relatively small embedding dimension. Empirically, LITE outperforms previous late-interaction models such as ColBERT on both in-domain and zero-shot re-ranking tasks. For instance, experiments on MS MARCO passage re-ranking show that LITE not only yields a model with better generalization, but also lowers latency and requires 0.25x storage compared to ColBERT.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Investigating the Robustness of LLMs on Math Word Problems
Authors:
Ujjwala Anantheswaran,
Himanshu Gupta,
Kevin Scaria,
Shreyas Verma,
Chitta Baral,
Swaroop Mishra
Abstract:
Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experim…
▽ More
Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experiments reveal that LLMs are susceptible to distraction by numerical noise, resulting in an average relative performance drop of ~26% on adversarial MWPs. To mitigate this, we fine-tune LLMs (Llama-2, Mistral) on the adversarial samples from our dataset. Fine-tuning on adversarial training instances improves performance on adversarial MWPs by ~8%, indicating increased robustness to noise and better ability to identify relevant data for reasoning. Finally, to assess the generalizability of our prompting framework, we introduce GSM-8K-Adv, an adversarial variant of the GSM-8K benchmark. LLMs continue to struggle when faced with adversarial information, reducing performance by up to ~6%.
△ Less
Submitted 30 May, 2024;
originally announced June 2024.
-
Dynamics of Cyclic Contractions
Authors:
H. Baranwal,
A. K. B. Chand
Abstract:
Cyclic contractions generalize the usual contractivities in metric spaces and $b$-MSs. In this paper, we enhance several fixed point theorems related to cyclic (i) Banach self-maps, (ii) Chatterjea contractivities, (iii) Kannan self-map**s, (iv) Ćirić and Hardy-Rogers, and (v) Reich contractions including local versions in $b$-metric spaces while also delineating the associated dynamics. Especia…
▽ More
Cyclic contractions generalize the usual contractivities in metric spaces and $b$-MSs. In this paper, we enhance several fixed point theorems related to cyclic (i) Banach self-maps, (ii) Chatterjea contractivities, (iii) Kannan self-map**s, (iv) Ćirić and Hardy-Rogers, and (v) Reich contractions including local versions in $b$-metric spaces while also delineating the associated dynamics. Especially noteworthy is the expansion of the results concerning both fixed and periodic points, which are substantiated across a wider spectrum of ratio values for the aforementioned cyclic contractions within this class of spaces. Additionally, the convergence of Picard iterations towards the fixed point is rigorously established.
△ Less
Submitted 24 June, 2024; v1 submitted 8 April, 2024;
originally announced June 2024.
-
NAC-QFL: Noise Aware Clustered Quantum Federated Learning
Authors:
Himanshu Sahu,
Hari Prabhat Gupta
Abstract:
Recent advancements in quantum computing, alongside successful deployments of quantum communication, hold promises for revolutionizing mobile networks. While Quantum Machine Learning (QML) presents opportunities, it contends with challenges like noise in quantum devices and scalability. Furthermore, the high cost of quantum communication constrains the practical application of QML in real-world sc…
▽ More
Recent advancements in quantum computing, alongside successful deployments of quantum communication, hold promises for revolutionizing mobile networks. While Quantum Machine Learning (QML) presents opportunities, it contends with challenges like noise in quantum devices and scalability. Furthermore, the high cost of quantum communication constrains the practical application of QML in real-world scenarios. This paper introduces a noise-aware clustered quantum federated learning system that addresses noise mitigation, limited quantum device capacity, and high quantum communication costs in distributed QML. It employs noise modelling and clustering to select devices with minimal noise and distribute QML tasks efficiently. Using circuit partitioning to deploy smaller models on low-noise devices and aggregating similar devices, the system enhances distributed QML performance and reduces communication costs. Leveraging circuit cutting, QML techniques are more effective for smaller circuit sizes and fidelity. We conduct experimental evaluations to assess the performance of the proposed system. Additionally, we introduce a noisy dataset for QML to demonstrate the impact of noise on proposed accuracy.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
The influence of dislocations on R-phase transformations in a NiTi shape memory alloy
Authors:
Himanshu Vashishtha,
David M. Collins
Abstract:
The ability to control the stress-induced phase transformation of the shape memory alloy, NiTi, is an important technological challenge that must be understood for their wide application in devices that can exploit their reversible strain properties. This study elucidates the direct relationship between dislocation density and the \textit{R}-phase transformation, including its formation temperatur…
▽ More
The ability to control the stress-induced phase transformation of the shape memory alloy, NiTi, is an important technological challenge that must be understood for their wide application in devices that can exploit their reversible strain properties. This study elucidates the direct relationship between dislocation density and the \textit{R}-phase transformation, including its formation temperature from interrupted annealing of rolled NiTi samples. Deformation is shown to determine the enthalpy change required for the B2$\rightarrow$\textit{R}-phase transformation, with associated transformation temperatures being modifiable via dislocation density and recovery processes. Recovery is shown to be rapid, highly heterogeneous and sensitive to crystal orientation. Grains with a $\langle100\rangle$ direction close to the macroscopic rolling direction recover more rapidly than $\langle110\rangle$ and $\langle111\rangle$ orientated grains. Considered to be governed by processing induced residual stresses and resultant crystallographic dependent annihilation/slip pathways, there are opportunities to tune B2$\rightarrow$\textit{R}-phase transformation on either a grain-averaged or an orientation dependant per-grain basis.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
QCD-Gravity double-copy in the Regge regime: shock wave propagators
Authors:
Himanshu Raj,
Raju Venugopalan
Abstract:
In recent work, we demonstrated a double-copy relation between inclusive gluon radiation in shock wave collisions of ultrarelativistic nuclei and inclusive graviton radiation in trans-Planckian gravitational shock wave collisions. We compute here the corresponding gravitational shock wave propagators in general relativity and demonstrate that they too obey a double copy relation to gluon shock wav…
▽ More
In recent work, we demonstrated a double-copy relation between inclusive gluon radiation in shock wave collisions of ultrarelativistic nuclei and inclusive graviton radiation in trans-Planckian gravitational shock wave collisions. We compute here the corresponding gravitational shock wave propagators in general relativity and demonstrate that they too obey a double copy relation to gluon shock wave propagators computed previously. These results provide key input in a renormalization group approach towards computing the high frequency radiation spectrum in close black hole encounters.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Multi-Objective Control Co-design Using Graph-Based Optimization for Offshore Wind Farm Grid Integration
Authors:
Himanshu Sharma,
Wei Wang,
Bowen Huang,
Thiagarajan Ramachandran,
Veronica Adetola
Abstract:
Offshore wind farms have emerged as a popular renewable energy source that can generate substantial electric power with a low environmental impact. However, integrating these farms into the grid poses significant complexities. To address these issues, optimal-sized energy storage can provide potential solutions and help improve the reliability, efficiency, and flexibility of the grid. Nevertheless…
▽ More
Offshore wind farms have emerged as a popular renewable energy source that can generate substantial electric power with a low environmental impact. However, integrating these farms into the grid poses significant complexities. To address these issues, optimal-sized energy storage can provide potential solutions and help improve the reliability, efficiency, and flexibility of the grid. Nevertheless, limited studies have attempted to perform energy storage sizing while including design and operations (i.e., control co-design) for offshore wind farms. As a result, the present work develops a control co-design optimization formulation to optimize multiple objectives and identify Pareto optimal solutions. The graph-based optimization framework is proposed to address the complexity of the system, allowing the optimization problem to be decomposed for large power systems. The IEEE-9 bus system is treated as an onshore AC grid with two offshore wind farms connected via a multi-terminal DC grid for our use case. The developed methodology successfully identifies the Pareto front during the control co-design optimization, enabling decision-makers to select the best compromise solution for multiple objectives.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
A Comparison of the Performance of the Molecular Dynamics Simulation Package GROMACS Implemented in the SYCL and CUDA Programming Models
Authors:
L. Apanasevich,
Yogesh Kale,
Himanshu Sharma,
Ana Marija Sokovic
Abstract:
For many years, systems running Nvidia-based GPU architectures have dominated the heterogeneous supercomputer landscape. However, recently GPU chipsets manufactured by Intel and AMD have cut into this market and can now be found in some of the worlds fastest supercomputers. The June 2023 edition of the TOP500 list of supercomputers ranks the Frontier supercomputer at the Oak Ridge National Laborat…
▽ More
For many years, systems running Nvidia-based GPU architectures have dominated the heterogeneous supercomputer landscape. However, recently GPU chipsets manufactured by Intel and AMD have cut into this market and can now be found in some of the worlds fastest supercomputers. The June 2023 edition of the TOP500 list of supercomputers ranks the Frontier supercomputer at the Oak Ridge National Laboratory in Tennessee as the top system in the world. This system features AMD Instinct 250 X GPUs and is currently the only true exascale computer in the world.The first framework that enabled support for heterogeneous platforms across multiple hardware vendors was OpenCL, in 2009. Since then a number of frameworks have been developed to support vendor agnostic heterogeneous environments including OpenMP, OpenCL, Kokkos, and SYCL. SYCL, which combines the concepts of OpenCL with the flexibility of single-source C++, is one of the more promising programming models for heterogeneous computing devices. One key advantage of this framework is that it provides a higher-level programming interface that abstracts away many of the hardware details than the other frameworks. This makes SYCL easier to learn and to maintain across multiple architectures and vendors. In n recent years, there has been growing interest in using heterogeneous computing architectures to accelerate molecular dynamics simulations. Some of the more popular molecular dynamics simulations include Amber, NAMD, and Gromacs. However, to the best of our knowledge, only Gromacs has been successfully ported to SYCL to date. In this paper, we compare the performance of GROMACS compiled using the SYCL and CUDA frameworks for a variety of standard GROMACS benchmarks. In addition, we compare its performance across three different Nvidia GPU chipsets, P100, V100, and A100.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
A micromechanical study of heat treatment induced hardening in α-brass
Authors:
Jonathan Birch,
Emily Jenkins,
Anastasia Vrettou,
Mohammed Said,
Himanshu Vashishtha,
Thomas Connolley,
Jeff Brooks,
David M. Collins
Abstract:
The mechanisms that govern a previously unexplained hardening effect of a single phase Cu-30wt%Zn α-brass after heating have been investigated. After cold-work, the alloy possesses an increased yield strength and hardening rate only when heat treated to temperatures close to 220{^\circ}C, and is otherwise softer. Crystallographic texture and microstructure, explored using electron backscatter diff…
▽ More
The mechanisms that govern a previously unexplained hardening effect of a single phase Cu-30wt%Zn α-brass after heating have been investigated. After cold-work, the alloy possesses an increased yield strength and hardening rate only when heat treated to temperatures close to 220{^\circ}C, and is otherwise softer. Crystallographic texture and microstructure, explored using electron backscatter diffraction (EBSD), describe the deformation heterogeneity including twin development, as a function of heat treatment. When heated, an increased area fraction of deformation twins is observed, with dimensions reaching a critical size that maximises the resistance to dislocation slip in the parent grains. The effect is shown to dominate over other alloy characteristics including short range order, giving serrated yielding during tensile testing which is mostly eliminated after heating. In-situ X-ray diffraction during tensile testing corroborates these findings; dislocation-related line broadening and lattice strain development between as worked and heated α-brass is directly related to the interaction between the dislocations and the population of deformation twins. The experiments unambiguously disprove that other thermally-induced microstructure features contribute to thermal hardening. Specifically, the presence of recrystallised grains or second phases do not play a role. As these heat treatments match annealing conditions subjected to α-brass during deformation-related manufacturing processes, the results here are considered critical to understand, predict and exploit, where appropriate, any beneficial process-induced structural behaviour.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Shadow and strong gravitational lensing of new wormhole solutions supported by embedding Class-I condition
Authors:
Niyaz Uddin Molla,
Himanshu Chaudhary,
Ujjal Debnath,
G. Mustafa,
S. K. Maurya
Abstract:
This study deals with the new class of embedded wormhole solutions in the background of general relativity. Two newly calculated wormhole solutions satisfy all the required properties. All the energy conditions are discussed through their validity regions for the different ranges of involved parameters. In maximum regions, all energy conditions are violated. We investigate the shadow and strong gr…
▽ More
This study deals with the new class of embedded wormhole solutions in the background of general relativity. Two newly calculated wormhole solutions satisfy all the required properties. All the energy conditions are discussed through their validity regions for the different ranges of involved parameters. In maximum regions, all energy conditions are violated. We investigate the shadow and strong gravitational lensing by the wormhole throat for the two new wormhole models, namely Model-I and Model-II. The present paper considers the wormhole throat to act as a photon sphere. We first derive null geodesics using the Hamilton-Jacobi separation method to investigate the shadow and strong gravitational lensing caused by the wormhole throat. We then numerically obtain the radius of wormhole shadow, strong deflection angle, and various lensing observables by taking the example of supermassive black M87* and Sgr A* in the context of both Model-I and Model-II. Kee** all other parameters fixed, it is observed that the parameters $ζ_1$ and $ζ_2$ for Model-I; and $χ_1$ and $χ_2$ for Model-II have significant effects on the wormhole shadow and strong gravitational lensing phenomena. Our conclusion is that it is possible to detect relativistic images, such as Einstein rings, produced by wormholes with throat radii of $r_{th}=3M$. Additionally, current technology enables us to test hypotheses related to astrophysical wormholes.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Novel Optimized Designs of Modulo $2n+1$ Adder for Quantum Computing
Authors:
Bhaskar Gaur,
Himanshu Thapliyal
Abstract:
Quantum modular adders are one of the most fundamental yet versatile quantum computation operations. They help implement functions of higher complexity, such as subtraction and multiplication, which are used in applications such as quantum cryptanalysis, quantum image processing, and securing communication. To the best of our knowledge, there is no existing design of quantum modulo $(2n+1)$ adder.…
▽ More
Quantum modular adders are one of the most fundamental yet versatile quantum computation operations. They help implement functions of higher complexity, such as subtraction and multiplication, which are used in applications such as quantum cryptanalysis, quantum image processing, and securing communication. To the best of our knowledge, there is no existing design of quantum modulo $(2n+1)$ adder. In this work, we propose four quantum adders targeted specifically for modulo $(2n+1)$ addition. These adders can provide both regular and modulo $(2n+1)$ sum concurrently, enhancing their application in residue number system based arithmetic. Our first design, QMA1, is a novel quantum modulo $(2n+1)$ adder. The second proposed adder, QMA2, optimizes the utilization of quantum gates within the QMA1, resulting in 37.5% reduced CNOT gate count, 46.15% reduced CNOT depth, and 26.5% decrease in both Toffoli gates and depth. We propose a third adder QMA3 that uses zero resets, a dynamic circuits based feature that reuses qubits, leading to 25% savings in qubit count. Our fourth design, QMA4, demonstrates the benefit of incorporating additional zero resets to achieve a purer zero state, reducing quantum state preparation errors. Notably, we conducted experiments using 5-qubit configurations of the proposed modulo $(2n+1)$ adders on the IBM Washington, a 127-qubit quantum computer based on the Eagle R1 architecture, to demonstrate a 28.8% reduction in QMA1's error of which: (i) 18.63% error reduction happens due to gate and depth reduction in QMA2, and (ii) 2.53% drop in error due to qubit reduction in QMA3, and (iii) 7.64% error decreased due to application of additional zero resets in QMA4.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
A Human-in-the-Loop Approach to Improving Cross-Text Prosody Transfer
Authors:
Himanshu Maurya,
Atli Sigurgeirsson
Abstract:
Text-To-Speech (TTS) prosody transfer models can generate varied prosodic renditions, for the same text, by conditioning on a reference utterance. These models are trained with a reference that is identical to the target utterance. But when the reference utterance differs from the target text, as in cross-text prosody transfer, these models struggle to separate prosody from text, resulting in redu…
▽ More
Text-To-Speech (TTS) prosody transfer models can generate varied prosodic renditions, for the same text, by conditioning on a reference utterance. These models are trained with a reference that is identical to the target utterance. But when the reference utterance differs from the target text, as in cross-text prosody transfer, these models struggle to separate prosody from text, resulting in reduced perceived naturalness. To address this, we propose a Human-in-the-Loop (HitL) approach. HitL users adjust salient correlates of prosody to make the prosody more appropriate for the target text, while maintaining the overall reference prosodic effect. Human adjusted renditions maintain the reference prosody while being rated as more appropriate for the target text $57.8\%$ of the time. Our analysis suggests that limited user effort suffices for these improvements, and that closeness in the latent reference space is not a reliable prosodic similarity metric for the cross-text condition.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Enhancing Presentation Slide Generation by LLMs with a Multi-Staged End-to-End Approach
Authors:
Sambaran Bandyopadhyay,
Himanshu Maheshwari,
Anandhavelu Natarajan,
Apoorv Saxena
Abstract:
Generating presentation slides from a long document with multimodal elements such as text and images is an important task. This is time consuming and needs domain expertise if done manually. Existing approaches for generating a rich presentation from a document are often semi-automatic or only put a flat summary into the slides ignoring the importance of a good narrative. In this paper, we address…
▽ More
Generating presentation slides from a long document with multimodal elements such as text and images is an important task. This is time consuming and needs domain expertise if done manually. Existing approaches for generating a rich presentation from a document are often semi-automatic or only put a flat summary into the slides ignoring the importance of a good narrative. In this paper, we address this research gap by proposing a multi-staged end-to-end model which uses a combination of LLM and VLM. We have experimentally shown that compared to applying LLMs directly with state-of-the-art prompting, our proposed multi-staged solution is better in terms of automated metrics and human evaluation.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Information scrambling in quantum-walks
Authors:
Himanshu Sahu
Abstract:
We study information scrambling -- a spread of initially localized quantum information into the system's many degree of freedom -- in discrete-time quantum walks. We consider out-of-time-ordered correlators (OTOC) and K-complexity as probe of information scrambling. The OTOC for local spin operators in all directions has a light-cone structure which is ``shell-like''. As the wavefront passes, the…
▽ More
We study information scrambling -- a spread of initially localized quantum information into the system's many degree of freedom -- in discrete-time quantum walks. We consider out-of-time-ordered correlators (OTOC) and K-complexity as probe of information scrambling. The OTOC for local spin operators in all directions has a light-cone structure which is ``shell-like''. As the wavefront passes, the OTOC approaches to zero in the long-time limit, showing no signature of scrambling. The introduction of spatial or temporal disorder changes the shape of the light-cone akin to localization of wavefuction. We formulate the K-complexity in system with discrete-time evolution, and show that it grows linearly in discrete-time quantum walk. The presence of disorder modifies this growth to sub-linear. Our study present interesting case to explore many-body phenomenon in discrete-time quantum walk using scrambling.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Residue Number System (RNS) based Distributed Quantum Addition
Authors:
Bhaskar Gaur,
Travis S. Humble,
Himanshu Thapliyal
Abstract:
Quantum Arithmetic faces limitations such as noise and resource constraints in the current Noisy Intermediate Scale Quantum (NISQ) era quantum computers. We propose using Distributed Quantum Computing (DQC) to overcome these limitations by substituting a higher depth quantum addition circuit with Residue Number System (RNS) based quantum modulo adders. The RNS-based distributed quantum addition ci…
▽ More
Quantum Arithmetic faces limitations such as noise and resource constraints in the current Noisy Intermediate Scale Quantum (NISQ) era quantum computers. We propose using Distributed Quantum Computing (DQC) to overcome these limitations by substituting a higher depth quantum addition circuit with Residue Number System (RNS) based quantum modulo adders. The RNS-based distributed quantum addition circuits possess lower depth and are distributed across multiple quantum computers/jobs, resulting in higher noise resilience. We propose the Quantum Superior Modulo Addition based on RNS Tool (QSMART), which can generate RNS sets of quantum adders based on multiple factors such as depth, range, and efficiency. We also propose a novel design of Quantum Diminished-1 Modulo (2n + 1) Adder (QDMA), which forms a crucial part of RNS-based distributed quantum addition and the QSMART tool. We demonstrate the higher noise resilience of the Residue Number System (RNS) based distributed quantum addition by conducting simulations modeling Quantinuum's H1 ion trap-based quantum computer. Our simulations demonstrate that RNS-based distributed quantum addition has 11.36% to 133.15% higher output probability over 6-bit to 10-bit non-distributed quantum full adders, indicating higher noise fidelity. Furthermore, we present a scalable way of achieving distributed quantum addition higher than limited otherwise by the 20-qubit range of Quantinuum H1.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Effective Field Theory of Conformal Boundaries
Authors:
Oleksandr Diatlyk,
Himanshu Khanchandani,
Fedor K. Popov,
Yifan Wang
Abstract:
We introduce an effective field theory (EFT) for conformal impurity by considering a pair of transversely displaced impurities and integrating out modes with mass inversely proportional to the separation distance. This EFT captures the universal signature of the impurity seen by a heavy local operator. We focus on the case of conformal boundaries and derive universal formulas from this EFT for the…
▽ More
We introduce an effective field theory (EFT) for conformal impurity by considering a pair of transversely displaced impurities and integrating out modes with mass inversely proportional to the separation distance. This EFT captures the universal signature of the impurity seen by a heavy local operator. We focus on the case of conformal boundaries and derive universal formulas from this EFT for the boundary structure constants at high energy. We point out that the more familiar thermal EFT for conformal field theory is a special case of this EFT with distinguished conformal boundaries. We also derive, for general conformal impurities, non-positivity and convexity-like constraints on the Casimir energy which determines the leading EFT coefficient.
△ Less
Submitted 9 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Learning to Recover from Plan Execution Errors during Robot Manipulation: A Neuro-symbolic Approach
Authors:
Namasivayam Kalithasan,
Arnav Tuli,
Vishal Bindal,
Himanshu Gaurav Singh,
Parag Singla,
Rohan Paul
Abstract:
Automatically detecting and recovering from failures is an important but challenging problem for autonomous robots. Most of the recent work on learning to plan from demonstrations lacks the ability to detect and recover from errors in the absence of an explicit state representation and/or a (sub-) goal check function. We propose an approach (blending learning with symbolic search) for automated er…
▽ More
Automatically detecting and recovering from failures is an important but challenging problem for autonomous robots. Most of the recent work on learning to plan from demonstrations lacks the ability to detect and recover from errors in the absence of an explicit state representation and/or a (sub-) goal check function. We propose an approach (blending learning with symbolic search) for automated error discovery and recovery, without needing annotated data of failures. Central to our approach is a neuro-symbolic state representation, in the form of dense scene graph, structured based on the objects present within the environment. This enables efficient learning of the transition function and a discriminator that not only identifies failures but also localizes them facilitating fast re-planning via computation of heuristic distance function. We also present an anytime version of our algorithm, where instead of recovering to the last correct state, we search for a sub-goal in the original plan minimizing the total distance to the goal given a re-planning budget. Experiments on a physics simulator with a variety of simulated failures show the effectiveness of our approach compared to existing baselines, both in terms of efficiency as well as accuracy of our recovery mechanism.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
The estimation of parameters of generalized cosmic Chaplygin gas and viscous modified Chaplygin gas and Accretions around Black Hole in the background of Einstein-Aether gravity
Authors:
Puja Mukherjee,
Ujjal Debnath,
Himanshu Chaudhary,
G. Mustafa
Abstract:
In this paper, we have investigated the phenomenon of accelerated cosmic expansion in the late universe and the mass accretion process of a 4-dimensional Einstein-Aether black hole. Starting with the basics of Einstein-Aether gravity theory, we have first considered the field equations and two eminent models of Chaplygin gas, viz. generalized cosmic Chaplygin gas model and viscous modified Chaplyg…
▽ More
In this paper, we have investigated the phenomenon of accelerated cosmic expansion in the late universe and the mass accretion process of a 4-dimensional Einstein-Aether black hole. Starting with the basics of Einstein-Aether gravity theory, we have first considered the field equations and two eminent models of Chaplygin gas, viz. generalized cosmic Chaplygin gas model and viscous modified Chaplygin gas model. Then, we obtained the energy density and Hubble parameters equations for these models in terms of some dimensionless density parameters and some unknown parameters. After finding the required parameters, we proceeded with the mass accretion process. For both models, we obtained the equation of mass in terms of the redshift function and represented the change of mass of the black hole graphically with redshift. At the same time, we have made a graphical comparison between the above-mentioned models and the $Λ$CDM model of the universe. Eventually, we have concluded that the mass of a 4-dimensional black hole will increase along the universe's evolution in the backdrop of Einstein-Aether gravity.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Dark Energy Model in Einstein and Horava-Lifshitz Gravity with a new Parametrization of $ω(z)$: Model Comparison, Analysis and Observational Constraint
Authors:
Ujjal Debnath,
Himanshu Chaudhary,
Niyaz Uddin Molla,
S. K. J. Pacif,
G. Mustafa
Abstract:
We present a novel dynamical dark energy model within the frameworks of both Einstein gravity and Horava-Lifshitz gravity. Utilizing a new parametrization of the dark energy equation of state $ω(z)$, we derive solutions to the field equations. By employing recent cosmological datasets, such as cosmic chronometer datasets, Type Ia Supernovae datasets, Baryonic Oscillation datasets, and the recent H…
▽ More
We present a novel dynamical dark energy model within the frameworks of both Einstein gravity and Horava-Lifshitz gravity. Utilizing a new parametrization of the dark energy equation of state $ω(z)$, we derive solutions to the field equations. By employing recent cosmological datasets, such as cosmic chronometer datasets, Type Ia Supernovae datasets, Baryonic Oscillation datasets, and the recent Hubble constant value measured by the Hubble Space Telescope and the SH0ES Team as an additional prior. We validate our model and determine optimal parameter values. Furthermore, we analyze the evolution of the Universe by showing the redshift dependence plots of key cosmological parameters through graphical representations. We also perform diagnostic analyses to compare our model with the standard model. Using the Akaike Information Criterion (AIC), we compare the three models and find that all of them are supported by the current data, making it impossible to discard any of them. Our model aligns well with recent observations and unveils intriguing features of the Universe, particularly the late-time behavior of the Universe.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
The ORT and the uGMRT Pulsar Monitoring Program : Pulsar Timing Irregularities & the Gaussian Process Realization
Authors:
Himanshu Grover,
Bhal Chandra Joshi,
Jaikhomba Singha,
Erbil Gügercinoğlu,
Paramasivan Arumugam,
Debades Bandyopadhyay,
James O. Chibueze,
Shantanu Desai,
Innocent O. Eya,
Anu Kundu,
Johnson O. Urama
Abstract:
The spin-down law of pulsars is generally perturbed by two types of timing irregularities: glitches and timing noise. Glitches are sudden changes in the rotational frequency of pulsars, while timing noise is a discernible stochastic wandering in the phase, period, or spin-down rate of a pulsar. We present the timing results of a sample of glitching pulsars observed using the Ooty Radio Telescope (…
▽ More
The spin-down law of pulsars is generally perturbed by two types of timing irregularities: glitches and timing noise. Glitches are sudden changes in the rotational frequency of pulsars, while timing noise is a discernible stochastic wandering in the phase, period, or spin-down rate of a pulsar. We present the timing results of a sample of glitching pulsars observed using the Ooty Radio Telescope (ORT) and the upgraded Giant Metrewave Radio Telescope (uGMRT). Our findings include timing noise analysis for 17 pulsars, with seven being reported for the first time. We detected five glitches in four pulsars and a glitch-like event in J1825-0935. The frequency evolution of glitch in pulsars, J0742-2822 and J1740-3015, is presented for the first time. Additionally, we report timing noise results for three glitching pulsars. The timing noise was analyzed separately in the pre-glitch region and post-glitch regions. We observed an increase in the red noise parameters in the post-glitch regions, where exponential recovery was considered in the noise analysis. Timing noise can introduce ambiguities in the correct evaluation of glitch observations. Hence, it is important to consider timing noise in glitch analysis. We propose an innovative glitch verification approach designed to discern between a glitch and strong timing noise. The novel glitch analysis technique is also demonstrated using the observed data.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Presentations are not always linear! GNN meets LLM for Document-to-Presentation Transformation with Attribution
Authors:
Himanshu Maheshwari,
Sambaran Bandyopadhyay,
Aparna Garimella,
Anandhavelu Natarajan
Abstract:
Automatically generating a presentation from the text of a long document is a challenging and useful problem. In contrast to a flat summary, a presentation needs to have a better and non-linear narrative, i.e., the content of a slide can come from different and non-contiguous parts of the given document. However, it is difficult to incorporate such non-linear map** of content to slides and ensur…
▽ More
Automatically generating a presentation from the text of a long document is a challenging and useful problem. In contrast to a flat summary, a presentation needs to have a better and non-linear narrative, i.e., the content of a slide can come from different and non-contiguous parts of the given document. However, it is difficult to incorporate such non-linear map** of content to slides and ensure that the content is faithful to the document. LLMs are prone to hallucination and their performance degrades with the length of the input document. Towards this, we propose a novel graph based solution where we learn a graph from the input document and use a combination of graph neural network and LLM to generate a presentation with attribution of content for each slide. We conduct thorough experiments to show the merit of our approach compared to directly using LLMs for this task.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Hydrogen peroxide forms spontaneously in water (bulk, film, or microdroplet) via reduction of dissolved oxygen at solid-water interface
Authors:
Muzzamil Ahmad Eatoo,
Himanshu Mishra
Abstract:
Zare and co-workers have recently claimed that hydrogen peroxide is spontaneously generated on the air-water interface of sprayed microdroplets, i.e., that H2O2 forms without an external energy source or co-reactant or catalyst. Specifically, they find that the H2O2(aq) concentration in sprayed microdroplets increases by a factor of 3.5 (or 2.5) as the spray chamber's relative humidity (RH) is cha…
▽ More
Zare and co-workers have recently claimed that hydrogen peroxide is spontaneously generated on the air-water interface of sprayed microdroplets, i.e., that H2O2 forms without an external energy source or co-reactant or catalyst. Specifically, they find that the H2O2(aq) concentration in sprayed microdroplets increases by a factor of 3.5 (or 2.5) as the spray chamber's relative humidity (RH) is changed from 15% to 50% (or from 15% to 95%). Building on these results, they imply causation for the seasonality of viral infections arising from the RH-dependent H2O2 generation in environmental microdroplets. Here, we present an alternative explanation for their observations.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
From Questions to Insightful Answers: Building an Informed Chatbot for University Resources
Authors:
Subash Neupane,
Elias Hossain,
Jason Keith,
Himanshu Tripathi,
Farbod Ghiasi,
Noorbakhsh Amiri Golilarz,
Amin Amirlatifi,
Sudip Mittal,
Shahram Rahimi
Abstract:
This paper presents BARKPLUG V.2, a Large Language Model (LLM)-based chatbot system built using Retrieval Augmented Generation (RAG) pipelines to enhance the user experience and access to information within academic settings.The objective of BARKPLUG V.2 is to provide information to users about various campus resources, including academic departments, programs, campus facilities, and student resou…
▽ More
This paper presents BARKPLUG V.2, a Large Language Model (LLM)-based chatbot system built using Retrieval Augmented Generation (RAG) pipelines to enhance the user experience and access to information within academic settings.The objective of BARKPLUG V.2 is to provide information to users about various campus resources, including academic departments, programs, campus facilities, and student resources at a university setting in an interactive fashion. Our system leverages university data as an external data corpus and ingests it into our RAG pipelines for domain-specific question-answering tasks. We evaluate the effectiveness of our system in generating accurate and pertinent responses for Mississippi State University, as a case study, using quantitative measures, employing frameworks such as Retrieval Augmented Generation Assessment(RAGAS). Furthermore, we evaluate the usability of this system via subjective satisfaction surveys using the System Usability Scale (SUS). Our system demonstrates impressive quantitative performance, with a mean RAGAS score of 0.96, and experience, as validated by usability assessments.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Defining subsystems in Hilbert spaces with non-Euclidean metric
Authors:
Himanshu Badhani,
Sibasish Ghosh
Abstract:
This work outlines a consistent method of identifying subsystems in finite-dimensional Hilbert spaces, independent of the underlying inner-product structure. It has been well established that Hilbert spaces with modified inner-product, defined through the so-called metric operator, turn out to be the most natural ways to represent certain phenomena such as those involving balanced gain and loss re…
▽ More
This work outlines a consistent method of identifying subsystems in finite-dimensional Hilbert spaces, independent of the underlying inner-product structure. It has been well established that Hilbert spaces with modified inner-product, defined through the so-called metric operator, turn out to be the most natural ways to represent certain phenomena such as those involving balanced gain and loss resulting in pseudo-Hermitian Hamiltonians. For composite systems undergoing pseudo-Hermitian evolution, defining the subsystems is generally considered feasible only when the metric operator is chosen to have a tensor product form so that a partial trace operation can be well defined. In this work, we use arguments from algebraic quantum mechanics to show that the subsystems can be well-defined in every metric space -- irrespective of whether or not the metric is of tensor product form. This is done by identifying subsystems with a decomposition of the underlying $C^*$-algebra into commuting sub-algebras. We show that different subsystem decompositions correspond to choosing different equivalence classes of the GNS representation. Furthermore, given a form of pseudo-Hermitian Hamiltonian, the choice of the Hamiltonian compatible metric characterizes the subsystem decomposition and as a consequence, the entanglement structure in the system. We clarify how each of the subsystems, defined this way, can be tomographically constructed and that these subsystems satisfy the no-signaling principle. With these results, we put all the choices of the metric operator on an equal footing.
△ Less
Submitted 5 June, 2024; v1 submitted 13 May, 2024;
originally announced May 2024.
-
Optimized Generation of Entanglement by Real-Time Ordering of Swap** Operations
Authors:
Ranjani G Sundaram,
Himanshu Gupta
Abstract:
Long-distance quantum communication in quantum networks faces significant challenges due to the constraints imposed by the no-cloning theorem. Most existing quantum communication protocols rely on the a priori distribution of entanglement pairs (EPs), a process known to incur considerable latency due to its stochastic nature. In this work, we consider the problem of minimizing the latency of estab…
▽ More
Long-distance quantum communication in quantum networks faces significant challenges due to the constraints imposed by the no-cloning theorem. Most existing quantum communication protocols rely on the a priori distribution of entanglement pairs (EPs), a process known to incur considerable latency due to its stochastic nature. In this work, we consider the problem of minimizing the latency of establishing an EP across a pair of nodes in a quantum network. While prior research has primarily focused on minimizing the expected generation latency by selecting {\em static} entanglement routes and/or swap** trees in advance, our approach considers a real-time adaptive strategy -- wherein the order of entanglement-swap** operations (hence, the swap** tree used) is progressively determined at runtime based on the runtime success/failure of the stochastic events. In this context, we present a greedy algorithm that iteratively determines the best route and/or entanglement-swap** operation to perform at each stage based on the current network. We evaluate our schemes on randomly generated networks and observe a reduction in latency of up to 40% from the optimal offline approach.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Distributed Quantum Computation with Minimum Circuit Execution Time over Quantum Networks
Authors:
Ranjani G Sundaram,
Himanshu Gupta,
C. R. Ramakrishnan
Abstract:
Present quantum computers are constrained by limited qubit capacity and restricted physical connectivity, leading to challenges in large-scale quantum computations. Distributing quantum computations across a network of quantum computers is a promising way to circumvent these challenges and facilitate large quantum computations. However, distributed quantum computations require entanglements (to ex…
▽ More
Present quantum computers are constrained by limited qubit capacity and restricted physical connectivity, leading to challenges in large-scale quantum computations. Distributing quantum computations across a network of quantum computers is a promising way to circumvent these challenges and facilitate large quantum computations. However, distributed quantum computations require entanglements (to execute remote gates) which can incur significant generation latency and, thus, lead to decoherence of qubits. In this work, we consider the problem of distributing quantum circuits across a quantum network to minimize the execution time. The problem entails map** the circuit qubits to network memories, including within each computer since limited connectivity within computers can affect the circuit execution time. We provide two-step solutions for the above problem: In the first step, we allocate qubits to memories to minimize the estimated execution time; for this step, we design an efficient algorithm based on an approximation algorithm for the max-quadratic-assignment problem. In the second step, we determine an efficient execution scheme, including generating required entanglements with minimum latency under the network resource and decoherence constraints; for this step, we develop two algorithms with appropriate performance guarantees under certain settings or assumptions. We consider multiple protocols for executing remote gates, viz., telegates and cat-entanglements. With extensive simulations over NetSquid, a quantum network simulator, we demonstrate the effectiveness of our developed techniques and show that they outperform a scheme based on prior work by up to 95%.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Sufficient conditions for total positivity, compounds, and Dodgson condensation
Authors:
Shaun Fallat,
Himanshu Gupta,
Charles R. Johnson
Abstract:
A $n$-by-$n$ matrix is called totally positive ($TP$) if all its minors are positive and $TP_k$ if all of its $k$-by-$k$ submatrices are $TP$. For an arbitrary totally positive matrix or $TP_k$ matrix, we investigate if the $r$th compound ($1<r<n$) is in turn $TP$ or $TP_k$, and demonstrate a strong negative resolution in general. Focus is then shifted to Dodgson's algorithm for calculating the de…
▽ More
A $n$-by-$n$ matrix is called totally positive ($TP$) if all its minors are positive and $TP_k$ if all of its $k$-by-$k$ submatrices are $TP$. For an arbitrary totally positive matrix or $TP_k$ matrix, we investigate if the $r$th compound ($1<r<n$) is in turn $TP$ or $TP_k$, and demonstrate a strong negative resolution in general. Focus is then shifted to Dodgson's algorithm for calculating the determinant of a generic matrix, and we analyze whether the associated condensed matrices are possibly totally positive or $TP_k$. We also show that all condensed matrices associated with a $TP$ Hankel matrix are $TP$.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language
Authors:
Ronny Paul,
Himanshu Buckchash,
Shantipriya Parida,
Dilip K. Prasad
Abstract:
Sámi, an indigenous language group comprising multiple languages, faces digital marginalization due to the limited availability of data and sophisticated language models designed for its linguistic intricacies. This work focuses on increasing technological participation for the Sámi language. We draw the attention of the ML community towards the language modeling problem of Ultra Low Resource (ULR…
▽ More
Sámi, an indigenous language group comprising multiple languages, faces digital marginalization due to the limited availability of data and sophisticated language models designed for its linguistic intricacies. This work focuses on increasing technological participation for the Sámi language. We draw the attention of the ML community towards the language modeling problem of Ultra Low Resource (ULR) languages. ULR languages are those for which the amount of available textual resources is very low, and the speaker count for them is also very low. ULRLs are also not supported by mainstream Large Language Models (LLMs) like ChatGPT, due to which gathering artificial training data for them becomes even more challenging. Mainstream AI foundational model development has given less attention to this category of languages. Generally, these languages have very few speakers, making it hard to find them. However, it is important to develop foundational models for these ULR languages to promote inclusion and the tangible abilities and impact of LLMs. To this end, we have compiled the available Sámi language resources from the web to create a clean dataset for training language models. In order to study the behavior of modern LLM models with ULR languages (Sámi), we have experimented with different kinds of LLMs, mainly at the order of $\sim$ seven billion parameters. We have also explored the effect of multilingual LLM training for ULRLs. We found that the decoder-only models under a sequential multilingual training scenario perform better than joint multilingual training, whereas multilingual training with high semantic overlap, in general, performs better than training from scratch.This is the first study on the Sámi language for adapting non-statistical language models that use the latest developments in the field of natural language processing (NLP).
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Information revival without backflow: non-causal explanations of non-Markovianity
Authors:
Francesco Buscemi,
Rajeev Gangwar,
Kaumudibikash Goswami,
Himanshu Badhani,
Tanmoy Pandit,
Brij Mohan,
Siddhartha Das,
Manabendra Nath Bera
Abstract:
The study of information revivals, witnessing the violation of certain data-processing inequalities, has provided an important paradigm in the study of non-Markovian processes. Although often used interchangeably, we argue here that the notions of ``revivals'' and ``backflows'', i.e., flows of information from the environment back into the system, are distinct: an information revival can occur wit…
▽ More
The study of information revivals, witnessing the violation of certain data-processing inequalities, has provided an important paradigm in the study of non-Markovian processes. Although often used interchangeably, we argue here that the notions of ``revivals'' and ``backflows'', i.e., flows of information from the environment back into the system, are distinct: an information revival can occur without any backflow ever taking place. In this paper, we examine in detail the phenomenon of non-causal revivals and relate them to the theory of short Markov chains and squashed non-Markovianity. As a byproduct, we demonstrate that focusing on processes with actual backflows, while excluding those with only non-causal revivals, resolves the issue of non-convexity of Markovianity, thus enabling the construction of a convex resource theory of genuine non-Markovianity.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs
Authors:
Feiyang Kang,
Hoang Anh Just,
Yifan Sun,
Himanshu Jahagirdar,
Yuanzhi Zhang,
Rongxing Du,
Anit Kumar Sahu,
Ruoxi Jia
Abstract:
This work focuses on leveraging and selecting from vast, unlabeled, open data to pre-fine-tune a pre-trained language model. The goal is to minimize the need for costly domain-specific data for subsequent fine-tuning while achieving desired performance levels. While many data selection algorithms have been designed for small-scale applications, rendering them unsuitable for our context, some emerg…
▽ More
This work focuses on leveraging and selecting from vast, unlabeled, open data to pre-fine-tune a pre-trained language model. The goal is to minimize the need for costly domain-specific data for subsequent fine-tuning while achieving desired performance levels. While many data selection algorithms have been designed for small-scale applications, rendering them unsuitable for our context, some emerging methods do cater to language data scales. However, they often prioritize data that aligns with the target distribution. While this strategy may be effective when training a model from scratch, it can yield limited results when the model has already been pre-trained on a different distribution. Differing from prior work, our key idea is to select data that nudges the pre-training distribution closer to the target distribution. We show the optimality of this approach for fine-tuning tasks under certain conditions. We demonstrate the efficacy of our methodology across a diverse array of tasks (NLU, NLG, zero-shot) with models up to 2.7B, showing that it consistently surpasses other selection methods. Moreover, our proposed method is significantly faster than existing techniques, scaling to millions of samples within a single GPU hour. Our code is open-sourced (Code repository: https://anonymous.4open.science/r/DV4LLM-D761/ ). While fine-tuning offers significant potential for enhancing performance across diverse tasks, its associated costs often limit its widespread adoption; with this work, we hope to lay the groundwork for cost-effective fine-tuning, making its benefits more accessible.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Change of polarization degree of light beams on propagation in curved space
Authors:
You-Lin Chuang,
Himanshu Parihar
Abstract:
Even in free space, which is commonly considered of as a flat space-time in most settings, the degree of polarization of a partially spatially coherent light beam changes as it travels. Similarly, the polarization degree would change when a partially spatially coherent light beam propagates in a curved space-time. The difference of the polarization degree between the curved space and flat space ca…
▽ More
Even in free space, which is commonly considered of as a flat space-time in most settings, the degree of polarization of a partially spatially coherent light beam changes as it travels. Similarly, the polarization degree would change when a partially spatially coherent light beam propagates in a curved space-time. The difference of the polarization degree between the curved space and flat space can reveal the essential structure of the curved space. In this work, we consider a simplest case of curved space known as Schwarzschild spacetime. We can simulate the Schwarzschild space-time as an optical material with an effective refractive index. The difference of the polarization degree of a light beam propagating in curved space and flat space can be achieved up to $ 5\% $, which is detectable in practical measurement. In addition, we have found that the partially spatially coherent light source is necessary for obtaining significant changes in polarization degree. Our results provide an alternative method to estimate the Schwarzschild radius of a massive object with the optical polarization degree measurement.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Optimized Distribution of Entanglement Graph States in Quantum Networks
Authors:
Xiaojie Fan,
Caitao Zhan,
Himanshu Gupta,
C. R. Ramakrishnan
Abstract:
Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantu…
▽ More
Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantum networks lie at the heart of the success of future quantum information technologies. In quantum networks, multipartite entangled states distributed over the network help implement and support many quantum network applications for communications, sensing, and computing. Our work focuses on develo** optimal techniques to generate and distribute multipartite entanglement states efficiently. Prior works on generating general multipartite entanglement states have focused on the objective of minimizing the number of maximally entangled pairs (EPs) while ignoring the heterogeneity of the network nodes and links as well as the stochastic nature of underlying processes. In this work, we develop a hypergraph based linear programming framework that delivers optimal (under certain assumptions) generation schemes for general multipartite entanglement represented by graph states, under the network resources, decoherence, and fidelity constraints, while considering the stochasticity of the underlying processes. We illustrate our technique by develo** generation schemes for the special cases of path and tree graph states, and discuss optimized generation schemes for more general classes of graph states. Using extensive simulations over a quantum network simulator (NetSquid), we demonstrate the effectiveness of our developed techniques and show that they outperform prior known schemes by up to orders of magnitude.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Advancing Healthcare Automation: Multi-Agent System for Medical Necessity Justification
Authors:
Himanshu Pandey,
Akhil Amod,
Shivang
Abstract:
Prior Authorization delivers safe, appropriate, and cost-effective care that is medically justified with evidence-based guidelines. However, the process often requires labor-intensive manual comparisons between patient medical records and clinical guidelines, that is both repetitive and time-consuming. Recent developments in Large Language Models (LLMs) have shown potential in addressing complex m…
▽ More
Prior Authorization delivers safe, appropriate, and cost-effective care that is medically justified with evidence-based guidelines. However, the process often requires labor-intensive manual comparisons between patient medical records and clinical guidelines, that is both repetitive and time-consuming. Recent developments in Large Language Models (LLMs) have shown potential in addressing complex medical NLP tasks with minimal supervision. This paper explores the application of Multi-Agent System (MAS) that utilize specialized LLM agents to automate Prior Authorization task by breaking them down into simpler and manageable sub-tasks. Our study systematically investigates the effects of various prompting strategies on these agents and benchmarks the performance of different LLMs. We demonstrate that GPT-4 achieves an accuracy of 86.2% in predicting checklist item-level judgments with evidence, and 95.6% in determining overall checklist judgment. Additionally, we explore how these agents can contribute to explainability of steps taken in the process, thereby enhancing trust and transparency in the system.
△ Less
Submitted 6 July, 2024; v1 submitted 27 April, 2024;
originally announced April 2024.
-
Why Some Metal Ions Spontaneously Form Nanoparticles in Water Microdroplets? Disentangling the Contributions of Air-Water Interface and Bulk Redox Chemistry
Authors:
Muzzamil Ahmad Eatoo,
Nimer Wehbe,
Najeh Kharbatia,
Xianrong Guo,
Himanshu Mishra
Abstract:
Water microdroplets containing 100 micromolar HAuCl4 have been shown to reduce gold ions into gold nanoparticles spontaneously. It has been suggested that this chemical transformation is driven by ultrahigh electric fields at the air-water interface, albeit without mechanistic insight. We investigated the fate of several metallic salts in water, methanol, ethanol, and acetonitrile in bulk and micr…
▽ More
Water microdroplets containing 100 micromolar HAuCl4 have been shown to reduce gold ions into gold nanoparticles spontaneously. It has been suggested that this chemical transformation is driven by ultrahigh electric fields at the air-water interface, albeit without mechanistic insight. We investigated the fate of several metallic salts in water, methanol, ethanol, and acetonitrile in bulk and microdroplets. This revealed that when HAuCl4 (or PtCl4) is added to bulk water (or methanol or ethanol), metal NPs appear spontaneously. Over time, the nanoparticles grow in bulk, as evidenced by the solution's changing colors. If the same bulk solution is sprayed pneumatically and collected, the NP size has no significant enhancement. Interestingly, the reduction of metal ions is accompanied by the oxidation of water (or alcohols); however, these redox reactions are minimal in acetonitrile. We establish that the spontaneous reduction of metal ions is (i) not limited to water or gold ions, (ii) not driven by the air-water interface of microdroplets, and (iii) appears to be a general phenomenon for solvents containing hydroxyl groups. These results advance our understanding of liquids in general and should be relevant in soil chemistry, biogeochemistry, electrochemistry, and green chemistry.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Host star properties of hot, warm and cold Jupiters in the solar neighborhood from \textit{Gaia} DR3: clues to formation pathways
Authors:
Bihan Banerjee,
Mayank Narang,
P. Manoj,
Thomas Henning,
Himanshu Tyagi,
Arun Surya,
Prasanta K. Nayak,
Mihir Tripathi
Abstract:
Giant planets exhibit diverse orbital properties, hinting at their distinct formation and dynamic histories. In this paper, using $\textit{Gaia}$ DR3, we investigate if and how the orbital properties of Jupiters are linked to their host star properties, particularly their metallicity and age. We obtain metallicities for main sequence stars of spectral type F, G, and K, hosting hot, warm, and cold…
▽ More
Giant planets exhibit diverse orbital properties, hinting at their distinct formation and dynamic histories. In this paper, using $\textit{Gaia}$ DR3, we investigate if and how the orbital properties of Jupiters are linked to their host star properties, particularly their metallicity and age. We obtain metallicities for main sequence stars of spectral type F, G, and K, hosting hot, warm, and cold Jupiters with varying eccentricities. We compute the velocity dispersion of host stars of these three groups using kinematic information from $\textit{Gaia}$ DR3 and obtain average ages using velocity dispersion-age relation. We find that host stars of hot Jupiters are relatively metal-rich ([Fe/H]=$0.18 \pm 0.13$) and young ( median age $3.97 \pm 0.51$ Gyr) compared to the host stars of cold Jupiters in nearly circular orbits, which are relatively metal-poor ($0.03 \pm 0.18$) and older (median age $6.07 \pm 0.79$ Gyr). Host stars of cold Jupiters in high eccentric orbits, on the other hand, show metallicities similar to that of the hosts of hot Jupiters, but are older, on average (median age $6.25 \pm 0.92$ Gyr). The similarity in metallicity between hosts of hot Jupiters and hosts of cold Jupiters in high eccentric orbits supports high eccentricity migration as the potential origin of hot Jupiters, with the latter serving as the progenitors. However, the average age difference between them suggests that the older hot Jupiters may have been engulfed by the star in a timescale of $\sim 6$ Gyr. This allows us to estimate the value of stellar tidal quality factor $Q'_\ast\sim10^{6\pm1}$.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
JWST detections of amorphous and crystalline HDO ice toward massive protostars
Authors:
Katerina Slavicinska,
Ewine F. van Dishoeck,
Łukasz Tychoniec,
Pooneh Nazari,
Adam E. Rubinstein,
Robert Gutermuth,
Himanshu Tyagi,
Yuan Chen,
Nashanty G. C. Brunken,
Will R. M. Rocha,
P. Manoj,
Mayank Narang,
S. Thomas Megeath,
Yao-Lun Yang,
Leslie W. Looney,
John J. Tobin,
Henrik Beuther,
Tyler L. Bourke,
Harold Linnartz,
Samuel Federman,
Dan M. Watson,
Hendrik Linz
Abstract:
This work aims to utilize the increased sensitivity and resolution of the JWST to quantify the HDO/H$_{2}$O ratio in ices toward young stellar objects (YSOs) and to determine if the HDO/H$_{2}$O ratios measured in the gas phase toward massive YSOs (MYSOs) are representative of the ratios in their ice envelopes. Two protostars observed in the Investigating Protostellar Accretion (IPA) program using…
▽ More
This work aims to utilize the increased sensitivity and resolution of the JWST to quantify the HDO/H$_{2}$O ratio in ices toward young stellar objects (YSOs) and to determine if the HDO/H$_{2}$O ratios measured in the gas phase toward massive YSOs (MYSOs) are representative of the ratios in their ice envelopes. Two protostars observed in the Investigating Protostellar Accretion (IPA) program using JWST NIRSpec were analyzed: HOPS 370, an intermediate-mass YSO (IMYSO), and IRAS 20126+4104, a MYSO. The HDO ice toward these sources was detected above the 3$σ$ level and quantified via its 4.1 $μ$m band. The contributions from the CH$_{3}$OH combination modes to the observed optical depth in this spectral region were constrained via the CH$_{3}$OH 3.53 $μ$m band to ensure that the integrated optical depth of the HDO feature was not overestimated. H$_{2}$O ice was quantified via its 3 $μ$m band. From these fits, ice HDO/H$_{2}$O abundance ratios of 4.6$\pm$1.8$\times$10$^{-3}$ and 2.6$\pm$1.2$\times$10$^{-3}$ are obtained for HOPS 370 and IRAS 20126+4104, respectively. The simultaneous detections of both crystalline HDO and crystalline H$_{2}$O corroborate the assignment of the observed feature at 4.1 $μ$m to HDO ice. The ice HDO/H$_{2}$O ratios are similar to the highest reported gas HDO/H$_{2}$O ratios measured toward MYSOs as well as the hot inner regions of isolated low-mass protostars, suggesting that at least some of the gas HDO/H$_{2}$O ratios measured toward massive hot cores are representative of the HDO/H$_{2}$O ratios in ices. The need for an H$_{2}$O-rich CH$_{3}$OH component in the CH$_{3}$OH ice analysis supports recent experimental and observational results that indicate that some CH$_{3}$OH ice may form prior to the CO freeze-out stage in H$_{2}$O-rich ice layers.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Comparison of On-Orbit Manual Attitude Control Methods for Non-Docking Spacecraft Through Virtual Reality Simulation
Authors:
Ajit Krishnan,
Himanshu Vishwakarma,
Maharudra Kharsade,
Pradipta Biswas
Abstract:
On-orbit manual attitude control of manned spacecraft is accomplished using external visual references and some method of three axis attitude control. All past, present, and developmental spacecraft feature the capability to manually control attitude for deorbit. National Aeronautics and Space Administration (NASA) spacecraft permit an aircraft windshield type front view, wherein an arc of the Ear…
▽ More
On-orbit manual attitude control of manned spacecraft is accomplished using external visual references and some method of three axis attitude control. All past, present, and developmental spacecraft feature the capability to manually control attitude for deorbit. National Aeronautics and Space Administration (NASA) spacecraft permit an aircraft windshield type front view, wherein an arc of the Earths horizon is visible to the crew in deorbit attitude. Russian and Chinese spacecraft permit the crew a bottom view wherein the entire circular Earth horizon disk is visible to the crew in deorbit attitude. Our study compared these two types of external views for efficiency in achievement of deorbit attitude. We used a Unity Virtual Reality (VR) spacecraft simulator that we built in house. The task was to accurately achieve deorbit attitude while in a 400 km circular orbit. Six military test pilots and six civilians with gaming experience flew the task using two methods of visual reference. Comparison was based on time taken, fuel consumed, cognitive workload assessment and user preference. We used ocular parameters, EEG, NASA TLX and IBM SUS to quantify our results. Our study found that the bottom view was easier to operate for manual deorbit task. Additionally, we realized that a VR based system can work as a training simulator for manual on-orbit flight path control tasks by pilots and non pilots. Results from our study can be used for design of manual on orbit attitude control of present and future spacecrafts.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Sup3r: A Semi-Supervised Algorithm for increasing Sparsity, Stability, and Separability in Hierarchy Of Time-Surfaces architectures
Authors:
Marco Rasetto,
Himanshu Akolkar,
Ryad Benosman
Abstract:
The Hierarchy Of Time-Surfaces (HOTS) algorithm, a neuromorphic approach for feature extraction from event data, presents promising capabilities but faces challenges in accuracy and compatibility with neuromorphic hardware. In this paper, we introduce Sup3r, a Semi-Supervised algorithm aimed at addressing these challenges. Sup3r enhances sparsity, stability, and separability in the HOTS networks.…
▽ More
The Hierarchy Of Time-Surfaces (HOTS) algorithm, a neuromorphic approach for feature extraction from event data, presents promising capabilities but faces challenges in accuracy and compatibility with neuromorphic hardware. In this paper, we introduce Sup3r, a Semi-Supervised algorithm aimed at addressing these challenges. Sup3r enhances sparsity, stability, and separability in the HOTS networks. It enables end-to-end online training of HOTS networks replacing external classifiers, by leveraging semi-supervised learning. Sup3r learns class-informative patterns, mitigates confounding features, and reduces the number of processed events. Moreover, Sup3r facilitates continual and incremental learning, allowing adaptation to data distribution shifts and learning new tasks without forgetting. Preliminary results on N-MNIST demonstrate that Sup3r achieves comparable accuracy to similarly sized Artificial Neural Networks trained with back-propagation. This work showcases the potential of Sup3r to advance the capabilities of HOTS networks, offering a promising avenue for neuromorphic algorithms in real-world applications.
△ Less
Submitted 30 April, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
FastVPINNs: Tensor-Driven Acceleration of VPINNs for Complex Geometries
Authors:
Thivin Anandh,
Divij Ghose,
Himanshu Jain,
Sashikumaar Ganesan
Abstract:
Variational Physics-Informed Neural Networks (VPINNs) utilize a variational loss function to solve partial differential equations, mirroring Finite Element Analysis techniques. Traditional hp-VPINNs, while effective for high-frequency problems, are computationally intensive and scale poorly with increasing element counts, limiting their use in complex geometries. This work introduces FastVPINNs, a…
▽ More
Variational Physics-Informed Neural Networks (VPINNs) utilize a variational loss function to solve partial differential equations, mirroring Finite Element Analysis techniques. Traditional hp-VPINNs, while effective for high-frequency problems, are computationally intensive and scale poorly with increasing element counts, limiting their use in complex geometries. This work introduces FastVPINNs, a tensor-based advancement that significantly reduces computational overhead and improves scalability. Using optimized tensor operations, FastVPINNs achieve a 100-fold reduction in the median training time per epoch compared to traditional hp-VPINNs. With proper choice of hyperparameters, FastVPINNs surpass conventional PINNs in both speed and accuracy, especially in problems with high-frequency solutions. Demonstrated effectiveness in solving inverse problems on complex domains underscores FastVPINNs' potential for widespread application in scientific and engineering challenges, opening new avenues for practical implementations in scientific machine learning.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Effects of Superradiance in Active Galactic Nuclei
Authors:
Priyanka Sarmah,
Himanshu Verma,
Kingman Cheung,
Joseph Silk
Abstract:
A spinning supermassive black hole (SMBH) at the core of an active galactic nucleus (AGN) provides room for the elusive ultra-light scalar particles (ULSP) to be produced through a phenomenon called \textit{superradiance}. As a result of this phenomenon, a cloud of scalar particles forms around the black hole by draining the spin angular momentum of the SMBH. In this work, we present a study of th…
▽ More
A spinning supermassive black hole (SMBH) at the core of an active galactic nucleus (AGN) provides room for the elusive ultra-light scalar particles (ULSP) to be produced through a phenomenon called \textit{superradiance}. As a result of this phenomenon, a cloud of scalar particles forms around the black hole by draining the spin angular momentum of the SMBH. In this work, we present a study of the superradiant instability due to a scalar field in the vicinity of the central SMBH in an AGN. We begin by showing that the time-evolution of the gravitational coupling $α$ in a realistic ambiance created by the accretion disk around the SMBH in AGN leads to interesting consequences such as the amplified growth of the scalar cloud, enhancement of the gravitational wave emission rate, and appearance of higher modes of superradiance within the age of the Universe ($\sim 10^{10}$ years). We then explore the consequence of superradiance on the characteristics of the AGN. Using the Novikov-Thorne model for an accretion disk, we divide the full spectrum into three distinct wavelength bands- X-ray ($10^{-4}-10^{-2}~μ$m), UV (0.010-0.4~$μ$m), and Vis-IR (0.4~$μ$m-100~$μ$m) and observe sudden drops in the time-variations of the luminosities across these bands and Eddington ratio ($f_{\textrm{Edd}}$) with a characteristic timescale of superradiance. Using a uniform distribution of spin and mass of the SMBHs in AGNs, we demonstrate the appearance of depleted regions and accumulations along the boundaries of these regions in the planes of different band-luminosities and $f_{\textrm{Edd}}$. Finally, we discuss some possible signatures of superradiance that can be drawn from the observed time-variation of the AGN luminosities.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Sketch-Plan-Generalize: Continual Few-Shot Learning of Inductively Generalizable Spatial Concepts
Authors:
Namasivayam Kalithasan,
Sachit Sachdeva,
Himanshu Gaurav Singh,
Vishal Bindal,
Arnav Tuli,
Gurarmaan Singh Panjeta,
Divyanshu Aggarwal,
Rohan Paul,
Parag Singla
Abstract:
Our goal is to enable embodied agents to learn inductively generalizable spatial concepts, e.g., learning staircase as an inductive composition of towers of increasing height. Given a human demonstration, we seek a learning architecture that infers a succinct ${program}$ representation that explains the observed instance. Additionally, the approach should generalize inductively to novel structures…
▽ More
Our goal is to enable embodied agents to learn inductively generalizable spatial concepts, e.g., learning staircase as an inductive composition of towers of increasing height. Given a human demonstration, we seek a learning architecture that infers a succinct ${program}$ representation that explains the observed instance. Additionally, the approach should generalize inductively to novel structures of different sizes or complex structures expressed as a hierarchical composition of previously learned concepts. Existing approaches that use code generation capabilities of pre-trained large (visual) language models, as well as purely neural models, show poor generalization to a-priori unseen complex concepts. Our key insight is to factor inductive concept learning as (i) ${\it Sketch:}$ detecting and inferring a coarse signature of a new concept (ii) ${\it Plan:}$ performing MCTS search over grounded action sequences (iii) ${\it Generalize:}$ abstracting out grounded plans as inductive programs. Our pipeline facilitates generalization and modular reuse, enabling continual concept learning. Our approach combines the benefits of the code generation ability of large language models (LLM) along with grounded neural representations, resulting in neuro-symbolic programs that show stronger inductive generalization on the task of constructing complex structures in relation to LLM-only and neural-only approaches. Furthermore, we demonstrate reasoning and planning capabilities with learned concepts for embodied instruction following.
△ Less
Submitted 29 May, 2024; v1 submitted 11 April, 2024;
originally announced April 2024.
-
JWST/MIRI detection of suprathermal OH rotational emissions: probing the dissociation of the water by Lyman alpha photons near the protostar HOPS 370
Authors:
David A. Neufeld,
P. Manoj,
Himanshu Tyagi,
Mayank Narang,
Dan M. Watson,
S. Thomas Megeath,
Ewine F. Van Dishoeck,
Robert A. Gutermuth,
Thomas Stanke,
Yao-Lun Yang,
Adam E. Rubinstein,
Guillem Anglada,
Henrik Beuther,
Alessio Caratti o Garatti,
Neal J. Evans II,
Samuel Federman,
William J. Fischer,
Joel Green,
Pamela Klaassen,
Leslie W. Looney,
Mayra Osorio,
Pooneh Nazari,
John J. Tobin,
Lukasz Tychoniec,
Scott Wolk
Abstract:
Using the MIRI/MRS spectrometer on JWST, we have detected pure rotational, suprathermal OH emissions from the vicinity of the intermediate-mass protostar HOPS 370 (OMC2/FIR3). These emissions are observed from shocked knots in a jet/outflow, and originate in states of rotational quantum number as high as 46 that possess excitation energies as large as $E_U/k = 4.65 \times 10^4$ K. The relative str…
▽ More
Using the MIRI/MRS spectrometer on JWST, we have detected pure rotational, suprathermal OH emissions from the vicinity of the intermediate-mass protostar HOPS 370 (OMC2/FIR3). These emissions are observed from shocked knots in a jet/outflow, and originate in states of rotational quantum number as high as 46 that possess excitation energies as large as $E_U/k = 4.65 \times 10^4$ K. The relative strengths of the observed OH lines provide a powerful diagnostic of the ultraviolet radiation field in a heavily-extinguished region ($A_V \sim 10 - 20$) where direct UV observations are impossible. To high precision, the OH line strengths are consistent with a picture in which the suprathermal OH states are populated following the photodissociation of water in its $\tilde B - X$ band by ultraviolet radiation produced by fast ($\sim 80\,\rm km\,s^{-1}$) shocks along the jet. The observed dominance of emission from symmetric ($A^\prime$) OH states over that from antisymmetric ($A^{\prime\prime}$) states provides a distinctive signature of this particular population mechanism. Moreover, the variation of intensity with rotational quantum number suggests specifically that Ly$α$ radiation is responsible for the photodissociation of water, an alternative model with photodissociation by a 10$^4$ K blackbody being disfavored at a high level of significance. Using measurements of the Br$α$ flux to estimate the Ly$α$ production rate, we find that $\sim 4\%$ of the Ly$α$ photons are absorbed by water. Combined with direct measurements of water emissions in the $ν_2 = 1 -0$ band, the OH observations promise to provide key constraints on future models for the diffusion of Ly$α$ photons in the vicinity of a shock front.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Defect Fusion and Casimir Energy in Higher Dimensions
Authors:
Oleksandr Diatlyk,
Himanshu Khanchandani,
Fedor K. Popov,
Yifan Wang
Abstract:
We study the operator algebra of extended conformal defects in more than two spacetime dimensions. Such algebra structure encodes the combined effect of multiple impurities on physical observables at long distances as well as the interactions among the impurities. These features are formalized by a fusion product which we define for a pair of defects, after isolating divergences that capture the e…
▽ More
We study the operator algebra of extended conformal defects in more than two spacetime dimensions. Such algebra structure encodes the combined effect of multiple impurities on physical observables at long distances as well as the interactions among the impurities. These features are formalized by a fusion product which we define for a pair of defects, after isolating divergences that capture the effective potential between the defects, which generalizes the usual Casimir energy. We discuss general properties of the corresponding fusion algebra and contrast with the more familiar cases that involve topological defects. We also describe the relation to a different defect setup in the shape of a wedge. We provide explicit examples to illustrate these properties using line defects and interfaces in the Wilson-Fisher CFT and the Gross-Neveu(-Yukawa) CFT and determine the defect fusion data thereof.
△ Less
Submitted 7 June, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Positivity preservers over finite fields
Authors:
Dominique Guillot,
Himanshu Gupta,
Prateek Kumar Vishwakarma
Abstract:
We resolve an algebraic version of Schoenberg's celebrated theorem [Duke Math. J., 1942] characterizing entrywise matrix transforms that preserve positive definiteness. Compared to the classical real and complex settings, we consider matrices with entries in a finite field and obtain a complete characterization of such preservers for matrices of a fixed dimension. When the dimension of the matrice…
▽ More
We resolve an algebraic version of Schoenberg's celebrated theorem [Duke Math. J., 1942] characterizing entrywise matrix transforms that preserve positive definiteness. Compared to the classical real and complex settings, we consider matrices with entries in a finite field and obtain a complete characterization of such preservers for matrices of a fixed dimension. When the dimension of the matrices is at least $3$, we prove that, surprisingly, the positivity preservers are precisely the positive multiples of the field's automorphisms. Our work makes crucial use of the well-known character-sum bound due to Weil, and of a result of Carlitz [Proc. Amer. Math. Soc., 1960] that provides a characterization of the automorphisms of Paley graphs.
△ Less
Submitted 25 April, 2024; v1 submitted 29 March, 2024;
originally announced April 2024.
-
Optimal State Estimation in the Presence of Non-Gaussian Uncertainty via Wasserstein Distance Minimization
Authors:
Himanshu Prabhat,
Raktim Bhattacharya
Abstract:
This paper presents a novel distribution-agnostic Wasserstein distance-based estimation framework. The goal is to determine an optimal map combining prior estimate with measurement likelihood such that posterior estimation error optimally reaches the Dirac delta distribution with minimal effort. The Wasserstein metric is used to quantify the effort of transporting from one distribution to another.…
▽ More
This paper presents a novel distribution-agnostic Wasserstein distance-based estimation framework. The goal is to determine an optimal map combining prior estimate with measurement likelihood such that posterior estimation error optimally reaches the Dirac delta distribution with minimal effort. The Wasserstein metric is used to quantify the effort of transporting from one distribution to another. We hypothesize that minimizing the Wasserstein distance between the posterior error and the Dirac delta distribution results in optimal information fusion and posterior state uncertainty. Framework validation is demonstrated by the successful recovery of the classical Kalman filter for linear systems with Gaussian uncertainties. Notably, the proposed Wasserstein filter does not rely on particle representation of uncertainty. Furthermore, the classical result for the Gaussian Sum Filter (GSF) is retrieved from the Wasserstein framework. This approach analytically exhibits the suboptimality of GSF and enables the use of nonlinear optimization techniques to enhance the accuracy of the Gaussian sum estimator.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Strain aided drastic reduction in lattice thermal conductivity and improved thermoelectric properties in Janus MXenes
Authors:
Himanshu Murari,
Swati Shaw,
Subhradip Ghosh
Abstract:
Surface and strain engineering are among the cheaper ways to modulate structure property relations in materials. Due to their compositional flexibilities, MXenes, the family of two-dimensional materials, provide enough opportunity for surface engineering. In this work, we have explored the possibility of improving thermoelectric efficiency of MXenes through these routes. The Janus MXenes obtained…
▽ More
Surface and strain engineering are among the cheaper ways to modulate structure property relations in materials. Due to their compositional flexibilities, MXenes, the family of two-dimensional materials, provide enough opportunity for surface engineering. In this work, we have explored the possibility of improving thermoelectric efficiency of MXenes through these routes. The Janus MXenes obtained by modifications of the transition metal constituents and the functional groups passivating their surfaces are considered as surface engineered materials on which bi-axial strain is applied in a systematic way. We find that in the three Janus compounds Zr$_{2}$COS, ZrHfO$_{2}$ and ZrHfCOS, tensile strain modifies the electronic and lattice thermoelectric parameters such that the thermoelectric efficiency can be maximised. A remarkable reduction in the lattice thermal conductivity due to increased anharmonicity and elevation in Seebeck coefficient are obtained by application of moderate tensile strain. With the help of first-principles electronic structure method and semi-classical Boltzmann transport theory we analyse the interplay of structural parameters, electronic and dynamical properties to understand the effects of strain and surface modifications on thermoelectric properties of these systems. Our detailed calculations and in depth analysis lead not only to the microscopic understanding of the influences of surface and strain engineering in these three systems, but also provide enough insights for adopting this approach and improve thermoelectric efficiencies in similar systems.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
BFT-PoLoc: A Byzantine Fortified Trigonometric Proof of Location Protocol using Internet Delays
Authors:
Peiyao Sheng,
Vishal Sevani,
Ranvir Rana,
Himanshu Tyagi,
Pramod Viswanath
Abstract:
Internet platforms depend on accurately determining the geographical locations of online users to deliver targeted services (e.g., advertising). The advent of decentralized platforms (blockchains) emphasizes the importance of geographically distributed nodes, making the validation of locations more crucial. In these decentralized settings, mutually non-trusting participants need to {\em prove} the…
▽ More
Internet platforms depend on accurately determining the geographical locations of online users to deliver targeted services (e.g., advertising). The advent of decentralized platforms (blockchains) emphasizes the importance of geographically distributed nodes, making the validation of locations more crucial. In these decentralized settings, mutually non-trusting participants need to {\em prove} their locations to each other. The incentives for claiming desired location include decentralization properties (validators of a blockchain), explicit rewards for improving coverage (physical infrastructure blockchains) and regulatory compliance -- and entice participants towards prevaricating their true location malicious via VPNs, tampering with internet delays, or compromising other parties (challengers) to misrepresent their location. Traditional delay-based geolocation methods focus on reducing the noise in measurements and are very vulnerable to wilful divergences from prescribed protocol.
In this paper we use Internet delay measurements to securely prove the location of IP addresses while being immune to a large fraction of Byzantine actions. Our core methods are to endow Internet telemetry tools (e.g., **) with cryptographic primitives (signatures and hash functions) together with Byzantine resistant data inferences subject to Euclidean geometric constraints. We introduce two new networking protocols, robust against Byzantine actions: Proof of Internet Geometry (PoIG) converts delay measurements into precise distance estimates across the Internet; Proof of Location (PoLoc) enables accurate and efficient multilateration of a specific IP address. The key algorithmic innovations are in conducting ``Byzantine fortified trigonometry" (BFT) inferences of data, endowing low rank matrix completion methods with Byzantine resistance.
△ Less
Submitted 28 March, 2024; v1 submitted 19 March, 2024;
originally announced March 2024.