Search | arXiv e-print repository

arXiv:2406.19771 [pdf]

Unveiling photon-photon coupling induced transparency and absorption

Authors: Kuldeep Kumar Shrivastava, Ansuman Sahu, Biswanath Bhoi, Rajeev Singh

Abstract: This study presents the theoretical foundations of an analogous electromagnetically induced transparency (EIT) and absorption (EIA) which we are referring as coupling induced transparency (CIT) and absorption (CIA) respectively, along with an exploration of the transition between these phenomena. We provide a concise phenomenological description with analytical expressions for transmission spectra… ▽ More This study presents the theoretical foundations of an analogous electromagnetically induced transparency (EIT) and absorption (EIA) which we are referring as coupling induced transparency (CIT) and absorption (CIA) respectively, along with an exploration of the transition between these phenomena. We provide a concise phenomenological description with analytical expressions for transmission spectra and dispersion elucidating how the interplay of coherent and dissipative interactions in a coupled system results in the emergence of level repulsion and attraction, corresponding to CIT and CIA, respectively. The model is validated through numerical simulations using a hybrid system comprising a split ring resonator (SRR) and electric inductive-capacitive (ELC) resonator in planar geometry. We analyse two cases while kee** ELC parameters constant; one involving a dynamic adjustment of the SRR size with a fixed split gap, and the other entailing a varying gap while maintaining a constant SRR size. Notably, in the first case, the dispersion profile of the transmission signal demonstrates level repulsion, while the second case results in level attraction, effectively showcasing CIT and CIA, respectively. These simulated findings not only align with the theoretical model but also underscore the versatility of our approach. Subsequently, we expand our model to a more general case, demonstrating that a controlled transition from CIT to CIA is achievable by manipulating the dissipation rate of individual modes within the hybrid system, leading to either coherent or dissipative interactions between the modes. The results provide a pathway for designing hybrid systems that can control the group velocity of light, offering potential applications in the fields of optical switching and quantum information technology. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2405.10354 [pdf, other]

Dynamical System Analysis for Scalar Field Potential in Teleparallel Gravity

Authors: S. A. Kadam, Ananya Sahu, S. K. Tripathy, B. Mishra

Abstract: In this paper, we have presented a power law cosmological model and its dynamical system analysis in $f(T,φ)$ gravity, where $T$ is the torsion scalar and $φ$ is the canonical scalar field. The two well-motivated forms of the non-minimal coupling function $F(φ)$, the exponential form and the power law form, with exponential potential function, are investigated. The dynamical system analysis is per… ▽ More In this paper, we have presented a power law cosmological model and its dynamical system analysis in $f(T,φ)$ gravity, where $T$ is the torsion scalar and $φ$ is the canonical scalar field. The two well-motivated forms of the non-minimal coupling function $F(φ)$, the exponential form and the power law form, with exponential potential function, are investigated. The dynamical system analysis is performed by establishing the dimensionless dynamical variables, and the critical points were obtained. The evolution of standard density parameters is analysed for each case. The behaviour of the equation of state (EoS) and deceleration parameter show agreement with the result of cosmological observations. The model parameters are constrained using the existence and the stability conditions of the critical points describing different epochs of the evolution of the Universe. △ Less

Submitted 16 May, 2024; originally announced May 2024.

Comments: 17 pages, 8 figures, constructive comments appreciated

arXiv:2405.05127 [pdf, other]

Evolution of Spin in the Intermediate Polar CC Sculptoris

Authors: John A. Paice, S. Scaringi, N. Castro Segura, A. Sahu, K. Ilkiewicz, Deanne L. Coppejans, D. De Martino, C. Knigge, M. Veresvarska

Abstract: We report on spin variations in the intermediate polar and cataclysmic variable CC Scl, as seen by the Transiting Exoplanet Survey Satellite (TESS). By studying both the spin period and its harmonic, we find that the spin has varied since it was first observed in 2011. We find the latest spin value for the source to be 389.473(6)s, equivalent to 0.00450779(7) days, 0.02s shorter than the first val… ▽ More We report on spin variations in the intermediate polar and cataclysmic variable CC Scl, as seen by the Transiting Exoplanet Survey Satellite (TESS). By studying both the spin period and its harmonic, we find that the spin has varied since it was first observed in 2011. We find the latest spin value for the source to be 389.473(6)s, equivalent to 0.00450779(7) days, 0.02s shorter than the first value measured. A linear fit to these and intermediate data give a rate of change of spin ~-4.26(2.66)e10^-11 and a characteristic timescale tau~2.90e10^5 years, in line with other known intermediate polars with varying spin. The spin profile of this source also matches theoretical spin profiles of high-inclination intermediate polars, and furthermore, appears to have changed in shape over a period of three years. Such `spin-up' in an intermediate polar is considered to be from mass accretion onto the white dwarf (the primary), and we note the presence of dwarf nova eruptions in this source as being a possible catalyst of the variations. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: 5 pages, 6 figures. Accepted into MNRAS Letters

arXiv:2405.02774 [pdf, other]

Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs

Authors: Feiyang Kang, Hoang Anh Just, Yifan Sun, Himanshu Jahagirdar, Yuanzhi Zhang, Rongxing Du, Anit Kumar Sahu, Ruoxi Jia

Abstract: This work focuses on leveraging and selecting from vast, unlabeled, open data to pre-fine-tune a pre-trained language model. The goal is to minimize the need for costly domain-specific data for subsequent fine-tuning while achieving desired performance levels. While many data selection algorithms have been designed for small-scale applications, rendering them unsuitable for our context, some emerg… ▽ More This work focuses on leveraging and selecting from vast, unlabeled, open data to pre-fine-tune a pre-trained language model. The goal is to minimize the need for costly domain-specific data for subsequent fine-tuning while achieving desired performance levels. While many data selection algorithms have been designed for small-scale applications, rendering them unsuitable for our context, some emerging methods do cater to language data scales. However, they often prioritize data that aligns with the target distribution. While this strategy may be effective when training a model from scratch, it can yield limited results when the model has already been pre-trained on a different distribution. Differing from prior work, our key idea is to select data that nudges the pre-training distribution closer to the target distribution. We show the optimality of this approach for fine-tuning tasks under certain conditions. We demonstrate the efficacy of our methodology across a diverse array of tasks (NLU, NLG, zero-shot) with models up to 2.7B, showing that it consistently surpasses other selection methods. Moreover, our proposed method is significantly faster than existing techniques, scaling to millions of samples within a single GPU hour. Our code is open-sourced (Code repository: https://anonymous.4open.science/r/DV4LLM-D761/ ). While fine-tuning offers significant potential for enhancing performance across diverse tasks, its associated costs often limit its widespread adoption; with this work, we hope to lay the groundwork for cost-effective fine-tuning, making its benefits more accessible. △ Less

Submitted 4 May, 2024; originally announced May 2024.

Comments: Published as a conference paper at ICLR 2024

arXiv:2404.15487 [pdf, other]

Minimum Consistent Subset in Trees and Interval Graphs

Authors: Aritra Banik, Sayani Das, Anil Maheshwari, Bubai Manna, Subhas C Nandy, Krishna Priya K M, Bodhayan Roy, Sasanka Roy, Abhishek Sahu

Abstract: In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of i… ▽ More In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of its nearest neighbors in $V'$ (measured in terms of the hop distance) shares the same color as $v$. The decision problem, indicating whether there exists a subset $V'$ of cardinality at most $l$ for some positive integer $l$, is known to be NP-complete even for planar graphs. In this paper, we establish that the MCS problem for trees, when the number of colors $c$ is considered an input parameter, is NP-complete. We propose a fixed-parameter tractable (FPT) algorithm for MCS on trees running in $O(2^{6c}n^6)$ time, significantly improving the currently best-known algorithm whose running time is $O(2^{4c}n^{2c+3})$. In an effort to comprehensively understand the computational complexity of the MCS problem across different graph classes, we extend our investigation to interval graphs. We show that it remains NP-complete for interval graphs, thus enriching graph classes where MCS remains intractable. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.09464 [pdf]

Information Gain, Operator Spreading, and Sensitivity to Perturbations as Quantifiers of Chaos in Quantum Systems

Authors: Abinash Sahu

Abstract: We adopt a continuous weak measurement tomography protocol to explore the signatures of chaos in the quantum system(s). We generate the measurement record as a series of expectation values of an observable evolving under the desired dynamics, which can show a transition from integrability to chaos. We find that the rate of information gain depends on the degree of chaos in the dynamics, the choice… ▽ More We adopt a continuous weak measurement tomography protocol to explore the signatures of chaos in the quantum system(s). We generate the measurement record as a series of expectation values of an observable evolving under the desired dynamics, which can show a transition from integrability to chaos. We find that the rate of information gain depends on the degree of chaos in the dynamics, the choice of initial observable, and how well the operator is aligned along the density matrix. The amount of operator spreading in the Krylov subspace, as quantified by the fidelity in quantum tomography and various other metrics of information gain, increases with the degree of chaos in the system. We study operator spreading in many-body quantum systems by its potential to generate an informationally complete measurement record. Our quantifiers for operator spreading are more consistent indicators of quantum chaos than Krylov complexity. Our study gives an operational interpretation for operator spreading in terms of fidelity gain in quantum tomography. Continuing in our journey of finding the footprints of chaos in the quantum domain, we explore the growth of errors in noisy tomography. For random states, when the measurement record is obtained from a random operator, the subsequent drop in the fidelity obtained is inversely correlated to the degree of chaos in the dynamics. This gives us an operational interpretation of Loschmidt echo for operators by connecting it to the performance of quantum tomography. We find a quantity to capture the scrambling of errors, an out-of-time-ordered correlator (OTOC) between two operators under perturbed and unperturbed dynamics that serves as a signature of chaos. Our results demonstrate a fundamental link between Loschmidt echo and scrambling of errors, as captured by OTOCs, with operational consequences in quantum information processing. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: PhD Thesis

arXiv:2404.04623 [pdf, other]

An Automated Machine Learning Approach to Inkjet Printed Component Analysis: A Step Toward Smart Additive Manufacturing

Authors: Abhishek Sahu, Peter H. Aaen, Praveen Damacharla

Abstract: In this paper, we present a machine learning based architecture for microwave characterization of inkjet printed components on flexible substrates. Our proposed architecture uses several machine learning algorithms and automatically selects the best algorithm to extract the material parameters (ink conductivity and dielectric properties) from on-wafer measurements. Initially, the mutual dependence… ▽ More In this paper, we present a machine learning based architecture for microwave characterization of inkjet printed components on flexible substrates. Our proposed architecture uses several machine learning algorithms and automatically selects the best algorithm to extract the material parameters (ink conductivity and dielectric properties) from on-wafer measurements. Initially, the mutual dependence between material parameters of the inkjet printed coplanar waveguides (CPWs) and EM-simulated propagation constants is utilized to train the machine learning models. Next, these machine learning models along with measured propagation constants are used to extract the ink conductivity and dielectric properties of the test prototypes. To demonstrate the applicability of our proposed approach, we compare and contrast four heuristic based machine learning models. It is shown that eXtreme Gradient Boosted Trees Regressor (XGB) and Light Gradient Boosting (LGB) algorithms perform best for the characterization problem under study. △ Less

Submitted 6 April, 2024; originally announced April 2024.

Comments: 2024 IEEE Texas Symposium on Wireless & Micrwowave Circuits and Systems

arXiv:2403.07328 [pdf, other]

Satisfiability to Coverage in Presence of Fairness, Matroid, and Global Constraints

Authors: Tanmay Inamdar, Pallavi Jain, Daniel Lokshtanov, Abhishek Sahu, Saket Saurabh, Anannya Upasana

Abstract: In MaxSAT with Cardinality Constraint problem (CC-MaxSAT), we are given a CNF-formula $Φ$, and $k \ge 0$, and the goal is to find an assignment $β$ with at most $k$ variables set to true (also called a weight $k$-assignment) such that the number of clauses satisfied by $β$ is maximized. MaxCov can be seen as a special case of CC-MaxSAT, where the formula $Φ$ is monotone, i.e., does not contain any… ▽ More In MaxSAT with Cardinality Constraint problem (CC-MaxSAT), we are given a CNF-formula $Φ$, and $k \ge 0$, and the goal is to find an assignment $β$ with at most $k$ variables set to true (also called a weight $k$-assignment) such that the number of clauses satisfied by $β$ is maximized. MaxCov can be seen as a special case of CC-MaxSAT, where the formula $Φ$ is monotone, i.e., does not contain any negative literals. CC-MaxSAT and MaxCov are extremely well-studied problems in the approximation algorithms as well as parameterized complexity literature. Our first contribution is that the two problems are equivalent to each other in the context of FPT-Approximation parameterized by $k$ (approximation is in terms of number of clauses satisfied/elements covered). We give a randomized reduction from CC-MaxSAT to MaxCov in time $O(1/ε)^{k} \cdot (m+n)^{O(1)}$ that preserves the approximation guarantee up to a factor of $1-ε$. Furthermore, this reduction also works in the presence of fairness and matroid constraints. Armed with this reduction, we focus on designing FPT-Approximation schemes (FPT-ASes) for MaxCov and its generalizations. Our algorithms are based on a novel combination of a variety of ideas, including a carefully designed probability distribution that exploits sparse coverage functions. These algorithms substantially generalize the results in Jain et al. [SODA 2023] for CC-MaxSAT and MaxCov for $K_{d,d}$-free set systems (i.e., no $d$ sets share $d$ elements), as well as a recent FPT-AS for Matroid-Constrained MaxCov by Sellier [ESA 2023] for frequency-$d$ set systems. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: Abstract shortened due to arxiv restrictions

arXiv:2403.05904 [pdf, other]

CFD analysis of the influence of solvent viscosity ratio on the cree** flow of viscoelastic fluid over a channel-confined circular cylinder

Authors: Pratyush Kumar Mohanty, Akhilesh Kumar Sahu, Ram Prakash Bharti

Abstract: In this study, the role of solvent viscosity ratio ($β$) on the cree** flow characteristics of Oldroyd-B fluid over a channel-confined circular cylinder has been explored numerically. The hydrodynamic model equations have been solved by RheoTool, an open-source toolbox based on OpenFOAM, employing the finite volume method for extensive ranges of Deborah number ($De = 0.025-1.5$) and solvent visc… ▽ More In this study, the role of solvent viscosity ratio ($β$) on the cree** flow characteristics of Oldroyd-B fluid over a channel-confined circular cylinder has been explored numerically. The hydrodynamic model equations have been solved by RheoTool, an open-source toolbox based on OpenFOAM, employing the finite volume method for extensive ranges of Deborah number ($De = 0.025-1.5$) and solvent viscosity ratio ($β= 0.1-0.9$) for the fixed wall blockage ($B = 0.5$). The present investigation has undergone extensive validation, with available literature under specific limited conditions, before obtaining detailed results for the relevant flow phenomena such as streamline, pressure and stress contour profiles, pressure coefficient ($C_p$), wall shear stress ($τ_w$), normal stress ($τ_{xx}$), first normal stress difference ($N_{1}$), and drag coefficient ($C_{\text{D}}$).The flow profiles have exhibited a distinctive behavior characterized by a loss of symmetry in the presence of pronounced viscoelastic and polymeric effects. The results for low $De$ notably align closely with those for Newtonian fluids, and the drag coefficient ($C_D$) remains relatively constant regardless of $β$, as the viscoelastic influence is somewhat subdued. As $De$ increases, the influence of viscoelasticity becomes more pronounced, while a decrease in $β$ leads to an escalation in polymeric effects; an increase in the $C_D$ value is observed as $β$ increases. Within this parameter range, the prevailing force governing the flow is the pressure drag force. △ Less

Submitted 9 March, 2024; originally announced March 2024.

Comments: 36 pages, 13 figures

arXiv:2403.04265 [pdf, other]

Conflict and Fairness in Resource Allocation

Authors: Susobhan Bandopadhyay, Aritra Banik, Sushmita Gupta, Pallavi Jain, Abhishek Sahu, Saket Saurabh, Prafullkumar Tale

Abstract: In the standard model of fair allocation of resources to agents, every agent has some utility for every resource, and the goal is to assign resources to agents so that the agents' welfare is maximized. Motivated by job scheduling, interest in this problem dates back to the work of Deuermeyer et al. [SIAM J. on Algebraic Discrete Methods'82]. Recent works consider the compatibility between resource… ▽ More In the standard model of fair allocation of resources to agents, every agent has some utility for every resource, and the goal is to assign resources to agents so that the agents' welfare is maximized. Motivated by job scheduling, interest in this problem dates back to the work of Deuermeyer et al. [SIAM J. on Algebraic Discrete Methods'82]. Recent works consider the compatibility between resources and assign only mutually compatible resources to an agent. We study a fair allocation problem in which we are given a set of agents, a set of resources, a utility function for every agent over a set of resources, and a {\it conflict graph} on the set of resources (where an edge denotes incompatibility). The goal is to assign resources to the agents such that $(i)$ the set of resources allocated to an agent are compatible with each other, and $(ii)$ the minimum satisfaction of an agent is maximized, where the satisfaction of an agent is the sum of the utility of the assigned resources. Chiarelli et al. [Algorithmica'22] explore this problem from the classical complexity perspective to draw the boundary between the cases that are polynomial-time solvable and those that are \NP-hard. In this article, we study the parameterized complexity of the problem (and its variants) by considering several natural and structural parameters. △ Less

Submitted 7 March, 2024; originally announced March 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2309.04995

arXiv:2402.09539 [pdf, other]

Singular hypersurfaces and thin shells in cosmology

Authors: Abhisek Sahu

Abstract: We analyse spherically symmetric geometries, combining a cosmological patch and a Schwarzschild black hole patch joined via a singular co-dimension 1 hypersurface. In a general analysis applicable to dimensions greater than three, assuming an arbitrary homogeneous and isotropic cosmology, we derive the stress-energy tensor of the hypersurface in terms of the cosmological energy density. This analy… ▽ More We analyse spherically symmetric geometries, combining a cosmological patch and a Schwarzschild black hole patch joined via a singular co-dimension 1 hypersurface. In a general analysis applicable to dimensions greater than three, assuming an arbitrary homogeneous and isotropic cosmology, we derive the stress-energy tensor of the hypersurface in terms of the cosmological energy density. This analysis reveals a novel exact solution featuring radiation within the cosmology and a shell composed of pressureless dust. Exploring the parameter space yields twenty-two distinct solution families, including `bubble of cosmology' and `Swiss cheese' spacetimes. Notably, solutions with a negative cosmological constant exhibit a holographic dual. Additionally, we provide a pedagogical introduction to hypersurfaces in general relativity and a practical approach for constructing thin shell spacetimes. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 26 pages, 32 figures

arXiv:2402.00553 [pdf, other]

Classifying optical (out)bursts in cataclysmic variables: the distinct observational characteristics of dwarf novae, micronovae, stellar flares and magnetic gating

Authors: Krystian Ilkiewicz, Simone Scaringi, Martina Veresvarska, Domitilla De Martino, Colin Littlefield, Christian Knigge, John A. Paice, Anwesha Sahu

Abstract: Cataclysmic variables can experience short optical brightenings, which are commonly attributed to phenomena such as dwarf novae outbursts, micronovae, donor flares or magnetic gating bursts. Since these events exhibit similar observational characteristics, their identification has often been ambiguous. In particular, magnetic gating bursts and micronovae have been suggested as alternative interpre… ▽ More Cataclysmic variables can experience short optical brightenings, which are commonly attributed to phenomena such as dwarf novae outbursts, micronovae, donor flares or magnetic gating bursts. Since these events exhibit similar observational characteristics, their identification has often been ambiguous. In particular, magnetic gating bursts and micronovae have been suggested as alternative interpretations of the same phenomena. Here we show that the timescales and energies separate the optical brightenings into separate clusters consistent with their different classifications. This suggest that micronovae and magnetic gating bursts are in fact separate phenomena. Based on our finding we develop diagnostic diagrams that can distinguish between these bursts/flares based on their properties. We demonstrate the effectiveness of this approach on observations of a newly identified intermediate polar, CTCV J0333-4451, which we classify as a magnetic gating system. CTCV J0333-4451 is the third high spin-to-orbital period ratio intermediate polar with magnetic gating, suggesting that these bursts are common among these rare systems. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: 8 pages, 4 figures, accepted to ApJL

arXiv:2401.11773 [pdf, other]

The fast transient AT 2023clx in the nearby LINER galaxy NGC 3799, as a tidal disruption event of a very low-mass star

Authors: P. Charalampopoulos, R. Kotak, T. Wevers, G. Leloudas, T. Kravtsov, P. Ramsden, T. M. Reynolds, A. Aamer, J. P. Anderson, I. Arcavi, Y. -Z. Cai, T. -W. Chen, M. Dennefeld, L. Galbany, M. Gromadzki, C. P. Gutiérrez, N. Ihanec, T. Kangas, E. Kankare, E. Kool, A. Lawrence, L. Makrygianni, S. Mattila, T. E. Müller-Bravo, M. Nicholl , et al. (7 additional authors not shown)

Abstract: We present an extensive analysis of the optical and UV properties of AT2023clx, the closest TDE to date, that occurred in the nucleus of the interacting LINER galaxy, NGC3799 (z=0.01107). From several standard methods, we estimate the mass of the central SMBH to be ~ 10^6 Msol. After correcting for the host reddening (E(B-V) = 0.177 mag) we measured its peak absolute g-band magnitude to be -18.25\… ▽ More We present an extensive analysis of the optical and UV properties of AT2023clx, the closest TDE to date, that occurred in the nucleus of the interacting LINER galaxy, NGC3799 (z=0.01107). From several standard methods, we estimate the mass of the central SMBH to be ~ 10^6 Msol. After correcting for the host reddening (E(B-V) = 0.177 mag) we measured its peak absolute g-band magnitude to be -18.25\pm0.05 mag, and its peak bolometric luminosity to be L_pk=(3.24\pm0.36)x10^43erg/s, making AT2023clx an intermediate luminosity TDE. The first distinctive feature of AT2023clx is that it rose to peak within only 10.4\pm2.5 days, making it the fastest rising TDE to date. Our SMBH mass estimate rules out the possibility of an intermediate mass BH as the reason of the fast rise. Dense spectral follow-up revealed a blue continuum that cools slowly and broad Balmer and HeII lines as well as weak HeI emission, features that are typically seen in TDEs. A flat Balmer decrement (~ 1.58) suggests that the lines are collisionally excited rather than being produced via photoionisation, as in typical active galactic nuclei. A second distinctive feature, seen for the first time in TDE spectra, is a sharp, narrow emission peak at a rest wavelength of ~6353 A. This feature is clearly visible up to 10d post-peak; we attribute it to clumpy material preceding the bulk outflow, and manifested as a high-velocity component of Ha (-9584km/s). The third distinctive feature is a break observed in the near-UV light curves that is reflected as a dip in the temperature evolution around ~18-28 days post-peak. Combining these findings, we propose a scenario for AT2023clx involving the disruption of a very low-mass star (<=0.1Msol) with an outflow launched in our line-of-sight with disruption properties that led to circularisation and prompt and efficient accretion disc formation, observed through a low-density photosphere. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: Submitted to A&A. Comments are welcome!

arXiv:2401.03415 [pdf, other]

A Polynomial Kernel for Proper Helly Circular-arc Vertex Deletion

Authors: Akanksha Agrawal, Satyabrata Jana, Abhishek Sahu

Abstract: A proper Helly circular-arc graph is an intersection graph of a set of arcs on a circle such that none of the arcs properly contains any other arc and every set of pairwise intersecting arcs has a common intersection. The Proper Helly Circular-arc Vertex Deletion problem takes as input a graph $G$ and an integer $k$, and the goal is to check if we can remove at most $k$ vertices from the graph to… ▽ More A proper Helly circular-arc graph is an intersection graph of a set of arcs on a circle such that none of the arcs properly contains any other arc and every set of pairwise intersecting arcs has a common intersection. The Proper Helly Circular-arc Vertex Deletion problem takes as input a graph $G$ and an integer $k$, and the goal is to check if we can remove at most $k$ vertices from the graph to obtain a proper Helly circular-arc graph; the parameter is $k$. Recently, Cao et al.~[MFCS 2023] obtained an FPT algorithm for this (and related) problem. In this work, we obtain a polynomial kernel for the problem. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: 25 pages, 3 figures, In LATIN 2024

arXiv:2311.10548 [pdf, other]

Efficient Profit Maximization in Reliability Concerned Static Vehicular Cloud System

Authors: Suvarthi Sarkar, Akshat Arun, Harshit Surekha, Aryabartta Sahu

Abstract: Modern electric VUs are equipped with a variety of increasingly potent computing, communication, and storage resources, and with this tremendous computation power in their arsenal can be used to enhance the computing power of regular cloud systems, which is termed as vehicular cloud. Unlike in the traditional cloud computing resources, these vehicular cloud resource moves around and participates i… ▽ More Modern electric VUs are equipped with a variety of increasingly potent computing, communication, and storage resources, and with this tremendous computation power in their arsenal can be used to enhance the computing power of regular cloud systems, which is termed as vehicular cloud. Unlike in the traditional cloud computing resources, these vehicular cloud resource moves around and participates in the vehicular cloud for a sporadic duration at parking places, shop** malls, etc. This introduces the dynamic nature of vehicular resource participation in the vehicular cloud. As the user-submitted task gets allocated on these vehicular units for execution and the dynamic stay nature of vehicular units, enforce the system to ensure the reliability of task execution by allocating multiple redundant vehicular units for the task. In this work, we are maximizing the profit of vehicular cloud by ensuring the reliability of task execution where user tasks come online manner with different revenue, execution, and deadline. We propose an efficient approach to solve this problem by considering (a) task classification based on the deadline and laxity of the task, (b) ordering of tasks for task admission based on the expected profit of the task, (c) classification of vehicular units based in expected residency time and reliability concerned redundant allocation of tasks of vehicular units considering this classification and (d) handing dynamic scenario of the vehicular unit leaving the cloud system by copying the maximum percentage of executed virtual machine of the task to the substitute unit. We compared our proposed profit maximization approach with the state of art approach and showed that our approach outperforms the state of art approach with an extra 10\% to 20\% profit margin. △ Less

Submitted 17 November, 2023; originally announced November 2023.

arXiv:2310.13681 [pdf, other]

Towards Realistic Mechanisms That Incentivize Federated Participation and Contribution

Authors: Marco Bornstein, Amrit Singh Bedi, Anit Kumar Sahu, Furqan Khan, Furong Huang

Abstract: Edge device participation in federating learning (FL) is typically studied through the lens of device-server communication (e.g., device dropout) and assumes an undying desire from edge devices to participate in FL. As a result, current FL frameworks are flawed when implemented in realistic settings, with many encountering the free-rider dilemma. In a step to push FL towards realistic settings, we… ▽ More Edge device participation in federating learning (FL) is typically studied through the lens of device-server communication (e.g., device dropout) and assumes an undying desire from edge devices to participate in FL. As a result, current FL frameworks are flawed when implemented in realistic settings, with many encountering the free-rider dilemma. In a step to push FL towards realistic settings, we propose RealFM: the first federated mechanism that (1) realistically models device utility, (2) incentivizes data contribution and device participation, (3) provably removes the free-rider dilemma, and (4) relaxes assumptions on data homogeneity and data sharing. Compared to previous FL mechanisms, RealFM allows for a non-linear relationship between model accuracy and utility, which improves the utility gained by the server and participating devices. On real-world data, RealFM improves device and server utility, as well as data contribution, by over 3 and 4 magnitudes respectively compared to baselines. △ Less

Submitted 22 May, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

Comments: 24 pages, 11 figures

arXiv:2308.08513 [pdf, other]

doi 10.1103/PhysRevB.108.224306

Quantifying operator spreading and chaos in Krylov subspaces with quantum state reconstruction

Authors: Abinash Sahu, Naga Dileep Varikuti, Bishal Kumar Das, Vaibhav Madhok

Abstract: We study operator spreading in many-body quantum systems by its potential to generate an informationally complete measurement record in quantum tomography. We adopt continuous weak measurement tomography for this purpose. We generate the measurement record as a series of expectation values of an observable evolving under the desired dynamics, which can show a transition from integrability to compl… ▽ More We study operator spreading in many-body quantum systems by its potential to generate an informationally complete measurement record in quantum tomography. We adopt continuous weak measurement tomography for this purpose. We generate the measurement record as a series of expectation values of an observable evolving under the desired dynamics, which can show a transition from integrability to complete chaos. We find that the amount of operator spreading, as quantified by the fidelity in quantum tomography, increases with the degree of chaos in the system. We also observe a remarkable increase in information gain when the dynamics transitions from integrable to nonintegrable. We find our approach in quantifying operator spreading is a more consistent indicator of quantum chaos than Krylov complexity as the latter may correlate/anti-correlate or show no explicit behavior with the level of chaos in the dynamics. We support our argument through various metrics of information gain for two models: the Ising spin chain with a tilted magnetic field and the Heisenberg XXZ spin chain with an integrability-breaking field. Our paper gives an operational interpretation for operator spreading in quantum chaos. △ Less

Submitted 11 December, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

Comments: 17 pages, 9 figures, updated Manuscript, accepted for publication in Phys. Rev. B

Journal ref: Phys. Rev. B 108, 224306 (2023)

arXiv:2308.02013 [pdf, other]

Federated Representation Learning for Automatic Speech Recognition

Authors: Guruprasad V Ramesh, Gopinath Chennupati, Milind Rao, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo

Abstract: Federated Learning (FL) is a privacy-preserving paradigm, allowing edge devices to learn collaboratively without sharing data. Edge devices like Alexa and Siri are prospective sources of unlabeled audio data that can be tapped to learn robust audio representations. In this work, we bring Self-supervised Learning (SSL) and FL together to learn representations for Automatic Speech Recognition respec… ▽ More Federated Learning (FL) is a privacy-preserving paradigm, allowing edge devices to learn collaboratively without sharing data. Edge devices like Alexa and Siri are prospective sources of unlabeled audio data that can be tapped to learn robust audio representations. In this work, we bring Self-supervised Learning (SSL) and FL together to learn representations for Automatic Speech Recognition respecting data privacy constraints. We use the speaker and chapter information in the unlabeled speech dataset, Libri-Light, to simulate non-IID speaker-siloed data distributions and pre-train an LSTM encoder with the Contrastive Predictive Coding framework with FedSGD. We show that the pre-trained ASR encoder in FL performs as well as a centrally pre-trained model and produces an improvement of 12-15% (WER) compared to no pre-training. We further adapt the federated pre-trained models to a new language, French, and show a 20% (WER) improvement over no pre-training. △ Less

Submitted 7 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

Comments: Accepted at ISCA SPSC Symposium 3rd Symposium on Security and Privacy in Speech Communication, 2023

arXiv:2307.02460 [pdf, other]

Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources

Authors: Feiyang Kang, Hoang Anh Just, Anit Kumar Sahu, Ruoxi Jia

Abstract: Traditionally, data selection has been studied in settings where all samples from prospective sources are fully revealed to a machine learning developer. However, in practical data exchange scenarios, data providers often reveal only a limited subset of samples before an acquisition decision is made. Recently, there have been efforts to fit scaling laws that predict model performance at any size a… ▽ More Traditionally, data selection has been studied in settings where all samples from prospective sources are fully revealed to a machine learning developer. However, in practical data exchange scenarios, data providers often reveal only a limited subset of samples before an acquisition decision is made. Recently, there have been efforts to fit scaling laws that predict model performance at any size and data source composition using the limited available samples. However, these scaling functions are black-box, computationally expensive to fit, highly susceptible to overfitting, or/and difficult to optimize for data selection. This paper proposes a framework called <projektor>, which predicts model performance and supports data selection decisions based on partial samples of prospective data sources. Our approach distinguishes itself from existing work by introducing a novel *two-stage* performance inference process. In the first stage, we leverage the Optimal Transport distance to predict the model's performance for any data mixture ratio within the range of disclosed data sizes. In the second stage, we extrapolate the performance to larger undisclosed data sizes based on a novel parameter-free map** technique inspired by neural scaling laws. We further derive an efficient gradient-based method to select data sources based on the projected model performance. Evaluation over a diverse range of applications demonstrates that <projektor> significantly improves existing performance scaling approaches in terms of both the accuracy of performance inference and the computation costs associated with constructing the performance predictor. Also, <projektor> outperforms by a wide margin in data selection effectiveness compared to a range of other off-the-shelf solutions. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: An extended abstract of this work appears in Data-centric Machine Learning Research (DMLR) Workshop at 40th International Conference on Machine Learning, Honolulu HI, USA. July 29, 2023

arXiv:2307.00142 [pdf, other]

BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark for Short-Term Load Forecasting

Authors: Patrick Emami, Abhijeet Sahu, Peter Graf

Abstract: Short-term forecasting of residential and commercial building energy consumption is widely used in power systems and continues to grow in importance. Data-driven short-term load forecasting (STLF), although promising, has suffered from a lack of open, large-scale datasets with high building diversity. This has hindered exploring the pretrain-then-fine-tune paradigm for STLF. To help address this,… ▽ More Short-term forecasting of residential and commercial building energy consumption is widely used in power systems and continues to grow in importance. Data-driven short-term load forecasting (STLF), although promising, has suffered from a lack of open, large-scale datasets with high building diversity. This has hindered exploring the pretrain-then-fine-tune paradigm for STLF. To help address this, we present BuildingsBench, which consists of: 1) Buildings-900K, a large-scale dataset of 900K simulated buildings representing the U.S. building stock; and 2) an evaluation platform with over 1,900 real residential and commercial buildings from 7 open datasets. BuildingsBench benchmarks two under-explored tasks: zero-shot STLF, where a pretrained model is evaluated on unseen buildings without fine-tuning, and transfer learning, where a pretrained model is fine-tuned on a target building. The main finding of our benchmark analysis is that synthetically pretrained models generalize surprisingly well to real commercial buildings. An exploration of the effect of increasing dataset size and diversity on zero-shot commercial building performance reveals a power-law with diminishing returns. We also show that fine-tuning pretrained models on real commercial and residential buildings improves performance for a majority of target buildings. We hope that BuildingsBench encourages and facilitates future research on generalizable STLF. All datasets and code can be accessed from https://github.com/NREL/BuildingsBench. △ Less

Submitted 10 January, 2024; v1 submitted 30 June, 2023; originally announced July 2023.

Comments: NeurIPS 2023 Datasets & Benchmarks Track camera-ready version. 35 pages. Code available at https://github.com/NREL/BuildingsBench/ and data available at https://data.openei.org/submissions/5859

arXiv:2306.15072 [pdf, other]

A Firewall Optimization for Threat-Resilient Micro-Segmentation in Power System Networks

Authors: Abhijeet Sahu, Patrick Wlazlo, Nastassja Gaudet, Ana Goulart, Edmond Rogers, Katherine Davis

Abstract: Electric power delivery relies on a communications backbone that must be secure. SCADA systems are essential to critical grid functions and include industrial control systems (ICS) protocols such as the Distributed Network Protocol-3 (DNP3). These protocols are vulnerable to cyber threats that power systems, as cyber-physical critical infrastructure, must be protected against. For this reason, the… ▽ More Electric power delivery relies on a communications backbone that must be secure. SCADA systems are essential to critical grid functions and include industrial control systems (ICS) protocols such as the Distributed Network Protocol-3 (DNP3). These protocols are vulnerable to cyber threats that power systems, as cyber-physical critical infrastructure, must be protected against. For this reason, the NERC Critical Infrastructure Protection standard CIP-005-5 specifies that an electronic system perimeter is needed, accomplished with firewalls. This paper presents how these electronic system perimeters can be optimally found and generated using a proposed meta-heuristic approach for optimal security zone formation for large-scale power systems. Then, to implement the optimal firewall rules in a large scale power system model, this work presents a prototype software tool that takes the optimization results and auto-configures the firewall nodes for different utilities in a cyber-physical testbed. Using this tool, firewall policies are configured for all the utilities and their substations within a synthetic 2000-bus model, assuming two different network topologies. Results generate the optimal electronic security perimeters to protect a power system's data flows and compare the number of firewalls, monetary cost, and risk alerts from path analysis. △ Less

Submitted 26 June, 2023; originally announced June 2023.

Comments: 12 pages, 22 figures

arXiv:2306.13143 [pdf, other]

Bubbles of cosmology in AdS/CFT

Authors: Abhisek Sahu, Petar Simidzija, Mark Van Raamsdonk

Abstract: Gravitational effective theories associated with holographic CFTs have cosmological solutions, which are typically big-bang / big-crunch cosmologies. These solutions are not asymptotically AdS, so they are not dual to finite-energy states of the CFT. However, we can find solutions with arbitrarily large spherical bubbles of such cosmologies embedded in asymptotically AdS spacetimes where the exter… ▽ More Gravitational effective theories associated with holographic CFTs have cosmological solutions, which are typically big-bang / big-crunch cosmologies. These solutions are not asymptotically AdS, so they are not dual to finite-energy states of the CFT. However, we can find solutions with arbitrarily large spherical bubbles of such cosmologies embedded in asymptotically AdS spacetimes where the exterior of the bubble is Schwarzschild-AdS. In this paper, we explore such solutions and their possible CFT dual descriptions. Starting with a cosmological solution with $Λ< 0$ plus arbitrary matter density, radiation density, and spatial curvature, we show that a comoving bubble of arbitrary size can be embedded in a geometry with AdS-Schwarzschild exterior across a thin-shell domain wall comprised of pressureless matter. We show that in most cases (in particular, for arbitrarily large bubbles with an arbitrarily small negative spatial curvature) the entropy of the black hole exceeds the (radiation) entropy in the cosmological bubble, suggesting that a faithful CFT description is possible. We show that unlike the case of a de Sitter bubble, the Euclidean continuation of these cosmological solutions is sensible and suggests a specific construction of CFT states dual to the cosmological solutions via Euclidean path integral. △ Less

Submitted 22 June, 2023; originally announced June 2023.

Comments: 30 pages, 10 figures

arXiv:2306.12015 [pdf, other]

Federated Self-Learning with Weak Supervision for Speech Recognition

Authors: Milind Rao, Gopinath Chennupati, Gautam Tiwari, Anit Kumar Sahu, Anirudh Raju, Ariya Rastrow, Jasha Droppo

Abstract: Automatic speech recognition (ASR) models with low-footprint are increasingly being deployed on edge devices for conversational agents, which enhances privacy. We study the problem of federated continual incremental learning for recurrent neural network-transducer (RNN-T) ASR models in the privacy-enhancing scheme of learning on-device, without access to ground truth human transcripts or machine t… ▽ More Automatic speech recognition (ASR) models with low-footprint are increasingly being deployed on edge devices for conversational agents, which enhances privacy. We study the problem of federated continual incremental learning for recurrent neural network-transducer (RNN-T) ASR models in the privacy-enhancing scheme of learning on-device, without access to ground truth human transcripts or machine transcriptions from a stronger ASR model. In particular, we study the performance of a self-learning based scheme, with a paired teacher model updated through an exponential moving average of ASR models. Further, we propose using possibly noisy weak-supervision signals such as feedback scores and natural language understanding semantics determined from user behavior across multiple turns in a session of interactions with the conversational agent. These signals are leveraged in a multi-task policy-gradient training approach to improve the performance of self-learning for ASR. Finally, we show how catastrophic forgetting can be mitigated by combining on-device learning with a memory-replay approach using selected historical datasets. These innovations allow for 10% relative improvement in WER on new use cases with minimal degradation on other test sets in the absence of strong-supervision signals such as ground-truth transcriptions. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: Proceedings of ICASSP 2023

arXiv:2306.12012 [pdf, other]

doi 10.21437/Interspeech.2023-2205

Learning When to Trust Which Teacher for Weakly Supervised ASR

Authors: Aakriti Agrawal, Milind Rao, Anit Kumar Sahu, Gopinath Chennupati, Andreas Stolcke

Abstract: Automatic speech recognition (ASR) training can utilize multiple experts as teacher models, each trained on a specific domain or accent. Teacher models may be opaque in nature since their architecture may be not be known or their training cadence is different from that of the student ASR model. Still, the student models are updated incrementally using the pseudo-labels generated independently by t… ▽ More Automatic speech recognition (ASR) training can utilize multiple experts as teacher models, each trained on a specific domain or accent. Teacher models may be opaque in nature since their architecture may be not be known or their training cadence is different from that of the student ASR model. Still, the student models are updated incrementally using the pseudo-labels generated independently by the expert teachers. In this paper, we exploit supervision from multiple domain experts in training student ASR models. This training strategy is especially useful in scenarios where few or no human transcriptions are available. To that end, we propose a Smart-Weighter mechanism that selects an appropriate expert based on the input audio, and then trains the student model in an unsupervised setting. We show the efficacy of our approach using LibriSpeech and LibriLight benchmarks and find an improvement of 4 to 25\% over baselines that uniformly weight all the experts, use a single expert model, or combine experts using ROVER. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: Proceedings of INTERSPEECH 2023

Journal ref: Proc. Interspeech, Aug. 2023, pp. 381-385

arXiv:2306.04209 [pdf, other]

doi 10.1103/PhysRevE.109.014209

Probing Dynamical Sensitivity of a Non-KAM System Through Out-of-Time-Order Correlators

Authors: Naga Dileep Varikuti, Abinash Sahu, Arul Lakshminarayan, Vaibhav Madhok

Abstract: Non-KAM (Kolmogorov-Arnold-Moser) systems, when perturbed by weak time-dependent fields, offer a fast route to classical chaos through an abrupt breaking of invariant phase space tori. In this work, we employ out-of-time-order correlators (OTOCs) to study the dynamical sensitivity of a perturbed non-KAM system in the quantum limit as the parameter that characterizes the $\textit{resonance}$ condit… ▽ More Non-KAM (Kolmogorov-Arnold-Moser) systems, when perturbed by weak time-dependent fields, offer a fast route to classical chaos through an abrupt breaking of invariant phase space tori. In this work, we employ out-of-time-order correlators (OTOCs) to study the dynamical sensitivity of a perturbed non-KAM system in the quantum limit as the parameter that characterizes the $\textit{resonance}$ condition is slowly varied. For this purpose, we consider a quantized kicked harmonic oscillator (KHO) model, which displays stochastic webs resembling Arnold's diffusion that facilitate large-scale diffusion in the phase space. Although the Lyapunov exponent of the KHO at resonances remains close to zero in the weak perturbative regime, making the system weakly chaotic in the conventional sense, the classical phase space undergoes significant structural changes. Motivated by this, we study the OTOCs when the system is in resonance and contrast the results with the non-resonant case. At resonances, we observe that the long-time dynamics of the OTOCs are sensitive to these structural changes, where they grow quadratically as opposed to linear or stagnant growth at non-resonances. On the other hand, our findings suggest that the short-time dynamics remain relatively more stable and show the exponential growth found in the literature for unstable fixed points. The numerical results are backed by analytical expressions derived for a few special cases. We will then extend our findings concerning the non-resonant cases to a broad class of near-integrable KAM systems. △ Less

Submitted 11 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: 17 pages, 6 figures. Close to the accepted version in Phys. Rev. E

Journal ref: Phys. Rev. E 109, 014209 (2024)

arXiv:2305.16861 [pdf, other]

Low and High Frequency Vibrations Synergistically Enhance Singlet Exciton Fission Through Robust Vibronic Resonances

Authors: Atandrita Bhattacharyya, Amitav Sahu, Sanjoy Patra, Vivek Tiwari

Abstract: Singlet exciton fission (SEF) is initiated by ultrafast internal conversion of a singlet exciton into a correlated triplet pair (TT)1. The `reaction coordinates' for ultrafast SEF even in archetypal systems such as pentacene thin film remain unclear with synthetic design principles broadly relying on tailoring electronic couplings to achieve new templates for efficient SEF materials. Spectroscopic… ▽ More Singlet exciton fission (SEF) is initiated by ultrafast internal conversion of a singlet exciton into a correlated triplet pair (TT)1. The `reaction coordinates' for ultrafast SEF even in archetypal systems such as pentacene thin film remain unclear with synthetic design principles broadly relying on tailoring electronic couplings to achieve new templates for efficient SEF materials. Spectroscopic detection of vibrational coherences in the (TT)1 photoproduct has motivated theoretical investigations into a possible role of vibronic resonance in driving SEF, akin to that reported in several photosynthetic proteins. However, a precise understanding of how prominent low-frequency vibrations and their modulation of intermolecular orbital overlaps, equally prominent high-frequency vibrations, and order of magnitude larger Huang-Rhys factors in SEF chromophores compared to photosynthetic pigments, collectively influence the mechanistic details of SEF remains starkly lacking. Here we address this gap and identify previously unrecognized effects which are quite contrasting from those known in photosynthesis excitons, and vitally enhance non-adiabatic internal conversion in SEF. Our findings have direct implications for the broad experimental interest in synthetically tailoring molecules to promote vibronically enhanced internal conversion. △ Less

Submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.16469 [pdf, other]

Bayesian Reinforcement Learning for Automatic Voltage Control under Cyber-Induced Uncertainty

Authors: Abhijeet Sahu, Katherine Davis

Abstract: Voltage control is crucial to large-scale power system reliable operation, as timely reactive power support can help prevent widespread outages. However, there is currently no built in mechanism for power systems to ensure that the voltage control objective to maintain reliable operation will survive or sustain the uncertainty caused under adversary presence. Hence, this work introduces a Bayesian… ▽ More Voltage control is crucial to large-scale power system reliable operation, as timely reactive power support can help prevent widespread outages. However, there is currently no built in mechanism for power systems to ensure that the voltage control objective to maintain reliable operation will survive or sustain the uncertainty caused under adversary presence. Hence, this work introduces a Bayesian Reinforcement Learning (BRL) approach for power system control problems, with focus on sustained voltage control under uncertainty in a cyber-adversarial environment. This work proposes a data-driven BRL-based approach for automatic voltage control by formulating and solving a Partially-Observable Markov Decision Problem (POMDP), where the states are partially observable due to cyber intrusions. The techniques are evaluated on the WSCC and IEEE 14 bus systems. Additionally, BRL techniques assist in automatically finding a threshold for exploration and exploitation in various RL techniques. △ Less

Submitted 25 May, 2023; originally announced May 2023.

Comments: 11 pages

arXiv:2304.11866 [pdf, other]

Some results on Continuous dependence of fractal functions on the Sierpiński gasket

Authors: Vishal Agrawal, Ajay Prajapati, Abhilash Sahu, Tanmoy Som

Abstract: In this article, we show that $α$-fractal functions defined on Sierpiński gasket (denoted by $\triangle$) depend continuously on the parameters involved in the construction. In the latter part of this article, the continuous dependence of parameters on $α$-fractal functions defined on $\triangle$ is shown graphically. In this article, we show that $α$-fractal functions defined on Sierpiński gasket (denoted by $\triangle$) depend continuously on the parameters involved in the construction. In the latter part of this article, the continuous dependence of parameters on $α$-fractal functions defined on $\triangle$ is shown graphically. △ Less

Submitted 24 April, 2023; originally announced April 2023.

Comments: 10 Pages, 16 figures

MSC Class: 28A80; 41A10

arXiv:2304.01586 [pdf]

Influence of Gold-Selenium Precursor Ratio on Synthesis and Structural Stability of α- and β-AuSe

Authors: Aditya Kumar Sahu, Satyabrata Raj

Abstract: Gold selenide (AuSe) is a multilayer compound yet to be thoroughly studied. The colloidal synthesis and characterization of gold selenide nanoparticles are described, emphasizing the effect of different gold-to-selenium precursor ratios and temperatures on the crystal structure and form. The structural characterization is done using an X-ray diffraction pattern. The coexistence of the α- and β-AuS… ▽ More Gold selenide (AuSe) is a multilayer compound yet to be thoroughly studied. The colloidal synthesis and characterization of gold selenide nanoparticles are described, emphasizing the effect of different gold-to-selenium precursor ratios and temperatures on the crystal structure and form. The structural characterization is done using an X-ray diffraction pattern. The coexistence of the α- and β-AuSe phases is observed in all synthesized samples. The morphologies of the mainly α-AuSe sample are nanobelts, whereas the primarily β-AuSe phase sample has a nanoplate-like structure, according to the TEM and SEM data. All of the samples had Raman vibrational modes with mixed phases. The effect of high pressure on as-prepared AuSe samples has been studied in this work. The introduction of external pressure and temperature allows both phases to transition. Pressure lowers the existence of other phase modes, and the corresponding dominating sample modes are entirely significant in our sample. The phase transition pressure was observed using Raman scattering. Our findings show that 2D AuSe has a lot of promise for multifunctional applications, encouraging more research on these systems. △ Less

Submitted 4 April, 2023; originally announced April 2023.

Comments: 18 pages, 9 figures

arXiv:2303.10866 [pdf, other]

An Improved Exact Algorithm for Knot-Free Vertex Deletion

Authors: Ajaykrishnan E S, Soumen Maity, Abhishek Sahu, Saket Saurabh

Abstract: A knot $K$ in a directed graph $D$ is a strongly connected component of size at least two such that there is no arc $(u,v)$ with $u \in V(K)$ and $v\notin V(K)$. Given a directed graph $D=(V,E)$, we study Knot-Free Vertex Deletion (KFVD), where the goal is to remove the minimum number of vertices such that the resulting graph contains no knots. This problem naturally emerges from its application i… ▽ More A knot $K$ in a directed graph $D$ is a strongly connected component of size at least two such that there is no arc $(u,v)$ with $u \in V(K)$ and $v\notin V(K)$. Given a directed graph $D=(V,E)$, we study Knot-Free Vertex Deletion (KFVD), where the goal is to remove the minimum number of vertices such that the resulting graph contains no knots. This problem naturally emerges from its application in deadlock resolution since knots are deadlocks in the OR-model of distributed computation. The fastest known exact algorithm in literature for KFVD runs in time $\mathcal{O}^\star(1.576^n)$. In this paper, we present an improved exact algorithm running in time $\mathcal{O}^\star(1.4549^n)$, where $n$ is the number of vertices in $D$. We also prove that the number of inclusion wise minimal knot-free vertex deletion sets is $\mathcal{O}^\star(1.4549^n)$ and construct a family of graphs with $Ω(1.4422^n)$ minimal knot-free vertex deletion sets △ Less

Submitted 20 March, 2023; originally announced March 2023.

arXiv:2303.10624 [pdf, other]

PFSL: Personalized & Fair Split Learning with Data & Label Privacy for thin clients

Authors: Manas Wadhwa, Gagan Raj Gupta, Ashutosh Sahu, Rahul Saini, Vidhi Mittal

Abstract: The traditional framework of federated learning (FL) requires each client to re-train their models in every iteration, making it infeasible for resource-constrained mobile devices to train deep-learning (DL) models. Split learning (SL) provides an alternative by using a centralized server to offload the computation of activations and gradients for a subset of the model but suffers from problems of… ▽ More The traditional framework of federated learning (FL) requires each client to re-train their models in every iteration, making it infeasible for resource-constrained mobile devices to train deep-learning (DL) models. Split learning (SL) provides an alternative by using a centralized server to offload the computation of activations and gradients for a subset of the model but suffers from problems of slow convergence and lower accuracy. In this paper, we implement PFSL, a new framework of distributed split learning where a large number of thin clients perform transfer learning in parallel, starting with a pre-trained DL model without sharing their data or labels with a central server. We implement a lightweight step of personalization of client models to provide high performance for their respective data distributions. Furthermore, we evaluate performance fairness amongst clients under a work fairness constraint for various scenarios of non-i.i.d. data distributions and unequal sample sizes. Our accuracy far exceeds that of current SL algorithms and is very close to that of centralized learning on several real-life benchmarks. It has a very low computation cost compared to FL variants and promises to deliver the full benefits of DL to extremely thin, resource-constrained clients. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: To be published in : THE 23RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON Cluster, Cloud and Internet Computing. Granted: Open Research Objects (ORO) and Research Objects Reviewed (ROR) badges. See https://www.niso.org/publications/rp-31-2021-badging for definitions of the badges. Code available at: https://github.com/mnswdhw/PFSL

arXiv:2302.00276 [pdf, other]

Coherence Transfer and Destructive Interference in Two-Dimensional Coherence Maps

Authors: Amitav Sahu, Vivek Tiwari

Abstract: Coherence maps (CMs) in multidimensional spectroscopy report total interference of all quantum coherent pathways. Detailed understanding of how this interference manifests spectroscopically is vital for deciphering mechanistic origins of impulsively generated wavepackets, but currently lacking. Here we explain the origin of recently reported diagonal node-like features in CMs of bacteriochlorophyl… ▽ More Coherence maps (CMs) in multidimensional spectroscopy report total interference of all quantum coherent pathways. Detailed understanding of how this interference manifests spectroscopically is vital for deciphering mechanistic origins of impulsively generated wavepackets, but currently lacking. Here we explain the origin of recently reported diagonal node-like features in CMs of bacteriochlorophyll monomers and photosynthetic reaction centers (RCs), where the apparent resemblance in the two disparate systems was reportedly perplexing. We show that both spectroscopic signatures have distinct physical origins. Node-like lineshapes in monomers arise from unique phase twists caused by destructive interference between ground and excited state vibrational coherences. In contrast, nodal lines in RCs are explained by coherence transfer of vibrational wavepackets which do not participate in the ultrafast energy transfer and their destructive interference with ground state pathways. Our results resolve recent spectroscopic observations and illustrate new mechanistic insights gained from understanding interference effects in multidimensional spectroscopy. △ Less

Submitted 1 February, 2023; originally announced February 2023.

arXiv:2211.16882 [pdf, other]

MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

Authors: Pranjali Pathre, Anurag Sahu, Ashwin Rao, Avinash Prabhu, Meher Shashwat Nigam, Tanvi Karandikar, Harit Pandya, K. Madhava Krishna

Abstract: In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented… ▽ More In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented racks, the front and the top view layout of each shelf within a rack. With minimal effort, such an output is transformed into a 3D rendering of all racks, shelves and objects on the shelves, giving an accurate 3D depiction of the entire warehouse scene in terms of racks, shelves and the number of objects on each shelf. MVRackLay generalizes to a diverse set of warehouse scenes with varying number of objects on each shelf, number of shelves and in the presence of other such racks in the background. Further, MVRackLay shows superior performance vis-a-vis its single view counterpart, RackLay, in layout accuracy, quantized in terms of the mean IoU and mAP metrics. We also showcase a multi-view stitching of the 3D layouts resulting in a representation of the warehouse scene with respect to a global reference frame akin to a rendering of the scene from a SLAM pipeline. To the best of our knowledge, this is the first such work to portray a 3D rendering of a warehouse scene in terms of its semantic components - Racks, Shelves and Objects - all from a single monocular camera. △ Less

Submitted 30 November, 2022; originally announced November 2022.

Journal ref: IEEE International Conference on Robotics and Biomimetics (ROBIO) 2022

arXiv:2211.11221 [pdf, other]

Information scrambling and the growth of errors in noisy tomography -- a quantum signature of chaos

Authors: Abinash Sahu, Naga Dileep Varikuti, Vaibhav Madhok

Abstract: How does quantum chaos lead to rapid scrambling of information as well as errors across a system when one introduces perturbations in the dynamics? What are its consequences for the reliability of quantum simulations and quantum information processing? We employ continuous measurement quantum tomography as a paradigm to study these questions. The measurement record is generated as a sequence of ex… ▽ More How does quantum chaos lead to rapid scrambling of information as well as errors across a system when one introduces perturbations in the dynamics? What are its consequences for the reliability of quantum simulations and quantum information processing? We employ continuous measurement quantum tomography as a paradigm to study these questions. The measurement record is generated as a sequence of expectation values of a Hermitian observable evolving under repeated application of the Floquet map of the quantum kicked top. Interestingly, we find that the reconstruction fidelity initially increases regardless of the degree of chaos or the strength of perturbations in the dynamics. For random states, when the measurement record is obtained from a random initial observable, the subsequent drop in the fidelity obtained is inversely correlated to the degree of chaos in the dynamics. More importantly, this also gives us an operational interpretation of Loschmidt echo for operators by connecting it to the performance of quantum tomography. We define a quantity to capture the scrambling of errors, an out-of-time-ordered correlator (OTOC) between two operators under perturbed and unperturbed system dynamics that serves as a signature of chaos and quantifies the spread of errors. Our results demonstrate not only a fundamental link between Loschmidt echo and scrambling of errors, as captured by OTOCs, but that such a link can have operational consequences in quantum information processing. △ Less

Submitted 12 June, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

Comments: 10 pages, 4 figures, new title, minor changes in the abstract and the main text with better presentation

arXiv:2210.08239 [pdf, other]

High-sensitivity Fluorescence-Detected Multidimensional Electronic Spectroscopy Through Continuous Pump-probe Delay Scan

Authors: Amitav Sahu, Vivek N. Bhat, Sanjoy Patra, Vivek Tiwari

Abstract: Background-free fluorescence detection in multidimensional electronic spectroscopy promises high sensitivity compared to conventional approaches. Here we explore the sensitivity limits of multidimensional electronic spectroscopy. We present a fluorescence-detected multidimensional electronic spectrometer based on a visible white-light continuum. As a demonstration of sensitivity, we report room te… ▽ More Background-free fluorescence detection in multidimensional electronic spectroscopy promises high sensitivity compared to conventional approaches. Here we explore the sensitivity limits of multidimensional electronic spectroscopy. We present a fluorescence-detected multidimensional electronic spectrometer based on a visible white-light continuum. As a demonstration of sensitivity, we report room temperature two-dimensional coherence maps of vibrational quantum coherences in a laser dye at optical densities ~2-3 orders of magnitude lower than conventional approaches. This high sensitivity is enabled by a combination of biased sampling along the optical coherence time axes and a rapid scan of the waiting time T dimension at each time step. A combination of acousto-optic phase modulation and phase-sensitive lock-in detection enables simultaneous collection of rephasing and non-rephasing signals and measurements of room temperature vibrational wavepackets even at the lowest ODs. Alternative faster data collection schemes, enabled by the flexibility of continuous pump-probe scanning approach, are also demonstrated. △ Less

Submitted 15 October, 2022; originally announced October 2022.

arXiv:2209.08363 [pdf]

TBL-induced energy transmission into a double wall backed enclosure system computed in a cloud-based Python-FE environment

Authors: Biplab Ranjan Adhikary, Atanu Sahu, Partha Bhattacharya

Abstract: We propose a fully coupled numerical model to predict turbulent boundary layer (TBL) induced energy transmission behavior for a double-wall backed enclosure system in a finite element (FE) framework computed in cloud-based Python environment. Goody single point wall-pressure spectrum and Corcos spatial correlation function are used to generate the TBL cross-power spectra. Mindlins first order shea… ▽ More We propose a fully coupled numerical model to predict turbulent boundary layer (TBL) induced energy transmission behavior for a double-wall backed enclosure system in a finite element (FE) framework computed in cloud-based Python environment. Goody single point wall-pressure spectrum and Corcos spatial correlation function are used to generate the TBL cross-power spectra. Mindlins first order shear deformation model is considered for the panels and a fully coupled TBL-structure-acoustic model is developed using the FE approach to predict the acoustic power level inside the enclosure for variable gap distance between the panels. The model is developed in a way to capture the contribution of orthotropic lamina sequence, frequency-dependent structural dam**, and stiffening orientation in predicting the energy transmission into a double-wall backed enclosure. Thus, a new numerical model is presented that enables the designers with more precise energy transmission quantification with greater flexibility in terms of the number of panel leaves, geometry, and boundary conditions of the enclosure system, backed by double wall made of isotropic or orthotropic laminates. △ Less

Submitted 17 September, 2022; originally announced September 2022.

Comments: 9 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:2208.11155

arXiv:2209.07321 [pdf]

A coupled FE-BE approach for vibro-acoustic response prediction of laminated composite panels due to turbulent boundary layer excitation involving Cholesky decomposition

Authors: Biplab Ranjan Adhikary, Atanu Sahu, Partha Bhattacharya

Abstract: An original numerical framework is developed in the present research work in order to estimate the free field sound radiation from baffled structural panels subjected to turbulent boundary layer-induced excitation. A semi-analytical method is used to estimate the TBL wall pressure spectrum which is decomposed using Cholesky technique to obtain random wall pressure in the frequency domain. Structur… ▽ More An original numerical framework is developed in the present research work in order to estimate the free field sound radiation from baffled structural panels subjected to turbulent boundary layer-induced excitation. A semi-analytical method is used to estimate the TBL wall pressure spectrum which is decomposed using Cholesky technique to obtain random wall pressure in the frequency domain. Structural panels are modeled using the finite element technique and a coupled finite element boundary element modeling technique is developed to estimate the sound power level radiating into the free field. Results are obtained for laminated composite structural panels with various fiber orientations and significant findings are discussed. The developed technique has the potential to be further extended for complex structures in terms of geometry, material properties, and boundary conditions. The complete numerical toolbox, developed in an in-house MATLAB environment, enables the prediction of turbulent structure acoustic coupled behavior at an early design stage. △ Less

Submitted 15 September, 2022; originally announced September 2022.

Comments: 14 pages, 11 figures

arXiv:2208.13884 [pdf, other]

Toward a Mathematical Vulnerability Propagation and Defense Model in Smart Grid Networks

Authors: Abhijeet Sahu, Bin Mai, Katherine Davis, Ana Goulart

Abstract: For reducing threat propagation within an inter-connected network, it is essential to distribute the defense investment optimally. Most electric power utilities are resource constrained, yet how to account for costs while designing threat reduction techniques is not well understood. Hence, in this work, a vulnerability propagation and a defense model is proposed based on an epidemic model. The new… ▽ More For reducing threat propagation within an inter-connected network, it is essential to distribute the defense investment optimally. Most electric power utilities are resource constrained, yet how to account for costs while designing threat reduction techniques is not well understood. Hence, in this work, a vulnerability propagation and a defense model is proposed based on an epidemic model. The new defense mechanism is then validated through sensitivity of the propagation parameters on the optimal investment with two-node and N-node cases. Further, the model efficacy is evaluated with implementation in one of the communication networks of a cyber-physical power system. Topological impact on the optimal nodal investment is also emphasized. Optimal investment of the neighbors with less degree were found to be highly sensitive to fluctuation in vulnerability exploitability probability. △ Less

Submitted 29 August, 2022; originally announced August 2022.

Comments: 7 pages, 20 figures

arXiv:2208.11155 [pdf]

A coupled FE-RRM-based numerical model for analysis of energy transmission loss through stiffened double-wall panel due to TBL excitation

Authors: Biplab Ranjan Adhikary, Atanu Sahu, Partha Bhattacharya

Abstract: We propose a fully coupled numerical model to predict energy transmission through a turbulent boundary layer (TBL) excited stiffened double-leaf flexible aircraft panel using a finite element (FE) framework. Mindlin first order shear deformation model is adopted for the panels and a TBL-structure-acoustic coupling model is developed using finite element-radiation resistance matrix (FE-RRM) approac… ▽ More We propose a fully coupled numerical model to predict energy transmission through a turbulent boundary layer (TBL) excited stiffened double-leaf flexible aircraft panel using a finite element (FE) framework. Mindlin first order shear deformation model is adopted for the panels and a TBL-structure-acoustic coupling model is developed using finite element-radiation resistance matrix (FE-RRM) approach to predict the transmission loss (TL) through double-leaf panels with variable thickness and stiffener orientation. The model is also capable to capture the contribution of orthotropic lamina sequence and frequency-dependent structural dam** in predicting the TL. Thus, a new numerical model is proposed that enables the designers with greater flexibility in terms of the number of panel leaves, boundary, and stiffening condition of the aircraft panel-cavity-panel system, made of isotropic or orthotropic laminates. △ Less

Submitted 23 August, 2022; originally announced August 2022.

Comments: 9 pages, 2 figures

arXiv:2208.08669 [pdf]

Effect of confinement on flow around a rotating elliptic cylinder in laminar flow regime

Authors: Prateek Gupta, Sibasish Panda, Akhilesh Kumar Sahu, Deepak Kumar

Abstract: The flow phenomena around a rotating elliptic cylinder in a channel is studied numerically. The value of the confinement parameter βis varied as \frac{1}{k}, where k = 2, 4, 6, and 8 respectively, to demonstrate the vortex-shedding patterns around the cylinder in the downstream wake. The non-dimensional rotation rate αtakes up 0.5, 1, and 2 as its value. Additionally, the Reynolds number (\textit{… ▽ More The flow phenomena around a rotating elliptic cylinder in a channel is studied numerically. The value of the confinement parameter βis varied as \frac{1}{k}, where k = 2, 4, 6, and 8 respectively, to demonstrate the vortex-shedding patterns around the cylinder in the downstream wake. The non-dimensional rotation rate αtakes up 0.5, 1, and 2 as its value. Additionally, the Reynolds number (\textit{Re}) based on the cylinder diameter is taken to be 50, 100, and 150 respectively. A parametric study is performed to explain the changes in drag coefficient \textit{(C_{D})}, lift coefficient \textit{(C_{L})}, and moment coefficient \textit{(C_{M})} with variations of β, α, and \textit{Re}. The Fast-Fourier transform (FFT) of the time-periodic lift signals is presented to understand the shedding frequency characteristics, and the \textit{C_{M}} values are analyzed for cases of autorotation. Despite the introduction of significant confinement and cylinder rotation, complete suppression of vortex shedding is not observed for the considered parameter space. Autorotation is observed and becomes prominent with decrease in non-dimensional rotation rate and increase in confinement and Reynolds number. △ Less

Submitted 12 December, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

Comments: Prateek Gupta and Sibasish Panda have contributed equally to this work and are co-first authors. 25 pages, 15 figures, 5 tables

arXiv:2208.00796 [pdf, ps, other]

doi 10.1142/S1793042123500707

Expressing $q$-series in terms of building blocks of Hecke-type double-sums

Authors: Eric T. Mortenson, Ankit Sahu

Abstract: We express recent double-sums studied by Wang, Yee, and Liu in terms of two types of Hecke-type double-sum building blocks. When possible we determine the (mock) modularity. We also express a recent $q$-hypergeometric function of Andrews as a mixed mock modular form. We express recent double-sums studied by Wang, Yee, and Liu in terms of two types of Hecke-type double-sum building blocks. When possible we determine the (mock) modularity. We also express a recent $q$-hypergeometric function of Andrews as a mixed mock modular form. △ Less

Submitted 1 August, 2022; originally announced August 2022.

Journal ref: International Journal of Number Theory, 16, (2023), no. 6, 1429-1451

arXiv:2207.09078 [pdf, other]

doi 10.1145/3534678.3539174

ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale

Authors: Gopinath Chennupati, Milind Rao, Gurpreet Chadha, Aaron Eakin, Anirudh Raju, Gautam Tiwari, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo, Andy Oberlin, Buddha Nandanoor, Prahalad Venkataramanan, Zheng Wu, Pankaj Sitpure

Abstract: Incremental learning is one paradigm to enable model building and updating at scale with streaming data. For end-to-end automatic speech recognition (ASR) tasks, the absence of human annotated labels along with the need for privacy preserving policies for model building makes it a daunting challenge. Motivated by these challenges, in this paper we use a cloud based framework for production systems… ▽ More Incremental learning is one paradigm to enable model building and updating at scale with streaming data. For end-to-end automatic speech recognition (ASR) tasks, the absence of human annotated labels along with the need for privacy preserving policies for model building makes it a daunting challenge. Motivated by these challenges, in this paper we use a cloud based framework for production systems to demonstrate insights from privacy preserving incremental learning for automatic speech recognition (ILASR). By privacy preserving, we mean, usage of ephemeral data which are not human annotated. This system is a step forward for production levelASR models for incremental/continual learning that offers near real-time test-bed for experimentation in the cloud for end-to-end ASR, while adhering to privacy-preserving policies. We show that the proposed system can improve the production models significantly(3%) over a new time period of six months even in the absence of human annotated labels with varying levels of weak supervision and large batch sizes in incremental learning. This improvement is 20% over test sets with new words and phrases in the new time period. We demonstrate the effectiveness of model building in a privacy-preserving incremental fashion for ASR while further exploring the utility of having an effective teacher model and use of large batch sizes. △ Less

Submitted 22 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

Comments: 9 pages

arXiv:2206.10815 [pdf, other]

FedBC: Calibrating Global and Local Models via Federated Learning Beyond Consensus

Authors: Amrit Singh Bedi, Chen Fan, Alec Koppel, Anit Kumar Sahu, Brian M. Sadler, Furong Huang, Dinesh Manocha

Abstract: In this work, we quantitatively calibrate the performance of global and local models in federated learning through a multi-criterion optimization-based framework, which we cast as a constrained program. The objective of a device is its local objective, which it seeks to minimize while satisfying nonlinear constraints that quantify the proximity between the local and the global model. By considerin… ▽ More In this work, we quantitatively calibrate the performance of global and local models in federated learning through a multi-criterion optimization-based framework, which we cast as a constrained program. The objective of a device is its local objective, which it seeks to minimize while satisfying nonlinear constraints that quantify the proximity between the local and the global model. By considering the Lagrangian relaxation of this problem, we develop a novel primal-dual method called Federated Learning Beyond Consensus (\texttt{FedBC}). Theoretically, we establish that \texttt{FedBC} converges to a first-order stationary point at rates that matches the state of the art, up to an additional error term that depends on a tolerance parameter introduced to scalarize the multi-criterion formulation. Finally, we demonstrate that \texttt{FedBC} balances the global and local model test accuracy metrics across a suite of datasets (Synthetic, MNIST, CIFAR-10, Shakespeare), achieving competitive performance with state-of-the-art. △ Less

Submitted 1 February, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

arXiv:2204.08069 [pdf, other]

Self-Aware Personalized Federated Learning

Authors: Huili Chen, Jie Ding, Eric Tramel, Shuang Wu, Anit Kumar Sahu, Salman Avestimehr, Tao Zhang

Abstract: In the context of personalized federated learning (FL), the critical challenge is to balance local model improvement and global model tuning when the personal and global objectives may not be exactly aligned. Inspired by Bayesian hierarchical models, we develop a self-aware personalized FL method where each client can automatically balance the training of its local personal model and the global mo… ▽ More In the context of personalized federated learning (FL), the critical challenge is to balance local model improvement and global model tuning when the personal and global objectives may not be exactly aligned. Inspired by Bayesian hierarchical models, we develop a self-aware personalized FL method where each client can automatically balance the training of its local personal model and the global model that implicitly contributes to other clients' training. Such a balance is derived from the inter-client and intra-client uncertainty quantification. A larger inter-client variation implies more personalization is needed. Correspondingly, our method uses uncertainty-driven local training steps and aggregation rule instead of conventional local fine-tuning and sample size-based aggregation. With experimental studies on synthetic data, Amazon Alexa audio data, and public datasets such as MNIST, FEMNIST, CIFAR10, and Sent140, we show that our proposed method can achieve significantly improved personalization performance compared with the existing counterparts. △ Less

Submitted 17 April, 2022; originally announced April 2022.

arXiv:2204.07298 [pdf, ps, other]

Cartan Connection for h-Matsumoto change

Authors: M. K. Gupta, Abha Sahu, Suman Sharma

Abstract: In the present paper, we have studied the Matsumoto change $\overline{L}(x,y)= \frac{L^{2}(x,y)}{L(x,y) - β(x,y)} $ with an \textsl{h-}vector $b_{i}(x,y)$. We have derived some fundamental tensors for this transformation. We have also obtained the necessary and sufficient condition for which the Cartan connection coefficients for both the spaces $F^{n}=(M^n,L)$ and… ▽ More In the present paper, we have studied the Matsumoto change $\overline{L}(x,y)= \frac{L^{2}(x,y)}{L(x,y) - β(x,y)} $ with an \textsl{h-}vector $b_{i}(x,y)$. We have derived some fundamental tensors for this transformation. We have also obtained the necessary and sufficient condition for which the Cartan connection coefficients for both the spaces $F^{n}=(M^n,L)$ and $\overline{F}^{\,n}=(M^{n},\overline{L})$ are same. △ Less

Submitted 14 April, 2022; originally announced April 2022.

arXiv:2204.02593 [pdf, other]

Nonlinear gradient map**s and stochastic optimization: A general framework with applications to heavy-tail noise

Authors: Dusan Jakovetic, Dragana Bajovic, Anit Kumar Sahu, Soummya Kar, Nemanja Milosevic, Dusan Stamenkovic

Abstract: We introduce a general framework for nonlinear stochastic gradient descent (SGD) for the scenarios when gradient noise exhibits heavy tails. The proposed framework subsumes several popular nonlinearity choices, like clipped, normalized, signed or quantized gradient, but we also consider novel nonlinearity choices. We establish for the considered class of methods strong convergence guarantees assum… ▽ More We introduce a general framework for nonlinear stochastic gradient descent (SGD) for the scenarios when gradient noise exhibits heavy tails. The proposed framework subsumes several popular nonlinearity choices, like clipped, normalized, signed or quantized gradient, but we also consider novel nonlinearity choices. We establish for the considered class of methods strong convergence guarantees assuming a strongly convex cost function with Lipschitz continuous gradients under very general assumptions on the gradient noise. Most notably, we show that, for a nonlinearity with bounded outputs and for the gradient noise that may not have finite moments of order greater than one, the nonlinear SGD's mean squared error (MSE), or equivalently, the expected cost function's optimality gap, converges to zero at rate~$O(1/t^ζ)$, $ζ\in (0,1)$. In contrast, for the same noise setting, the linear SGD generates a sequence with unbounded variances. Furthermore, for the nonlinearities that can be decoupled component wise, like, e.g., sign gradient or component-wise clip**, we show that the nonlinear SGD asymptotically (locally) achieves a $O(1/t)$ rate in the weak convergence sense and explicitly quantify the corresponding asymptotic variance. Experiments show that, while our framework is more general than existing studies of SGD under heavy-tail noise, several easy-to-implement nonlinearities from our framework are competitive with state of the art alternatives on real data sets with heavy tail noises. △ Less

Submitted 6 April, 2022; originally announced April 2022.

Comments: Submitted for publication Nov 2021

arXiv:2204.00760 [pdf, ps, other]

The Isoperimetric Problem In Randers Planes

Authors: Arti Sahu, Ranadip Gangopadhyay, Hemangi Madhusudan Shah, Bankteshwar Tiwari

Abstract: In this paper, the isoperimetric problem in Randers planes, $(\mathbb{R}^2,F=α+β)$, which are slight deformation of the Euclidean plane $(\mathbb{R}^2,α)$ by suitable one forms $β$, have been studied. We prove that the circles centred at the origin achieves the local maximum area of the isoperimetric problem with respect to well known volume forms in Finsler geometry. In this paper, the isoperimetric problem in Randers planes, $(\mathbb{R}^2,F=α+β)$, which are slight deformation of the Euclidean plane $(\mathbb{R}^2,α)$ by suitable one forms $β$, have been studied. We prove that the circles centred at the origin achieves the local maximum area of the isoperimetric problem with respect to well known volume forms in Finsler geometry. △ Less

Submitted 2 April, 2022; originally announced April 2022.

MSC Class: 53B40; 52B60

arXiv:2203.07692 [pdf, other]

doi 10.1103/PhysRevE.106.024209

Effect of chaos on information gain in quantum tomography

Authors: Abinash Sahu, Sreeram PG, Vaibhav Madhok

Abstract: Does chaos in the dynamics enable information gain in quantum tomography or impede it? We address this question by considering continuous measurement tomography in which the measurement record is obtained as a sequence of expectation values of a Hermitian observable evolving under the repeated application of the Floquet map of the quantum kicked top. For a given dynamics and Hermitian observables,… ▽ More Does chaos in the dynamics enable information gain in quantum tomography or impede it? We address this question by considering continuous measurement tomography in which the measurement record is obtained as a sequence of expectation values of a Hermitian observable evolving under the repeated application of the Floquet map of the quantum kicked top. For a given dynamics and Hermitian observables, we observe completely opposite behavior in the tomography of well-localized spin coherent states compared to random states. As the chaos in the dynamics increases, the reconstruction fidelity of spin coherent states decreases. This contrasts with the previous results connecting information gain in tomography of random states with the degree of chaos in the dynamics that drives the system. The rate of information gain and hence the fidelity obtained in tomography depends not only on the degree of chaos in the dynamics and to what extent it causes the initial observable to spread in various directions of the operator space but, more importantly, how well these directions are aligned with the density matrix to be estimated. Our study also gives an operational interpretation for operator spreading in terms of fidelity gain in an actual quantum information tomography protocol. △ Less

Submitted 27 August, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: 11 pages, 6 figures, published version with modified title

Journal ref: Phys. Rev. E 106, 024209 (2022)

arXiv:2202.00807 [pdf, other]

Federated Learning Challenges and Opportunities: An Outlook

Authors: Jie Ding, Eric Tramel, Anit Kumar Sahu, Shuang Wu, Salman Avestimehr, Tao Zhang

Abstract: Federated learning (FL) has been developed as a promising framework to leverage the resources of edge devices, enhance customers' privacy, comply with regulations, and reduce development costs. Although many methods and applications have been developed for FL, several critical challenges for practical FL systems remain unaddressed. This paper provides an outlook on FL development, categorized into… ▽ More Federated learning (FL) has been developed as a promising framework to leverage the resources of edge devices, enhance customers' privacy, comply with regulations, and reduce development costs. Although many methods and applications have been developed for FL, several critical challenges for practical FL systems remain unaddressed. This paper provides an outlook on FL development, categorized into five emerging directions of FL, namely algorithm foundation, personalization, hardware and security constraints, lifelong learning, and nonstandard data. Our unique perspectives are backed by practical observations from large-scale federated systems for edge devices. △ Less

Submitted 1 February, 2022; originally announced February 2022.

Comments: This paper provides an outlook on FL development as part of the ICASSP 2022 special session entitled "Frontiers of Federated Learning: Applications, Challenges, and Opportunities"

arXiv:2201.03789 [pdf, other]

Partial Model Averaging in Federated Learning: Performance Guarantees and Benefits

Authors: Sunwoo Lee, Anit Kumar Sahu, Chaoyang He, Salman Avestimehr

Abstract: Local Stochastic Gradient Descent (SGD) with periodic model averaging (FedAvg) is a foundational algorithm in Federated Learning. The algorithm independently runs SGD on multiple workers and periodically averages the model across all the workers. When local SGD runs with many workers, however, the periodic averaging causes a significant model discrepancy across the workers making the global loss c… ▽ More Local Stochastic Gradient Descent (SGD) with periodic model averaging (FedAvg) is a foundational algorithm in Federated Learning. The algorithm independently runs SGD on multiple workers and periodically averages the model across all the workers. When local SGD runs with many workers, however, the periodic averaging causes a significant model discrepancy across the workers making the global loss converge slowly. While recent advanced optimization methods tackle the issue focused on non-IID settings, there still exists the model discrepancy issue due to the underlying periodic model averaging. We propose a partial model averaging framework that mitigates the model discrepancy issue in Federated Learning. The partial averaging encourages the local models to stay close to each other on parameter space, and it enables to more effectively minimize the global loss. Given a fixed number of iterations and a large number of workers (128), the partial averaging achieves up to 2.2% higher validation accuracy than the periodic full averaging. △ Less

Submitted 11 January, 2022; originally announced January 2022.

Showing 1–50 of 118 results for author: Sahu, A