Search | arXiv e-print repository

Linear Algebra and Galois Theory

Abstract: In \cite{GQ2008} R. Gow and R. Quinlan have cast a new look on the endomorphism algebra of a $K$-vector space $V$ of dimension $n$ assuming that $K$ has a Galois extension $L$ of degree $n$. In this approach the $K$-space $L$ may serve as a model for $V$ and Galois-theoretic ideas and results may be applied to elucidate the structure of endomorphisms and other important objects of linear algebra.… ▽ More In \cite{GQ2008} R. Gow and R. Quinlan have cast a new look on the endomorphism algebra of a $K$-vector space $V$ of dimension $n$ assuming that $K$ has a Galois extension $L$ of degree $n$. In this approach the $K$-space $L$ may serve as a model for $V$ and Galois-theoretic ideas and results may be applied to elucidate the structure of endomorphisms and other important objects of linear algebra. In particular, this leads to the clarification of the structure of a rank-one endomorphism, trace of an endomorphism, criteria for linear indepedence etc. We present an exposition of these results using the language of tensor algebra wherever possible to provide shorter and more conceptual proofs. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 7 pages

MSC Class: 12F10; 15A03; 15A04

arXiv:2405.10490 [pdf]

doi 10.1145/3637528.3671591

Neural Optimization with Adaptive Heuristics for Intelligent Marketing System

Authors: Changshuai Wei, Benjamin Zelditch, Joyce Chen, Andre Assuncao Silva T Ribeiro, **gyi Kenneth Tay, Borja Ocejo Elizondo, Keerthi Selvaraj, Aman Gupta, Licurgo Benemann De Almeida

Abstract: Computational marketing has become increasingly important in today's digital world, facing challenges such as massive heterogeneous data, multi-channel customer journeys, and limited marketing budgets. In this paper, we propose a general framework for marketing AI systems, the Neural Optimization with Adaptive Heuristics (NOAH) framework. NOAH is the first general framework for marketing optimizat… ▽ More Computational marketing has become increasingly important in today's digital world, facing challenges such as massive heterogeneous data, multi-channel customer journeys, and limited marketing budgets. In this paper, we propose a general framework for marketing AI systems, the Neural Optimization with Adaptive Heuristics (NOAH) framework. NOAH is the first general framework for marketing optimization that considers both to-business (2B) and to-consumer (2C) products, as well as both owned and paid channels. We describe key modules of the NOAH framework, including prediction, optimization, and adaptive heuristics, providing examples for bidding and content optimization. We then detail the successful application of NOAH to LinkedIn's email marketing system, showcasing significant wins over the legacy ranking system. Additionally, we share details and insights that are broadly useful, particularly on: (i) addressing delayed feedback with lifetime value, (ii) performing large-scale linear programming with randomization, (iii) improving retrieval with audience expansion, (iv) reducing signal dilution in targeting tests, and (v) handling zero-inflated heavy-tail metrics in statistical testing. △ Less

Submitted 25 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: KDD 2024

ACM Class: G.3; G.1.6; I.2

arXiv:2401.13754 [pdf, other]

Multi-Function Multi-Way Analog Technology for Sustainable Machine Intelligence Computation

Authors: Vassilis Kalantzis, Mark S. Squillante, Shashanka Ubaru, Tayfun Gokmen, Chai Wah Wu, Anshul Gupta, Haim Avron, Tomasz Nowicki, Malte Rasch, Murat Onen, Vanessa Lopez Marrero, Effendi Leobandung, Yasuteru Kohda, Wilfried Haensch, Lior Horesh

Abstract: Numerical computation is essential to many areas of artificial intelligence (AI), whose computing demands continue to grow dramatically, yet their continued scaling is jeopardized by the slowdown in Moore's law. Multi-function multi-way analog (MFMWA) technology, a computing architecture comprising arrays of memristors supporting in-memory computation of matrix operations, can offer tremendous imp… ▽ More Numerical computation is essential to many areas of artificial intelligence (AI), whose computing demands continue to grow dramatically, yet their continued scaling is jeopardized by the slowdown in Moore's law. Multi-function multi-way analog (MFMWA) technology, a computing architecture comprising arrays of memristors supporting in-memory computation of matrix operations, can offer tremendous improvements in computation and energy, but at the expense of inherent unpredictability and noise. We devise novel randomized algorithms tailored to MFMWA architectures that mitigate the detrimental impact of imperfect analog computations while realizing their potential benefits across various areas of AI, such as applications in computer vision. Through analysis, measurements from analog devices, and simulations of larger systems, we demonstrate orders of magnitude reduction in both computation and energy with accuracy similar to digital computers. △ Less

Submitted 24 January, 2024; originally announced January 2024.

MSC Class: 65F10; C3; G1 ACM Class: G.1.3

arXiv:2401.12332 [pdf, other]

A Precise Characterization of SGD Stability Using Loss Surface Geometry

Authors: Gregory Dexter, Borja Ocejo, Sathiya Keerthi, Aman Gupta, Ayan Acharya, Rajiv Khanna

Abstract: Stochastic Gradient Descent (SGD) stands as a cornerstone optimization algorithm with proven real-world empirical successes but relatively limited theoretical understanding. Recent research has illuminated a key factor contributing to its practical efficacy: the implicit regularization it instigates. Several studies have investigated the linear stability property of SGD in the vicinity of a statio… ▽ More Stochastic Gradient Descent (SGD) stands as a cornerstone optimization algorithm with proven real-world empirical successes but relatively limited theoretical understanding. Recent research has illuminated a key factor contributing to its practical efficacy: the implicit regularization it instigates. Several studies have investigated the linear stability property of SGD in the vicinity of a stationary point as a predictive proxy for sharpness and generalization error in overparameterized neural networks (Wu et al., 2022; Jastrzebski et al., 2019; Cohen et al., 2021). In this paper, we delve deeper into the relationship between linear stability and sharpness. More specifically, we meticulously delineate the necessary and sufficient conditions for linear stability, contingent on hyperparameters of SGD and the sharpness at the optimum. Towards this end, we introduce a novel coherence measure of the loss Hessian that encapsulates pertinent geometric properties of the loss function that are relevant to the linear stability of SGD. It enables us to provide a simplified sufficient condition for identifying linear instability at an optimum. Notably, compared to previous works, our analysis relies on significantly milder assumptions and is applicable for a broader class of loss functions than known before, encompassing not only mean-squared error but also cross-entropy loss. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: To appear at ICLR 2024

arXiv:2401.09364 [pdf, other]

Anticipating Tip** Points for Disordered Traffic: Critical Slowing Down on the Onset of Congestion

Authors: Shankha Narayan Chattopadhyay, Arvind Kumar Gupta

Abstract: Regime shifts are quite common in complex systems like cell regulations, disease transmissions, ecosystems, marine ice instability, etc. Several statistical indicators known as early warning signals (EWS) have been theorized to anticipate these abrupt transitions in advance. These regime shifts happen because they cross some critical value of the parameter that influences the overall dynamics. Thi… ▽ More Regime shifts are quite common in complex systems like cell regulations, disease transmissions, ecosystems, marine ice instability, etc. Several statistical indicators known as early warning signals (EWS) have been theorized to anticipate these abrupt transitions in advance. These regime shifts happen because they cross some critical value of the parameter that influences the overall dynamics. This critical threshold is known as tip** point. In the vicinity of a tip** point, perturbations gradually increases, and as a consequence, system-state extensively swing around the quasi-static attractor, and the local dynamics become progressively slow, which is known as critical slowing down (CSD). Because of this CSD, statistical measures known as early warning signals (EWS) such as variance and lag-1 autocorrelation increase. From the point of view of physics, a free flow can become congested when the mean car density crosses its tip** point. Recently, for lane-based traffic system using continuum model, study reveals that analysis of the generic EWSs serve as a good measure to predict upstream stop-and-go traffic jams. Now, we introduce EWSs to anticipate traffic jam for heterogeneous disordered traffic relevant for non-lane-based systems. We have analyzed a lattice hydrodynamic area occupancy model with passing and through numerical simulations, we have shown emergence of kink or chaotic jam. Also, we provided proper framework for prediction of traffic jams via different EWSs. From simulated data, we demonstrated that EWSs are sensitive as tip** is approached. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: 12 pages, 4 figures

arXiv:2401.06020 [pdf, other]

Dynamic Capital Requirements for Markov Decision Processes

Authors: William B. Haskell, Abhishek Gupta, Shi** Shao

Abstract: We build on the theory of capital requirements (CRs) to create a new framework for modeling dynamic risk preferences. The key question is how to evaluate the risk of a payoff stream sequentially as new information is revealed. In our model, we associate each payoff stream with a disbursement strategy and a premium schedule to form a triple of stochastic processes. We characterize risk preferences… ▽ More We build on the theory of capital requirements (CRs) to create a new framework for modeling dynamic risk preferences. The key question is how to evaluate the risk of a payoff stream sequentially as new information is revealed. In our model, we associate each payoff stream with a disbursement strategy and a premium schedule to form a triple of stochastic processes. We characterize risk preferences in terms of a single set that we call the risk frontier which characterizes acceptable triples. We then propose the generalized capital requirement (GCR) which evaluates the risk of a payoff stream by minimizing the premium schedule over acceptable triples. We apply this model to a risk-aware decision maker (DM) who controls a Markov decision process (MDP) and wants to find a policy to minimize the GCR of its payoff stream. The resulting GCR-MDP recovers many well-known risk-aware MDPs as special cases. To make this approach computationally viable, we obtain the temporal decomposition of the GCR in terms of the risk frontier. Then, we connect the temporal decomposition with the notion of an information state to compactly capture the dependence of DM's risk preferences on the problem history, where augmented dynamic programming can be used to compute an optimal policy. We report numerical experiments for the GCR-minimizing newsvendor. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2312.11869 [pdf, other]

Pinned Billiard Balls Simulation (WXML Autumn 2023 report)

Authors: Bella Cochran, Anish Gupta, Colin Holzman-Klima, Wanchaloem Wunkaew, Krzysztof Burdzy, Raghavendra Tripathi

Abstract: Systems of pinned billiard balls serve as simplified models of collisions, where all particles remain fixed in their positions while their (pseudo-)velocities evolve in accordance with the laws of conservation of energy and momentum. For some families of ball configurations, Athreya, Burdzy, and Duarte have established the maximum upper bound for the number of pseudo-collisions, thereby demonstrat… ▽ More Systems of pinned billiard balls serve as simplified models of collisions, where all particles remain fixed in their positions while their (pseudo-)velocities evolve in accordance with the laws of conservation of energy and momentum. For some families of ball configurations, Athreya, Burdzy, and Duarte have established the maximum upper bound for the number of pseudo-collisions, thereby demonstrating that the number of collisions is finite. The result has been extended to all ball configurations. In this project, we do extensive simulations to study two specific configurations. First, we consider balls arranged in a half-space and assign a single ball an inward (pseudo-) velocity. Simulations suggest that in the long run, most of the energy is concentrated near the boundary. Second, when the balls are arranged on a flat torus, we find that in the stationary regime, the distributions of the velocity components are i.i.d. normal. Additionally, we find that the components of the velocities in the direction of impact between two touching balls are uncorrelated. △ Less

Submitted 19 December, 2023; originally announced December 2023.

MSC Class: 60K35

arXiv:2311.00836 [pdf, ps, other]

Effective filtering approach for joint parameter-state estimation in SDEs via Rao-Blackwellization and modularization

Authors: Zhou Fang, Ankit Gupta, Mustafa Khammash

Abstract: Stochastic filtering is a vibrant area of research in both control theory and statistics, with broad applications in many scientific fields. Despite its extensive historical development, there still lacks an effective method for joint parameter-state estimation in SDEs. The state-of-the-art particle filtering methods suffer from either sample degeneracy or information loss, with both issues stemmi… ▽ More Stochastic filtering is a vibrant area of research in both control theory and statistics, with broad applications in many scientific fields. Despite its extensive historical development, there still lacks an effective method for joint parameter-state estimation in SDEs. The state-of-the-art particle filtering methods suffer from either sample degeneracy or information loss, with both issues stemming from the dynamics of the particles generated to represent system parameters. This paper provides a novel and effective approach for joint parameter-state estimation in SDEs via Rao-Blackwellization and modularization. Our method operates in two layers: the first layer estimates the system states using a bootstrap particle filter, and the second layer marginalizes out system parameters explicitly. This strategy circumvents the need to generate particles representing system parameters, thereby mitigating their associated problems of sample degeneracy and information loss. Moreover, our method employs a modularization approach when integrating out the parameters, which significantly reduces the computational complexity. All these designs ensure the superior performance of our method. Finally, a numerical example is presented to illustrate that our method outperforms existing approaches by a large margin. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: 8 pages, 2 figures

MSC Class: 62M20; 62F15; 65C05; 92-08; 93E11

arXiv:2310.15682 [pdf, ps, other]

Tensor product of irreducible characters of $\mathrm{GL}_2(\mathbb{F}_q)$

Authors: Archita Gupta, M Hassain

Abstract: We decompose the tensor product of two irreducible representations of $\mathrm{GL}_2(\mathbb{F}_q)$ for odd $q$ and classify the pairs such that their tensor product is multiplicity free. We also classify the pairs such that their tensor product has unique decomposition property. We additionally characterize the self-dual irreducible representations of $\mathrm{GL}_2(\mathbb{F}_q).$ We decompose the tensor product of two irreducible representations of $\mathrm{GL}_2(\mathbb{F}_q)$ for odd $q$ and classify the pairs such that their tensor product is multiplicity free. We also classify the pairs such that their tensor product has unique decomposition property. We additionally characterize the self-dual irreducible representations of $\mathrm{GL}_2(\mathbb{F}_q).$ △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 8 pages

MSC Class: 20C15; 20G05; 20G40

arXiv:2310.12176 [pdf, ps, other]

Common Fixed Point Theorems with Generalized TAC-Contraction on Partial b-Metric Space

Authors: Anuradha Gupta, Rahul Mansotra

Abstract: In this paper, we give common coincidence point and common fixed point theorems for four self maps in the setting of generalized TAC-contraction in partial b-metric space. Also, we give an example to authenticate the viability of the results. In this paper, we give common coincidence point and common fixed point theorems for four self maps in the setting of generalized TAC-contraction in partial b-metric space. Also, we give an example to authenticate the viability of the results. △ Less

Submitted 16 October, 2023; originally announced October 2023.

MSC Class: 47H10; 54H25

arXiv:2310.12161 [pdf, ps, other]

Fixed Point Theorems Using Interpolative Boyd-Wong Type Contractions And Interpolative Matkowski Type Contractions on Partial Sb-Metric Space

Authors: Anuradha Gupta, Rahul Mansotra

Abstract: In this article, we define and explore the topological properties of partial Sb-metric space. We define interpolative Boyd-Wong type contraction and interpolative Matkowski type contractions in the setting of partial Sb-metric space and obtain fixed point results for the same. In this article, we define and explore the topological properties of partial Sb-metric space. We define interpolative Boyd-Wong type contraction and interpolative Matkowski type contractions in the setting of partial Sb-metric space and obtain fixed point results for the same. △ Less

Submitted 27 September, 2023; originally announced October 2023.

arXiv:2310.03340 [pdf, ps, other]

Constant rank subspaces of alternating bilinear forms from Galois Theory

Authors: Ashish Gupta, Sugata Mandal

Abstract: Let $L/K$ be a cyclic extension of degree $n = 2m$. It is known that the space $\text{Alt}_K(L)$ of alternating $K$-bilinear forms (skew-forms) on $L$ decomposes into a direct sum of $K$-subspaces $A^{σ^i}$ indexed by the elements of $\text{Gal}(L/K) = \langle σ\rangle$. It is also known that the components $A^{σ^i}$ can have nice constant-rank properties. We enhance and enrich these constant-rank… ▽ More Let $L/K$ be a cyclic extension of degree $n = 2m$. It is known that the space $\text{Alt}_K(L)$ of alternating $K$-bilinear forms (skew-forms) on $L$ decomposes into a direct sum of $K$-subspaces $A^{σ^i}$ indexed by the elements of $\text{Gal}(L/K) = \langle σ\rangle$. It is also known that the components $A^{σ^i}$ can have nice constant-rank properties. We enhance and enrich these constant-rank results and show that the component $A^σ$ often decomposes directly into a sum of constant rank subspaces, that is, subspaces all of whose non-zero skew-forms have a fixed rank $r$. In particular, this is always true when $-1 \not \in L^2$. As a result we deduce a decomposition of $\text{Alt}_K(L)$ into subspaces of constant rank in several interesting situations. We also establish that a subspace of dimension $\frac{n}{2}$ all of whose nonzero skew-forms are non-degenerate can always be found in $A^{σ^i}$ where $σ^i$ has order divisible by $2$. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 16 pages. Suggestions are welcomed

MSC Class: 12F05; 12F10; 15A63

arXiv:2310.02941 [pdf, ps, other]

Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

Authors: Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

Abstract: This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM). The generalized concentrability condition establishes a framework that interpolates and extends the existing hypotheses of Markov chain Hoeffding-type inequalities. The flexibility of our framework allows Hoeffding's inequality to be applied bey… ▽ More This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM). The generalized concentrability condition establishes a framework that interpolates and extends the existing hypotheses of Markov chain Hoeffding-type inequalities. The flexibility of our framework allows Hoeffding's inequality to be applied beyond the ergodic Markov chains in the traditional sense. We demonstrate the utility by applying our framework to several non-asymptotic analyses arising from the field of machine learning, including (i) a generalization bound for empirical risk minimization with Markovian samples, (ii) a finite sample guarantee for Ployak-Ruppert averaging of SGD, and (iii) a new regret bound for rested Markovian bandits with general state space. △ Less

Submitted 4 October, 2023; originally announced October 2023.

arXiv:2309.14699 [pdf, ps, other]

Triviality of the automorphism group of the multiparameter quantum affine $n$-space

Authors: Ashish Gupta, Sugata Mandal

Abstract: A multiparameter quantum affine space of rank $n$ is the $\mathbb F$-algebra generated by indeterminates $X_1, \cdots, X_n$ satisfying $X_iX_j = q_{ij} X_jX_i \ (1 \le i < j \le n)$ where $q_{ij}$ are nonzero scalars in $\mathbb F^\ast$. The corresponding quantum torus is generated by the $X_i$ and together with their inverses subject to the same relations. So far the automorphisms of a quantum af… ▽ More A multiparameter quantum affine space of rank $n$ is the $\mathbb F$-algebra generated by indeterminates $X_1, \cdots, X_n$ satisfying $X_iX_j = q_{ij} X_jX_i \ (1 \le i < j \le n)$ where $q_{ij}$ are nonzero scalars in $\mathbb F^\ast$. The corresponding quantum torus is generated by the $X_i$ and together with their inverses subject to the same relations. So far the automorphisms of a quantum affine space have been considered mainly in the uniparameter case, that is, $q_{ij} = q$. We remove this restriction here. Necessary and sufficient conditions are obtained for the quantum affine space to be rigid, that is, the only automorphisms are the trivial ones arising from the action of the torus $(\mathbb F^\ast)^n$. These conditions are based on the multiparameters $q_{ij}$ and also on the subgroup of $\mathbb F^\ast$ generated by these multiparameters. We employ the results in J. Alev and M. Chamarie, Derivations et automorphismes de quelques algebras quantiques, Communications in Algebra, 1992 (20), 1787-1802, and point out a small error in a main theorem in this paper which however remains valid with a small modification. We also note that a quantum affine space whose corresponding quantum torus has dimension one necessarily has a trivial automorphism group. This is a consequence of a result of J.~M.~Osborne, D.~S.~Passman, Derivations of Skew Polynomial Rings, J. Algebra, 1995, 176, 417--448. We expand the known list of examples of quantum tori that have dimension one and are thus hereditary noetherian domains. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 12 pages. arXiv admin note: text overlap with arXiv:2102.02859

MSC Class: 16S38; 16S35; 16S36; 16W20

arXiv:2308.08988 [pdf, ps, other]

A Dirichlet character analogue of Ramanujan's formula for odd zeta values

Authors: Anushree Gupta, Md Kashif Jamal, Nilmoni Karak, Bibekananda Maji

Abstract: In 2001, Kanemitsu, Tanigawa, and Yoshimoto studied the following generalized Lambert series, $$ \sum_{n=1}^{\infty} \frac{n^{N-2h} }{\exp(n^N x)-1}, $$ for $N \in \mathbb{N}$ and $h\in \mathbb{Z}$ with some restriction on $h$. Recently, Dixit and the last author pointed out that this series has already been present in the Lost Notebook of Ramanujan with a more general form. Although, Ramanuja… ▽ More In 2001, Kanemitsu, Tanigawa, and Yoshimoto studied the following generalized Lambert series, $$ \sum_{n=1}^{\infty} \frac{n^{N-2h} }{\exp(n^N x)-1}, $$ for $N \in \mathbb{N}$ and $h\in \mathbb{Z}$ with some restriction on $h$. Recently, Dixit and the last author pointed out that this series has already been present in the Lost Notebook of Ramanujan with a more general form. Although, Ramanujan did not provide any transformation identity for it. In the same paper, Dixit and the last author found an elegant generalization of Ramanujan's celebrated identity for $ζ(2m+1)$ while extending the results of Kanemitsu et al. In a subsequent work, Kanemitsu et al. explored another extended version of the aforementioned series, namely, $$\sum_{r=1}^{q}\sum_{n=1}^{\infty} \frac{χ(r)n^{N-2h}{\exp\left(-\frac{r}{q}n^N x\right)}}{1-\exp({-n^N x})},$$ where $χ$ denotes a Dirichlet character modulo $q$, $N\in 2\mathbb{N}$ and with some restriction on the variable $h$. In the current paper, we investigate the above series for {\it any} $N \in \mathbb{N}$ and $h \in \mathbb{Z}$. We obtain a Dirichlet character analogue of Dixit and the last author's identity and there by derive a two variable generalization of Ramanujan's identity for $ζ(2m+1)$. Moreover, we establish a new identity for $L(1/3, χ)$ analogous to Ramanujan's famous identity for $ζ(1/2)$. △ Less

Submitted 29 September, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

Comments: 24 pages, comments are welcome!

MSC Class: Primary 11M06; Secondary 11J81

arXiv:2308.05230 [pdf, ps, other]

Generalized Weighted Composition Operators on Vector-Valued Weighted Bergman Space

Authors: Anuradha Gupta, Geeta Yadav

Abstract: In this research article the necessary and sufficient conditions for the norm of composition operator $C_Φ$ on $\mathcal{A}_α^2(H)$ to be one are obtained. Moreover, $C_Φ$ is unitary on $\mathcal{A}_α^2(H)$ if and only if it is co-isometry. The necessary and sufficient condition for Hermitian and normal composition operators on $\mathcal{A}_α^2(H)$ are also explored. Also, the characterization for… ▽ More In this research article the necessary and sufficient conditions for the norm of composition operator $C_Φ$ on $\mathcal{A}_α^2(H)$ to be one are obtained. Moreover, $C_Φ$ is unitary on $\mathcal{A}_α^2(H)$ if and only if it is co-isometry. The necessary and sufficient condition for Hermitian and normal composition operators on $\mathcal{A}_α^2(H)$ are also explored. Also, the characterization for boundeness of generalized weighted composition operator is obtained under some condition on $Φ.$ △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: 15 pages

MSC Class: 47B33; 47B38

arXiv:2307.11676 [pdf, ps, other]

Ascent and Descent of Composition Operators on Orlicz-lorentz space

Authors: Neha Bhatia, Anuradha Gupta

Abstract: The aim of this paper is to discuss the characterizations of the composition operators on Orlicz-Lorentz space to have finite ascent (or descent). The aim of this paper is to discuss the characterizations of the composition operators on Orlicz-Lorentz space to have finite ascent (or descent). △ Less

Submitted 21 July, 2023; originally announced July 2023.

MSC Class: 47B33; 47B38; 46E30

arXiv:2306.15221 [pdf, other]

[Re] Double Sampling Randomized Smoothing

Authors: Aryan Gupta, Sarthak Gupta, Abhay Kumar, Harsh Dugar

Abstract: This paper is a contribution to the reproducibility challenge in the field of machine learning, specifically addressing the issue of certifying the robustness of neural networks (NNs) against adversarial perturbations. The proposed Double Sampling Randomized Smoothing (DSRS) framework overcomes the limitations of existing methods by using an additional smoothing distribution to improve the robustn… ▽ More This paper is a contribution to the reproducibility challenge in the field of machine learning, specifically addressing the issue of certifying the robustness of neural networks (NNs) against adversarial perturbations. The proposed Double Sampling Randomized Smoothing (DSRS) framework overcomes the limitations of existing methods by using an additional smoothing distribution to improve the robustness certification. The paper provides a clear manifestation of DSRS for a generalized family of Gaussian smoothing and a computationally efficient method for implementation. The experiments on MNIST and CIFAR-10 demonstrate the effectiveness of DSRS, consistently certifying larger robust radii compared to other methods. Also various ablations studies are conducted to further analyze the hyperparameters and effect of adversarial training methods on the certified radius by the proposed framework. △ Less

Submitted 27 June, 2023; originally announced June 2023.

arXiv:2305.17552 [pdf, other]

Online Nonstochastic Model-Free Reinforcement Learning

Authors: Udaya Ghai, Arushi Gupta, Wenhan Xia, Karan Singh, Elad Hazan

Abstract: We investigate robust model-free reinforcement learning algorithms designed for environments that may be dynamic or even adversarial. Traditional state-based policies often struggle to accommodate the challenges imposed by the presence of unmodeled disturbances in such settings. Moreover, optimizing linear state-based policies pose an obstacle for efficient optimization, leading to nonconvex objec… ▽ More We investigate robust model-free reinforcement learning algorithms designed for environments that may be dynamic or even adversarial. Traditional state-based policies often struggle to accommodate the challenges imposed by the presence of unmodeled disturbances in such settings. Moreover, optimizing linear state-based policies pose an obstacle for efficient optimization, leading to nonconvex objectives, even in benign environments like linear dynamical systems. Drawing inspiration from recent advancements in model-based control, we introduce a novel class of policies centered on disturbance signals. We define several categories of these signals, which we term pseudo-disturbances, and develop corresponding policy classes based on them. We provide efficient and practical algorithms for optimizing these policies. Next, we examine the task of online adaptation of reinforcement learning agents in the face of adversarial disturbances. Our methods seamlessly integrate with any black-box model-free approach, yielding provable regret guarantees when dealing with linear dynamics. These regret guarantees unconditionally improve the best-known results for bandit linear control in having no dependence on the state-space dimension. We evaluate our method over various standard RL benchmarks and demonstrate improved robustness. △ Less

Submitted 31 October, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

Comments: Camera-ready version for NeurIPS 2023

arXiv:2303.07464 [pdf, other]

doi 10.1175/JPO-D-23-0247.1

Understanding Stokes drift mechanism via crest and trough phase estimates

Authors: Anirban Guha, Akanksha Gupta

Abstract: By providing mathematical estimates, this paper answers a fundamental question -- "what leads to Stokes drift"? Although overwhelmingly understood for water waves, Stokes drift is a generic mechanism that stems from kinematics and occurs in any non-transverse wave in fluids. To showcase its generality, we undertake a comparative study of the pathline equation of sound (1D) and intermediate-depth w… ▽ More By providing mathematical estimates, this paper answers a fundamental question -- "what leads to Stokes drift"? Although overwhelmingly understood for water waves, Stokes drift is a generic mechanism that stems from kinematics and occurs in any non-transverse wave in fluids. To showcase its generality, we undertake a comparative study of the pathline equation of sound (1D) and intermediate-depth water (2D) waves. Although we obtain a closed-form solution $\mathbf{x}(t)$ for the specific case of linear sound waves, a more generic and meaningful approach involves the application of asymptotic methods and expressing variables in terms of the Lagrangian phase $θ$. We show that the latter reduces the 2D pathline equation of water waves to 1D. Using asymptotic methods, we solve the respective pathline equation for sound and water waves, and for each case, we obtain a parametric representation of particle position $\mathbf{x}(θ)$ and elapsed time $t(θ)$. Such a parametric description has allowed us to obtain second-order-accurate expressions for the time duration, horizontal displacement, and average horizontal velocity of a particle in the crest and trough phases. All these quantities are of higher magnitude in the crest phase in comparison to the trough, leading to a forward drift, i.e. Stokes drift. We also explore particle trajectory due to second-order Stokes waves and compare it with linear waves. While finite amplitude waves modify the estimates obtained from linear waves, the understanding acquired from linear waves is generally found to be valid. △ Less

Submitted 24 February, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

arXiv:2301.07338 [pdf, ps, other]

Generalizations of Chainability and Compactness, and the Hypertopologies

Authors: Ajit Kumar Gupta, Saikat Mukherjee

Abstract: We study two properties for subsets of a metric space. One of them is generalization of chainability, finite chainability, and Menger convexity for metric spaces; while the other is a generalization of compactness. We explore the basic results related to these two properties. Further, in the perspective of these properties, we explore relations among the Hausdorff, Vietoris, and locally finite hyp… ▽ More We study two properties for subsets of a metric space. One of them is generalization of chainability, finite chainability, and Menger convexity for metric spaces; while the other is a generalization of compactness. We explore the basic results related to these two properties. Further, in the perspective of these properties, we explore relations among the Hausdorff, Vietoris, and locally finite hypertopologies. △ Less

Submitted 18 January, 2023; originally announced January 2023.

MSC Class: 54B20

arXiv:2301.06198 [pdf, other]

Generalized Neural Closure Models with Interpretability

Authors: Abhinav Gupta, Pierre F. J. Lermusiaux

Abstract: Improving the predictive capability and computational cost of dynamical models is often at the heart of augmenting computational physics with machine learning (ML). However, most learning results are limited in interpretability and generalization over different computational grid resolutions, initial and boundary conditions, domain geometries, and physical or problem-specific parameters. In the pr… ▽ More Improving the predictive capability and computational cost of dynamical models is often at the heart of augmenting computational physics with machine learning (ML). However, most learning results are limited in interpretability and generalization over different computational grid resolutions, initial and boundary conditions, domain geometries, and physical or problem-specific parameters. In the present study, we simultaneously address all these challenges by develo** the novel and versatile methodology of unified neural partial delay differential equations. We augment existing/low-fidelity dynamical models directly in their partial differential equation (PDE) forms with both Markovian and non-Markovian neural network (NN) closure parameterizations. The melding of the existing models with NNs in the continuous spatiotemporal space followed by numerical discretization automatically allows for the desired generalizability. The Markovian term is designed to enable extraction of its analytical form and thus provides interpretability. The non-Markovian terms allow accounting for inherently missing time delays needed to represent the real world. We obtain adjoint PDEs in the continuous form, thus enabling direct implementation across differentiable and non-differentiable computational physics codes, different ML frameworks, and treatment of nonuniformly-spaced spatiotemporal training data. We demonstrate the new generalized neural closure models (gnCMs) framework using four sets of experiments based on advecting nonlinear waves, shocks, and ocean acidification models. Our learned gnCMs discover missing physics, find leading numerical error terms, discriminate among candidate functional forms in an interpretable fashion, achieve generalization, and compensate for the lack of complexity in simpler models. Finally, we analyze the computational advantages of our new framework. △ Less

Submitted 18 May, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

Comments: 26 pages, 7 figures, 11 pages of supplementary information

MSC Class: 68T07 (Primary) 37M05; 35A99; 86-08 (Secondary) ACM Class: J.2; I.2; I.6

arXiv:2212.04343 [pdf, other]

Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization

Authors: Kayhan Behdin, Qingquan Song, Aman Gupta, David Durfee, Ayan Acharya, Sathiya Keerthi, Rahul Mazumder

Abstract: Modern deep learning models are over-parameterized, where the optimization setup strongly affects the generalization performance. A key element of reliable optimization for these systems is the modification of the loss function. Sharpness-Aware Minimization (SAM) modifies the underlying loss function to guide descent methods towards flatter minima, which arguably have better generalization abiliti… ▽ More Modern deep learning models are over-parameterized, where the optimization setup strongly affects the generalization performance. A key element of reliable optimization for these systems is the modification of the loss function. Sharpness-Aware Minimization (SAM) modifies the underlying loss function to guide descent methods towards flatter minima, which arguably have better generalization abilities. In this paper, we focus on a variant of SAM known as mSAM, which, during training, averages the updates generated by adversarial perturbations across several disjoint shards of a mini-batch. Recent work suggests that mSAM can outperform SAM in terms of test accuracy. However, a comprehensive empirical study of mSAM is missing from the literature -- previous results have mostly been limited to specific architectures and datasets. To that end, this paper presents a thorough empirical evaluation of mSAM on various tasks and datasets. We provide a flexible implementation of mSAM and compare the generalization performance of mSAM to the performance of SAM and vanilla training on different image classification and natural language processing tasks. We also conduct careful experiments to understand the computational cost of training with mSAM, its sensitivity to hyperparameters and its correlation with the flatness of the loss landscape. Our analysis reveals that mSAM yields superior generalization performance and flatter minima, compared to SAM, across a wide range of tasks without significantly increasing computational costs. △ Less

Submitted 6 December, 2022; originally announced December 2022.

arXiv:2212.02232 [pdf, other]

Nucleation and development of multiple cracks in thin composite fibers

Authors: Arnav Gupta, Timothy J. Healey

Abstract: We study the nucleation and development of crack patterns in thin composite fibers under tension in this work. A fiber comprises an elastic core and an outer layer of a weaker brittle material. In recent tensile experiments on such composites, multiple cracks were observed to develop simultaneously on the outer layer. We propose here a simple one-dimensional model to predict such phenomenon. We id… ▽ More We study the nucleation and development of crack patterns in thin composite fibers under tension in this work. A fiber comprises an elastic core and an outer layer of a weaker brittle material. In recent tensile experiments on such composites, multiple cracks were observed to develop simultaneously on the outer layer. We propose here a simple one-dimensional model to predict such phenomenon. We idealize the problem as two axially loaded rods coupled by a linear interfacial condition. The latter can be regarded as an adhesive that resists slip between the two materials. One rod is modeled as a brittle material, and the other a linearly elastic material, both undergoing finite deformations. △ Less

Submitted 14 November, 2022; originally announced December 2022.

arXiv:2211.12409 [pdf, other]

A Light-speed Linear Program Solver for Personalized Recommendation with Diversity Constraints

Authors: Haoyue Wang, Miao Cheng, Kinjal Basu, Aman Gupta, Keerthi Selvaraj, Rahul Mazumder

Abstract: We study a structured linear program (LP) that emerges in the need of ranking candidates or items in personalized recommender systems. Since the candidate set is only known in real time, the LP also needs to be formed and solved in real time. Latency and user experience are major considerations, requiring the LP to be solved within just a few milliseconds. Although typical instances of the problem… ▽ More We study a structured linear program (LP) that emerges in the need of ranking candidates or items in personalized recommender systems. Since the candidate set is only known in real time, the LP also needs to be formed and solved in real time. Latency and user experience are major considerations, requiring the LP to be solved within just a few milliseconds. Although typical instances of the problem are not very large in size, this stringent time limit appears to be beyond the capability of most existing (commercial) LP solvers, which can take $20$ milliseconds or more to find a solution. Thus, reliable methods that address the real-world complication of latency become necessary. In this paper, we propose a fast specialized LP solver for a structured problem with diversity constraints. Our method solves the dual problem, making use of the piece-wise affine structure of the dual objective function, with an additional screening technique that helps reduce the dimensionality of the problem as the algorithm progresses. Experiments reveal that our method can solve the problem within roughly 1 millisecond, yielding a 20x improvement in speed over efficient off-the-shelf LP solvers. This speed-up can help improve the quality of recommendations without affecting user experience, highlighting how optimization can provide solid orthogonal value to machine-learned recommender systems. △ Less

Submitted 22 November, 2022; originally announced November 2022.

arXiv:2209.12937 [pdf, ps, other]

Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems with Markov Risk Measures

Authors: Shi** Shao, Abhishek Gupta, William B. Haskell

Abstract: We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. We identify sufficient conditions under which small perturbations in the model parameters lead to small changes in the optimal value function and optimal policy. We further establish the robustness of the risk-sensitive optimal policies to mode… ▽ More We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. We identify sufficient conditions under which small perturbations in the model parameters lead to small changes in the optimal value function and optimal policy. We further establish the robustness of the risk-sensitive optimal policies to modeling errors. Implications of the results for data-driven decision-making, decision-making with preference uncertainty, and systems with changing noise distributions are discussed. △ Less

Submitted 26 September, 2022; originally announced September 2022.

Comments: 24 pages, submitted to SIAM Journal on Control and Optimization

arXiv:2209.08519 [pdf, ps, other]

Centrally endo-AIP Modules

Authors: Shiv Kumar, Ashok Ji Gupta

Abstract: In this paper, we introduce the concept of centrally endo-AIP modules. We call a module M centrally endo-AIP, if the left annihilator of any fully invariant submodule N of M in the endomorphism ring S = End(M) is a centrally s-unital ideal of S. We discuss some properties of centrally endo-AIP modules. We also study the endomorphism ring of centrally endo-AIP modules and characterize quasi-Baer mo… ▽ More In this paper, we introduce the concept of centrally endo-AIP modules. We call a module M centrally endo-AIP, if the left annihilator of any fully invariant submodule N of M in the endomorphism ring S = End(M) is a centrally s-unital ideal of S. We discuss some properties of centrally endo-AIP modules. We also study the endomorphism ring of centrally endo-AIP modules and characterize quasi-Baer modules in terms of centrally endo-AIP modules △ Less

Submitted 18 September, 2022; originally announced September 2022.

MSC Class: 16D10; 16D40; 16S50

arXiv:2209.04176 [pdf, ps, other]

Modules in which pure submodule is essential in a direct summand

Authors: Kaushal Gupta, Shiv Kumar, Ashok Ji Gupta

Abstract: In this paper, we study the class of modules have the property that every pure submodule is essential in a direct summand. These modules are termed as pure extending modules which is a proper generalisation of extending modules. Examples and counterexamples are given. We study some properties of pure extending modules and characterize regular ring, semisimple ring, local ring and PDS ring in terms… ▽ More In this paper, we study the class of modules have the property that every pure submodule is essential in a direct summand. These modules are termed as pure extending modules which is a proper generalisation of extending modules. Examples and counterexamples are given. We study some properties of pure extending modules and characterize regular ring, semisimple ring, local ring and PDS ring in terms of pure extending modules. △ Less

Submitted 9 September, 2022; originally announced September 2022.

MSC Class: 16D40; 16D60; 16E50

arXiv:2208.10999 [pdf, ps, other]

Self-adjoint and co-isometry composition and weighted composition operators on Fock-type spaces

Authors: Anuradha Gupta, Geeta Yadav

Abstract: In this paper we obtain characterizations for adjoint of a composition and weighted composition operator to be composition and weighted composition operator on $F_ψ^2,$ respectively. We study the co-isometry composition and weighted composition operators on $F_ψ^2.$ In this paper we obtain characterizations for adjoint of a composition and weighted composition operator to be composition and weighted composition operator on $F_ψ^2,$ respectively. We study the co-isometry composition and weighted composition operators on $F_ψ^2.$ △ Less

Submitted 23 August, 2022; originally announced August 2022.

Comments: 13 pages

MSC Class: 47B33; 47B38

arXiv:2208.03038 [pdf, other]

Leveraging Distributional Bias for Reactive Collision Avoidance under Uncertainty: A Kernel Embedding Approach

Authors: Anish Gupta, Arun Kumar Singh, K. Madhava Krishna

Abstract: Many commodity sensors that measure the robot and dynamic obstacle's state have non-Gaussian noise characteristics. Yet, many current approaches treat the underlying-uncertainty in motion and perception as Gaussian, primarily to ensure computational tractability. On the other hand, existing planners working with non-Gaussian uncertainty do not shed light on leveraging distributional characteristic… ▽ More Many commodity sensors that measure the robot and dynamic obstacle's state have non-Gaussian noise characteristics. Yet, many current approaches treat the underlying-uncertainty in motion and perception as Gaussian, primarily to ensure computational tractability. On the other hand, existing planners working with non-Gaussian uncertainty do not shed light on leveraging distributional characteristics of motion and perception noise, such as bias for efficient collision avoidance. This paper fills this gap by interpreting reactive collision avoidance as a distribution matching problem between the collision constraint violations and Dirac Delta distribution. To ensure fast reactivity in the planner, we embed each distribution in Reproducing Kernel Hilbert Space and reformulate the distribution matching as minimizing the Maximum Mean Discrepancy (MMD) between the two distributions. We show that evaluating the MMD for a given control input boils down to just matrix-matrix products. We leverage this insight to develop a simple control sampling approach for reactive collision avoidance with dynamic and uncertain obstacles. We advance the state-of-the-art in two respects. First, we conduct an extensive empirical study to show that our planner can infer distributional bias from sample-level information. Consequently, it uses this insight to guide the robot to good homotopy. We also highlight how a Gaussian approximation of the underlying uncertainty can lose the bias estimate and guide the robot to unfavorable states with a high collision probability. Second, we show tangible comparative advantages of the proposed distribution matching approach for collision avoidance with previous non-parametric and Gaussian approximated methods of reactive collision avoidance. △ Less

Submitted 22 September, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

arXiv:2205.11855 [pdf, ps, other]

Lebesgue Number and Total Boundedness

Authors: Ajit Kumar Gupta, Saikat Mukherjee

Abstract: A generalization of the Lebesgue number lemma is obtained. It is proved that, if each countably infinite locally finite open cover of a chainable metric space $X$ has a Lebesgue number, then $X$ is totally bounded. A property of metric spaces which is a generalization of connectedness and Menger convexity is introduced. It is observed that Atsu**ess and compactness are equivalent for a metric spa… ▽ More A generalization of the Lebesgue number lemma is obtained. It is proved that, if each countably infinite locally finite open cover of a chainable metric space $X$ has a Lebesgue number, then $X$ is totally bounded. A property of metric spaces which is a generalization of connectedness and Menger convexity is introduced. It is observed that Atsu**ess and compactness are equivalent for a metric space with this introduced property as well as for a chainable metric space. △ Less

Submitted 24 May, 2022; originally announced May 2022.

Comments: 6

MSC Class: 54E50; 54D05

arXiv:2205.11842 [pdf, ps, other]

Induced Homeomorphism and Atsuji Hyperspaces

Authors: Ajit Kumar Gupta, Saikat Mukherjee

Abstract: Given uniformly homeomorphic metric spaces $X$ and $Y$, it is proved that the hyperspaces $C(X)$ and $C(Y)$ are uniformly homeomorphic, where $C(X)$ denotes the collection of all nonempty closed subsets of $X$, and is endowed with Hausdorff distance. Gerald Beer has proved that the hyperspace $C(X)$ is Atsuji when $X$ is either compact or uniformly discrete. An Atsuji space is a generalization of… ▽ More Given uniformly homeomorphic metric spaces $X$ and $Y$, it is proved that the hyperspaces $C(X)$ and $C(Y)$ are uniformly homeomorphic, where $C(X)$ denotes the collection of all nonempty closed subsets of $X$, and is endowed with Hausdorff distance. Gerald Beer has proved that the hyperspace $C(X)$ is Atsuji when $X$ is either compact or uniformly discrete. An Atsuji space is a generalization of compact metric spaces as well as of uniformly discrete spaces. In this article, we investigate the space $C(X)$ when $X$ is Atsuji, and a class of Atsuji subspaces of $C(X)$ is obtained. Using the obtained results, some fixed point results for continuous maps on Atsuji spaces are obtained. △ Less

Submitted 24 May, 2022; originally announced May 2022.

Comments: 9

MSC Class: 54B20

arXiv:2204.12234 [pdf, ps, other]

Σ-dual Rickart modules

Authors: Shiv Kumar, Ashok Ji Gupta

Abstract: In this paper, we dualize the concept of Σ-Rickart modules as Σ-dual Rickart modules. An R-module M is said to be Σ-dual Rickart if the direct sum of arbitrary copies of M is dual Rickart. We prove that each cohereditary module over the Noetherian ring is a Σ-dual Rickart. We introduce the notion of strongly cogenerated modules and characterize Σ-dual Rickart modules in terms of strongly cogenerat… ▽ More In this paper, we dualize the concept of Σ-Rickart modules as Σ-dual Rickart modules. An R-module M is said to be Σ-dual Rickart if the direct sum of arbitrary copies of M is dual Rickart. We prove that each cohereditary module over the Noetherian ring is a Σ-dual Rickart. We introduce the notion of strongly cogenerated modules and characterize Σ-dual Rickart modules in terms of strongly cogenerated modules. We also study some properties of Σ- dual Rickart modules and find connections with semisimple Artinian ring, regular ring semi-hereditary ring and FP-injective module. Further, we study the endomorphism ring of Σ-dual Rickart modules △ Less

Submitted 17 August, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

arXiv:2201.11722 [pdf, other]

doi 10.1109/CDC51059.2022.9992982

Change Detection of Markov Kernels with Unknown Pre and Post Change Kernel

Authors: Hao Chen, Jiacheng Tang, Abhishek Gupta

Abstract: In this paper, we develop a new change detection algorithm for detecting a change in the Markov kernel over a metric space in which the post-change kernel is unknown. Under the assumption that the pre- and post-change Markov kernel is uniformly ergodic, we derive an upper bound on the mean delay and a lower bound on the mean time between false alarms. A numerical simulation is provided to demonstr… ▽ More In this paper, we develop a new change detection algorithm for detecting a change in the Markov kernel over a metric space in which the post-change kernel is unknown. Under the assumption that the pre- and post-change Markov kernel is uniformly ergodic, we derive an upper bound on the mean delay and a lower bound on the mean time between false alarms. A numerical simulation is provided to demonstrate the effectiveness of our method. △ Less

Submitted 5 September, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

Comments: 7 pages, 4 figures

arXiv:2111.07520 [pdf]

Introducing Services and Protocols for Inter-Hub Transportation in the Physical Internet

Authors: Sahrish Jaleel Shaikh, Benoit Montreuil, Moussa Hodjat-Shamami, Ashish Gupta

Abstract: The Physical Internet (PI) puts high emphasis on enabling logistics to reliably perform at the speed mandated by and promised to customers, and to do so efficiently and sustainably. To do so, goods to be moved are encapsulated in modular containers and these are flowed from hub to hub in relay mode. At each hub, PI enables fast and efficient dynamic consolidation of sets of containers to be shippe… ▽ More The Physical Internet (PI) puts high emphasis on enabling logistics to reliably perform at the speed mandated by and promised to customers, and to do so efficiently and sustainably. To do so, goods to be moved are encapsulated in modular containers and these are flowed from hub to hub in relay mode. At each hub, PI enables fast and efficient dynamic consolidation of sets of containers to be shipped together to next hubs. Each consolidated set is assigned to an appropriate vehicle so to enact the targeted transport. In this paper, we address the case where transportation service providers are available to provide vehicles and trailers of distinct dimensions on demand according to openly agreed and/or contracted terms. We describe the essence of such terms, notably relative to expected frequency distribution of transport requests, and expectations about time between request and arrival at hub. In such a context, we introduce rigorous generic protocols that can be applied at each hub so as to dynamically generate consolidation sets of modular containers and requests for on-demand transportation services, in an efficient, resilient, and sustainable way ensuring reliable pickup and delivery within the promised time windows. We demonstrate the performance of such protocols using a simulation-based experiment for a national intercity express parcel logistic network. We finally provide conclusive remarks and promising avenues for field implementation and further research. △ Less

Submitted 14 November, 2021; originally announced November 2021.

Comments: IPIC 2021 International Physical Internet Conference

arXiv:2111.06308 [pdf, other]

Online Discrepancy with Recourse for Vectors and Graphs

Authors: Anupam Gupta, Vijaykrishna Gurunathan, Ravishankar Krishnaswamy, Amit Kumar, Sahil Singla

Abstract: The vector-balancing problem is a fundamental problem in discrepancy theory: given T vectors in $[-1,1]^n$, find a signing $σ(a) \in \{\pm 1\}$ of each vector $a$ to minimize the discrepancy $\| \sum_{a} σ(a) \cdot a \|_{\infty}$. This problem has been extensively studied in the static/offline setting. In this paper we initiate its study in the fully-dynamic setting with recourse: the algorithm se… ▽ More The vector-balancing problem is a fundamental problem in discrepancy theory: given T vectors in $[-1,1]^n$, find a signing $σ(a) \in \{\pm 1\}$ of each vector $a$ to minimize the discrepancy $\| \sum_{a} σ(a) \cdot a \|_{\infty}$. This problem has been extensively studied in the static/offline setting. In this paper we initiate its study in the fully-dynamic setting with recourse: the algorithm sees a stream of T insertions and deletions of vectors, and at each time must maintain a low-discrepancy signing, while also minimizing the amortized recourse (the number of times any vector changes its sign) per update. For general vectors, we show algorithms which almost match Spencer's $O(\sqrt{n})$ offline discrepancy bound, with ${O}(n\cdot poly\!\log T)$ amortized recourse per update. The crucial idea is to compute a basic feasible solution to the linear relaxation in a distributed and recursive manner, which helps find a low-discrepancy signing. To bound recourse we argue that only a small part of the instance needs to be re-computed at each update. Since vector balancing has also been greatly studied for sparse vectors, we then give algorithms for low-discrepancy edge orientation, where we dynamically maintain signings for 2-sparse vectors. Alternatively, this can be seen as orienting a dynamic set of edges of an n-vertex graph to minimize the absolute difference between in- and out-degrees at any vertex. We present a deterministic algorithm with $O(poly\!\log n)$ discrepancy and $O(poly\!\log n)$ amortized recourse. The core ideas are to dynamically maintain an expander-decomposition with low recourse and then to show that, as the expanders change over time, a natural local-search algorithm converges quickly (i.e., with low recourse) to a low-discrepancy solution. We also give strong lower bounds for local-search discrepancy minimization algorithms. △ Less

Submitted 11 November, 2021; originally announced November 2021.

Comments: 29 pages. Appears in SODA 2022

arXiv:2110.04114 [pdf, ps, other]

Self-adjoint and co-isometry composition and weighted composition operator on general weighted Hardy space

Authors: Anuradha Gupta, Geeta Yadav

Abstract: In this paper we study the self-adjoint and co-isometry composition and weighted composition operator on $H_E(ζ).$ We also discuss the conditions under which adjoint operator of a composition and weighted composition operator on $H_E(ζ)$ to be some composition and weighted composition operator, respectively. In this paper we study the self-adjoint and co-isometry composition and weighted composition operator on $H_E(ζ).$ We also discuss the conditions under which adjoint operator of a composition and weighted composition operator on $H_E(ζ)$ to be some composition and weighted composition operator, respectively. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Comments: 12 pages

MSC Class: 47B33; 47B37

arXiv:2107.10755 [pdf, ps, other]

Point Singularities in Incompatible Elasticity

Authors: Animesh Pandey, Anurag Gupta

Abstract: The equations of stress equilibrium and strain compatibility/incompatibility are discussed for fields with point singularities in a planar domain. The sufficiency (or insufficiency) of the smooth maps, obtained by restricting the singular fields to the domain away from the singularities, in completely characterizing the equations of equilibrium and compatibility/incompatibility over the entire dom… ▽ More The equations of stress equilibrium and strain compatibility/incompatibility are discussed for fields with point singularities in a planar domain. The sufficiency (or insufficiency) of the smooth maps, obtained by restricting the singular fields to the domain away from the singularities, in completely characterizing the equations of equilibrium and compatibility/incompatibility over the entire domain, is established and illustrated with examples. The uniqueness of the solution to the stress problem of incompatible linear elasticity, allowing for singular fields, is proved. The uniqueness fails when the problem is considered solely in terms of the restricted maps. As applications of our framework, a general stress solution, in response to point supported body force and defect fields, is derived and a generalized notion of the force acting on a defect is developed. △ Less

Submitted 22 July, 2021; originally announced July 2021.

arXiv:2107.08363 [pdf, other]

Rotating Binaries

Authors: Anant Gupta, Idriss J. Aberkane, Sourangshu Ghosh, Adrian Abold, Alexander Rahn, Eldar Sultanow

Abstract: This paper investigates the behaviour of rotating binaries. A rotation by $r$ digits to the left of a binary number $B$ exhibits in particular cases the divisibility $l\mid N_1(B)\cdot r+1$, where $l$ is the bit-length of $B$ and $N_1(B)$ is the Hamming weight of $B$, that is the number of ones in $B$. The integer $r$ is called the left-rotational distance. We investigate the connection between th… ▽ More This paper investigates the behaviour of rotating binaries. A rotation by $r$ digits to the left of a binary number $B$ exhibits in particular cases the divisibility $l\mid N_1(B)\cdot r+1$, where $l$ is the bit-length of $B$ and $N_1(B)$ is the Hamming weight of $B$, that is the number of ones in $B$. The integer $r$ is called the left-rotational distance. We investigate the connection between this rotational distance, the length and the Hamming weight of binary numbers. Moreover we follow the question under which circumstances the above mentioned divisibility is true. We have found out and will demonstrate that this divisibility occurs for $kn+c$ cycles. △ Less

Submitted 18 July, 2021; originally announced July 2021.

Comments: 16 Pages, 5 Tables, 12 References

MSC Class: 11-11; 11D72 (Primary) 11D45; 68R99 (Secondary)

arXiv:2107.02851 [pdf, ps, other]

Product isometry of generalized weighted composition operator on general weighted Hardy space

Authors: Anuradha Gupta, Geeta Yadav

Abstract: We obtain necessary and sufficient conditions for the composition and weighted composition operator and product of composition operators to be isometry and unitary on $H_{E}(ξ).$ With the help of counter example we also prove that the product of two non isometric composition operator and weighted composition operator can be isometry on $H_{E}(ξ)$. We also completely characterize the boundedness of… ▽ More We obtain necessary and sufficient conditions for the composition and weighted composition operator and product of composition operators to be isometry and unitary on $H_{E}(ξ).$ With the help of counter example we also prove that the product of two non isometric composition operator and weighted composition operator can be isometry on $H_{E}(ξ)$. We also completely characterize the boundedness of generalized weighted composition operators on $H_{E}(ξ).$ △ Less

Submitted 21 October, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

Comments: 16 pages

MSC Class: 47B33; 47B37

arXiv:2107.02636 [pdf, ps, other]

Isometries of the product of composition operators on weighted Bergman space

Authors: Anuradha Gupta, Geeta Yadav

Abstract: In this paper the necessary and sufficient conditions for the product of composition operators to be isometry are obtained on weighted Bergman space. With the help of a counter example we also proved that unlike on $\mathcal{H}^2(\mathbb{D})$ and $\mathcal{A}_α^2(\mathbb{D}),$ the composition operator on $\mathcal{S}^2(\mathbb{D})$ induced by an analytic self map on $\mathbb{D}$ with fixed origin… ▽ More In this paper the necessary and sufficient conditions for the product of composition operators to be isometry are obtained on weighted Bergman space. With the help of a counter example we also proved that unlike on $\mathcal{H}^2(\mathbb{D})$ and $\mathcal{A}_α^2(\mathbb{D}),$ the composition operator on $\mathcal{S}^2(\mathbb{D})$ induced by an analytic self map on $\mathbb{D}$ with fixed origin need not be of norm one. We have generalized the Schwartz's well known result on $\mathcal{A}_α^2(\mathbb{D})$ which characterizes the almost multiplicative operator on $\mathcal{H}^2(\mathbb{D}).$ △ Less

Submitted 6 July, 2021; originally announced July 2021.

Comments: 8 pages

MSC Class: 47B33; 47B38

arXiv:2106.04797 [pdf, ps, other]

On Ramanujan's formula for $ζ(1/2)$ and $ζ(2m+1)$

Authors: Anushree Gupta, Bibekananda Maji

Abstract: Page 332 of Ramanujan's Lost Notebook contains a compelling identity for $ζ(1/2)$, which has been studied by many mathematicians over the years. On the same page, Ramanujan also recorded the series, \begin{align*} \frac{1^r}{\exp(1^s x) - 1} + \frac{2^r}{\exp(2^s x) - 1} + \frac{3^r}{\exp(3^s x) - 1} + \cdots, \end{align*} where $s$ is a positive integer and $r-s$ is any even integer. Unfortunatel… ▽ More Page 332 of Ramanujan's Lost Notebook contains a compelling identity for $ζ(1/2)$, which has been studied by many mathematicians over the years. On the same page, Ramanujan also recorded the series, \begin{align*} \frac{1^r}{\exp(1^s x) - 1} + \frac{2^r}{\exp(2^s x) - 1} + \frac{3^r}{\exp(3^s x) - 1} + \cdots, \end{align*} where $s$ is a positive integer and $r-s$ is any even integer. Unfortunately, Ramanujan doesn't give any formula for it. This series was rediscovered by Kanemitsu, Tanigawa, and Yoshimoto, although they studied it only when $r-s$ is a negative even integer. Recently, Dixit and the second author generalized the work of Kanemitsu et al. and obtained a transformation formula for the aforementioned series with $r-s$ is any even integer. While extending the work of Kanemitsu et al., Dixit and the second author obtained a beautiful generalization of Ramanujan's formula for odd zeta values. In the current paper, we investigate transformation formulas for an infinite series, and interestingly, we derive Ramanujan's formula for $ζ(1/2)$, Wigert's formula for $ζ(1/k)$ as well as Ramanujan's formula for $ζ(2m+1)$. Furthermore, we obtain a new identity for $ζ(-1/2)$ in the spirit of Ramanujan. △ Less

Submitted 9 June, 2021; originally announced June 2021.

Comments: 32 pages, Comments are welcome!

MSC Class: Primary 11M06; Secondary 11J81

arXiv:2105.10710 [pdf, other]

Diophantus Equations and Partially Ordered Sets

Authors: Addea Gupta

Abstract: In [1] it is shown that the Diophantine equation $(k!)^n+k^n=(n!)^k+n^k$ only has the trivial solution $n=k$, and $(k!)^n-k^n=(n!)^k-n^k$ only has the solutions $n=k$, $(n, k)=(1, 2),$ and $(2, 1)$. In this article we find all solutions of the Diophantine Equations $a_1!a_2!\cdots a_n! \pm a_1a_2 \cdots a_n = b_1!b_2! \cdots b_k! \pm b_1b_2 \cdots b_k$, where $a_i$ majorizes $b_i$. Furthermore we… ▽ More In [1] it is shown that the Diophantine equation $(k!)^n+k^n=(n!)^k+n^k$ only has the trivial solution $n=k$, and $(k!)^n-k^n=(n!)^k-n^k$ only has the solutions $n=k$, $(n, k)=(1, 2),$ and $(2, 1)$. In this article we find all solutions of the Diophantine Equations $a_1!a_2!\cdots a_n! \pm a_1a_2 \cdots a_n = b_1!b_2! \cdots b_k! \pm b_1b_2 \cdots b_k$, where $a_i$ majorizes $b_i$. Furthermore we find a sufficient condition on a function $f:N\to R^+$ to guarantee that $f$ gives a monotone function on the POSET of all finite sequences of natural numbers. We then use that to solve other Diophantine equations involving factorials and generalize the results of [2]. We also explore similar Diophantine Equations for the Fibonacci Sequence and other sequences of natural numbers given by linear recursions of the form $A_{n+2}=aA_{n+1}+bA_{n}$. △ Less

Submitted 22 May, 2021; originally announced May 2021.

Comments: 12 pages, 1 figure

MSC Class: 06A06; 11D72; 11B65; 33B15

arXiv:2102.03953 [pdf, ps, other]

Inscribed triangles of Jordan curves in $\mathbb{R}^{n}$

Authors: Aryaman Gupta, Simon Rubinstein-Salzedo

Abstract: Nielsen's theorem states that any triangle can be inscribed in a planar Jordan curve. We prove a generalisation of this theorem, extending to any Jordan curve $J$ embedded in $\mathbb{R}^{n}$, for a restricted set of triangles. We then conclude by investigating a condition under which a given point of $J$ inscribes an equilateral triangle in particular. Nielsen's theorem states that any triangle can be inscribed in a planar Jordan curve. We prove a generalisation of this theorem, extending to any Jordan curve $J$ embedded in $\mathbb{R}^{n}$, for a restricted set of triangles. We then conclude by investigating a condition under which a given point of $J$ inscribes an equilateral triangle in particular. △ Less

Submitted 7 February, 2021; originally announced February 2021.

Comments: Feedback welcome!

arXiv:2102.02859 [pdf, ps, other]

Automorphisms of Quantum Polynomials

Authors: Ashish Gupta

Abstract: An important step in the determination of the automorphism group of the quantum torus of rank $n$ (or twisted group algebra of $\mathbb Z^n$) is the determination of its so-called non-scalar automorphisms. We present a new algorithimic approach towards this problem based on the bivector representation $\wedge^2 : \mathrm{GL}(n, \mathbb Z) \rightarrow \mathrm{GL}(\binom{n}{2}, \mathbb Z)$ of… ▽ More An important step in the determination of the automorphism group of the quantum torus of rank $n$ (or twisted group algebra of $\mathbb Z^n$) is the determination of its so-called non-scalar automorphisms. We present a new algorithimic approach towards this problem based on the bivector representation $\wedge^2 : \mathrm{GL}(n, \mathbb Z) \rightarrow \mathrm{GL}(\binom{n}{2}, \mathbb Z)$ of $\mathrm{GL}(n, \mathbb Z)$ and thus compute the non-scalar automorphism group $\mathrm{Aut}(\mathbb Z^n, λ)$ in several new cases. As an application of our ideas we show that the quantum polynomial algebra (multiparameter quantum affine space of rank $n$) has only scalar (or toric) automorphisms provided that the torsion-free rank of the subgroup generated by the defining multiparameters is no less than $\binom{n - 1}{2} + 1$ thus improving an earlier result. We also investigate the question: when is a multiparameter quantum affine space free of so-called linear automorphisms other than those arising from the action of the $n$-torus ${(\mathbb F^\ast)}^n$. △ Less

Submitted 4 February, 2021; originally announced February 2021.

arXiv:2102.00568 [pdf, other]

An Algorithm to Warm Start Perturbed (WASP) Constrained Dynamic Programs

Authors: Abhishek Gupta, Shreshta Rajakumar Deshpande, Marcello Canova

Abstract: Receding horizon optimal control problems compute the solution at each time step to operate the system on a near-optimal path. However, in many practical cases, the boundary conditions, such as external inputs, constraint equations, or the objective function, vary only marginally from one time step to the next. In this case, recomputing the optimal solution at each time represents a significant bu… ▽ More Receding horizon optimal control problems compute the solution at each time step to operate the system on a near-optimal path. However, in many practical cases, the boundary conditions, such as external inputs, constraint equations, or the objective function, vary only marginally from one time step to the next. In this case, recomputing the optimal solution at each time represents a significant burden for real-time applications. This paper proposes a novel algorithm to approximately solve a perturbed constrained dynamic program that significantly improves the computational burden when the objective function and the constraints are perturbed slightly. The method hinges on determining closed-form expressions for first-order perturbations in the optimal strategy and the Lagrange multipliers of the perturbed constrained dynamic programming problem are obtained. This information can be used to initialize any algorithm (such as the method of Lagrange multipliers, or the augmented Lagrangian method) to solve the perturbed dynamic programming problem with minimal computational resources. △ Less

Submitted 31 January, 2021; originally announced February 2021.

Comments: This work has been submitted to Automatica for possible publication and is under review. Paper summary: 14 pages, 3 figures

arXiv:2101.10814

Spread and defend infection in graphs

Authors: Arya Tanmay Gupta

Abstract: The spread of an infection, a contagion, meme, emotion, message and various other spreadable objects have been discussed in several works. Burning and firefighting have been discussed in particular on static graphs. Graph burning simulates the notion of the spread of "fire" throughout a graph (plus, one unburned node burned at each time-step); graph firefighting simulates the defending of nodes by… ▽ More The spread of an infection, a contagion, meme, emotion, message and various other spreadable objects have been discussed in several works. Burning and firefighting have been discussed in particular on static graphs. Graph burning simulates the notion of the spread of "fire" throughout a graph (plus, one unburned node burned at each time-step); graph firefighting simulates the defending of nodes by placing firefighters on the nodes which have not been already burned while the fire is being spread (started by only a single fire source). This article studies a combination of firefighting and burning on a graph class which is a variation (generalization) of temporal graphs. Nodes can be infected from "outside" a network. We present a notion of both upgrading (of unburned nodes, similar to firefighting) and repairing (of infected nodes). The nodes which are burned, firefighted, or repaired are chosen probabilistically. So a variable amount of nodes are allowed to be infected, upgraded and repaired in each time step. In the model presented in this article, both burning and firefighting proceed concurrently, we introduce such a system to enable the community to study the notion of spread of an infection and the notion of upgrade/repair against each other. The graph class that we study (on which, these processes are simulated) is a variation of temporal graph class in which at each time-step, probabilistically, a communication takes place (iff an edge exists in that time step). In addition, a node can be "worn out" and thus can be removed from the network, and a new healthy node can be added to the network as well. This class of graphs enables systems with high complexity to be able to be simulated and studied. △ Less

Submitted 16 November, 2023; v1 submitted 5 January, 2021; originally announced January 2021.

Comments: incomplete work. major revision required

arXiv:2101.10255 [pdf, ps, other]

Consistent specification testing under spatial dependence

Authors: Abhimanyu Gupta, Xi Qu

Abstract: We propose a series-based nonparametric specification test for a regression function when data are spatially dependent, the `space' being of a general economic or social nature. Dependence can be parametric, parametric with increasing dimension, semiparametric or any combination thereof, thus covering a vast variety of settings. These include spatial error models of varying types and levels of com… ▽ More We propose a series-based nonparametric specification test for a regression function when data are spatially dependent, the `space' being of a general economic or social nature. Dependence can be parametric, parametric with increasing dimension, semiparametric or any combination thereof, thus covering a vast variety of settings. These include spatial error models of varying types and levels of complexity. Under a new smooth spatial dependence condition, our test statistic is asymptotically standard normal. To prove the latter property, we establish a central limit theorem for quadratic forms in linear processes in an increasing dimension setting. Finite sample performance is investigated in a simulation study, with a bootstrap method also justified and illustrated, and empirical examples illustrate the test with real-world data. △ Less

Submitted 29 August, 2022; v1 submitted 25 January, 2021; originally announced January 2021.

Comments: 70 pages

arXiv:2012.13869 [pdf, other]

doi 10.1098/rspa.2020.1004

Neural Closure Models for Dynamical Systems

Authors: Abhinav Gupta, Pierre F. J. Lermusiaux

Abstract: Complex dynamical systems are used for predictions in many domains. Because of computational costs, models are truncated, coarsened, or aggregated. As the neglected and unresolved terms become important, the utility of model predictions diminishes. We develop a novel, versatile, and rigorous methodology to learn non-Markovian closure parameterizations for known-physics/low-fidelity models using da… ▽ More Complex dynamical systems are used for predictions in many domains. Because of computational costs, models are truncated, coarsened, or aggregated. As the neglected and unresolved terms become important, the utility of model predictions diminishes. We develop a novel, versatile, and rigorous methodology to learn non-Markovian closure parameterizations for known-physics/low-fidelity models using data from high-fidelity simulations. The new "neural closure models" augment low-fidelity models with neural delay differential equations (nDDEs), motivated by the Mori-Zwanzig formulation and the inherent delays in complex dynamical systems. We demonstrate that neural closures efficiently account for truncated modes in reduced-order-models, capture the effects of subgrid-scale processes in coarse models, and augment the simplification of complex biological and physical-biogeochemical models. We find that using non-Markovian over Markovian closures improves long-term prediction accuracy and requires smaller networks. We derive adjoint equations and network architectures needed to efficiently implement the new discrete and distributed nDDEs, for any time-integration schemes and allowing nonuniformly-spaced temporal training data. The performance of discrete over distributed delays in closure models is explained using information theory, and we find an optimal amount of past information for a specified architecture. Finally, we analyze computational complexity and explain the limited additional cost due to neural closure models. △ Less

Submitted 13 July, 2021; v1 submitted 27 December, 2020; originally announced December 2020.

Comments: 29 pages, 9 figures, 13 pages of supplementary information

MSC Class: 68T01 (Primary) 37M05; 34A99; 86-08 (Secondary) ACM Class: J.2; I.2.m

Journal ref: Proc. R. Soc. A 477-2252 (2021): 20201004

arXiv:2012.08208 [pdf, other]

A 55-line code for large-scale parallel topology optimization in 2D and 3D

Authors: Abhinav Gupta, Rajib Chowdhury, Anupam Chakrabarti, Timon Rabczuk

Abstract: This paper presents a 55-line code written in python for 2D and 3D topology optimization (TO) based on the open-source finite element computing software (FEniCS), equipped with various finite element tools and solvers. PETSc is used as the linear algebra back-end, which results in significantly less computational time than standard python libraries. The code is designed based on the popular solid… ▽ More This paper presents a 55-line code written in python for 2D and 3D topology optimization (TO) based on the open-source finite element computing software (FEniCS), equipped with various finite element tools and solvers. PETSc is used as the linear algebra back-end, which results in significantly less computational time than standard python libraries. The code is designed based on the popular solid isotropic material with penalization (SIMP) methodology. Extensions to multiple load cases, different boundary conditions, and incorporation of passive elements are also presented. Thus, this implementation is the most compact implementation of SIMP based topology optimization for 3D as well as 2D problems. Utilizing the concept of Euclidean distance matrix to vectorize the computation of the weight matrix for the filter, we have achieved a substantial reduction in the computational time and have also made it possible for the code to work with complex ground structure configurations. We have also presented the code's extension to large-scale topology optimization problems with support for parallel computations on complex structural configuration, which could help students and researchers explore novel insights into the TO problem with dense meshes. Appendix-A contains the complete code, and the website: \url{https://github.com/iitrabhi/topo-fenics} also contains the complete code. △ Less

Submitted 15 December, 2020; originally announced December 2020.

Showing 1–50 of 154 results for author: Gupta, A