Search | arXiv e-print repository

$i$Trust: Trust-Region Optimisation with Ising Machines

Authors: Sayantan Pramanik, Kaumudibikash Goswami, Sourav Chatterjee, M Girish Chandra

Abstract: In this work, we present a heretofore unseen application of Ising machines to perform trust region-based optimisation with box constraints. This is done by considering a specific form of opto-electronic oscillator-based coherent Ising machines with clipped transfer functions, and proposing appropriate modifications to facilitate trust-region optimisation. The enhancements include the inclusion of… ▽ More In this work, we present a heretofore unseen application of Ising machines to perform trust region-based optimisation with box constraints. This is done by considering a specific form of opto-electronic oscillator-based coherent Ising machines with clipped transfer functions, and proposing appropriate modifications to facilitate trust-region optimisation. The enhancements include the inclusion of non-symmetric coupling and linear terms, modulation of noise, and compatibility with convex-projections to improve its convergence. The convergence of the modified Ising machine has been shown under the reasonable assumptions of convexity or invexity. The mathematical structures of the modified Ising machine and trust-region methods have been exploited to design a new trust-region method to effectively solve unconstrained optimisation problems in many scenarios, such as machine learning and optimisation of parameters in variational quantum algorithms. Hence, the proposition is useful for both classical and quantum-classical hybrid scenarios. Finally, the convergence of the Ising machine-based trust-region method, has also been proven analytically, establishing the feasibility of the technique. △ Less

Submitted 6 June, 2024; originally announced July 2024.

Comments: This is a first draft; proofs of the lemmas, theorems, and corollaries herein will be included in the next version, along with experimental results. Reviews, comments, and discussions are welcome

arXiv:2406.02794 [pdf, other]

PriME: Privacy-aware Membership profile Estimation in networks

Authors: Abhinav Chakraborty, Sayak Chatterjee, Sagnik Nandy

Abstract: This paper presents a novel approach to estimating community membership probabilities for network vertices generated by the Degree Corrected Mixed Membership Stochastic Block Model while preserving individual edge privacy. Operating within the $\varepsilon$-edge local differential privacy framework, we introduce an optimal private algorithm based on a symmetric edge flip mechanism and spectral clu… ▽ More This paper presents a novel approach to estimating community membership probabilities for network vertices generated by the Degree Corrected Mixed Membership Stochastic Block Model while preserving individual edge privacy. Operating within the $\varepsilon$-edge local differential privacy framework, we introduce an optimal private algorithm based on a symmetric edge flip mechanism and spectral clustering for accurate estimation of vertex community memberships. We conduct a comprehensive analysis of the estimation risk and establish the optimality of our procedure by providing matching lower bounds to the minimax risk under privacy constraints. To validate our approach, we demonstrate its performance through numerical simulations and its practical application to real-world data. This work represents a significant step forward in balancing accurate community membership estimation with stringent privacy preservation in network data analysis. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2405.01193 [pdf, ps, other]

On Hyperbolicity of Spirallike Circularlike domain

Authors: Sanjoy Chatterjee, Golam Mostafa Mondal

Abstract: In this paper, we prove that a spirallike circularlike domain is Kobayashi hyperbolic if and only if its core is empty. In particular, we show that such a domain is Kobayashi hyperbolic if and only if it is (biholomorphic to) a bounded domain. We also propose a problem in this area. In this paper, we prove that a spirallike circularlike domain is Kobayashi hyperbolic if and only if its core is empty. In particular, we show that such a domain is Kobayashi hyperbolic if and only if it is (biholomorphic to) a bounded domain. We also propose a problem in this area. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: Preliminary draft. Comments are welcome

MSC Class: 32F45; 32H02; 32Q02

arXiv:2404.02001 [pdf, other]

Liouville Theory: An Introduction to Rigorous Approaches

Authors: Sourav Chatterjee, Edward Witten

Abstract: In recent years, a surprisingly direct and simple rigorous understanding of quantum Liouville theory has developed. We aim here to make this material more accessible to physicists working on quantum field theory. In recent years, a surprisingly direct and simple rigorous understanding of quantum Liouville theory has developed. We aim here to make this material more accessible to physicists working on quantum field theory. △ Less

Submitted 4 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 41 pp, added references in v. 2

arXiv:2401.10507 [pdf, ps, other]

A scaling limit of $\mathrm{SU}(2)$ lattice Yang-Mills-Higgs theory

Authors: Sourav Chatterjee

Abstract: The construction of non-Abelian Euclidean Yang-Mills theories in dimension four, as scaling limits of lattice Yang-Mills theories or otherwise, is a central open question of mathematical physics. This paper takes the following small step towards this goal. In any dimension $d\ge 2$, we construct a scaling limit of $\mathrm{SU}(2)$ lattice Yang-Mills theory coupled to a Higgs field transforming in… ▽ More The construction of non-Abelian Euclidean Yang-Mills theories in dimension four, as scaling limits of lattice Yang-Mills theories or otherwise, is a central open question of mathematical physics. This paper takes the following small step towards this goal. In any dimension $d\ge 2$, we construct a scaling limit of $\mathrm{SU}(2)$ lattice Yang-Mills theory coupled to a Higgs field transforming in the fundamental representation of $\mathrm{SU}(2)$. The scaling limit is obtained by sending the gauge coupling constant $g$ to zero and the Higgs length $α$ to infinity slower than $g^{-1}$, but faster than $g^{-1+1/49d}$. After unitary gauge fixing and taking the lattice scaling to zero as a constant multiple of $αg$, a stereographic projection of the gauge field is shown to converge to a scale-invariant massive Gaussian field. This gives the first construction of a scaling limit of a non-Abelian lattice Yang-Mills theory in a dimension higher than two, as well as the first rigorous proof of mass generation by the Higgs mechanism in such a theory. Analogous results are proved for $\mathrm{U}(1)$ theory as well. The question of constructing a non-Gaussian scaling limit remains open. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 45 pages

MSC Class: 70S15; 81T13; 81T25; 82B20

arXiv:2312.04290 [pdf, other]

Convergence Analysis of Opto-Electronic Oscillator based Coherent Ising Machines

Authors: Sayantan Pramanik, Sourav Chatterjee, Harshkumar Oza

Abstract: Ising machines are purported to be better at solving large-scale combinatorial optimisation problems better than conventional von Neumann computers. However, these Ising machines are widely believed to be heuristics, whose promise is observed empirically rather than obtained theoretically. We bridge this gap by considering an opto-electronic oscillator based coherent Ising machine, and providing t… ▽ More Ising machines are purported to be better at solving large-scale combinatorial optimisation problems better than conventional von Neumann computers. However, these Ising machines are widely believed to be heuristics, whose promise is observed empirically rather than obtained theoretically. We bridge this gap by considering an opto-electronic oscillator based coherent Ising machine, and providing the first analytical proof that under reasonable assumptions, the OEO-CIM is not a heuristic approach. We find and prove bounds on its performance in terms of the expected difference between the objective value at the final iteration and the optimal one, and on the number of iterations required by it. In the process, we emphasise on some of its limitations such as the inability to handle asymmetric coupling between spins, and the absence of external magnetic field applied on them (both of which are necessary in many optimisation problems), along with some issues in its convergence. We overcome these limitations by proposing suitable adjustments and prove that the improved architecture is guaranteed to converge to the optimum of the relaxed objective function. △ Less

Submitted 7 December, 2023; originally announced December 2023.

arXiv:2311.05711 [pdf, other]

Discrete Dynamics and Supergeometry

Authors: Subhobrata Chatterjee, Andrew Waldron, Cem Yetişmişoğlu

Abstract: We formulate a geometric measurement theory of dynamical classical systems possessing both continuous and discrete degrees of freedom. The approach is covariant with respect to choices of clocks and canonically incorporates laboratories. The latter are embedded symplectic submanifolds of an odd-dimensional symplectic structure. When suitably defined, symplectic geometry in odd dimensions is exactl… ▽ More We formulate a geometric measurement theory of dynamical classical systems possessing both continuous and discrete degrees of freedom. The approach is covariant with respect to choices of clocks and canonically incorporates laboratories. The latter are embedded symplectic submanifolds of an odd-dimensional symplectic structure. When suitably defined, symplectic geometry in odd dimensions is exactly the structure needed for covariance. A fundamentally probabilistic viewpoint allows classical supergeometries to describe discrete dynamics. We solve the problem of how to construct probabilistic measures on supermanifolds given a (possibly odd dimensional) supersymplectic structure. This relies on a superanalog of the Hodge star for differential forms and a description of probabilities by convex cones. We also show how stochastic processes such as Markov chains can be described by supergeometry. △ Less

Submitted 9 November, 2023; originally announced November 2023.

Comments: 41 pages, 1 figure, LaTeX

arXiv:2310.10876 [pdf, ps, other]

Spectral gap of nonreversible Markov chains

Authors: Sourav Chatterjee

Abstract: We define the spectral gap of a Markov chain on a finite state space as the second-smallest singular value of the generator of the chain, generalizing the usual definition of spectral gap for reversible chains. We then define the relaxation time of the chain as the inverse of this spectral gap, and show that this relaxation time can be characterized, for any Markov chain, as the time required for… ▽ More We define the spectral gap of a Markov chain on a finite state space as the second-smallest singular value of the generator of the chain, generalizing the usual definition of spectral gap for reversible chains. We then define the relaxation time of the chain as the inverse of this spectral gap, and show that this relaxation time can be characterized, for any Markov chain, as the time required for convergence of empirical averages. This relaxation time is related to the Cheeger constant and the mixing time of the chain through inequalities that are similar to the reversible case, and the path argument can be used to get upper bounds. Several examples are worked out. An interesting finding from the examples is that the time for convergence of empirical averages in nonreversible chains can often be substantially smaller than the mixing time. △ Less

Submitted 13 November, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: 40 pages. Minor corrections and simplifications in this revision

MSC Class: 60J10

arXiv:2310.04401 [pdf, other]

Neighbour Sum Patterns : Chessboards to Toroidal Worlds

Authors: Sayan Dutta, Ayanava Mandal, Sohom Gupta, Sourin Chatterjee

Abstract: We say that a chessboard filled with integer entries satisfies the neighbour-sum property if the number appearing on each cell is the sum of entries in its neighbouring cells, where neighbours are cells sharing a common edge or vertex. We show that an $n\times n$ chessboard satisfies this property if and only if $n\equiv 5\pmod 6$. Existence of solutions is further investigated of rectangular, tor… ▽ More We say that a chessboard filled with integer entries satisfies the neighbour-sum property if the number appearing on each cell is the sum of entries in its neighbouring cells, where neighbours are cells sharing a common edge or vertex. We show that an $n\times n$ chessboard satisfies this property if and only if $n\equiv 5\pmod 6$. Existence of solutions is further investigated of rectangular, toroidal boards, as well as on Neumann neighbourhoods, including a nice connection to discrete harmonic functions. Construction of solutions on infinite boards are also presented. Finally, answers to three dimensional analogues of these boards are explored using properties of cyclotomic polynomials and relevant ideas conjectured. △ Less

Submitted 6 October, 2023; originally announced October 2023.

MSC Class: 11C20; 39A06; 12H99; 15B05

arXiv:2309.10864 [pdf, other]

A dynamic mean-field statistical model of academic collaboration

Authors: Soumendu Sundar Mukherjee, Tamojit Sadhukhan, Shirshendu Chatterjee

Abstract: There is empirical evidence that collaboration in academia has increased significantly during the past few decades, perhaps due to the breathtaking advancements in communication and technology during this period. Multi-author articles have become more frequent than single-author ones. Interdisciplinary collaboration is also on the rise. Although there have been several studies on the dynamical asp… ▽ More There is empirical evidence that collaboration in academia has increased significantly during the past few decades, perhaps due to the breathtaking advancements in communication and technology during this period. Multi-author articles have become more frequent than single-author ones. Interdisciplinary collaboration is also on the rise. Although there have been several studies on the dynamical aspects of collaboration networks, systematic statistical models which theoretically explain various empirically observed features of such networks have been lacking. In this work, we propose a dynamic mean-field model and an associated estimation framework for academic collaboration networks. We primarily focus on how the degree of collaboration of a typical author, rather than the local structure of her collaboration network, changes over time. We consider several popular indices of collaboration from the literature and study their dynamics under the proposed model. In particular, we obtain exact formulae for the expectations and temporal rates of change of these indices. Through extensive simulation experiments, we demonstrate that the proposed model has enough flexibility to capture various phenomena characteristic of real-world collaboration networks. Using metadata on papers from the arXiv repository, we empirically study the mean-field collaboration dynamics in disciplines such as Computer Science, Mathematics and Physics. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: 27 pages, 20 figures

arXiv:2309.05355 [pdf, ps, other]

Parallel transport on a Lie 2-group bundle over a Lie groupoid along Haefliger paths

Authors: Saikat Chatterjee, Adittya Chaudhuri

Abstract: We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection… ▽ More We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection structures introduced in the authors' previous work, combine nicely with the underlying fibration structure of a principal 2-bundle over a Lie groupoid. This interrelation allows us to derive a notion of parallel transport in the framework of principal 2-bundles over Lie groupoids along a particular class of Haefliger paths. The corresponding parallel transport functor is shown to be smooth. We apply our results to examine the parallel transport on an associated VB-groupoid. △ Less

Submitted 11 September, 2023; originally announced September 2023.

MSC Class: Primary 53C08; Secondary 22A22; 58H05

arXiv:2308.04359 [pdf, ps, other]

doi 10.1016/j.jmaa.2023.128008

A characterization of bounded balanced convex domains in $\mathbb{C}^n$

Authors: Sanjoy Chatterjee, Golam Mostafa Mondal

Abstract: In this paper, we investigate the characterization of balanced bounded convex domains in $\mathbb{C}^n$ in terms of the squeezing function. As an application, we provide a characterization of the polydisc in $\mathbb{C}^n$. In this paper, we investigate the characterization of balanced bounded convex domains in $\mathbb{C}^n$ in terms of the squeezing function. As an application, we provide a characterization of the polydisc in $\mathbb{C}^n$. △ Less

Submitted 13 February, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

Comments: 12 pages. Introduction revised and Remark 1.3 added for greater clarity. Typos corrected; proof of Proposition 2.9 extensively rewritten. This version is being posted because v2 mistakenly concatenates two documents. Published in J. Math. Anal. Appl.; DOI of the published article given elsewhere on this page>>

MSC Class: 32F45; 32H02

arXiv:2307.07634 [pdf, ps, other]

Features of a spin glass in the random field Ising model

Authors: Sourav Chatterjee

Abstract: A longstanding open question in the theory of disordered systems is whether short-range models, such as the random field Ising model or the Edwards-Anderson model, can indeed have the famous properties that characterize mean-field spin glasses at nonzero temperature. This article shows that this is at least partially possible in the case of the random field Ising model. Consider the Ising model on… ▽ More A longstanding open question in the theory of disordered systems is whether short-range models, such as the random field Ising model or the Edwards-Anderson model, can indeed have the famous properties that characterize mean-field spin glasses at nonzero temperature. This article shows that this is at least partially possible in the case of the random field Ising model. Consider the Ising model on a discrete $d$-dimensional cube under free boundary condition, subjected to a very weak i.i.d. random external field, where the field strength is inversely proportional to the square-root of the number of sites. It turns out that in $d\ge 2$ and at subcritical temperatures, this model has some of the key features of a mean-field spin glass. Namely, (a) the site overlap exhibits one step of replica symmetry breaking, (b) the quenched distribution of the overlap is non-self-averaging, and (c) the overlap has the Parisi ultrametric property. Furthermore, it is shown that for Gaussian disorder, replica symmetry does not break if the field strength is taken to be stronger than the one prescribed above, and non-self-averaging fails if it is weaker, showing that the above order of field strength is the only one that allows all three properties to hold. However, the model does not have two other features of mean-field models. Namely, (a) it does not satisfy the Ghirlanda-Guerra identities, and (b) it has only two pure states instead of many. △ Less

Submitted 7 March, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

Comments: 35 pages. Final version, to appear in Comm. Math. Phys

MSC Class: 82B44; 82D30

arXiv:2307.05429 [pdf, ps, other]

A study of spirallike domains: polynomial convexity, Loewner chains and dense holomorphic curves

Authors: Sanjoy Chatterjee, Sushil Gorai

Abstract: In this paper, we prove that the closure of a bounded pseudoconvex domain, which is spirallike with respect to a globally asymptotic stable holomorphic vector field, is polynomially convex. We also provide a necessary and sufficient condition, in terms of polynomial convexity, on a univalent function defined on a strongly convex domain for embedding it into a filtering Loewner chain. Next, we prov… ▽ More In this paper, we prove that the closure of a bounded pseudoconvex domain, which is spirallike with respect to a globally asymptotic stable holomorphic vector field, is polynomially convex. We also provide a necessary and sufficient condition, in terms of polynomial convexity, on a univalent function defined on a strongly convex domain for embedding it into a filtering Loewner chain. Next, we provide an application of our first result. We show that for any bounded pseudoconvex strictly spirallike domain $Ω$ in $\mathbb{C}^n$ and given any connected complex manifold $Y$, there exists a holomorphic map from the unit disc to the space of all holomorphic maps from $Ω$ to $Y$. This also yields us the existence of $\mathcal{O}(Ω, Y)$-universal map for any generalized translation on $Ω$, which, in turn, is connected to the hypercyclicity of certain composition operators on the space of manifold valued holomorphic maps. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: 25 pages, comments are welcome

MSC Class: 32E20; 32H02; 30K20

arXiv:2306.16521 [pdf, ps, other]

Enumerative Theory for the Tsetlin Library

Authors: Sourav Chatterjee, Persi Diaconis, Gene B. Kim

Abstract: The Tsetlin library is a well-studied Markov chain on the symmetric group $S_n$. It has stationary distribution $π(σ)$ the Luce model, a nonuniform distribution on $S_n$, which appears in psychology, horse race betting, and tournament poker. Simple enumerative questions, such as ``what is the distribution of the top $k$ cards?'' or ``what is the distribution of the bottom $k$ cards?'' are long ope… ▽ More The Tsetlin library is a well-studied Markov chain on the symmetric group $S_n$. It has stationary distribution $π(σ)$ the Luce model, a nonuniform distribution on $S_n$, which appears in psychology, horse race betting, and tournament poker. Simple enumerative questions, such as ``what is the distribution of the top $k$ cards?'' or ``what is the distribution of the bottom $k$ cards?'' are long open. We settle these questions and draw attention to a host of parallel questions on the extension to the chambers of a hyperplane arrangement. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 23 pages, 1 figure

MSC Class: 60B15; 60J99

arXiv:2305.12512 [pdf, ps, other]

Central Limit Theorem for Gram-Schmidt Random Walk Design

Authors: Sabyasachi Chatterjee, Partha S. Dey, Subhajit Goswami

Abstract: We prove a central limit theorem for the Horvitz-Thompson estimator based on the Gram-Schmidt Walk (GSW) design, recently developed in Harshaw et al.(2022). In particular, we consider the version of the GSW design which uses randomized pivot order, thereby answering an open question raised in the same article. We deduce this under minimal and global assumptions involving only the problem parameter… ▽ More We prove a central limit theorem for the Horvitz-Thompson estimator based on the Gram-Schmidt Walk (GSW) design, recently developed in Harshaw et al.(2022). In particular, we consider the version of the GSW design which uses randomized pivot order, thereby answering an open question raised in the same article. We deduce this under minimal and global assumptions involving only the problem parameters such as the (sum) potential outcome vector and the covariate matrix. As an interesting consequence of our analysis we also obtain the precise limiting variance of the estimator in terms of these parameters which is smaller than the previously known upper bound. The main ingredients are a simplified skeletal process approximating the GSW design and concentration phenomena for random matrices obtained from random sampling using the Stein's method for exchangeable pairs. △ Less

Submitted 5 June, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

Comments: 35 pages. Some typo's fixed in the arxiv abstract to fit arxiv's abstract requirements

MSC Class: 60F05; 62K99; 62D20; 60G42; 62E20

arXiv:2302.00479 [pdf, other]

Characterising Solutions of Anomalous Cancellation

Authors: Satvik Saha, Sohom Gupta, Sayan Dutta, Sourin Chatterjee

Abstract: Anomalous cancellation of fractions is a mathematically inaccurate method where cancelling the common digits of the numerator and denominator correctly reduces it. While it appears to be accidentally successful, the property of anomalous cancellation is intricately connected to the number of digits of the denominator as well as the base in which the fraction is represented. Previous work have been… ▽ More Anomalous cancellation of fractions is a mathematically inaccurate method where cancelling the common digits of the numerator and denominator correctly reduces it. While it appears to be accidentally successful, the property of anomalous cancellation is intricately connected to the number of digits of the denominator as well as the base in which the fraction is represented. Previous work have been mostly surrounding three digit solutions or specific properties of the same. This paper seeks to get general results regarding the structure of numbers that follow the cancellation property (denoted by $P^*_{\ell; k}$) and an estimate of the total number of solutions possible in a given base representation. In particular, interesting properties regarding the saturation of the number of solutions in general and $p^n$ bases (where $p$ is a prime) have been studied in detail. △ Less

Submitted 31 January, 2023; originally announced February 2023.

Comments: 17 pages, 1 figure

MSC Class: 11D72; 00A08

arXiv:2301.04112 [pdf, ps, other]

Spin glass phase at zero temperature in the Edwards-Anderson model

Authors: Sourav Chatterjee

Abstract: While the analysis of mean-field spin glass models has seen tremendous progress in the last twenty years, lattice spin glasses have remained largely intractable. This article presents the solutions to a number of questions about the Edwards-Anderson model of short-range spin glasses (in all dimensions) that were raised in the physics literature many years ago. First, it is shown that the ground st… ▽ More While the analysis of mean-field spin glass models has seen tremendous progress in the last twenty years, lattice spin glasses have remained largely intractable. This article presents the solutions to a number of questions about the Edwards-Anderson model of short-range spin glasses (in all dimensions) that were raised in the physics literature many years ago. First, it is shown that the ground state is sensitive to small perturbations of the disorder, in the sense that a small amount of noise gives rise to a new ground state that is nearly orthogonal to the old one with respect to the site overlap inner product. Second, it is shown that one can overturn a macroscopic fraction of the spins in the ground state with an energy cost that is negligible compared to the size of the boundary of the overturned region - a feature that is believed to be typical of spin glasses but clearly absent in ferromagnets. The third result is that the boundary of the overturned region in dimension $d$ has fractal dimension strictly greater than $d-1$, confirming a prediction from physics. The fourth result is that the correlations between bonds in the ground state can decay at most like the inverse of the distance. This contrasts with the random field Ising model, where it has been shown recently that the correlation decays exponentially in distance in dimension two. The fifth result is that the expected size of the critical droplet of a bond grows at least like a power of the volume. Taken together, these results comprise the first mathematical proof of glassy behavior in a short-range spin glass model. △ Less

Submitted 28 February, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Comments: 27 pages. Some minor corrections in this revision

MSC Class: 82B44; 82D30

arXiv:2212.10064 [pdf, other]

AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning

Authors: Aowabin Rahman, Arnab Bhattacharya, Thiagarajan Ramachandran, Sayak Mukherjee, Himanshu Sharma, Ted Fujimoto, Samrat Chatterjee

Abstract: Search and Rescue (SAR) missions in remote environments often employ autonomous multi-robot systems that learn, plan, and execute a combination of local single-robot control actions, group primitives, and global mission-oriented coordination and collaboration. Often, SAR coordination strategies are manually designed by human experts who can remotely control the multi-robot system and enable semi-a… ▽ More Search and Rescue (SAR) missions in remote environments often employ autonomous multi-robot systems that learn, plan, and execute a combination of local single-robot control actions, group primitives, and global mission-oriented coordination and collaboration. Often, SAR coordination strategies are manually designed by human experts who can remotely control the multi-robot system and enable semi-autonomous operations. However, in remote environments where connectivity is limited and human intervention is often not possible, decentralized collaboration strategies are needed for fully-autonomous operations. Nevertheless, decentralized coordination may be ineffective in adversarial environments due to sensor noise, actuation faults, or manipulation of inter-agent communication data. In this paper, we propose an algorithmic approach based on adversarial multi-agent reinforcement learning (MARL) that allows robots to efficiently coordinate their strategies in the presence of adversarial inter-agent communications. In our setup, the objective of the multi-robot team is to discover targets strategically in an obstacle-strewn geographical area by minimizing the average time needed to find the targets. It is assumed that the robots have no prior knowledge of the target locations, and they can interact with only a subset of neighboring robots at any time. Based on the centralized training with decentralized execution (CTDE) paradigm in MARL, we utilize a hierarchical meta-learning framework to learn dynamic team-coordination modalities and discover emergent team behavior under complex cooperative-competitive scenarios. The effectiveness of our approach is demonstrated on a collection of prototype grid-world environments with different specifications of benign and adversarial agents, target locations, and agent rewards. △ Less

Submitted 20 December, 2022; originally announced December 2022.

arXiv:2212.03921 [pdf, other]

Online Distributed Algorithm for Optimal Power Flow problem with Regret Analysis

Authors: Sushobhan Chatterjee, Rachel Kalpana Kalaimani

Abstract: We investigate the distributed DC-Optimal Power Flow (DC-OPF) problem for a dynamic and uncertain environment. The unpredictable supply of renewable resources and varying prices of the electricity market are a few factors responsible for the uncertainty. We propose to address this problem using the framework of online convex optimization, where the cost functions are not known apriori because of t… ▽ More We investigate the distributed DC-Optimal Power Flow (DC-OPF) problem for a dynamic and uncertain environment. The unpredictable supply of renewable resources and varying prices of the electricity market are a few factors responsible for the uncertainty. We propose to address this problem using the framework of online convex optimization, where the cost functions are not known apriori because of the uncertainty and are revealed only incrementally over time. We also consider a distributed setting, where each agent (generators and loads) in the power network is only privy to their own local objectives and constraints but can communicate with their neighbours. A distributed online algorithm is proposed based on the modified primal-dual approach. The performance of the online algorithm is evaluated using the regret (static) function, which is the difference between the actual cost incurred by employing the proposed algorithm and the optimal fixed decision in hindsight. Since we deal with a constrained optimization problem, analogous to the notion of regret the accumulation of the constraint violation is also calculated at each step. We establish a sub-linear bound on the static regret and constraint violation under suitable assumptions on step-size and cost function. Finally, we use the standard IEEE-14 bus system to demonstrate the performance of our algorithm. △ Less

Submitted 9 August, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

Comments: 11 pages, 4 figures, Under Review

arXiv:2211.04702 [pdf, other]

A survey of some recent developments in measures of association

Authors: Sourav Chatterjee

Abstract: This paper surveys some recent developments in measures of association related to a new coefficient of correlation introduced by the author. A straightforward extension of this coefficient to standard Borel spaces (which includes all Polish spaces), overlooked in the literature so far, is proposed at the end of the survey. This paper surveys some recent developments in measures of association related to a new coefficient of correlation introduced by the author. A straightforward extension of this coefficient to standard Borel spaces (which includes all Polish spaces), overlooked in the literature so far, is proposed at the end of the survey. △ Less

Submitted 9 August, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

Comments: 22 pages. Minor changes in this revision

MSC Class: 62H20; 62H15

arXiv:2209.07028 [pdf, other]

Estimating large causal polytrees from small samples

Authors: Sourav Chatterjee, Mathukumalli Vidyasagar

Abstract: We consider the problem of estimating a large causal polytree from a relatively small i.i.d. sample. This is motivated by the problem of determining causal structure when the number of variables is very large compared to the sample size, such as in gene regulatory networks. We give an algorithm that recovers the tree with high accuracy in such settings. The algorithm works under essentially no dis… ▽ More We consider the problem of estimating a large causal polytree from a relatively small i.i.d. sample. This is motivated by the problem of determining causal structure when the number of variables is very large compared to the sample size, such as in gene regulatory networks. We give an algorithm that recovers the tree with high accuracy in such settings. The algorithm works under essentially no distributional or modeling assumptions other than some mild non-degeneracy conditions. △ Less

Submitted 29 March, 2024; v1 submitted 14 September, 2022; originally announced September 2022.

Comments: 26 pages. An R package has been developed (see link in the article), and a real data example has been added

MSC Class: 62D20

arXiv:2208.11107 [pdf, ps, other]

Approximations on Spirallike domains of $\mathbb{C}^{n}$

Authors: Sanjoy Chatterjee, Sushil Gorai

Abstract: In this paper, we first show that any domain $\Om$ in $\cn(n \geq 2)$, which is spirallike with respect to a complete holomorphic globally asymptotic stable vector field $F$, is a Runge domain. Next, we prove an Andersén-Lempert type approximation theorem: any biholomorphism $Φ\colon \Om \to Φ(\Om)$, with $Φ(\Om)$ is Runge, can be approximated by automorphisms of $\mathbb{C}^{n}$ uniformly on comp… ▽ More In this paper, we first show that any domain $\Om$ in $\cn(n \geq 2)$, which is spirallike with respect to a complete holomorphic globally asymptotic stable vector field $F$, is a Runge domain. Next, we prove an Andersén-Lempert type approximation theorem: any biholomorphism $Φ\colon \Om \to Φ(\Om)$, with $Φ(\Om)$ is Runge, can be approximated by automorphisms of $\mathbb{C}^{n}$ uniformly on compacts, in the following two cases. \begin{itemize} \item [(i)] The domain $\Om\subset\cn$ is a spirallike with respect to a linear vector field $A$, where $2\max\{\rlλ:λ\inσ(A)\}<\min\{\rlλ:λ\inσ(A)\}$. \item [(ii)] The domain $\Om$ is spirallike with respect to complete globally exponentially stable vector field $F$, with a certain rate of the convergence of the flow of the vector field $F$ in $\Om $ \end{itemize} We further show that, if $J(Φ) \equiv 1$ (and $div(F)$ is constant in the situation (ii)) then the biholomorphism $Φ\colon \Om \to Φ(\Om)$ can be approximated by volume preserving automorphism of $\cn$ in both the cases mentioned above. As an application of our approximation results, we show that any Loewner PDE in a complete hyperbolic domain $\Om$ which satisfies (i) or (ii) mentioned above admits an essentially unique univalent solution with values in $\cn$. We also provide an example of a Hartogs domain in $\mathbb{C}^{2}$ which spirallike with respect to a complete holomorphic vector field $F(z_{1},z_{2})=(-2z_{1},-3z_{2}+z_{1}z_{2})$, but the domain is not spirallike with respect to any linear vector field. Some more examples are provided at the end of this paper. △ Less

Submitted 4 March, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

Comments: 39 page, comments are welcome

MSC Class: 32M17; 32E30; 30C45

arXiv:2208.02492 [pdf, ps, other]

An invariance principle for the 1D KPZ equation

Authors: Arka Adhikari, Sourav Chatterjee

Abstract: Consider a discrete one-dimensional random surface whose height at a point grows as a function of the heights at neighboring points plus an independent random noise. Assuming that this function is equivariant under constant shifts, symmetric in its arguments, and at least six times continuously differentiable in a neighborhood of the origin, we show that as the variance of the noise goes to zero,… ▽ More Consider a discrete one-dimensional random surface whose height at a point grows as a function of the heights at neighboring points plus an independent random noise. Assuming that this function is equivariant under constant shifts, symmetric in its arguments, and at least six times continuously differentiable in a neighborhood of the origin, we show that as the variance of the noise goes to zero, any such process converges to the Cole-Hopf solution of the 1D KPZ equation under a suitable scaling of space and time. This proves an invariance principle for the 1D KPZ equation, in the spirit of Donsker's invariance principle for Brownian motion. △ Less

Submitted 1 September, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

Comments: 32 pages. To appear in Ann. Probab

MSC Class: 60F05; 82C05

arXiv:2208.01365 [pdf, other]

Concentration inequalities for correlated network-valued processes with applications to community estimation and changepoint analysis

Authors: Sayak Chatterjee, Shirshendu Chatterjee, Soumendu Sundar Mukherjee, Anirban Nath, Sharmodeep Bhattacharyya

Abstract: Network-valued time series are currently a common form of network data. However, the study of the aggregate behavior of network sequences generated from network-valued stochastic processes is relatively rare. Most of the existing research focuses on the simple setup where the networks are independent (or conditionally independent) across time, and all edges are updated synchronously at each time s… ▽ More Network-valued time series are currently a common form of network data. However, the study of the aggregate behavior of network sequences generated from network-valued stochastic processes is relatively rare. Most of the existing research focuses on the simple setup where the networks are independent (or conditionally independent) across time, and all edges are updated synchronously at each time step. In this paper, we study the concentration properties of the aggregated adjacency matrix and the corresponding Laplacian matrix associated with network sequences generated from lazy network-valued stochastic processes, where edges update asynchronously, and each edge follows a lazy stochastic process for its updates independent of the other edges. We demonstrate the usefulness of these concentration results in proving consistency of standard estimators in community estimation and changepoint estimation problems. We also conduct a simulation study to demonstrate the effect of the laziness parameter, which controls the extent of temporal correlation, on the accuracy of community and changepoint estimation. △ Less

Submitted 2 August, 2022; originally announced August 2022.

Comments: 27 pages, 4 figures

arXiv:2205.15022 [pdf, ps, other]

doi 10.22130/scma.2023.554520.1117

Some basic results on fuzzy strong $φ$-b-normed linear spaces

Authors: Abhishikta Das, T. Bag, S. Chatterjee

Abstract: In this paper, definition of fuzzy strong $φ$-b-normed linear space is given. Here the scalar function |c| is replaced by a general function $φ$(c) where φ satisfies some properties. Some basic results on finite dimensional fuzzy strong $φ$-b-normed linear space are studied. In this paper, definition of fuzzy strong $φ$-b-normed linear space is given. Here the scalar function |c| is replaced by a general function $φ$(c) where φ satisfies some properties. Some basic results on finite dimensional fuzzy strong $φ$-b-normed linear space are studied. △ Less

Submitted 25 May, 2022; originally announced May 2022.

Comments: 10 pages

MSC Class: 54A40; 03E72

Journal ref: Sahand Communications in Mathematical Analysis (SCMA) 20 (2) (2023) 183.196

arXiv:2205.06894 [pdf, ps, other]

A random walk on the Rado graph

Authors: Sourav Chatterjee, Persi Diaconis, Laurent Miclo

Abstract: The Rado graph, also known as the random graph $G(\infty, p)$, is a classical limit object for finite graphs. We study natural ball walks as a way of understanding the geometry of this graph. For the walk started at $i$, we show that order $\log_2^*i$ steps are sufficient, and for infinitely many $i$, necessary for convergence to stationarity. The proof involves an application of Hardy's inequalit… ▽ More The Rado graph, also known as the random graph $G(\infty, p)$, is a classical limit object for finite graphs. We study natural ball walks as a way of understanding the geometry of this graph. For the walk started at $i$, we show that order $\log_2^*i$ steps are sufficient, and for infinitely many $i$, necessary for convergence to stationarity. The proof involves an application of Hardy's inequality for trees. △ Less

Submitted 13 May, 2022; originally announced May 2022.

Comments: 43 pages

arXiv:2205.02264 [pdf, other]

DeepBayes -- an estimator for parameter estimation in stochastic nonlinear dynamical models

Authors: Anubhab Ghosh, Mohamed Abdalmoaty, Saikat Chatterjee, Håkan Hjalmarsson

Abstract: Stochastic nonlinear dynamical systems are ubiquitous in modern, real-world applications. Yet, estimating the unknown parameters of stochastic, nonlinear dynamical models remains a challenging problem. The majority of existing methods employ maximum likelihood or Bayesian estimation. However, these methods suffer from some limitations, most notably the substantial computational time for inference… ▽ More Stochastic nonlinear dynamical systems are ubiquitous in modern, real-world applications. Yet, estimating the unknown parameters of stochastic, nonlinear dynamical models remains a challenging problem. The majority of existing methods employ maximum likelihood or Bayesian estimation. However, these methods suffer from some limitations, most notably the substantial computational time for inference coupled with limited flexibility in application. In this work, we propose DeepBayes estimators that leverage the power of deep recurrent neural networks in learning an estimator. The method consists of first training a recurrent neural network to minimize the mean-squared estimation error over a set of synthetically generated data using models drawn from the model set of interest. The a priori trained estimator can then be used directly for inference by evaluating the network with the estimation data. The deep recurrent neural network architectures can be trained offline and ensure significant time savings during inference. We experiment with two popular recurrent neural networks -- long short term memory network (LSTM) and gated recurrent unit (GRU). We demonstrate the applicability of our proposed method on different example models and perform detailed comparisons with state-of-the-art approaches. We also provide a study on a real-world nonlinear benchmark problem. The experimental evaluations show that the proposed approach is asymptotically as good as the Bayes estimator. △ Less

Submitted 4 May, 2022; originally announced May 2022.

arXiv:2203.16587 [pdf, other]

Spatially Adaptive Online Prediction of Piecewise Regular Functions

Authors: Sabyasachi Chatterjee, Subhajit Goswami

Abstract: We consider the problem of estimating piecewise regular functions in an online setting, i.e., the data arrive sequentially and at any round our task is to predict the value of the true function at the next revealed point using the available data from past predictions. We propose a suitably modified version of a recently developed online learning algorithm called the slee** experts aggregation al… ▽ More We consider the problem of estimating piecewise regular functions in an online setting, i.e., the data arrive sequentially and at any round our task is to predict the value of the true function at the next revealed point using the available data from past predictions. We propose a suitably modified version of a recently developed online learning algorithm called the slee** experts aggregation algorithm. We show that this estimator satisfies oracle risk bounds simultaneously for all local regions of the domain. As concrete instantiations of the expert aggregation algorithm proposed here, we study an online mean aggregation and an online linear regression aggregation algorithm where experts correspond to the set of dyadic subrectangles of the domain. The resulting algorithms are near linear time computable in the sample size. We specifically focus on the performance of these online algorithms in the context of estimating piecewise polynomial and bounded variation function classes in the fixed design setup. The simultaneous oracle risk bounds we obtain for these estimators in this context provide new and improved (in certain aspects) guarantees even in the batch setting and are not available for the state of the art batch learning estimators. △ Less

Submitted 30 March, 2022; originally announced March 2022.

Comments: 34 pages, 12 figures

MSC Class: 62G05; 62G08

arXiv:2203.16462 [pdf, other]

Convergence of gradient descent for deep neural networks

Authors: Sourav Chatterjee

Abstract: This article presents a criterion for convergence of gradient descent to a global minimum, which is then used to show that gradient descent with proper initialization converges to a global minimum when training any feedforward neural network with smooth and strictly increasing activation functions, provided that the input dimension is greater than or equal to the number of data points. The main di… ▽ More This article presents a criterion for convergence of gradient descent to a global minimum, which is then used to show that gradient descent with proper initialization converges to a global minimum when training any feedforward neural network with smooth and strictly increasing activation functions, provided that the input dimension is greater than or equal to the number of data points. The main difference with prior work is that the width of the network can be a fixed number instead of growing as some multiple or power of the number of data points. △ Less

Submitted 17 December, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

Comments: 23 pages. The article has been majorly reorganized, by deleting some unnecessary materials. Some minor errors have been fixed. (Theorem numbers have changed in this revision.)

arXiv:2203.16451 [pdf, ps, other]

Distributed Optimization of Average Consensus Containment with Multiple Stationary Leaders

Authors: Sushobhan Chatterjee, Rachel Kalpana Kalaimani

Abstract: In this paper, we consider the problem of containment control of multi-agent systems with multiple stationary leaders, interacting over a directed network. While, containment control refers to just ensuring that the follower agents reach the convex hull of the leaders states, we focus on the problem where the followers achieve a consensus to the average values of the leaders states. We propose an… ▽ More In this paper, we consider the problem of containment control of multi-agent systems with multiple stationary leaders, interacting over a directed network. While, containment control refers to just ensuring that the follower agents reach the convex hull of the leaders states, we focus on the problem where the followers achieve a consensus to the average values of the leaders states. We propose an algorithm that can be implemented in a distributed manner to achieve the above consensus among followers. Next we optimize the convergence rate of the followers to the average consensus by proper choice of weights for the interaction graph. This optimization is also performed in a distributed manner using Alternating Direction Method of Multipliers (ADMM). Finally, we complement our results by illustrating them with numerical examples. △ Less

Submitted 30 March, 2022; originally announced March 2022.

Comments: Accepted in 2022 European Control Conference

arXiv:2203.04369 [pdf, ps, other]

Element-wise Estimation Error of Generalized Fused Lasso

Authors: Teng Zhang, Sabyasachi Chatterjee

Abstract: The main result of this article is that we obtain an elementwise error bound for the Fused Lasso estimator for any general convex loss function $ρ$. We then focus on the special cases when either $ρ$ is the square loss function (for mean regression) or is the quantile loss function (for quantile regression) for which we derive new pointwise error bounds. Even though error bounds for the usual Fuse… ▽ More The main result of this article is that we obtain an elementwise error bound for the Fused Lasso estimator for any general convex loss function $ρ$. We then focus on the special cases when either $ρ$ is the square loss function (for mean regression) or is the quantile loss function (for quantile regression) for which we derive new pointwise error bounds. Even though error bounds for the usual Fused Lasso estimator and its quantile version have been studied before; our bound appears to be new. This is because all previous works bound a global loss function like the sum of squared error, or a sum of Huber losses in the case of quantile regression in Padilla and Chatterjee (2021). Clearly, element wise bounds are stronger than global loss error bounds as it reveals how the loss behaves locally at each point. Our element wise error bound also has a clean and explicit dependence on the tuning parameter $λ$ which informs the user of a good choice of $λ$. In addition, our bound is nonasymptotic with explicit constants and is able to recover almost all the known results for Fused Lasso (both mean and quantile regression) with additional improvements in some cases. △ Less

Submitted 18 March, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

arXiv:2201.02654 [pdf, other]

A Cross Validation Framework for Signal Denoising with Applications to Trend Filtering, Dyadic CART and Beyond

Authors: Anamitra Chaudhuri, Sabyasachi Chatterjee

Abstract: This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses o… ▽ More This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses of cross validated versions of Trend Filtering or Dyadic CART. To illustrate the generality of the framework we also propose and study cross validated versions of two fundamental estimators; lasso for high dimensional linear regression and singular value thresholding for matrix estimation. Our general framework is inspired by the ideas in Chatterjee and Jafarov (2015) and is potentially applicable to a wide range of estimation methods which use tuning parameters. △ Less

Submitted 3 May, 2023; v1 submitted 7 January, 2022; originally announced January 2022.

MSC Class: Primary 62G05; 62G08

arXiv:2112.08535 [pdf, other]

Fractional cyber-neural systems -- a brief survey

Authors: Emily Reed, Sarthak Chatterjee, Guilherme Ramos, Paul Bogdan, Sérgio Pequito

Abstract: Neurotechnology has made great strides in the last 20 years. However, we still have a long way to go to commercialize many of these technologies as we lack a unified framework to study cyber-neural systems (CNS) that bring the hardware, software, and the neural system together. Dynamical systems play a key role in develo** these technologies as they capture different aspects of the brain and pro… ▽ More Neurotechnology has made great strides in the last 20 years. However, we still have a long way to go to commercialize many of these technologies as we lack a unified framework to study cyber-neural systems (CNS) that bring the hardware, software, and the neural system together. Dynamical systems play a key role in develo** these technologies as they capture different aspects of the brain and provide insight into their function. Converging evidence suggests that fractional-order dynamical systems are advantageous in modeling neural systems because of their compact representation and accuracy in capturing the long-range memory exhibited in neural behavior. In this brief survey, we provide an overview of fractional CNS that entails fractional-order systems in the context of CNS. In particular, we introduce basic definitions required for the analysis and synthesis of fractional CNS, encompassing system identification, state estimation, and closed-loop control. Additionally, we provide an illustration of some applications in the context of CNS and draw some possible future research directions. Ultimately, advancements in these three areas will be critical in develo** the next generation of CNS, which will, ultimately, improve people's quality of life. △ Less

Submitted 15 December, 2021; originally announced December 2021.

Comments: 67 pages, 13 figures

arXiv:2111.12813 [pdf, ps, other]

A state space for 3D Euclidean Yang-Mills theories

Authors: Sky Cao, Sourav Chatterjee

Abstract: It is believed that Euclidean Yang-Mills theories behave like the massless Gaussian free field (GFF) at short distances. This makes it impossible to define the main observables for these theories - the Wilson loop observables - in dimensions greater than two, because line integrals of the GFF do not exist in such dimensions. Taking forward a proposal of Charalambous and Gross, this article shows t… ▽ More It is believed that Euclidean Yang-Mills theories behave like the massless Gaussian free field (GFF) at short distances. This makes it impossible to define the main observables for these theories - the Wilson loop observables - in dimensions greater than two, because line integrals of the GFF do not exist in such dimensions. Taking forward a proposal of Charalambous and Gross, this article shows that it is possible to define Euclidean Yang-Mills theories on the 3D unit torus as "random distributional gauge orbits", provided that they indeed behave like the GFF in a certain sense. One of the main technical tools is the existence of the Yang-Mills heat flow on the 3D torus starting from GFF-like initial data, which is established in a companion paper. A key consequence of this construction is that under the GFF assumption, one can define a notion of "regularized Wilson loop observables" for Euclidean Yang-Mills theories on the 3D unit torus. △ Less

Submitted 19 November, 2023; v1 submitted 24 November, 2021; originally announced November 2021.

Comments: 73 pages. Final draft. To appear Comm. Math. Phys

MSC Class: 81T13; 82B28; 60B05

arXiv:2111.10652 [pdf, ps, other]

The Yang-Mills heat flow with random distributional initial data

Authors: Sky Cao, Sourav Chatterjee

Abstract: We construct local solutions to the Yang-Mills heat flow (in the DeTurck gauge) for a certain class of random distributional initial data, which includes the 3D Gaussian free field. The main idea, which goes back to work of Bourgain as well as work of Da Prato-Debussche, is to decompose the solution into a rougher linear part and a smoother nonlinear part, and to control the latter by probabilisti… ▽ More We construct local solutions to the Yang-Mills heat flow (in the DeTurck gauge) for a certain class of random distributional initial data, which includes the 3D Gaussian free field. The main idea, which goes back to work of Bourgain as well as work of Da Prato-Debussche, is to decompose the solution into a rougher linear part and a smoother nonlinear part, and to control the latter by probabilistic arguments. In a companion work, we use the main results of this paper to propose a way towards the construction of 3D Yang-Mills measures. △ Less

Submitted 25 August, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

Comments: 79 pages. Minor changes in this revision

MSC Class: 35R60; 35A01; 60G60; 81T13

arXiv:2110.10294 [pdf, other]

Existence of stationary ballistic deposition on the infinite lattice

Authors: Sourav Chatterjee

Abstract: Ballistic deposition is one of the many models of interface growth that are believed to be in the KPZ universality class, but have so far proved to be largely intractable mathematically. In this model, blocks of size one fall independently as Poisson processes at each site on the $d$-dimensional lattice, and either attach themselves to the column growing at that site, or to the side of an adjacent… ▽ More Ballistic deposition is one of the many models of interface growth that are believed to be in the KPZ universality class, but have so far proved to be largely intractable mathematically. In this model, blocks of size one fall independently as Poisson processes at each site on the $d$-dimensional lattice, and either attach themselves to the column growing at that site, or to the side of an adjacent column, whichever comes first. It is not hard to see that if we subtract off the height of the column at the origin from the heights of the other columns, the resulting interface process is Markovian. The main result of this article is that this Markov process has at least one invariant probability measure. We conjecture that the invariant measure is not unique, and provide some partial evidence. △ Less

Submitted 18 May, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

Comments: 25 pages, 2 figures. To appear in Random Structures and Algorithms. Proof of the main theorem slightly shortened in this revision

MSC Class: 60K35; 60J10; 60G10

arXiv:2110.10245 [pdf, other]

Regret Minimization in Isotonic, Heavy-Tailed Contextual Bandits via Adaptive Confidence Bands

Authors: Sabyasachi Chatterjee, Subhabrata Sen

Abstract: In this paper we initiate a study of non parametric contextual bandits under shape constraints on the mean reward function. Specifically, we study a setting where the context is one dimensional, and the mean reward function is isotonic with respect to this context. We propose a policy for this problem and show that it attains minimax rate optimal regret. Moreover, we show that the same policy enjo… ▽ More In this paper we initiate a study of non parametric contextual bandits under shape constraints on the mean reward function. Specifically, we study a setting where the context is one dimensional, and the mean reward function is isotonic with respect to this context. We propose a policy for this problem and show that it attains minimax rate optimal regret. Moreover, we show that the same policy enjoys automatic adaptation; that is, for subclasses of the parameter space where the true mean reward functions are also piecewise constant with $k$ pieces, this policy remains minimax rate optimal simultaneously for all $k \geq 1.$ Automatic adaptation phenomena are well-known for shape constrained problems in the offline setting; %The phenomenon of automatic adaptation of shape constrained methods is known to occur in offline problems; we show that such phenomena carry over to the online setting. The main technical ingredient underlying our policy is a procedure to derive confidence bands for an underlying isotonic function using the isotonic quantile estimator. The confidence band we propose is valid under heavy tailed noise, and its average width goes to $0$ at an adaptively optimal rate. We consider this to be an independent contribution to the isotonic regression literature. △ Less

Submitted 19 October, 2021; originally announced October 2021.

arXiv:2110.08665 [pdf, other]

Quantile Regression by Dyadic CART

Authors: Oscar Hernan Madrid Padilla, Sabyasachi Chatterjee

Abstract: In this paper we propose and study a version of the Dyadic Classification and Regression Trees (DCART) estimator from Donoho (1997) for (fixed design) quantile regression in general dimensions. We refer to this proposed estimator as the QDCART estimator. Just like the mean regression version, we show that a) a fast dynamic programming based algorithm with computational complexity $O(N \log N)$ exi… ▽ More In this paper we propose and study a version of the Dyadic Classification and Regression Trees (DCART) estimator from Donoho (1997) for (fixed design) quantile regression in general dimensions. We refer to this proposed estimator as the QDCART estimator. Just like the mean regression version, we show that a) a fast dynamic programming based algorithm with computational complexity $O(N \log N)$ exists for computing the QDCART estimator and b) an oracle risk bound (trading off squared error and a complexity parameter of the true signal) holds for the QDCART estimator. This oracle risk bound then allows us to demonstrate that the QDCART estimator enjoys adaptively rate optimal estimation guarantees for piecewise constant and bounded variation function classes. In contrast to existing results for the DCART estimator which requires subgaussianity of the error distribution, for our estimation guarantees to hold we do not need any restrictive tail decay assumptions on the error distribution. For instance, our results hold even when the error distribution has no first moment such as the Cauchy distribution. Apart from the Dyadic CART method, we also consider other variant methods such as the Optimal Regression Tree (ORT) estimator introduced in Chatterjee and Goswami (2019). In particular, we also extend the ORT estimator to the quantile setting and establish that it enjoys analogous guarantees. Thus, this paper extends the scope of these globally optimal regression tree based methodologies to be applicable for heavy tailed data. We then perform extensive numerical experiments on both simulated and real data which illustrate the usefulness of the proposed methods. △ Less

Submitted 16 October, 2021; originally announced October 2021.

arXiv:2110.01062 [pdf, ps, other]

doi 10.1007/s00220-022-04492-w

Local KPZ behavior under arbitrary scaling limits

Authors: Sourav Chatterjee

Abstract: One of the main difficulties in proving convergence of discrete models of surface growth to the Kardar-Parisi-Zhang (KPZ) equation in dimensions higher than one is that the correct way to take a scaling limit, so that the limit is nontrivial, is not known in a rigorous sense. To understand KPZ growth without being hindered by this issue, this article introduces a notion of "local KPZ behavior", wh… ▽ More One of the main difficulties in proving convergence of discrete models of surface growth to the Kardar-Parisi-Zhang (KPZ) equation in dimensions higher than one is that the correct way to take a scaling limit, so that the limit is nontrivial, is not known in a rigorous sense. To understand KPZ growth without being hindered by this issue, this article introduces a notion of "local KPZ behavior", which roughly means that the instantaneous growth of the surface at a point decomposes into the sum of a Laplacian term, a gradient squared term, a noise term that behaves like white noise, and a remainder term that is negligible compared to the other three terms and their sum. The main result is that for a general class of surfaces, which contains the model of directed polymers in a random environment as a special case, local KPZ behavior occurs under arbitrary scaling limits, in any dimension. △ Less

Submitted 29 July, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

Comments: 32 pages. Minor revisions in this update. To appear in Comm. Math. Phys

MSC Class: 6H15; 82C41; 35R60

arXiv:2108.04323 [pdf, ps, other]

Isomorphisms between random graphs

Authors: Sourav Chatterjee, Persi Diaconis

Abstract: Consider two independent Erdős-Rényi $G(N,1/2)$ graphs. We show that with probability tending to $1$ as $N\to\infty$, the largest induced isomorphic subgraph has size either $\lfloor x_N-\varepsilon_N\rfloor$ or $\lfloor x_N+\varepsilon_N \rfloor$, where $x_N=4\log_2 N -2 \log_2 \log_2 N - 2\log_2(4/e)+1$ and $\varepsilon_N = (4\log_2 N)^{-1/2}$. Using similar techniques, we also show that if… ▽ More Consider two independent Erdős-Rényi $G(N,1/2)$ graphs. We show that with probability tending to $1$ as $N\to\infty$, the largest induced isomorphic subgraph has size either $\lfloor x_N-\varepsilon_N\rfloor$ or $\lfloor x_N+\varepsilon_N \rfloor$, where $x_N=4\log_2 N -2 \log_2 \log_2 N - 2\log_2(4/e)+1$ and $\varepsilon_N = (4\log_2 N)^{-1/2}$. Using similar techniques, we also show that if $Γ_1$ and $Γ_2$ are independent $G(n,1/2)$ and $G(N,1/2)$ random graphs, then $Γ_2$ contains an isomorphic copy of $Γ_1$ as an induced subgraph with high probability if $n\le \lfloor y_N - \varepsilon_N \rfloor$ and does not contain an isomorphic copy of $Γ_1$ as an induced subgraph with high probability if $n>\lfloor y_N+\varepsilon_N \rfloor$, where $y_N=2\log_2 N+1$ and $\varepsilon_N$ is as above. △ Less

Submitted 30 December, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

Comments: 17 pages. To appear in J. Combin. Theory B

MSC Class: 05C80; 05C60; 60C05

arXiv:2108.00538 [pdf, ps, other]

doi 10.1007/s00205-022-01798-w

Convergence of deterministic growth models

Authors: Sourav Chatterjee, Panagiotis E. Souganidis

Abstract: We prove the uniform in space and time convergence of the scaled heights of large classes of deterministic growth models that are monotone and equivariant under translations by constants. The limits are characterized as the unique (viscosity solutions) of first- or second-order partial differential equations depending on whether the growth models are scaled hyperbolically or parabolically. The res… ▽ More We prove the uniform in space and time convergence of the scaled heights of large classes of deterministic growth models that are monotone and equivariant under translations by constants. The limits are characterized as the unique (viscosity solutions) of first- or second-order partial differential equations depending on whether the growth models are scaled hyperbolically or parabolically. The results greatly simplify and extend a recent work by the first author to more general surface growth models. The proofs are based on the methodology developed by Barles and the second author to prove convergence of approximation schemes. △ Less

Submitted 6 December, 2021; v1 submitted 1 August, 2021; originally announced August 2021.

Comments: 28 pages. One new example and several other major changes in this revision

arXiv:2107.14347 [pdf, other]

Subcritical Connectivity and Some Exact Tail Exponents in High Dimensional Percolation

Authors: Shirshendu Chatterjee, Jack Hanson, Philippe Sosoe

Abstract: In high dimensional percolation at parameter $p < p_c$, the one-arm probability $π_p(n)$ is known to decay exponentially on scale $(p_c - p)^{-1/2}$. We show the same statement for the ratio $π_p(n) / π_{p_c}(n)$, establishing a form of a hypothesis of scaling theory. As part of our study, we provide sharp estimates (with matching upper and lower bounds) for several quantities of interest at the… ▽ More In high dimensional percolation at parameter $p < p_c$, the one-arm probability $π_p(n)$ is known to decay exponentially on scale $(p_c - p)^{-1/2}$. We show the same statement for the ratio $π_p(n) / π_{p_c}(n)$, establishing a form of a hypothesis of scaling theory. As part of our study, we provide sharp estimates (with matching upper and lower bounds) for several quantities of interest at the critical probability $p_c$. These include the tail behavior of volumes of, and chemical distances within, spanning clusters, along with the scaling of the two-point function at "mesoscopic distance" from the boundary of half-spaces. As a corollary, we obtain the tightness of the number of spanning clusters of a diameter $n$ box on scale $n^{d-6}$; this result complements a lower bound of Aizenman. △ Less

Submitted 29 July, 2021; originally announced July 2021.

Comments: 63 pages, 6 figures

MSC Class: 60K35 (Primary) 82B43; 82B27 (Secondary)

arXiv:2107.13747 [pdf, ps, other]

doi 10.1016/j.geomphys.2022.104509

Atiyah sequence and Gauge transformations of a principal $2$-bundle over a Lie groupoid

Authors: Saikat Chatterjee, Adittya Chaudhuri, Praphulla Koushik

Abstract: In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction u… ▽ More In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction up to a natural isomorphism have been introduced. We constructed a class of principal $\mathbb{G}=[G_1\rightrightarrows G_0]$-bundles and connections from a given principal $G_0$-bundle $E_0\rightarrow X_0$ over $[X_1\rightrightarrows X_0]$ with connection. An existence criterion for the connections on a principal $2$-bundle over a proper, étale Lie groupoid is proposed. The action of the $2$-group of gauge transformations on the category of strict and semi-strict connections has been studied. Finally we noted an extended symmetry of the category of semi-strict connections. △ Less

Submitted 4 August, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

MSC Class: Primary 53C08; Secondary 22A22; 58H05

Journal ref: 2022

arXiv:2106.08489 [pdf, other]

Lorenz System State Stability Identification using Neural Networks

Authors: Megha Subramanian, Ramakrishna Tipireddy, Samrat Chatterjee

Abstract: Nonlinear dynamical systems such as Lorenz63 equations are known to be chaotic in nature and sensitive to initial conditions. As a result, a small perturbation in the initial conditions results in deviation in state trajectory after a few time steps. The algorithms and computational resources needed to accurately identify the system states vary depending on whether the solution is in transition re… ▽ More Nonlinear dynamical systems such as Lorenz63 equations are known to be chaotic in nature and sensitive to initial conditions. As a result, a small perturbation in the initial conditions results in deviation in state trajectory after a few time steps. The algorithms and computational resources needed to accurately identify the system states vary depending on whether the solution is in transition region or not. We refer to the transition and non-transition regions as unstable and stable regions respectively. We label a system state to be stable if it's immediate past and future states reside in the same regime. However, at a given time step we don't have the prior knowledge about whether system is in stable or unstable region. In this paper, we develop and train a feed forward (multi-layer perceptron) Neural Network to classify the system states of a Lorenz system as stable and unstable. We pose this task as a supervised learning problem where we train the neural network on Lorenz system which have states labeled as stable or unstable. We then test the ability of the neural network models to identify the stable and unstable states on a different Lorenz system that is generated using different initial conditions. We also evaluate the classification performance in the mismatched case i.e., when the initial conditions for training and validation data are sampled from different intervals. We show that certain normalization schemes can greatly improve the performance of neural networks in especially these mismatched scenarios. The classification framework developed in the paper can be a preprocessor for a larger context of sequential decision making framework where the decision making is performed based on observed stable or unstable states. △ Less

Submitted 15 June, 2021; originally announced June 2021.

arXiv:2106.02290 [pdf, other]

Matrix completion with data-dependent missingness probabilities

Authors: Sohom Bhattacharya, Sourav Chatterjee

Abstract: The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independe… ▽ More The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independently with probability $p$ and missing otherwise. This assumption may not be realistic for many applications. In this work, we replace it with the assumption that the probability that an entry is available is an unknown function $f$ of the entry itself. For example, if the entry is the rating given to a movie by a viewer, then it seems plausible that high value entries have greater probability of being available than low value entries. We propose two new estimators, based on singular value thresholding and nuclear norm minimization, to recover the matrix under this assumption. The estimators involve no tuning parameters, and are shown to be consistent under a low rank assumption. We also provide a consistent estimator of the unknown function $f$. △ Less

Submitted 22 April, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

Comments: 28 pages, 9 figures. To appear in IEEE Trans. Inf. Theory

arXiv:2105.05933 [pdf, ps, other]

Weak convergence of directed polymers to deterministic KPZ at high temperature

Authors: Sourav Chatterjee

Abstract: It is shown that when $d\ge 3$, the growing random surface generated by the $(d+1)$-dimensional directed polymer model at sufficiently high temperature, after being smoothed by taking microscopic local averages, converges to a solution of the deterministic KPZ equation in a suitable scaling limit. It is shown that when $d\ge 3$, the growing random surface generated by the $(d+1)$-dimensional directed polymer model at sufficiently high temperature, after being smoothed by taking microscopic local averages, converges to a solution of the deterministic KPZ equation in a suitable scaling limit. △ Less

Submitted 12 May, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

Comments: 26 pages. Minor changes in this revision. To appear in Ann. de l'Institut Henri Poincare Probab. Stat

MSC Class: 82C41; 60G60; 39A12

arXiv:2104.09409 [pdf, other]

Discrete-Time Fractional-Order Dynamical Networks Minimum-Energy State Estimation

Authors: Sarthak Chatterjee, Andrea Alessandretti, A. Pedro Aguiar, Sérgio Pequito

Abstract: Fractional-order dynamical networks are increasingly being used to model and describe processes demonstrating long-term memory or complex interlaced dependencies amongst the spatial and temporal components of a wide variety of dynamical networks. Notable examples include networked control systems or neurophysiological networks which are created using electroencephalographic (EEG) or blood-oxygen-l… ▽ More Fractional-order dynamical networks are increasingly being used to model and describe processes demonstrating long-term memory or complex interlaced dependencies amongst the spatial and temporal components of a wide variety of dynamical networks. Notable examples include networked control systems or neurophysiological networks which are created using electroencephalographic (EEG) or blood-oxygen-level-dependent (BOLD) data. As a result, the estimation of the states of fractional-order dynamical networks poses an important problem. To this effect, this paper addresses the problem of minimum-energy state estimation for discrete-time fractional-order dynamical networks (DT-FODN), where the state and output equations are affected by an additive noise that is considered to be deterministic, bounded, and unknown. Specifically, we derive the corresponding estimator and show that the resulting estimation error is exponentially input-to-state stable with respect to the disturbances and to a signal that is decreasing with the increase of the accuracy of the adopted approximation model. An illustrative example shows the effectiveness of the proposed method on real-world neurophysiological networks. △ Less

Submitted 2 August, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

Comments: 9 pages, 7 figures

arXiv:2103.14975 [pdf, other]

On Learning Discrete-Time Fractional-Order Dynamical Systems

Authors: Sarthak Chatterjee, Sérgio Pequito

Abstract: Discrete-time fractional-order dynamical systems (DT-FODS) have found innumerable applications in the context of modeling spatiotemporal behaviors associated with long-term memory. Applications include neurophysiological signals such as electroencephalogram (EEG) and electrocorticogram (ECoG). Although learning the spatiotemporal parameters of DT-FODS is not a new problem, when dealing with neurop… ▽ More Discrete-time fractional-order dynamical systems (DT-FODS) have found innumerable applications in the context of modeling spatiotemporal behaviors associated with long-term memory. Applications include neurophysiological signals such as electroencephalogram (EEG) and electrocorticogram (ECoG). Although learning the spatiotemporal parameters of DT-FODS is not a new problem, when dealing with neurophysiological signals we need to guarantee performance standards. Therefore, we need to understand the trade-offs between sample complexity and estimation accuracy of the system parameters. Simply speaking, we need to address the question of how many measurements we need to collect to identify the system parameters up to an uncertainty level. In this paper, we address the problem of identifying the spatial and temporal parameters of DT-FODS. The main result is the first result on non-asymptotic finite-sample complexity guarantees of identifying DT-FODS. Finally, we provide evidence of the efficacy of our method in the context of forecasting real-life intracranial EEG time series collected from patients undergoing epileptic seizures. △ Less

Submitted 3 October, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

Comments: 6 pages, 2 figures

arXiv:2103.09199 [pdf, ps, other]

Superconcentration in surface growth

Authors: Sourav Chatterjee

Abstract: Height functions of growing random surfaces are often conjectured to be superconcentrated, meaning that their variances grow sublinearly in time. This article introduces a new concept, called subroughness, meaning that there exist two distinct points such that the expected squared difference between the heights at these points grows sublinearly in time. The main result of the paper is that superco… ▽ More Height functions of growing random surfaces are often conjectured to be superconcentrated, meaning that their variances grow sublinearly in time. This article introduces a new concept, called subroughness, meaning that there exist two distinct points such that the expected squared difference between the heights at these points grows sublinearly in time. The main result of the paper is that superconcentration is equivalent to subroughness in a class of growing random surfaces. The result is applied to establish superconcentration in a variant of the restricted solid-on-solid (RSOS) model and in a variant of the ballistic deposition model, and give new proofs of superconcentration in directed last-passage percolation and directed polymers. △ Less

Submitted 7 May, 2022; v1 submitted 16 March, 2021; originally announced March 2021.

Comments: 32 pages. Minor edits in this revision. To appear in Random Structures and Algorithms

MSC Class: 82C41; 60E15

Showing 1–50 of 196 results for author: Chatterjee, S