-
$i$Trust: Trust-Region Optimisation with Ising Machines
Authors:
Sayantan Pramanik,
Kaumudibikash Goswami,
Sourav Chatterjee,
M Girish Chandra
Abstract:
In this work, we present a heretofore unseen application of Ising machines to perform trust region-based optimisation with box constraints. This is done by considering a specific form of opto-electronic oscillator-based coherent Ising machines with clipped transfer functions, and proposing appropriate modifications to facilitate trust-region optimisation. The enhancements include the inclusion of…
▽ More
In this work, we present a heretofore unseen application of Ising machines to perform trust region-based optimisation with box constraints. This is done by considering a specific form of opto-electronic oscillator-based coherent Ising machines with clipped transfer functions, and proposing appropriate modifications to facilitate trust-region optimisation. The enhancements include the inclusion of non-symmetric coupling and linear terms, modulation of noise, and compatibility with convex-projections to improve its convergence. The convergence of the modified Ising machine has been shown under the reasonable assumptions of convexity or invexity. The mathematical structures of the modified Ising machine and trust-region methods have been exploited to design a new trust-region method to effectively solve unconstrained optimisation problems in many scenarios, such as machine learning and optimisation of parameters in variational quantum algorithms. Hence, the proposition is useful for both classical and quantum-classical hybrid scenarios. Finally, the convergence of the Ising machine-based trust-region method, has also been proven analytically, establishing the feasibility of the technique.
△ Less
Submitted 6 June, 2024;
originally announced July 2024.
-
PriME: Privacy-aware Membership profile Estimation in networks
Authors:
Abhinav Chakraborty,
Sayak Chatterjee,
Sagnik Nandy
Abstract:
This paper presents a novel approach to estimating community membership probabilities for network vertices generated by the Degree Corrected Mixed Membership Stochastic Block Model while preserving individual edge privacy. Operating within the $\varepsilon$-edge local differential privacy framework, we introduce an optimal private algorithm based on a symmetric edge flip mechanism and spectral clu…
▽ More
This paper presents a novel approach to estimating community membership probabilities for network vertices generated by the Degree Corrected Mixed Membership Stochastic Block Model while preserving individual edge privacy. Operating within the $\varepsilon$-edge local differential privacy framework, we introduce an optimal private algorithm based on a symmetric edge flip mechanism and spectral clustering for accurate estimation of vertex community memberships. We conduct a comprehensive analysis of the estimation risk and establish the optimality of our procedure by providing matching lower bounds to the minimax risk under privacy constraints. To validate our approach, we demonstrate its performance through numerical simulations and its practical application to real-world data. This work represents a significant step forward in balancing accurate community membership estimation with stringent privacy preservation in network data analysis.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
On Hyperbolicity of Spirallike Circularlike domain
Authors:
Sanjoy Chatterjee,
Golam Mostafa Mondal
Abstract:
In this paper, we prove that a spirallike circularlike domain is Kobayashi hyperbolic if and only if its core is empty. In particular, we show that such a domain is Kobayashi hyperbolic if and only if it is (biholomorphic to) a bounded domain. We also propose a problem in this area.
In this paper, we prove that a spirallike circularlike domain is Kobayashi hyperbolic if and only if its core is empty. In particular, we show that such a domain is Kobayashi hyperbolic if and only if it is (biholomorphic to) a bounded domain. We also propose a problem in this area.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Liouville Theory: An Introduction to Rigorous Approaches
Authors:
Sourav Chatterjee,
Edward Witten
Abstract:
In recent years, a surprisingly direct and simple rigorous understanding of quantum Liouville theory has developed. We aim here to make this material more accessible to physicists working on quantum field theory.
In recent years, a surprisingly direct and simple rigorous understanding of quantum Liouville theory has developed. We aim here to make this material more accessible to physicists working on quantum field theory.
△ Less
Submitted 4 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
A scaling limit of $\mathrm{SU}(2)$ lattice Yang-Mills-Higgs theory
Authors:
Sourav Chatterjee
Abstract:
The construction of non-Abelian Euclidean Yang-Mills theories in dimension four, as scaling limits of lattice Yang-Mills theories or otherwise, is a central open question of mathematical physics. This paper takes the following small step towards this goal. In any dimension $d\ge 2$, we construct a scaling limit of $\mathrm{SU}(2)$ lattice Yang-Mills theory coupled to a Higgs field transforming in…
▽ More
The construction of non-Abelian Euclidean Yang-Mills theories in dimension four, as scaling limits of lattice Yang-Mills theories or otherwise, is a central open question of mathematical physics. This paper takes the following small step towards this goal. In any dimension $d\ge 2$, we construct a scaling limit of $\mathrm{SU}(2)$ lattice Yang-Mills theory coupled to a Higgs field transforming in the fundamental representation of $\mathrm{SU}(2)$. The scaling limit is obtained by sending the gauge coupling constant $g$ to zero and the Higgs length $α$ to infinity slower than $g^{-1}$, but faster than $g^{-1+1/49d}$. After unitary gauge fixing and taking the lattice scaling to zero as a constant multiple of $αg$, a stereographic projection of the gauge field is shown to converge to a scale-invariant massive Gaussian field. This gives the first construction of a scaling limit of a non-Abelian lattice Yang-Mills theory in a dimension higher than two, as well as the first rigorous proof of mass generation by the Higgs mechanism in such a theory. Analogous results are proved for $\mathrm{U}(1)$ theory as well. The question of constructing a non-Gaussian scaling limit remains open.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Convergence Analysis of Opto-Electronic Oscillator based Coherent Ising Machines
Authors:
Sayantan Pramanik,
Sourav Chatterjee,
Harshkumar Oza
Abstract:
Ising machines are purported to be better at solving large-scale combinatorial optimisation problems better than conventional von Neumann computers. However, these Ising machines are widely believed to be heuristics, whose promise is observed empirically rather than obtained theoretically. We bridge this gap by considering an opto-electronic oscillator based coherent Ising machine, and providing t…
▽ More
Ising machines are purported to be better at solving large-scale combinatorial optimisation problems better than conventional von Neumann computers. However, these Ising machines are widely believed to be heuristics, whose promise is observed empirically rather than obtained theoretically. We bridge this gap by considering an opto-electronic oscillator based coherent Ising machine, and providing the first analytical proof that under reasonable assumptions, the OEO-CIM is not a heuristic approach. We find and prove bounds on its performance in terms of the expected difference between the objective value at the final iteration and the optimal one, and on the number of iterations required by it. In the process, we emphasise on some of its limitations such as the inability to handle asymmetric coupling between spins, and the absence of external magnetic field applied on them (both of which are necessary in many optimisation problems), along with some issues in its convergence. We overcome these limitations by proposing suitable adjustments and prove that the improved architecture is guaranteed to converge to the optimum of the relaxed objective function.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Discrete Dynamics and Supergeometry
Authors:
Subhobrata Chatterjee,
Andrew Waldron,
Cem Yetişmişoğlu
Abstract:
We formulate a geometric measurement theory of dynamical classical systems possessing both continuous and discrete degrees of freedom. The approach is covariant with respect to choices of clocks and canonically incorporates laboratories. The latter are embedded symplectic submanifolds of an odd-dimensional symplectic structure. When suitably defined, symplectic geometry in odd dimensions is exactl…
▽ More
We formulate a geometric measurement theory of dynamical classical systems possessing both continuous and discrete degrees of freedom. The approach is covariant with respect to choices of clocks and canonically incorporates laboratories. The latter are embedded symplectic submanifolds of an odd-dimensional symplectic structure. When suitably defined, symplectic geometry in odd dimensions is exactly the structure needed for covariance. A fundamentally probabilistic viewpoint allows classical supergeometries to describe discrete dynamics. We solve the problem of how to construct probabilistic measures on supermanifolds given a (possibly odd dimensional) supersymplectic structure. This relies on a superanalog of the Hodge star for differential forms and a description of probabilities by convex cones. We also show how stochastic processes such as Markov chains can be described by supergeometry.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Spectral gap of nonreversible Markov chains
Authors:
Sourav Chatterjee
Abstract:
We define the spectral gap of a Markov chain on a finite state space as the second-smallest singular value of the generator of the chain, generalizing the usual definition of spectral gap for reversible chains. We then define the relaxation time of the chain as the inverse of this spectral gap, and show that this relaxation time can be characterized, for any Markov chain, as the time required for…
▽ More
We define the spectral gap of a Markov chain on a finite state space as the second-smallest singular value of the generator of the chain, generalizing the usual definition of spectral gap for reversible chains. We then define the relaxation time of the chain as the inverse of this spectral gap, and show that this relaxation time can be characterized, for any Markov chain, as the time required for convergence of empirical averages. This relaxation time is related to the Cheeger constant and the mixing time of the chain through inequalities that are similar to the reversible case, and the path argument can be used to get upper bounds. Several examples are worked out. An interesting finding from the examples is that the time for convergence of empirical averages in nonreversible chains can often be substantially smaller than the mixing time.
△ Less
Submitted 13 November, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Neighbour Sum Patterns : Chessboards to Toroidal Worlds
Authors:
Sayan Dutta,
Ayanava Mandal,
Sohom Gupta,
Sourin Chatterjee
Abstract:
We say that a chessboard filled with integer entries satisfies the neighbour-sum property if the number appearing on each cell is the sum of entries in its neighbouring cells, where neighbours are cells sharing a common edge or vertex. We show that an $n\times n$ chessboard satisfies this property if and only if $n\equiv 5\pmod 6$. Existence of solutions is further investigated of rectangular, tor…
▽ More
We say that a chessboard filled with integer entries satisfies the neighbour-sum property if the number appearing on each cell is the sum of entries in its neighbouring cells, where neighbours are cells sharing a common edge or vertex. We show that an $n\times n$ chessboard satisfies this property if and only if $n\equiv 5\pmod 6$. Existence of solutions is further investigated of rectangular, toroidal boards, as well as on Neumann neighbourhoods, including a nice connection to discrete harmonic functions. Construction of solutions on infinite boards are also presented. Finally, answers to three dimensional analogues of these boards are explored using properties of cyclotomic polynomials and relevant ideas conjectured.
△ Less
Submitted 6 October, 2023;
originally announced October 2023.
-
A dynamic mean-field statistical model of academic collaboration
Authors:
Soumendu Sundar Mukherjee,
Tamojit Sadhukhan,
Shirshendu Chatterjee
Abstract:
There is empirical evidence that collaboration in academia has increased significantly during the past few decades, perhaps due to the breathtaking advancements in communication and technology during this period. Multi-author articles have become more frequent than single-author ones. Interdisciplinary collaboration is also on the rise. Although there have been several studies on the dynamical asp…
▽ More
There is empirical evidence that collaboration in academia has increased significantly during the past few decades, perhaps due to the breathtaking advancements in communication and technology during this period. Multi-author articles have become more frequent than single-author ones. Interdisciplinary collaboration is also on the rise. Although there have been several studies on the dynamical aspects of collaboration networks, systematic statistical models which theoretically explain various empirically observed features of such networks have been lacking. In this work, we propose a dynamic mean-field model and an associated estimation framework for academic collaboration networks. We primarily focus on how the degree of collaboration of a typical author, rather than the local structure of her collaboration network, changes over time. We consider several popular indices of collaboration from the literature and study their dynamics under the proposed model. In particular, we obtain exact formulae for the expectations and temporal rates of change of these indices. Through extensive simulation experiments, we demonstrate that the proposed model has enough flexibility to capture various phenomena characteristic of real-world collaboration networks. Using metadata on papers from the arXiv repository, we empirically study the mean-field collaboration dynamics in disciplines such as Computer Science, Mathematics and Physics.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Parallel transport on a Lie 2-group bundle over a Lie groupoid along Haefliger paths
Authors:
Saikat Chatterjee,
Adittya Chaudhuri
Abstract:
We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection…
▽ More
We prove a Lie 2-group torsor version of the well-known one-one correspondence between fibered categories and pseudofunctors. Consequently, we obtain a weak version of the principal Lie group bundle over a Lie groupoid. The correspondence also enables us to extend a particular class of principal 2-bundles to be defined over differentiable stacks. We show that the differential geometric connection structures introduced in the authors' previous work, combine nicely with the underlying fibration structure of a principal 2-bundle over a Lie groupoid. This interrelation allows us to derive a notion of parallel transport in the framework of principal 2-bundles over Lie groupoids along a particular class of Haefliger paths. The corresponding parallel transport functor is shown to be smooth. We apply our results to examine the parallel transport on an associated VB-groupoid.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
A characterization of bounded balanced convex domains in $\mathbb{C}^n$
Authors:
Sanjoy Chatterjee,
Golam Mostafa Mondal
Abstract:
In this paper, we investigate the characterization of balanced bounded convex domains in $\mathbb{C}^n$ in terms of the squeezing function. As an application, we provide a characterization of the polydisc in $\mathbb{C}^n$.
In this paper, we investigate the characterization of balanced bounded convex domains in $\mathbb{C}^n$ in terms of the squeezing function. As an application, we provide a characterization of the polydisc in $\mathbb{C}^n$.
△ Less
Submitted 13 February, 2024; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Features of a spin glass in the random field Ising model
Authors:
Sourav Chatterjee
Abstract:
A longstanding open question in the theory of disordered systems is whether short-range models, such as the random field Ising model or the Edwards-Anderson model, can indeed have the famous properties that characterize mean-field spin glasses at nonzero temperature. This article shows that this is at least partially possible in the case of the random field Ising model. Consider the Ising model on…
▽ More
A longstanding open question in the theory of disordered systems is whether short-range models, such as the random field Ising model or the Edwards-Anderson model, can indeed have the famous properties that characterize mean-field spin glasses at nonzero temperature. This article shows that this is at least partially possible in the case of the random field Ising model. Consider the Ising model on a discrete $d$-dimensional cube under free boundary condition, subjected to a very weak i.i.d. random external field, where the field strength is inversely proportional to the square-root of the number of sites. It turns out that in $d\ge 2$ and at subcritical temperatures, this model has some of the key features of a mean-field spin glass. Namely, (a) the site overlap exhibits one step of replica symmetry breaking, (b) the quenched distribution of the overlap is non-self-averaging, and (c) the overlap has the Parisi ultrametric property. Furthermore, it is shown that for Gaussian disorder, replica symmetry does not break if the field strength is taken to be stronger than the one prescribed above, and non-self-averaging fails if it is weaker, showing that the above order of field strength is the only one that allows all three properties to hold. However, the model does not have two other features of mean-field models. Namely, (a) it does not satisfy the Ghirlanda-Guerra identities, and (b) it has only two pure states instead of many.
△ Less
Submitted 7 March, 2024; v1 submitted 14 July, 2023;
originally announced July 2023.
-
A study of spirallike domains: polynomial convexity, Loewner chains and dense holomorphic curves
Authors:
Sanjoy Chatterjee,
Sushil Gorai
Abstract:
In this paper, we prove that the closure of a bounded pseudoconvex domain, which is spirallike with respect to a globally asymptotic stable holomorphic vector field, is polynomially convex. We also provide a necessary and sufficient condition, in terms of polynomial convexity, on a univalent function defined on a strongly convex domain for embedding it into a filtering Loewner chain. Next, we prov…
▽ More
In this paper, we prove that the closure of a bounded pseudoconvex domain, which is spirallike with respect to a globally asymptotic stable holomorphic vector field, is polynomially convex. We also provide a necessary and sufficient condition, in terms of polynomial convexity, on a univalent function defined on a strongly convex domain for embedding it into a filtering Loewner chain. Next, we provide an application of our first result. We show that for any bounded pseudoconvex strictly spirallike domain $Ω$ in $\mathbb{C}^n$ and given any connected complex manifold $Y$, there exists a holomorphic map from the unit disc to the space of all holomorphic maps from $Ω$ to $Y$. This also yields us the existence of $\mathcal{O}(Ω, Y)$-universal map for any generalized translation on $Ω$, which, in turn, is connected to the hypercyclicity of certain composition operators on the space of manifold valued holomorphic maps.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Enumerative Theory for the Tsetlin Library
Authors:
Sourav Chatterjee,
Persi Diaconis,
Gene B. Kim
Abstract:
The Tsetlin library is a well-studied Markov chain on the symmetric group $S_n$. It has stationary distribution $π(σ)$ the Luce model, a nonuniform distribution on $S_n$, which appears in psychology, horse race betting, and tournament poker. Simple enumerative questions, such as ``what is the distribution of the top $k$ cards?'' or ``what is the distribution of the bottom $k$ cards?'' are long ope…
▽ More
The Tsetlin library is a well-studied Markov chain on the symmetric group $S_n$. It has stationary distribution $π(σ)$ the Luce model, a nonuniform distribution on $S_n$, which appears in psychology, horse race betting, and tournament poker. Simple enumerative questions, such as ``what is the distribution of the top $k$ cards?'' or ``what is the distribution of the bottom $k$ cards?'' are long open. We settle these questions and draw attention to a host of parallel questions on the extension to the chambers of a hyperplane arrangement.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Central Limit Theorem for Gram-Schmidt Random Walk Design
Authors:
Sabyasachi Chatterjee,
Partha S. Dey,
Subhajit Goswami
Abstract:
We prove a central limit theorem for the Horvitz-Thompson estimator based on the Gram-Schmidt Walk (GSW) design, recently developed in Harshaw et al.(2022). In particular, we consider the version of the GSW design which uses randomized pivot order, thereby answering an open question raised in the same article. We deduce this under minimal and global assumptions involving only the problem parameter…
▽ More
We prove a central limit theorem for the Horvitz-Thompson estimator based on the Gram-Schmidt Walk (GSW) design, recently developed in Harshaw et al.(2022). In particular, we consider the version of the GSW design which uses randomized pivot order, thereby answering an open question raised in the same article. We deduce this under minimal and global assumptions involving only the problem parameters such as the (sum) potential outcome vector and the covariate matrix. As an interesting consequence of our analysis we also obtain the precise limiting variance of the estimator in terms of these parameters which is smaller than the previously known upper bound. The main ingredients are a simplified skeletal process approximating the GSW design and concentration phenomena for random matrices obtained from random sampling using the Stein's method for exchangeable pairs.
△ Less
Submitted 5 June, 2023; v1 submitted 21 May, 2023;
originally announced May 2023.
-
Characterising Solutions of Anomalous Cancellation
Authors:
Satvik Saha,
Sohom Gupta,
Sayan Dutta,
Sourin Chatterjee
Abstract:
Anomalous cancellation of fractions is a mathematically inaccurate method where cancelling the common digits of the numerator and denominator correctly reduces it. While it appears to be accidentally successful, the property of anomalous cancellation is intricately connected to the number of digits of the denominator as well as the base in which the fraction is represented. Previous work have been…
▽ More
Anomalous cancellation of fractions is a mathematically inaccurate method where cancelling the common digits of the numerator and denominator correctly reduces it. While it appears to be accidentally successful, the property of anomalous cancellation is intricately connected to the number of digits of the denominator as well as the base in which the fraction is represented. Previous work have been mostly surrounding three digit solutions or specific properties of the same. This paper seeks to get general results regarding the structure of numbers that follow the cancellation property (denoted by $P^*_{\ell; k}$) and an estimate of the total number of solutions possible in a given base representation. In particular, interesting properties regarding the saturation of the number of solutions in general and $p^n$ bases (where $p$ is a prime) have been studied in detail.
△ Less
Submitted 31 January, 2023;
originally announced February 2023.
-
Spin glass phase at zero temperature in the Edwards-Anderson model
Authors:
Sourav Chatterjee
Abstract:
While the analysis of mean-field spin glass models has seen tremendous progress in the last twenty years, lattice spin glasses have remained largely intractable. This article presents the solutions to a number of questions about the Edwards-Anderson model of short-range spin glasses (in all dimensions) that were raised in the physics literature many years ago. First, it is shown that the ground st…
▽ More
While the analysis of mean-field spin glass models has seen tremendous progress in the last twenty years, lattice spin glasses have remained largely intractable. This article presents the solutions to a number of questions about the Edwards-Anderson model of short-range spin glasses (in all dimensions) that were raised in the physics literature many years ago. First, it is shown that the ground state is sensitive to small perturbations of the disorder, in the sense that a small amount of noise gives rise to a new ground state that is nearly orthogonal to the old one with respect to the site overlap inner product. Second, it is shown that one can overturn a macroscopic fraction of the spins in the ground state with an energy cost that is negligible compared to the size of the boundary of the overturned region - a feature that is believed to be typical of spin glasses but clearly absent in ferromagnets. The third result is that the boundary of the overturned region in dimension $d$ has fractal dimension strictly greater than $d-1$, confirming a prediction from physics. The fourth result is that the correlations between bonds in the ground state can decay at most like the inverse of the distance. This contrasts with the random field Ising model, where it has been shown recently that the correlation decays exponentially in distance in dimension two. The fifth result is that the expected size of the critical droplet of a bond grows at least like a power of the volume. Taken together, these results comprise the first mathematical proof of glassy behavior in a short-range spin glass model.
△ Less
Submitted 28 February, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning
Authors:
Aowabin Rahman,
Arnab Bhattacharya,
Thiagarajan Ramachandran,
Sayak Mukherjee,
Himanshu Sharma,
Ted Fujimoto,
Samrat Chatterjee
Abstract:
Search and Rescue (SAR) missions in remote environments often employ autonomous multi-robot systems that learn, plan, and execute a combination of local single-robot control actions, group primitives, and global mission-oriented coordination and collaboration. Often, SAR coordination strategies are manually designed by human experts who can remotely control the multi-robot system and enable semi-a…
▽ More
Search and Rescue (SAR) missions in remote environments often employ autonomous multi-robot systems that learn, plan, and execute a combination of local single-robot control actions, group primitives, and global mission-oriented coordination and collaboration. Often, SAR coordination strategies are manually designed by human experts who can remotely control the multi-robot system and enable semi-autonomous operations. However, in remote environments where connectivity is limited and human intervention is often not possible, decentralized collaboration strategies are needed for fully-autonomous operations. Nevertheless, decentralized coordination may be ineffective in adversarial environments due to sensor noise, actuation faults, or manipulation of inter-agent communication data. In this paper, we propose an algorithmic approach based on adversarial multi-agent reinforcement learning (MARL) that allows robots to efficiently coordinate their strategies in the presence of adversarial inter-agent communications. In our setup, the objective of the multi-robot team is to discover targets strategically in an obstacle-strewn geographical area by minimizing the average time needed to find the targets. It is assumed that the robots have no prior knowledge of the target locations, and they can interact with only a subset of neighboring robots at any time. Based on the centralized training with decentralized execution (CTDE) paradigm in MARL, we utilize a hierarchical meta-learning framework to learn dynamic team-coordination modalities and discover emergent team behavior under complex cooperative-competitive scenarios. The effectiveness of our approach is demonstrated on a collection of prototype grid-world environments with different specifications of benign and adversarial agents, target locations, and agent rewards.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Online Distributed Algorithm for Optimal Power Flow problem with Regret Analysis
Authors:
Sushobhan Chatterjee,
Rachel Kalpana Kalaimani
Abstract:
We investigate the distributed DC-Optimal Power Flow (DC-OPF) problem for a dynamic and uncertain environment. The unpredictable supply of renewable resources and varying prices of the electricity market are a few factors responsible for the uncertainty. We propose to address this problem using the framework of online convex optimization, where the cost functions are not known apriori because of t…
▽ More
We investigate the distributed DC-Optimal Power Flow (DC-OPF) problem for a dynamic and uncertain environment. The unpredictable supply of renewable resources and varying prices of the electricity market are a few factors responsible for the uncertainty. We propose to address this problem using the framework of online convex optimization, where the cost functions are not known apriori because of the uncertainty and are revealed only incrementally over time. We also consider a distributed setting, where each agent (generators and loads) in the power network is only privy to their own local objectives and constraints but can communicate with their neighbours. A distributed online algorithm is proposed based on the modified primal-dual approach. The performance of the online algorithm is evaluated using the regret (static) function, which is the difference between the actual cost incurred by employing the proposed algorithm and the optimal fixed decision in hindsight. Since we deal with a constrained optimization problem, analogous to the notion of regret the accumulation of the constraint violation is also calculated at each step. We establish a sub-linear bound on the static regret and constraint violation under suitable assumptions on step-size and cost function. Finally, we use the standard IEEE-14 bus system to demonstrate the performance of our algorithm.
△ Less
Submitted 9 August, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
A survey of some recent developments in measures of association
Authors:
Sourav Chatterjee
Abstract:
This paper surveys some recent developments in measures of association related to a new coefficient of correlation introduced by the author. A straightforward extension of this coefficient to standard Borel spaces (which includes all Polish spaces), overlooked in the literature so far, is proposed at the end of the survey.
This paper surveys some recent developments in measures of association related to a new coefficient of correlation introduced by the author. A straightforward extension of this coefficient to standard Borel spaces (which includes all Polish spaces), overlooked in the literature so far, is proposed at the end of the survey.
△ Less
Submitted 9 August, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Estimating large causal polytrees from small samples
Authors:
Sourav Chatterjee,
Mathukumalli Vidyasagar
Abstract:
We consider the problem of estimating a large causal polytree from a relatively small i.i.d. sample. This is motivated by the problem of determining causal structure when the number of variables is very large compared to the sample size, such as in gene regulatory networks. We give an algorithm that recovers the tree with high accuracy in such settings. The algorithm works under essentially no dis…
▽ More
We consider the problem of estimating a large causal polytree from a relatively small i.i.d. sample. This is motivated by the problem of determining causal structure when the number of variables is very large compared to the sample size, such as in gene regulatory networks. We give an algorithm that recovers the tree with high accuracy in such settings. The algorithm works under essentially no distributional or modeling assumptions other than some mild non-degeneracy conditions.
△ Less
Submitted 29 March, 2024; v1 submitted 14 September, 2022;
originally announced September 2022.
-
Approximations on Spirallike domains of $\mathbb{C}^{n}$
Authors:
Sanjoy Chatterjee,
Sushil Gorai
Abstract:
In this paper, we first show that any domain $\Om$ in $\cn(n \geq 2)$, which is spirallike with respect to a complete holomorphic globally asymptotic stable vector field $F$, is a Runge domain. Next, we prove an Andersén-Lempert type approximation theorem: any biholomorphism $Φ\colon \Om \to Φ(\Om)$, with $Φ(\Om)$ is Runge, can be approximated by automorphisms of $\mathbb{C}^{n}$ uniformly on comp…
▽ More
In this paper, we first show that any domain $\Om$ in $\cn(n \geq 2)$, which is spirallike with respect to a complete holomorphic globally asymptotic stable vector field $F$, is a Runge domain. Next, we prove an Andersén-Lempert type approximation theorem: any biholomorphism $Φ\colon \Om \to Φ(\Om)$, with $Φ(\Om)$ is Runge, can be approximated by automorphisms of $\mathbb{C}^{n}$ uniformly on compacts, in the following two cases.
\begin{itemize}
\item [(i)] The domain $\Om\subset\cn$ is a spirallike with respect to a linear vector field $A$, where $2\max\{\rlλ:λ\inσ(A)\}<\min\{\rlλ:λ\inσ(A)\}$.
\item [(ii)] The domain $\Om$ is spirallike with respect to complete globally exponentially stable vector field $F$, with a certain rate of the convergence of the flow of the vector field $F$ in $\Om $
\end{itemize}
We further show that, if $J(Φ) \equiv 1$ (and $div(F)$ is constant in the situation (ii)) then the biholomorphism $Φ\colon \Om \to Φ(\Om)$ can be approximated by volume preserving automorphism of $\cn$ in both the cases mentioned above. As an application of our approximation results, we show that any Loewner PDE in a complete hyperbolic domain $\Om$ which satisfies (i) or (ii) mentioned above admits an essentially unique univalent solution with values in $\cn$. We also provide an example of a Hartogs domain in $\mathbb{C}^{2}$ which spirallike with respect to a complete holomorphic vector field $F(z_{1},z_{2})=(-2z_{1},-3z_{2}+z_{1}z_{2})$, but the domain is not spirallike with respect to any linear vector field. Some more examples are provided at the end of this paper.
△ Less
Submitted 4 March, 2024; v1 submitted 23 August, 2022;
originally announced August 2022.
-
An invariance principle for the 1D KPZ equation
Authors:
Arka Adhikari,
Sourav Chatterjee
Abstract:
Consider a discrete one-dimensional random surface whose height at a point grows as a function of the heights at neighboring points plus an independent random noise. Assuming that this function is equivariant under constant shifts, symmetric in its arguments, and at least six times continuously differentiable in a neighborhood of the origin, we show that as the variance of the noise goes to zero,…
▽ More
Consider a discrete one-dimensional random surface whose height at a point grows as a function of the heights at neighboring points plus an independent random noise. Assuming that this function is equivariant under constant shifts, symmetric in its arguments, and at least six times continuously differentiable in a neighborhood of the origin, we show that as the variance of the noise goes to zero, any such process converges to the Cole-Hopf solution of the 1D KPZ equation under a suitable scaling of space and time. This proves an invariance principle for the 1D KPZ equation, in the spirit of Donsker's invariance principle for Brownian motion.
△ Less
Submitted 1 September, 2023; v1 submitted 4 August, 2022;
originally announced August 2022.
-
Concentration inequalities for correlated network-valued processes with applications to community estimation and changepoint analysis
Authors:
Sayak Chatterjee,
Shirshendu Chatterjee,
Soumendu Sundar Mukherjee,
Anirban Nath,
Sharmodeep Bhattacharyya
Abstract:
Network-valued time series are currently a common form of network data. However, the study of the aggregate behavior of network sequences generated from network-valued stochastic processes is relatively rare. Most of the existing research focuses on the simple setup where the networks are independent (or conditionally independent) across time, and all edges are updated synchronously at each time s…
▽ More
Network-valued time series are currently a common form of network data. However, the study of the aggregate behavior of network sequences generated from network-valued stochastic processes is relatively rare. Most of the existing research focuses on the simple setup where the networks are independent (or conditionally independent) across time, and all edges are updated synchronously at each time step. In this paper, we study the concentration properties of the aggregated adjacency matrix and the corresponding Laplacian matrix associated with network sequences generated from lazy network-valued stochastic processes, where edges update asynchronously, and each edge follows a lazy stochastic process for its updates independent of the other edges. We demonstrate the usefulness of these concentration results in proving consistency of standard estimators in community estimation and changepoint estimation problems. We also conduct a simulation study to demonstrate the effect of the laziness parameter, which controls the extent of temporal correlation, on the accuracy of community and changepoint estimation.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Some basic results on fuzzy strong $φ$-b-normed linear spaces
Authors:
Abhishikta Das,
T. Bag,
S. Chatterjee
Abstract:
In this paper, definition of fuzzy strong $φ$-b-normed linear space is given. Here the scalar function |c| is replaced by a general function $φ$(c) where φ satisfies some properties. Some basic results on finite dimensional fuzzy strong $φ$-b-normed linear space are studied.
In this paper, definition of fuzzy strong $φ$-b-normed linear space is given. Here the scalar function |c| is replaced by a general function $φ$(c) where φ satisfies some properties. Some basic results on finite dimensional fuzzy strong $φ$-b-normed linear space are studied.
△ Less
Submitted 25 May, 2022;
originally announced May 2022.
-
A random walk on the Rado graph
Authors:
Sourav Chatterjee,
Persi Diaconis,
Laurent Miclo
Abstract:
The Rado graph, also known as the random graph $G(\infty, p)$, is a classical limit object for finite graphs. We study natural ball walks as a way of understanding the geometry of this graph. For the walk started at $i$, we show that order $\log_2^*i$ steps are sufficient, and for infinitely many $i$, necessary for convergence to stationarity. The proof involves an application of Hardy's inequalit…
▽ More
The Rado graph, also known as the random graph $G(\infty, p)$, is a classical limit object for finite graphs. We study natural ball walks as a way of understanding the geometry of this graph. For the walk started at $i$, we show that order $\log_2^*i$ steps are sufficient, and for infinitely many $i$, necessary for convergence to stationarity. The proof involves an application of Hardy's inequality for trees.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
DeepBayes -- an estimator for parameter estimation in stochastic nonlinear dynamical models
Authors:
Anubhab Ghosh,
Mohamed Abdalmoaty,
Saikat Chatterjee,
Håkan Hjalmarsson
Abstract:
Stochastic nonlinear dynamical systems are ubiquitous in modern, real-world applications. Yet, estimating the unknown parameters of stochastic, nonlinear dynamical models remains a challenging problem. The majority of existing methods employ maximum likelihood or Bayesian estimation. However, these methods suffer from some limitations, most notably the substantial computational time for inference…
▽ More
Stochastic nonlinear dynamical systems are ubiquitous in modern, real-world applications. Yet, estimating the unknown parameters of stochastic, nonlinear dynamical models remains a challenging problem. The majority of existing methods employ maximum likelihood or Bayesian estimation. However, these methods suffer from some limitations, most notably the substantial computational time for inference coupled with limited flexibility in application. In this work, we propose DeepBayes estimators that leverage the power of deep recurrent neural networks in learning an estimator. The method consists of first training a recurrent neural network to minimize the mean-squared estimation error over a set of synthetically generated data using models drawn from the model set of interest. The a priori trained estimator can then be used directly for inference by evaluating the network with the estimation data. The deep recurrent neural network architectures can be trained offline and ensure significant time savings during inference. We experiment with two popular recurrent neural networks -- long short term memory network (LSTM) and gated recurrent unit (GRU). We demonstrate the applicability of our proposed method on different example models and perform detailed comparisons with state-of-the-art approaches. We also provide a study on a real-world nonlinear benchmark problem. The experimental evaluations show that the proposed approach is asymptotically as good as the Bayes estimator.
△ Less
Submitted 4 May, 2022;
originally announced May 2022.
-
Spatially Adaptive Online Prediction of Piecewise Regular Functions
Authors:
Sabyasachi Chatterjee,
Subhajit Goswami
Abstract:
We consider the problem of estimating piecewise regular functions in an online setting, i.e., the data arrive sequentially and at any round our task is to predict the value of the true function at the next revealed point using the available data from past predictions. We propose a suitably modified version of a recently developed online learning algorithm called the slee** experts aggregation al…
▽ More
We consider the problem of estimating piecewise regular functions in an online setting, i.e., the data arrive sequentially and at any round our task is to predict the value of the true function at the next revealed point using the available data from past predictions. We propose a suitably modified version of a recently developed online learning algorithm called the slee** experts aggregation algorithm. We show that this estimator satisfies oracle risk bounds simultaneously for all local regions of the domain. As concrete instantiations of the expert aggregation algorithm proposed here, we study an online mean aggregation and an online linear regression aggregation algorithm where experts correspond to the set of dyadic subrectangles of the domain. The resulting algorithms are near linear time computable in the sample size. We specifically focus on the performance of these online algorithms in the context of estimating piecewise polynomial and bounded variation function classes in the fixed design setup. The simultaneous oracle risk bounds we obtain for these estimators in this context provide new and improved (in certain aspects) guarantees even in the batch setting and are not available for the state of the art batch learning estimators.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
Convergence of gradient descent for deep neural networks
Authors:
Sourav Chatterjee
Abstract:
This article presents a criterion for convergence of gradient descent to a global minimum, which is then used to show that gradient descent with proper initialization converges to a global minimum when training any feedforward neural network with smooth and strictly increasing activation functions, provided that the input dimension is greater than or equal to the number of data points. The main di…
▽ More
This article presents a criterion for convergence of gradient descent to a global minimum, which is then used to show that gradient descent with proper initialization converges to a global minimum when training any feedforward neural network with smooth and strictly increasing activation functions, provided that the input dimension is greater than or equal to the number of data points. The main difference with prior work is that the width of the network can be a fixed number instead of growing as some multiple or power of the number of data points.
△ Less
Submitted 17 December, 2022; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Distributed Optimization of Average Consensus Containment with Multiple Stationary Leaders
Authors:
Sushobhan Chatterjee,
Rachel Kalpana Kalaimani
Abstract:
In this paper, we consider the problem of containment control of multi-agent systems with multiple stationary leaders, interacting over a directed network. While, containment control refers to just ensuring that the follower agents reach the convex hull of the leaders states, we focus on the problem where the followers achieve a consensus to the average values of the leaders states. We propose an…
▽ More
In this paper, we consider the problem of containment control of multi-agent systems with multiple stationary leaders, interacting over a directed network. While, containment control refers to just ensuring that the follower agents reach the convex hull of the leaders states, we focus on the problem where the followers achieve a consensus to the average values of the leaders states. We propose an algorithm that can be implemented in a distributed manner to achieve the above consensus among followers. Next we optimize the convergence rate of the followers to the average consensus by proper choice of weights for the interaction graph. This optimization is also performed in a distributed manner using Alternating Direction Method of Multipliers (ADMM). Finally, we complement our results by illustrating them with numerical examples.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
Element-wise Estimation Error of Generalized Fused Lasso
Authors:
Teng Zhang,
Sabyasachi Chatterjee
Abstract:
The main result of this article is that we obtain an elementwise error bound for the Fused Lasso estimator for any general convex loss function $ρ$. We then focus on the special cases when either $ρ$ is the square loss function (for mean regression) or is the quantile loss function (for quantile regression) for which we derive new pointwise error bounds. Even though error bounds for the usual Fuse…
▽ More
The main result of this article is that we obtain an elementwise error bound for the Fused Lasso estimator for any general convex loss function $ρ$. We then focus on the special cases when either $ρ$ is the square loss function (for mean regression) or is the quantile loss function (for quantile regression) for which we derive new pointwise error bounds. Even though error bounds for the usual Fused Lasso estimator and its quantile version have been studied before; our bound appears to be new. This is because all previous works bound a global loss function like the sum of squared error, or a sum of Huber losses in the case of quantile regression in Padilla and Chatterjee (2021). Clearly, element wise bounds are stronger than global loss error bounds as it reveals how the loss behaves locally at each point. Our element wise error bound also has a clean and explicit dependence on the tuning parameter $λ$ which informs the user of a good choice of $λ$. In addition, our bound is nonasymptotic with explicit constants and is able to recover almost all the known results for Fused Lasso (both mean and quantile regression) with additional improvements in some cases.
△ Less
Submitted 18 March, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
A Cross Validation Framework for Signal Denoising with Applications to Trend Filtering, Dyadic CART and Beyond
Authors:
Anamitra Chaudhuri,
Sabyasachi Chatterjee
Abstract:
This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses o…
▽ More
This paper formulates a general cross validation framework for signal denoising. The general framework is then applied to nonparametric regression methods such as Trend Filtering and Dyadic CART. The resulting cross validated versions are then shown to attain nearly the same rates of convergence as are known for the optimally tuned analogues. There did not exist any previous theoretical analyses of cross validated versions of Trend Filtering or Dyadic CART. To illustrate the generality of the framework we also propose and study cross validated versions of two fundamental estimators; lasso for high dimensional linear regression and singular value thresholding for matrix estimation. Our general framework is inspired by the ideas in Chatterjee and Jafarov (2015) and is potentially applicable to a wide range of estimation methods which use tuning parameters.
△ Less
Submitted 3 May, 2023; v1 submitted 7 January, 2022;
originally announced January 2022.
-
Fractional cyber-neural systems -- a brief survey
Authors:
Emily Reed,
Sarthak Chatterjee,
Guilherme Ramos,
Paul Bogdan,
Sérgio Pequito
Abstract:
Neurotechnology has made great strides in the last 20 years. However, we still have a long way to go to commercialize many of these technologies as we lack a unified framework to study cyber-neural systems (CNS) that bring the hardware, software, and the neural system together. Dynamical systems play a key role in develo** these technologies as they capture different aspects of the brain and pro…
▽ More
Neurotechnology has made great strides in the last 20 years. However, we still have a long way to go to commercialize many of these technologies as we lack a unified framework to study cyber-neural systems (CNS) that bring the hardware, software, and the neural system together. Dynamical systems play a key role in develo** these technologies as they capture different aspects of the brain and provide insight into their function. Converging evidence suggests that fractional-order dynamical systems are advantageous in modeling neural systems because of their compact representation and accuracy in capturing the long-range memory exhibited in neural behavior. In this brief survey, we provide an overview of fractional CNS that entails fractional-order systems in the context of CNS. In particular, we introduce basic definitions required for the analysis and synthesis of fractional CNS, encompassing system identification, state estimation, and closed-loop control. Additionally, we provide an illustration of some applications in the context of CNS and draw some possible future research directions. Ultimately, advancements in these three areas will be critical in develo** the next generation of CNS, which will, ultimately, improve people's quality of life.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
A state space for 3D Euclidean Yang-Mills theories
Authors:
Sky Cao,
Sourav Chatterjee
Abstract:
It is believed that Euclidean Yang-Mills theories behave like the massless Gaussian free field (GFF) at short distances. This makes it impossible to define the main observables for these theories - the Wilson loop observables - in dimensions greater than two, because line integrals of the GFF do not exist in such dimensions. Taking forward a proposal of Charalambous and Gross, this article shows t…
▽ More
It is believed that Euclidean Yang-Mills theories behave like the massless Gaussian free field (GFF) at short distances. This makes it impossible to define the main observables for these theories - the Wilson loop observables - in dimensions greater than two, because line integrals of the GFF do not exist in such dimensions. Taking forward a proposal of Charalambous and Gross, this article shows that it is possible to define Euclidean Yang-Mills theories on the 3D unit torus as "random distributional gauge orbits", provided that they indeed behave like the GFF in a certain sense. One of the main technical tools is the existence of the Yang-Mills heat flow on the 3D torus starting from GFF-like initial data, which is established in a companion paper. A key consequence of this construction is that under the GFF assumption, one can define a notion of "regularized Wilson loop observables" for Euclidean Yang-Mills theories on the 3D unit torus.
△ Less
Submitted 19 November, 2023; v1 submitted 24 November, 2021;
originally announced November 2021.
-
The Yang-Mills heat flow with random distributional initial data
Authors:
Sky Cao,
Sourav Chatterjee
Abstract:
We construct local solutions to the Yang-Mills heat flow (in the DeTurck gauge) for a certain class of random distributional initial data, which includes the 3D Gaussian free field. The main idea, which goes back to work of Bourgain as well as work of Da Prato-Debussche, is to decompose the solution into a rougher linear part and a smoother nonlinear part, and to control the latter by probabilisti…
▽ More
We construct local solutions to the Yang-Mills heat flow (in the DeTurck gauge) for a certain class of random distributional initial data, which includes the 3D Gaussian free field. The main idea, which goes back to work of Bourgain as well as work of Da Prato-Debussche, is to decompose the solution into a rougher linear part and a smoother nonlinear part, and to control the latter by probabilistic arguments. In a companion work, we use the main results of this paper to propose a way towards the construction of 3D Yang-Mills measures.
△ Less
Submitted 25 August, 2022; v1 submitted 20 November, 2021;
originally announced November 2021.
-
Existence of stationary ballistic deposition on the infinite lattice
Authors:
Sourav Chatterjee
Abstract:
Ballistic deposition is one of the many models of interface growth that are believed to be in the KPZ universality class, but have so far proved to be largely intractable mathematically. In this model, blocks of size one fall independently as Poisson processes at each site on the $d$-dimensional lattice, and either attach themselves to the column growing at that site, or to the side of an adjacent…
▽ More
Ballistic deposition is one of the many models of interface growth that are believed to be in the KPZ universality class, but have so far proved to be largely intractable mathematically. In this model, blocks of size one fall independently as Poisson processes at each site on the $d$-dimensional lattice, and either attach themselves to the column growing at that site, or to the side of an adjacent column, whichever comes first. It is not hard to see that if we subtract off the height of the column at the origin from the heights of the other columns, the resulting interface process is Markovian. The main result of this article is that this Markov process has at least one invariant probability measure. We conjecture that the invariant measure is not unique, and provide some partial evidence.
△ Less
Submitted 18 May, 2022; v1 submitted 19 October, 2021;
originally announced October 2021.
-
Regret Minimization in Isotonic, Heavy-Tailed Contextual Bandits via Adaptive Confidence Bands
Authors:
Sabyasachi Chatterjee,
Subhabrata Sen
Abstract:
In this paper we initiate a study of non parametric contextual bandits under shape constraints on the mean reward function. Specifically, we study a setting where the context is one dimensional, and the mean reward function is isotonic with respect to this context. We propose a policy for this problem and show that it attains minimax rate optimal regret. Moreover, we show that the same policy enjo…
▽ More
In this paper we initiate a study of non parametric contextual bandits under shape constraints on the mean reward function. Specifically, we study a setting where the context is one dimensional, and the mean reward function is isotonic with respect to this context. We propose a policy for this problem and show that it attains minimax rate optimal regret. Moreover, we show that the same policy enjoys automatic adaptation; that is, for subclasses of the parameter space where the true mean reward functions are also piecewise constant with $k$ pieces, this policy remains minimax rate optimal simultaneously for all $k \geq 1.$ Automatic adaptation phenomena are well-known for shape constrained problems in the offline setting;
%The phenomenon of automatic adaptation of shape constrained methods is known to occur in offline problems;
we show that such phenomena carry over to the online setting.
The main technical ingredient underlying our policy is a procedure to derive confidence bands for an underlying isotonic function using the isotonic quantile estimator. The confidence band we propose is valid under heavy tailed noise, and its average width goes to $0$ at an adaptively optimal rate. We consider this to be an independent contribution to the isotonic regression literature.
△ Less
Submitted 19 October, 2021;
originally announced October 2021.
-
Quantile Regression by Dyadic CART
Authors:
Oscar Hernan Madrid Padilla,
Sabyasachi Chatterjee
Abstract:
In this paper we propose and study a version of the Dyadic Classification and Regression Trees (DCART) estimator from Donoho (1997) for (fixed design) quantile regression in general dimensions. We refer to this proposed estimator as the QDCART estimator. Just like the mean regression version, we show that a) a fast dynamic programming based algorithm with computational complexity $O(N \log N)$ exi…
▽ More
In this paper we propose and study a version of the Dyadic Classification and Regression Trees (DCART) estimator from Donoho (1997) for (fixed design) quantile regression in general dimensions. We refer to this proposed estimator as the QDCART estimator. Just like the mean regression version, we show that a) a fast dynamic programming based algorithm with computational complexity $O(N \log N)$ exists for computing the QDCART estimator and b) an oracle risk bound (trading off squared error and a complexity parameter of the true signal) holds for the QDCART estimator. This oracle risk bound then allows us to demonstrate that the QDCART estimator enjoys adaptively rate optimal estimation guarantees for piecewise constant and bounded variation function classes. In contrast to existing results for the DCART estimator which requires subgaussianity of the error distribution, for our estimation guarantees to hold we do not need any restrictive tail decay assumptions on the error distribution. For instance, our results hold even when the error distribution has no first moment such as the Cauchy distribution. Apart from the Dyadic CART method, we also consider other variant methods such as the Optimal Regression Tree (ORT) estimator introduced in Chatterjee and Goswami (2019). In particular, we also extend the ORT estimator to the quantile setting and establish that it enjoys analogous guarantees. Thus, this paper extends the scope of these globally optimal regression tree based methodologies to be applicable for heavy tailed data. We then perform extensive numerical experiments on both simulated and real data which illustrate the usefulness of the proposed methods.
△ Less
Submitted 16 October, 2021;
originally announced October 2021.
-
Local KPZ behavior under arbitrary scaling limits
Authors:
Sourav Chatterjee
Abstract:
One of the main difficulties in proving convergence of discrete models of surface growth to the Kardar-Parisi-Zhang (KPZ) equation in dimensions higher than one is that the correct way to take a scaling limit, so that the limit is nontrivial, is not known in a rigorous sense. To understand KPZ growth without being hindered by this issue, this article introduces a notion of "local KPZ behavior", wh…
▽ More
One of the main difficulties in proving convergence of discrete models of surface growth to the Kardar-Parisi-Zhang (KPZ) equation in dimensions higher than one is that the correct way to take a scaling limit, so that the limit is nontrivial, is not known in a rigorous sense. To understand KPZ growth without being hindered by this issue, this article introduces a notion of "local KPZ behavior", which roughly means that the instantaneous growth of the surface at a point decomposes into the sum of a Laplacian term, a gradient squared term, a noise term that behaves like white noise, and a remainder term that is negligible compared to the other three terms and their sum. The main result is that for a general class of surfaces, which contains the model of directed polymers in a random environment as a special case, local KPZ behavior occurs under arbitrary scaling limits, in any dimension.
△ Less
Submitted 29 July, 2022; v1 submitted 3 October, 2021;
originally announced October 2021.
-
Isomorphisms between random graphs
Authors:
Sourav Chatterjee,
Persi Diaconis
Abstract:
Consider two independent Erdős-Rényi $G(N,1/2)$ graphs. We show that with probability tending to $1$ as $N\to\infty$, the largest induced isomorphic subgraph has size either $\lfloor x_N-\varepsilon_N\rfloor$ or $\lfloor x_N+\varepsilon_N \rfloor$, where $x_N=4\log_2 N -2 \log_2 \log_2 N - 2\log_2(4/e)+1$ and $\varepsilon_N = (4\log_2 N)^{-1/2}$. Using similar techniques, we also show that if…
▽ More
Consider two independent Erdős-Rényi $G(N,1/2)$ graphs. We show that with probability tending to $1$ as $N\to\infty$, the largest induced isomorphic subgraph has size either $\lfloor x_N-\varepsilon_N\rfloor$ or $\lfloor x_N+\varepsilon_N \rfloor$, where $x_N=4\log_2 N -2 \log_2 \log_2 N - 2\log_2(4/e)+1$ and $\varepsilon_N = (4\log_2 N)^{-1/2}$. Using similar techniques, we also show that if $Γ_1$ and $Γ_2$ are independent $G(n,1/2)$ and $G(N,1/2)$ random graphs, then $Γ_2$ contains an isomorphic copy of $Γ_1$ as an induced subgraph with high probability if $n\le \lfloor y_N - \varepsilon_N \rfloor$ and does not contain an isomorphic copy of $Γ_1$ as an induced subgraph with high probability if $n>\lfloor y_N+\varepsilon_N \rfloor$, where $y_N=2\log_2 N+1$ and $\varepsilon_N$ is as above.
△ Less
Submitted 30 December, 2022; v1 submitted 9 August, 2021;
originally announced August 2021.
-
Convergence of deterministic growth models
Authors:
Sourav Chatterjee,
Panagiotis E. Souganidis
Abstract:
We prove the uniform in space and time convergence of the scaled heights of large classes of deterministic growth models that are monotone and equivariant under translations by constants. The limits are characterized as the unique (viscosity solutions) of first- or second-order partial differential equations depending on whether the growth models are scaled hyperbolically or parabolically. The res…
▽ More
We prove the uniform in space and time convergence of the scaled heights of large classes of deterministic growth models that are monotone and equivariant under translations by constants. The limits are characterized as the unique (viscosity solutions) of first- or second-order partial differential equations depending on whether the growth models are scaled hyperbolically or parabolically. The results greatly simplify and extend a recent work by the first author to more general surface growth models. The proofs are based on the methodology developed by Barles and the second author to prove convergence of approximation schemes.
△ Less
Submitted 6 December, 2021; v1 submitted 1 August, 2021;
originally announced August 2021.
-
Subcritical Connectivity and Some Exact Tail Exponents in High Dimensional Percolation
Authors:
Shirshendu Chatterjee,
Jack Hanson,
Philippe Sosoe
Abstract:
In high dimensional percolation at parameter $p < p_c$, the one-arm probability $π_p(n)$ is known to decay exponentially on scale $(p_c - p)^{-1/2}$. We show the same statement for the ratio $π_p(n) / π_{p_c}(n)$, establishing a form of a hypothesis of scaling theory.
As part of our study, we provide sharp estimates (with matching upper and lower bounds) for several quantities of interest at the…
▽ More
In high dimensional percolation at parameter $p < p_c$, the one-arm probability $π_p(n)$ is known to decay exponentially on scale $(p_c - p)^{-1/2}$. We show the same statement for the ratio $π_p(n) / π_{p_c}(n)$, establishing a form of a hypothesis of scaling theory.
As part of our study, we provide sharp estimates (with matching upper and lower bounds) for several quantities of interest at the critical probability $p_c$. These include the tail behavior of volumes of, and chemical distances within, spanning clusters, along with the scaling of the two-point function at "mesoscopic distance" from the boundary of half-spaces. As a corollary, we obtain the tightness of the number of spanning clusters of a diameter $n$ box on scale $n^{d-6}$; this result complements a lower bound of Aizenman.
△ Less
Submitted 29 July, 2021;
originally announced July 2021.
-
Atiyah sequence and Gauge transformations of a principal $2$-bundle over a Lie groupoid
Authors:
Saikat Chatterjee,
Adittya Chaudhuri,
Praphulla Koushik
Abstract:
In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction u…
▽ More
In this paper, a notion of a principal $2$-bundle over a Lie groupoid has been introduced. For such principal $2$-bundles, we produced a short exact sequence of VB-groupoids, namely, the Atiyah sequence. Two notions of connection structures viz. strict connections and semi-strict connections on a principal $2$-bundle arising respectively, from a retraction of the Atiyah sequence and a retraction up to a natural isomorphism have been introduced. We constructed a class of principal $\mathbb{G}=[G_1\rightrightarrows G_0]$-bundles and connections from a given principal $G_0$-bundle $E_0\rightarrow X_0$ over $[X_1\rightrightarrows X_0]$ with connection. An existence criterion for the connections on a principal $2$-bundle over a proper, étale Lie groupoid is proposed. The action of the $2$-group of gauge transformations on the category of strict and semi-strict connections has been studied. Finally we noted an extended symmetry of the category of semi-strict connections.
△ Less
Submitted 4 August, 2021; v1 submitted 29 July, 2021;
originally announced July 2021.
-
Lorenz System State Stability Identification using Neural Networks
Authors:
Megha Subramanian,
Ramakrishna Tipireddy,
Samrat Chatterjee
Abstract:
Nonlinear dynamical systems such as Lorenz63 equations are known to be chaotic in nature and sensitive to initial conditions. As a result, a small perturbation in the initial conditions results in deviation in state trajectory after a few time steps. The algorithms and computational resources needed to accurately identify the system states vary depending on whether the solution is in transition re…
▽ More
Nonlinear dynamical systems such as Lorenz63 equations are known to be chaotic in nature and sensitive to initial conditions. As a result, a small perturbation in the initial conditions results in deviation in state trajectory after a few time steps. The algorithms and computational resources needed to accurately identify the system states vary depending on whether the solution is in transition region or not. We refer to the transition and non-transition regions as unstable and stable regions respectively. We label a system state to be stable if it's immediate past and future states reside in the same regime. However, at a given time step we don't have the prior knowledge about whether system is in stable or unstable region. In this paper, we develop and train a feed forward (multi-layer perceptron) Neural Network to classify the system states of a Lorenz system as stable and unstable. We pose this task as a supervised learning problem where we train the neural network on Lorenz system which have states labeled as stable or unstable. We then test the ability of the neural network models to identify the stable and unstable states on a different Lorenz system that is generated using different initial conditions. We also evaluate the classification performance in the mismatched case i.e., when the initial conditions for training and validation data are sampled from different intervals. We show that certain normalization schemes can greatly improve the performance of neural networks in especially these mismatched scenarios. The classification framework developed in the paper can be a preprocessor for a larger context of sequential decision making framework where the decision making is performed based on observed stable or unstable states.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
Matrix completion with data-dependent missingness probabilities
Authors:
Sohom Bhattacharya,
Sourav Chatterjee
Abstract:
The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independe…
▽ More
The problem of completing a large matrix with lots of missing entries has received widespread attention in the last couple of decades. Two popular approaches to the matrix completion problem are based on singular value thresholding and nuclear norm minimization. Most of the past works on this subject assume that there is a single number $p$ such that each entry of the matrix is available independently with probability $p$ and missing otherwise. This assumption may not be realistic for many applications. In this work, we replace it with the assumption that the probability that an entry is available is an unknown function $f$ of the entry itself. For example, if the entry is the rating given to a movie by a viewer, then it seems plausible that high value entries have greater probability of being available than low value entries. We propose two new estimators, based on singular value thresholding and nuclear norm minimization, to recover the matrix under this assumption. The estimators involve no tuning parameters, and are shown to be consistent under a low rank assumption. We also provide a consistent estimator of the unknown function $f$.
△ Less
Submitted 22 April, 2022; v1 submitted 4 June, 2021;
originally announced June 2021.
-
Weak convergence of directed polymers to deterministic KPZ at high temperature
Authors:
Sourav Chatterjee
Abstract:
It is shown that when $d\ge 3$, the growing random surface generated by the $(d+1)$-dimensional directed polymer model at sufficiently high temperature, after being smoothed by taking microscopic local averages, converges to a solution of the deterministic KPZ equation in a suitable scaling limit.
It is shown that when $d\ge 3$, the growing random surface generated by the $(d+1)$-dimensional directed polymer model at sufficiently high temperature, after being smoothed by taking microscopic local averages, converges to a solution of the deterministic KPZ equation in a suitable scaling limit.
△ Less
Submitted 12 May, 2022; v1 submitted 12 May, 2021;
originally announced May 2021.
-
Discrete-Time Fractional-Order Dynamical Networks Minimum-Energy State Estimation
Authors:
Sarthak Chatterjee,
Andrea Alessandretti,
A. Pedro Aguiar,
Sérgio Pequito
Abstract:
Fractional-order dynamical networks are increasingly being used to model and describe processes demonstrating long-term memory or complex interlaced dependencies amongst the spatial and temporal components of a wide variety of dynamical networks. Notable examples include networked control systems or neurophysiological networks which are created using electroencephalographic (EEG) or blood-oxygen-l…
▽ More
Fractional-order dynamical networks are increasingly being used to model and describe processes demonstrating long-term memory or complex interlaced dependencies amongst the spatial and temporal components of a wide variety of dynamical networks. Notable examples include networked control systems or neurophysiological networks which are created using electroencephalographic (EEG) or blood-oxygen-level-dependent (BOLD) data. As a result, the estimation of the states of fractional-order dynamical networks poses an important problem. To this effect, this paper addresses the problem of minimum-energy state estimation for discrete-time fractional-order dynamical networks (DT-FODN), where the state and output equations are affected by an additive noise that is considered to be deterministic, bounded, and unknown. Specifically, we derive the corresponding estimator and show that the resulting estimation error is exponentially input-to-state stable with respect to the disturbances and to a signal that is decreasing with the increase of the accuracy of the adopted approximation model. An illustrative example shows the effectiveness of the proposed method on real-world neurophysiological networks.
△ Less
Submitted 2 August, 2021; v1 submitted 19 April, 2021;
originally announced April 2021.
-
On Learning Discrete-Time Fractional-Order Dynamical Systems
Authors:
Sarthak Chatterjee,
Sérgio Pequito
Abstract:
Discrete-time fractional-order dynamical systems (DT-FODS) have found innumerable applications in the context of modeling spatiotemporal behaviors associated with long-term memory. Applications include neurophysiological signals such as electroencephalogram (EEG) and electrocorticogram (ECoG). Although learning the spatiotemporal parameters of DT-FODS is not a new problem, when dealing with neurop…
▽ More
Discrete-time fractional-order dynamical systems (DT-FODS) have found innumerable applications in the context of modeling spatiotemporal behaviors associated with long-term memory. Applications include neurophysiological signals such as electroencephalogram (EEG) and electrocorticogram (ECoG). Although learning the spatiotemporal parameters of DT-FODS is not a new problem, when dealing with neurophysiological signals we need to guarantee performance standards. Therefore, we need to understand the trade-offs between sample complexity and estimation accuracy of the system parameters. Simply speaking, we need to address the question of how many measurements we need to collect to identify the system parameters up to an uncertainty level. In this paper, we address the problem of identifying the spatial and temporal parameters of DT-FODS. The main result is the first result on non-asymptotic finite-sample complexity guarantees of identifying DT-FODS. Finally, we provide evidence of the efficacy of our method in the context of forecasting real-life intracranial EEG time series collected from patients undergoing epileptic seizures.
△ Less
Submitted 3 October, 2021; v1 submitted 27 March, 2021;
originally announced March 2021.
-
Superconcentration in surface growth
Authors:
Sourav Chatterjee
Abstract:
Height functions of growing random surfaces are often conjectured to be superconcentrated, meaning that their variances grow sublinearly in time. This article introduces a new concept, called subroughness, meaning that there exist two distinct points such that the expected squared difference between the heights at these points grows sublinearly in time. The main result of the paper is that superco…
▽ More
Height functions of growing random surfaces are often conjectured to be superconcentrated, meaning that their variances grow sublinearly in time. This article introduces a new concept, called subroughness, meaning that there exist two distinct points such that the expected squared difference between the heights at these points grows sublinearly in time. The main result of the paper is that superconcentration is equivalent to subroughness in a class of growing random surfaces. The result is applied to establish superconcentration in a variant of the restricted solid-on-solid (RSOS) model and in a variant of the ballistic deposition model, and give new proofs of superconcentration in directed last-passage percolation and directed polymers.
△ Less
Submitted 7 May, 2022; v1 submitted 16 March, 2021;
originally announced March 2021.