Search | arXiv e-print repository

Modified Legendre-Gauss Collocation Method for Solving Optimal Control Problems with Nonsmooth Solutions

Authors: Gabriela Abadia-Doyle, Anil V. Rao

Abstract: A modified form of Legendre-Gauss orthogonal direct collocation is developed for solving optimal control problems whose solutions are nonsmooth due to control discontinuities. This new method adds switch-time variables, control variables, and collocation conditions at both endpoints of a mesh interval, whereas these new variables and collocation conditions are not included in standard Legendre-Gau… ▽ More A modified form of Legendre-Gauss orthogonal direct collocation is developed for solving optimal control problems whose solutions are nonsmooth due to control discontinuities. This new method adds switch-time variables, control variables, and collocation conditions at both endpoints of a mesh interval, whereas these new variables and collocation conditions are not included in standard Legendre-Gauss orthogonal collocation. The modified Legendre-Gauss collocation method alters the search space of the resulting nonlinear programming problem and enables determining accurately the location of the nonsmoothness in the optimal control. The transformed adjoint system of the modified Legendre-Gauss collocation method is then derived and shown to satisfy a discrete form of the continuous variational necessary conditions for optimality. The method is motivated via a control-constrained triple-integrator minimum-time optimal control problem where the solution possesses a two-switch bang-bang optimal control structure. In addition, the method developed in this paper is compared with existing Gaussian quadrature collocation methods. The method developed in this paper is shown to be capable of accurately solving optimal control problems with a discontinuous optimal control. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 23 pages, 7 figures. Submitted for publication consideration in the IEEE Transactions on Automatic Control. Also, an earlier version has been submitted for publication consideration in the 2024 IEEE Conference on Decision and Control to be held in Milan, Italy, 16-19 December 2024

arXiv:2406.04185 [pdf, ps, other]

Numerical Optimization Study of a Constrained Hypersonic Reentry Vehicle

Authors: Cale A. Byczkowski, Anil V. Rao

Abstract: The trajectory optimization of the atmospheric entry of a reusable launch vehicle is studied. The objective is to maximize the crossrange of the vehicle subject to two control-inequality path constraints, two state-inequality path constraints, and one mixed state-and-control inequality path constraint. In order to determine the complex switching structure in the activity of the path constraints, a… ▽ More The trajectory optimization of the atmospheric entry of a reusable launch vehicle is studied. The objective is to maximize the crossrange of the vehicle subject to two control-inequality path constraints, two state-inequality path constraints, and one mixed state-and-control inequality path constraint. In order to determine the complex switching structure in the activity of the path constraints, a recently developed method for solving state-path constrained optimal control problems is used. This recently developed method is designed to algorithmically locate the points of activation and deactivation in the path constraints and partition the domain of the independent variable into subdomains based on these activation and deactivation points. Additionally, in a domain where a state-inequality path constraint is found to be active, the method algorithmically determines and enforces the additional necessary conditions that apply on the constrained arc. A multiple-domain formulation of Legendre-Gauss-Radau direct collocation is then employed to transcribe the optimal control problem into a large sparse nonlinear programming problem. Two studies are performed which analyze a variety of problem formulations of the hypersonic reusable launch vehicle. Key features of the constrained trajectories are presented, and the method used is shown to obtain highly accurate solutions with minimal user intervention. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 29 pages, 11 figures, 5 tables

arXiv:2403.00975 [pdf, other]

Equipment Health Assessment: Time Series Analysis for Wind Turbine Performance

Authors: Jana Backhus, Aniruddha Rajendra Rao, Chandrasekar Venkatraman, Abhishek Padmanabhan, A. Vinoth Kumar, Chetan Gupta

Abstract: In this study, we leverage SCADA data from diverse wind turbines to predict power output, employing advanced time series methods, specifically Functional Neural Networks (FNN) and Long Short-Term Memory (LSTM) networks. A key innovation lies in the ensemble of FNN and LSTM models, capitalizing on their collective learning. This ensemble approach outperforms individual models, ensuring stable and a… ▽ More In this study, we leverage SCADA data from diverse wind turbines to predict power output, employing advanced time series methods, specifically Functional Neural Networks (FNN) and Long Short-Term Memory (LSTM) networks. A key innovation lies in the ensemble of FNN and LSTM models, capitalizing on their collective learning. This ensemble approach outperforms individual models, ensuring stable and accurate power output predictions. Additionally, machine learning techniques are applied to detect wind turbine performance deterioration, enabling proactive maintenance strategies and health assessment. Crucially, our analysis reveals the uniqueness of each wind turbine, necessitating tailored models for optimal predictions. These insight underscores the importance of providing automatized customization for different turbines to keep human modeling effort low. Importantly, the methodologies developed in this analysis are not limited to wind turbines; they can be extended to predict and optimize performance in various machinery, highlighting the versatility and applicability of our research across diverse industrial contexts. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 19 Pages, 17 Figures, 3 Tables, Submitted at Applied Sciences (MDPI)

arXiv:2402.00196 [pdf, ps, other]

Badly approximable grids and k-divergent lattices

Authors: Nikolay Moshchevitin, Anurag Rao, Uri Shapira

Abstract: For an m by n real matrix A, we investigate the set of badly approximable targets for A as a subset of the m-torus. It is well known that this set is large in the sense that it is dense and has full Hausdorff dimension. We investigate the relationship between its measure and Diophantine properties of A. On the one hand, we give the first examples of a non-singular matrix A such that the set of bad… ▽ More For an m by n real matrix A, we investigate the set of badly approximable targets for A as a subset of the m-torus. It is well known that this set is large in the sense that it is dense and has full Hausdorff dimension. We investigate the relationship between its measure and Diophantine properties of A. On the one hand, we give the first examples of a non-singular matrix A such that the set of badly approximable targets has full measure with respect to some non-trivial algebraic measure on the torus. For this, we use transference theorems due to Jarnik and Khintchine, and the parametric geometry of numbers in the sense of Roy. On the other hand, we give a novel Diophantine condition on A that slightly strengthens non-singularity, and show that under the assumption that A satisfies this condition, the set of badly approximable targets is a null-set with respect to any non-trivial algebraic measure on the torus. For this we use naive homogeneous dynamics, harmonic analysis, and a novel concept we refer to as mixing convergence of measures. △ Less

Submitted 1 March, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

arXiv:2311.07569 [pdf, other]

Optimal Load Shedding for Public Safety Power Shutoffs

Authors: Aniruddha Rajendra Rao, Chandrasekar Venkatraman, Robert Ellis, Chetan Gupta

Abstract: Public utilities are faced with situations where high winds can bring trees and debris into contact with energized power lines and other equipments, which could ignite wildfires. As a result, they need to turn off power during severe weather to help prevent wildfires. This is called Public Safety Power Shutoff (PSPS). We present a method for load reduction using a multi-step genetic algorithm for… ▽ More Public utilities are faced with situations where high winds can bring trees and debris into contact with energized power lines and other equipments, which could ignite wildfires. As a result, they need to turn off power during severe weather to help prevent wildfires. This is called Public Safety Power Shutoff (PSPS). We present a method for load reduction using a multi-step genetic algorithm for Public Safety Power Shutoff events. The proposed method optimizes load shedding using partial load shedding based on load importance (critical loads like hospitals, fire stations, etc). The multi-step genetic algorithm optimizes load shedding while minimizing the impact on important loads and preserving grid stability. The effectiveness of the method is demonstrated through network examples. The results show that the proposed method achieves minimal load shedding while maintaining the critical loads at acceptable levels. This approach will help utilities to effectively manage PSPS events and reduce the risk of wildfires caused by the power lines. △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: 10 pages, 5 figures, 3 Tables. Accepted at IEEE ETFG 2023

arXiv:2307.09054 [pdf, other]

K-divergent lattices

Authors: Guy Lachman, Anurag Rao, Uri Shapira, Yuval Yifrach

Abstract: We introduce a novel concept in topological dynamics, referred to as $k$-divergence, which extends the notion of divergent orbits. Motivated by questions in the theory of inhomogeneous Diophantine approximations, we investigate this notion in the dynamical system given by a certain flow on the space of unimodular lattices in $\mathbb{R}^d$. Our main result is the existence of $k$-divergent lattice… ▽ More We introduce a novel concept in topological dynamics, referred to as $k$-divergence, which extends the notion of divergent orbits. Motivated by questions in the theory of inhomogeneous Diophantine approximations, we investigate this notion in the dynamical system given by a certain flow on the space of unimodular lattices in $\mathbb{R}^d$. Our main result is the existence of $k$-divergent lattices for any $k\geq 0$. In fact, we utilize the emerging theory of parametric geometry of numbers and calculate the Hausdorff dimension of the set of $k$-divergent lattices. △ Less

Submitted 18 July, 2023; originally announced July 2023.

arXiv:2304.06130 [pdf, ps, other]

doi 10.1002/oca.3097

Method for Solving State-Path Constrained Optimal Control Problems Using Adaptive Radau Collocation

Authors: Cale A. Byczkowski, Anil V. Rao

Abstract: A new method is developed for accurately approximating the solution to state-variable inequality path constrained optimal control problems using a multiple-domain adaptive Legendre-Gauss-Radau collocation method. The method consists of the following parts. First, a structure detection method is developed to estimate switch times in the activation and deactivation of state-variable inequality path… ▽ More A new method is developed for accurately approximating the solution to state-variable inequality path constrained optimal control problems using a multiple-domain adaptive Legendre-Gauss-Radau collocation method. The method consists of the following parts. First, a structure detection method is developed to estimate switch times in the activation and deactivation of state-variable inequality path constraints. Second, using the detected structure, the domain is partitioned into multiple-domains where each domain corresponds to either a constrained or an unconstrained segment. Furthermore, additional decision variables are introduced in the multiple-domain formulation, where these additional decision variables represent the switch times of the detected active state-variable inequality path constraints. Within a constrained domain, the path constraint is differentiated with respect to the independent variable until the control appears explicitly, and this derivative is set to zero along the constrained arc while all preceding derivatives are set to zero at the start of the constrained arc. The time derivatives of the active state-variable inequality path constraints are computed using automatic differentiation and the properties of the chain rule. The method is demonstrated on two problems, the first being a benchmark optimal control problem which has a known analytical solution and the second being a challenging problem from the field of aerospace engineering in which there is no known analytical solution. When compared against previously developed adaptive Legendre-Gauss-Radau methods, the results show that the method developed in this paper is capable of computing accurate solutions to problems whose solution contain active state-variable inequality path constraints. △ Less

Submitted 4 January, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

Comments: 31 pages, 7 figures, 5 tables

arXiv:2211.04111 [pdf, ps, other]

Generalised homotopy and commutativity principle

Authors: Ravi A. Rao, Sampat Sharma

Abstract: In this paper, we study the action of special $n\times n $ linear (resp. symplectic) matrices which are homotopic to identity on the right invertible $n\times m$ matrices. We also prove that the commutator subgroup of $\rm{O}_{2n}(R[X])$ is two stably elementary orthogonal for a local ring $R$ with $\frac{1}{2}\in R$ and $n\geq 3.$ In this paper, we study the action of special $n\times n $ linear (resp. symplectic) matrices which are homotopic to identity on the right invertible $n\times m$ matrices. We also prove that the commutator subgroup of $\rm{O}_{2n}(R[X])$ is two stably elementary orthogonal for a local ring $R$ with $\frac{1}{2}\in R$ and $n\geq 3.$ △ Less

Submitted 8 November, 2022; originally announced November 2022.

arXiv:2210.09299 [pdf, ps, other]

A dichotomy phenomenon for Bad minus normed Dirichlet

Authors: Dmitry Kleinbock, Anurag Rao

Abstract: Given a norm $ν$ on $\mathbb{R}^2$, the set of $ν$-Dirichlet improvable numbers $\mathbf{DI}_ν$ was defined and studied in the papers of Andersen-Duke (Acta Arith. 2021) and Kleinbock-Rao (Internat. Math. Res. Notices 2022). When $ν$ is the supremum norm, $\mathbf{DI}_ν= \mathbf{BA}\cup \mathbb{Q}$, where $\mathbf{BA}$ is the set of badly approximable numbers. Each of the sets $\mathbf{DI}_ν$, lik… ▽ More Given a norm $ν$ on $\mathbb{R}^2$, the set of $ν$-Dirichlet improvable numbers $\mathbf{DI}_ν$ was defined and studied in the papers of Andersen-Duke (Acta Arith. 2021) and Kleinbock-Rao (Internat. Math. Res. Notices 2022). When $ν$ is the supremum norm, $\mathbf{DI}_ν= \mathbf{BA}\cup \mathbb{Q}$, where $\mathbf{BA}$ is the set of badly approximable numbers. Each of the sets $\mathbf{DI}_ν$, like $\mathbf{BA}$, is of measure zero and satisfies the winning property of Schmidt. Hence for every norm $ν$, $\mathbf{BA} \cap \mathbf{DI}_ν$ is winning and thus has full Hausdorff dimension. In the present article we prove the following dichotomy phenomenon: either $\mathbf{BA} \subset \mathbf{DI}_ν$ or else $\mathbf{BA} \smallsetminus \mathbf{DI}_ν$ has full Hausdorff dimension. We give several examples for each of the two cases. The dichotomy is based on whether the critical locus of $ν$ intersects a precompact $g_t$-orbit, where $\{g_t\}$ is the one-parameter diagonal subgroup of $\operatorname{SL}_2(\mathbb{R})$ acting on the space $X$ of unimodular lattices in $\mathbb{R}^2$. Thus the aforementioned dichotomy follows from the following dynamical statement: for a lattice $Λ\in X$, either $g_\mathbb{R} Λ$ is unbounded (and then any precompact $g_{\mathbb{R}_{>0}}$-orbit must eventually avoid a neighborhood of $Λ$), or not, in which case the set of lattices in $X$ whose $g_{\mathbb{R}_{>0}}$-trajectories are precompact and contain $Λ$ in their closure has full Hausdorff dimension. △ Less

Submitted 31 August, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: Minor corrections following referee report

arXiv:2206.13683 [pdf, ps, other]

Minimum-Fuel Earth-Based Orbit Transfers Using Multiple-Domain Adaptive Radau Collocation

Authors: Brittanny V. Holden, Anil V. Rao

Abstract: A numerical optimization study of minimum-fuel Earth-based orbital transfers from low-Earth orbit (LEO) to either medium-Earth orbit (MEO), high-Earth orbit (HEO), or geostationary orbit (GEO), is performed. Various values of maximum allowable thrust acceleration are considered for each type of transfer (LEO-to-MEO, LEO-to-HEO, or LEO-to-GEO). A key aspect of the study performed in this paper is t… ▽ More A numerical optimization study of minimum-fuel Earth-based orbital transfers from low-Earth orbit (LEO) to either medium-Earth orbit (MEO), high-Earth orbit (HEO), or geostationary orbit (GEO), is performed. Various values of maximum allowable thrust acceleration are considered for each type of transfer (LEO-to-MEO, LEO-to-HEO, or LEO-to-GEO). A key aspect of the study performed in this paper is that the optimal thrusting structure is not assumed to be known a priori, but is determined as part of the solution process. In order to determine the optimal thrusting structure, a recently developed bang-bang and singular optimal control (BBSOC) method is employed together with multiple-domain Legendre-Gauss-Radau quadrature collocation. Key results obtained in this study include not only the number of switches in the optimized thrust, but also the total impulse. Furthermore, it is found that, as the maximum allowable thrust acceleration decreases, the total impulse is less than the total impulse obtained from a previous study where a burn-coast-burn thrusting structure was assumed a priori. For each type of transfer a particular value of maximum allowable thrust acceleration is chosen to highlight in more detail the key features of the optimal solutions. This study provides improved results over previous studies and provides improved insight into the optimal thrusting structure required in order to accomplish each type of orbital transfer using the least amount of fuel. △ Less

Submitted 27 June, 2022; originally announced June 2022.

Comments: 38 pages, 12 figures, 11 tables

arXiv:2205.00563 [pdf, other]

QC-LDPC Codes from Difference Matrices and Difference Covering Arrays

Authors: Diane Donovan, Asha Rao, Elif Üsküplü, E. Ş. Yazıcı

Abstract: We give a framework for generalizing LDPC code constructions that use Transversal Designs or related structures such as mutually orthogonal Latin squares. Our construction offers a broader range of code lengths and codes rates. Similar earlier constructions rely on the existence of finite fields of order a power of a prime. In contrast the LDPC codes constructed here are based on difference matric… ▽ More We give a framework for generalizing LDPC code constructions that use Transversal Designs or related structures such as mutually orthogonal Latin squares. Our construction offers a broader range of code lengths and codes rates. Similar earlier constructions rely on the existence of finite fields of order a power of a prime. In contrast the LDPC codes constructed here are based on difference matrices and difference covering arrays, structures available for any order $a$. They satisfy the RC constraint and have, for $a$ odd, length $a^2$ and rate $1-\frac{4a-3}{a^2}$, and for $a$ even, length $a^2-a$ and rate at least $1-\frac{4a-6}{a^2-a}$. When $3$ does not divide $a$, these LDPC codes have stop** distance at least $8$. When $a$ is odd and both $3$ and $5$ do not divide $a$, our construction delivers an infinite family of QC-LDPC codes with minimum distance at least $10$. The simplicity of the construction allows us to theoretically verify these properties and analytically determine lower bounds for the minimum distance and stop** distance of the code. The BER and FER performance of our codes over AWGN (via simulation) is at the least equivalent to codes constructed previously, while in some cases significantly outperforming them. △ Less

Submitted 1 May, 2022; originally announced May 2022.

arXiv:2203.11394 [pdf, ps, other]

Minimum-Time Reorientation of Axisymmetric Rigid Spacecraft Using Three Controls

Authors: Elisha R. Pager, Anil V. Rao

Abstract: A minimum-time reorientation of an axisymmetric rigid spacecraft controlled by three torques is studied. The orientation of the body is modeled such that the attitude kinematics are representative of a spin-stabilized spacecraft. The optimal control problem considered is shown to have a switching control structure. Moreover, under certain assumptions, the solutions contain segments that lie on a s… ▽ More A minimum-time reorientation of an axisymmetric rigid spacecraft controlled by three torques is studied. The orientation of the body is modeled such that the attitude kinematics are representative of a spin-stabilized spacecraft. The optimal control problem considered is shown to have a switching control structure. Moreover, under certain assumptions, the solutions contain segments that lie on a singular arc. A numerical optimization study is performed using a recently developed method that is designed to accurately solve bang-bang and singular optimal control problems. The optimality conditions for the resulting optimal control problem are derived and analyzed for a variety of cases. Also, the results obtained in this study are compared to a previous method existing in the literature. The key features of the optimized trajectories and controls are identified, and the aforementioned method for solving bang-bang and singular optimal control problems is shown to efficiently and accurately solve the problem under consideration. △ Less

Submitted 21 March, 2022; originally announced March 2022.

Comments: 30 pages, 11 figures, 4 tables

arXiv:2202.00304 [pdf, ps, other]

Rank two bundles on P^n with isolated cohomology

Authors: F. Malaspina, A. P. Rao

Abstract: The purpose of this paper is to study minimal monads associated to a rank two vector bundle $\mathcal E$ on $\mathbb P^n$. In particular, we study situations where $\mathcal E$ has $H^i_*(\mathcal E) =0$ for $1<i<n-1$, except for one pair of values $(k,n-k)$. We show that on $\mathbb P^8,$ if $H^3_*(\mathcal E)=H^4_*(\mathcal E)=0$, then $\mathcal E$ must be decomposable. More generally, we show t… ▽ More The purpose of this paper is to study minimal monads associated to a rank two vector bundle $\mathcal E$ on $\mathbb P^n$. In particular, we study situations where $\mathcal E$ has $H^i_*(\mathcal E) =0$ for $1<i<n-1$, except for one pair of values $(k,n-k)$. We show that on $\mathbb P^8,$ if $H^3_*(\mathcal E)=H^4_*(\mathcal E)=0$, then $\mathcal E$ must be decomposable. More generally, we show that for $n\geq 4k$, there is no indecomposable bundle $\mathcal E$ for which all intermediate cohomology modules except for $H^1_*, H^k_*, H^{n-k}_*, H^{n-1}_*$ are zero. △ Less

Submitted 1 February, 2022; originally announced February 2022.

Comments: 14 pages, no figures

arXiv:2201.01374 [pdf, ps, other]

Anti-concentration and the Exact Gap-Hamming Problem

Authors: Anup Rao, Amir Yehudayoff

Abstract: We prove anti-concentration bounds for the inner product of two independent random vectors, and use these bounds to prove lower bounds in communication complexity. We show that if $A,B$ are subsets of the cube $\{\pm 1\}^n$ with $|A| \cdot |B| \geq 2^{1.01 n}$, and $X \in A$ and $Y \in B$ are sampled independently and uniformly, then the inner product $\langle X,Y \rangle$ takes on any fixed value… ▽ More We prove anti-concentration bounds for the inner product of two independent random vectors, and use these bounds to prove lower bounds in communication complexity. We show that if $A,B$ are subsets of the cube $\{\pm 1\}^n$ with $|A| \cdot |B| \geq 2^{1.01 n}$, and $X \in A$ and $Y \in B$ are sampled independently and uniformly, then the inner product $\langle X,Y \rangle$ takes on any fixed value with probability at most $O(1/\sqrt{n})$. In fact, we prove the following stronger "smoothness" statement: $$ \max_{k } \big| \Pr[\langle X,Y \rangle = k] - \Pr[\langle X,Y \rangle = k+4]\big| \leq O(1/n).$$ We use these results to prove that the exact gap-hamming problem requires linear communication, resolving an open problem in communication complexity. We also conclude anti-concentration for structured distributions with low entropy. If $x \in \mathcal{Z}^n$ has no zero coordinates, and $B \subseteq \{\pm 1\}^n$ corresponds to a subspace of $\mathcal{F}_2^n$ of dimension $0.51n$, then $\max_k \Pr[\langle x,Y \rangle = k] \leq O(\sqrt{\ln (n)/n})$. △ Less

Submitted 4 January, 2022; originally announced January 2022.

arXiv:2111.07115 [pdf, ps, other]

Weighted uniform Diophantine approximation of systems of linear forms

Authors: Dmitry Kleinbock, Anurag Rao

Abstract: Following the development of weighted asymptotic approximation properties of matrices, we introduce the analogous uniform approximation properties (that is, study the improvability of Dirichlet's Theorem). An added feature is the use of general norms, rather than the supremum norm, to quantify the approximation. In terms of homogeneous dynamics, the approximation properties of an $m \times n$ matr… ▽ More Following the development of weighted asymptotic approximation properties of matrices, we introduce the analogous uniform approximation properties (that is, study the improvability of Dirichlet's Theorem). An added feature is the use of general norms, rather than the supremum norm, to quantify the approximation. In terms of homogeneous dynamics, the approximation properties of an $m \times n$ matrix are governed by a trajectory in $\mathrm{SL}_{m+n}({\mathbb R})/\mathrm{SL}_{m+n}({\mathbb Z})$ avoiding a compact subset of the space of lattices called the critical locus defined with respect to the corresponding norm. The trajectory is formed by the action of a one-parameter diagonal subgroup corresponding to the weights. We first state a very precise form of Dirichlet's theorem and prove it for some norms. Secondly we show, for these same norms, that the set of Dirichlet-improvable matrices has full Hausdorff dimension. Though the techniques used vary greatly depending on the chosen norm, we expect these results to hold in general. △ Less

Submitted 23 February, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

Comments: 12 pages; Theorem 1.7 added, several misprints corrected

MSC Class: 11J13; 11J83; 11H06; 37A17

arXiv:2110.04523 [pdf, other]

An Empirical Study on Compressed Decentralized Stochastic Gradient Algorithms with Overparameterized Models

Authors: Arjun Ashok Rao, Hoi-To Wai

Abstract: This paper considers decentralized optimization with application to machine learning on graphs. The growing size of neural network (NN) models has motivated prior works on decentralized stochastic gradient algorithms to incorporate communication compression. On the other hand, recent works have demonstrated the favorable convergence and generalization properties of overparameterized NNs. In this w… ▽ More This paper considers decentralized optimization with application to machine learning on graphs. The growing size of neural network (NN) models has motivated prior works on decentralized stochastic gradient algorithms to incorporate communication compression. On the other hand, recent works have demonstrated the favorable convergence and generalization properties of overparameterized NNs. In this work, we present an empirical analysis on the performance of compressed decentralized stochastic gradient (DSG) algorithms with overparameterized NNs. Through simulations on an MPI network environment, we observe that the convergence rates of popular compressed DSG algorithms are robust to the size of NNs. Our findings suggest a gap between theories and practice of the compressed DSG algorithms in the existing literature. △ Less

Submitted 9 October, 2021; originally announced October 2021.

Comments: 7 pages, 6 figures, accepted to APSIPA 2021

arXiv:2107.10679 [pdf, other]

Multilevel Contours on Bundles of Complex Planes

Authors: Arni S. R. Srinivasa Rao

Abstract: A new concept called multilevel contours is introduced through this article by the author. Theorems on contours constructed on a bundle of complex planes are stated and proved. Multilevel contours can transport information from one complex plane to another. Within a random environment, the behavior of contours and multilevel contours passing through the bundles of complex planes are studied. Furth… ▽ More A new concept called multilevel contours is introduced through this article by the author. Theorems on contours constructed on a bundle of complex planes are stated and proved. Multilevel contours can transport information from one complex plane to another. Within a random environment, the behavior of contours and multilevel contours passing through the bundles of complex planes are studied. Further properties of contours by a removal process of the data are studied. The concept of 'islands' and 'holes' within a bundle is introduced through this article. These all constructions help to understand the dynamics of the set of points of the bundle. Further research on the topics introduced here will be followed up by the author. These include closed approximations of the multilevel contour formations and their removal processes. The ideas and results presented in this article are novel. △ Less

Submitted 21 July, 2021; originally announced July 2021.

Comments: 76 pages and 13 Figures

MSC Class: 32L05; 60K3; 32H02

arXiv:2107.10298 [pdf, ps, other]

doi 10.2140/moscow.2022.11.97

Abundance of Dirichlet-improvable pairs with respect to arbitrary norms

Authors: Dmitry Kleinbock, Anurag Rao

Abstract: In a recent paper of Akhunzhanov and Shatskov the two-dimensional Dirichlet spectrum with respect to Euclidean norm was defined. We consider an analogous definition for arbitrary norms on $\mathbb{R}^2$ and prove that, for each such norm, the set of Dirichlet improvable pairs contains the set of badly approximable pairs, hence is hyperplane absolute winning. To prove this we make a careful study o… ▽ More In a recent paper of Akhunzhanov and Shatskov the two-dimensional Dirichlet spectrum with respect to Euclidean norm was defined. We consider an analogous definition for arbitrary norms on $\mathbb{R}^2$ and prove that, for each such norm, the set of Dirichlet improvable pairs contains the set of badly approximable pairs, hence is hyperplane absolute winning. To prove this we make a careful study of some classical results in the geometry of numbers due to Chalk--Rogers and Mahler to establish a Hajós--Minkowski type result for the critical locus of a cylinder. As a corollary, using a recent result of the first named author with Mirzadeh, we conclude that for any norm on $\mathbb{R}^2$ the top of the Dirichlet spectrum is not an isolated point. △ Less

Submitted 19 October, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

Comments: 16 pages, 3 figures; the new version has a stronger version of the main theorem with a simpler proof

MSC Class: 11J13; 11J83; 11H06; 37A17

Journal ref: Moscow J. Comb. Number Th. 11 (2022) 97-114

arXiv:2107.06309 [pdf, ps, other]

Tight bounds on the Fourier growth of bounded functions on the hypercube

Authors: Siddharth Iyer, Anup Rao, Victor Reis, Thomas Rothvoss, Amir Yehudayoff

Abstract: We give tight bounds on the degree $\ell$ homogenous parts $f_\ell$ of a bounded function $f$ on the cube. We show that if $f: \{\pm 1\}^n \rightarrow [-1,1]$ has degree $d$, then $\| f_\ell \|_\infty$ is bounded by $d^\ell/\ell!$, and $\| \hat{f}_\ell \|_1$ is bounded by $d^\ell e^{\binom{\ell+1}{2}} n^{\frac{\ell-1}{2}}$. We describe applications to pseudorandomness and learning theory. We use s… ▽ More We give tight bounds on the degree $\ell$ homogenous parts $f_\ell$ of a bounded function $f$ on the cube. We show that if $f: \{\pm 1\}^n \rightarrow [-1,1]$ has degree $d$, then $\| f_\ell \|_\infty$ is bounded by $d^\ell/\ell!$, and $\| \hat{f}_\ell \|_1$ is bounded by $d^\ell e^{\binom{\ell+1}{2}} n^{\frac{\ell-1}{2}}$. We describe applications to pseudorandomness and learning theory. We use similar methods to generalize the classical Pisier's inequality from convex analysis. Our analysis involves properties of real-rooted polynomials that may be useful elsewhere. △ Less

Submitted 19 July, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

arXiv:2106.14635 [pdf, ps, other]

Rao distances and Conformal Map**

Authors: Arni S. R. Srinivasa Rao, Steven G. Krantz

Abstract: In this article, we have described the Rao distance (due to C.R. Rao) and ideas of conformal map**s on 3D objects with angle preservations. Three propositions help us to construct distances between the points within the 3D objects in \mathbb{R}^{3} and line integrals within complex planes. We highlight the application of these concepts to virtual tourism. In this article, we have described the Rao distance (due to C.R. Rao) and ideas of conformal map**s on 3D objects with angle preservations. Three propositions help us to construct distances between the points within the 3D objects in \mathbb{R}^{3} and line integrals within complex planes. We highlight the application of these concepts to virtual tourism. △ Less

Submitted 11 June, 2021; originally announced June 2021.

Comments: 17 pages, 4 figures

MSC Class: 53B12; 30C20

Journal ref: Information Geometry (2021), Volume 45: pp43-56, Handbook of Statistics, Elsevier/North-Holland, Amsterdam

arXiv:2106.12199 [pdf, other]

doi 10.1137/21M1430005

Bayesian Joint Chance Constrained Optimization: Approximations and Statistical Consistency

Authors: Prateek Jaiswal, Harsha Honnappa, Vinayak A. Rao

Abstract: This paper considers data-driven chance-constrained stochastic optimization problems in a Bayesian framework. Bayesian posteriors afford a principled mechanism to incorporate data and prior knowledge into stochastic optimization problems. However, the computation of Bayesian posteriors is typically an intractable problem, and has spawned a large literature on approximate Bayesian computation. Here… ▽ More This paper considers data-driven chance-constrained stochastic optimization problems in a Bayesian framework. Bayesian posteriors afford a principled mechanism to incorporate data and prior knowledge into stochastic optimization problems. However, the computation of Bayesian posteriors is typically an intractable problem, and has spawned a large literature on approximate Bayesian computation. Here, in the context of chance-constrained optimization, we focus on the question of statistical consistency (in an appropriate sense) of the optimal value, computed using an approximate posterior distribution. To this end, we rigorously prove a frequentist consistency result demonstrating the convergence of the optimal value to the optimal value of a fixed, parameterized constrained optimization problem. We augment this by also establishing a probabilistic rate of convergence of the optimal value. We also prove the convex feasibility of the approximate Bayesian stochastic optimization problem. Finally, we demonstrate the utility of our approach on an optimal staffing problem for an M/M/c queueing model. △ Less

Submitted 30 September, 2022; v1 submitted 23 June, 2021; originally announced June 2021.

arXiv:2105.15129 [pdf]

Ancient Indian mathematics needs an honorific place in modern mathematics celebration

Authors: Steven G. Krantz, Arni S. R. Srinivasa Rao

Abstract: In point of fact the Indian tradition in mathematics is long and glorious. It dates back to earliest times, and indeed many of the Indian discoveries from 5000 years ago correspond rather naturally to modern mathematical results. In point of fact the Indian tradition in mathematics is long and glorious. It dates back to earliest times, and indeed many of the Indian discoveries from 5000 years ago correspond rather naturally to modern mathematical results. △ Less

Submitted 19 July, 2022; v1 submitted 25 May, 2021; originally announced May 2021.

Comments: 5 pages, 1 Table and 1 Figure

MSC Class: 01A32

arXiv:2105.06298 [pdf, ps, other]

PDE Models and Riemann-Stieltjes Integrals in Sustainability

Authors: Arni S. R. Srinivasa Rao, Sireesh Saride

Abstract: Understanding sustainability through modeling involves one of the complex and interdisciplinary activities where mathematics plays a key role. We provide arguments favoring the need for develo** global models for measuring the status of sustainability. A global model (applicable in broader perspective) and global sustainability indices are proposed which can be used with real-world data. The sol… ▽ More Understanding sustainability through modeling involves one of the complex and interdisciplinary activities where mathematics plays a key role. We provide arguments favoring the need for develo** global models for measuring the status of sustainability. A global model (applicable in broader perspective) and global sustainability indices are proposed which can be used with real-world data. The solutions of the proposed Partial Differential Equations (PDEs) are blended with the weight functions of Riemann Stieltjes integrals to capture the differential importance of sustainability associated factors. The ideas, methods, and models are new and are prepared for handling multi-dimensional and multi-variate data. A practically adaptable formula for measuring the sustainability index is developed with few key variables. We provide a real-world example arising in civil engineering applications with a numerical example to demonstrate our models. △ Less

Submitted 12 May, 2021; originally announced May 2021.

Comments: 27 pages and 5 Figures

MSC Class: 92D40; 35Q80; 26A42

Journal ref: Advances on Methodology and Applications of Statistics (Springer, 2021) - A Volume in Honor of C.R. Rao on the Occasion of his 100th Birthday

arXiv:2104.12296 [pdf, ps, other]

End-to-End Ascent-Entry Mission Performance Optimization Using Gaussian Quadrature Collocation

Authors: Alexander T. Miller, Anil V. Rao

Abstract: The performance optimization for a combined ascent-entry mission subject to constraints on heating rate and heating load is studied. The ascent vehicle is modeled as a three-stage rocket that places the vehicle onto a suborbital exo-atmopheric trajectory after which the vehicle undergoes an unpowered entry and descent to a vertically downward terminal condition. The entry vehicle is modeled as a h… ▽ More The performance optimization for a combined ascent-entry mission subject to constraints on heating rate and heating load is studied. The ascent vehicle is modeled as a three-stage rocket that places the vehicle onto a suborbital exo-atmopheric trajectory after which the vehicle undergoes an unpowered entry and descent to a vertically downward terminal condition. The entry vehicle is modeled as a high lift-to-drag ratio vehicle that is capable of withstanding high levels of thermal and structural loads. A performance index is designed to improve control margin while attenuating phugoid oscillations during atmospheric entry. Furthermore, a mission corresponding to a prototype launch and target point is used in this study. The trajectory optimization problem is formulated as a multiple-phase optimal control problem, and the optimal control problem is solved using an adaptive Gaussian quadrature collocation method. A key aspect of the optimized trajectories is that, for particular ranges of maximum allowable heating rate and heating load during entry, relatively small adjustments made during ascent can potentially decrease the control effort required during atmospheric entry. Outside of these ranges for maximum allowable heating rate and heating load, however, it is found that the required control effort increases and eventually saturates the commanded angle of attack upon initial descent. The key features of the optimized trajectories and controls are identified, and the approach developed in this paper provides a systematic method for end-to-end ascent-entry trajectory optimization. △ Less

Submitted 25 April, 2021; originally announced April 2021.

Comments: 38 pages, 20 figures, 10 tables

arXiv:2104.12247 [pdf, ps, other]

doi 10.1007/s10589-022-00350-6

Method for Solving Bang-Bang and Singular Optimal Control Problems using Adaptive Radau Collocation

Authors: Elisha R. Pager, Anil V. Rao

Abstract: A method is developed for solving bang-bang and singular optimal control problems using adaptive Legendre-Gauss-Radau (LGR) collocation. The method is divided into several parts. First, a structure detection method is developed that identifies switch times in the control and analyzes the corresponding switching function for segments where the solution is either bang-bang or singular. Second, after… ▽ More A method is developed for solving bang-bang and singular optimal control problems using adaptive Legendre-Gauss-Radau (LGR) collocation. The method is divided into several parts. First, a structure detection method is developed that identifies switch times in the control and analyzes the corresponding switching function for segments where the solution is either bang-bang or singular. Second, after the structure has been detected, the domain is decomposed into multiple domains such that the multiple-domain formulation includes additional decision variables that represent the switch times in the optimal control. In domains classified as bang-bang, the control is set to either its upper or lower limit. In domains identified as singular, the objective function is augmented with a regularization term to avoid the singular arc. An iterative procedure is then developed for singular domains to obtain a control that lies in close proximity to the singular control. The method is demonstrated on four examples, three of which have either a bang-bang and/or singular optimal control while the fourth has a smooth and nonsingular optimal control. The results demonstrate that the method of this paper provides accurate solutions to problems whose solutions are either bang-bang or singular when compared against previously developed mesh refinement methods that are not tailored for solving nonsmooth and/or singular optimal control problems, and produces results that are equivalent to those obtained using previously developed mesh refinement methods for optimal control problems whose solutions are smooth. △ Less

Submitted 26 January, 2022; v1 submitted 25 April, 2021; originally announced April 2021.

Comments: 37 pages, 6 figures, 5 tables To Appear in Computational Optimization and Applications

arXiv:2104.08972 [pdf, ps, other]

Nonsingular Euler Parameterizations for Motion of a Point Mass in Atmospheric Flight

Authors: Alexander T. Miller, Anil V. Rao

Abstract: Three parameterizations are developed for modeling translational motion of a point mass in atmosphere flight over a central rotating body. Unlike well-known parameterizations such as spherical coordinate parameterizations, where position and velocity are parameterized using a magnitude an an Euler angle rotation sequence, the method presented in this research employs Euler parameters. Consequently… ▽ More Three parameterizations are developed for modeling translational motion of a point mass in atmosphere flight over a central rotating body. Unlike well-known parameterizations such as spherical coordinate parameterizations, where position and velocity are parameterized using a magnitude an an Euler angle rotation sequence, the method presented in this research employs Euler parameters. Consequently, singularities and trigonometric functions are eliminated from the differential equations of motion. As a result, the new parameterizations presented in this paper offer computational advantages over standard parameterizations that employ Euler angle sequences. Finally, an example is studied where an atmospheric vehicle moves while in vertical flight, demonstrating the nonsingular nature of the formulations developed in this paper. △ Less

Submitted 18 April, 2021; originally announced April 2021.

Comments: 25 Pages, 3 Figures, 2 Tables. arXiv admin note: substantial text overlap with arXiv:2011.11158

arXiv:2102.02765

Online Discrepancy Minimization via Persistent Self-Balancing Walks

Authors: David Arbour, Drew Dimmery, Tung Mai, Anup Rao

Abstract: We study the online discrepancy minimization problem for vectors in $\mathbb{R}^d$ in the oblivious setting where an adversary is allowed fix the vectors $x_1, x_2, \ldots, x_n$ in arbitrary order ahead of time. We give an algorithm that maintains $O(\sqrt{\log(nd/δ)})$ discrepancy with probability $1-δ$, matching the lower bound given in [Bansal et al. 2020] up to an $O(\sqrt{\log \log n})$ facto… ▽ More We study the online discrepancy minimization problem for vectors in $\mathbb{R}^d$ in the oblivious setting where an adversary is allowed fix the vectors $x_1, x_2, \ldots, x_n$ in arbitrary order ahead of time. We give an algorithm that maintains $O(\sqrt{\log(nd/δ)})$ discrepancy with probability $1-δ$, matching the lower bound given in [Bansal et al. 2020] up to an $O(\sqrt{\log \log n})$ factor in the high-probability regime. We also provide results for the weighted and multi-color versions of the problem. △ Less

Submitted 5 February, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

Comments: The proof of Lemma 7 is incorrect. There is a serious issue that we don't know how to fix at the moment. We thank Yang, Nikhil and collaborators for bringing it to our attention

arXiv:2101.06309 [pdf, other]

Fundamental Tradeoffs in Distributionally Adversarial Training

Authors: Mohammad Mehrabi, Adel Javanmard, Ryan A. Rossi, Anup Rao, Tung Mai

Abstract: Adversarial training is among the most effective techniques to improve the robustness of models against adversarial perturbations. However, the full effect of this approach on models is not well understood. For example, while adversarial training can reduce the adversarial risk (prediction error against an adversary), it sometimes increase standard risk (generalization error when there is no adver… ▽ More Adversarial training is among the most effective techniques to improve the robustness of models against adversarial perturbations. However, the full effect of this approach on models is not well understood. For example, while adversarial training can reduce the adversarial risk (prediction error against an adversary), it sometimes increase standard risk (generalization error when there is no adversary). Even more, such behavior is impacted by various elements of the learning problem, including the size and quality of training data, specific forms of adversarial perturbations in the input, model overparameterization, and adversary's power, among others. In this paper, we focus on \emph{distribution perturbing} adversary framework wherein the adversary can change the test distribution within a neighborhood of the training data distribution. The neighborhood is defined via Wasserstein distance between distributions and the radius of the neighborhood is a measure of adversary's manipulative power. We study the tradeoff between standard risk and adversarial risk and derive the Pareto-optimal tradeoff, achievable over specific classes of models, in the infinite data limit with features dimension kept fixed. We consider three learning settings: 1) Regression with the class of linear models; 2) Binary classification under the Gaussian mixtures data model, with the class of linear classifiers; 3) Regression with the class of random features model (which can be equivalently represented as two-layer neural network with random first-layer weights). We show that a tradeoff between standard and adversarial risk is manifested in all three settings. We further characterize the Pareto-optimal tradeoff curves and discuss how a variety of factors, such as features correlation, adversary's power or the width of two-layer neural network would affect this tradeoff. △ Less

Submitted 15 January, 2021; originally announced January 2021.

Comments: 23 pages, 3 figures

arXiv:2101.05197 [pdf, ps, other]

PAC-Bayes Bounds on Variational Tempered Posteriors for Markov Models

Authors: Imon Banerjee, Vinayak A. Rao, Harsha Honnappa

Abstract: Datasets displaying temporal dependencies abound in science and engineering applications, with Markov models representing a simplified and popular view of the temporal dependence structure. In this paper, we consider Bayesian settings that place prior distributions over the parameters of the transition kernel of a Markov model, and seeks to characterize the resulting, typically intractable, poster… ▽ More Datasets displaying temporal dependencies abound in science and engineering applications, with Markov models representing a simplified and popular view of the temporal dependence structure. In this paper, we consider Bayesian settings that place prior distributions over the parameters of the transition kernel of a Markov model, and seeks to characterize the resulting, typically intractable, posterior distributions. We present a PAC-Bayesian analysis of variational Bayes (VB) approximations to tempered Bayesian posterior distributions, bounding the model risk of the VB approximations. Tempered posteriors are known to be robust to model misspecification, and their variational approximations do not suffer the usual problems of over confident approximations. Our results tie the risk bounds to the mixing and ergodic properties of the Markov data generating model. We illustrate the PAC-Bayes bounds through a number of example Markov models, and also consider the situation where the Markov model is misspecified. △ Less

Submitted 13 January, 2021; originally announced January 2021.

Comments: 14 pages main, 24 pages appendix and citations

arXiv:2012.04116 [pdf, other]

Minimum-Time Earth-to-Mars Interplanetary Orbit Transfer Using Adaptive Gaussian Quadrature Collocation

Authors: Brittanny V. Holden, Shan He, Anil V. Rao

Abstract: The problem of minimum-time, low-thrust, Earth-to-Mars interplanetary orbital trajectory optimization is considered. The minimum-time orbital transfer problem is modeled as a four-phase optimal control problem where the four phases correspond to planetary alignment, Earth escape, heliocentric transfer, and Mars capture. The four-phase optimal control problem is then solved using a direct collocati… ▽ More The problem of minimum-time, low-thrust, Earth-to-Mars interplanetary orbital trajectory optimization is considered. The minimum-time orbital transfer problem is modeled as a four-phase optimal control problem where the four phases correspond to planetary alignment, Earth escape, heliocentric transfer, and Mars capture. The four-phase optimal control problem is then solved using a direct collocation adaptive Gaussian quadrature collocation method. The following three models are used in the study: (1) circular planetary motion; (2) elliptic planetary motion; and (3) elliptic planetary motion with gravity perturbations, where the transfer begins in a geostationary orbit and terminates in a Mars-stationary orbit. Results for all three cases are provided, and one particular case is studied in detail to show the key features of the optimal solutions. Using the particular value thrust specific force of $0.00098\times 10^{-4}~\textrm{m}\cdot\textrm{s}^{-2}$, it was found that the minimum times for cases (1), (2), and (3) are, respectively, 215 d, 196 d, and 198 d with departure dates, respectively, of 1 July 2020, 30 June 2020, and 28 June 2020. Finally, the problem formulation developed in this study is compared against prior work on an Earth-to-Mars interplanetary orbit transfer where it is found that the results of this research show significant improvement in transfer time relative to the prior work. △ Less

Submitted 6 April, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

Comments: 41 pages, 8 figures, 7 tables

arXiv:2011.11158 [pdf, other]

Nonsingular Parameterization for Modeling Translational Motion Using Euler Parameters

Authors: Alexander T. Miller, Anil V. Rao

Abstract: A parameterization is described for quantifying translational motion of a point in three-dimensional Euclidean space. The parameterization is similar to well-known parameterizations such as spherical coordinates in that both position and velocity are decoupled into magnitude and orientation components. Unlike these standard parameterizations, where principal rotation sequences are employed, the me… ▽ More A parameterization is described for quantifying translational motion of a point in three-dimensional Euclidean space. The parameterization is similar to well-known parameterizations such as spherical coordinates in that both position and velocity are decoupled into magnitude and orientation components. Unlike these standard parameterizations, where principal rotation sequences are employed, the method presented in this research employs Euler parameters. By using Euler parameters instead of Euler angles, singularities and trigonometric functions are removed from the equations of motion. The parameterization is demonstrated on two examples, where it is found that the new parameterization offers both mathematical and computational advantages over other commonly used parameterizations. △ Less

Submitted 22 November, 2020; originally announced November 2020.

Comments: 14 pages, 3 figures

arXiv:2009.10754 [pdf, ps, other]

An Elementary Exposition of Pisier's Inequality

Authors: Siddharth Iyer, Anup Rao, Victor Reis, Thomas Rothvoss, Amir Yehudayoff

Abstract: Pisier's inequality is central in the study of normed spaces and has important applications in geometry. We provide an elementary proof of this inequality, which avoids some non-constructive steps from previous proofs. Our goal is to make the inequality and its proof more accessible, because we think they will find additional applications. We demonstrate this with a new type of restriction on the… ▽ More Pisier's inequality is central in the study of normed spaces and has important applications in geometry. We provide an elementary proof of this inequality, which avoids some non-constructive steps from previous proofs. Our goal is to make the inequality and its proof more accessible, because we think they will find additional applications. We demonstrate this with a new type of restriction on the Fourier spectrum of bounded functions on the discrete cube. △ Less

Submitted 22 September, 2020; originally announced September 2020.

arXiv:2007.10501 [pdf, ps, other]

A Warm Start Method for Solving Chance Constrained Optimal Control Problems

Authors: Rachel E. Kiel, Mrinal Kumar, Anil V. Rao

Abstract: A warm start method is developed for efficiently solving complex chance constrained optimal control problems. The warm start method addresses the computational challenges of solving chance constrained optimal control problems using biased kernel density estimators and Legendre-Gauss-Radau collocation with an $hp$ adaptive mesh refinement method. To address the computational challenges, the warm st… ▽ More A warm start method is developed for efficiently solving complex chance constrained optimal control problems. The warm start method addresses the computational challenges of solving chance constrained optimal control problems using biased kernel density estimators and Legendre-Gauss-Radau collocation with an $hp$ adaptive mesh refinement method. To address the computational challenges, the warm start method improves both the starting point for the chance constrained optimal control problem, as well as the efficiency of cycling through mesh refinement iterations. The improvement is accomplished by tuning a parameter of the kernel density estimator, as well as implementing a kernel switch as part of the solution process. Additionally, the number of samples for the biased kernel density estimator is set to incrementally increase through a series of mesh refinement iterations. Thus, the warm start method is a combination of tuning a parameter, a kernel switch, and an incremental increase in sample size. This warm start method is successfully applied to solve two challenging chance constrained optimal control problems in a computationally efficient manner using biased kernel density estimators and Legendre-Gauss-Radau collocation. △ Less

Submitted 20 July, 2020; originally announced July 2020.

Comments: 34 pages, 6 Figures, 8 Tables

arXiv:2003.13829 [pdf, ps, other]

Critical loci of convex domains in the plane

Authors: Dmitry Kleinbock, Anurag Rao, Srinivasan Sathiamurthy

Abstract: Let $K$ be a bounded convex domain in $\mathbb{R}^2$ symmetric about the origin. The critical locus of $K$ is defined to be the (non-empty compact) set of lattices $Λ$ in $\mathbb{R}^2$ of smallest possible covolume such that $Λ\cap K= \lbrace 0\rbrace$. These are classical objects in geometry of numbers; yet all previously known examples of critical loci were either finite sets or finite unions o… ▽ More Let $K$ be a bounded convex domain in $\mathbb{R}^2$ symmetric about the origin. The critical locus of $K$ is defined to be the (non-empty compact) set of lattices $Λ$ in $\mathbb{R}^2$ of smallest possible covolume such that $Λ\cap K= \lbrace 0\rbrace$. These are classical objects in geometry of numbers; yet all previously known examples of critical loci were either finite sets or finite unions of closed curves. In this paper we give a new construction which, in particular, furnishes examples of domains having critical locus of arbitrary Hausdorff dimension between $0$ and $1$. △ Less

Submitted 11 January, 2021; v1 submitted 30 March, 2020; originally announced March 2020.

Comments: new section added

arXiv:2003.11676 [pdf, ps, other]

Mesh Refinement Method for Solving Optimal Control Problems with Nonsmooth Solutions Using Jump Function Approximations

Authors: Alexander T. Miller, WIlliam W. Hager, Anil V. Rao

Abstract: A mesh refinement method is described for solving optimal control problems using Legendre-Gauss-Radau collocation. The method detects discontinuities in the control solution by employing an edge detection scheme based on jump function approximations. When discontinuities are identified, the mesh is refined with a targeted $h$-refinement approach whereby the discontinuity locations are bracketed wi… ▽ More A mesh refinement method is described for solving optimal control problems using Legendre-Gauss-Radau collocation. The method detects discontinuities in the control solution by employing an edge detection scheme based on jump function approximations. When discontinuities are identified, the mesh is refined with a targeted $h$-refinement approach whereby the discontinuity locations are bracketed with mesh points. The remaining smooth portions of the mesh are refined using previously developed techniques. The method is demonstrated on two examples, and results indicate that the method solves optimal control problems with discontinuous control solutions using fewer mesh refinement iterations and less computation time when compared with previously developed methods. △ Less

Submitted 25 March, 2020; originally announced March 2020.

Comments: 22 Pages, 8 Figures, 0 Tables

arXiv:2003.08010 [pdf, ps, other]

Method for Chance Constrained Optimal Control Using Biased Kernel Density Estimators

Authors: Rachel E. Keil, Alexander T. Miller, Mrinal Kumar, Anil V. Rao

Abstract: A method is developed to numerically solve chance constrained optimal control problems. The chance constraints are reformulated as nonlinear constraints that retain the probability properties of the original constraint. The reformulation transforms the chance constrained optimal control problem into a deterministic optimal control problem that can be solved numerically. The new method developed in… ▽ More A method is developed to numerically solve chance constrained optimal control problems. The chance constraints are reformulated as nonlinear constraints that retain the probability properties of the original constraint. The reformulation transforms the chance constrained optimal control problem into a deterministic optimal control problem that can be solved numerically. The new method developed in this paper approximates the chance constraints using Markov Chain Monte Carlo (MCMC) sampling and kernel density estimators whose kernels have integral functions that bound the indicator function. The nonlinear constraints resulting from the application of kernel density estimators are designed with bounds that do not violate the bounds of the original chance constraint. The method is tested on a non-trivial chance constrained modification of a soft lunar landing optimal control problem and the results are compared with results obtained using a conservative deterministic formulation of the optimal control problem. The results show that this new method efficiently solves chance constrained optimal control problems. △ Less

Submitted 27 May, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

Comments: 32 pages, 4 figures, 1 table

arXiv:2001.06976 [pdf, ps, other]

doi 10.1007/978-981-15-1611-5_13

The quotient Unimodular Vector group is nilpotent

Authors: Reema Khanna, Selby Jose, Sampat Sharma, Ravi A. Rao

Abstract: Jose-Rao introduced and studied the Special Unimodular Vector group $SUm_r(R)$ and $EUm_r(R)$, its Elementary Unimodular Vector subgroup. They proved that for $r \geq 2$, $EUm_r(R)$ is a normal subgroup of $SUm_r(R)$. The Jose-Rao theorem says that the quotient Unimodular Vector group, $SUm_r(R)/EUm_r(R)$, for $r \geq 2$, is a subgroup of the orthogonal quotient group… ▽ More Jose-Rao introduced and studied the Special Unimodular Vector group $SUm_r(R)$ and $EUm_r(R)$, its Elementary Unimodular Vector subgroup. They proved that for $r \geq 2$, $EUm_r(R)$ is a normal subgroup of $SUm_r(R)$. The Jose-Rao theorem says that the quotient Unimodular Vector group, $SUm_r(R)/EUm_r(R)$, for $r \geq 2$, is a subgroup of the orthogonal quotient group $SO_{2(r+1)}(R)/EO_{2(r + 1)}(R)$. The latter group is known to be nilpotent by the work of Hazrat-Vavilov, following methods of A. Bak; and so is the former. In this article we give a direct proof, following ideas of A. Bak, to show that the quotient Unimodular Vector group is nilpotent of class $\leq d = \dim(R)$. We also use the Quillen-Suslin theory, inspired by A. Bak's method, to prove that if $R = A[X]$, with $A$ a local ring, then the quotient Unimodular Vector group is abelian. △ Less

Submitted 20 January, 2020; originally announced January 2020.

Journal ref: Leavitt Path algebra and Classical K-Theory, (2020) 225-240. Indian statistical institute series. Springer, Singapore

arXiv:1910.00126 [pdf, ps, other]

A zero-one law for uniform Diophantine approximation in Euclidean norm

Authors: Dmitry Kleinbock, Anurag Rao

Abstract: We study a norm sensitive Diophantine approximation problem arising from the work of Davenport and Schmidt on the improvement of Dirichlet's theorem. Its supremum norm case was recently considered by the first-named author and Wadleigh, and here we extend the set-up by replacing the supremum norm with an arbitrary norm. This gives rise to a class of shrinking target problems for one-parameter diag… ▽ More We study a norm sensitive Diophantine approximation problem arising from the work of Davenport and Schmidt on the improvement of Dirichlet's theorem. Its supremum norm case was recently considered by the first-named author and Wadleigh, and here we extend the set-up by replacing the supremum norm with an arbitrary norm. This gives rise to a class of shrinking target problems for one-parameter diagonal flows on the space of lattices, with the targets being neighborhoods of the critical locus of a suitably scaled norm ball. We use methods from geometry of numbers and dynamics to generalize a result due to Andersen and Duke on measure zero and uncountability of the set of numbers for which Minkowski approximation theorem can be improved. The choice of the Euclidean norm on $\mathbb{R}^2$ corresponds to studying geodesics on a hyperbolic surface which visit a decreasing family of balls. An application of a dynamical Borel-Cantelli lemma of Maucourant produces a zero-one law for improvement of Dirichlet's theorem in Euclidean norm. △ Less

Submitted 18 August, 2020; v1 submitted 30 September, 2019; originally announced October 2019.

Comments: 27 pages; Theorem 1.3 replaced by a stronger version, new section and more detail of the proof added

MSC Class: 11J04; 11J13; 37A17; 37D40

arXiv:1909.04774 [pdf, other]

Coding for Sunflowers

Authors: Anup Rao

Abstract: A sunflower is a family of sets that have the same pairwise intersections. We simplify a recent result of Alweiss, Lovett, Wu and Zhang that gives an upper bound on the size of every family of sets of size $k$ that does not contain a sunflower. We show how to use the converse of Shannon's noiseless coding theorem to give a cleaner proof of their result. A sunflower is a family of sets that have the same pairwise intersections. We simplify a recent result of Alweiss, Lovett, Wu and Zhang that gives an upper bound on the size of every family of sets of size $k$ that does not contain a sunflower. We show how to use the converse of Shannon's noiseless coding theorem to give a cleaner proof of their result. △ Less

Submitted 25 February, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

Comments: Revised version includes an improved bound. This version is published by Discrete Analysis

arXiv:1909.03326 [pdf, ps, other]

Modified Legendre-Gauss-Radau Collocation Method for Solving Optimal Control Problems with Nonsmooth Solutions

Authors: Joseph D. Eide, William W. Hager, Anil V. Rao

Abstract: A new method is developed for solving optimal control problems whose solutions are nonsmooth. The method developed in this paper employs a modified form of the Legendre-Gauss-Radau orthogonal direct collocation method. This modified Legendre-Gauss-Radau method adds two variables and two constraints at the end of a mesh interval when compared with a previously developed standard Legendre-Gauss-Rada… ▽ More A new method is developed for solving optimal control problems whose solutions are nonsmooth. The method developed in this paper employs a modified form of the Legendre-Gauss-Radau orthogonal direct collocation method. This modified Legendre-Gauss-Radau method adds two variables and two constraints at the end of a mesh interval when compared with a previously developed standard Legendre-Gauss-Radau collocation method. The two additional variables are the time at the interface between two mesh intervals and the control at the end of each mesh interval. The two additional constraints are a collocation condition for those differential equations that depend upon the control and an inequality constraint on the control at the endpoint of each mesh interval. The additional constraints modify the search space of the nonlinear programming problem such that an accurate approximation to the location of the nonsmoothness is obtained. The transformed adjoint system of the modified Legendre-Gauss-Radau method is then developed. Using this transformed adjoint system, a method is developed to transform the Lagrange multipliers of the nonlinear programming problem to the costate of the optimal control problem. Furthermore, it is shown that the costate estimate satisfies one of the Weierstrass-Erdmann optimality conditions. Finally, the method developed in this paper is demonstrated on an example whose solution is nonsmooth. △ Less

Submitted 8 November, 2020; v1 submitted 7 September, 2019; originally announced September 2019.

Comments: 36 pages, 8 figures

arXiv:1905.12745 [pdf, other]

doi 10.2514/1.J058514

Comparison of Derivative Estimation Methods in Solving Optimal Control Problems Using Direct Collocation

Authors: Yunus M. Agamawi, Anil V. Rao

Abstract: A study is conducted to evaluate four derivative estimation methods when solving a large sparse nonlinear programming problem that arises from the approximation of an optimal control problem using a direct collocation method. In particular, the Taylor series-based finite-difference, bicomplex-step, and hyper-dual derivative estimation methods are evaluated and compared alongside a well known autom… ▽ More A study is conducted to evaluate four derivative estimation methods when solving a large sparse nonlinear programming problem that arises from the approximation of an optimal control problem using a direct collocation method. In particular, the Taylor series-based finite-difference, bicomplex-step, and hyper-dual derivative estimation methods are evaluated and compared alongside a well known automatic differentiation method. The performance of each derivative estimation method is assessed based on the number of iterations, the computation time per iteration, and the total computation time required to solve the nonlinear programming problem. The efficiency of each of the four derivative estimation methods is compared by solving three benchmark optimal control problems. It is found that while central finite-differencing is typically more efficient per iteration than either the hyper-dual or bicomplex-step, the latter two methods have significantly lower overall computation times due to the fact that fewer iterations are required by the nonlinear programming problem when compared with central finite-differencing. Furthermore, while the bicomplex-step and hyper-dual methods are similar in performance, the hyper-dual method is significantly easier to implement. Moreover, the automatic differentiation method is found to be substantially less computationally efficient than any of the three Taylor series-based methods. The results of this study show that the hyper-dual method offers several benefits over the other three methods both in terms of computational efficiency and ease of implementation. △ Less

Submitted 29 May, 2019; originally announced May 2019.

Comments: 26 pages, 2 figures, 3 tables

Journal ref: AIAA Journal 2020

arXiv:1905.11898 [pdf, other]

CGPOPS: A C++ Software for Solving Multiple-Phase Optimal Control Problems Using Adaptive Gaussian Quadrature Collocation and Sparse Nonlinear Programming

Authors: Yunus M. Agamawi, Anil V. Rao

Abstract: A general-purpose C++ software program called $\mathbb{CGPOPS}$ is described for solving multiple-phase optimal control problems using adaptive Gaussian quadrature collocation. The software employs a Legendre-Gauss-Radau direct orthogonal collocation method to transcribe the continuous-time optimal control problem into a large sparse nonlinear programming problem. A class of $hp$ mesh refinement m… ▽ More A general-purpose C++ software program called $\mathbb{CGPOPS}$ is described for solving multiple-phase optimal control problems using adaptive Gaussian quadrature collocation. The software employs a Legendre-Gauss-Radau direct orthogonal collocation method to transcribe the continuous-time optimal control problem into a large sparse nonlinear programming problem. A class of $hp$ mesh refinement methods are implemented which determine the number of mesh intervals and the degree of the approximating polynomial within each mesh interval to achieve a specified accuracy tolerance. The software is interfaced with the open source Newton NLP solver IPOPT. All derivatives required by the NLP solver are computed using either central finite differencing, bicomplex-step derivative approximation, hyper-dual derivative approximation, or automatic differentiation. The key components of the software are described in detail and the utility of the software is demonstrated on five optimal control problems of varying complexity. The software described in this article provides a computationally efficient and accurate approach for solving a wide variety of complex constrained optimal control problems. △ Less

Submitted 28 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

Comments: 38 pages, 15 figures, 19 tables

arXiv:1905.11895 [pdf, other]

Mesh Refinement Method for Solving Bang-Bang Optimal Control Problems Using Direct Collocation

Authors: Yunus M. Agamawi, William W. Hager, Anil V. Rao

Abstract: A mesh refinement method is developed for solving bang-bang optimal control problems using direct collocation. The method starts by finding a solution on a coarse mesh. Using this initial solution, the method then determines automatically if the Hamiltonian is linear with respect to the control, and, if so, estimates the locations of the discontinuities in the control. The switch times are estimat… ▽ More A mesh refinement method is developed for solving bang-bang optimal control problems using direct collocation. The method starts by finding a solution on a coarse mesh. Using this initial solution, the method then determines automatically if the Hamiltonian is linear with respect to the control, and, if so, estimates the locations of the discontinuities in the control. The switch times are estimated by determining the roots of the switching functions, where the switching functions are determined using estimates of the state and costate obtained from the collocation method. The accuracy of the switch times is then improved on subsequent meshes by dividing the original optimal control problem into multiple domains and including variables that define the locations of the switch times. While in principle any collocation method can be used, in this research the previously developed Legendre-Gauss-Radau collocation method is employed because it provides an accurate approximation of the costate which in turn improves the approximation of the switching functions. The method of this paper is designed to be used with a previously developed mesh refinement method in order to accurately approximate the solution in segments where the solution is smooth. The method is demonstrated on three examples where it is shown to accurately determine the switching structure of a bang-bang optimal control problem. When compared with previously developed mesh refinement methods, the results demonstrate that the method developed in this paper improves computational efficiency when solving bang-bang optimal control problems. △ Less

Submitted 30 May, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

Comments: 22 pages, 9 figures, 3 tables,

arXiv:1903.05220 [pdf, other]

On the Statistical Consistency of Risk-Sensitive Bayesian Decision-Making

Authors: Prateek Jaiswal, Harsha Honnappa, Vinayak A. Rao

Abstract: We study data-driven decision-making problems in the Bayesian framework, where the expectation in the Bayes risk is replaced by a risk-sensitive entropic risk measure. We focus on problems where calculating the posterior distribution is intractable, a typical situation in modern applications with large datasets and complex data generating models. We leverage a dual representation of the entropic r… ▽ More We study data-driven decision-making problems in the Bayesian framework, where the expectation in the Bayes risk is replaced by a risk-sensitive entropic risk measure. We focus on problems where calculating the posterior distribution is intractable, a typical situation in modern applications with large datasets and complex data generating models. We leverage a dual representation of the entropic risk measure to introduce a novel risk-sensitive variational Bayesian (RSVB) framework for jointly computing a risk-sensitive posterior approximation and the corresponding decision rule. The proposed RSVB framework can be used to extract computational methods for doing risk-sensitive approximate Bayesian inference. We show that our general framework includes two well-known computational methods for doing approximate Bayesian inference viz. naive VB and loss-calibrated VB. We also study the impact of these computational approximations on the predictive performance of the inferred decision rules and values. We compute the convergence rates of the RSVB approximate posterior and also of the corresponding optimal value and decision rules. We illustrate our theoretical findings in both parametric and nonparametric settings with the help of three examples: the single and multi-product newsvendor model and Gaussian process classification. △ Less

Submitted 9 September, 2021; v1 submitted 12 March, 2019; originally announced March 2019.

arXiv:1902.01902 [pdf, other]

Asymptotic Consistency of $α-$Rényi-Approximate Posteriors

Authors: Prateek Jaiswal, Vinayak A. Rao, Harsha Honnappa

Abstract: We study the asymptotic consistency properties of $α$-Rényi approximate posteriors, a class of variational Bayesian methods that approximate an intractable Bayesian posterior with a member of a tractable family of distributions, the member chosen to minimize the $α$-Rényi divergence from the true posterior. Unique to our work is that we consider settings with $α> 1$, resulting in approximations th… ▽ More We study the asymptotic consistency properties of $α$-Rényi approximate posteriors, a class of variational Bayesian methods that approximate an intractable Bayesian posterior with a member of a tractable family of distributions, the member chosen to minimize the $α$-Rényi divergence from the true posterior. Unique to our work is that we consider settings with $α> 1$, resulting in approximations that upperbound the log-likelihood, and consequently have wider spread than traditional variational approaches that minimize the Kullback-Liebler (KL) divergence from the posterior. Our primary result identifies sufficient conditions under which consistency holds, centering around the existence of a 'good' sequence of distributions in the approximating family that possesses, among other properties, the right rate of convergence to a limit distribution. We further characterize the good sequence by demonstrating that a sequence of distributions that converges too quickly cannot be a good sequence. We also extend our analysis to the setting where $α$ equals one, corresponding to the minimizer of the reverse KL divergence, and to models with local latent variables. We also illustrate the existence of good sequence with a number of examples. Our results complement a growing body of work focused on the frequentist properties of variational Bayesian methods. △ Less

Submitted 14 August, 2020; v1 submitted 5 February, 2019; originally announced February 2019.

arXiv:1811.06510 [pdf, ps, other]

Anti-concentration in most directions

Authors: Anup Rao, Amir Yehudayoff

Abstract: We prove anti-concentration bounds for the inner product of two independent random vectors. For example, we show that if $A,B$ are subsets of the cube $\{\pm 1\}^n$ with $|A| \cdot |B| \geq 2^{1.01 n}$, and $X \in A$ and $Y \in B$ are sampled independently and uniformly, then the inner product $\langle X, Y \rangle$ takes on any fixed value with probability at most $O(\tfrac{1}{\sqrt{n}})$. Extend… ▽ More We prove anti-concentration bounds for the inner product of two independent random vectors. For example, we show that if $A,B$ are subsets of the cube $\{\pm 1\}^n$ with $|A| \cdot |B| \geq 2^{1.01 n}$, and $X \in A$ and $Y \in B$ are sampled independently and uniformly, then the inner product $\langle X, Y \rangle$ takes on any fixed value with probability at most $O(\tfrac{1}{\sqrt{n}})$. Extending Halász work, we prove stronger bounds when the choices for $x$ are unstructured. We also describe applications to communication complexity, randomness extraction and additive combinatorics. △ Less

Submitted 4 March, 2019; v1 submitted 15 November, 2018; originally announced November 2018.

Comments: 23 pages

MSC Class: 60C05; 68Q87

arXiv:1810.03329 [pdf, ps, other]

The Pillars of Relative Quillen--Suslin Theory

Authors: Rabeya Basu, Reema Khanna, Ravi A. Rao

Abstract: We deduce the relative version of the equivalences relating the relative Local Global Principle and the Normality of the relative Elementary subgroups of the traditional classical groups, viz. general linear, symplectic and orthogonal groups. This generalizes our previous result for the absolute case. We deduce the relative version of the equivalences relating the relative Local Global Principle and the Normality of the relative Elementary subgroups of the traditional classical groups, viz. general linear, symplectic and orthogonal groups. This generalizes our previous result for the absolute case. △ Less

Submitted 8 October, 2018; originally announced October 2018.

Comments: 12 pages

MSC Class: 13C10; 11E57; 11E70; 15A63; 19B10; 19B14

arXiv:1810.02959 [pdf, other]

doi 10.1145/3394486.3403045

Higher-order Spectral Clustering for Heterogeneous Graphs

Authors: Aldo G. Carranza, Ryan A. Rossi, Anup Rao, Eunyee Koh

Abstract: Higher-order connectivity patterns such as small induced sub-graphs called graphlets (network motifs) are vital to understand the important components (modules/functional units) governing the configuration and behavior of complex networks. Existing work in higher-order clustering has focused on simple homogeneous graphs with a single node/edge type. However, heterogeneous graphs consisting of node… ▽ More Higher-order connectivity patterns such as small induced sub-graphs called graphlets (network motifs) are vital to understand the important components (modules/functional units) governing the configuration and behavior of complex networks. Existing work in higher-order clustering has focused on simple homogeneous graphs with a single node/edge type. However, heterogeneous graphs consisting of nodes and edges of different types are seemingly ubiquitous in the real-world. In this work, we introduce the notion of typed-graphlet that explicitly captures the rich (typed) connectivity patterns in heterogeneous networks. Using typed-graphlets as a basis, we develop a general principled framework for higher-order clustering in heterogeneous networks. The framework provides mathematical guarantees on the optimality of the higher-order clustering obtained. The experiments demonstrate the effectiveness of the framework quantitatively for three important applications including (i) clustering, (ii) link prediction, and (iii) graph compression. In particular, the approach achieves a mean improvement of 43x over all methods and graphs for clustering while achieving a 18.7% and 20.8% improvement for link prediction and graph compression, respectively. △ Less

Submitted 6 October, 2018; originally announced October 2018.

arXiv:1803.03979 [pdf, ps, other]

doi 10.1016/j.jpaa.2018.02.031

Stability results for projective modules over Rees algebras

Authors: Ravi A. Rao, Husney Parvez Sarwar

Abstract: We provide a class of commutative Noetherian domains $R$ of dimension $d$ such that every finitely generated projective $R$-module $P$ of rank $d$ splits off a free summand of rank one. On this class, we also show that $P$ is cancellative. At the end we give some applications to the number of generators of a module over the Rees algebras. We provide a class of commutative Noetherian domains $R$ of dimension $d$ such that every finitely generated projective $R$-module $P$ of rank $d$ splits off a free summand of rank one. On this class, we also show that $P$ is cancellative. At the end we give some applications to the number of generators of a module over the Rees algebras. △ Less

Submitted 11 March, 2018; originally announced March 2018.

Comments: 10 pages, to appear in JPAA

MSC Class: Primary: 13C10; 19A13; Secondary: 13A30

arXiv:1703.08292 [pdf, ps, other]

Homotopy and Commutativity Principle

Authors: Ravi A. Rao, Sampat Sharma

Abstract: In this article, we prove commutativity principal for linear, symplectic and transvection groups. This principle is a consequence of Quillen-Suslin local global principle and using a non-symmetric application of it as done by A. Bak. The existence of a Local-Global Principle enables us to prove similar results in various groups. We restrict ourselves to the classical symplectic, orthogonal groups… ▽ More In this article, we prove commutativity principal for linear, symplectic and transvection groups. This principle is a consequence of Quillen-Suslin local global principle and using a non-symmetric application of it as done by A. Bak. The existence of a Local-Global Principle enables us to prove similar results in various groups. We restrict ourselves to the classical symplectic, orthogonal groups (and their relative versions); and to the automorphism groups of a projective module (with a unimodular element), a symplectic module (with ahyperbolic summand), and an orthogonal module (with a hyperbolic symmand). We could show that the symplectic quotients were abelian, but we could only establish that the orthogonal quotients are solvable of length atmost two. We do believe that the orthogonal quotient groups are also abelian; and prove this when the base ring is a regular local ring containing a field. △ Less

Submitted 18 January, 2020; v1 submitted 24 March, 2017; originally announced March 2017.

Showing 1–50 of 95 results for author: Rao, A