Skip to main content

Showing 1–26 of 26 results for author: Vidyasagar, M

.
  1. arXiv:2312.02828  [pdf, ps, other

    stat.ML cs.LG math.OC math.PR

    Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications

    Authors: Rajeeva L. Karandikar, M. Vidyasagar

    Abstract: In this paper, we study the convergence properties of the Stochastic Gradient Descent (SGD) method for finding a stationary point of a given objective function $J(\cdot)$. The objective function is not required to be convex. Rather, our results apply to a class of ``invex'' functions, which have the property that every stationary point is also a global minimizer. First, it is assumed that… ▽ More

    Submitted 12 May, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 33 pages, 2figures

    MSC Class: 62L20; 60G17; 93D05

  2. arXiv:2304.00803  [pdf, ps, other

    cs.LG eess.SY

    A Tutorial Introduction to Reinforcement Learning

    Authors: Mathukumalli Vidyasagar

    Abstract: In this paper, we present a brief survey of Reinforcement Learning (RL), with particular emphasis on Stochastic Approximation (SA) as a unifying theme. The scope of the paper includes Markov Reward Processes, Markov Decision Processes, Stochastic Approximation algorithms, and widely used algorithms such as Temporal Difference Learning and $Q$-learning.

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 32 pages, 3 figures

  3. arXiv:2303.16241  [pdf, ps, other

    math.OC stat.ML

    Convergence of Momentum-Based Heavy Ball Method with Batch Updating and/or Approximate Gradients

    Authors: Tadipatri Uday Kiran Reddy, Mathukumalli Vidyasagar

    Abstract: In this paper, we study the well-known "Heavy Ball" method for convex and nonconvex optimization introduced by Polyak in 1964, and establish its convergence under a variety of situations. Traditionally, most algorithms use "full-coordinate update," that is, at each step, every component of the argument is updated. However, when the dimension of the argument is very high, it is more efficient to up… ▽ More

    Submitted 10 June, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: 33 pages, 6 figures

  4. arXiv:2209.07028  [pdf, other

    stat.ME cs.LG math.PR math.ST stat.ML

    Estimating large causal polytrees from small samples

    Authors: Sourav Chatterjee, Mathukumalli Vidyasagar

    Abstract: We consider the problem of estimating a large causal polytree from a relatively small i.i.d. sample. This is motivated by the problem of determining causal structure when the number of variables is very large compared to the sample size, such as in gene regulatory networks. We give an algorithm that recovers the tree with high accuracy in such settings. The algorithm works under essentially no dis… ▽ More

    Submitted 29 March, 2024; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: 26 pages. An R package has been developed (see link in the article), and a real data example has been added

    MSC Class: 62D20

  5. arXiv:2209.05372  [pdf, other

    math.OC stat.ML

    Convergence of Batch Updating Methods with Approximate Gradients and/or Noisy Measurements: Theory and Computational Results

    Authors: Tadipatri Uday Kiran Reddy, M. Vidyasagar

    Abstract: In this paper, we present a unified and general framework for analyzing the batch updating approach to nonlinear, high-dimensional optimization. The framework encompasses all the currently used batch updating approaches, and is applicable to nonconvex as well as convex functions. Moreover, the framework permits the use of noise-corrupted gradients, as well as first-order approximations to the grad… ▽ More

    Submitted 27 January, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 21 pages, 4 figures

  6. arXiv:2205.01303  [pdf, ps, other

    stat.ML cs.LG math.PR

    Convergence of Stochastic Approximation via Martingale and Converse Lyapunov Methods

    Authors: M. Vidyasagar

    Abstract: In this paper, we study the almost sure boundedness and the convergence of the stochastic approximation (SA) algorithm. At present, most available convergence proofs are based on the ODE method, and the almost sure boundedness of the iterations is an assumption and not a conclusion. In Borkar-Meyn (2000), it is shown that if the ODE has only one globally attractive equilibrium, then under addition… ▽ More

    Submitted 9 January, 2023; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: 20 pages; dedicated to Prof. Eduardo Sontag on the occasion of his 70th birthday, and to Prof. Rajeeva L. Karandikar on his 65th birthday

    MSC Class: 60G44

  7. arXiv:2109.03445  [pdf, ps, other

    stat.ML cs.AI cs.LG eess.SY math.PR

    Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning

    Authors: Rajeeva L. Karandikar, M. Vidyasagar

    Abstract: Ever since its introduction in the classic paper of Robbins and Monro in 1951, Stochastic Approximation (SA) has become a standard tool for finding a solution of an equation of the form $f(θ) = 0$, when only noisy measurements of $f(\cdot)$ are available. In most situations, \textit{every component} of the putative solution $θ_t$ is updated at each step $t$. In some applications such as $Q$-learni… ▽ More

    Submitted 20 February, 2024; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 28 pages

  8. arXiv:2101.09158  [pdf, other

    q-bio.PE

    SUTRA: A Novel Approach to Modelling Pandemics with Applications to COVID-19

    Authors: Manindra Agrawal, Madhuri Kanitkar, Deepu Phillip, Tanima Hajra, Arti Singh, Avaneesh Singh, Prabal Pratap Singh, Mathukumalli Vidyasagar

    Abstract: The Covid-19 pandemic has two key properties: (i) asymptomatic cases (both detected and undetected) that can result in new infections, and (ii) time-varying characteristics due to new variants, Non-Pharmaceutical Interventions etc. We develop a model called SUTRA (Susceptible, Undetected though infected, Tested positive, and Removed Analysis) that takes into account both of these two key propertie… ▽ More

    Submitted 25 October, 2022; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: 38 pages, 20 figures, 5 tables

  9. arXiv:2006.00045  [pdf, other

    q-bio.PE physics.soc-ph

    Estimating Hidden Asymptomatics, Herd Immunity Threshold and Lockdown Effects using a COVID-19 Specific Model

    Authors: Shaurya Kaushal, Abhineet Singh Rajput, Soumyadeep Bhattacharya, M. Vidyasagar, Aloke Kumar, Meher K. Prakash, Santosh Ansumali

    Abstract: A quantitative COVID-19 model that incorporates hidden asymptomatic patients is developed, and an analytic solution in parametric form is given. The model incorporates the impact of lockdown and resulting spatial migration of population due to announcement of lockdown. A method is presented for estimating the model parameters from real-world data. It is shown that increase of infections slows down… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

  10. arXiv:1910.03937  [pdf, ps, other

    stat.ML cs.LG math.CO math.NT

    New and Explicit Constructions of Unbalanced Ramanujan Bipartite Graphs

    Authors: Shantanu Prasad Burnwal, Kaneenika Sinha, Mathukumalli Vidyasagar

    Abstract: The objectives of this article are three-fold. Firstly, we present for the first time explicit constructions of an infinite family of \textit{unbalanced} Ramanujan bigraphs. Secondly, we revisit some of the known methods for constructing Ramanujan graphs and discuss the computational work required in actually implementing the various construction methods. The third goal of this article is to addre… ▽ More

    Submitted 12 November, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: This paper is a partial replacement of 1910.03937v1. The phase transition part of 1910.03937v1 will be uploaded as a separate submission

  11. arXiv:1908.00963  [pdf, ps, other

    stat.ML cs.LG

    Deterministic Completion of Rectangular Matrices Using Asymmetric Ramanujan Graphs: Exact and Stable Recovery

    Authors: Shantanu Prasad Burnwal, Mathukumalli Vidyasagar

    Abstract: In this paper we study the matrix completion problem: Suppose $X \in {\mathbb R}^{n_r \times n_c}$ is unknown except for a known upper bound $r$ on its rank. By measuring a small number $m \ll n_r n_c$ of elements of $X$, is it possible to recover $X$ exactly with noise-free measurements, or to construct a good approximation of $X$ with noisy measurements? Existing solutions to these problems invo… ▽ More

    Submitted 21 May, 2020; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: The original submission 1908.00963 has been split into two parts. The replacement submission is Part-1 of the revised version. Part-2 can also be found on arXiv

    MSC Class: 68T05

  12. arXiv:1808.03001  [pdf, other

    stat.ML cs.LG

    Compressed Sensing Using Binary Matrices of Nearly Optimal Dimensions

    Authors: Mahsa Lotfi, Mathukumalli Vidyasagar

    Abstract: In this paper, we study the problem of compressed sensing using binary measurement matrices and $\ell_1$-norm minimization (basis pursuit) as the recovery algorithm. We derive new upper and lower bounds on the number of measurements to achieve robust sparse recovery with binary matrices. We establish sufficient conditions for a column-regular binary matrix to satisfy the robust null space property… ▽ More

    Submitted 26 April, 2020; v1 submitted 8 August, 2018; originally announced August 2018.

    Comments: 28 pages, 3 figures, 5 tables

  13. arXiv:1710.07973  [pdf, ps, other

    stat.ML

    An Approach to One-Bit Compressed Sensing Based on Probably Approximately Correct Learning Theory

    Authors: Mehmet Eren Ahsen, Mathukumalli Vidyasagar

    Abstract: In this paper, the problem of one-bit compressed sensing (OBCS) is formulated as a problem in probably approximately correct (PAC) learning. It is shown that the Vapnik-Chervonenkis (VC-) dimension of the set of half-spaces in $\mathbb{R}^n$ generated by $k$-sparse vectors is bounded below by $k \lg (n/k)$ and above by $2k \lg (n/k)$, plus some round-off terms. By coupling this estimate with well-… ▽ More

    Submitted 22 October, 2017; originally announced October 2017.

    Comments: 28 pages, 4 figures

  14. arXiv:1710.07952  [pdf, ps, other

    eess.SY

    CLOT Norm Minimization for Continuous Hands-off Control

    Authors: Niharika Challapalli, Masaaki Nagahara, Mathukumalli Vidyasagar

    Abstract: In this paper, we consider hands-off control via minimization of the CLOT (Combined $L$-One and Two) norm. The maximum hands-off control is the $L^0$-optimal (or the sparsest) control among all feasible controls that are bounded by a specified value and transfer the state from a given initial state to the origin within a fixed time duration. In general, the maximum hands-off control is a bang-off-… ▽ More

    Submitted 22 October, 2017; originally announced October 2017.

    Comments: 38 pages, 20 figures. enlarged version of arXiv:1611.02071

  15. arXiv:1708.03608  [pdf, ps, other

    cs.IT cs.LG

    A Fast Noniterative Algorithm for Compressive Sensing Using Binary Measurement Matrices

    Authors: Mahsa Lotfi, Mathukumalli Vidyasagar

    Abstract: In this paper we present a new algorithm for compressive sensing that makes use of binary measurement matrices and achieves exact recovery of ultra sparse vectors, in a single pass and without any iterations. Due to its noniterative nature, our algorithm is hundreds of times faster than $\ell_1$-norm minimization, and methods based on expander graphs, both of which require multiple iterations. Our… ▽ More

    Submitted 21 May, 2018; v1 submitted 11 August, 2017; originally announced August 2017.

    Comments: 24 pages, 4 tables

  16. arXiv:1611.02071  [pdf, ps, other

    eess.SY

    Continuous Hands-off Control by CLOT Norm Minimization

    Authors: Niharika Challapalli, Masaaki Nagahara, Mathukumalli Vidyasagar

    Abstract: In this paper, we consider hands-off control via minimization of the CLOT (Combined L-One and Two) norm. The maximum hands-off control is the L0-optimal (or the sparsest) control among all feasible controls that are bounded by a specified value and transfer the state from a given initial state to the origin within a fixed time duration. In general, the maximum hands-off control is a bang-off-bang… ▽ More

    Submitted 7 November, 2016; originally announced November 2016.

    Comments: 8 pages, 18 figures, 3 tables

  17. arXiv:1606.05889  [pdf, ps, other

    stat.ML

    Tight Performance Bounds for Compressed Sensing With Conventional and Group Sparsity

    Authors: Shashank Ranjan, Mathukumalli Vidyasagar

    Abstract: In this paper, we study the problem of recovering a group sparse vector from a small number of linear measurements. In the past the common approach has been to use various "group sparsity-inducing" norms such as the Group LASSO norm for this purpose. By using the theory of convex relaxations, we show that it is also possible to use $\ell_1$-norm minimization for group sparse recovery. We introduce… ▽ More

    Submitted 28 July, 2018; v1 submitted 19 June, 2016; originally announced June 2016.

    Comments: 26 pages, one table, no figures. Revised version of a paper

  18. arXiv:1512.08673  [pdf, ps, other

    stat.ML

    Error Bounds for Compressed Sensing Algorithms With Group Sparsity: A Unified Approach

    Authors: M. Eren Ahsen, M. Vidyasagar

    Abstract: In compressed sensing, in order to recover a sparse or nearly sparse vector from possibly noisy measurements, the most popular approach is $\ell_1$-norm minimization. Upper bounds for the $\ell_2$- norm of the error between the true and estimated vectors are given in [1] and reviewed in [2], while bounds for the $\ell_1$-norm are given in [3]. When the unknown vector is not conventionally sparse b… ▽ More

    Submitted 29 December, 2015; originally announced December 2015.

    Comments: 28 pages, final version of 1401.6623, accepted for publication. arXiv admin note: substantial text overlap with arXiv:1401.6623

    MSC Class: 62J99

  19. arXiv:1410.8229  [pdf, ps, other

    stat.ML

    Two New Approaches to Compressed Sensing Exhibiting Both Robust Sparse Recovery and the Grou** Effect

    Authors: Mehmet Eren Ahsen, Niharika Challapalli, Mathukumalli Vidyasagar

    Abstract: In this paper we introduce a new optimization formulation for sparse regression and compressed sensing, called CLOT (Combined L-One and Two), wherein the regularizer is a convex combination of the $\ell_1$- and $\ell_2$-norms. This formulation differs from the Elastic Net (EN) formulation, in which the regularizer is a convex combination of the $\ell_1$- and $\ell_2$-norm squared. It is shown that… ▽ More

    Submitted 20 June, 2017; v1 submitted 29 October, 2014; originally announced October 2014.

    Comments: 22 pages, 3 figures, to appear in the Journal of Machine Learning Research

    MSC Class: 90C25

  20. arXiv:1402.5728  [pdf, other

    q-bio.QM cs.LG stat.ML

    Machine Learning Methods in the Computational Biology of Cancer

    Authors: Mathukumalli Vidyasagar

    Abstract: The objectives of this "perspective" paper are to review some recent advances in sparse feature selection for regression and classification, as well as compressed sensing, and to discuss how these might be used to develop tools to advance personalized cancer therapy. As an illustration of the possibilities, a new algorithm for sparse regression is presented, and is applied to predict the time to t… ▽ More

    Submitted 24 February, 2014; originally announced February 2014.

    Comments: 35 pages, three figures

    MSC Class: 62P10

  21. arXiv:1401.6623  [pdf, ps, other

    stat.ML

    Near-Ideal Behavior of Compressed Sensing Algorithms

    Authors: Mehmet Eren Ahsen, Mathukumalli Vidyasagar

    Abstract: In a recent paper, it is shown that the LASSO algorithm exhibits "near-ideal behavior," in the following sense: Suppose $y = Az + η$ where $A$ satisfies the restricted isometry property (RIP) with a sufficiently small constant, and $\Vert η\Vert_2 \leq ε$. Then minimizing $\Vert z \Vert_1$ subject to $\Vert y - Az \Vert_2 \leq ε$ leads to an estimate $\hat{x}$ whose error… ▽ More

    Submitted 20 April, 2014; v1 submitted 26 January, 2014; originally announced January 2014.

    Comments: 31 pages

    MSC Class: 62J07

  22. arXiv:1309.3663  [pdf, ps, other

    math.ST math.PR

    An Elementary Derivation of the Large Deviation Rate Function for Finite State Markov Chains

    Authors: Mathukumalli Vidyasagar

    Abstract: Large deviation theory is a branch of probability theory that is devoted to a study of the "rate" at which empirical estimates of various quantities converge to their true values. The object of study in this paper is the rate at which estimates of the doublet frequencies of a Markov chain over a finite alphabet converge to their true values. In case the Markov process is actually an i.i.d.\ proces… ▽ More

    Submitted 14 September, 2013; originally announced September 2013.

    Comments: 34 pages, no figures

  23. arXiv:1208.4066  [pdf, other

    q-bio.GN stat.AP

    Reverse Engineering Gene Interaction Networks Using the Phi-Mixing Coefficient

    Authors: Nitin Kumar Singh, M. Eren Ahsen, Shiva Mankala, Hyun-Seok Kim, Michael A. White, M. Vidyasagar

    Abstract: Constructing gene interaction networks (GINs) from high-throughput gene expression data is an important and challenging problem in systems biology. Existing algorithms produce networks that either have undirected and unweighted edges, or else are constrained to contain no cycles, both of which are biologically unrealistic. In the present paper we propose a new algorithm, based on a concept from pr… ▽ More

    Submitted 12 March, 2016; v1 submitted 20 August, 2012; originally announced August 2012.

    Comments: 19 pages, 6 figures

    MSC Class: 62P10; 92B15

  24. arXiv:1208.1720  [pdf, ps, other

    stat.CO q-bio.QM

    Mixing Coefficients Between Discrete and Real Random Variables: Computation and Properties

    Authors: Mehmet Eren Ahsen, Mathukumalli Vidyasagar

    Abstract: In this paper we study the problem of estimating the alpha-, beta- and phi-mixing coefficients between two random variables, that can either assume values in a finite set or the set of real numbers. In either case, explicit closed-form formulas for the beta-mixing coefficient are already known. Therefore for random variables assuming values in a finite set, our contributions are two-fold: (i) In t… ▽ More

    Submitted 3 July, 2013; v1 submitted 8 August, 2012; originally announced August 2012.

    Comments: 36 pages. Accepted for publication in IEEE Transactions on Automatic Control

    MSC Class: 93E25

  25. arXiv:1104.4521  [pdf, ps, other

    eess.SY cs.IT math.OC

    A Metric Between Probability Distributions on Finite Sets of Different Cardinalities and Applications to Order Reduction

    Authors: Mathukumalli Vidyasagar

    Abstract: With increasing use of digital control it is natural to view control inputs and outputs as stochastic processes assuming values over finite alphabets rather than in a Euclidean space. As control over networks becomes increasingly common, data compression by reducing the size of the input and output alphabets without losing the fidelity of representation becomes relevant. This requires us to define… ▽ More

    Submitted 6 September, 2011; v1 submitted 22 April, 2011; originally announced April 2011.

    Comments: 32 pages, no figures

    MSC Class: 93E99

  26. arXiv:math/0112208  [pdf, ps, other

    math.OC math.AG

    An Improved Bound on the VC-Dimension of Neural Networks with Polynomial Activation Functions

    Authors: J. Maurice Rojas, M. Vidyasagar

    Abstract: In this note, we derive an improved upper bound for the VC-dimension of neural networks with polynomial activation functions. This improved bound is based on a result of Rojas on the number of connected components of a semi-algebraic set.

    Submitted 1 February, 2002; v1 submitted 19 December, 2001; originally announced December 2001.

    Comments: 9 pages, submitted for publication. Various typos fixed and the proof of the main result has been streamlined