Skip to main content

Showing 1–6 of 6 results for author: Tang, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2208.02711  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Agnostic Learning of General ReLU Activation Using Gradient Descent

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We provide a convergence analysis of gradient descent for the problem of agnostically learning a single ReLU function under Gaussian distributions. Unlike prior work that studies the setting of zero bias, we consider the more challenging scenario when the bias of the ReLU function is non-zero. Our main result establishes that starting from random initialization, in a polynomial number of iteration… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 28 oages

  2. arXiv:2107.10209  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We present polynomial time and sample efficient algorithms for learning an unknown depth-2 feedforward neural network with general ReLU activations, under mild non-degeneracy assumptions. In particular, we consider learning an unknown network of the form $f(x) = {a}^{\mathsf{T}}σ({W}^\mathsf{T}x+b)$, where $x$ is drawn from the Gaussian distribution, and $σ(t) := \max(t,0)$ is the ReLU activation.… ▽ More

    Submitted 1 August, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: 45 pages (including appendix). This version fixes an error in the previous version of the paper

  3. arXiv:1811.01316  [pdf, other

    cs.LG stat.ML

    Nonlinear Collaborative Scheme for Deep Neural Networks

    Authors: Hui-Ling Zhen, Xi Lin, Alan Z. Tang, Zhenhua Li, Qingfu Zhang, Sam Kwong

    Abstract: Conventional research attributes the improvements of generalization ability of deep neural networks either to powerful optimizers or the new network design. Different from them, in this paper, we aim to link the generalization ability of a deep network to optimizing a new objective function. To this end, we propose a \textit{nonlinear collaborative scheme} for deep network training, with the key t… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

    Comments: 11 pages, 3 figures (20 subfigures), prepared to submit to IEEE Trans. on Neural Networks and Learning Systems

  4. arXiv:1409.0391  [pdf, ps, other

    stat.ME

    Estimating Linear Mixed-effects State Space Model Based on Disturbance Smoothing

    Authors: Jie Zhou, Ai** Tang

    Abstract: We extend the linear mixed-effects state model to accommodate the correlated individuals and investigate its parameter and state estimation based on disturbance smoothing in this paper. For parameter estimation, EM and score based algorithms are considered. Intermediate quantity of EM algorithm is investigated firstly from which the explicit recursive formulas for the maximizer of the intermediate… ▽ More

    Submitted 2 September, 2014; v1 submitted 1 September, 2014; originally announced September 2014.

    Comments: 27 pages, 2 figures

  5. Percolation under Noise: Detecting Explosive Percolation Using the Second Largest Component

    Authors: Wes Viles, Cedric E. Ginestet, Ariana Tang, Mark A. Kramer, Eric D. Kolaczyk

    Abstract: We consider the problem of distinguishing classical (Erdős-Rényi) percolation from explosive (Achlioptas) percolation, under noise. A statistical model of percolation is constructed allowing for the birth and death of edges as well as the presence of noise in the observations. This graph-valued stochastic process is composed of a latent and an observed non-stationary process, where the observed gr… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Comments: 9 pages and 8 figures. Submitted to Physics Review, Series E

    Journal ref: Phys. Rev. E 93, 052301 (2016)

  6. Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret

    Authors: Animashree Anandkumar, Nithin Michael, Ao Kevin Tang, Ananthram Swami

    Abstract: The problem of distributed learning and channel access is considered in a cognitive network with multiple secondary users. The availability statistics of the channels are initially unknown to the secondary users and are estimated using sensing decisions. There is no explicit information exchange or prior agreement among the secondary users. We propose policies for distributed learning and access w… ▽ More

    Submitted 8 June, 2010; originally announced June 2010.

    Comments: Submitted to IEEE JSAC on Advances in Cognitive Radio Networking and Communications, Dec. 2009, Revised May 2010