Skip to main content

Showing 1–6 of 6 results for author: Lozano, A C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.07484  [pdf, other

    cs.LG math.OC stat.ML

    A General Family of Stochastic Proximal Gradient Methods for Deep Learning

    Authors: Jihun Yun, Aurelie C. Lozano, Eunho Yang

    Abstract: We study the training of regularized neural networks where the regularizer can be non-smooth and non-convex. We propose a unified framework for stochastic proximal gradient descent, which we term ProxGen, that allows for arbitrary positive preconditioners and lower semi-continuous regularizers. Our framework encompasses standard stochastic proximal gradient methods without preconditioners as speci… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 21 pages

  2. arXiv:2007.00884   

    cs.LG stat.ML

    A Revision of Neural Tangent Kernel-based Approaches for Neural Networks

    Authors: Kyung-Su Kim, Aurélie C. Lozano, Eunho Yang

    Abstract: Recent theoretical works based on the neural tangent kernel (NTK) have shed light on the optimization and generalization of over-parameterized networks, and partially bridge the gap between their practical success and classical learning theory. Especially, using the NTK-based approach, the following three representative results were obtained: (1) A training error bound was derived to show that net… ▽ More

    Submitted 6 August, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: We spotted an error in the proof of Lemma A.4 and are investigating whether this can be corrected. Furthermore, the authors of the original paper have informed us that they are fixing the lemma upon which our theorem 3.2 builds. Therefore, we are removing the current version of our paper

  3. arXiv:1905.10757  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Gradient Methods with Block Diagonal Matrix Adaptation

    Authors: Jihun Yun, Aurelie C. Lozano, Eunho Yang

    Abstract: Adaptive gradient approaches that automatically adjust the learning rate on a per-feature basis have been very popular for training deep networks. This rich class of algorithms includes Adagrad, RMSprop, Adam, and recent extensions. All these algorithms have adopted diagonal matrix adaptation, due to the prohibitive computational burden of manipulating full matrices in high-dimensions. In this pap… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

    Comments: 31 pages

  4. arXiv:1604.03915  [pdf, other

    cs.CV cs.LG

    Removing Clouds and Recovering Ground Observations in Satellite Image Sequences via Temporally Contiguous Robust Matrix Completion

    Authors: Jialei Wang, Peder A. Olsen, Andrew R. Conn, Aurelie C. Lozano

    Abstract: We consider the problem of removing and replacing clouds in satellite image sequences, which has a wide range of applications in remote sensing. Our approach first detects and removes the cloud-contaminated part of the image sequences. It then recovers the missing scenes from the clean parts using the proposed "TECROMAC" (TEmporally Contiguous RObust MAtrix Completion) objective. The objective fun… ▽ More

    Submitted 13 April, 2016; originally announced April 2016.

    Comments: To Appear In Conference on Computer Vision and Pattern Recognition (CVPR 2016)

  5. arXiv:1402.4624  [pdf, ps, other

    stat.ML cs.DS math.OC stat.ME

    Sparse Quantile Huber Regression for Efficient and Robust Estimation

    Authors: Aleksandr Y. Aravkin, Anju Kambadur, Aurelie C. Lozano, Ronny Luss

    Abstract: We consider new formulations and methods for sparse quantile regression in the high-dimensional setting. Quantile regression plays an important role in many applications, including outlier-robust exploratory analysis in gene selection. In addition, the sparsity consideration in quantile regression enables the exploration of the entire conditional distribution of the response variable given the pre… ▽ More

    Submitted 19 February, 2014; originally announced February 2014.

    Comments: 9 pages

    MSC Class: 62F35; 65K10

  6. arXiv:1210.4792  [pdf, ps, other

    stat.ML cs.LG

    Scalable Matrix-valued Kernel Learning for High-dimensional Nonlinear Multivariate Regression and Granger Causality

    Authors: Vikas Sindhwani, Minh Ha Quang, Aurelie C. Lozano

    Abstract: We propose a general matrix-valued multiple kernel learning framework for high-dimensional nonlinear multivariate regression problems. This framework allows a broad class of mixed norm regularizers, including those that induce sparsity, to be imposed on a dictionary of vector-valued Reproducing Kernel Hilbert Spaces. We develop a highly scalable and eigendecomposition-free algorithm that orchestra… ▽ More

    Submitted 7 March, 2013; v1 submitted 17 October, 2012; originally announced October 2012.

    Comments: 22 pages. Presentation changes; Corrections made to Theorem 2 (section 6.2) in this version