Skip to main content

Showing 1–11 of 11 results for author: Dexter, G

.
  1. arXiv:2403.12278  [pdf, other

    cs.LG math.NA

    Stochastic Rounding Implicitly Regularizes Tall-and-Thin Matrices

    Authors: Gregory Dexter, Christos Boutsikas, Linkai Ma, Ilse C. F. Ipsen, Petros Drineas

    Abstract: Motivated by the popularity of stochastic rounding in the context of machine learning and the training of large-scale deep neural network models, we consider stochastic nearness rounding of real matrices $\mathbf{A}$ with many more rows than columns. We provide novel theoretical evidence, supported by extensive experimental evaluation that, with high probability, the smallest singular value of a s… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    MSC Class: 68W20; 65F15; 65F22; 65G50; 15A18; 15A42

  2. arXiv:2401.12332  [pdf, other

    cs.LG math.OC

    A Precise Characterization of SGD Stability Using Loss Surface Geometry

    Authors: Gregory Dexter, Borja Ocejo, Sathiya Keerthi, Aman Gupta, Ayan Acharya, Rajiv Khanna

    Abstract: Stochastic Gradient Descent (SGD) stands as a cornerstone optimization algorithm with proven real-world empirical successes but relatively limited theoretical understanding. Recent research has illuminated a key factor contributing to its practical efficacy: the implicit regularization it instigates. Several studies have investigated the linear stability property of SGD in the vicinity of a statio… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: To appear at ICLR 2024

  3. arXiv:2310.19068  [pdf, ps, other

    cs.DS cs.LG

    Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming

    Authors: Gregory Dexter, Petros Drineas, David P. Woodruff, Taisuke Yasuda

    Abstract: Sketching algorithms have recently proven to be a powerful approach both for designing low-space streaming algorithms as well as fast polynomial time approximation schemes (PTAS). In this work, we develop new techniques to extend the applicability of sketching-based approaches to the sparse dictionary learning and the Euclidean $k$-means clustering problems. In particular, we initiate the study of… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: To appear in NeurIPS 2023

  4. arXiv:2305.05826  [pdf, ps, other

    cs.DS math.NA

    Universal Matrix Sparsifiers and Fast Deterministic Algorithms for Linear Algebra

    Authors: Rajarshi Bhattacharjee, Gregory Dexter, Cameron Musco, Archan Ray, Sushant Sachdeva, David P Woodruff

    Abstract: Let $\mathbf S \in \mathbb R^{n \times n}$ satisfy $\|\mathbf 1-\mathbf S\|_2\leεn$, where $\mathbf 1$ is the all ones matrix and $\|\cdot\|_2$ is the spectral norm. It is well-known that there exists such an $\mathbf S$ with just $O(n/ε^2)$ non-zero entries: we can let $\mathbf S$ be the scaled adjacency matrix of a Ramanujan expander graph. We show that such an $\mathbf S$ yields a $universal$… ▽ More

    Submitted 12 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 41 pages

    ACM Class: F.2.1; G.1.3; G.1.2; G.4; I.1.2

  5. arXiv:2303.14284  [pdf, ps, other

    cs.LG stat.ML

    Feature Space Sketching for Logistic Regression

    Authors: Gregory Dexter, Rajiv Khanna, Jawad Raheel, Petros Drineas

    Abstract: We present novel bounds for coreset construction, feature selection, and dimensionality reduction for logistic regression. All three approaches can be thought of as sketching the logistic regression inputs. On the coreset construction front, we resolve open problems from prior work and present novel bounds for the complexity of coreset construction methods. On the feature selection and dimensional… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  6. arXiv:2302.09693  [pdf, other

    stat.ML cs.LG

    mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

    Authors: Kayhan Behdin, Qingquan Song, Aman Gupta, Sathiya Keerthi, Ayan Acharya, Borja Ocejo, Gregory Dexter, Rajiv Khanna, David Durfee, Rahul Mazumder

    Abstract: Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance. The Sharpness-Aware Minimization (SAM) technique modifies the fundamental loss function that steers gradient descent methods toward flatter minima, which are believed to exhibit enhanced generalization prowess. Our study delves into a specific variant of SAM known as… ▽ More

    Submitted 30 September, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2212.04343

  7. arXiv:2209.08722  [pdf, other

    cs.DS

    Faster Randomized Interior Point Methods for Tall/Wide Linear Programs

    Authors: Agniva Chowdhury, Gregory Dexter, Palma London, Haim Avron, Petros Drineas

    Abstract: Linear programming (LP) is an extremely useful tool which has been successfully applied to solve various problems in a wide range of areas, including operations research, engineering, economics, or even more abstract mathematical areas such as combinatorics. It is also used in many machine learning applications, such as $\ell_1$-regularized SVMs, basis pursuit, nonnegative matrix factorization, et… ▽ More

    Submitted 23 September, 2022; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: Extended version of the NeurIPS 2020 submission. arXiv admin note: substantial text overlap with arXiv:2003.08072

  8. arXiv:2202.01756  [pdf, other

    math.OC

    On the Convergence of Inexact Predictor-Corrector Methods for Linear Programming

    Authors: Gregory Dexter, Agniva Chowdhury, Haim Avron, Petros Drineas

    Abstract: Interior point methods (IPMs) are a common approach for solving linear programs (LPs) with strong theoretical guarantees and solid empirical performance. The time complexity of these methods is dominated by the cost of solving a linear system of equations at each iteration. In common applications of linear programming, particularly in machine learning and scientific computing, the size of this lin… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 39 pages total. 12 pages main text. 4 figures

  9. arXiv:2109.07647  [pdf, other

    cs.DS math.NA

    Sublinear Time Eigenvalue Approximation via Random Sampling

    Authors: Rajarshi Bhattacharjee, Gregory Dexter, Petros Drineas, Cameron Musco, Archan Ray

    Abstract: We study the problem of approximating the eigenspectrum of a symmetric matrix $\mathbf A \in \mathbb{R}^{n \times n}$ with bounded entries (i.e., $\|\mathbf A\|_{\infty} \leq 1$). We present a simple sublinear time algorithm that approximates all eigenvalues of $\mathbf{A}$ up to additive error $\pm εn$ using those of a randomly sampled… ▽ More

    Submitted 21 July, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 58 pages, 4 figures

    MSC Class: F.2.1; G.1.3; G.1.2; G.4; I.1.2

  10. arXiv:2102.07937  [pdf, other

    cs.LG stat.ML

    Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees

    Authors: Gregory Dexter, Kevin Bello, Jean Honorio

    Abstract: Inverse Reinforcement Learning (IRL) is the problem of finding a reward function which describes observed/known expert behavior. The IRL setting is remarkably useful for automated control, in situations where the reward function is difficult to specify manually or as a means to extract agent preference. In this work, we provide a new IRL algorithm for the continuous state space setting with unknow… ▽ More

    Submitted 21 October, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Journal ref: Neural Information Processing Systems (NeurIPS), 2021

  11. arXiv:1801.01072  [pdf, other

    cs.IT

    Randomized Linear Algebra Approaches to Estimate the Von Neumann Entropy of Density Matrices

    Authors: Eugenia-Maria Kontopoulou, Gregory-Paul Dexter, Wojciech Szpankowski, Ananth Grama, Petros Drineas

    Abstract: Thevon Neumann entropy, named after John von Neumann, is an extension of the classical concept of entropy to the field of quantum mechanics. From a numerical perspective, von Neumann entropy can be computed simply by computing all eigenvalues of a density matrix, an operation that could be prohibitively expensive for large-scale density matrices. We present and analyze three randomized algorithms… ▽ More

    Submitted 3 February, 2020; v1 submitted 3 January, 2018; originally announced January 2018.