Skip to main content

Showing 1–8 of 8 results for author: Beaglehole, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15714  [pdf, other

    cs.GT

    Fast, optimal, and dynamic electoral campaign budgeting by a generalized Colonel Blotto game

    Authors: Thomas Valles, Daniel Beaglehole

    Abstract: The Colonel Blotto game is a deeply studied theoretical model for competitive allocation environments including elections, advertising, and ecology. However, the original formulation of Colonel Blotto has had few practical implications due to the lack of fast algorithms to compute its optimal strategies and the limited applicability of its winner-take-all reward distribution. We demonstrate that t… ▽ More

    Submitted 25 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2402.13728  [pdf, other

    cs.LG stat.ML

    Average gradient outer product as a mechanism for deep neural collapse

    Authors: Daniel Beaglehole, Peter Súkeník, Marco Mondelli, Mikhail Belkin

    Abstract: Deep Neural Collapse (DNC) refers to the surprisingly rigid structure of the data representations in the final layers of Deep Neural Networks (DNNs). Though the phenomenon has been measured in a variety of settings, its emergence is typically explained via data-agnostic approaches, such as the unconstrained features model. In this work, we introduce a data-dependent setting where DNC forms due to… ▽ More

    Submitted 23 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  3. arXiv:2402.05271  [pdf, other

    stat.ML cs.AI cs.LG

    Feature learning as alignment: a structural property of gradient descent in non-linear neural networks

    Authors: Daniel Beaglehole, Ioannis Mitliagkas, Atish Agarwala

    Abstract: Understanding the mechanisms through which neural networks extract statistics from input-label pairs through feature learning is one of the most important unsolved problems in supervised learning. Prior works demonstrated that the gram matrices of the weights (the neural feature matrices, NFM) and the average gradient outer products (AGOP) become correlated during training, in a statement known as… ▽ More

    Submitted 24 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2309.00570  [pdf, other

    stat.ML cs.CV cs.LG

    Mechanism of feature learning in convolutional neural networks

    Authors: Daniel Beaglehole, Adityanarayanan Radhakrishnan, Parthe Pandit, Mikhail Belkin

    Abstract: Understanding the mechanism of how convolutional neural networks learn features from image data is a fundamental problem in machine learning and computer vision. In this work, we identify such a mechanism. We posit the Convolutional Neural Feature Ansatz, which states that covariances of filters in any convolutional layer are proportional to the average gradient outer product (AGOP) taken with res… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  5. arXiv:2212.13881  [pdf, other

    cs.LG cs.AI stat.ML

    Mechanism of feature learning in deep fully connected networks and kernel machines that recursively learn features

    Authors: Adityanarayanan Radhakrishnan, Daniel Beaglehole, Parthe Pandit, Mikhail Belkin

    Abstract: In recent years neural networks have achieved impressive results on many technological and scientific tasks. Yet, the mechanism through which these models automatically select features, or patterns in data, for prediction remains unclear. Identifying such a mechanism is key to advancing performance and interpretability of neural networks and promoting reliable adoption of these models in scientifi… ▽ More

    Submitted 9 May, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

  6. arXiv:2205.13525  [pdf, other

    cs.LG

    On the Inconsistency of Kernel Ridgeless Regression in Fixed Dimensions

    Authors: Daniel Beaglehole, Mikhail Belkin, Parthe Pandit

    Abstract: ``Benign overfitting'', the ability of certain algorithms to interpolate noisy training data and yet perform well out-of-sample, has been a topic of considerable recent interest. We show, using a fixed design setup, that an important class of predictors, kernel machines with translation-invariant kernels, does not exhibit benign overfitting in fixed dimensions. In particular, the estimated predict… ▽ More

    Submitted 12 April, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

  7. arXiv:2201.10758  [pdf, ps, other

    cs.GT cs.DM cs.DS cs.MA

    Sampling Equilibria: Fast No-Regret Learning in Structured Games

    Authors: Daniel Beaglehole, Max Hopkins, Daniel Kane, Sihan Liu, Shachar Lovett

    Abstract: Learning and equilibrium computation in games are fundamental problems across computer science and economics, with applications ranging from politics to machine learning. Much of the work in this area revolves around a simple algorithm termed \emph{randomized weighted majority} (RWM), also known as "Hedge" or "Multiplicative Weights Update," which is well known to achieve statistically optimal rat… ▽ More

    Submitted 15 July, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

  8. arXiv:2108.05433  [pdf, other

    cs.DS cs.LG

    Learning to Hash Robustly, Guaranteed

    Authors: Alexandr Andoni, Daniel Beaglehole

    Abstract: The indexing algorithms for the high-dimensional nearest neighbor search (NNS) with the best worst-case guarantees are based on the randomized Locality Sensitive Hashing (LSH), and its derivatives. In practice, many heuristic approaches exist to "learn" the best indexing method in order to speed-up NNS, crucially adapting to the structure of the given dataset. Oftentimes, these heuristics outper… ▽ More

    Submitted 7 July, 2022; v1 submitted 11 August, 2021; originally announced August 2021.