Skip to main content

Showing 1–6 of 6 results for author: Gonzalvo, J

.
  1. arXiv:2405.18590  [pdf, other

    stat.ML cs.LG

    A Margin-based Multiclass Generalization Bound via Geometric Complexity

    Authors: Michael Munn, Benoit Dherin, Javier Gonzalvo

    Abstract: There has been considerable effort to better understand the generalization capabilities of deep neural networks both as a means to unlock a theoretical understanding of their success as well as providing directions for further improvements. In this paper, we investigate margin-based multiclass generalization bounds for neural networks which rely on a recent complexity measure, the geometric comple… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted as an ICML 2023 workshop paper (Topology, Algebra and Geometry in Machine Learning)

    Journal ref: Proceedings of 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML), PMLR 221:189-205, 2023

  2. arXiv:2405.15706  [pdf, other

    cs.LG

    The Impact of Geometric Complexity on Neural Collapse in Transfer Learning

    Authors: Michael Munn, Benoit Dherin, Javier Gonzalvo

    Abstract: Many of the recent remarkable advances in computer vision and language models can be attributed to the success of transfer learning via the pre-training of large foundation models. However, a theoretical framework which explains this empirical success is incomplete and remains an active area of research. Flatness of the loss surface and neural collapse have recently emerged as useful pre-training… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:1906.01550  [pdf, other

    stat.ML cs.LG

    Towards Task and Architecture-Independent Generalization Gap Predictors

    Authors: Scott Yak, Javier Gonzalvo, Hanna Mazzawi

    Abstract: Can we use deep learning to predict when deep learning works? Our results suggest the affirmative. We created a dataset by training 13,500 neural networks with different architectures, on different variations of spiral datasets, and using different optimization parameters. We used this dataset to train task-independent and architecture-independent generalization gap predictors for those neural net… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: 8 pages, 6 figures, 2 tables. To be presented at ICML 2019 "Understanding and Improving Generalization in Deep Learning" Workshop (poster)

  4. arXiv:1905.00080  [pdf, other

    cs.LG stat.ML

    AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles

    Authors: Charles Weill, Javier Gonzalvo, Vitaly Kuznetsov, Scott Yang, Scott Yak, Hanna Mazzawi, Eugen Hotaj, Ghassen Jerfel, Vladimir Macko, Ben Adlam, Mehryar Mohri, Corinna Cortes

    Abstract: AdaNet is a lightweight TensorFlow-based (Abadi et al., 2015) framework for automatically learning high-quality ensembles with minimal expert intervention. Our framework is inspired by the AdaNet algorithm (Cortes et al., 2017) which learns the structure of a neural network as an ensemble of subnetworks. We designed it to: (1) integrate with the existing TensorFlow ecosystem, (2) offer sensible de… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.

  5. arXiv:1903.06236  [pdf, other

    cs.LG stat.ML

    Improving Neural Architecture Search Image Classifiers via Ensemble Learning

    Authors: Vladimir Macko, Charles Weill, Hanna Mazzawi, Javier Gonzalvo

    Abstract: Finding the best neural network architecture requires significant time, resources, and human expertise. These challenges are partially addressed by neural architecture search (NAS) which is able to find the best convolutional layer or cell that is then used as a building block for the network. However, once a good building block is found, manual design is still required to assemble the final archi… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

  6. The Compact Linear Collider (CLIC) - 2018 Summary Report

    Authors: The CLIC, CLICdp collaborations, :, T. K. Charles, P. J. Giansiracusa, T. G. Lucas, R. P. Rassool, M. Volpi, C. Balazs, K. Afanaciev, V. Makarenko, A. Patapenka, I. Zhuk, C. Collette, M. J. Boland, A. C. Abusleme Hoffman, M. A. Diaz, F. Garay, Y. Chi, X. He, G. Pei, S. Pei, G. Shu, X. Wang, J. Zhang , et al. (671 additional authors not shown)

    Abstract: The Compact Linear Collider (CLIC) is a TeV-scale high-luminosity linear $e^+e^-$ collider under development at CERN. Following the CLIC conceptual design published in 2012, this report provides an overview of the CLIC project, its current status, and future developments. It presents the CLIC physics potential and reports on design, technology, and implementation aspects of the accelerator and the… ▽ More

    Submitted 6 May, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: 112 pages, 59 figures; published as CERN Yellow Report Monograph Vol. 2/2018; corresponding editors: Philip N. Burrows, Nuria Catalan Lasheras, Lucie Linssen, Marko Petrič, Aidan Robson, Daniel Schulte, Eva Sicking, Steinar Stapnes

    Report number: CERN-2018-005-M