Skip to main content

Showing 1–3 of 3 results for author: Guiroy, S

.
  1. arXiv:2208.02377  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Meta-Learning Generalization with Activation-Based Early-Stop**

    Authors: Simon Guiroy, Christopher Pal, Gonçalo Mordido, Sarath Chandar

    Abstract: Meta-Learning algorithms for few-shot learning aim to train neural networks capable of generalizing to novel tasks using only a few examples. Early-stop** is critical for performance, halting model training when it reaches optimal generalization to the new task distribution. Early-stop** mechanisms in Meta-Learning typically rely on measuring the model performance on labeled examples from a me… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted at CoLLAs 2022. To be published in Proceedings of Machine Learning Research (PMLR)

  2. arXiv:2110.06990  [pdf, other

    cs.LG cs.AI cs.CV

    Scaling Laws for the Few-Shot Adaptation of Pre-trained Image Classifiers

    Authors: Gabriele Prato, Simon Guiroy, Ethan Caballero, Irina Rish, Sarath Chandar

    Abstract: Empirical science of neural scaling laws is a rapidly growing area of significant importance to the future of machine learning, particularly in the light of recent breakthroughs achieved by large-scale pre-trained models such as GPT-3, CLIP and DALL-e. Accurately predicting the neural network performance with increasing resources such as data, compute and model size provides a more comprehensive e… ▽ More

    Submitted 18 October, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

  3. arXiv:1907.07287  [pdf, other

    cs.LG cs.CV stat.ML

    Towards Understanding Generalization in Gradient-Based Meta-Learning

    Authors: Simon Guiroy, Vikas Verma, Christopher Pal

    Abstract: In this work we study generalization of neural networks in gradient-based meta-learning by analyzing various properties of the objective landscapes. We experimentally demonstrate that as meta-training progresses, the meta-test solutions, obtained after adapting the meta-train solution of the model, to new tasks via few steps of gradient-based fine-tuning, become flatter, lower in loss, and further… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.