Skip to main content

Showing 1–3 of 3 results for author: de Laroussilhe, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.12754  [pdf, other

    cs.LG

    Task Selection for AutoML System Evaluation

    Authors: Jonathan Lorraine, Nihesh Anderson, Chansoo Lee, Quentin De Laroussilhe, Mehadi Hassen

    Abstract: Our goal is to assess if AutoML system changes - i.e., to the search space or hyperparameter optimization - will improve the final model's performance on production tasks. However, we cannot test the changes on production tasks. Instead, we only have access to limited descriptors about tasks that our AutoML system previously executed, like the number of data points or features. We also have a set… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  2. arXiv:1902.00751  [pdf, other

    cs.LG cs.CL stat.ML

    Parameter-Efficient Transfer Learning for NLP

    Authors: Neil Houlsby, Andrei Giurgiu, Stanislaw Jastrzebski, Bruna Morrone, Quentin de Laroussilhe, Andrea Gesmundo, Mona Attariyan, Sylvain Gelly

    Abstract: Fine-tuning large pre-trained models is an effective transfer mechanism in NLP. However, in the presence of many downstream tasks, fine-tuning is parameter inefficient: an entire new model is required for every task. As an alternative, we propose transfer with adapter modules. Adapter modules yield a compact and extensible model; they add only a few trainable parameters per task, and new tasks can… ▽ More

    Submitted 13 June, 2019; v1 submitted 2 February, 2019; originally announced February 2019.

  3. arXiv:1812.10666  [pdf, other

    cs.LG stat.ML

    Neural Architecture Search Over a Graph Search Space

    Authors: Stanisław Jastrzębski, Quentin de Laroussilhe, Mingxing Tan, Xiao Ma, Neil Houlsby, Andrea Gesmundo

    Abstract: Neural Architecture Search (NAS) enabled the discovery of state-of-the-art architectures in many domains. However, the success of NAS depends on the definition of the search space. Current search spaces are defined as a static sequence of decisions and a set of available actions for each decision. Each possible sequence of actions defines an architecture. We propose a more expressive class of sear… ▽ More

    Submitted 31 July, 2019; v1 submitted 27 December, 2018; originally announced December 2018.