Skip to main content

Showing 1–4 of 4 results for author: Chatelain, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.08554  [pdf, other

    cs.CL

    PAGnol: An Extra-Large French Generative Model

    Authors: Julien Launay, Elena Tommasone, Baptiste Pannier, François Boniface, Amélie Chatelain, Alessandro Cappelli, Iacopo Poli, Djamé Seddah

    Abstract: Access to large pre-trained models of varied architectures, in many different languages, is central to the democratization of NLP. We introduce PAGnol, a collection of French GPT models. Using scaling laws, we efficiently train PAGnol-XL (1.5B parameters) with the same computational budget as CamemBERT, a model 13 times smaller. PAGnol-XL is the largest model trained to date for the French langu… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  2. arXiv:2109.11928  [pdf, other

    stat.ML cs.LG

    Is the Number of Trainable Parameters All That Actually Matters?

    Authors: Amélie Chatelain, Amine Djeghri, Daniel Hesslow, Julien Launay, Iacopo Poli

    Abstract: Recent work has identified simple empirical scaling laws for language models, linking compute budget, dataset size, model size, and autoregressive modeling loss. The validity of these simple power laws across orders of magnitude in model scale provides compelling evidence that larger models are also more capable models. However, scaling up models under the constraints of hardware and infrastructur… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  3. arXiv:2107.11814  [pdf, other

    cs.AR cs.ET

    LightOn Optical Processing Unit: Scaling-up AI and HPC with a Non von Neumann co-processor

    Authors: Charles Brossollet, Alessandro Cappelli, Igor Carron, Charidimos Chaintoutis, Amélie Chatelain, Laurent Daudet, Sylvain Gigan, Daniel Hesslow, Florent Krzakala, Julien Launay, Safa Mokaadi, Fabien Moreau, Kilian Müller, Ruben Ohana, Gustave Pariente, Iacopo Poli, Elena Tommasone

    Abstract: We introduce LightOn's Optical Processing Unit (OPU), the first photonic AI accelerator chip available on the market for at-scale Non von Neumann computations, reaching 1500 TeraOPS. It relies on a combination of free-space optics with off-the-shelf components, together with a software API allowing a seamless integration within Python-based processing pipelines. We discuss a variety of use cases… ▽ More

    Submitted 25 July, 2021; originally announced July 2021.

    Comments: Proceedings IEEE Hot Chips 33, 2021

  4. arXiv:2006.08697  [pdf, other

    physics.comp-ph cs.LG

    Online Change Point Detection in Molecular Dynamics With Optical Random Features

    Authors: Amélie Chatelain, Elena Tommasone, Laurent Daudet, Iacopo Poli

    Abstract: Proteins are made of atoms constantly fluctuating, but can occasionally undergo large-scale changes. Such transitions are of biological interest, linking the structure of a protein to its function with a cell. Atomic-level simulations, such as Molecular Dynamics (MD), are used to study these events. However, molecular dynamics simulations produce time series with multiple observables, while chan… ▽ More

    Submitted 17 June, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 15 pages, 12 figures