Skip to main content

Showing 1–4 of 4 results for author: Tommasone, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.07565  [pdf, other

    cs.CL

    On Leakage of Code Generation Evaluation Datasets

    Authors: Alexandre Matton, Tom Sherborne, Dennis Aumiller, Elena Tommasone, Milad Alizadeh, **gyi He, Raymond Ma, Maxime Voisin, Ellen Gilsenan-McMahon, Matthias Gallé

    Abstract: In this paper we consider contamination by code generation test sets, in particular in their use in modern large language models. We discuss three possible sources of such contamination and show findings supporting each of them: (i) direct data leakage, (ii) indirect data leakage through the use of synthetic data and (iii) overfitting to evaluation sets during model selection. Key to our findings… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 4 main pages, 9 in total

  2. arXiv:2110.08554  [pdf, other

    cs.CL

    PAGnol: An Extra-Large French Generative Model

    Authors: Julien Launay, Elena Tommasone, Baptiste Pannier, François Boniface, Amélie Chatelain, Alessandro Cappelli, Iacopo Poli, Djamé Seddah

    Abstract: Access to large pre-trained models of varied architectures, in many different languages, is central to the democratization of NLP. We introduce PAGnol, a collection of French GPT models. Using scaling laws, we efficiently train PAGnol-XL (1.5B parameters) with the same computational budget as CamemBERT, a model 13 times smaller. PAGnol-XL is the largest model trained to date for the French langu… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  3. arXiv:2107.11814  [pdf, other

    cs.AR cs.ET

    LightOn Optical Processing Unit: Scaling-up AI and HPC with a Non von Neumann co-processor

    Authors: Charles Brossollet, Alessandro Cappelli, Igor Carron, Charidimos Chaintoutis, Amélie Chatelain, Laurent Daudet, Sylvain Gigan, Daniel Hesslow, Florent Krzakala, Julien Launay, Safa Mokaadi, Fabien Moreau, Kilian Müller, Ruben Ohana, Gustave Pariente, Iacopo Poli, Elena Tommasone

    Abstract: We introduce LightOn's Optical Processing Unit (OPU), the first photonic AI accelerator chip available on the market for at-scale Non von Neumann computations, reaching 1500 TeraOPS. It relies on a combination of free-space optics with off-the-shelf components, together with a software API allowing a seamless integration within Python-based processing pipelines. We discuss a variety of use cases… ▽ More

    Submitted 25 July, 2021; originally announced July 2021.

    Comments: Proceedings IEEE Hot Chips 33, 2021

  4. arXiv:2006.08697  [pdf, other

    physics.comp-ph cs.LG

    Online Change Point Detection in Molecular Dynamics With Optical Random Features

    Authors: Amélie Chatelain, Elena Tommasone, Laurent Daudet, Iacopo Poli

    Abstract: Proteins are made of atoms constantly fluctuating, but can occasionally undergo large-scale changes. Such transitions are of biological interest, linking the structure of a protein to its function with a cell. Atomic-level simulations, such as Molecular Dynamics (MD), are used to study these events. However, molecular dynamics simulations produce time series with multiple observables, while chan… ▽ More

    Submitted 17 June, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 15 pages, 12 figures