Skip to main content

Showing 1–10 of 10 results for author: Gehring, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00725  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

    Authors: Michael Hassid, Tal Remez, Jonas Gehring, Roy Schwartz, Yossi Adi

    Abstract: It is a common belief that large language models (LLMs) are better than smaller-sized ones. However, larger models also require significantly more time and compute during inference. This begs the question: what happens when both models operate under the same budget? (e.g., compute, run-time). To address this question, we analyze code generation LLMs of various sizes and make comparisons such as ru… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  2. arXiv:2309.07062  [pdf, other

    cs.PL cs.AI cs.CL cs.LG

    Large Language Models for Compiler Optimization

    Authors: Chris Cummins, Volker Seeker, Dejan Grubisic, Mostafa Elhoushi, Youwei Liang, Baptiste Roziere, Jonas Gehring, Fabian Gloeckle, Kim Hazelwood, Gabriel Synnaeve, Hugh Leather

    Abstract: We explore the novel application of Large Language Models to code optimization. We present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly for code size. The model takes as input unoptimized assembly and outputs a list of compiler options to best optimize the program. Crucially, during training, we ask the model to predict the instruction counts before and after opt… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  3. arXiv:2308.12950  [pdf, other

    cs.CL

    Code Llama: Open Foundation Models for Code

    Authors: Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, **gyu Liu, Romain Sauvestre, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom , et al. (1 additional authors not shown)

    Abstract: We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following ability for programming tasks. We provide multiple flavors to cover a wide range of applications: foundation models (Code Llama), Python specializations (Code Llama… ▽ More

    Submitted 31 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  4. arXiv:2210.14685  [pdf, other

    cs.LG cs.AI cs.RO

    Leveraging Demonstrations with Latent Space Priors

    Authors: Jonas Gehring, Deepak Gopinath, Jungdam Won, Andreas Krause, Gabriel Synnaeve, Nicolas Usunier

    Abstract: Demonstrations provide insight into relevant state or action space regions, bearing great potential to boost the efficiency and practicality of reinforcement learning agents. In this work, we propose to leverage demonstration datasets by combining skill learning and sequence modeling. Starting with a learned joint latent space, we separately train a generative model of demonstration sequences and… ▽ More

    Submitted 13 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Published in Transactions on Machine Learning Research (03/2023)

  5. arXiv:2110.10809  [pdf, other

    cs.LG cs.AI cs.RO

    Hierarchical Skills for Efficient Exploration

    Authors: Jonas Gehring, Gabriel Synnaeve, Andreas Krause, Nicolas Usunier

    Abstract: In reinforcement learning, pre-trained low-level skills have the potential to greatly facilitate exploration. However, prior knowledge of the downstream task is required to strike the right balance between generality (fine-grained control) and specificity (faster learning) in skill design. In previous work on continuous control, the sensitivity of methods to this trade-off has not been addressed e… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: To appear in 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  6. arXiv:1812.00054  [pdf, other

    cs.LG cs.AI

    Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger

    Authors: Gabriel Synnaeve, Zeming Lin, Jonas Gehring, Dan Gant, Vegard Mella, Vasil Khalidov, Nicolas Carion, Nicolas Usunier

    Abstract: We formulate the problem of defogging as state estimation and future state prediction from previous, partial observations in the context of real-time strategy games. We propose to employ encoder-decoder neural networks for this task, and introduce proxy tasks and baselines for evaluation to assess their ability of capturing basic game rules and high-level dynamics. By combining convolutional neura… ▽ More

    Submitted 30 November, 2018; originally announced December 2018.

    Journal ref: Advances in Neural Information Processing Systems 31 (2018) 10759-10770

  7. arXiv:1811.08568  [pdf, other

    cs.LG stat.ML

    High-Level Strategy Selection under Partial Observability in StarCraft: Brood War

    Authors: Jonas Gehring, Da Ju, Vegard Mella, Daniel Gant, Nicolas Usunier, Gabriel Synnaeve

    Abstract: We consider the problem of high-level strategy selection in the adversarial setting of real-time strategy games from a reinforcement learning perspective, where taking an action corresponds to switching to the respective strategy. Here, a good strategy successfully counters the opponent's current and possible future strategies which can only be estimated using partial observations. We investigate… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  8. arXiv:1708.02139  [pdf, other

    cs.AI

    STARDATA: A StarCraft AI Research Dataset

    Authors: Zeming Lin, Jonas Gehring, Vasil Khalidov, Gabriel Synnaeve

    Abstract: We release a dataset of 65646 StarCraft replays that contains 1535 million frames and 496 million player actions. We provide full game state data along with the original replays that can be viewed in StarCraft. The game state data was recorded every 3 frames which ensures suitability for a wide variety of machine learning tasks such as strategy classification, inverse reinforcement learning, imita… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.

    Comments: To be presented at AIIDE17

  9. arXiv:1705.03122  [pdf, other

    cs.CL

    Convolutional Sequence to Sequence Learning

    Authors: Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin

    Abstract: The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to recurrent models, computations over all elements can be fully parallelized during training and optimization is easier since the number of non-linearities is fixed… ▽ More

    Submitted 24 July, 2017; v1 submitted 8 May, 2017; originally announced May 2017.

  10. arXiv:1611.02344  [pdf, other

    cs.CL

    A Convolutional Encoder Model for Neural Machine Translation

    Authors: Jonas Gehring, Michael Auli, David Grangier, Yann N. Dauphin

    Abstract: The prevalent approach to neural machine translation relies on bi-directional LSTMs to encode the source sentence. In this paper we present a faster and simpler architecture based on a succession of convolutional layers. This allows to encode the entire source sentence simultaneously compared to recurrent networks for which computation is constrained by temporal dependencies. On WMT'16 English-Rom… ▽ More

    Submitted 24 July, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: 13 pages