Skip to main content

Showing 1–7 of 7 results for author: Sartran, L

.
  1. arXiv:2308.03291  [pdf, other

    cs.LG cs.AI cs.CL

    SynJax: Structured Probability Distributions for JAX

    Authors: Miloš Stanojević, Laurent Sartran

    Abstract: The development of deep learning software libraries enabled significant progress in the field by allowing users to focus on modeling, while letting the library to take care of the tedious and time-consuming task of optimizing execution for modern hardware accelerators. However, this has benefited only particular types of deep learning models, such as Transformers, whose primitives map easily to th… ▽ More

    Submitted 15 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  2. arXiv:2305.07558  [pdf, other

    cs.CL cs.CV

    Measuring Progress in Fine-grained Vision-and-Language Understanding

    Authors: Emanuele Bugliarello, Laurent Sartran, Aishwarya Agrawal, Lisa Anne Hendricks, Aida Nematzadeh

    Abstract: While pretraining on large-scale image-text data from the Web has facilitated rapid progress on many vision-and-language (V&L) tasks, recent work has demonstrated that pretrained models lack "fine-grained" understanding, such as the ability to recognise relationships, verbs, and numbers in images. This has resulted in an increased interest in the community to either develop new benchmarks or model… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  3. arXiv:2211.15089  [pdf, other

    cs.CL cs.LG

    Continuous diffusion for categorical data

    Authors: Sander Dieleman, Laurent Sartran, Arman Roshannai, Nikolay Savinov, Yaroslav Ganin, Pierre H. Richemond, Arnaud Doucet, Robin Strudel, Chris Dyer, Conor Durkan, Curtis Hawthorne, Rémi Leblond, Will Grathwohl, Jonas Adler

    Abstract: Diffusion models have quickly become the go-to paradigm for generative modelling of perceptual signals (such as images and sound) through iterative refinement. Their success hinges on the fact that the underlying physical phenomena are continuous. For inherently discrete and categorical data such as language, various diffusion-inspired alternatives have been proposed. However, the continuous natur… ▽ More

    Submitted 15 December, 2022; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 26 pages, 8 figures; corrections and additional information about hyperparameters

  4. Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale

    Authors: Laurent Sartran, Samuel Barrett, Adhiguna Kuncoro, Miloš Stanojević, Phil Blunsom, Chris Dyer

    Abstract: We introduce Transformer Grammars (TGs), a novel class of Transformer language models that combine (i) the expressive power, scalability, and strong performance of Transformers and (ii) recursive syntactic compositions, which here are implemented through a special attention mask and deterministic transformation of the linearized tree. We find that TGs outperform various strong baselines on sentenc… ▽ More

    Submitted 6 December, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 17 pages, 5 figures, 2 tables and 1 algorithm. To appear in TACL, to be presented at EMNLP 2022

  5. arXiv:2202.11444  [pdf, other

    cs.CL cs.AI cs.LG

    Enabling arbitrary translation objectives with Adaptive Tree Search

    Authors: Wang Ling, Wojciech Stokowiec, Domenic Donato, Laurent Sartran, Lei Yu, Austin Matthews, Chris Dyer

    Abstract: We introduce an adaptive tree search algorithm, that can find high-scoring outputs under translation models that make no assumptions about the form or structure of the search objective. This algorithm -- a deterministic variant of Monte Carlo tree search -- enables the exploration of new kinds of models that are unencumbered by constraints imposed to make decoding tractable, such as autoregressivi… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 17 pages, 3 figures

  6. arXiv:2006.03662  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Rapid Task-Solving in Novel Environments

    Authors: Sam Ritter, Ryan Faulkner, Laurent Sartran, Adam Santoro, Matt Botvinick, David Raposo

    Abstract: We propose the challenge of rapid task-solving in novel environments (RTS), wherein an agent must solve a series of tasks as rapidly as possible in an unfamiliar environment. An effective RTS agent must balance between exploring the unfamiliar environment and solving its current task, all while building a model of the new environment over which it can plan when faced with later tasks. While modern… ▽ More

    Submitted 19 April, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

  7. arXiv:1910.00553  [pdf, other

    cs.CL cs.LG

    Better Document-Level Machine Translation with Bayes' Rule

    Authors: Lei Yu, Laurent Sartran, Wojciech Stokowiec, Wang Ling, Lingpeng Kong, Phil Blunsom, Chris Dyer

    Abstract: We show that Bayes' rule provides an effective mechanism for creating document translation models that can be learned from only parallel sentences and monolingual documents---a compelling benefit as parallel documents are not always available. In our formulation, the posterior probability of a candidate translation is the product of the unconditional (prior) probability of the candidate output doc… ▽ More

    Submitted 2 July, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: Accepted by TACL