Skip to main content

Showing 1–34 of 34 results for author: Scialom, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.15930  [pdf, other

    cs.CL cs.AI

    WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models

    Authors: Youssef Benchekroun, Megi Dervishi, Mark Ibrahim, Jean-Baptiste Gaya, Xavier Martinet, Grégoire Mialon, Thomas Scialom, Emmanuel Dupoux, Dieuwke Hupkes, Pascal Vincent

    Abstract: We propose WorldSense, a benchmark designed to assess the extent to which LLMs are consistently able to sustain tacit world models, by testing how they draw simple inferences from descriptions of simple arrangements of entities. Worldsense is a synthetic benchmark with three problem types, each with their own trivial control, which explicitly avoids bias by decorrelating the abstract structure of… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  2. arXiv:2311.12983  [pdf, other

    cs.CL cs.AI

    GAIA: a benchmark for General AI Assistants

    Authors: Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann LeCun, Thomas Scialom

    Abstract: We introduce GAIA, a benchmark for General AI Assistants that, if solved, would represent a milestone in AI research. GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency. GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human r… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  3. arXiv:2308.13418  [pdf, other

    cs.LG cs.CV

    Nougat: Neural Optical Understanding for Academic Documents

    Authors: Lukas Blecher, Guillem Cucurull, Thomas Scialom, Robert Stojnic

    Abstract: Scientific knowledge is predominantly stored in books and scientific journals, often in the form of PDFs. However, the PDF format leads to a loss of semantic information, particularly for mathematical expressions. We propose Nougat (Neural Optical Understanding for Academic Documents), a Visual Transformer model that performs an Optical Character Recognition (OCR) task for processing scientific do… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 17 pages, 10 figures

  4. arXiv:2308.12950  [pdf, other

    cs.CL

    Code Llama: Open Foundation Models for Code

    Authors: Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, **gyu Liu, Romain Sauvestre, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom , et al. (1 additional authors not shown)

    Abstract: We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following ability for programming tasks. We provide multiple flavors to cover a wide range of applications: foundation models (Code Llama), Python specializations (Code Llama… ▽ More

    Submitted 31 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  5. arXiv:2307.09288  [pdf, other

    cs.CL cs.AI

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    Authors: Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini , et al. (43 additional authors not shown)

    Abstract: In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be… ▽ More

    Submitted 19 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  6. arXiv:2302.07842  [pdf, ps, other

    cs.CL

    Augmented Language Models: a Survey

    Authors: Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Edouard Grave, Yann LeCun, Thomas Scialom

    Abstract: This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools. The former is defined as decomposing a potentially complex task into simpler subtasks while the latter consists in calling external modules such as a code interpreter. LMs can leverage these augmentations separately or in combination via heuristics, or learn to do so from demo… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  7. arXiv:2302.04761  [pdf, other

    cs.CL

    Toolformer: Language Models Can Teach Themselves to Use Tools

    Authors: Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom

    Abstract: Language models (LMs) exhibit remarkable abilities to solve new tasks from just a few examples or textual instructions, especially at scale. They also, paradoxically, struggle with basic functionality, such as arithmetic or factual lookup, where much simpler and smaller models excel. In this paper, we show that LMs can teach themselves to use external tools via simple APIs and achieve the best of… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  8. arXiv:2301.11312  [pdf, other

    cs.CL

    LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization

    Authors: Laura Nguyen, Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Text Summarization is a popular task and an active area of research for the Natural Language Processing community. By definition, it requires to account for long input texts, a characteristic which poses computational challenges for neural models. Moreover, real-world documents come in a variety of complex, visually-rich, layouts. This information is of great relevance, whether to highlight salien… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: To be published in EACL 2023

  9. arXiv:2212.09689  [pdf, other

    cs.CL cs.AI cs.LG

    Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor

    Authors: Or Honovich, Thomas Scialom, Omer Levy, Timo Schick

    Abstract: Instruction tuning enables pretrained language models to perform new tasks from inference-time natural language descriptions. These approaches rely on vast amounts of human supervision in the form of crowdsourced datasets or user interactions. In this work, we introduce Unnatural Instructions: a large dataset of creative and diverse instructions, collected with virtually no human labor. We collect… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 18 pages, 7 figures

  10. arXiv:2211.09085  [pdf, other

    cs.CL stat.ML

    Galactica: A Large Language Model for Science

    Authors: Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, Robert Stojnic

    Abstract: Information overload is a major obstacle to scientific progress. The explosive growth in scientific literature and data has made it ever harder to discover useful insights in a large mass of information. Today scientific knowledge is accessed through search engines, but they are unable to organize scientific knowledge alone. In this paper we introduce Galactica: a large language model that can sto… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  11. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  12. arXiv:2211.01482  [pdf, other

    cs.CL cs.AI cs.LG

    RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question

    Authors: Alireza Mohammadshahi, Thomas Scialom, Majid Yazdani, Pouya Yanki, Angela Fan, James Henderson, Marzieh Saeidi

    Abstract: Existing metrics for evaluating the quality of automatically generated questions such as BLEU, ROUGE, BERTScore, and BLEURT compare the reference and predicted questions, providing a high score when there is a considerable lexical overlap or semantic similarity between the candidate and the reference questions. This approach has two major shortcomings. First, we need expensive human-provided refer… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted to Findings of ACL 2023

  13. arXiv:2205.12394  [pdf, other

    cs.CL

    MaskEval: Weighted MLM-Based Evaluation for Text Summarization and Simplification

    Authors: Yu Lu Liu, Rachel Bawden, Thomas Scialom, Benoît Sagot, Jackie Chi Kit Cheung

    Abstract: In text summarization and simplification, system outputs must be evaluated along multiple dimensions such as relevance, factual consistency, fluency, and grammaticality, and a wide range of possible outputs could be of high quality. These properties make the development of an adaptable, reference-less evaluation metric both necessary and challenging. We introduce MaskEval, a reference-less metric… ▽ More

    Submitted 13 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  14. arXiv:2205.12393  [pdf, other

    cs.CL

    Fine-tuned Language Models are Continual Learners

    Authors: Thomas Scialom, Tuhin Chakrabarty, Smaranda Muresan

    Abstract: Recent work on large language models relies on the intuition that most natural language processing tasks can be described via natural language instructions. Language models trained on these instructions show strong zero-shot performance on several standard datasets. However, these models even though impressive still perform poorly on a wide range of tasks outside of their respective training and e… ▽ More

    Submitted 29 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  15. Which Discriminator for Cooperative Text Generation?

    Authors: Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, Ewa Kijak, Vincent Claveau

    Abstract: Language models generate texts by successively predicting probability distributions for next tokens given past ones. A growing field of interest tries to leverage external information in the decoding process so that the generated texts have desired properties, such as being more natural, non toxic, faithful, or having a specific writing style. A solution is to use a classifier at each generation s… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 6 pages, 2 figures, accepted to SIGIR 2022

  16. arXiv:2204.04991  [pdf, other

    cs.CL

    TRUE: Re-evaluating Factual Consistency Evaluation

    Authors: Or Honovich, Roee Aharoni, Jonathan Herzig, Hagai Taitelbaum, Doron Kukliansy, Vered Cohen, Thomas Scialom, Idan Szpektor, Avinatan Hassidim, Yossi Matias

    Abstract: Grounded text generation systems often generate text that contains factual inconsistencies, hindering their real-world applicability. Automatic factual consistency evaluation may help alleviate this limitation by accelerating evaluation cycles, filtering inconsistent outputs and augmenting training data. While attracting increasing attention, such evaluation metrics are usually developed and evalu… ▽ More

    Submitted 3 May, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted as a long paper to NAACL 2022 main conference

  17. arXiv:2201.12320  [pdf, other

    cs.LG cs.CL

    Generative Cooperative Networks for Natural Language Generation

    Authors: Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, Benjamin Piwowarski

    Abstract: Generative Adversarial Networks (GANs) have known a tremendous success for many continuous generation tasks, especially in the field of image generation. However, for discrete outputs such as language, optimizing GANs remains an open problem with many instabilities, as no gradient can be properly back-propagated from the discriminator output to the generator parameters. An alternative is to learn… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  18. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  19. arXiv:2110.09147  [pdf, other

    cs.CL cs.AI cs.LG

    BEAMetrics: A Benchmark for Language Generation Evaluation Evaluation

    Authors: Thomas Scialom, Felix Hill

    Abstract: Natural language processing (NLP) systems are increasingly trained to generate open-ended text rather than classifying between responses. This makes research on evaluation metrics for generated language -- functions that score system output given the context and/or human reference responses -- of critical importance. However, different metrics have different strengths and biases, and reflect human… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  20. arXiv:2109.01078  [pdf, other

    cs.CL

    Skim-Attention: Learning to Focus via Document Layout

    Authors: Laura Nguyen, Thomas Scialom, Jacopo Staiano, Benjamin Piwowarski

    Abstract: Transformer-based pre-training techniques of text and layout have proven effective in a number of document understanding tasks. Despite this success, multimodal pre-training models suffer from very high computational and memory costs. Motivated by human reading strategies, this paper presents Skim-Attention, a new attention mechanism that takes advantage of the structure of the document and its la… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: 15 pages, 6 figures, to be published in EMNLP 2021 Findings

  21. arXiv:2108.12560  [pdf, other

    cs.CL cs.CV

    QACE: Asking Questions to Evaluate an Image Caption

    Authors: Hwanhee Lee, Thomas Scialom, Seunghyun Yoon, Franck Dernoncourt, Kyomin Jung

    Abstract: In this paper, we propose QACE, a new metric based on Question Answering for Caption Evaluation. QACE generates questions on the evaluated caption and checks its content by asking the questions on either the reference caption or the source image. We first develop QACE-Ref that compares the answers of the evaluated caption to its reference, and report competitive results with the state-of-the-art m… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: EMNLP 2021 Findings

  22. arXiv:2106.06363  [pdf, other

    cs.CL cs.AI

    To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Due to the discrete nature of words, language GANs require to be optimized from rewards provided by discriminator networks, via reinforcement learning methods. This is a much harder setting than for continuous tasks, which enjoy gradient flows from discriminators to generators, usually leading to dramatic learning instabilities. However, we claim that this can be solved by making discriminator and… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  23. arXiv:2104.07560  [pdf, other

    cs.CL

    Rethinking Automatic Evaluation in Sentence Simplification

    Authors: Thomas Scialom, Louis Martin, Jacopo Staiano, Éric Villemonte de la Clergerie, Benoît Sagot

    Abstract: Automatic evaluation remains an open research question in Natural Language Generation. In the context of Sentence Simplification, this is particularly challenging: the task requires by nature to replace complex words with simpler ones that shares the same meaning. This limits the effectiveness of n-gram based metrics like BLEU. Going hand in hand with the recent advances in NLG, new metrics have b… ▽ More

    Submitted 16 April, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: updated affiliation and link to data

  24. arXiv:2104.07555  [pdf, other

    cs.CL

    Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic Evaluation

    Authors: Clément Rebuffel, Thomas Scialom, Laure Soulier, Benjamin Piwowarski, Sylvain Lamprier, Jacopo Staiano, Geoffrey Scoutheeten, Patrick Gallinari

    Abstract: QuestEval is a reference-less metric used in text-to-text tasks, that compares the generated summaries directly to the source text, by automatically asking and answering questions. Its adaptation to Data-to-Text tasks is not straightforward, as it requires multimodal Question Generation and Answering systems on the considered tasks, which are seldom available. To this purpose, we propose a method… ▽ More

    Submitted 7 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021

  25. arXiv:2103.12693  [pdf, other

    cs.CL

    QuestEval: Summarization Asks for Fact-based Evaluation

    Authors: Thomas Scialom, Paul-Alexis Dray, Patrick Gallinari, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano, Alex Wang

    Abstract: Summarization evaluation remains an open research problem: current metrics such as ROUGE are known to be limited and to correlate poorly with human judgments. To alleviate this issue, recent work has proposed evaluation metrics which rely on question answering models to assess whether a summary contains all the relevant information in its source document. Though promising, the proposed approaches… ▽ More

    Submitted 9 April, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: project page: https://github.com/recitalAI/QuestEval

  26. arXiv:2010.12643  [pdf, other

    cs.CL

    Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering

    Authors: Arij Riabi, Thomas Scialom, Rachel Keraron, Benoît Sagot, Djamé Seddah, Jacopo Staiano

    Abstract: Coupled with the availability of large scale datasets, deep learning architectures have enabled rapid progress on the Question Answering task. However, most of those datasets are in English, and the performances of state-of-the-art multilingual models are significantly lower when evaluated on non-English data. Due to high data collection costs, it is not realistic to obtain annotated data for each… ▽ More

    Submitted 14 October, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: 7 pages

  27. arXiv:2010.03369  [pdf, other

    cs.CL cs.AI

    Toward Stance-based Personas for Opinionated Dialogues

    Authors: Thomas Scialom, Serra Sinem Tekiroglu, Jacopo Staiano, Marco Guerini

    Abstract: In the context of chit-chat dialogues it has been shown that endowing systems with a persona profile is important to produce more coherent and meaningful conversations. Still, the representation of such personas has thus far been limited to a fact-based representation (e.g. "I have two cats."). We argue that these representations remain superficial w.r.t. the complexity of human personality. In th… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted at Findings of EMNLP 2020

  28. arXiv:2007.00968  [pdf, other

    cs.CL

    Project PIAF: Building a Native French Question-Answering Dataset

    Authors: Rachel Keraron, Guillaume Lancrenon, Mathilde Bras, Frédéric Allary, Gilles Moyse, Thomas Scialom, Edmundo-Pavel Soriano-Morales, Jacopo Staiano

    Abstract: Motivated by the lack of data for non-English languages, in particular for the evaluation of downstream tasks such as Question Answering, we present a participatory effort to collect a native French Question Answering Dataset. Furthermore, we describe and publicly release the annotation tool developed for our collection effort, along with the data obtained and preliminary baselines.

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: LREC 2020

  29. arXiv:2006.04643  [pdf, other

    cs.CL cs.LG

    ColdGANs: Taming Language GANs with Cautious Sampling Strategies

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Training regimes based on Maximum Likelihood Estimation (MLE) suffer from known limitations, often leading to poorly generated text sequences. At the root of these limitations is the mismatch between training and inference, i.e. the so-called exposure bias, exacerbated by considering only the reference texts as correct, while in practice several alternative formulations could be as good. Generativ… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  30. arXiv:2004.14900  [pdf, other

    cs.CL

    MLSUM: The Multilingual Summarization Corpus

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five different languages -- namely, French, German, Spanish, Russian, Turkish. Together with English newspapers from the popular CNN/Daily mail dataset, the collected data form a large scale multilingual dataset which can enable new research direct… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

  31. arXiv:2002.10832  [pdf, other

    cs.CL cs.CV cs.LG

    What BERT Sees: Cross-Modal Transfer for Visual Question Generation

    Authors: Thomas Scialom, Patrick Bordes, Paul-Alexis Dray, Jacopo Staiano, Patrick Gallinari

    Abstract: Pre-trained language models have recently contributed to significant advances in NLP tasks. Recently, multi-modal versions of BERT have been developed, using heavy pre-training relying on vast corpora of aligned textual and image data, primarily applied to classification tasks such as VQA. In this paper, we are interested in evaluating the visual capabilities of BERT out-of-the-box, by avoiding pr… ▽ More

    Submitted 16 December, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: INLG 2020

  32. arXiv:2002.10375  [pdf, other

    cs.CL cs.LG

    Discriminative Adversarial Search for Abstractive Summarization

    Authors: Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: We introduce a novel approach for sequence decoding, Discriminative Adversarial Search (DAS), which has the desirable properties of alleviating the effects of exposure bias without requiring external metrics. Inspired by Generative Adversarial Networks (GANs), wherein a discriminator is used to improve the generator, our method differs from GANs in that the generator parameters are not updated at… ▽ More

    Submitted 30 August, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  33. arXiv:1911.03350  [pdf, other

    cs.CL cs.AI

    Ask to Learn: A Study on Curiosity-driven Question Generation

    Authors: Thomas Scialom, Jacopo Staiano

    Abstract: We propose a novel text generation task, namely Curiosity-driven Question Generation. We start from the observation that the Question Generation task has traditionally been considered as the dual problem of Question Answering, hence tackling the problem of generating a question given the text that contains its answer. Such questions can be used to evaluate machine reading comprehension. However, i… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 13 pages, 3 figures

  34. arXiv:1909.01610  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Answers Unite! Unsupervised Metrics for Reinforced Summarization Models

    Authors: Thomas Scialom, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano

    Abstract: Abstractive summarization approaches based on Reinforcement Learning (RL) have recently been proposed to overcome classical likelihood maximization. RL enables to consider complex, possibly non-differentiable, metrics that globally assess the quality and relevance of the generated outputs. ROUGE, the most used summarization metric, is known to suffer from bias towards lexical similarity as well as… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted at EMNLP 2019