Skip to main content

Showing 1–4 of 4 results for author: Rebuffel, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.07555  [pdf, other

    cs.CL

    Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic Evaluation

    Authors: Clément Rebuffel, Thomas Scialom, Laure Soulier, Benjamin Piwowarski, Sylvain Lamprier, Jacopo Staiano, Geoffrey Scoutheeten, Patrick Gallinari

    Abstract: QuestEval is a reference-less metric used in text-to-text tasks, that compares the generated summaries directly to the source text, by automatically asking and answering questions. Its adaptation to Data-to-Text tasks is not straightforward, as it requires multimodal Question Generation and Answering systems on the considered tasks, which are seldom available. To this purpose, we propose a method… ▽ More

    Submitted 7 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021

  2. arXiv:2102.02810  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Controlling Hallucinations at Word Level in Data-to-Text Generation

    Authors: Clément Rebuffel, Marco Roberti, Laure Soulier, Geoffrey Scoutheeten, Rossella Cancelliere, Patrick Gallinari

    Abstract: Data-to-Text Generation (DTG) is a subfield of Natural Language Generation aiming at transcribing structured data in natural language descriptions. The field has been recently boosted by the use of neural-based generators which exhibit on one side great syntactic skills without the need of hand-crafted pipelines; on the other side, the quality of the generated text reflects the quality of the trai… ▽ More

    Submitted 9 July, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: 20 pages, 6 figures, 5 tables (excluding Appendix). Source code: https://github.com/KaijuML/dtt-multi-branch

    MSC Class: 68T50 (Primary); 68T07 (Secondary); 68T05 ACM Class: I.2.6; I.2.7

  3. arXiv:2010.10866  [pdf, other

    cs.CL

    PARENTing via Model-Agnostic Reinforcement Learning to Correct Pathological Behaviors in Data-to-Text Generation

    Authors: Clément Rebuffel, Laure Soulier, Geoffrey Scoutheeten, Patrick Gallinari

    Abstract: In language generation models conditioned by structured data, the classical training via maximum likelihood almost always leads models to pick up on dataset divergence (i.e., hallucinations or omissions), and to incorporate them erroneously in their own generations at inference. In this work, we build ontop of previous Reinforcement Learning based approaches and show that a model-agnostic framewor… ▽ More

    Submitted 22 October, 2020; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted at the 13th International Conference on Natural Language Generation (INLG 2020)

  4. arXiv:1912.10011  [pdf, other

    cs.CL cs.IR cs.LG

    A Hierarchical Model for Data-to-Text Generation

    Authors: Clément Rebuffel, Laure Soulier, Geoffrey Scoutheeten, Patrick Gallinari

    Abstract: Transcribing structured data into natural language descriptions has emerged as a challenging task, referred to as "data-to-text". These structures generally regroup multiple elements, as well as their attributes. Most attempts rely on translation encoder-decoder methods which linearize elements into a sequence. This however loses most of the structure contained in the data. In this work, we propos… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Accepted at the 42nd European Conference on IR Research, ECIR 2020