Skip to main content

Showing 1–17 of 17 results for author: Ribeiro, L F R

.
  1. arXiv:2406.03592  [pdf, other

    cs.CL cs.AI

    Measuring Retrieval Complexity in Question Answering Systems

    Authors: Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti

    Abstract: In this paper, we investigate which questions are challenging for retrieval-based Question Answering (QA). We (i) propose retrieval complexity (RC), a novel metric conditioned on the completeness of retrieved documents, which measures the difficulty of answering questions, and (ii) propose an unsupervised pipeline to measure RC given an arbitrary retrieval system. Our proposed pipeline measures RC… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 (findings)

  2. arXiv:2404.01701  [pdf, other

    cs.CL

    On the Role of Summary Content Units in Text Summarization Evaluation

    Authors: Marcel Nawrath, Agnieszka Nowak, Tristan Ratz, Danilo C. Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Lining Zhang, Sebastian Gehrmann, Saad Mahamood, Miruna Clinciu, Khyathi Chandu, Yufang Hou

    Abstract: At the heart of the Pyramid evaluation method for text summarization lie human written summary content units (SCUs). These SCUs are concise sentences that decompose a summary into small facts. Such SCUs can be used to judge the quality of a candidate summary, possibly partially automated via natural language inference (NLI) systems. Interestingly, with the aim to fully automate the Pyramid evaluat… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 10 Pages, 3 Figures, 3 Tables, camera ready version accepted at NAACL 2024

  3. arXiv:2310.10623  [pdf, other

    cs.CL cs.AI cs.LG

    Generating Summaries with Controllable Readability Levels

    Authors: Leonardo F. R. Ribeiro, Mohit Bansal, Markus Dreyer

    Abstract: Readability refers to how easily a reader can understand a written text. Several factors affect the readability level, such as the complexity of the text, its subject matter, and the reader's background knowledge. Generating summaries based on different readability levels is critical for enabling knowledge consumption by diverse audiences. However, current text generation approaches lack refined c… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted as an EMNLP 2023 main paper

  4. arXiv:2305.07716  [pdf, other

    cs.RO cs.AI

    Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning

    Authors: Georgia Chalvatzaki, Ali Younes, Daljeet Nandha, An Le, Leonardo F. R. Ribeiro, Iryna Gurevych

    Abstract: Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 21 pages, 6 figures

  5. arXiv:2210.10695  [pdf, other

    cs.IR cs.CL

    Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking

    Authors: Tim Baumgärtner, Leonardo F. R. Ribeiro, Nils Reimers, Iryna Gurevych

    Abstract: Pairing a lexical retriever with a neural re-ranking model has set state-of-the-art performance on large-scale information retrieval datasets. This pipeline covers scenarios like question answering or navigational queries, however, for information-seeking scenarios, users often provide information on whether a document is relevant to their query in form of clicks or explicit feedback. Therefore, i… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

  6. arXiv:2208.09316  [pdf, other

    cs.CL

    UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

    Authors: Rachneet Sachdeva, Haritz Puerto, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych

    Abstract: Question Answering (QA) systems are increasingly deployed in applications where they support real-world decisions. However, state-of-the-art models rely on deep neural networks, which are difficult to interpret by humans. Inherently interpretable models or post hoc explainability methods can help users to comprehend how a model arrives at its prediction and, if successful, increase their trust in… ▽ More

    Submitted 20 October, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Accepted at AACL 2022 as Demo Paper

  7. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di **, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  8. arXiv:2204.06508  [pdf, other

    cs.CL cs.AI cs.LG

    FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations

    Authors: Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal

    Abstract: Despite recent improvements in abstractive summarization, most current approaches generate summaries that are not factually consistent with the source document, severely restricting their trust and usage in real-world applications. Recent works have shown promising improvements in factuality error identification using text or dependency arc entailments; however, they do not consider the entire sem… ▽ More

    Submitted 19 July, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: NAACL 2022 (15 pages)

  9. arXiv:2203.13693  [pdf, other

    cs.CL cs.IR

    UKP-SQUARE: An Online Platform for Question Answering Research

    Authors: Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych

    Abstract: Recent advances in NLP and information retrieval have given rise to a diverse set of question answering tasks that are of different formats (e.g., extractive, abstractive), require different model architectures (e.g., generative, discriminative), and setups (e.g., with or without retrieval). Despite having a large number of powerful, specialized QA pipelines (which we refer to as Skills) that cons… ▽ More

    Submitted 28 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Demo Track

  10. arXiv:2109.03808  [pdf, other

    cs.CL

    Smelting Gold and Silver for Improved Multilingual AMR-to-Text Generation

    Authors: Leonardo F. R. Ribeiro, Jonas Pfeiffer, Yue Zhang, Iryna Gurevych

    Abstract: Recent work on multilingual AMR-to-text generation has exclusively focused on data augmentation strategies that utilize silver AMR. However, this assumes a high quality of generated AMRs, potentially limiting the transferability to the target task. In this paper, we investigate different techniques for automatically generating AMR annotations, where we aim to study which source of information yiel… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted as a conference paper to EMNLP 2021

  11. arXiv:2103.09120  [pdf, other

    cs.CL

    Structural Adapters in Pretrained Language Models for AMR-to-text Generation

    Authors: Leonardo F. R. Ribeiro, Yue Zhang, Iryna Gurevych

    Abstract: Pretrained language models (PLM) have recently advanced graph-to-text generation, where the input graph is linearized into a sequence and fed into the PLM to obtain its representation. However, efficiently encoding the graph structure in PLMs is challenging because such models were pretrained on natural language, and modeling structured data may lead to catastrophic forgetting of distributional kn… ▽ More

    Submitted 8 September, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Accepted as a long conference paper to EMNLP 2021

  12. arXiv:2007.08426  [pdf, other

    cs.CL

    Investigating Pretrained Language Models for Graph-to-Text Generation

    Authors: Leonardo F. R. Ribeiro, Martin Schmitt, Hinrich Schütze, Iryna Gurevych

    Abstract: Graph-to-text generation aims to generate fluent texts from graph-based data. In this paper, we investigate two recently proposed pretrained language models (PLMs) and analyze the impact of different task-adaptive pretraining strategies for PLMs in graph-to-text generation. We present a study across three graph domains: meaning representations, Wikipedia knowledge graphs (KGs) and scientific KGs.… ▽ More

    Submitted 27 September, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted as a long paper to NLP4ConvAI, EMNLP2021

  13. arXiv:2006.09242  [pdf, other

    cs.CL

    Modeling Graph Structure via Relative Position for Text Generation from Knowledge Graphs

    Authors: Martin Schmitt, Leonardo F. R. Ribeiro, Philipp Dufter, Iryna Gurevych, Hinrich Schütze

    Abstract: We present Graformer, a novel Transformer-based encoder-decoder architecture for graph-to-text generation. With our novel graph self-attention, the encoding of a node relies on all nodes in the input graph - not only direct neighbors - facilitating the detection of global patterns. We represent the relation between two nodes as the length of the shortest path between them. Graformer learns to weig… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Accepted as a long paper at TextGraphs 2021

  14. arXiv:2005.11787  [pdf, ps, other

    cs.CL

    Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers

    Authors: Anne Lauscher, Olga Majewska, Leonardo F. R. Ribeiro, Iryna Gurevych, Nikolai Rozanov, Goran Glavaš

    Abstract: Following the major success of neural language models (LMs) such as BERT or GPT-2 on a variety of language understanding tasks, recent work focused on injecting (structured) knowledge from external resources into these models. While on the one hand, joint pretraining (i.e., training from scratch, adding objectives based on external knowledge to the primary LM objective) may be prohibitively comput… ▽ More

    Submitted 11 October, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: EMNLP 2020 - DeeLIO, ECML 2020 - DECODEML, 5 pages, 4 tables, 3 references

  15. arXiv:2001.11003  [pdf, other

    cs.CL

    Modeling Global and Local Node Contexts for Text Generation from Knowledge Graphs

    Authors: Leonardo F. R. Ribeiro, Yue Zhang, Claire Gardent, Iryna Gurevych

    Abstract: Recent graph-to-text models generate text from graph-based data using either global or local aggregation to learn node representations. Global node encoding allows explicit communication between two distant nodes, thereby neglecting graph topology as all nodes are directly connected. In contrast, local node encoding considers the relations between neighbor nodes capturing the graph structure, but… ▽ More

    Submitted 22 June, 2020; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2020; Author's final version; pre-MIT Press publication version

  16. arXiv:1909.00352  [pdf, other

    cs.CL

    Enhancing AMR-to-Text Generation with Dual Graph Representations

    Authors: Leonardo F. R. Ribeiro, Claire Gardent, Iryna Gurevych

    Abstract: Generating text from graph-based data, such as Abstract Meaning Representation (AMR), is a challenging task due to the inherent difficulty in how to properly encode the structure of a graph with labeled edges. To address this difficulty, we propose a novel graph-to-sequence model that encodes different but complementary perspectives of the structural information contained in the AMR graph. The mod… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

    Comments: Accepted as a long conference paper to EMNLP 2019

  17. arXiv:1704.03165  [pdf, other

    cs.SI cs.LG stat.ML

    struc2vec: Learning Node Representations from Structural Identity

    Authors: Leonardo F. R. Ribeiro, Pedro H. P. Savarese, Daniel R. Figueiredo

    Abstract: Structural identity is a concept of symmetry in which network nodes are identified according to the network structure and their relationship to other nodes. Structural identity has been studied in theory and practice over the past decades, but only recently has it been addressed with representational learning techniques. This work presents struc2vec, a novel and flexible framework for learning lat… ▽ More

    Submitted 3 July, 2017; v1 submitted 11 April, 2017; originally announced April 2017.

    Comments: 10 pages, KDD2017, Research Track