Skip to main content

Showing 1–7 of 7 results for author: Simoes, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.05617  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.CY

    Debiasing Methods for Fairer Neural Models in Vision and Language Research: A Survey

    Authors: Otávio Parraga, Martin D. More, Christian M. Oliveira, Nathan S. Gavenski, Lucas S. Kupssinskü, Adilson Medronha, Luis V. Moura, Gabriel S. Simões, Rodrigo C. Barros

    Abstract: Despite being responsible for state-of-the-art results in several computer vision and natural language processing tasks, neural networks have faced harsh criticism due to some of their current shortcomings. One of them is that neural networks are correlation machines prone to model biases within the data instead of focusing on actual useful causal relationships. This problem is particularly seriou… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: Submitted to ACM Computing Surveys - Special Issue on Trustworthy AI

  2. arXiv:2203.15108  [pdf, other

    cs.CL

    A Well-Composed Text is Half Done! Composition Sampling for Diverse Conditional Generation

    Authors: Shashi Narayan, Gonçalo Simões, Yao Zhao, Joshua Maynez, Dipanjan Das, Michael Collins, Mirella Lapata

    Abstract: We propose Composition Sampling, a simple but effective method to generate diverse outputs for conditional generation of higher quality compared to previous stochastic decoding strategies. It builds on recently proposed plan-based neural generation models (Narayan et al, 2021) that are trained to first create a composition of the output and then generate by conditioning on it and the input. Our ap… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 21 pages, ACL 2022

  3. arXiv:2104.07606  [pdf, other

    cs.CL

    Planning with Learned Entity Prompts for Abstractive Summarization

    Authors: Shashi Narayan, Yao Zhao, Joshua Maynez, Gonçalo Simoes, Vitaly Nikolaev, Ryan McDonald

    Abstract: We introduce a simple but flexible mechanism to learn an intermediate plan to ground the generation of abstractive summaries. Specifically, we prepend (or prompt) target summaries with entity chains -- ordered sequences of entities mentioned in the summary. Transformer-based sequence-to-sequence models are then trained to generate the entity chain and then continue generating the summary condition… ▽ More

    Submitted 5 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Accepted to appear at TACL (19 pages, pre-MIT Press publication version)

  4. arXiv:2004.14535  [pdf, other

    cs.CL

    Text Segmentation by Cross Segment Attention

    Authors: Michal Lukasik, Boris Dadachev, Gonçalo Simões, Kishore Papineni

    Abstract: Document and discourse segmentation are two fundamental NLP tasks pertaining to breaking up text into constituents, which are commonly used to help downstream tasks such as information retrieval or text summarization. In this work, we propose three transformer-based architectures and provide comprehensive comparisons with previously proposed approaches on three standard datasets. We establish a ne… ▽ More

    Submitted 7 December, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: 10 pages, 4 figures

  5. arXiv:2004.11026  [pdf, ps, other

    cs.CL

    QURIOUS: Question Generation Pretraining for Text Generation

    Authors: Shashi Narayan, Gonçalo Simoes, Ji Ma, Hannah Craighead, Ryan Mcdonald

    Abstract: Recent trends in natural language processing using pretraining have shifted focus towards pretraining and fine-tuning approaches for text generation. Often the focus has been on task-agnostic approaches that generalize the language modeling objective. We propose question generation as a pretraining method, which better aligns with the text generation objectives. Our text generation models pretrain… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: 9 pages

  6. arXiv:1805.08237  [pdf, other

    cs.CL

    Morphosyntactic Tagging with a Meta-BiLSTM Model over Context Sensitive Token Encodings

    Authors: Bernd Bohnet, Ryan McDonald, Goncalo Simoes, Daniel Andor, Emily Pitler, Joshua Maynez

    Abstract: The rise of neural networks, and particularly recurrent neural networks, has produced significant advances in part-of-speech tagging accuracy. One characteristic common among these models is the presence of rich initial word encodings. These encodings typically are composed of a recurrent character-based representation with learned and pre-trained word embeddings. However, these encodings do not c… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Journal ref: ACL 2018

  7. arXiv:1302.4874  [pdf, other

    cs.CL cs.LG

    A Labeled Graph Kernel for Relationship Extraction

    Authors: Gonçalo Simões, Helena Galhardas, David Matos

    Abstract: In this paper, we propose an approach for Relationship Extraction (RE) based on labeled graph kernels. The kernel we propose is a particularization of a random walk kernel that exploits two properties previously studied in the RE literature: (i) the words between the candidate entities or connecting them in a syntactic representation are particularly likely to carry information regarding the relat… ▽ More

    Submitted 20 February, 2013; originally announced February 2013.