Skip to main content

Showing 1–3 of 3 results for author: Lopes, A V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.01826  [pdf, other

    cs.CL cs.AI

    One Wide Feedforward is All You Need

    Authors: Telmo Pessoa Pires, António V. Lopes, Yannick Assogba, Hendra Setiawan

    Abstract: The Transformer architecture has two main non-embedding components: Attention and the Feed Forward Network (FFN). Attention captures interdependencies between words regardless of their position, while the FFN non-linearly transforms each input token independently. In this work we explore the role of the FFN, and find that despite taking up a significant fraction of the model's parameters, it is hi… ▽ More

    Submitted 21 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: Accepted at WMT23 (EMNLP 2023)

  2. arXiv:1907.10352  [pdf, other

    cs.CL

    Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task

    Authors: Fabio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, António Góis, M. Amin Farajian, António V. Lopes, André F. T. Martins

    Abstract: We present the contribution of the Unbabel team to the WMT 2019 Shared Task on Quality Estimation. We participated on the word, sentence, and document-level tracks, encompassing 3 language pairs: English-German, English-Russian, and English-French. Our submissions build upon the recent OpenKiwi framework: we combine linear, neural, and predictor-estimator systems with new transfer learning approac… ▽ More

    Submitted 11 September, 2019; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: In Proceedings of the Fourth Conference on Machine Translation (WMT) 2019: https://www.aclweb.org/anthology/W19-5406/

  3. arXiv:1905.13068  [pdf, other

    cs.CL

    Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based Encoder-Decoder for Automatic Post-Editing

    Authors: António V. Lopes, M. Amin Farajian, Gonçalo M. Correia, Jonay Trenous, André F. T. Martins

    Abstract: This paper describes Unbabel's submission to the WMT2019 APE Shared Task for the English-German language pair. Following the recent rise of large, powerful, pre-trained models, we adapt the BERT pretrained model to perform Automatic Post-Editing in an encoder-decoder framework. Analogously to dual-encoder architectures we develop a BERT-based encoder-decoder (BED) model in which a single pretraine… ▽ More

    Submitted 29 June, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Updated sections 2.2 and 4