Skip to main content

Showing 51–62 of 62 results for author: Costa-jussà, M R

.
  1. arXiv:1907.00735  [pdf, other

    cs.CL

    From Bilingual to Multilingual Neural Machine Translation by Incremental Training

    Authors: Carlos Escolano, Marta R. Costa-Jussà, José A. R. Fonollosa

    Abstract: Multilingual Neural Machine Translation approaches are based on the use of task-specific models and the addition of one more language can only be done by retraining the whole system. In this work, we propose a new training schedule that allows the system to scale to more languages without modification of the previous components based on joint training and language-independent encoder/decoder modul… ▽ More

    Submitted 11 July, 2019; v1 submitted 28 June, 2019; originally announced July 2019.

    Comments: Accepted paper at ACL 2019 Student Research Workshop. arXiv admin note: substantial text overlap with arXiv:1905.06831

  2. arXiv:1905.06831  [pdf, other

    cs.CL

    Towards Interlingua Neural Machine Translation

    Authors: Carlos Escolano, Marta R. Costa-jussà, José A. R. Fonollosa

    Abstract: Common intermediate language representation in neural machine translation can be used to extend bilingual to multilingual systems by incremental training. In this paper, we propose a new architecture based on introducing an interlingual loss as an additional training objective. By adding and forcing this interlingual loss, we are able to train multiple encoders and decoders for each language, shar… ▽ More

    Submitted 8 December, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.06351

  3. arXiv:1905.06596  [pdf, other

    cs.CL cs.LG

    Joint Source-Target Self Attention with Locality Constraints

    Authors: José A. R. Fonollosa, Noe Casas, Marta R. Costa-jussà

    Abstract: The dominant neural machine translation models are based on the encoder-decoder structure, and many of them rely on an unconstrained receptive field over source and target sequences. In this paper we study a new architecture that breaks with both conventions. Our simplified architecture consists in the decoder part of a transformer model, based on self-attention, but with locality constraints appl… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

  4. arXiv:1904.08783  [pdf, other

    cs.CL cs.LG

    Evaluating the Underlying Gender Bias in Contextualized Word Embeddings

    Authors: Christine Basta, Marta R. Costa-jussà, Noe Casas

    Abstract: Gender bias is highly impacting natural language processing applications. Word embeddings have clearly been proven both to keep and amplify gender biases that are present in current data sources. Recently, contextualized word embeddings have enhanced previous word embedding techniques by computing word vector representations dependent on the sentence they appear in. In this paper, we study the i… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  5. arXiv:1901.03116  [pdf, other

    cs.CL

    Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques

    Authors: Joel Escudé Font, Marta R. Costa-jussà

    Abstract: Neural machine translation has significantly pushed forward the quality of the field. However, there are remaining big issues with the output translations and one of them is fairness. Neural models are trained on large text corpora which contain biases and stereotypes. As a consequence, models inherit these social biases. Recent methods have shown results in reducing gender bias in other natural l… ▽ More

    Submitted 2 June, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

    Comments: This paper has been accepted for publication at the 1st ACL Workshop on Gender Bias for Natural Language Processing (2019)

  6. arXiv:1810.06351  [pdf, other

    cs.CL

    (Self-Attentive) Autoencoder-based Universal Language Representation for Machine Translation

    Authors: Carlos Escolano, Marta R. Costa-jussà, José A. R. Fonollosa

    Abstract: Universal language representation is the holy grail in machine translation (MT). Thanks to the new neural MT approach, it seems that there are good perspectives towards this goal. In this paper, we propose a new architecture based on combining variational autoencoders with encoder-decoders and introducing an interlingual loss as an additional training objective. By adding and forcing this interlin… ▽ More

    Submitted 15 October, 2018; originally announced October 2018.

    Comments: 7 pages, 4 figures

  7. arXiv:1807.00651  [pdf, other

    cs.CL

    A Neural Approach to Language Variety Translation

    Authors: Marta R. Costa-jussà, Marcos Zampieri, Santanu Pal

    Abstract: In this paper we present the first neural-based machine translation system trained to translate between standard national varieties of the same language. We take the pair Brazilian - European Portuguese as an example and compare the performance of this method to a phrase-based statistical machine translation system. We report a performance improvement of 0.9 BLEU points in translating from Europea… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial)

  8. arXiv:1803.07139  [pdf, other

    cs.CL cs.AI

    English-Catalan Neural Machine Translation in the Biomedical Domain through the cascade approach

    Authors: Marta R. Costa-jussà, Noe Casas, Maite Melero

    Abstract: This paper describes the methodology followed to build a neural machine translation system in the biomedical domain for the English-Catalan language pair. This task can be considered a low-resourced task from the point of view of the domain and the language pair. To face this task, this paper reports experiments on a cascade pivot strategy through Spanish for the neural machine translation using t… ▽ More

    Submitted 26 April, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

    Comments: Full workshop proceedings can be found at https://multilingualbio.bsc.es/wp-content/uploads/2018/03/LREC-2018-PROCEEDINGS-MultilingualBIO.pdf

    Journal ref: Proceedings of workshop "MultilingualBIO: Multilingual Biomedical Text Processing" of the 11th Edition of the Language Resources and Evaluation Conference, 2018

  9. arXiv:1707.07469  [pdf, other

    cs.CL cs.LG

    Character-level Intra Attention Network for Natural Language Inference

    Authors: Han Yang, Marta R. Costa-jussà, José A. R. Fonollosa

    Abstract: Natural language inference (NLI) is a central problem in language understanding. End-to-end artificial neural networks have reached state-of-the-art performance in NLI field recently. In this paper, we propose Character-level Intra Attention Network (CIAN) for the NLI task. In our model, we use the character-level convolutional network to replace the standard word embedding layer, and we use the… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: EMNLP Workshop RepEval 2017: The Second Workshop on Evaluating Vector Space Representations for NLP

  10. arXiv:1610.02209  [pdf, ps, other

    cs.CL stat.ML

    Morphology Generation for Statistical Machine Translation using Deep Learning Techniques

    Authors: Marta R. Costa-jussà, Carlos Escolano

    Abstract: Morphology in unbalanced languages remains a big challenge in the context of machine translation. In this paper, we propose to de-couple machine translation from morphology generation in order to better deal with the problem. We investigate the morphology simplification with a reasonable trade-off between expected gain and generation complexity. For the Chinese-Spanish task, optimum morphological… ▽ More

    Submitted 6 February, 2017; v1 submitted 7 October, 2016; originally announced October 2016.

  11. arXiv:1603.00810  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Character-based Neural Machine Translation

    Authors: Marta R. Costa-Jussà, José A. R. Fonollosa

    Abstract: Neural Machine Translation (MT) has reached state-of-the-art results. However, one of the main challenges that neural MT still faces is dealing with very large vocabularies and morphologically rich languages. In this paper, we propose a neural MT system using character-based embeddings in combination with convolutional and highway layers to replace the standard lookup-based word representations. T… ▽ More

    Submitted 30 June, 2016; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: Accepted for publication at ACL 2016

  12. Evaluating Indirect Strategies for Chinese-Spanish Statistical Machine Translation

    Authors: Marta R. Costa-jussà, Carlos A. Henríquez, Rafael E. Banchs

    Abstract: Although, Chinese and Spanish are two of the most spoken languages in the world, not much research has been done in machine translation for this language pair. This paper focuses on investigating the state-of-the-art of Chinese-to-Spanish statistical machine translation (SMT), which nowadays is one of the most popular approaches to machine translation. For this purpose, we report details of the av… ▽ More

    Submitted 3 February, 2014; originally announced February 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 45, pages 761-780, 2012