Search | arXiv e-print repository

arXiv:2012.09823 [pdf, ps, other]

doi 10.18653/v1/2020.coling-main.574

Continual Lifelong Learning in Natural Language Processing: A Survey

Authors: Magdalena Biesialska, Katarzyna Biesialska, Marta R. Costa-jussà

Abstract: Continual learning (CL) aims to enable information systems to learn from a continuous data stream across time. However, it is difficult for existing deep learning architectures to learn a new task without largely forgetting previously acquired knowledge. Furthermore, CL is particularly challenging for language learning, as natural language is ambiguous: it is discrete, compositional, and its meani… ▽ More Continual learning (CL) aims to enable information systems to learn from a continuous data stream across time. However, it is difficult for existing deep learning architectures to learn a new task without largely forgetting previously acquired knowledge. Furthermore, CL is particularly challenging for language learning, as natural language is ambiguous: it is discrete, compositional, and its meaning is context-dependent. In this work, we look at the problem of CL through the lens of various NLP tasks. Our survey discusses major challenges in CL and current methods applied in neural network models. We also provide a critical review of the existing CL evaluation methods and datasets in NLP. Finally, we present our outlook on future research directions. △ Less

Submitted 17 December, 2020; originally announced December 2020.

Comments: COLING 2020

Journal ref: Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), Barcelona, Spain (Online), pp. 6523--6541

arXiv:2005.10048 [pdf, other]

doi 10.18653/v1/2020.acl-srw.36

Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources

Authors: Magdalena Biesialska, Bardia Rafieian, Marta R. Costa-jussà

Abstract: In this work, we present an effective method for semantic specialization of word vector representations. To this end, we use traditional word embeddings and apply specialization methods to better capture semantic relations between words. In our approach, we leverage external knowledge from rich lexical resources such as BabelNet. We also show that our proposed post-specialization method based on a… ▽ More In this work, we present an effective method for semantic specialization of word vector representations. To this end, we use traditional word embeddings and apply specialization methods to better capture semantic relations between words. In our approach, we leverage external knowledge from rich lexical resources such as BabelNet. We also show that our proposed post-specialization method based on an adversarial neural network with the Wasserstein distance allows to gain improvements over state-of-the-art methods on two tasks: word similarity and dialog state tracking. △ Less

Submitted 20 May, 2020; originally announced May 2020.

Comments: Accepted to ACL 2020 SRW

arXiv:2003.05574 [pdf, other]

doi 10.1007/978-3-030-59491-6_4

Sentiment Analysis with Contextual Embeddings and Self-Attention

Authors: Katarzyna Biesialska, Magdalena Biesialska, Henryk Rybinski

Abstract: In natural language the intended meaning of a word or phrase is often implicit and depends on the context. In this work, we propose a simple yet effective method for sentiment analysis using contextual embeddings and a self-attention mechanism. The experimental results for three languages, including morphologically rich Polish and German, show that our model is comparable to or even outperforms st… ▽ More In natural language the intended meaning of a word or phrase is often implicit and depends on the context. In this work, we propose a simple yet effective method for sentiment analysis using contextual embeddings and a self-attention mechanism. The experimental results for three languages, including morphologically rich Polish and German, show that our model is comparable to or even outperforms state-of-the-art models. In all cases the superiority of models leveraging contextual embeddings is demonstrated. Finally, this work is intended as a step towards introducing a universal, multilingual sentiment classifier. △ Less

Submitted 5 October, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

Comments: Accepted at the 25th International Symposium on Methodologies for Intelligent Systems (ISMIS 2020)

arXiv:2002.09213 [pdf, ps, other]

doi 10.3233/FAIA200317

Refinement of Unsupervised Cross-Lingual Word Embeddings

Authors: Magdalena Biesialska, Marta R. Costa-jussà

Abstract: Cross-lingual word embeddings aim to bridge the gap between high-resource and low-resource languages by allowing to learn multilingual word representations even without using any direct bilingual signal. The lion's share of the methods are projection-based approaches that map pre-trained embeddings into a shared latent space. These methods are mostly based on the orthogonal transformation, which a… ▽ More Cross-lingual word embeddings aim to bridge the gap between high-resource and low-resource languages by allowing to learn multilingual word representations even without using any direct bilingual signal. The lion's share of the methods are projection-based approaches that map pre-trained embeddings into a shared latent space. These methods are mostly based on the orthogonal transformation, which assumes language vector spaces to be isomorphic. However, this criterion does not necessarily hold, especially for morphologically-rich languages. In this paper, we propose a self-supervised method to refine the alignment of unsupervised bilingual word embeddings. The proposed model moves vectors of words and their corresponding translations closer to each other as well as enforces length- and center-invariance, thus allowing to better align cross-lingual embeddings. The experimental results demonstrate the effectiveness of our approach, as in most cases it outperforms state-of-the-art methods in a bilingual lexicon induction task. △ Less

Submitted 21 February, 2020; originally announced February 2020.

Comments: Accepted at the 24th European Conference on Artificial Intelligence (ECAI 2020)

arXiv:1908.01192 [pdf, other]

doi 10.18653/v1/W19-5424

The TALP-UPC System for the WMT Similar Language Task: Statistical vs Neural Machine Translation

Authors: Magdalena Biesialska, Lluis Guardia, Marta R. Costa-jussà

Abstract: Although the problem of similar language translation has been an area of research interest for many years, yet it is still far from being solved. In this paper, we study the performance of two popular approaches: statistical and neural. We conclude that both methods yield similar results; however, the performance varies depending on the language pair. While the statistical approach outperforms the… ▽ More Although the problem of similar language translation has been an area of research interest for many years, yet it is still far from being solved. In this paper, we study the performance of two popular approaches: statistical and neural. We conclude that both methods yield similar results; however, the performance varies depending on the language pair. While the statistical approach outperforms the neural one by a difference of 6 BLEU points for the Spanish-Portuguese language pair, the proposed neural model surpasses the statistical one by a difference of 2 BLEU points for Czech-Polish. In the former case, the language similarity (based on perplexity) is much higher than in the latter case. Additionally, we report negative results for the system combination with back-translation. Our TALP-UPC system submission won 1st place for Czech-to-Polish and 2nd place for Spanish-to-Portuguese in the official evaluation of the 1st WMT Similar Language Translation task. △ Less

Submitted 3 August, 2019; originally announced August 2019.

Comments: WMT 2019 Shared Task paper

Showing 1–5 of 5 results for author: Biesialska, M