Showing 1–2 of 2 results for author: Chinea-Rios, M
-
Zero and Few-shot Learning for Author Profiling
Authors:
Mara Chinea-Rios,
Thomas Müller,
Gretel Liz De la Peña Sarracén,
Francisco Rangel,
Marc Franco-Salvador
Abstract:
Author profiling classifies author characteristics by analyzing how language is shared among people. In this work, we study that task from a low-resource viewpoint: using little or no training data. We explore different zero and few-shot models based on entailment and evaluate our systems on several profiling tasks in Spanish and English. In addition, we study the effect of both the entailment hyp…
▽ More
Author profiling classifies author characteristics by analyzing how language is shared among people. In this work, we study that task from a low-resource viewpoint: using little or no training data. We explore different zero and few-shot models based on entailment and evaluate our systems on several profiling tasks in Spanish and English. In addition, we study the effect of both the entailment hypothesis and the size of the few-shot training sample. We find that entailment-based models out-perform supervised text classifiers based on roberta-XLM and that we can reach 80% of the accuracy of previous approaches using less than 50\% of the training data on average.
△ Less
Submitted 17 May, 2022; v1 submitted 22 April, 2022;
originally announced April 2022.
-
Neural Networks Classifier for Data Selection in Statistical Machine Translation
Authors:
Álvaro Peris,
Mara Chinea-Rios,
Francisco Casacuberta
Abstract:
We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical resul…
▽ More
We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical results reported are coherent across different language pairs.
△ Less
Submitted 21 December, 2016; v1 submitted 16 December, 2016;
originally announced December 2016.