Showing 1–2 of 2 results for author: Chinea-Rios, M

Search v0.5.6 released 2020-02-24

arXiv:2204.10543 [pdf, other]

cs.CL

Zero and Few-shot Learning for Author Profiling

Authors: Mara Chinea-Rios, Thomas Müller, Gretel Liz De la Peña Sarracén, Francisco Rangel, Marc Franco-Salvador

Abstract: Author profiling classifies author characteristics by analyzing how language is shared among people. In this work, we study that task from a low-resource viewpoint: using little or no training data. We explore different zero and few-shot models based on entailment and evaluate our systems on several profiling tasks in Spanish and English. In addition, we study the effect of both the entailment hyp… ▽ More Author profiling classifies author characteristics by analyzing how language is shared among people. In this work, we study that task from a low-resource viewpoint: using little or no training data. We explore different zero and few-shot models based on entailment and evaluate our systems on several profiling tasks in Spanish and English. In addition, we study the effect of both the entailment hypothesis and the size of the few-shot training sample. We find that entailment-based models out-perform supervised text classifiers based on roberta-XLM and that we can reach 80% of the accuracy of previous approaches using less than 50\% of the training data on average. △ Less

Submitted 17 May, 2022; v1 submitted 22 April, 2022; originally announced April 2022.
arXiv:1612.05555 [pdf, other]

cs.CL

Neural Networks Classifier for Data Selection in Statistical Machine Translation

Authors: Álvaro Peris, Mara Chinea-Rios, Francisco Casacuberta

Abstract: We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical resul… ▽ More We address the data selection problem in statistical machine translation (SMT) as a classification task. The new data selection method is based on a neural network classifier. We present a new method description and empirical results proving that our data selection method provides better translation quality, compared to a state-of-the-art method (i.e., Cross entropy). Moreover, the empirical results reported are coherent across different language pairs. △ Less

Submitted 21 December, 2016; v1 submitted 16 December, 2016; originally announced December 2016.

Comments: Submitted to EACL'17

Search v0.5.6 released 2020-02-24