-
arXiv:1904.04055 [pdf, ps, other]
Evaluating KGR10 Polish word embeddings in the recognition of temporal expressions using BiLSTM-CRF
Abstract: The article introduces a new set of Polish word embeddings, built using KGR10 corpus, which contains more than 4 billion words. These embeddings are evaluated in the problem of recognition of temporal expressions (timexes) for the Polish language. We described the process of KGR10 corpus creation and a new approach to the recognition problem using Bidirectional Long-Short Term Memory (BiLSTM) netw… ▽ More
Submitted 3 April, 2019; originally announced April 2019.
Comments: Presented at TFML 2019 (Theoretical Foundations of Machine Learning)