Skip to main content

Showing 1–2 of 2 results for author: Tsygankova, T

.
  1. arXiv:2006.09627  [pdf

    cs.CL

    Building Low-Resource NER Models Using Non-Speaker Annotation

    Authors: Tatiana Tsygankova, Francesca Marini, Stephen Mayhew, Dan Roth

    Abstract: In low-resource natural language processing (NLP), the key problems are a lack of target language training data, and a lack of native speakers to create it. Cross-lingual methods have had notable success in addressing these concerns, but in certain common circumstances, such as insufficient pre-training corpora or languages far from the source language, their performance suffers. In this work we p… ▽ More

    Submitted 26 April, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Accepted to DASH-LA 2021, workshop at NAACL 2021

  2. arXiv:1903.11222  [pdf, other

    cs.CL

    ner and pos when nothing is capitalized

    Authors: Stephen Mayhew, Tatiana Tsygankova, Dan Roth

    Abstract: For those languages which use it, capitalization is an important signal for the fundamental NLP tasks of Named Entity Recognition (NER) and Part of Speech (POS) tagging. In fact, it is such a strong signal that model performance on these tasks drops sharply in common lowercased scenarios, such as noisy web text or machine translation outputs. In this work, we perform a systematic analysis of solut… ▽ More

    Submitted 31 August, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: Accepted to EMNLP2019