Skip to main content

Showing 1–9 of 9 results for author: Poerner, N

.
  1. arXiv:2107.00927  [pdf, other

    cs.CL cs.LG

    Data Centric Domain Adaptation for Historical Text with OCR Errors

    Authors: Luisa März, Stefan Schweter, Nina Poerner, Benjamin Roth, Hinrich Schütze

    Abstract: We propose new methods for in-domain and cross-domain Named Entity Recognition (NER) on historical data for Dutch and French. For the cross-domain case, we address domain shift by integrating unsupervised in-domain data via contextualized string embeddings; and OCR errors by injecting synthetic OCR errors into the source domain and address data centric domain adaptation. We propose a general appro… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 14 pages, 2 figures, 6 tables

  2. arXiv:2004.03354  [pdf, other

    cs.CL

    Inexpensive Domain Adaptation of Pretrained Language Models: Case Studies on Biomedical NER and Covid-19 QA

    Authors: Nina Poerner, Ulli Waltinger, Hinrich Schütze

    Abstract: Domain adaptation of Pretrained Language Models (PTLMs) is typically achieved by unsupervised pretraining on target-domain text. While successful, this approach is expensive in terms of hardware, runtime and CO_2 emissions. Here, we propose a cheaper alternative: We train Word2Vec on target-domain text and align the resulting word vectors with the wordpiece vectors of a general-domain PTLM. We eva… ▽ More

    Submitted 27 June, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

  3. arXiv:1911.03700  [pdf, other

    cs.CL

    Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity

    Authors: Nina Poerner, Ulli Waltinger, Hinrich Schütze

    Abstract: We address the task of unsupervised Semantic Textual Similarity (STS) by ensembling diverse pre-trained sentence encoders into sentence meta-embeddings. We apply, extend and evaluate different meta-embedding methods from the word embedding literature at the sentence level, including dimensionality reduction (Yin and Schütze, 2016), generalized Canonical Correlation Analysis (Rastogi et al., 2015)… ▽ More

    Submitted 24 June, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

  4. arXiv:1911.03681  [pdf, other

    cs.CL

    E-BERT: Efficient-Yet-Effective Entity Embeddings for BERT

    Authors: Nina Poerner, Ulli Waltinger, Hinrich Schütze

    Abstract: We present a novel way of injecting factual knowledge about entities into the pretrained BERT model (Devlin et al., 2019): We align Wikipedia2Vec entity vectors (Yamada et al., 2016) with BERT's native wordpiece vector space and use the aligned entity vectors as if they were wordpiece vectors. The resulting entity-enhanced version of BERT (called E-BERT) is similar in spirit to ERNIE (Zhang et al.… ▽ More

    Submitted 1 May, 2020; v1 submitted 9 November, 2019; originally announced November 2019.

  5. arXiv:1906.10924  [pdf, other

    cs.CL cs.AI cs.LG

    Interpretable Question Answering on Knowledge Bases and Text

    Authors: Alona Sydorova, Nina Poerner, Benjamin Roth

    Abstract: Interpretability of machine learning (ML) models becomes more relevant with their increasing adoption. In this work, we address the interpretability of ML based question answering (QA) models on a combination of knowledge bases (KB) and text documents. We adapt post hoc explanation methods such as LIME and input perturbation (IP) and compare them with the self-explanatory attention mechanism of th… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

  6. arXiv:1811.00066  [pdf, other

    cs.CL

    Aligning Very Small Parallel Corpora Using Cross-Lingual Word Embeddings and a Monogamy Objective

    Authors: Nina Poerner, Masoud Jalili Sabet, Benjamin Roth, Hinrich Schütze

    Abstract: Count-based word alignment methods, such as the IBM models or fast-align, struggle on very small parallel corpora. We therefore present an alternative approach based on cross-lingual word embeddings (CLWEs), which are trained on purely monolingual data. Our main contribution is an unsupervised objective to adapt CLWEs to parallel corpora. In experiments on between 25 and 500 sentences, our method… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

  7. arXiv:1809.07291  [pdf, other

    cs.CL

    Interpretable Textual Neuron Representations for NLP

    Authors: Nina Poerner, Benjamin Roth, Hinrich Schütze

    Abstract: Input optimization methods, such as Google Deep Dream, create interpretable representations of neurons for computer vision DNNs. We propose and evaluate ways of transferring this technology to NLP. Our results suggest that gradient ascent with a gumbel softmax layer produces n-gram representations that outperform naive corpus search in terms of target neuron activation. The representations highlig… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: BlackboxNLP Workshop at EMNLP 2018 (Extended Abstract)

  8. Neural Architectures for Open-Type Relation Argument Extraction

    Authors: Benjamin Roth, Costanza Conforti, Nina Poerner, Sanjeev Karn, Hinrich Schütze

    Abstract: In this work, we introduce the task of Open-Type Relation Argument Extraction (ORAE): Given a corpus, a query entity Q and a knowledge base relation (e.g.,"Q authored notable work with title X"), the model has to extract an argument of non-standard entity type (entities that cannot be extracted by a standard named entity tagger, e.g. X: the title of a book or a work of art) from the corpus. A dist… ▽ More

    Submitted 30 September, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Journal ref: Nat. Lang. Eng. 25 (2019) 219-238

  9. arXiv:1801.06422  [pdf, other

    cs.CL

    Evaluating neural network explanation methods using hybrid documents and morphological agreement

    Authors: Nina Poerner, Benjamin Roth, Hinrich Schütze

    Abstract: The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explore post hoc explanation methods. We conduct the first comprehensive evaluation of explanation methods for NLP. To this end, we design two novel evaluation paradigms that cover two important classes of NLP problems: small context and large context problems. Both paradigms require no manual annotation… ▽ More

    Submitted 6 May, 2019; v1 submitted 19 January, 2018; originally announced January 2018.