Skip to main content

Showing 151–200 of 220 results for author: Schütze, H

.
  1. arXiv:1906.03608  [pdf, other

    cs.CL cs.LG

    Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings

    Authors: Yadollah Yaghoobzadeh, Katharina Kann, Timothy J. Hazen, Eneko Agirre, Hinrich Schütze

    Abstract: Word embeddings typically represent different meanings of a word in a single conflated vector. Empirical analysis of embeddings of ambiguous words is currently limited by the small size of manually annotated resources and by the fact that word senses are treated as unrelated individual concepts. We present a large dataset based on manual Wikipedia annotations and word senses, where word senses fro… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 14 pages, Accepted at ACL 2019

  2. arXiv:1906.01973  [pdf, other

    cs.CL

    A Hierarchical Decoder with Three-level Hierarchical Attention to Generate Abstractive Summaries of Interleaved Texts

    Authors: Sanjeev Kumar Karn, Francine Chen, Yan-Ying Chen, Ulli Waltinger, Hinrich Schütze

    Abstract: Interleaved texts, where posts belonging to different threads occur in one sequence, are a common occurrence, e.g., online chat conversations. To quickly obtain an overview of such texts, existing systems first disentangle the posts by threads and then extract summaries from those threads. The major issues with such systems are error propagation and non-fluent summary. To address those, we propose… ▽ More

    Submitted 9 April, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  3. arXiv:1906.01393  [pdf, other

    cs.CL cs.AI

    SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference

    Authors: Martin Schmitt, Hinrich Schütze

    Abstract: We present SherLIiC, a testbed for lexical inference in context (LIiC), consisting of 3985 manually annotated inference rule candidates (InfCands), accompanied by (i) ~960k unlabeled InfCands, and (ii) ~190k typed textual relations between Freebase entities extracted from the large entity-linked corpus ClueWeb09. Each InfCand consists of one of these relations, expressed as a lemmatized dependency… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted as a long paper to ACL 2019

  4. arXiv:1904.09688  [pdf, other

    cs.CL

    Fine-Grained Argument Unit Recognition and Classification

    Authors: Dietrich Trautmann, Johannes Daxenberger, Christian Stab, Hinrich Schütze, Iryna Gurevych

    Abstract: Prior work has commonly defined argument retrieval from heterogeneous document collections as a sentence-level classification task. Consequently, argument retrieval suffers both from low recall and from sentence segmentation errors making it difficult for humans and machines to consume the arguments. In this work, we argue that the task should be performed on a more fine-grained level of sequence… ▽ More

    Submitted 21 November, 2019; v1 submitted 21 April, 2019; originally announced April 2019.

    Comments: AAAI 2020

  5. arXiv:1904.09447  [pdf, other

    cs.CL cs.AI

    An Unsupervised Joint System for Text Generation from Knowledge Graphs and Semantic Parsing

    Authors: Martin Schmitt, Sahand Sharifzadeh, Volker Tresp, Hinrich Schütze

    Abstract: Knowledge graphs (KGs) can vary greatly from one domain to another. Therefore supervised approaches to both graph-to-text generation and text-to-graph knowledge extraction (semantic parsing) will always suffer from a shortage of domain-specific parallel graph-text data; at the same time, adapting a model trained on a different domain is often impossible due to little or no overlap in entities and… ▽ More

    Submitted 17 November, 2020; v1 submitted 20 April, 2019; originally announced April 2019.

    Comments: Accepted as long paper to EMNLP 2020

  6. arXiv:1904.08654  [pdf, other

    cs.CL

    Analytical Methods for Interpretable Ultradense Word Embeddings

    Authors: Philipp Dufter, Hinrich Schütze

    Abstract: Word embeddings are useful for a wide variety of tasks, but they lack interpretability. By rotating word spaces, interpretable dimensions can be identified while preserving the information contained in the embeddings without any loss. In this work, we investigate three methods for making word spaces interpretable by rotation: Densifier (Rothe et al., 2016), linear SVMs and DensRay, a new method we… ▽ More

    Submitted 13 September, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: EMNLP 2019

  7. arXiv:1904.06707  [pdf, ps, other

    cs.CL cs.LG

    Rare Words: A Major Problem for Contextualized Embeddings And How to Fix it by Attentive Mimicking

    Authors: Timo Schick, Hinrich Schütze

    Abstract: Pretraining deep neural network architectures with a language modeling objective has brought large improvements for many natural language processing tasks. Exemplified by BERT, a recently proposed such architecture, we demonstrate that despite being trained on huge amounts of data, deep language models still struggle to understand rare words. To fix this problem, we adapt Attentive Mimicking, a me… ▽ More

    Submitted 4 December, 2019; v1 submitted 14 April, 2019; originally announced April 2019.

    Comments: To appear at AAAI 2020

  8. arXiv:1904.01617  [pdf, other

    cs.CL

    Attentive Mimicking: Better Word Embeddings by Attending to Informative Contexts

    Authors: Timo Schick, Hinrich Schütze

    Abstract: Learning high-quality embeddings for rare words is a hard problem because of sparse context information. Mimicking (Pinter et al., 2017) has been proposed as a solution: given embeddings learned by a standard algorithm, a model is first trained to reproduce embeddings of frequent words from their surface form and then used to compute embeddings for rare words. In this paper, we introduce attentive… ▽ More

    Submitted 5 April, 2019; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: Accepted at NAACL2019

  9. arXiv:1811.03866  [pdf, ps, other

    cs.CL cs.AI

    Learning Semantic Representations for Novel Words: Leveraging Both Form and Context

    Authors: Timo Schick, Hinrich Schütze

    Abstract: Word embeddings are a key component of high-performing natural language processing (NLP) systems, but it remains a challenge to learn good representations for novel words on the fly, i.e., for words that did not occur in the training data. The general problem setting is that word embeddings are induced on an unlabeled training corpus and then a model is trained that embeds novel words into this in… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Comments: AAAI 2019

  10. arXiv:1811.02230  [pdf, other

    cs.CL

    CIS at TAC Cold Start 2015: Neural Networks and Coreference Resolution for Slot Filling

    Authors: Heike Adel, Hinrich Schütze

    Abstract: This paper describes the CIS slot filling system for the TAC Cold Start evaluations 2015. It extends and improves the system we have built for the evaluation last year. This paper mainly describes the changes to our last year's system. Especially, it focuses on the coreference and classification component. For coreference, we have performed several analysis and prepared a resource to simplify our… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: TAC KBP 2015

  11. arXiv:1811.00586  [pdf, other

    cs.CL

    Multilingual Embeddings Jointly Induced from Contexts and Concepts: Simple, Strong and Scalable

    Authors: Philipp Dufter, Mengjie Zhao, Hinrich Schütze

    Abstract: Word embeddings induced from local context are prevalent in NLP. A simple and effective context-based multilingual embedding learner is Levy et al. (2017)'s S-ID (sentence ID) method. Another line of work induces high-performing multilingual embeddings from concepts (Dufter et al., 2018). In this paper, we propose Co+Co, a simple and scalable method that combines context-based and concept-based le… ▽ More

    Submitted 30 April, 2020; v1 submitted 1 November, 2018; originally announced November 2018.

  12. arXiv:1811.00066  [pdf, other

    cs.CL

    Aligning Very Small Parallel Corpora Using Cross-Lingual Word Embeddings and a Monogamy Objective

    Authors: Nina Poerner, Masoud Jalili Sabet, Benjamin Roth, Hinrich Schütze

    Abstract: Count-based word alignment methods, such as the IBM models or fast-align, struggle on very small parallel corpora. We therefore present an alternative approach based on cross-lingual word embeddings (CLWEs), which are trained on purely monolingual data. Our main contribution is an unsupervised objective to adapt CLWEs to parallel corpora. In experiments on between 25 and 500 sentences, our method… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

  13. arXiv:1810.10499  [pdf, other

    cs.CL cs.AI

    Multi-Multi-View Learning: Multilingual and Multi-Representation Entity Ty**

    Authors: Yadollah Yaghoobzadeh, Hinrich Schütze

    Abstract: Knowledge bases (KBs) are paramount in NLP. We employ multiview learning for increasing accuracy and coverage of entity type information in KBs. We rely on two metaviews: language and representation. For language, we consider high-resource and low-resource languages from Wikipedia. For representation, we consider representations based on the context distribution of the entity (i.e., on its embeddi… ▽ More

    Submitted 24 October, 2018; originally announced October 2018.

    Comments: 7 pages, Accepted at EMNLP 2018

  14. arXiv:1810.05102  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Neural Relation Extraction Within and Across Sentence Boundaries

    Authors: Pankaj Gupta, Subburam Rajaram, Hinrich Schütze, Bernt Andrassy, Thomas Runkler

    Abstract: Past work in relation extraction mostly focuses on binary relation between entity pairs within single sentence. Recently, the NLP community has gained interest in relation extraction in entity pairs spanning multiple sentences. In this paper, we propose a novel architecture for this task: inter-sentential dependency-based neural networks (iDepNN). iDepNN models the shortest and augmented dependenc… ▽ More

    Submitted 14 January, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: AAAI2019

  15. arXiv:1810.03947  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    textTOvec: Deep Contextualized Neural Autoregressive Topic Models of Language with Distributed Compositional Prior

    Authors: Pankaj Gupta, Yatin Chaudhary, Florian Buettner, Hinrich Schütze

    Abstract: We address two challenges of probabilistic topic modelling in order to better estimate the probability of a word in a given context, i.e., P(word|context): (1) No Language Structure in Context: Probabilistic topic models ignore word order by summarizing a given context as a "bag-of-word" and consequently the semantics of words in the context is lost. The LSTM-LM learns a vector-space representatio… ▽ More

    Submitted 23 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Published in #ICLR2019 International Conference on Learning Representations

  16. arXiv:1809.08733  [pdf, other

    cs.CL

    Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting

    Authors: Katharina Kann, Hinrich Schütze

    Abstract: Neural state-of-the-art sequence-to-sequence (seq2seq) models often do not perform well for small training sets. We address paradigm completion, the morphological task of, given a partial paradigm, generating all missing forms. We propose two new methods for the minimal-resource setting: (i) Paradigm transduction: Since we assume only few paradigms available for training, neural seq2seq models are… ▽ More

    Submitted 9 May, 2019; v1 submitted 23 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  17. arXiv:1809.07291  [pdf, other

    cs.CL

    Interpretable Textual Neuron Representations for NLP

    Authors: Nina Poerner, Benjamin Roth, Hinrich Schütze

    Abstract: Input optimization methods, such as Google Deep Dream, create interpretable representations of neurons for computer vision DNNs. We propose and evaluate ways of transferring this technology to NLP. Our results suggest that gradient ascent with a gumbel softmax layer produces n-gram representations that outperform naive corpus search in terms of target neuron activation. The representations highlig… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: BlackboxNLP Workshop at EMNLP 2018 (Extended Abstract)

  18. arXiv:1809.06709  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Document Informed Neural Autoregressive Topic Models with Distributional Prior

    Authors: Pankaj Gupta, Yatin Chaudhary, Florian Buettner, Hinrich Schütze

    Abstract: We address two challenges in topic models: (1) Context information around words helps in determining their actual meaning, e.g., "networks" used in the contexts "artificial neural networks" vs. "biological neuron networks". Generative topic models infer topic-word distributions, taking no or only little context into account. Here, we extend a neural autoregressive topic model to exploit the full c… ▽ More

    Submitted 14 January, 2019; v1 submitted 15 September, 2018; originally announced September 2018.

    Comments: AAAI2019. arXiv admin note: substantial text overlap with arXiv:1808.03793

  19. arXiv:1808.04208  [pdf, other

    cs.CL

    Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging

    Authors: Apostolos Kemos, Heike Adel, Hinrich Schütze

    Abstract: Character-level models of tokens have been shown to be effective at dealing with within-token noise and out-of-vocabulary words. But these models still rely on correct token boundaries. In this paper, we propose a novel end-to-end character-level model and demonstrate its effectiveness in multilingual settings and when token boundaries are noisy. Our model is a semi-Markov conditional random field… ▽ More

    Submitted 2 January, 2020; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: NAACL 2019

  20. arXiv:1808.03793  [pdf, other

    cs.IR cs.CL cs.LG

    Document Informed Neural Autoregressive Topic Models

    Authors: Pankaj Gupta, Florian Buettner, Hinrich Schütze

    Abstract: Context information around words helps in determining their actual meaning, for example "networks" used in contexts of artificial neural networks or biological neuron networks. Generative topic models infer topic-word distributions, taking no or only little context into account. Here, we extend a neural autoregressive topic model to exploit the full context information around words in a document i… ▽ More

    Submitted 11 August, 2018; originally announced August 2018.

  21. arXiv:1808.01591  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    LISA: Explaining Recurrent Neural Network Judgments via Layer-wIse Semantic Accumulation and Example to Pattern Transformation

    Authors: Pankaj Gupta, Hinrich Schütze

    Abstract: Recurrent neural networks (RNNs) are temporal networks and cumulative in nature that have shown promising results in various natural language processing tasks. Despite their success, it still remains a challenge to understand their hidden behavior. In this work, we analyze and interpret the cumulative nature of RNN via a proposed technique named as Layer-wIse-Semantic-Accumulation (LISA) for expla… ▽ More

    Submitted 5 August, 2018; originally announced August 2018.

    Comments: 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP2018) workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP)

  22. arXiv:1807.11535  [pdf, other

    cs.CL

    News Article Teaser Tweets and How to Generate Them

    Authors: Sanjeev Kumar Karn, Mark Buckley, Ulli Waltinger, Hinrich Schütze

    Abstract: In this work, we define the task of teaser generation and provide an evaluation benchmark and baseline systems for the process of generating teasers. A teaser is a short reading suggestion for an article that is illustrative and includes curiosity-arousing elements to entice potential readers to read particular news items. Teasers are one of the main vehicles for transmitting news to social media… ▽ More

    Submitted 18 April, 2019; v1 submitted 30 July, 2018; originally announced July 2018.

    Journal ref: 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019)

  23. arXiv:1807.07186  [pdf, other

    cs.CL cs.AI

    Evaluating Word Embeddings in Multi-label Classification Using Fine-grained Name Ty**

    Authors: Yadollah Yaghoobzadeh, Katharina Kann, Hinrich Schütze

    Abstract: Embedding models typically associate each word with a single real-valued vector, representing its different properties. Evaluation methods, therefore, need to analyze the accuracy and completeness of these properties in embeddings. This requires fine-grained analysis of embedding subspaces. Multi-label classification is an appropriate way to do so. We propose a new evaluation method for word embed… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

    Comments: 6 pages, The 3rd Workshop on Representation Learning for NLP (RepL4NLP @ ACL2018)

  24. arXiv:1807.05371  [pdf, other

    eess.SP

    Adaptive Hierarchical Sensing for the Efficient Sampling of Sparse and Compressible Signals

    Authors: Henry Schütze, Erhardt Barth, Thomas Martinetz

    Abstract: We present the novel adaptive hierarchical sensing algorithm K-AHS, which samples sparse or compressible signals with a measurement complexity equal to that of Compressed Sensing (CS). In contrast to CS, K-AHS is adaptive as sensing vectors are selected while sampling, depending on previous measurements. Prior to sampling, the user chooses a transform domain in which the signal of interest is spar… ▽ More

    Submitted 14 July, 2018; originally announced July 2018.

  25. arXiv:1807.02854  [pdf, other

    cs.IR cs.CL cs.LG

    Replicated Siamese LSTM in Ticketing System for Similarity Learning and Retrieval in Asymmetric Texts

    Authors: Pankaj Gupta, Bernt Andrassy, Hinrich Schütze

    Abstract: The goal of our industrial ticketing system is to retrieve a relevant solution for an input query, by matching with historical tickets stored in knowledge base. A query is comprised of subject and description, while a historical ticket consists of subject, description and solution. To retrieve a relevant solution, we use textual similarity paradigm to learn similarity in the query and historical t… ▽ More

    Submitted 8 July, 2018; originally announced July 2018.

    Comments: In the 27th International Conference on Computational Linguistics (COLING 2018) workshop on Semantic Deep Learning (SemDeep-3)

  26. arXiv:1806.04523  [pdf, other

    cs.CL

    Recurrent One-Hop Predictions for Reasoning over Knowledge Graphs

    Authors: Wenpeng Yin, Yadollah Yaghoobzadeh, Hinrich Schütze

    Abstract: Large scale knowledge graphs (KGs) such as Freebase are generally incomplete. Reasoning over multi-hop (mh) KG paths is thus an important capability that is needed for question answering or other NLP tasks that require knowledge about the world. mh-KG reasoning includes diverse scenarios, e.g., given a head entity and a relation path, predict the tail entity; or given two entities connected by som… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: COLING'2018 camera-ready

  27. arXiv:1805.00254  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.NE

    Joint Bootstrap** Machines for High Confidence Relation Extraction

    Authors: Pankaj Gupta, Benjamin Roth, Hinrich Schütze

    Abstract: Semi-supervised bootstrap** techniques for relationship extraction from text iteratively expand a set of initial seed instances. Due to the lack of labeled data, a key challenge in bootstrap** is semantic drift: if a false positive instance is added during an iteration, then all following iterations are contaminated. We introduce BREX, a new bootstrap** method that protects against such cont… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

    Comments: In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018)

  28. arXiv:1804.08813  [pdf, other

    cs.CL

    End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions

    Authors: Wenpeng Yin, Hinrich Schütze, Dan Roth

    Abstract: This work deals with SciTail, a natural entailment challenge derived from a multi-choice question answering problem. The premises and hypotheses in SciTail were generated with no awareness of each other, and did not specifically aim at the entailment task. This makes it more challenging than other entailment data sets and more directly useful to the end-task -- question answering. We propose DEIST… ▽ More

    Submitted 14 May, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: ACL'2018 camera-ready; 6 pages, 3 figures

  29. arXiv:1804.06024  [pdf, ps, other

    cs.CL

    Fortification of Neural Morphological Segmentation Models for Polysynthetic Minimal-Resource Languages

    Authors: Katharina Kann, Manuel Mager, Ivan Meza-Ruiz, Hinrich Schütze

    Abstract: Morphological segmentation for polysynthetic languages is challenging, because a word may consist of many individual morphemes and training data can be extremely scarce. Since neural sequence-to-sequence (seq2seq) models define the state of the art for morphological segmentation in high-resource settings and for (mostly) European languages, we first show that they also obtain competitive performan… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    Comments: Long Paper, 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

  30. Neural Architectures for Open-Type Relation Argument Extraction

    Authors: Benjamin Roth, Costanza Conforti, Nina Poerner, Sanjeev Karn, Hinrich Schütze

    Abstract: In this work, we introduce the task of Open-Type Relation Argument Extraction (ORAE): Given a corpus, a query entity Q and a knowledge base relation (e.g.,"Q authored notable work with title X"), the model has to extract an argument of non-standard entity type (entities that cannot be extracted by a standard named entity tagger, e.g. X: the title of a book or a work of art) from the corpus. A dist… ▽ More

    Submitted 30 September, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Journal ref: Nat. Lang. Eng. 25 (2019) 219-238

  31. arXiv:1801.06807  [pdf, other

    cs.CL

    Embedding Learning Through Multilingual Concept Induction

    Authors: Philipp Dufter, Mengjie Zhao, Martin Schmitt, Alexander Fraser, Hinrich Schütze

    Abstract: We present a new method for estimating vector space representations of words: embedding learning by concept induction. We test this method on a highly parallel corpus and learn semantic representations of words in 1259 different languages in a single common space. An extensive experimental evaluation on crosslingual word similarity and sentiment analysis indicates that concept-based multilingual e… ▽ More

    Submitted 27 June, 2018; v1 submitted 21 January, 2018; originally announced January 2018.

    Comments: ACL 2018

  32. arXiv:1801.06422  [pdf, other

    cs.CL

    Evaluating neural network explanation methods using hybrid documents and morphological agreement

    Authors: Nina Poerner, Benjamin Roth, Hinrich Schütze

    Abstract: The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explore post hoc explanation methods. We conduct the first comprehensive evaluation of explanation methods for NLP. To this end, we design two novel evaluation paradigms that cover two important classes of NLP problems: small context and large context problems. Both paradigms require no manual annotation… ▽ More

    Submitted 6 May, 2019; v1 submitted 19 January, 2018; originally announced January 2018.

  33. arXiv:1711.05626  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time

    Authors: Pankaj Gupta, Subburam Rajaram, Hinrich Schütze, Bernt Andrassy

    Abstract: Dynamic topic modeling facilitates the identification of topical trends over time in temporal collections of unstructured documents. We introduce a novel unsupervised neural dynamic topic model named as Recurrent Neural Network-Replicated Softmax Model (RNNRSM), where the discovered topics at each time influence the topic discovery in the subsequent time steps. We account for the temporal ordering… ▽ More

    Submitted 1 May, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018)

  34. arXiv:1710.09753  [pdf, other

    cs.CL

    Impact of Coreference Resolution on Slot Filling

    Authors: Heike Adel, Hinrich Schütze

    Abstract: In this paper, we demonstrate the importance of coreference resolution for natural language processing on the example of the TAC Slot Filling shared task. We illustrate the strengths and weaknesses of automatic coreference resolution systems and provide experimental results to show that they improve performance in the slot filling end-to-end setting. Finally, we publish KBPchains, a resource conta… ▽ More

    Submitted 26 October, 2017; originally announced October 2017.

    Comments: 5 pages

  35. arXiv:1710.00519  [pdf, other

    cs.CL

    Attentive Convolution: Equip** CNNs with RNN-style Attention Mechanisms

    Authors: Wenpeng Yin, Hinrich Schütze

    Abstract: In NLP, convolutional neural networks (CNNs) have benefited less than recurrent neural networks (RNNs) from attention mechanisms. We hypothesize that this is because the attention in CNNs has been mainly implemented as attentive pooling (i.e., it is applied to pooling) rather than as attentive convolution (i.e., it is integrated into convolution). Convolution is the differentiator of CNNs in that… ▽ More

    Submitted 13 November, 2018; v1 submitted 2 October, 2017; originally announced October 2017.

    Comments: Camera-ready for TACL. 16 pages

  36. arXiv:1708.02275  [pdf, other

    cs.CL

    Corpus-level Fine-grained Entity Ty**

    Authors: Yadollah Yaghoobzadeh, Heike Adel, Hinrich Schütze

    Abstract: This paper addresses the problem of corpus-level entity ty**, i.e., inferring from a large corpus that an entity is a member of a class such as "food" or "artist". The application of entity ty** we are interested in is knowledge base completion, specifically, to learn which classes an entity is a member of. We propose FIGMENT to tackle this problem. FIGMENT is embedding- based and combines (i)… ▽ More

    Submitted 6 June, 2018; v1 submitted 7 August, 2017; originally announced August 2017.

    Comments: 24 pages. arXiv admin note: text overlap with arXiv:1701.02025, arXiv:1606.07901

    Journal ref: JAIR, Vol 61 (2018)

  37. arXiv:1707.07719  [pdf, other

    cs.CL

    Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification

    Authors: Heike Adel, Hinrich Schütze

    Abstract: We introduce globally normalized convolutional neural networks for joint entity classification and relation extraction. In particular, we propose a way to utilize a linear-chain conditional random field output layer for predicting entity types and relations between entities at the same time. Our experiments show that global normalization outperforms a locally normalized softmax layer on a benchmar… ▽ More

    Submitted 7 August, 2018; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: EMNLP 2017

  38. arXiv:1705.06106  [pdf, other

    cs.CL

    Unlabeled Data for Morphological Generation With Character-Based Sequence-to-Sequence Models

    Authors: Katharina Kann, Hinrich Schütze

    Abstract: We present a semi-supervised way of training a character-based encoder-decoder recurrent neural network for morphological reinflection, the task of generating one inflected word form from another. This is achieved by using unlabeled tokens or random strings as training data for an autoencoding task, adapting a network for morphological reinflection, and performing multi-task training. We thus use… ▽ More

    Submitted 21 July, 2017; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: Accepted at SCLeM 2017

  39. arXiv:1704.08914  [pdf, other

    cs.CL cs.AI cs.LG

    Past, Present, Future: A Computational Investigation of the Typology of Tense in 1000 Languages

    Authors: Ehsaneddin Asgari, Hinrich Schütze

    Abstract: We present SuperPivot, an analysis method for low-resource languages that occur in a superparallel corpus, i.e., in a corpus that contains an order of magnitude more languages than parallel corpora currently in use. We show that SuperPivot performs well for the crosslingual analysis of the linguistic phenomenon of tense. We produce analysis results for more than 1000 languages, conducting - to the… ▽ More

    Submitted 14 September, 2017; v1 submitted 28 April, 2017; originally announced April 2017.

    Journal ref: Extended version of EMNLP 2017

  40. arXiv:1704.01610  [pdf, ps, other

    cs.IR

    A Subjective Logic Formalisation of the Principle of Polyrepresentation for Information Needs

    Authors: Christina Lioma, Birger Larsen, Hinrich Schütze, Peter Ingwersen

    Abstract: Interactive Information Retrieval refers to the branch of Information Retrieval that considers the retrieval process with respect to a wide range of contexts, which may affect the user's information seeking experience. The identification and representation of such contexts has been the object of the principle of Polyrepresentation, a theoretical framework for reasoning about different representati… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.

  41. arXiv:1704.00052  [pdf, other

    cs.CL

    One-Shot Neural Cross-Lingual Transfer for Paradigm Completion

    Authors: Katharina Kann, Ryan Cotterell, Hinrich Schütze

    Abstract: We present a novel cross-lingual transfer method for paradigm completion, the task of map** a lemma to its inflected forms, using a neural encoder-decoder model, the state of the art for the monolingual task. We use labeled data from a high-resource language to increase performance on a low-resource language. In experiments on 21 language pairs from four different language families, we obtain up… ▽ More

    Submitted 31 March, 2017; originally announced April 2017.

    Comments: Accepted at ACL 2017

  42. arXiv:1702.01923  [pdf, other

    cs.CL

    Comparative Study of CNN and RNN for Natural Language Processing

    Authors: Wenpeng Yin, Katharina Kann, Mo Yu, Hinrich Schütze

    Abstract: Deep neural networks (DNN) have revolutionized the field of natural language processing (NLP). Convolutional neural network (CNN) and recurrent neural network (RNN), the two main types of DNN architectures, are widely explored to handle various NLP tasks. CNN is supposed to be good at extracting position-invariant features and RNN at modeling units in sequence. The state of the art on many NLP tas… ▽ More

    Submitted 7 February, 2017; originally announced February 2017.

    Comments: 7 pages, 11 figures

  43. arXiv:1701.02149  [pdf, other

    cs.CL

    Task-Specific Attentive Pooling of Phrase Alignments Contributes to Sentence Matching

    Authors: Wenpeng Yin, Hinrich Schütze

    Abstract: This work studies comparatively two typical sentence matching tasks: textual entailment (TE) and answer selection (AS), observing that weaker phrase alignments are more critical in TE, while stronger phrase alignments deserve more attention in AS. The key to reach this observation lies in phrase detection, phrase representation, phrase alignment, and more importantly how to connect those aligned p… ▽ More

    Submitted 9 January, 2017; originally announced January 2017.

    Comments: EACL'2017 long paper. arXiv admin note: substantial text overlap with arXiv:1604.06896

  44. arXiv:1701.02025  [pdf, other

    cs.CL cs.AI

    Multi-level Representations for Fine-Grained Ty** of Knowledge Base Entities

    Authors: Yadollah Yaghoobzadeh, Hinrich Schütze

    Abstract: Entities are essential elements of natural language. In this paper, we present methods for learning multi-level representations of entities on three complementary levels: character (character patterns in entity names extracted, e.g., by neural networks), word (embeddings of words in entity names) and entity (entity embeddings). We investigate state-of-the-art learning methods on each level and fin… ▽ More

    Submitted 16 January, 2017; v1 submitted 8 January, 2017; originally announced January 2017.

    Comments: 13 pages, in EACL 2017

  45. arXiv:1701.00946  [pdf, other

    cs.CL

    Joint Semantic Synthesis and Morphological Analysis of the Derived Word

    Authors: Ryan Cotterell, Hinrich Schütze

    Abstract: Much like sentences are composed of words, words themselves are composed of smaller units. For example, the English word questionably can be analyzed as question+able+ly. However, this structural decomposition of the word does not directly give us a semantic representation of the word's meaning. Since morphology obeys the principle of compositionality, the semantics of the word can be systematical… ▽ More

    Submitted 10 November, 2018; v1 submitted 4 January, 2017; originally announced January 2017.

    Comments: TACL 2017 (presented at ACL 2017)

  46. arXiv:1612.07495  [pdf, other

    cs.CL

    Noise Mitigation for Neural Entity Ty** and Relation Extraction

    Authors: Yadollah Yaghoobzadeh, Heike Adel, Hinrich Schütze

    Abstract: In this paper, we address two different types of noise in information extraction models: noise from distant supervision and noise from pipeline input features. Our target tasks are entity ty** and relation extraction. For the first noise type, we introduce multi-instance multi-label learning algorithms using neural network models, and apply them to fine-grained entity ty** for the first time.… ▽ More

    Submitted 10 January, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: EACL 2017; the first two authors contributed equally to this work

  47. arXiv:1612.06549  [pdf, other

    cs.CL

    Exploring Different Dimensions of Attention for Uncertainty Detection

    Authors: Heike Adel, Hinrich Schütze

    Abstract: Neural networks with attention have proven effective for many natural language processing tasks. In this paper, we develop attention mechanisms for uncertainty detection. In particular, we generalize standardly used attention mechanisms by introducing external attention and sequence-preserving attention. These novel architectures differ from standard approaches in that they use external resources… ▽ More

    Submitted 10 January, 2017; v1 submitted 20 December, 2016; originally announced December 2016.

    Comments: accepted at EACL 2017

  48. arXiv:1612.06027  [pdf, other

    cs.CL

    Neural Multi-Source Morphological Reinflection

    Authors: Katharina Kann, Ryan Cotterell, Hinrich Schütze

    Abstract: We explore the task of multi-source morphological reinflection, which generalizes the standard, single-source version. The input consists of (i) a target tag and (ii) multiple pairs of source form and source tag for a lemma. The motivation is that it is beneficial to have access to more than one source form since different source forms can provide complementary information, e.g., different stems.… ▽ More

    Submitted 22 January, 2017; v1 submitted 18 December, 2016; originally announced December 2016.

    Comments: Accepted at EACL 2017. Camera Ready Version

  49. arXiv:1610.00479  [pdf, ps, other

    cs.CL

    Nonsymbolic Text Representation

    Authors: Hinrich Schuetze, Heike Adel, Ehsaneddin Asgari

    Abstract: We introduce the first generic text representation model that is completely nonsymbolic, i.e., it does not require the availability of a segmentation or tokenization method that attempts to identify words or other symbolic units in text. This applies to training the parameters of the model on a training corpus as well as to applying it when computing the representation of a new text. We show that… ▽ More

    Submitted 1 May, 2017; v1 submitted 3 October, 2016; originally announced October 2016.

  50. arXiv:1606.07902  [pdf, ps, other

    cs.CL

    Intrinsic Subspace Evaluation of Word Embedding Representations

    Authors: Yadollah Yaghoobzadeh, Hinrich Schütze

    Abstract: We introduce a new methodology for intrinsic evaluation of word representations. Specifically, we identify four fundamental criteria based on the characteristics of natural language that pose difficulties to NLP systems; and develop tests that directly show whether or not representations contain the subspaces necessary to satisfy these criteria. Current intrinsic evaluations are mostly based on th… ▽ More

    Submitted 25 June, 2016; originally announced June 2016.

    Comments: Long paper accepted in ACL2016

    MSC Class: 68T50