Skip to main content

Showing 1–43 of 43 results for author: Adel, H

.
  1. arXiv:2406.18708  [pdf, other

    cs.LG cs.CL

    Learn it or Leave it: Module Composition and Pruning for Continual Learning

    Authors: Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

    Abstract: In real-world environments, continual learning is essential for machine learning models, as they need to acquire new knowledge incrementally without forgetting what they have already learned. While pretrained language models have shown impressive capabilities on various static tasks, applying them to continual learning poses significant challenges, including avoiding catastrophic forgetting, facil… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2404.18585  [pdf, other

    cs.CL

    FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question Answering

    Authors: Wei Zhou, Mohsen Mesgar, Heike Adel, Annemarie Friedrich

    Abstract: Table Question Answering (TQA) aims at composing an answer to a question based on tabular data. While prior research has shown that TQA models lack robustness, understanding the underlying cause and nature of this issue remains predominantly unclear, posing a significant obstacle to the development of robust TQA systems. In this paper, we formalize three major desiderata for a fine-grained evaluat… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL 2024

  3. arXiv:2404.00790  [pdf, other

    cs.LG cs.CL

    Rehearsal-Free Modular and Compositional Continual Learning for Language Models

    Authors: Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

    Abstract: Continual learning aims at incrementally acquiring new knowledge while not forgetting existing knowledge. To overcome catastrophic forgetting, methods are either rehearsal-based, i.e., store data examples from previous tasks for data replay, or isolate parameters dedicated to each task. However, rehearsal-based methods raise privacy and memory issues, and parameter-isolation continual learning doe… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  4. arXiv:2403.05338  [pdf, other

    cs.CL

    Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings

    Authors: Wei Zhou, Heike Adel, Hendrik Schuff, Ngoc Thang Vu

    Abstract: Attribution scores indicate the importance of different input parts and can, thus, explain model behaviour. Currently, prompt-based models are gaining popularity, i.a., due to their easier adaptability in low-resource settings. However, the quality of attribution scores extracted from prompt-based models has not been investigated yet. In this work, we address this topic by analyzing attribution sc… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2310.15269  [pdf, other

    cs.LG cs.CL

    GradSim: Gradient-Based Language Grou** for Effective Multilingual Training

    Authors: Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

    Abstract: Most languages of the world pose low-resource challenges to natural language processing models. With multilingual training, knowledge can be shared among languages. However, not all languages positively influence each other and it is an open research question how to select the most suitable set of languages for multilingual training and avoid negative interference among languages whose characteris… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  6. arXiv:2305.02679  [pdf, other

    cs.CL cs.HC

    Neighboring Words Affect Human Interpretation of Saliency Explanations

    Authors: Alon Jacovi, Hendrik Schuff, Heike Adel, Ngoc Thang Vu, Yoav Goldberg

    Abstract: Word-level saliency explanations ("heat maps over words") are often used to communicate feature-attribution in text-based models. Recent studies found that superficial factors such as word length can distort human interpretation of the communicated saliency scores. We conduct a user study to investigate how the marking of a word's neighboring words affect the explainee's perception of the word's i… ▽ More

    Submitted 6 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL 2023

  7. NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis

    Authors: Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

    Abstract: This paper describes our system developed for the SemEval-2023 Task 12 "Sentiment Analysis for Low-resource African Languages using Twitter Dataset". Sentiment analysis is one of the most widely studied applications in natural language processing. However, most prior work still focuses on a small number of high-resource languages. Building reliable sentiment analysis systems for low-resource langu… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  8. arXiv:2302.06868  [pdf, other

    cs.CL cs.AI

    SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains

    Authors: Koustava Goswami, Lukas Lange, Jun Araki, Heike Adel

    Abstract: Prompting pre-trained language models leads to promising results across natural language processing tasks but is less effective when applied in low-resource domains, due to the domain gap between the pre-training data and the downstream task. In this work, we bridge this gap with a novel and lightweight prompting methodology called SwitchPrompt for the adaptation of language models trained on data… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: Accepted at EACL 2023 Main Conference

  9. arXiv:2210.07126  [pdf, other

    cs.CL cs.AI cs.HC

    Challenges in Explanation Quality Evaluation

    Authors: Hendrik Schuff, Heike Adel, Peng Qi, Ngoc Thang Vu

    Abstract: While much research focused on producing explanations, it is still unclear how the produced explanations' quality can be evaluated in a meaningful way. Today's predominant approach is to quantify explanations using proxy scores which compare explanations to (human-annotated) gold explanations. This approach assumes that explanations which reach higher proxy scores will also provide a greater benef… ▽ More

    Submitted 9 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 41 pages, 11 figures

  10. arXiv:2205.10399  [pdf, other

    cs.CL cs.LG

    Multilingual Normalization of Temporal Expressions with Masked Language Models

    Authors: Lukas Lange, Jannik Strötgen, Heike Adel, Dietrich Klakow

    Abstract: The detection and normalization of temporal expressions is an important task and preprocessing step for many applications. However, prior work on normalization is rule-based, which severely limits the applicability in real-world multilingual settings, due to the costly creation of new rules. We propose a novel neural method for normalizing temporal expressions based on masked language modeling. Ou… ▽ More

    Submitted 10 February, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted at EACL 2023

  11. arXiv:2201.11569  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Human Interpretation of Saliency-based Explanation Over Text

    Authors: Hendrik Schuff, Alon Jacovi, Heike Adel, Yoav Goldberg, Ngoc Thang Vu

    Abstract: While a lot of research in explainable AI focuses on producing effective explanations, less work is devoted to the question of how people understand and interpret the explanation. In this work, we focus on this question through a study of saliency-based explanations over textual data. Feature-attribution explanations of text models aim to communicate which parts of the input text were more influen… ▽ More

    Submitted 17 June, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: FAccT 2022

  12. CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain

    Authors: Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow

    Abstract: The field of natural language processing (NLP) has recently seen a large change towards using pre-trained language models for solving almost any task. Despite showing great improvements in benchmark datasets for various tasks, these models often perform sub-optimal in non-standard domains like the clinical domain where a large gap between pre-training documents and target documents is observed. In… ▽ More

    Submitted 20 May, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: This article has been accepted for publication in Bioinformatics \c{opyright}: 2022 The Author(s). Published by Oxford University Press. All rights reserved. The published manuscript can be found here: https://doi.org/10.1093/bioinformatics/btac297

  13. arXiv:2109.08597  [pdf, other

    cs.CL cs.LG

    Boosting Transformers for Job Expression Extraction and Classification in a Low-Resource Setting

    Authors: Lukas Lange, Heike Adel, Jannik Strötgen

    Abstract: In this paper, we explore possible improvements of transformer models in a low-resource setting. In particular, we present our approaches to tackle the first two of three subtasks of the MEDDOPROF competition, i.e., the extraction and classification of job expressions in Spanish clinical texts. As neither language nor domain experts, we experiment with the multilingual XLM-R transformer model and… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Published at IberLEF 2021. Best system of the NER and CLASS tracks of the MEDDOPROF shared task

  14. arXiv:2109.07833  [pdf, other

    cs.CL cs.HC

    Does External Knowledge Help Explainable Natural Language Inference? Automatic Evaluation vs. Human Ratings

    Authors: Hendrik Schuff, Hsiu-Yu Yang, Heike Adel, Ngoc Thang Vu

    Abstract: Natural language inference (NLI) requires models to learn and apply commonsense knowledge. These reasoning abilities are particularly important for explainable NLI systems that generate a natural language explanation in addition to their label prediction. The integration of external knowledge has been shown to improve NLI systems, here we investigate whether it can also improve their explanation c… ▽ More

    Submitted 13 October, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: BlackboxNLP @ EMNLP2021

  15. arXiv:2107.12220  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Thought Flow Nets: From Single Predictions to Trains of Model Thought

    Authors: Hendrik Schuff, Heike Adel, Ngoc Thang Vu

    Abstract: When humans solve complex problems, they typically create a sequence of ideas (involving an intuitive decision, reflection, error correction, etc.) in order to reach a conclusive decision. Contrary to this, today's models are mostly trained to map an input to one single and fixed output. In this paper, we investigate how we can give models the opportunity of a second, third and $k$-th thought. Tak… ▽ More

    Submitted 14 March, 2023; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: 15 pages, 7 figures

  16. arXiv:2104.10899  [pdf, other

    cs.CL

    Enriched Attention for Robust Relation Extraction

    Authors: Heike Adel, Jannik Strötgen

    Abstract: The performance of relation extraction models has increased considerably with the rise of neural networks. However, a key issue of neural relation extraction is robustness: the models do not scale well to long sentences with multiple entities and relations. In this work, we address this problem with an enriched attention mechanism. Attention allows the model to focus on parts of the input sentence… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  17. arXiv:2104.08078  [pdf, other

    cs.CL cs.LG

    To Share or not to Share: Predicting Sets of Sources for Model Transfer Learning

    Authors: Lukas Lange, Jannik Strötgen, Heike Adel, Dietrich Klakow

    Abstract: In low-resource settings, model transfer can help to overcome a lack of labeled data for many tasks and domains. However, predicting useful transfer sources is a challenging problem, as even the most similar sources might lead to unexpected negative transfer results. Thus, ranking methods based on task and text similarity -- as suggested in prior work -- may not be sufficient to identify promising… ▽ More

    Submitted 29 October, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021

  18. arXiv:2010.12322  [pdf, other

    cs.CL cs.LG

    NLNDE at CANTEMIST: Neural Sequence Labeling and Parsing Approaches for Clinical Concept Extraction

    Authors: Lukas Lange, Xiang Dai, Heike Adel, Jannik Strötgen

    Abstract: The recognition and normalization of clinical information, such as tumor morphology mentions, is an important, but complex process consisting of multiple subtasks. In this paper, we describe our system for the CANTEMIST shared task, which is able to extract, normalize and rank ICD codes from Spanish electronic health records using neural sequence labeling and parsing approaches with context-aware… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: IberLEF 2020

  19. arXiv:2010.12309  [pdf, other

    cs.CL cs.LG

    A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios

    Authors: Michael A. Hedderich, Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow

    Abstract: Deep neural networks and huge language models are becoming omnipresent in natural language applications. As they are known for requiring large amounts of training data, there is a growing body of work to improve the performance in low-resource settings. Motivated by the recent fundamental changes towards neural models and the popular pre-train and fine-tune paradigm, we survey promising approaches… ▽ More

    Submitted 9 April, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted at NAACL 2021

  20. arXiv:2010.12305  [pdf, other

    cs.CL cs.LG

    FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations

    Authors: Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow

    Abstract: Combining several embeddings typically improves performance in downstream tasks as different embeddings encode different information. It has been shown that even models using embeddings from transformers still benefit from the inclusion of standard word embeddings. However, the combination of embeddings of different types and dimensions is challenging. As an alternative to attention-based meta-emb… ▽ More

    Submitted 29 October, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted at EMNLP 2021

  21. arXiv:2010.11683  [pdf, ps, other

    cs.CL

    An Analysis of Simple Data Augmentation for Named Entity Recognition

    Authors: Xiang Dai, Heike Adel

    Abstract: Simple yet effective data augmentation techniques have been proposed for sentence-level and sentence-pair natural language processing tasks. Inspired by these efforts, we design and compare data augmentation for named entity recognition, which is usually modeled as a token-level sequence labeling problem. Through experiments on two data sets from the biomedical and materials science domains (i2b2-… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: COLING 2020

  22. arXiv:2010.06283  [pdf, other

    cs.CL

    F1 is Not Enough! Models and Evaluation Towards User-Centered Explainable Question Answering

    Authors: Hendrik Schuff, Heike Adel, Ngoc Thang Vu

    Abstract: Explainable question answering systems predict an answer together with an explanation showing why the answer has been selected. The goal is to enable users to assess the correctness of the system and understand its reasoning process. However, we show that current models and evaluation settings have shortcomings regarding the coupling of answer and explanation which might cause serious issues in us… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  23. arXiv:2007.01030  [pdf, other

    cs.CL cs.LG

    NLNDE: The Neither-Language-Nor-Domain-Experts' Way of Spanish Medical Document De-Identification

    Authors: Lukas Lange, Heike Adel, Jannik Strötgen

    Abstract: Natural language processing has huge potential in the medical domain which recently led to a lot of research in this field. However, a prerequisite of secure processing of medical documents, e.g., patient notes and clinical trials, is the proper de-identification of privacy-sensitive information. In this paper, we describe our NLNDE system, with which we participated in the MEDDOCAN competition, t… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: Published at IberLEF 2019. Winning System of the MEDDOCAN shared task

  24. NLNDE: Enhancing Neural Sequence Taggers with Attention and Noisy Channel for Robust Pharmacological Entity Detection

    Authors: Lukas Lange, Heike Adel, Jannik Strötgen

    Abstract: Named entity recognition has been extensively studied on English news texts. However, the transfer to other domains and languages is still a challenging problem. In this paper, we describe the system with which we participated in the first subtrack of the PharmaCoNER competition of the BioNLP Open Shared Tasks 2019. Aiming at pharmacological entity detection in Spanish texts, the task provides a n… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: Published at BioNLP-OST@EMNLP 2019

  25. arXiv:2006.03039  [pdf, other

    cs.CL cs.LG

    The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain

    Authors: Annemarie Friedrich, Heike Adel, Federico Tomazic, Johannes Hingerl, Renou Benteau, Anika Maruscyk, Lukas Lange

    Abstract: This paper presents a new challenging information extraction task in the domain of materials science. We develop an annotation scheme for marking information on experiments related to solid oxide fuel cells in scientific publications, such as involved materials and measurement conditions. With this paper, we publish our annotation guidelines, as well as our SOFC-Exp corpus consisting of 45 open-ac… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at ACL 2020

  26. arXiv:2005.09397  [pdf, other

    cs.CL cs.LG

    Closing the Gap: Joint De-Identification and Concept Extraction in the Clinical Domain

    Authors: Lukas Lange, Heike Adel, Jannik Strötgen

    Abstract: Exploiting natural language processing in the clinical domain requires de-identification, i.e., anonymization of personal information in texts. However, current research considers de-identification and downstream tasks, such as concept extraction, only in isolation and does not study the effects of de-identification on other tasks. In this paper, we close this gap by reporting concept extraction p… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: ACL 2020

  27. arXiv:2005.09392  [pdf, other

    cs.CL cs.LG

    Adversarial Alignment of Multilingual Models for Extracting Temporal Expressions from Text

    Authors: Lukas Lange, Anastasiia Iurshina, Heike Adel, Jannik Strötgen

    Abstract: Although temporal tagging is still dominated by rule-based systems, there have been recent attempts at neural temporal taggers. However, all of them focus on monolingual settings. In this paper, we explore multilingual methods for the extraction of temporal expressions from text and investigate adversarial training for aligning embedding spaces to one common space. With this, we create a single mu… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: RepL4NLP at ACL 2020

  28. arXiv:2005.09389  [pdf, other

    cs.CL cs.LG

    On the Choice of Auxiliary Languages for Improved Sequence Tagging

    Authors: Lukas Lange, Heike Adel, Jannik Strötgen

    Abstract: Recent work showed that embeddings from related languages can improve the performance of sequence tagging, even for monolingual models. In this analysis paper, we investigate whether the best auxiliary language can be predicted based on language distances and show that the most related language is not always the best auxiliary language. Further, we show that attention-based meta-embeddings can eff… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: RepL4NLP at ACL 2020

  29. Type-aware Convolutional Neural Networks for Slot Filling

    Authors: Heike Adel, Hinrich Schütze

    Abstract: The slot filling task aims at extracting answers for queries about entities from text, such as "Who founded Apple". In this paper, we focus on the relation classification component of a slot filling system. We propose type-aware convolutional neural networks to benefit from the mutual dependencies between entity and relation classification. In particular, we explore different ways of integrating t… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: Journal of Artificial Intelligence Research (JAIR), volume 66

  30. arXiv:1902.11145  [pdf, other

    cs.CL

    Adversarial Training for Satire Detection: Controlling for Confounding Variables

    Authors: Robert McHardy, Heike Adel, Roman Klinger

    Abstract: The automatic detection of satire vs. regular news is relevant for downstream applications (for instance, knowledge base population) and to improve the understanding of linguistic characteristics of satire. Recent approaches build upon corpora which have been labeled automatically based on article sources. We hypothesize that this encourages the models to learn characteristics for different public… ▽ More

    Submitted 1 March, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: Accepted for publication at NAACL 2019

  31. arXiv:1811.02230  [pdf, other

    cs.CL

    CIS at TAC Cold Start 2015: Neural Networks and Coreference Resolution for Slot Filling

    Authors: Heike Adel, Hinrich Schütze

    Abstract: This paper describes the CIS slot filling system for the TAC Cold Start evaluations 2015. It extends and improves the system we have built for the evaluation last year. This paper mainly describes the changes to our last year's system. Especially, it focuses on the coreference and classification component. For coreference, we have performed several analysis and prepared a resource to simplify our… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: TAC KBP 2015

  32. arXiv:1808.04736  [pdf, other

    cs.CL

    Adversarial Neural Networks for Cross-lingual Sequence Tagging

    Authors: Heike Adel, Anton Bryl, David Weiss, Aliaksei Severyn

    Abstract: We study cross-lingual sequence tagging with little or no labeled data in the target language. Adversarial training has previously been shown to be effective for training cross-lingual sentence classifiers. However, it is not clear if language-agnostic representations enforced by an adversarial language discriminator will also enable effective transfer for token-level prediction tasks. Therefore,… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

  33. arXiv:1808.04208  [pdf, other

    cs.CL

    Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging

    Authors: Apostolos Kemos, Heike Adel, Hinrich Schütze

    Abstract: Character-level models of tokens have been shown to be effective at dealing with within-token noise and out-of-vocabulary words. But these models still rely on correct token boundaries. In this paper, we propose a novel end-to-end character-level model and demonstrate its effectiveness in multilingual settings and when token boundaries are noisy. Our model is a semi-Markov conditional random field… ▽ More

    Submitted 2 January, 2020; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: NAACL 2019

  34. arXiv:1710.09753  [pdf, other

    cs.CL

    Impact of Coreference Resolution on Slot Filling

    Authors: Heike Adel, Hinrich Schütze

    Abstract: In this paper, we demonstrate the importance of coreference resolution for natural language processing on the example of the TAC Slot Filling shared task. We illustrate the strengths and weaknesses of automatic coreference resolution systems and provide experimental results to show that they improve performance in the slot filling end-to-end setting. Finally, we publish KBPchains, a resource conta… ▽ More

    Submitted 26 October, 2017; originally announced October 2017.

    Comments: 5 pages

  35. Syntactic and Semantic Features For Code-Switching Factored Language Models

    Authors: Heike Adel, Ngoc Thang Vu, Katrin Kirchhoff, Dominic Telaar, Tanja Schultz

    Abstract: This paper presents our latest investigations on different features for factored language models for Code-Switching speech and their effect on automatic speech recognition (ASR) performance. We focus on syntactic and semantic features which can be extracted from Code-Switching text data and integrate them into factored language models. Different possible factors, such as words, part-of-speech tags… ▽ More

    Submitted 4 October, 2017; originally announced October 2017.

    Comments: IEEE/ACM Transactions on Audio, Speech, and Language Processing (Volume: 23, Issue: 3, March 2015)

  36. arXiv:1708.02275  [pdf, other

    cs.CL

    Corpus-level Fine-grained Entity Ty**

    Authors: Yadollah Yaghoobzadeh, Heike Adel, Hinrich Schütze

    Abstract: This paper addresses the problem of corpus-level entity ty**, i.e., inferring from a large corpus that an entity is a member of a class such as "food" or "artist". The application of entity ty** we are interested in is knowledge base completion, specifically, to learn which classes an entity is a member of. We propose FIGMENT to tackle this problem. FIGMENT is embedding- based and combines (i)… ▽ More

    Submitted 6 June, 2018; v1 submitted 7 August, 2017; originally announced August 2017.

    Comments: 24 pages. arXiv admin note: text overlap with arXiv:1701.02025, arXiv:1606.07901

    Journal ref: JAIR, Vol 61 (2018)

  37. arXiv:1707.07719  [pdf, other

    cs.CL

    Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification

    Authors: Heike Adel, Hinrich Schütze

    Abstract: We introduce globally normalized convolutional neural networks for joint entity classification and relation extraction. In particular, we propose a way to utilize a linear-chain conditional random field output layer for predicting entity types and relations between entities at the same time. Our experiments show that global normalization outperforms a locally normalized softmax layer on a benchmar… ▽ More

    Submitted 7 August, 2018; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: EMNLP 2017

  38. arXiv:1612.07495  [pdf, other

    cs.CL

    Noise Mitigation for Neural Entity Ty** and Relation Extraction

    Authors: Yadollah Yaghoobzadeh, Heike Adel, Hinrich Schütze

    Abstract: In this paper, we address two different types of noise in information extraction models: noise from distant supervision and noise from pipeline input features. Our target tasks are entity ty** and relation extraction. For the first noise type, we introduce multi-instance multi-label learning algorithms using neural network models, and apply them to fine-grained entity ty** for the first time.… ▽ More

    Submitted 10 January, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: EACL 2017; the first two authors contributed equally to this work

  39. arXiv:1612.06549  [pdf, other

    cs.CL

    Exploring Different Dimensions of Attention for Uncertainty Detection

    Authors: Heike Adel, Hinrich Schütze

    Abstract: Neural networks with attention have proven effective for many natural language processing tasks. In this paper, we develop attention mechanisms for uncertainty detection. In particular, we generalize standardly used attention mechanisms by introducing external attention and sequence-preserving attention. These novel architectures differ from standard approaches in that they use external resources… ▽ More

    Submitted 10 January, 2017; v1 submitted 20 December, 2016; originally announced December 2016.

    Comments: accepted at EACL 2017

  40. arXiv:1610.00479  [pdf, ps, other

    cs.CL

    Nonsymbolic Text Representation

    Authors: Hinrich Schuetze, Heike Adel, Ehsaneddin Asgari

    Abstract: We introduce the first generic text representation model that is completely nonsymbolic, i.e., it does not require the availability of a segmentation or tokenization method that attempts to identify words or other symbolic units in text. This applies to training the parameters of the model on a training corpus as well as to applying it when computing the representation of a new text. We show that… ▽ More

    Submitted 1 May, 2017; v1 submitted 3 October, 2016; originally announced October 2016.

  41. arXiv:1605.07333  [pdf, other

    cs.CL

    Combining Recurrent and Convolutional Neural Networks for Relation Classification

    Authors: Ngoc Thang Vu, Heike Adel, Pankaj Gupta, Hinrich Schütze

    Abstract: This paper investigates two different neural architectures for the task of relation classification: convolutional neural networks and recurrent neural networks. For both models, we demonstrate the effect of different architectural choices. We present a new context representation for convolutional neural networks for relation classification (extended middle context). Furthermore, we propose connect… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: NAACL 2016

  42. arXiv:1603.05157  [pdf, other

    cs.CL

    Comparing Convolutional Neural Networks to Traditional Models for Slot Filling

    Authors: Heike Adel, Benjamin Roth, Hinrich Schütze

    Abstract: We address relation classification in the context of slot filling, the task of finding and evaluating fillers like "Steve Jobs" for the slot X in "X founded Apple". We propose a convolutional neural network which splits the input sentence into three parts according to the relation arguments and compare it to state-of-the-art and traditional approaches of relation classification. Finally, we combin… ▽ More

    Submitted 4 April, 2016; v1 submitted 16 March, 2016; originally announced March 2016.

    Comments: NAACL 2016

  43. arXiv:1212.6080  [pdf

    cs.OH

    Beamforming Techniques for Multichannel audio Signal Separation

    Authors: Hidri Adel, Meddeb Souad, Abdulqadir Alaqeeli, Amiri Hamid

    Abstract: Beamforming is a signal processing technique. It has been studied in many areas such as radar, sonar, seismology and wireless communications, to name but a few. It can be used for a myriad of purposes, such as detecting the presence of a signal, estimating the direction of arrival, and enhancing a desired signal from its measurements corrupted by noise, competing sources and reverberation. Actuall… ▽ More

    Submitted 25 December, 2012; originally announced December 2012.

    Comments: 9 pages, 7 Figures

    Journal ref: JDCTA: International Journal of Digital Content Technology and its Applications, Vol. 6, No. 20, pp. 659-667, 2012