Skip to main content

Showing 1–7 of 7 results for author: Yaseen, U

.
  1. arXiv:2206.15221  [pdf, other

    cs.CL

    Domain Adaptive Pretraining for Multilingual Acronym Extraction

    Authors: Usama Yaseen, Stefan Langer

    Abstract: This paper presents our findings from participating in the multilingual acronym extraction shared task SDU@AAAI-22. The task consists of acronym extraction from documents in 6 languages within scientific and legal domains. To address multilingual acronym extraction we employed BiLSTM-CRF with multilingual XLM-RoBERTa embeddings. We pretrained the XLM-RoBERTa model on the shared task corpus to furt… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: SDU@AAAI-22

  2. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  3. arXiv:2108.11703  [pdf, other

    cs.CL

    Data Augmentation for Low-Resource Named Entity Recognition Using Backtranslation

    Authors: Usama Yaseen, Stefan Langer

    Abstract: The state of art natural language processing systems relies on sizable training datasets to achieve high performance. Lack of such datasets in the specialized low resource domains lead to suboptimal performance. In this work, we adapt backtranslation to generate high quality and linguistically diverse synthetic data for low-resource named entity recognition. We perform experiments on two datasets… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

  4. arXiv:2106.15329  [pdf, other

    cs.CV

    Cloud based Scalable Object Recognition from Video Streams using Orientation Fusion and Convolutional Neural Networks

    Authors: Muhammad Usman Yaseen, Ashiq Anjum, Giancarlo Fortino, Antonio Liotta, Amir Hussain

    Abstract: Object recognition from live video streams comes with numerous challenges such as the variation in illumination conditions and poses. Convolutional neural networks (CNNs) have been widely used to perform intelligent visual object recognition. Yet, CNNs still suffer from severe accuracy degradation, particularly on illumination-variant datasets. To address this problem, we propose a new CNN method… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

  5. arXiv:2106.05823  [pdf, other

    cs.CL

    Neural Text Classification and Stacked Heterogeneous Embeddings for Named Entity Recognition in SMM4H 2021

    Authors: Usama Yaseen, Stefan Langer

    Abstract: This paper presents our findings from participating in the SMM4H Shared Task 2021. We addressed Named Entity Recognition (NER) and Text Classification. To address NER we explored BiLSTM-CRF with Stacked Heterogeneous Embeddings and linguistic features. We investigated various machine learning algorithms (logistic regression, Support Vector Machine (SVM) and Neural Networks) to address text classif… ▽ More

    Submitted 11 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: NAACL 2021

  6. arXiv:1910.03385  [pdf, other

    cs.CL

    Linguistically Informed Relation Extraction and Neural Architectures for Nested Named Entity Recognition in BioNLP-OST 2019

    Authors: Usama Yaseen, Pankaj Gupta, Hinrich Schütze

    Abstract: Named Entity Recognition (NER) and Relation Extraction (RE) are essential tools in distilling knowledge from biomedical literature. This paper presents our findings from participating in BioNLP Shared Tasks 2019. We addressed Named Entity Recognition including nested entities extraction, Entity Normalization and Relation Extraction. Our proposed approach of Named Entities can be generalized to dif… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: EMNLP 2019, 11 pages, 4 figures, 8 tables

  7. arXiv:1909.06162  [pdf, other

    cs.CL cs.IR cs.LG

    Neural Architectures for Fine-Grained Propaganda Detection in News

    Authors: Pankaj Gupta, Khushbu Saxena, Usama Yaseen, Thomas Runkler, Hinrich Schütze

    Abstract: This paper describes our system (MIC-CIS) details and results of participation in the fine-grained propaganda detection shared task 2019. To address the tasks of sentence (SLC) and fragment level (FLC) propaganda detection, we explore different neural architectures (e.g., CNN, LSTM-CRF and BERT) and extract linguistic (e.g., part-of-speech, named entity, readability, sentiment, emotion, etc.), lay… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: EMNLP2019: Fine-grained propaganda detection shared task at NLP4IF workshop (EMNLP2019)