Skip to main content

Showing 1–20 of 20 results for author: Al-Onaizan, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.08985  [pdf, other

    cs.CL

    Label Semantics for Few Shot Named Entity Recognition

    Authors: Jie Ma, Miguel Ballesteros, Srikanth Doss, Rishita Anubhai, Sunil Mallya, Yaser Al-Onaizan, Dan Roth

    Abstract: We study the problem of few shot learning for named entity recognition. Specifically, we leverage the semantic information in the names of the labels as a way of giving the model additional signal and enriched priors. We propose a neural architecture that consists of two BERT encoders, one to encode the document and its tokens and another one to encode each of the labels in natural language format… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Findings of ACL 2022

  2. arXiv:2106.09790  [pdf, other

    cs.CL cs.AI

    Multi-Task Learning and Adapted Knowledge Models for Emotion-Cause Extraction

    Authors: Elsbeth Turcan, Shuai Wang, Rishita Anubhai, Kasturi Bhattacharjee, Yaser Al-Onaizan, Smaranda Muresan

    Abstract: Detecting what emotions are expressed in text is a well-studied problem in natural language processing. However, research on finer grained emotion analysis such as what causes an emotion is still in its infancy. We present solutions that tackle both emotion recognition and emotion cause detection in a joint fashion. Considering that common-sense knowledge plays an important role in understanding i… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 15 pages, 6 figures. Findings of ACL 2021

  3. arXiv:2012.07516  [pdf, other

    cs.CL

    Meta learning to classify intent and slot labels with noisy few shot examples

    Authors: Shang-Wen Li, Jason Krone, Shuyan Dong, Yi Zhang, Yaser Al-onaizan

    Abstract: Recently deep learning has dominated many machine learning areas, including spoken language understanding (SLU). However, deep learning models are notorious for being data-hungry, and the heavily optimized models are usually sensitive to the quality of the training examples provided and the consistency between training and inference conditions. To improve the performance of SLU models on tasks wit… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: accepted by IEEE Spoken Language Technology Workshop, 2021

  4. arXiv:2010.14042  [pdf, other

    cs.CL

    To BERT or Not to BERT: Comparing Task-specific and Task-agnostic Semi-Supervised Approaches for Sequence Tagging

    Authors: Kasturi Bhattacharjee, Miguel Ballesteros, Rishita Anubhai, Smaranda Muresan, Jie Ma, Faisal Ladhak, Yaser Al-Onaizan

    Abstract: Leveraging large amounts of unlabeled data using Transformer-like architectures, like BERT, has gained popularity in recent times owing to their effectiveness in learning general representations that can then be further fine-tuned for downstream tasks to much success. However, training these models can be costly both from an economic and environmental standpoint. In this work, we investigate how t… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: Accepted in the Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)(https://2020.emnlp.org/papers/main)

  5. arXiv:2010.03022  [pdf, other

    cs.CL

    Resource-Enhanced Neural Model for Event Argument Extraction

    Authors: Jie Ma, Shuai Wang, Rishita Anubhai, Miguel Ballesteros, Yaser Al-Onaizan

    Abstract: Event argument extraction (EAE) aims to identify the arguments of an event and classify the roles that those arguments play. Despite great efforts made in prior work, there remain many challenges: (1) Data scarcity. (2) Capturing the long-range dependency, specifically, the connection between an event trigger and a distant event argument. (3) Integrating event trigger information into candidate ar… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  6. arXiv:2005.01840  [pdf, other

    cs.CL

    Exploring Content Selection in Summarization of Novel Chapters

    Authors: Faisal Ladhak, Bryan Li, Yaser Al-Onaizan, Kathleen McKeown

    Abstract: We present a new summarization task, generating summaries of novel chapters using summary/chapter pairs from online study guides. This is a harder task than the news summarization task, given the chapter length as well as the extreme paraphrasing and generalization found in the summaries. We focus on extractive summarization, which requires the creation of a gold-standard set of extractive summari… ▽ More

    Submitted 29 March, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: ACL 2020

  7. arXiv:2005.01655  [pdf, other

    cs.CL cs.CV

    Words aren't enough, their order matters: On the Robustness of Grounding Visual Referring Expressions

    Authors: Arjun R Akula, Spandana Gella, Yaser Al-Onaizan, Song-Chun Zhu, Siva Reddy

    Abstract: Visual referring expression recognition is a challenging task that requires natural language understanding in the context of an image. We critically examine RefCOCOg, a standard benchmark for this task, using a human study and show that 83.7% of test instances do not require reasoning on linguistic structure, i.e., words are enough to identify the target object, the word order doesn't matter. To m… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: ACL 2020

  8. arXiv:2005.00580  [pdf, other

    cs.CL

    Evaluating Robustness to Input Perturbations for Neural Machine Translation

    Authors: Xing Niu, Prashant Mathur, Georgiana Dinu, Yaser Al-Onaizan

    Abstract: Neural Machine Translation (NMT) models are sensitive to small perturbations in the input. Robustness to such perturbations is typically measured using translation quality metrics such as BLEU on the noisy input. This paper proposes additional metrics which measure the relative degradation and changes in translation when small perturbations are added to the input. We focus on a class of models emp… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Accepted at ACL 2020

  9. arXiv:2004.05219  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Joint translation and unit conversion for end-to-end localization

    Authors: Georgiana Dinu, Prashant Mathur, Marcello Federico, Stanislas Lauly, Yaser Al-Onaizan

    Abstract: A variety of natural language tasks require processing of textual data which contains a mix of natural language and formal languages such as mathematical expressions. In this paper, we take unit conversions as an example and propose a data augmentation technique which leads to models learning both translation and conversion tasks as well as how to adequately switch between them for end-to-end loca… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

  10. arXiv:2004.04295  [pdf, ps, other

    cs.CL

    Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of Events

    Authors: Miguel Ballesteros, Rishita Anubhai, Shuai Wang, Nima Pourdamghani, Yogarshi Vyas, Jie Ma, Parminder Bhatia, Kathleen McKeown, Yaser Al-Onaizan

    Abstract: In this paper, we propose a neural architecture and a set of training methods for ordering events by predicting temporal relations. Our proposed models receive a pair of events within a span of text as input and they identify temporal relations (Before, After, Equal, Vague) between them. Given that a key challenge with this task is the scarcity of annotated data, our models rely on either pretrain… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  11. arXiv:1911.05241  [pdf, ps, other

    cs.CL cs.AI

    Robustness to Capitalization Errors in Named Entity Recognition

    Authors: Sravan Bodapati, Hyokun Yun, Yaser Al-Onaizan

    Abstract: Robustness to capitalization errors is a highly desirable characteristic of named entity recognizers, yet we find standard models for the task are surprisingly brittle to such noise. Existing methods to improve robustness to the noise completely discard given orthographic information, mwhich significantly degrades their performance on well-formed text. We propose a simple alternative approach base… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: Accepted to EMNLP 2019 Workshop : W-NUT 2019 5th Workshop on Noisy User Generated Text

    Journal ref: http://noisy-text.github.io/2019/

  12. arXiv:1910.01043  [pdf, other

    cs.CL

    Neural Word Decomposition Models for Abusive Language Detection

    Authors: Sravan Babu Bodapati, Spandana Gella, Kasturi Bhattacharjee, Yaser Al-Onaizan

    Abstract: User generated text on social media often suffers from a lot of undesired characteristics including hatespeech, abusive language, insults etc. that are targeted to attack or abuse a specific group of people. Often such text is written differently compared to traditional text such as news involving either explicit mention of abusive words, obfuscated words and typological errors or implicit abuse i… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted at ALW Workshop at ACL2019, Florence; BERT has a WordPiece model and it enhances performance of word based models in noisy settings

    Journal ref: https://www.aclweb.org/anthology/events/acl-2019/

  13. arXiv:1906.01105  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Training Neural Machine Translation To Apply Terminology Constraints

    Authors: Georgiana Dinu, Prashant Mathur, Marcello Federico, Yaser Al-Onaizan

    Abstract: This paper proposes a novel method to inject custom terminology into neural machine translation at run time. Previous works have mainly proposed modifications to the decoding algorithm in order to constrain the output to include run-time-provided target terms. While being effective, these constrained decoding methods add, however, significant computational overhead to the inference step, and, as w… ▽ More

    Submitted 24 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Accepted as a short paper at ACL 2019

  14. arXiv:1707.07755  [pdf, ps, other

    cs.CL

    AMR Parsing using Stack-LSTMs

    Authors: Miguel Ballesteros, Yaser Al-Onaizan

    Abstract: We present a transition-based AMR parser that directly generates AMR parses from plain text. We use Stack-LSTMs to represent our parser state and make decisions greedily. In our experiments, we show that our parser achieves very competitive scores on English using only AMR training data. Adding additional information, such as POS tags and dependency trees, improves the results further.

    Submitted 2 August, 2017; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: EMNLP 2017

  15. arXiv:1706.03824  [pdf, other

    cs.CL

    Attention-based Vocabulary Selection for NMT Decoding

    Authors: Baskaran Sankaran, Markus Freitag, Yaser Al-Onaizan

    Abstract: Neural Machine Translation (NMT) models usually use large target vocabulary sizes to capture most of the words in the target language. The vocabulary size is a big factor when decoding new sentences as the final softmax layer normalizes over all possible target words. To address this problem, it is widely common to restrict the target vocabulary with candidate lists based on the source sentence. U… ▽ More

    Submitted 12 June, 2017; originally announced June 2017.

    Comments: Submitted to Second Conference on Machine Translation (WMT-17); 7 pages

  16. Beam Search Strategies for Neural Machine Translation

    Authors: Markus Freitag, Yaser Al-Onaizan

    Abstract: The basic concept in Neural Machine Translation (NMT) is to train a large Neural Network that maximizes the translation performance on a given parallel corpus. NMT is then using a simple left-to-right beam-search decoder to generate new translations that approximately maximize the trained conditional probability. The current beam search strategy generates the target sentence word by word from left… ▽ More

    Submitted 13 June, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

    Comments: First Workshop on Neural Machine Translation, 2017

    Journal ref: Proceedings of the First Workshop on Neural Machine Translation, 2017

  17. arXiv:1702.01802  [pdf, ps, other

    cs.CL

    Ensemble Distillation for Neural Machine Translation

    Authors: Markus Freitag, Yaser Al-Onaizan, Baskaran Sankaran

    Abstract: Knowledge distillation describes a method for training a student network to perform better by learning from a stronger teacher network. Translating a sentence with an Neural Machine Translation (NMT) engine is time expensive and having a smaller model speeds up this process. We demonstrate how to transfer the translation quality of an ensemble and an oracle BLEU teacher network into a single NMT s… ▽ More

    Submitted 7 August, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

  18. arXiv:1612.06897  [pdf, ps, other

    cs.CL

    Fast Domain Adaptation for Neural Machine Translation

    Authors: Markus Freitag, Yaser Al-Onaizan

    Abstract: Neural Machine Translation (NMT) is a new approach for automatic translation of text from one human language into another. The basic concept in NMT is to train a large Neural Network that maximizes the translation performance on a given parallel corpus. NMT is gaining popularity in the research community because it outperformed traditional SMT approaches in several translation tasks at WMT and oth… ▽ More

    Submitted 20 December, 2016; originally announced December 2016.

  19. arXiv:1608.02927  [pdf, other

    cs.CL

    Temporal Attention Model for Neural Machine Translation

    Authors: Baskaran Sankaran, Haitao Mi, Yaser Al-Onaizan, Abe Ittycheriah

    Abstract: Attention-based Neural Machine Translation (NMT) models suffer from attention deficiency issues as has been observed in recent research. We propose a novel mechanism to address some of these limitations and improve the NMT attention. Specifically, our approach memorizes the alignments temporally (within each sentence) and modulates the attention with the accumulated temporal memory, as the decoder… ▽ More

    Submitted 9 August, 2016; originally announced August 2016.

    Comments: 8 pages

  20. arXiv:1606.04164  [pdf, ps, other

    cs.CL

    Zero-Resource Translation with Multi-Lingual Neural Machine Translation

    Authors: Orhan Firat, Baskaran Sankaran, Yaser Al-Onaizan, Fatos T. Yarman Vural, Kyunghyun Cho

    Abstract: In this paper, we propose a novel finetuning algorithm for the recently introduced multi-way, mulitlingual neural machine translate that enables zero-resource machine translation. When used together with novel many-to-one translation strategies, we empirically show that this finetuning algorithm allows the multi-way, multilingual model to translate a zero-resource language pair (1) as well as a si… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.