Skip to main content

Showing 1–1 of 1 results for author: Sundararaman, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:1911.06156  [pdf, other

    cs.CL cs.LG stat.ML

    Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding

    Authors: Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shi**g Si, Dinghan Shen, Dong Wang, Lawrence Carin

    Abstract: Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks. The Transformer, for instance, is an illustrative example that generates abstract representations of tokens inputted to an encoder based on their relationships to all tokens in a sequence. Recent studies have shown that although such models are capable of learning syntactic features purely b… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.