Skip to main content

Showing 1–4 of 4 results for author: Shapiro, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:1905.05816  [pdf, other

    cs.CL

    Curriculum Learning for Domain Adaptation in Neural Machine Translation

    Authors: Xuan Zhang, Pamela Shapiro, Gaurav Kumar, Paul McNamee, Marine Carpuat, Kevin Duh

    Abstract: We introduce a curriculum learning approach to adapt generic neural machine translation models to a specific domain. Samples are grouped by their similarities to the domain of interest and each group is fed to the training algorithm with a particular schedule. This approach is simple to implement on top of any neural framework or architecture, and consistently outperforms both unadapted and adapte… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

  2. arXiv:1809.02223  [pdf, other

    cs.CL

    Character-Aware Decoder for Translation into Morphologically Rich Languages

    Authors: Adithya Renduchintala, Pamela Shapiro, Kevin Duh, Philipp Koehn

    Abstract: Neural machine translation (NMT) systems operate primarily on words (or sub-words), ignoring lower-level patterns of morphology. We present a character-aware decoder designed to capture such patterns when translating into morphologically rich languages. We achieve character-awareness by augmenting both the softmax and embedding layers of an attention-based encoder-decoder model with convolutional… ▽ More

    Submitted 18 June, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: 9 pages (12 including Appendix), 5 figures, Accepted at MT Summit 2019

  3. arXiv:1809.01301  [pdf, other

    cs.CL

    BPE and CharCNNs for Translation of Morphology: A Cross-Lingual Comparison and Analysis

    Authors: Pamela Shapiro, Kevin Duh

    Abstract: Neural Machine Translation (NMT) in low-resource settings and of morphologically rich languages is made difficult in part by data sparsity of vocabulary words. Several methods have been used to help reduce this sparsity, notably Byte-Pair Encoding (BPE) and a character-based CNN layer (charCNN). However, the charCNN has largely been neglected, possibly because it has only been compared to BPE rath… ▽ More

    Submitted 8 September, 2018; v1 submitted 4 September, 2018; originally announced September 2018.

  4. arXiv:1808.10024  [pdf, other

    cs.CL

    Hard Non-Monotonic Attention for Character-Level Transduction

    Authors: Shijie Wu, Pamela Shapiro, Ryan Cotterell

    Abstract: Character-level string-to-string transduction is an important component of various NLP tasks. The goal is to map an input string to an output string, where the strings may be of different lengths and have characters taken from different alphabets. Recent approaches have used sequence-to-sequence models with an attention mechanism to learn which parts of the input string the model should focus on d… ▽ More

    Submitted 20 February, 2024; v1 submitted 29 August, 2018; originally announced August 2018.

    Comments: Published in EMNLP 2018