Skip to main content

Showing 1–1 of 1 results for author: Yamanouchi, K

Searching in archive cs. Search in all archives.
.
  1. LCP-dropout: Compression-based Multiple Subword Segmentation for Neural Machine Translation

    Authors: Keita Nonaka, Kazutaka Yamanouchi, Tomohiro I, Tsuyoshi Okita, Kazutaka Shimada, Hiroshi Sakamoto

    Abstract: In this study, we propose a simple and effective preprocessing method for subword segmentation based on a data compression algorithm. Compression-based subword segmentation has recently attracted significant attention as a preprocessing method for training data in Neural Machine Translation. Among them, BPE/BPE-dropout is one of the fastest and most effective method compared to conventional approa… ▽ More

    Submitted 19 March, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: 12 pages

    Journal ref: Electronics 11(7), Article number 1014, 2022