Skip to main content

Showing 1–4 of 4 results for author: Park, E L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2008.02878  [pdf, ps, other

    cs.CL cs.LG

    A Multilingual Neural Machine Translation Model for Biomedical Data

    Authors: Alexandre Bérard, Zae Myung Kim, Vassilina Nikoulina, Eunjeong L. Park, Matthias Gallé

    Abstract: We release a multilingual neural machine translation model, which can be used to translate text in the biomedical domain. The model can translate from 5 languages (French, German, Italian, Korean and Spanish) into English. It is trained with large amounts of generic and biomedical data, using domain tags. Our benchmarks show that it performs near state-of-the-art both on news (generic domain) and… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: https://github.com/naver/covid19-nmt

  2. arXiv:2004.13937  [pdf, other

    cs.CL cs.LG

    Revisiting Round-Trip Translation for Quality Estimation

    Authors: Jihyung Moon, Hyunchang Cho, Eunjeong L. Park

    Abstract: Quality estimation (QE) is the task of automatically evaluating the quality of translations without human-translated references. Calculating BLEU between the input sentence and round-trip translation (RTT) was once considered as a metric for QE, however, it was found to be a poor predictor of translation quality. Recently, various pre-trained language models have made breakthroughs in NLP tasks by… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: To be published in EAMT 2020

  3. arXiv:1903.05823  [pdf, other

    cs.CL cs.DL

    Deep Patent Landsca** Model Using Transformer and Graph Embedding

    Authors: Seokkyu Choi, Hyeonju Lee, Eunjeong Lucy Park, Sungchul Choi

    Abstract: Patent landsca** is a method used for searching related patents during a research and development (R&D) project. To avoid the risk of patent infringement and to follow current trends in technology, patent landsca** is a crucial task required during the early stages of an R&D project. As the process of patent landsca** requires advanced resources and can be tedious, the demand for automated p… ▽ More

    Submitted 21 November, 2019; v1 submitted 14 March, 2019; originally announced March 2019.

  4. arXiv:1901.09613  [pdf, other

    cs.LG cs.IR stat.ML

    Hybrid Machine Learning Approach to Popularity Prediction of Newly Released Contents for Online Video Streaming Service

    Authors: Hongjun Jeon, Wonchul Seo, Eunjeong Lucy Park, Sungchul Choi

    Abstract: In the industry of video content providers such as VOD and IPTV, predicting the popularity of video contents in advance is critical not only from a marketing perspective but also from a network optimization perspective. By predicting whether the content will be successful or not in advance, the content file, which is large, is efficiently deployed in the proper service providing server, leading to… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.