Skip to main content

Showing 1–6 of 6 results for author: Trinh, T H

.
  1. arXiv:2210.05610  [pdf, other

    cs.CL cs.AI

    MTet: Multi-domain Translation for English and Vietnamese

    Authors: Chinh Ngo, Trieu H. Trinh, Long Phan, Hieu Tran, Tai Dang, Hieu Nguyen, Minh Nguyen, Minh-Thang Luong

    Abstract: We introduce MTet, the largest publicly available parallel corpus for English-Vietnamese translation. MTet consists of 4.2M high-quality training sentence pairs and a multi-domain test set refined by the Vietnamese research community. Combining with previous works on English-Vietnamese translation, we grow the existing parallel dataset to 6.2M sentence pairs. We also release the first pretrained m… ▽ More

    Submitted 19 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

  2. arXiv:2210.05598  [pdf, other

    cs.CL cs.AI

    Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation

    Authors: Long Phan, Tai Dang, Hieu Tran, Trieu H. Trinh, Vy Phan, Lam D. Chau, Minh-Thang Luong

    Abstract: Biomedical data and benchmarks are highly valuable yet very limited in low-resource languages other than English such as Vietnamese. In this paper, we make use of a state-of-the-art translation model in English-Vietnamese to translate and produce both pretrained as well as supervised data in the biomedical domains. Thanks to such large-scale translation, we introduce ViPubmedT5, a pretrained Encod… ▽ More

    Submitted 29 January, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  3. arXiv:2205.06457  [pdf, ps, other

    cs.CL cs.AI

    ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation

    Authors: Long Phan, Hieu Tran, Hieu Nguyen, Trieu H. Trinh

    Abstract: We present ViT5, a pretrained Transformer-based encoder-decoder model for the Vietnamese language. With T5-style self-supervised pretraining, ViT5 is trained on a large corpus of high-quality and diverse Vietnamese texts. We benchmark ViT5 on two downstream text generation tasks, Abstractive Text Summarization and Named Entity Recognition. Although Abstractive Text Summarization has been widely st… ▽ More

    Submitted 26 May, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: NAACL SRW 2022. arXiv admin note: text overlap with arXiv:2110.04257

  4. arXiv:1906.02940  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Selfie: Self-supervised Pretraining for Image Embedding

    Authors: Trieu H. Trinh, Minh-Thang Luong, Quoc V. Le

    Abstract: We introduce a pretraining technique called Selfie, which stands for SELFie supervised Image Embedding. Selfie generalizes the concept of masked language modeling of BERT (Devlin et al., 2019) to continuous data, such as images, by making use of the Contrastive Predictive Coding loss (Oord et al., 2018). Given masked-out patches in an input image, our method learns to select the correct patch, amo… ▽ More

    Submitted 27 July, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  5. arXiv:1806.02847  [pdf, other

    cs.AI cs.CL cs.LG

    A Simple Method for Commonsense Reasoning

    Authors: Trieu H. Trinh, Quoc V. Le

    Abstract: Commonsense reasoning is a long-standing challenge for deep learning. For example, it is difficult to use neural networks to tackle the Winograd Schema dataset (Levesque et al., 2011). In this paper, we present a simple method for commonsense reasoning with neural networks, using unsupervised learning. Key to our method is the use of language models, trained on a massive amount of unlabled data, t… ▽ More

    Submitted 26 September, 2019; v1 submitted 7 June, 2018; originally announced June 2018.

  6. arXiv:1803.00144  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Learning Longer-term Dependencies in RNNs with Auxiliary Losses

    Authors: Trieu H. Trinh, Andrew M. Dai, Minh-Thang Luong, Quoc V. Le

    Abstract: Despite recent advances in training recurrent neural networks (RNNs), capturing long-term dependencies in sequences remains a fundamental challenge. Most approaches use backpropagation through time (BPTT), which is difficult to scale to very long sequences. This paper proposes a simple method that improves the ability to capture long term dependencies in RNNs by adding an unsupervised auxiliary lo… ▽ More

    Submitted 13 June, 2018; v1 submitted 28 February, 2018; originally announced March 2018.

    Comments: ICML 2018