Skip to main content

Showing 1–4 of 4 results for author: Vateekul, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.12778  [pdf, other

    cs.CL cs.LG

    Label-Aware Automatic Verbalizer for Few-Shot Text Classification

    Authors: Thanakorn Thaminkaew, Piyawat Lertvittayakumjorn, Peerapon Vateekul

    Abstract: Prompt-based learning has shown its effectiveness in few-shot text classification. One important factor in its success is a verbalizer, which translates output from a language model into a predicted class. Notably, the simplest and widely acknowledged verbalizer employs manual labels to represent the classes. However, manual selection does not guarantee the optimality of the selected words when co… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  2. A Summary of the ALQAC 2021 Competition

    Authors: Nguyen Ha Thanh, Bui Minh Quan, Chau Nguyen, Tung Le, Nguyen Minh Phuong, Dang Tran Binh, Vuong Thi Hai Yen, Teeradaj Racharak, Nguyen Le Minh, Tran Duc Vu, Phan Viet Anh, Nguyen Truong Son, Huy Tien Nguyen, Bhumindr Butr-indr, Peerapon Vateekul, Prachya Boonkwan

    Abstract: We summarize the evaluation of the first Automated Legal Question Answering Competition (ALQAC 2021). The competition this year contains three tasks, which aims at processing the statute law document, which are Legal Text Information Retrieval (Task 1), Legal Text Entailment Prediction (Task 2), and Legal Text Question Answering (Task 3). The final goal of these tasks is to build a system that can… ▽ More

    Submitted 24 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

  3. arXiv:1912.01580  [pdf, ps, other

    cs.LG cs.CL stat.ML

    A Comparative Study of Pretrained Language Models on Thai Social Text Categorization

    Authors: Thanapapas Horsuwan, Kasidis Kanwatchara, Peerapon Vateekul, Boonserm Kijsirikul

    Abstract: The ever-growing volume of data of user-generated content on social media provides a nearly unlimited corpus of unlabeled data even in languages where resources are scarce. In this paper, we demonstrate that state-of-the-art results on two Thai social text categorization tasks can be realized by pretraining a language model on a large noisy Thai social media corpus of over 1.26 billion tokens and… ▽ More

    Submitted 17 December, 2019; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: 12 pages, conference

  4. arXiv:1908.01294  [pdf, ps, other

    cs.CL

    Semi-supervised Thai Sentence Segmentation Using Local and Distant Word Representations

    Authors: Chanatip Saetia, Ekapol Chuangsuwanich, Tawunrat Chalothorn, Peerapon Vateekul

    Abstract: A sentence is typically treated as the minimal syntactic unit used for extracting valuable information from a longer piece of text. However, in written Thai, there are no explicit sentence markers. We proposed a deep learning model for the task of sentence segmentation that includes three main contributions. First, we integrate n-gram embedding as a local representation to capture word groups near… ▽ More

    Submitted 25 August, 2019; v1 submitted 4 August, 2019; originally announced August 2019.

    Comments: 19 pages, 6 figures