Skip to main content

Showing 1–19 of 19 results for author: Chuangsuwanich, E

.
  1. arXiv:2406.08055  [pdf, other

    cs.CL

    Learning Job Title Representation from Job Description Aggregation Network

    Authors: Napat Laosaengpha, Thanit Tativannarat, Chawan Piansaddhayanon, Attapol Rutherford, Ekapol Chuangsuwanich

    Abstract: Learning job title representation is a vital process for develo** automatic human resource tools. To do so, existing methods primarily rely on learning the title representation through skills extracted from the job description, neglecting the rich and diverse content within. Thus, we propose an alternative framework for learning job titles through their respective job description (JD) and utiliz… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: to be published in Findings of the Association for Computational Linguistics: ACL 2024

  2. arXiv:2406.06139  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge

    Authors: Thanapat Trachu, Chawan Piansaddhayanon, Ekapol Chuangsuwanich

    Abstract: Diffusion-based speech enhancement has shown promising results, but can suffer from a slower inference time. Initializing the diffusion process with the enhanced audio generated by a regression-based model can be used to reduce the computational steps required. However, these approaches often necessitate a regression model, further increasing the system's complexity. We propose Thunder, a unified… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures, 4 tables, This paper will be submitted in the interspeech conference

  3. arXiv:2406.05733  [pdf, other

    cs.CL

    MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model

    Authors: Danupat Khamnuansin, Tawunrat Chalothorn, Ekapol Chuangsuwanich

    Abstract: Large Language Models (LLMs) often struggle with hallucinations and outdated information. To address this, Information Retrieval (IR) systems can be employed to augment LLMs with up-to-date knowledge. However, existing IR techniques contain deficiencies, posing a performance bottleneck. Given the extensive array of IR systems, combining diverse approaches presents a viable strategy. Nevertheless,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: To be published in Findings of ACL 2024

  4. arXiv:2406.03125  [pdf, other

    cs.CL

    Space Decomposition for Sentence Embedding

    Authors: Wuttikorn Ponwitayarat, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong

    Abstract: Determining sentence pair similarity is crucial for various NLP tasks. A common technique to address this is typically evaluated on a continuous semantic textual similarity scale from 0 to 5. However, based on a linguistic observation in STS annotation guidelines, we found that the score in the range [4,5] indicates an upper-range sample, while the rest are lower-range samples. This necessitates a… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ACL Finding 2024. The code and pre-trained models are available at https://github.com/KornWtp/MixSP

  5. arXiv:2403.16127  [pdf, other

    cs.CL cs.AI

    WangchanLion and WangchanX MRC Eval

    Authors: Wannaphong Phatthiyaphaibun, Surapon Nonesung, Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Can Udomcharoenchaikit, Jitkapat Sawatphol, Chompakorn Chaksangchaichot, Ekapol Chuangsuwanich, Sarana Nutanong

    Abstract: This technical report describes the development of WangchanLion, an instruction fine-tuned model focusing on Machine Reading Comprehension (MRC) in the Thai language. Our model is based on SEA-LION and a collection of instruction following datasets. To promote open research and reproducibility, we publicly release all training data, code, and the final model weights under the Apache-2 license. To… ▽ More

    Submitted 23 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  6. arXiv:2311.03228  [pdf, other

    cs.CL cs.AI

    An Efficient Self-Supervised Cross-View Training For Sentence Embedding

    Authors: Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Lalita Lowphansirikul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong

    Abstract: Self-supervised sentence representation learning is the task of constructing an embedding space for sentences without relying on human annotation efforts. One straightforward approach is to finetune a pretrained language model (PLM) with a representation learning method such as contrastive learning. While this approach achieves impressive performance on larger PLMs, the performance rapidly degrade… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted to TACL. The code and pre-trained models are available at https://github.com/mrpeerat/SCT

  7. arXiv:2306.10348  [pdf, other

    cs.IR cs.CL

    Typo-Robust Representation Learning for Dense Retrieval

    Authors: Panuthep Tasawong, Wuttikorn Ponwitayarat, Peerat Limkonchotiwat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong

    Abstract: Dense retrieval is a basic building block of information retrieval applications. One of the main challenges of dense retrieval in real-world settings is the handling of queries containing misspelled words. A popular approach for handling misspelled queries is minimizing the representations discrepancy between misspelled queries and their pristine ones. Unlike the existing approaches, which only fo… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: 5 pages, 2 figures

    ACM Class: I.2.7

  8. arXiv:2303.13396  [pdf, other

    cs.CV

    Zero-guidance Segmentation Using Zero Segment Labels

    Authors: Pitchaporn Rewatbowornwong, Nattanat Chatthee, Ekapol Chuangsuwanich, Supasorn Suwajanakorn

    Abstract: CLIP has enabled new and exciting joint vision-language applications, one of which is open-vocabulary segmentation, which can locate any segment given an arbitrary text query. In our research, we ask whether it is possible to discover semantic segments without any user guidance in the form of text queries or predefined classes, and label them using natural language automatically? We propose a nove… ▽ More

    Submitted 4 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  9. arXiv:2208.04799  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Thai Wav2Vec2.0 with CommonVoice V8

    Authors: Wannaphong Phatthiyaphaibun, Chompakorn Chaksangchaichot, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong

    Abstract: Recently, Automatic Speech Recognition (ASR), a system that converts audio into text, has caught a lot of attention in the machine learning community. Thus, a lot of publicly available models were released in HuggingFace. However, most of these ASR models are available in English; only a minority of the models are available in Thai. Additionally, most of the Thai ASR models are closed-sourced, and… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  10. arXiv:2202.13912  [pdf, other

    cs.CV q-bio.TO

    ReCasNet: Improving consistency within the two-stage mitosis detection framework

    Authors: Chawan Piansaddhayanon, Sakun Santisukwongchote, Shanop Shuangshoti, Qingyi Tao, Sira Sriswasdi, Ekapol Chuangsuwanich

    Abstract: Mitotic count (MC) is an important histological parameter for cancer diagnosis and grading, but the manual process for obtaining MC from whole-slide histopathological images is very time-consuming and prone to error. Therefore, deep learning models have been proposed to facilitate this process. Existing approaches utilize a two-stage pipeline: the detection stage for identifying the locations of p… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  11. arXiv:2010.11475  [pdf, other

    cs.CV

    High resolution weakly supervised localization architectures for medical images

    Authors: Konpat Preechakul, Sira Sriswasdi, Boonserm Kijsirikul, Ekapol Chuangsuwanich

    Abstract: In medical imaging, Class-Activation Map (CAM) serves as the main explainability tool by pointing to the region of interest. Since the localization accuracy from CAM is constrained by the resolution of the model's feature map, one may expect that segmentation models, which generally have large feature maps, would produce more accurate CAMs. However, we have found that this is not the case due to t… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: submitted to ICASSP 2021

  12. arXiv:2005.07920  [pdf, other

    eess.AS cs.CL cs.SD

    Reducing Spelling Inconsistencies in Code-Switching ASR using Contextualized CTC Loss

    Authors: Burin Naowarat, Thananchai Kongthaworn, Korrawe Karunratanakul, Sheng Hui Wu, Ekapol Chuangsuwanich

    Abstract: Code-Switching (CS) remains a challenge for Automatic Speech Recognition (ASR), especially character-based models. With the combined choice of characters from multiple languages, the outcome from character-based models suffers from phoneme duplication, resulting in language-inconsistent spellings. We propose Contextualized Connectionist Temporal Classification (CCTC) loss to encourage spelling con… ▽ More

    Submitted 22 June, 2021; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: ICASSP 2021

  13. arXiv:2004.04157  [pdf, other

    q-bio.NC cs.LG eess.SP

    MetaSleepLearner: A Pilot Study on Fast Adaptation of Bio-signals-Based Sleep Stage Classifier to New Individual Subject Using Meta-Learning

    Authors: Nannapas Banluesombatkul, Pichayoot Ouppaphan, Pitshaporn Leelaarporn, Payongkit Lakhan, Busarakum Chaitusaney, Nattapong Jaimchariyatam, Ekapol Chuangsuwanich, Wei Chen, Huy Phan, Nat Dilokthanakul, Theerawit Wilaiprasitporn

    Abstract: Identifying bio-signals based-sleep stages requires time-consuming and tedious labor of skilled clinicians. Deep learning approaches have been introduced in order to challenge the automatic sleep stage classification conundrum. However, the difficulties can be posed in replacing the clinicians with the automatic system due to the differences in many aspects found in individual bio-signals, causing… ▽ More

    Submitted 10 November, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: IEEE Journal of Biomedical and Health Informatics (Accepted) (source code is available at https://github.com/IoBT-VISTEC/MetaSleepLearner)

    Journal ref: IEEE Journal of Biomedical and Health Informatics (2020)

  14. arXiv:1908.01294  [pdf, ps, other

    cs.CL

    Semi-supervised Thai Sentence Segmentation Using Local and Distant Word Representations

    Authors: Chanatip Saetia, Ekapol Chuangsuwanich, Tawunrat Chalothorn, Peerapon Vateekul

    Abstract: A sentence is typically treated as the minimal syntactic unit used for extracting valuable information from a longer piece of text. However, in written Thai, there are no explicit sentence markers. We proposed a deep learning model for the task of sentence segmentation that includes three main contributions. First, we integrate n-gram embedding as a local representation to capture word groups near… ▽ More

    Submitted 25 August, 2019; v1 submitted 4 August, 2019; originally announced August 2019.

    Comments: 19 pages, 6 figures

  15. arXiv:1808.10852  [pdf, other

    eess.SP cs.HC q-bio.NC

    Towards Asynchronous Motor Imagery-Based Brain-Computer Interfaces: a joint training scheme using deep learning

    Authors: Patcharin Cheng, Phairot Autthasan, Boriwat Pijarana, Ekapol Chuangsuwanich, Theerawit Wilaiprasitporn

    Abstract: In this paper, the deep learning (DL) approach is applied to a joint training scheme for asynchronous motor imagery-based Brain-Computer Interface (BCI). The proposed DL approach is a cascade of one-dimensional convolutional neural networks and fully-connected neural networks (CNN-FC). The focus is mainly on three types of brain responses: non-imagery EEG (\textit{background EEG}), (\textit{pure i… ▽ More

    Submitted 31 August, 2018; originally announced August 2018.

    Journal ref: TENCON 2018 - 2018 IEEE Region 10 Conference

  16. Universal Joint Feature Extraction for P300 EEG Classification using Multi-task Autoencoder

    Authors: Apiwat Ditthapron, Nannapas Banluesombatkul, Sombat Ketrat, Ekapol Chuangsuwanich, Theerawit Wilaiprasitporn

    Abstract: The process of recording Electroencephalography (EEG) signals is onerous and requires massive storage to store signals at an applicable frequency rate. In this work, we propose the EventRelated Potential Encoder Network (ERPENet); a multi-task autoencoder-based model, that can be applied to any ERP-related tasks. The strength of ERPENet lies in its capability to handle various kinds of ERP dataset… ▽ More

    Submitted 30 April, 2019; v1 submitted 30 July, 2018; originally announced August 2018.

    Journal ref: IEEE Access 2019

  17. Affective EEG-Based Person Identification Using the Deep Learning Approach

    Authors: Theerawit Wilaiprasitporn, Apiwat Ditthapron, Karis Matchaparn, Tanaboon Tongbuasirilai, Nannapas Banluesombatkul, Ekapol Chuangsuwanich

    Abstract: Electroencephalography (EEG) is another mode for performing Person Identification (PI). Due to the nature of the EEG signals, EEG-based PI is typically done while the person is performing some kind of mental task, such as motor control. However, few works have considered EEG-based PI while the person is in different mental states (affective EEG). The aim of this paper is to improve the performance… ▽ More

    Submitted 29 April, 2019; v1 submitted 5 July, 2018; originally announced July 2018.

    Comments: 10 pages

    Journal ref: IEEE Transactions on Cognitive and Developmental System (2019)

  18. arXiv:1805.11491  [pdf

    cs.CV

    Rice Classification Using Spatio-Spectral Deep Convolutional Neural Network

    Authors: Itthi Chatnuntawech, Kittipong Tantisantisom, Paisan Khanchaitit, Thitikorn Boonkoom, Berkin Bilgic, Ekapol Chuangsuwanich

    Abstract: Rice has been one of the staple foods that contribute significantly to human food supplies. Numerous rice varieties have been cultivated, imported, and exported worldwide. Different rice varieties could be mixed during rice production and trading. Rice impurities could damage the trust between rice importers and exporters, calling for the need to develop a rice variety inspection system. In this w… ▽ More

    Submitted 25 June, 2019; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: 22 pages, 10 figures, 6 tables; more methods and experiments included with references; link to github included; article restructured for clarity; typos fixed

  19. arXiv:1510.08985  [pdf, other

    cs.CL cs.LG cs.NE eess.AS

    Prediction-Adaptation-Correction Recurrent Neural Networks for Low-Resource Language Speech Recognition

    Authors: Yu Zhang, Ekapol Chuangsuwanich, James Glass, Dong Yu

    Abstract: In this paper, we investigate the use of prediction-adaptation-correction recurrent neural networks (PAC-RNNs) for low-resource speech recognition. A PAC-RNN is comprised of a pair of neural networks in which a {\it correction} network uses auxiliary information given by a {\it prediction} network to help estimate the state probability. The information from the correction network is also used by t… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.