Skip to main content

Showing 1–13 of 13 results for author: Nguyen, A G

.
  1. Detecting Spam Reviews on Vietnamese E-commerce Websites

    Authors: Co Van Dinh, Son T. Luu, Anh Gia-Tuan Nguyen

    Abstract: The reviews of customers play an essential role in online shop**. People often refer to reviews or comments of previous customers to decide whether to buy a new product. Catching up with this behavior, some people create untruths and illegitimate reviews to hoax customers about the fake quality of products. These are called spam reviews, confusing consumers on online shop** platforms and negat… ▽ More

    Submitted 8 December, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Published at The 14th Asian Conference on Intelligent Information and Database Systems (ACIIDS 2022). The dataset is available at https://github.com/sonlam1102/vispamdetection

  2. arXiv:2204.07002  [pdf, other

    cs.CL

    XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-based Textual Knowledge Source

    Authors: Kiet Van Nguyen, Phong Nguyen-Thuan Do, Nhat Duy Nguyen, Tin Van Huynh, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Question answering (QA) is a natural language understanding task within the fields of information retrieval and information extraction that has attracted much attention from the computational linguistics and artificial intelligence research community in recent years because of the strong development of machine reading comprehension-based models. A reader-based QA system is a high-level search engi… ▽ More

    Submitted 13 August, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by ACIIDS 2022

  3. B-DAC: A Decentralized Access Control Framework on Northbound Interface for Securing SDN Using Blockchain

    Authors: Phan The Duy, Hien Do Hoang, Do Thi Thu Hien, Anh Gia-Tuan Nguyen, Van-Hau Pham

    Abstract: Software-Defined Network (SDN) is a new arising terminology of network architecture with outstanding features of orchestration by decoupling the control plane and the data plane in each network element. Even though it brings several benefits, SDN is vulnerable to a diversity of attacks. Abusing the single point of failure in the SDN controller component, hackers can shut down all network operation… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 23 pages, 14 figures, 14 tables

    Report number: Volume 64, February 2022

    Journal ref: Journal of Information Security and Applications, 2022

  4. arXiv:2108.13741  [pdf, other

    cs.CL cs.AI

    Monolingual versus Multilingual BERTology for Vietnamese Extractive Multi-Document Summarization

    Authors: Huy Quoc To, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen, Anh Gia-Tuan Nguyen

    Abstract: Recent researches have demonstrated that BERT shows potential in a wide range of natural language processing tasks. It is adopted as an encoder for many state-of-the-art automatic summarizing systems, which achieve excellent performance. However, so far, there is not much work done for Vietnamese. In this paper, we showcase how BERT can be implemented for extractive text summarization in Vietnames… ▽ More

    Submitted 16 October, 2021; v1 submitted 31 August, 2021; originally announced August 2021.

  5. arXiv:2105.09043  [pdf, other

    cs.CL

    Sentence Extraction-Based Machine Reading Comprehension for Vietnamese

    Authors: Phong Nguyen-Thuan Do, Nhat Duy Nguyen, Tin Van Huynh, Kiet Van Nguyen, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: The development of natural language processing (NLP) in general and machine reading comprehension in particular has attracted the great attention of the research community. In recent years, there are a few datasets for machine reading comprehension tasks in Vietnamese with large sizes, such as UIT-ViQuAD and UIT-ViNewsQA. However, the datasets are not diverse in answers to serve the research. In t… ▽ More

    Submitted 11 June, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: Accepted by KSEM 2021 (International Conference on Knowledge Science, Engineering and Management)

  6. Gender Prediction Based on Vietnamese Names with Machine Learning Techniques

    Authors: Huy Quoc To, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen, Anh Gia-Tuan Nguyen

    Abstract: As biological gender is one of the aspects of presenting individual human, much work has been done on gender classification based on people names. The proposals for English and Chinese languages are tremendous; still, there have been few works done for Vietnamese so far. We propose a new dataset for gender prediction based on Vietnamese names. This dataset comprises over 26,000 full names annotate… ▽ More

    Submitted 23 March, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: 6 pages, 6 figures. NLPIR 2020: 4th International Conference on Natural Language Processing and Information Retrieval

  7. arXiv:2009.14725  [pdf, other

    cs.CL

    A Vietnamese Dataset for Evaluating Machine Reading Comprehension

    Authors: Kiet Van Nguyen, Duc-Vu Nguyen, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Over 97 million people speak Vietnamese as their native language in the world. However, there are few research studies on machine reading comprehension (MRC) for Vietnamese, the task of understanding a text and answering questions related to it. Due to the lack of benchmark datasets for Vietnamese, we present the Vietnamese Question Answering Dataset (UIT-ViQuAD), a new dataset for the low-resourc… ▽ More

    Submitted 7 November, 2020; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: Accepted by The 28th International Conference on Computational Linguistics (COLING 2020)

  8. An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension

    Authors: Son T. Luu, Kiet Van Nguyen, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Machine reading comprehension (MRC) is a challenging task in natural language processing that makes computers understanding natural language texts and answer questions based on those texts. There are many techniques for solving this problems, and word representation is a very important technique that impact most to the accuracy of machine reading comprehension problem in the popular languages like… ▽ More

    Submitted 18 February, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: Published in the 2020 IEEE Eighth International Conference on Communications and Electronics (ICCE)

  9. arXiv:2006.11138  [pdf, other

    cs.CL

    New Vietnamese Corpus for Machine Reading Comprehension of Health News Articles

    Authors: Kiet Van Nguyen, Tin Van Huynh, Duc-Vu Nguyen, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Large-scale and high-quality corpora are necessary for evaluating machine reading comprehension models on a low-resource language like Vietnamese. Besides, machine reading comprehension (MRC) for the health domain offers great potential for practical applications; however, there is still very little MRC research in this domain. This paper presents ViNewsQA as a new corpus for the Vietnamese langua… ▽ More

    Submitted 11 February, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

  10. Enhancing lexical-based approach with external knowledge for Vietnamese multiple-choice machine reading comprehension

    Authors: Kiet Van Nguyen, Khiem Vinh Tran, Son T. Luu, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: Although Vietnamese is the 17th most popular native-speaker language in the world, there are not many research studies on Vietnamese machine reading comprehension (MRC), the task of understanding a text and answering questions about it. One of the reasons is because of the lack of high-quality benchmark datasets for this task. In this work, we construct a dataset which consists of 2,783 pairs of m… ▽ More

    Submitted 1 November, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Journal ref: IEEE Access, 2020

  11. arXiv:1912.12214  [pdf, other

    cs.CL

    Job Prediction: From Deep Neural Network Models to Applications

    Authors: Tin Van Huynh, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen, Anh Gia-Tuan Nguyen

    Abstract: Determining the job is suitable for a student or a person looking for work based on their job's descriptions such as knowledge and skills that are difficult, as well as how employers must find ways to choose the candidates that match the job they require. In this paper, we focus on studying the job prediction using different deep neural network models including TextCNN, Bi-GRU-LSTM-CNN, and Bi-GRU… ▽ More

    Submitted 31 January, 2020; v1 submitted 27 December, 2019; originally announced December 2019.

    Comments: Accepted by IEEE RIVF 2020 Conference

  12. arXiv:1911.03648  [pdf, other

    cs.CL cs.LG

    Hate Speech Detection on Vietnamese Social Media Text using the Bidirectional-LSTM Model

    Authors: Hang Thi-Thuy Do, Huy Duc Huynh, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen, Anh Gia-Tuan Nguyen

    Abstract: In this paper, we describe our system which participates in the shared task of Hate Speech Detection on Social Networks of VLSP 2019 evaluation campaign. We are provided with the pre-labeled dataset and an unlabeled dataset for social media comments or posts. Our mission is to pre-process and build machine learning models to classify comments/posts. In this report, we use Bidirectional Long Short-… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Journal ref: VLSP Workshop 2019

  13. arXiv:1911.03644  [pdf, other

    cs.CL

    Hate Speech Detection on Vietnamese Social Media Text using the Bi-GRU-LSTM-CNN Model

    Authors: Tin Van Huynh, Vu Duc Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen, Anh Gia-Tuan Nguyen

    Abstract: In recent years, Hate Speech Detection has become one of the interesting fields in natural language processing or computational linguistics. In this paper, we present the description of our system to solve this problem at the VLSP shared task 2019: Hate Speech Detection on Social Networks with the corpus which contains 20,345 human-labeled comments/posts for training and 5,086 for public-testing.… ▽ More

    Submitted 21 December, 2019; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Technical Report, VLSP Workshop 2019

    Journal ref: VLSP Workshop 2019