Skip to main content

Showing 1–7 of 7 results for author: Vo, D T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03411  [pdf, other

    cs.RO

    Greedy Heuristics for Sampling-based Motion Planning in High-Dimensional State Spaces

    Authors: Phone Thiha Kyaw, Anh Vu Le, Lim Yi, Prabakaran Veerajagadheswar, Mohan Rajesh Elara, Dinh Tung Vo, Minh Bui Vu

    Abstract: Sampling-based motion planning algorithms are very effective at finding solutions in high-dimensional continuous state spaces as they do not require prior approximations of the problem domain compared to traditional discrete graph-based searches. The anytime version of the Rapidly-exploring Random Trees (RRT) algorithm, denoted as RRT*, often finds high-quality solutions by incrementally approxima… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: To be published at the International Journal of Robotics Research (IJRR)

  2. OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese

    Authors: Nghia Hieu Nguyen, Duong T. D. Vo, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

    Abstract: In recent years, visual question answering (VQA) has attracted attention from the research community because of its highly potential applications (such as virtual assistance on intelligent cars, assistant devices for blind people, or information retrieval from document images using natural language as queries) and challenge. The VQA task requires methods that have the ability to fuse the informati… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: submitted to Elsevier

  3. EVJVQA Challenge: Multilingual Visual Question Answering

    Authors: Ngan Luu-Thuy Nguyen, Nghia Hieu Nguyen, Duong T. D Vo, Khanh Quoc Tran, Kiet Van Nguyen

    Abstract: Visual Question Answering (VQA) is a challenging task of natural language processing (NLP) and computer vision (CV), attracting significant attention from researchers. English is a resource-rich language that has witnessed various developments in datasets and models for visual question answering. Visual question answering in other languages also would be developed for resources and models. In addi… ▽ More

    Submitted 17 April, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: VLSP2022 EVJVQA challenge

  4. arXiv:2212.14353  [pdf, other

    cs.DC eess.SP

    Sheaf-theoretic self-filtering network of low-cost sensors for local air quality monitoring: A causal approach

    Authors: Anh-Duy Pham, Chuong Dinh Le, Hoang Viet Pham, Thinh Gia Tran, Dat Thanh Vo, Chau Long Tran, An Dinh Le, Hien Bich Vo

    Abstract: Sheaf theory, which is a complex but powerful tool supported by topological theory, offers more flexibility and precision than traditional graph theory when it comes to modeling relationships between multiple features. In the realm of air quality monitoring, this can be incredibly useful in detecting sudden changes in local dust particle density, which can be difficult to accurately measure using… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  5. UIT-HWDB: Using Transferring Method to Construct A Novel Benchmark for Evaluating Unconstrained Handwriting Image Recognition in Vietnamese

    Authors: Nghia Hieu Nguyen, Duong T. D. Vo, Kiet Van Nguyen

    Abstract: Recognizing handwriting images is challenging due to the vast variation in writing style across many people and distinct linguistic aspects of writing languages. In Vietnamese, besides the modern Latin characters, there are accent and letter marks together with characters that draw confusion to state-of-the-art handwriting recognition methods. Moreover, as a low-resource language, there are not ma… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted for publishing at the 16th International Conference on Computing and Communication Technologies (RIVF)

  6. arXiv:2211.05405  [pdf, other

    cs.CV cs.CL

    VieCap4H-VLSP 2021: ObjectAoA-Enhancing performance of Object Relation Transformer with Attention on Attention for Vietnamese image captioning

    Authors: Nghia Hieu Nguyen, Duong T. D. Vo, Minh-Quan Ha

    Abstract: Image captioning is currently a challenging task that requires the ability to both understand visual information and use human language to describe this visual information in the image. In this paper, we propose an efficient way to improve the image understanding ability of transformer-based method by extending Object Relation Transformer architecture with Attention on Attention mechanism. Experim… ▽ More

    Submitted 20 March, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted for publishing at the VNU Journal of Science: Computer Science and Communication Engineering

  7. arXiv:1910.06748  [pdf, other

    cs.CL cs.SI

    Language Identification on Massive Datasets of Short Message using an Attention Mechanism CNN

    Authors: Duy Tin Vo, Richard Khoury

    Abstract: Language Identification (LID) is a challenging task, especially when the input texts are short and noisy such as posts and statuses on social media or chat logs on gaming forums. The task has been tackled by either designing a feature set for a traditional classifier (e.g. Naive Bayes) or applying a deep neural network classifier (e.g. Bi-directional Gated Recurrent Unit, Encoder-Decoder). These m… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: 9 pages, 5 tables, 1 figure