Skip to main content

Showing 1–9 of 9 results for author: Phuong, T M

.
  1. arXiv:2407.01983  [pdf, other

    cs.CV

    SADL: An Effective In-Context Learning Method for Compositional Visual QA

    Authors: Long Hoang Dang, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran

    Abstract: Large vision-language models (LVLMs) offer a novel capability for performing in-context learning (ICL) in Visual QA. When prompted with a few demonstrations of image-question-answer triplets, LVLMs have demonstrated the ability to discern underlying patterns and transfer this latent knowledge to answer new questions about unseen images without the need for expensive supervised fine-tuning. However… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2308.14654  [pdf, other

    cs.CL cs.AI

    Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-Distillation

    Authors: Nguyen Anh Tu, Hoang Thi Thu Uyen, Tu Minh Phuong, Ngo Xuan Bach

    Abstract: Multiple intent detection and slot filling are two fundamental and crucial tasks in spoken language understanding. Motivated by the fact that the two tasks are closely related, joint models that can detect intents and extract slots simultaneously are preferred to individual models that perform each task independently. The accuracy of a joint model depends heavily on the ability of the model to tra… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted at ECAI 2023

  3. Analyzing Vietnamese Legal Questions Using Deep Neural Networks with Biaffine Classifiers

    Authors: Nguyen Anh Tu, Hoang Thi Thu Uyen, Tu Minh Phuong, Ngo Xuan Bach

    Abstract: In this paper, we propose using deep neural networks to extract important information from Vietnamese legal questions, a fundamental task towards building a question answering system in the legal domain. Given a legal question in natural language, the goal is to extract all the segments that contain the needed information to answer the question. We introduce a deep model that solves the task in th… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: accepted as the oral presentation at ICONIP 2021

  4. arXiv:2207.03656  [pdf, other

    cs.CV cs.LG

    Video Dialog as Conversation about Objects Living in Space-Time

    Authors: Hoang-Anh Pham, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran

    Abstract: It would be a technological feat to be able to create a system that can hold a meaningful conversation with humans about what they watch. A setup toward that goal is presented as a video dialog task, where the system is asked to generate natural utterances in response to a question in an ongoing dialog. The task poses great visual, linguistic, and reasoning challenges that cannot be easily overcom… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022, code will be available at https://github.com/hoanganhpham1006/COST

  5. arXiv:2204.03324  [pdf, other

    cs.CL cs.AI

    Autoencoding Language Model Based Ensemble Learning for Commonsense Validation and Explanation

    Authors: Ngo Quang Huy, Tu Minh Phuong, Ngo Xuan Bach

    Abstract: An ultimate goal of artificial intelligence is to build computer systems that can understand human languages. Understanding commonsense knowledge about the world expressed in text is one of the foundational and challenging problems to create such intelligent systems. As a step towards this goal, we present in this paper ALMEn, an Autoencoding Language Model based Ensemble learning method for commo… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  6. arXiv:2003.06858  [pdf

    cs.CL cs.AI cs.IR

    Leveraging Foreign Language Labeled Data for Aspect-Based Opinion Mining

    Authors: Nguyen Thi Thanh Thuy, Ngo Xuan Bach, Tu Minh Phuong

    Abstract: Aspect-based opinion mining is the task of identifying sentiment at the aspect level in opinionated text, which consists of two subtasks: aspect category extraction and sentiment polarity classification. While aspect category extraction aims to detect and categorize opinion targets such as product features, sentiment polarity classification assigns a sentiment label, i.e. positive, negative, or ne… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

  7. Classifying Vietnamese Disease Outbreak Reports with Important Sentences and Rich Features

    Authors: Son Doan, Nguyen Thi Ngoc Vinh, Tu Minh Phuong

    Abstract: Text classification is an important field of research from mid 90s up to now. It has many applications, one of them is in Web-based biosurveillance systems which identify and summarize online disease outbreak reports. In this paper we focus on classifying Vietnamese disease outbreak reports. We investigate important properties of disease outbreak reports, e.g., sentences containing names of outbre… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: 5 pages, 2 tables

    Journal ref: Proc. of the Third Symposium on Information and Communication Technology (SoICT), pages 260-265, 2012

  8. arXiv:1703.09296  [pdf, other

    cs.CV

    Femoral ROIs and Entropy for Texture-based Detection of Osteoarthritis from High-Resolution Knee Radiographs

    Authors: Jiří Hladůvka, Bui Thi Mai Phuong, Richard Ljuhar, Davul Ljuhar, Ana M Rodrigues, Jaime C Branco, Helena Canhão

    Abstract: The relationship between knee osteoarthritis progression and changes in tibial bone structure has long been recognized and various texture descriptors have been proposed to detect early osteoarthritis (OA) from radiographs. This work aims to investigate (1) femoral textures as an OA indicator and (2) the potential of entropy as a computationally efficient alternative to established texture descrip… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

  9. Natural Language Processing in Biomedicine: A Unified System Architecture Overview

    Authors: Son Doan, Mike Conway, Tu Minh Phuong, Lucila Ohno-Machado

    Abstract: In modern electronic medical records (EMR) much of the clinically important data - signs and symptoms, symptom severity, disease status, etc. - are not provided in structured data fields, but rather are encoded in clinician generated narrative text. Natural language processing (NLP) provides a means of "unlocking" this important data source for applications in clinical decision support, quality… ▽ More

    Submitted 8 January, 2014; v1 submitted 2 January, 2014; originally announced January 2014.

    Comments: 25 pages, 5 figures, book chapter in Clinical Bioinformatics, 2014, edited by Ronand Trent