Skip to main content

Showing 1–3 of 3 results for author: Thieu, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09637  [pdf, other

    cs.CV

    Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings

    Authors: Keno Moenck, Duc Trung Thieu, Julian Koch, Thorsten Schüppstuhl

    Abstract: In recent years, the upstream of Large Language Models (LLM) has also encouraged the computer vision community to work on substantial multimodal datasets and train models on a scale in a self-/semi-supervised manner, resulting in Vision Foundation Models (VFM), as, e.g., Contrastive Language-Image Pre-training (CLIP). The models generalize well and perform outstandingly on everyday objects or scen… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Dataset at https://github.com/kenomo/ilid training- and evaluation-related code at https://github.com/kenomo/industrial-clip

  2. arXiv:2312.10202  [pdf, other

    cs.CL

    Low-resource classification of mobility functioning information in clinical sentences using large language models

    Authors: Tuan Dung Le, Thanh Duong, Thanh Thieu

    Abstract: Objective: Function is increasingly recognized as an important indicator of whole-person health. This study evaluates the ability of publicly available large language models (LLMs) to accurately identify the presence of functioning information from clinical notes. We explore various strategies to improve the performance on this task. Materials and Methods: We collect a balanced binary classificati… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  3. arXiv:2311.15946  [pdf, other

    cs.CL

    Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes

    Authors: Tuan-Dung Le, Zhuqi Miao, Samuel Alvarado, Brittany Smith, William Paiva, Thanh Thieu

    Abstract: Function is increasingly recognized as an important indicator of whole-person health, although it receives little attention in clinical natural language processing research. We introduce the first public annotated dataset specifically on the Mobility domain of the International Classification of Functioning, Disability and Health (ICF), aiming to facilitate automatic extraction and analysis of fun… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.