Skip to main content

Showing 1–2 of 2 results for author: Luong, T S

.
  1. arXiv:2406.14835  [pdf, other

    cs.CL cs.LG

    ToVo: Toxicity Taxonomy via Voting

    Authors: Tinh Son Luong, Thanh-Thien Le, Thang Viet Doan, Linh Ngo Van, Thien Huu Nguyen, Diep Thi-Ngoc Nguyen

    Abstract: Existing toxic detection models face significant limitations, such as lack of transparency, customization, and reproducibility. These challenges stem from the closed-source nature of their training data and the paucity of explanations for their evaluation mechanism. To address these issues, we propose a dataset creation mechanism that integrates voting and chain-of-thought processes, producing a h… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2405.10659  [pdf, other

    cs.CL cs.AI

    Realistic Evaluation of Toxicity in Large Language Models

    Authors: Tinh Son Luong, Thanh-Thien Le, Linh Ngo Van, Thien Huu Nguyen

    Abstract: Large language models (LLMs) have become integral to our professional workflows and daily lives. Nevertheless, these machine companions of ours have a critical flaw: the huge amount of data which endows them with vast and diverse knowledge, also exposes them to the inevitable toxicity and bias. While most LLMs incorporate defense mechanisms to prevent the generation of harmful content, these safeg… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Findings of ACL 2024