Skip to main content

Showing 1–15 of 15 results for author: Le, D D

.
  1. arXiv:2405.19612  [pdf, other

    cs.IR

    Keyword-driven Retrieval-Augmented Large Language Models for Cold-start User Recommendations

    Authors: Hai-Dang Kieu, Minh Duc Nguyen, Thanh-Son Nguyen, Dung D. Le

    Abstract: Recent advancements in Large Language Models (LLMs) have shown significant potential in enhancing recommender systems. However, addressing the cold-start recommendation problem, where users lack historical data, remains a considerable challenge. In this paper, we introduce KALM4Rec (Keyword-driven Retrieval-Augmented Large Language Models for Cold-start User Recommendations), a novel framework spe… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 10 pages, 10 figures, 4 tables

  2. arXiv:2403.19161  [pdf, other

    cs.CL

    Improving Vietnamese-English Medical Machine Translation

    Authors: Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi, Wray Buntine

    Abstract: Machine translation for Vietnamese-English in the medical domain is still an under-explored research area. In this paper, we introduce MedEV -- a high-quality Vietnamese-English parallel dataset constructed specifically for the medical domain, comprising approximately 360K sentence pairs. We conduct extensive experiments comparing Google Translate, ChatGPT (gpt-3.5-turbo), state-of-the-art Vietnam… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: To appear in Proceedings of LREC-COLING 2024

  3. arXiv:2403.02715  [pdf, other

    cs.CL cs.AI

    Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

    Authors: Sang T. Truong, Duc Q. Nguyen, Toan Nguyen, Dong D. Le, Nhi N. Truong, Tho Quan, Sanmi Koyejo

    Abstract: Recent advancements in large language models (LLMs) have underscored their importance in the evolution of artificial intelligence. However, despite extensive pretraining on multilingual datasets, available open-sourced LLMs exhibit limited effectiveness in processing Vietnamese. The challenge is exacerbated by the absence of systematic benchmark datasets and metrics tailored for Vietnamese LLM eva… ▽ More

    Submitted 26 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 51 pages

    MSC Class: 68T50

  4. arXiv:2402.14305  [pdf, other

    cs.IR cs.LG

    Towards Efficient Pareto-optimal Utility-Fairness between Groups in Repeated Rankings

    Authors: Phuong Dinh Mai, Duc-Trong Le, Tuan-Anh Hoang, Dung D. Le

    Abstract: In this paper, we tackle the problem of computing a sequence of rankings with the guarantee of the Pareto-optimal balance between (1) maximizing the utility of the consumers and (2) minimizing unfairness between producers of the items. Such a multi-objective optimization problem is typically solved using a combination of a scalarization method and linear programming on bi-stochastic matrices, repr… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  5. arXiv:2402.11469  [pdf, other

    cs.LG cs.CL cs.CR

    A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models

    Authors: Cuong Dang, Dung D. Le, Thai Le

    Abstract: Existing works have shown that fine-tuned textual transformer models achieve state-of-the-art prediction performances but are also vulnerable to adversarial text perturbations. Traditional adversarial evaluation is often done \textit{only after} fine-tuning the models and ignoring the training data. In this paper, we want to prove that there is also a strong correlation between training data and m… ▽ More

    Submitted 1 July, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL Findings 2024

  6. arXiv:2402.03292  [pdf, other

    cs.LG cs.CV

    Zero-shot Object-Level OOD Detection with Context-Aware Inpainting

    Authors: Quang-Huy Nguyen, ** Peng Zhou, Zhenzhen Liu, Khanh-Huyen Bui, Kilian Q. Weinberger, Dung D. Le

    Abstract: Machine learning algorithms are increasingly provided as black-box cloud services or pre-trained models, without access to their training data. This motivates the problem of zero-shot out-of-distribution (OOD) detection. Concretely, we aim to detect OOD objects that do not belong to the classifier's label set but are erroneously classified as in-distribution (ID) objects. Our approach, RONIN, uses… ▽ More

    Submitted 6 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  7. arXiv:2401.03748  [pdf, other

    cs.LG cs.CR cs.DC cs.IR

    Towards Efficient Communication and Secure Federated Recommendation System via Low-rank Training

    Authors: Ngoc-Hieu Nguyen, Tuan-Anh Nguyen, Tuan Nguyen, Vu Tien Hoang, Dung D. Le, Kok-Seng Wong

    Abstract: Federated Recommendation (FedRec) systems have emerged as a solution to safeguard users' data in response to growing regulatory concerns. However, one of the major challenges in these systems lies in the communication costs that arise from the need to transmit neural network models between user devices and a central server. Prior approaches to these challenges often lead to issues such as computat… ▽ More

    Submitted 28 February, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: 12 pages, 6 figures, 4 tables

  8. arXiv:2311.15297  [pdf, other

    cs.LG math.OC

    Controllable Expensive Multi-objective Learning with Warm-starting Bayesian Optimization

    Authors: Quang-Huy Nguyen, Long P. Hoang, Hoang V. Viet, Dung D. Le

    Abstract: Pareto Set Learning (PSL) is a promising approach for approximating the entire Pareto front in multi-objective optimization (MOO) problems. However, existing derivative-free PSL methods are often unstable and inefficient, especially for expensive black-box MOO problems where objective function evaluations are costly. In this work, we propose to address the instability and inefficiency of existing… ▽ More

    Submitted 9 February, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

  9. arXiv:2307.04514  [pdf, other

    cs.LG cs.AI

    Improving Heterogeneous Graph Learning with Weighted Mixed-Curvature Product Manifold

    Authors: Tuc Nguyen-Van, Dung D. Le, The-Anh Ta

    Abstract: In graph representation learning, it is important that the complex geometric structure of the input graph, e.g. hidden relations among nodes, is well captured in embedding space. However, standard Euclidean embedding spaces have a limited capacity in representing graphs of varying structures. A promising candidate for the faithful embedding of data with varying structure is product manifolds of co… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  10. arXiv:2304.09093  [pdf, other

    cs.IR cs.CL cs.LG

    Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation

    Authors: Huy Dao, Dung D. Le, Cuong Chu

    Abstract: State-of-the-art methods on conversational recommender systems (CRS) leverage external knowledge to enhance both items' and contextual words' representations to achieve high quality recommendations and responses generation. However, the representations of the items and words are usually modeled in two separated semantic spaces, which leads to misalignment issue between them. Consequently, this wil… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: 14 pages, 3 figures, 9 tables

  11. arXiv:2302.12487  [pdf, other

    math.OC

    A Framework for Controllable Pareto Front Learning with Completed Scalarization Functions and its Applications

    Authors: Tran Anh Tuan, Long P. Hoang, Dung D. Le, Tran Ngoc Thang

    Abstract: Pareto Front Learning (PFL) was recently introduced as an efficient method for approximating the entire Pareto front, the set of all optimal solutions to a Multi-Objective Optimization (MOO) problem. In the previous work, the map** between a preference vector and a Pareto optimal solution is still ambiguous, rendering its results. This study demonstrates the convergence and completion aspects of… ▽ More

    Submitted 13 August, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Under Review at Neural Networks Journal

  12. arXiv:2212.01130  [pdf, other

    cs.LG

    Improving Pareto Front Learning via Multi-Sample Hypernetworks

    Authors: Long P. Hoang, Dung D. Le, Tran Anh Tuan, Tran Ngoc Thang

    Abstract: Pareto Front Learning (PFL) was recently introduced as an effective approach to obtain a map** function from a given trade-off vector to a solution on the Pareto front, which solves the multi-objective optimization (MOO) problem. Due to the inherent trade-off between conflicting objectives, PFL offers a flexible approach in many scenarios in which the decision makers can not specify the preferen… ▽ More

    Submitted 28 April, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI-23

  13. Enhancing Few-shot Image Classification with Cosine Transformer

    Authors: Quang-Huy Nguyen, Cuong Q. Nguyen, Dung D. Le, Hieu H. Pham

    Abstract: This paper addresses the few-shot image classification problem, where the classification task is performed on unlabeled query samples given a small amount of labeled support samples only. One major challenge of the few-shot learning problem is the large variety of object visual appearances that prevents the support samples to represent that object comprehensively. This might result in a significan… ▽ More

    Submitted 21 July, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Journal ref: IEEE Access (2023)

  14. arXiv:2110.08678  [pdf, other

    cs.LG cs.CL stat.ML

    Improving Transformers with Probabilistic Attention Keys

    Authors: Tam Nguyen, Tan M. Nguyen, Dung D. Le, Duy Khuong Nguyen, Viet-Anh Tran, Richard G. Baraniuk, Nhat Ho, Stanley J. Osher

    Abstract: Multi-head attention is a driving force behind state-of-the-art transformers, which achieve remarkable performance across a variety of natural language processing (NLP) and computer vision tasks. It has been observed that for many applications, those attention heads learn redundant embedding, and most of them can be removed without degrading the performance of the model. Inspired by this observati… ▽ More

    Submitted 12 June, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: 27 pages, 16 figures, 10 tables

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

  15. arXiv:2012.15029  [pdf, other

    eess.IV

    VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations

    Authors: Ha Q. Nguyen, Khanh Lam, Linh T. Le, Hieu H. Pham, Dat Q. Tran, Dung B. Nguyen, Dung D. Le, Chi M. Pham, Hang T. T. Tong, Diep H. Dinh, Cuong D. Do, Luu T. Doan, Cuong N. Nguyen, Binh T. Nguyen, Que V. Nguyen, Au D. Hoang, Hien N. Phan, Anh T. Nguyen, Phuong H. Ho, Dat T. Ngo, Nghia T. Nguyen, Nhan T. Nguyen, Minh Dao, Van Vu

    Abstract: Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam… ▽ More

    Submitted 20 March, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: 11 pages, under review by Nature Scientific Data