Skip to main content

Showing 1–12 of 12 results for author: Le, L T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16008  [pdf, other

    cs.CL cs.AI cs.LG

    Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization

    Authors: Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister

    Abstract: Large language models (LLMs), even when specifically trained to process long input contexts, struggle to capture relevant information located in the middle of their input. This phenomenon has been known as the lost-in-the-middle problem. In this work, we make three contributions. First, we set out to understand the factors that cause this phenomenon. In doing so, we establish a connection between… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: ACL Findings 2024

  2. arXiv:2406.05365  [pdf, other

    cs.CL cs.AI cs.LG

    CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation

    Authors: I-Hung Hsu, Zifeng Wang, Long T. Le, Lesly Miculicich, Nanyun Peng, Chen-Yu Lee, Tomas Pfister

    Abstract: Grounded generation aims to equip language models (LMs) with the ability to produce more credible and accountable responses by accurately citing verifiable sources. However, existing methods, by either feeding LMs with raw or preprocessed materials, remain prone to errors. To address this, we introduce CaLM, a novel verification framework. CaLM leverages the insight that a robust grounded response… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Camera Ready Version

  3. arXiv:2405.15230  [pdf, other

    cs.AI cs.LG

    $i$REPO: $i$mplicit Reward Pairwise Difference based Empirical Preference Optimization

    Authors: Long Tan Le, Han Shu, Tung-Anh Nguyen, Choong Seon Hong, Nguyen H. Tran

    Abstract: While astonishingly capable, large Language Models (LLM) can sometimes produce outputs that deviate from human expectations. Such deviations necessitate an alignment phase to prevent disseminating untruthful, toxic, or biased information. Traditional alignment methods based on reinforcement learning often struggle with the identified instability, whereas preference optimization methods are limited… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Under Review

  4. arXiv:2404.05875  [pdf, other

    cs.CL cs.AI cs.LG

    CodecLM: Aligning Language Models with Tailored Synthetic Data

    Authors: Zifeng Wang, Chun-Liang Li, Vincent Perot, Long T. Le, ** Miao, Zizhao Zhang, Chen-Yu Lee, Tomas Pfister

    Abstract: Instruction tuning has emerged as the key in aligning large language models (LLMs) with specific task instructions, thereby mitigating the discrepancy between the next-token prediction objective and users' actual goals. To reduce the labor and time cost to collect or annotate data by humans, researchers start to explore the use of LLMs to generate instruction-aligned synthetic data. Recent works f… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted to Findings of NAACL 2024

  5. arXiv:2309.15659  [pdf, other

    cs.LG cs.DC

    Federated Deep Equilibrium Learning: A Compact Shared Representation for Edge Communication Efficiency

    Authors: Long Tan Le, Tuan Dung Nguyen, Tung-Anh Nguyen, Choong Seon Hong, Nguyen H. Tran

    Abstract: Federated Learning (FL) is a prominent distributed learning paradigm facilitating collaboration among nodes within an edge network to co-train a global model without centralizing data. By shifting computation to the network edge, FL offers robust and responsive edge-AI solutions and enhance privacy-preservation. However, deploying deep FL models within edge environments is often hindered by commun… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  6. arXiv:2304.01220  [pdf, other

    eess.IV cs.CV

    Evaluating the impact of an explainable machine learning system on the interobserver agreement in chest radiograph interpretation

    Authors: Hieu H. Pham, Ha Q. Nguyen, Hieu T. Nguyen, Linh T. Le, Khanh Lam

    Abstract: We conducted a prospective study to measure the clinical impact of an explainable machine learning system on interobserver agreement in chest radiograph interpretation. The AI system, which we call as it VinDr-CXR when used as a diagnosis-supporting tool, significantly improved the agreement between six radiologists with an increase of 1.5% in mean Fleiss' Kappa. In addition, we also observed that… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: This work has been accepted for publication in IEEE Access. This is a short version submitted to the Midwest Machine Learning Symposium (MMLS 2023), Chicago, IL, USA

  7. arXiv:2212.12121  [pdf, other

    cs.LG

    Federated PCA on Grassmann Manifold for Anomaly Detection in IoT Networks

    Authors: Tung-Anh Nguyen, Jiayu He, Long Tan Le, Wei Bao, Nguyen H. Tran

    Abstract: In the era of Internet of Things (IoT), network-wide anomaly detection is a crucial part of monitoring IoT networks due to the inherent security vulnerabilities of most IoT devices. Principal Components Analysis (PCA) has been proposed to separate network traffics into two disjoint subspaces corresponding to normal and malicious behaviors for anomaly detection. However, the privacy concerns and li… ▽ More

    Submitted 10 January, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: accepted at IEEE INFOCOM 2023

  8. arXiv:2208.03545  [pdf, other

    eess.IV cs.CV

    An Accurate and Explainable Deep Learning System Improves Interobserver Agreement in the Interpretation of Chest Radiograph

    Authors: Hieu H. Pham, Ha Q. Nguyen, Hieu T. Nguyen, Linh T. Le, Lam Khanh

    Abstract: Recent artificial intelligence (AI) algorithms have achieved radiologist-level performance on various medical classification tasks. However, only a few studies addressed the localization of abnormal findings from CXR scans, which is essential in explaining the image-level classification to radiologists. We introduce in this paper an explainable deep learning system called VinDr-CXR that can classi… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

  9. arXiv:2206.01432  [pdf, other

    cs.LG cs.DC

    On the Generalization of Wasserstein Robust Federated Learning

    Authors: Tung-Anh Nguyen, Tuan Dung Nguyen, Long Tan Le, Canh T. Dinh, Nguyen H. Tran

    Abstract: In federated learning, participating clients typically possess non-i.i.d. data, posing a significant challenge to generalization to unseen distributions. To address this, we propose a Wasserstein distributionally robust optimization scheme called WAFL. Leveraging its duality, we frame WAFL as an empirical surrogate risk minimization problem, and solve it using a local SGD-based algorithm with conv… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  10. arXiv:2203.11205  [pdf, other

    eess.IV cs.CV

    VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography

    Authors: Hieu T. Nguyen, Ha Q. Nguyen, Hieu H. Pham, Khanh Lam, Linh T. Le, Minh Dao, Van Vu

    Abstract: Mammography, or breast X-ray, is the most widely used imaging modality to detect cancer and other breast diseases. Recent studies have shown that deep learning-based computer-assisted detection and diagnosis (CADe or CADx) tools have been developed to support physicians and improve the accuracy of interpreting mammography. However, most published datasets of mammography are either limited on sampl… ▽ More

    Submitted 16 March, 2023; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: The manuscript is accepted for publication by Scientific Data (Nature)

  11. arXiv:2008.00646  [pdf, other

    cs.LG stat.ML

    Interpretable Sequence Learning for COVID-19 Forecasting

    Authors: Sercan O. Arik, Chun-Liang Li, **sung Yoon, Rajarishi Sinha, Arkady Epshteyn, Long T. Le, Vikas Menon, Shashank Singh, Leyou Zhang, Nate Yoder, Martin Nikoltchev, Yash Sonthalia, Hootan Nakhost, Elli Kanal, Tomas Pfister

    Abstract: We propose a novel approach that integrates machine learning into compartmental disease modeling to predict the progression of COVID-19. Our model is explainable by design as it explicitly shows how different compartments evolve and it uses interpretable encoders to incorporate covariates and improve performance. Explainability is valuable to ensure that the model's forecasts are credible to epide… ▽ More

    Submitted 13 January, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

  12. arXiv:1905.06647  [pdf

    cs.CL

    Using Entity Relations for Opinion Mining of Vietnamese Comments

    Authors: P. T. Nguyen, L. T. Le, V. M. Ngo, P. M. Nguyen

    Abstract: In this paper, we propose several novel techniques to extract and mining opinions of Vietnamese reviews of customers about a number of products traded on e-commerce in Vietnam. The assessment is based on the emotional level of customers on a specific product such as mobile and laptop. We exploit the features of the products because they are much interested by customers and have many products in th… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: 14 pages, in Vietnamese

    Journal ref: Journal of Science and Technology, Vietnam Academy of Science and Technology, Vol. 52, No. 4D, pp. 120-132 (2014)