Skip to main content

Showing 1–50 of 81 results for author: Nguyen, H T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20077  [pdf, other

    cs.CV

    HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model

    Authors: Hieu T. Nguyen, Yiwen Chen, Vikram Voleti, Varun Jampani, Huaizu Jiang

    Abstract: We introduce HouseCrafter, a novel approach that can lift a floorplan into a complete large 3D indoor scene (e.g., a house). Our key insight is to adapt a 2D diffusion model, which is trained on web-scale images, to generate consistent multi-view color (RGB) and depth (D) images across different locations of the scene. Specifically, the RGB-D images are generated autoregressively in a batch-wise m… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.17381  [pdf, other

    cs.LG cs.CV

    Forget but Recall: Incremental Latent Rectification in Continual Learning

    Authors: Nghia D. Nguyen, Hieu Trung Nguyen, Ang Li, Hoang Pham, Viet Anh Nguyen, Khoa D. Doan

    Abstract: Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper in… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.02317  [pdf, other

    cs.LG cs.AI stat.ML

    Generative Conditional Distributions by Neural (Entropic) Optimal Transport

    Authors: Bao Nguyen, Binh Nguyen, Hieu Trung Nguyen, Viet Anh Nguyen

    Abstract: Learning conditional distributions is challenging because the desired outcome is not a single distribution but multiple distributions that correspond to multiple instances of the covariates. We introduce a novel neural entropic optimal transport method designed to effectively learn generative models of conditional distributions, particularly in scenarios characterized by limited sample sizes. Our… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures

  4. arXiv:2406.00973  [pdf, other

    cs.IR cs.LG

    Cold-start Recommendation by Personalized Embedding Region Elicitation

    Authors: Hieu Trung Nguyen, Duy Nguyen, Khoa Doan, Viet Anh Nguyen

    Abstract: Rating elicitation is a success element for recommender systems to perform well at cold-starting, in which the systems need to recommend items to a newly arrived user with no prior knowledge about the user's preference. Existing elicitation methods employ a fixed set of items to learn the user's preference and then infer the users' preferences on the remaining items. Using a fixed seed set can lim… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted at UAI 2024

  5. arXiv:2405.14352  [pdf, other

    cs.LG

    Explaining Graph Neural Networks via Structure-aware Interaction Index

    Authors: Ngoc Bui, Hieu Trung Nguyen, Viet Anh Nguyen, Rex Ying

    Abstract: The Shapley value is a prominent tool for interpreting black-box machine learning models thanks to its strong theoretical foundation. However, for models with structured inputs, such as graph neural networks, existing Shapley-based explainability approaches either focus solely on node-wise importance or neglect the graph structure when perturbing the input instance. This paper introduces the Myers… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 30 pages, ICML'24

  6. arXiv:2405.01021  [pdf, other

    cs.ET quant-ph

    QSimPy: A Learning-centric Simulation Framework for Quantum Cloud Resource Management

    Authors: Hoa T. Nguyen, Muhammad Usman, Rajkumar Buyya

    Abstract: Quantum cloud computing is an emerging computing paradigm that allows seamless access to quantum hardware as cloud-based services. However, effective use of quantum resources is challenging and necessitates robust simulation frameworks for effective resource management design and evaluation. To address this need, we proposed QSimPy, a novel discrete-event simulation framework designed with the mai… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  7. arXiv:2404.11420  [pdf, other

    cs.ET cs.DC

    Quantum Cloud Computing: A Review, Open Problems, and Future Directions

    Authors: Hoa T. Nguyen, Prabhakar Krishnan, Dilip Krishnaswamy, Muhammad Usman, Rajkumar Buyya

    Abstract: Quantum cloud computing is an emerging paradigm of computing that empowers quantum applications and their deployment on quantum computing resources without the need for a specialized environment to host and operate physical quantum computers. This paper reviews recent advances, identifies open problems, and proposes future directions in quantum cloud computing. It discusses the state-of-the-art qu… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  8. arXiv:2402.13822  [pdf, other

    cs.CV

    MSTAR: Multi-Scale Backbone Architecture Search for Timeseries Classification

    Authors: Tue M. Cao, Nhat H. Tran, Hieu H. Pham, Hung T. Nguyen, Le P. Nguyen

    Abstract: Most of the previous approaches to Time Series Classification (TSC) highlight the significance of receptive fields and frequencies while overlooking the time resolution. Hence, unavoidably suffered from scalability issues as they integrated an extensive range of receptive fields into classification models. Other methods, while having a better adaptation for large datasets, require manual design an… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  9. arXiv:2402.04982  [pdf, other

    cs.LG cs.DB

    Beyond explaining: XAI-based Adaptive Learning with SHAP Clustering for Energy Consumption Prediction

    Authors: Tobias Clement, Hung Truong Thanh Nguyen, Nils Kemmerzell, Mohamed Abdelaal, Davor Stjelja

    Abstract: This paper presents an approach integrating explainable artificial intelligence (XAI) techniques with adaptive learning to enhance energy consumption prediction models, with a focus on handling data distribution shifts. Leveraging SHAP clustering, our method provides interpretable explanations for model predictions and uses these insights to adaptively refine the model, balancing model complexity… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: A short version of this paper was published at the Australasian Joint Conference on Artificial Intelligence in 2023

  10. arXiv:2401.07278  [pdf, other

    cs.CV cs.AI

    Semi-Supervised Semantic Segmentation using Redesigned Self-Training for White Blood Cells

    Authors: Vinh Quoc Luu, Duy Khanh Le, Huy Thanh Nguyen, Minh Thanh Nguyen, Thinh Tien Nguyen, Vinh Quang Dinh

    Abstract: Artificial Intelligence (AI) in healthcare, especially in white blood cell cancer diagnosis, is hindered by two primary challenges: the lack of large-scale labeled datasets for white blood cell (WBC) segmentation and outdated segmentation methods. These challenges inhibit the development of more accurate and modern techniques to diagnose cancer relating to white blood cells. To address the first c… ▽ More

    Submitted 23 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  11. arXiv:2312.09445  [pdf, other

    eess.SP cs.CV cs.LG

    IncepSE: Leveraging InceptionTime's performance with Squeeze and Excitation mechanism in ECG analysis

    Authors: Tue Minh Cao, Nhat Hong Tran, Le Phi Nguyen, Hieu Huy Pham, Hung Thanh Nguyen

    Abstract: Our study focuses on the potential for modifications of Inception-like architecture within the electrocardiogram (ECG) domain. To this end, we introduce IncepSE, a novel network characterized by strategic architectural incorporation that leverages the strengths of both InceptionTime and channel attention mechanisms. Furthermore, we propose a training setup that employs stabilization techniques tha… ▽ More

    Submitted 16 November, 2023; originally announced December 2023.

  12. arXiv:2312.01384  [pdf, other

    cs.DS cs.DC

    A Tight Lower Bound for 3-Coloring Grids in the Online-LOCAL Model

    Authors: Yi-Jun Chang, Gopinath Mishra, Hung Thuan Nguyen, Mingyang Yang, Yu-Cheng Yeh

    Abstract: Recently, \citeauthor*{akbari2021locality}~(ICALP 2023) studied the locality of graph problems in distributed, sequential, dynamic, and online settings from a {unified} point of view. They designed a novel $O(\log n)$-locality deterministic algorithm for proper 3-coloring bipartite graphs in the $\mathsf{Online}$-$\mathsf{LOCAL}$ model. In this work, we establish the optimality of the algorithm by… ▽ More

    Submitted 1 May, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

  13. Utilizing Model Residuals to Identify Rental Properties of Interest: The Price Anomaly Score (PAS) and Its Application to Real-time Data in Manhattan

    Authors: Youssef Sultan, Jackson C. Rafter, Huyen T. Nguyen

    Abstract: Understanding whether a property is priced fairly hinders buyers and sellers since they usually do not have an objective viewpoint of the price distribution for the overall market of their interest. Drawing from data collected of all possible available properties for rent in Manhattan as of September 2023, this paper aims to strengthen our understanding of model residuals; specifically on machine… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 8 pages, 8 figures, dataset is available with DOI

    Journal ref: Vol. 4, No. 2, December 2023, pp. 97-106

  14. arXiv:2311.03383  [pdf, other

    cs.LG cs.AI cs.AR cs.HC

    Toward Reinforcement Learning-based Rectilinear Macro Placement Under Human Constraints

    Authors: Tuyen P. Le, Hieu T. Nguyen, Seungyeol Baek, Taeyoun Kim, Jungwoo Lee, Seongjung Kim, Hyun** Kim, Misu Jung, Daehoon Kim, Seokyong Lee, Daewoo Choi

    Abstract: Macro placement is a critical phase in chip design, which becomes more intricate when involving general rectilinear macros and layout areas. Furthermore, macro placement that incorporates human-like constraints, such as design hierarchy and peripheral bias, has the potential to significantly reduce the amount of additional manual labor required from designers. This study proposes a methodology tha… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Fast ML for Science @ ICCAD 2023

  15. arXiv:2310.10822  [pdf, other

    cs.RO cs.CV eess.SY

    Vision and Language Navigation in the Real World via Online Visual Language Map**

    Authors: Chengguang Xu, Hieu T. Nguyen, Christopher Amato, Lawson L. S. Wong

    Abstract: Navigating in unseen environments is crucial for mobile robots. Enhancing them with the ability to follow instructions in natural language will further improve navigation efficiency in unseen cases. However, state-of-the-art (SOTA) vision-and-language navigation (VLN) methods are mainly evaluated in simulation, neglecting the complex and noisy real world. Directly transferring SOTA navigation poli… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  16. Distributionally Robust Safety Filter for Learning-Based Control in Active Distribution Systems

    Authors: Hoang Tien Nguyen, Dae-Hyun Choi

    Abstract: Operational constraint violations may occur when deep reinforcement learning (DRL) agents interact with real-world active distribution systems to learn their optimal policies during training. This letter presents a universal distributionally robust safety filter (DRSF) using which any DRL agent can reduce the constraint violations of distribution systems significantly during training while maintai… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  17. arXiv:2306.16638  [pdf, other

    cs.CL

    A negation detection assessment of GPTs: analysis with the xNot360 dataset

    Authors: Ha Thanh Nguyen, Randy Goebel, Francesca Toni, Kostas Stathis, Ken Satoh

    Abstract: Negation is a fundamental aspect of natural language, playing a critical role in communication and comprehension. Our study assesses the negation detection performance of Generative Pre-trained Transformer (GPT) models, specifically GPT-2, GPT-3, GPT-3.5, and GPT-4. We focus on the identification of negation in natural language using a zero-shot prediction approach applied to our custom xNot360 da… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  18. arXiv:2306.09266  [pdf, other

    cs.CV

    A9 Intersection Dataset: All You Need for Urban 3D Camera-LiDAR Roadside Perception

    Authors: Walter Zimmer, Christian Creß, Huu Tung Nguyen, Alois C. Knoll

    Abstract: Intelligent Transportation Systems (ITS) allow a drastic expansion of the visibility range and decrease occlusions for autonomous driving. To obtain accurate detections, detailed labeled sensor data for training is required. Unfortunately, high-quality 3D labels of LiDAR point clouds from the infrastructure perspective of an intersection are still rare. Therefore, we provide the A9 Intersection Da… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 8 pages, 6 figures, 3 tables

  19. arXiv:2306.06893  [pdf, other

    cs.CV cs.AI

    In-context Cross-Density Adaptation on Noisy Mammogram Abnormalities Detection

    Authors: Huy T. Nguyen, Thinh B. Lam, Quan D. D. Tran, Minh T. Nguyen, Dat T. Chung, Vinh Q. Dinh

    Abstract: This paper investigates the impact of breast density distribution on the generalization performance of deep-learning models on mammography images using the VinDr-Mammo dataset. We explore the use of domain adaptation techniques, specifically Domain Adaptive Object Detection (DAOD) with the Noise Latent Transferability Exploration (NLTE) framework, to improve model performance across breast densiti… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  20. arXiv:2305.00314  [pdf, other

    cs.CV

    InfraDet3D: Multi-Modal 3D Object Detection based on Roadside Infrastructure Camera and LiDAR Sensors

    Authors: Walter Zimmer, Joseph Birkner, Marcel Brucker, Huu Tung Nguyen, Stefan Petrovski, Bohan Wang, Alois C. Knoll

    Abstract: Current multi-modal object detection approaches focus on the vehicle domain and are limited in the perception range and the processing capabilities. Roadside sensor units (RSUs) introduce a new domain for perception systems and leverage altitude to observe traffic. Cameras and LiDARs mounted on gantry bridges increase the perception range and produce a full digital twin of the traffic. In this wor… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  21. arXiv:2304.07513  [pdf, other

    eess.SY cs.CR

    Experimental Impact Analysis of Cyberattacks in Power Systems using Digital Real-Time Testbeds

    Authors: Kalinath Katuri, Ioannis Zografopoulos, Ha Thi Nguyen, Charalambos Konstantinou

    Abstract: Smart grid advancements and the increased integration of digital devices have transformed the existing power grid into a cyber-physical energy system. This resha** of the current power system can make it vulnerable to cyberattacks, which could cause irreversible damage to the energy infrastructure resulting in the loss of power, equipment damage, etc. Constant threats emphasize the importance of… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: 2023 IEEE Belgrade PowerTech

  22. arXiv:2304.01220  [pdf, other

    eess.IV cs.CV

    Evaluating the impact of an explainable machine learning system on the interobserver agreement in chest radiograph interpretation

    Authors: Hieu H. Pham, Ha Q. Nguyen, Hieu T. Nguyen, Linh T. Le, Khanh Lam

    Abstract: We conducted a prospective study to measure the clinical impact of an explainable machine learning system on interobserver agreement in chest radiograph interpretation. The AI system, which we call as it VinDr-CXR when used as a diagnosis-supporting tool, significantly improved the agreement between six radiologists with an increase of 1.5% in mean Fleiss' Kappa. In addition, we also observed that… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: This work has been accepted for publication in IEEE Access. This is a short version submitted to the Midwest Machine Learning Symposium (MMLS 2023), Chicago, IL, USA

  23. iQuantum: A Case for Modeling and Simulation of Quantum Computing Environments

    Authors: Hoa T. Nguyen, Muhammad Usman, Rajkumar Buyya

    Abstract: Today's quantum computers are primarily accessible through the cloud and potentially shifting to the edge network in the future. With the rapid advancement and proliferation of quantum computing research worldwide, there has been a considerable increase in demand for using cloud-based quantum computation resources. This demand has highlighted the need for designing efficient and adaptable resource… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: 10 pages, 8 figures

  24. arXiv:2302.02547  [pdf, other

    eess.SY cs.LG

    A Quantum Neural Network Regression for Modeling Lithium-ion Battery Capacity Degradation

    Authors: Anh Phuong Ngo, Nhat Le, Hieu T. Nguyen, Abdullah Eroglu, Duong T. Nguyen

    Abstract: Given the high power density low discharge rate and decreasing cost rechargeable lithium-ion batteries LiBs have found a wide range of applications such as power grid level storage systems electric vehicles and mobile devices. Develo** a framework to accurately model the nonlinear degradation process of LiBs which is indeed a supervised learning problem becomes an important research topic. This… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: Accepted for 2023 IEEE Green Technology Conference, Denver, Colorado, USA

  25. Improving performance of real-time full-band blind packet-loss concealment with predictive network

    Authors: Viet-Anh Nguyen, Anh H. T. Nguyen, Andy W. H. Khong

    Abstract: Packet loss concealment (PLC) is a tool for enhancing speech degradation caused by poor network conditions or underflow/overflow in audio processing pipelines. We propose a real-time recurrent method that leverages previous outputs to mitigate artefact of lost packets without the prior knowledge of loss mask. The proposed full-band recurrent network (FRN) model operates at 48 kHz, which is suitabl… ▽ More

    Submitted 12 May, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: In Proceedings ICASSP 2023, 5 pages, 1 figure, 4 tables

  26. arXiv:2208.08019  [pdf, other

    cs.LG cs.AI cs.NI

    Interference Cancellation GAN Framework for Dynamic Channels

    Authors: Hung T. Nguyen, Steven Bottone, Kwang Taik Kim, Mung Chiang, H. Vincent Poor

    Abstract: Symbol detection is a fundamental and challenging problem in modern communication systems, e.g., multiuser multiple-input multiple-output (MIMO) setting. Iterative Soft Interference Cancellation (SIC) is a state-of-the-art method for this task and recently motivated data-driven neural network models, e.g. DeepSIC, that can deal with unknown non-linear channels. However, these neural network models… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  27. arXiv:2208.03545  [pdf, other

    eess.IV cs.CV

    An Accurate and Explainable Deep Learning System Improves Interobserver Agreement in the Interpretation of Chest Radiograph

    Authors: Hieu H. Pham, Ha Q. Nguyen, Hieu T. Nguyen, Linh T. Le, Lam Khanh

    Abstract: Recent artificial intelligence (AI) algorithms have achieved radiologist-level performance on various medical classification tasks. However, only a few studies addressed the localization of abnormal findings from CXR scans, which is essential in explaining the image-level classification to radiologists. We introduce in this paper an explainable deep learning system called VinDr-CXR that can classi… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

  28. arXiv:2208.03403  [pdf, other

    cs.CV

    Slice-level Detection of Intracranial Hemorrhage on CT Using Deep Descriptors of Adjacent Slices

    Authors: Dat T. Ngo, Thao T. B. Nguyen, Hieu T. Nguyen, Dung B. Nguyen, Ha Q. Nguyen, Hieu H. Pham

    Abstract: The rapid development in representation learning techniques such as deep neural networks and the availability of large-scale, well-annotated medical imaging datasets have to a rapid increase in the use of supervised machine learning in the 3D medical image analysis and diagnosis. In particular, deep convolutional neural networks (D-CNNs) have been key players and were adopted by the medical imagin… ▽ More

    Submitted 17 April, 2023; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: Accepted for presentation at the 22nd IEEE Statistical Signal Processing (SSP) workshop

  29. arXiv:2207.04186  [pdf, other

    cs.CV

    A Study on Self-Supervised Object Detection Pretraining

    Authors: Trung Dang, Simon Kornblith, Huy Thong Nguyen, Peter Chin, Maryam Khademi

    Abstract: In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and projecting boxes to each augmented view and maximizing the similarity between corresponding box features. We study existing design choices in the literature, such as bo… ▽ More

    Submitted 10 August, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

  30. arXiv:2205.14845  [pdf, other

    quant-ph cs.DC cs.ET

    QFaaS: A Serverless Function-as-a-Service Framework for Quantum Computing

    Authors: Hoa T. Nguyen, Muhammad Usman, Rajkumar Buyya

    Abstract: Recent breakthroughs in quantum hardware are creating opportunities for its use in many applications. However, quantum software engineering is still in its infancy with many challenges, especially dealing with the diversity of quantum programming languages and hardware platforms. To alleviate these challenges, we propose QFaaS, a novel Quantum Function-as-a-Service framework, which leverages the a… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: 35 pages, 15 figures

    Journal ref: Future Generation Computer Systems (FGCS) 154 (2024) 281-300

  31. A Summary of the ALQAC 2021 Competition

    Authors: Nguyen Ha Thanh, Bui Minh Quan, Chau Nguyen, Tung Le, Nguyen Minh Phuong, Dang Tran Binh, Vuong Thi Hai Yen, Teeradaj Racharak, Nguyen Le Minh, Tran Duc Vu, Phan Viet Anh, Nguyen Truong Son, Huy Tien Nguyen, Bhumindr Butr-indr, Peerapon Vateekul, Prachya Boonkwan

    Abstract: We summarize the evaluation of the first Automated Legal Question Answering Competition (ALQAC 2021). The competition this year contains three tasks, which aims at processing the statute law document, which are Legal Text Information Retrieval (Task 1), Legal Text Entailment Prediction (Task 2), and Legal Text Question Answering (Task 3). The final goal of these tasks is to build a system that can… ▽ More

    Submitted 24 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

  32. arXiv:2203.12738  [pdf, other

    cs.LG cs.AI cs.DC

    Contextual Model Aggregation for Fast and Robust Federated Learning in Edge Computing

    Authors: Hung T. Nguyen, H. Vincent Poor, Mung Chiang

    Abstract: Federated learning is a prime candidate for distributed machine learning at the network edge due to the low communication complexity and privacy protection among other attractive properties. However, existing algorithms face issues with slow convergence and/or robustness of performance due to the considerable heterogeneity of data distribution, computation and communication capability at the edge.… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: 10 pages

  33. arXiv:2203.11205  [pdf, other

    eess.IV cs.CV

    VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography

    Authors: Hieu T. Nguyen, Ha Q. Nguyen, Hieu H. Pham, Khanh Lam, Linh T. Le, Minh Dao, Van Vu

    Abstract: Mammography, or breast X-ray, is the most widely used imaging modality to detect cancer and other breast diseases. Recent studies have shown that deep learning-based computer-assisted detection and diagnosis (CADe or CADx) tools have been developed to support physicians and improve the accuracy of interpreting mammography. However, most published datasets of mammography are either limited on sampl… ▽ More

    Submitted 16 March, 2023; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: The manuscript is accepted for publication by Scientific Data (Nature)

  34. arXiv:2203.10611  [pdf, other

    cs.CV

    Learning from Multiple Expert Annotators for Enhancing Anomaly Detection in Medical Image Analysis

    Authors: Khiem H. Le, Tuan V. Tran, Hieu H. Pham, Hieu T. Nguyen, Tung T. Le, Ha Q. Nguyen

    Abstract: Building an accurate computer-aided diagnosis system based on data-driven approaches requires a large amount of high-quality labeled data. In medical imaging analysis, multiple expert annotators often produce subjective estimates about "ground truth labels" during the annotation process, depending on their expertise and experience. As a result, the labeled data may contain a variety of human biase… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Under review by Neurocomputing

  35. arXiv:2203.10609  [pdf, other

    cs.CV

    A Novel Transparency Strategy-based Data Augmentation Approach for BI-RADS Classification of Mammograms

    Authors: Sam B. Tran, Huyen T. X. Nguyen, Chi Phan, Hieu H. Pham, Ha Q. Nguyen

    Abstract: Image augmentation techniques have been widely investigated to improve the performance of deep learning (DL) algorithms on mammography classification tasks. Recent methods have proved the efficiency of image augmentation on data deficiency or data imbalance issues. In this paper, we propose a novel transparency strategy to boost the Breast Imaging Reporting and Data System (BI-RADS) scores of mamm… ▽ More

    Submitted 17 April, 2023; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted for presentation at the 22nd IEEE Statistical Signal Processing (SSP) workshop

  36. Handwriting recognition and automatic scoring for descriptive answers in Japanese language tests

    Authors: Hung Tuan Nguyen, Cuong Tuan Nguyen, Haruki Oka, Tsunenori Ishioka, Masaki Nakagawa

    Abstract: This paper presents an experiment of automatically scoring handwritten descriptive answers in the trial tests for the new Japanese university entrance examination, which were made for about 120,000 examinees in 2017 and 2018. There are about 400,000 answers with more than 20 million characters. Although all answers have been scored by human examiners, handwritten characters are not labeled. We pre… ▽ More

    Submitted 30 November, 2023; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: Keywords: handwritten Japanese answers, handwriting recognition, automatic scoring, ensemble recognition, deep neural networks; Reported in IEICE technical report, PRMU2021-32, pp.45-50 (2021.12) Published after peer review and Presented in ICFHR2022, Lecture Notes in Computer Science, vol. 13639, pp. 274-284 (2022.11)

  37. arXiv:2112.11491  [pdf, other

    cs.LG cs.IT

    Adversarial Neural Networks for Error Correcting Codes

    Authors: Hung T. Nguyen, Steven Bottone, Kwang Taik Kim, Mung Chiang, H. Vincent Poor

    Abstract: Error correcting codes are a fundamental component in modern day communication systems, demanding extremely high throughput, ultra-reliability and low latency. Recent approaches using machine learning (ML) models as the decoders offer both improved performance and great adaptability to unknown environments, where traditional decoders struggle. We introduce a general framework to further boost the… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: 6 pages, accepted to GLOBECOM 2021

  38. arXiv:2112.11485  [pdf, other

    cs.LG cs.DC

    On-the-fly Resource-Aware Model Aggregation for Federated Learning in Heterogeneous Edge

    Authors: Hung T. Nguyen, Roberto Morabito, Kwang Taik Kim, Mung Chiang

    Abstract: Edge computing has revolutionized the world of mobile and wireless networks world thanks to its flexible, secure, and performing characteristics. Lately, we have witnessed the increasing use of it to make more performing the deployment of machine learning (ML) techniques such as federated learning (FL). FL was debuted to improve communication efficiency compared to conventional distributed machine… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: 6 pages, accepted to GLOBECOM 2021

  39. arXiv:2112.04490  [pdf, other

    eess.IV cs.CV

    A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms

    Authors: Huyen T. X. Nguyen, Sam B. Tran, Dung B. Nguyen, Hieu H. Pham, Ha Q. Nguyen

    Abstract: Advanced deep learning (DL) algorithms may predict the patient's risk of develo** breast cancer based on the Breast Imaging Reporting and Data System (BI-RADS) and density standards. Recent studies have suggested that the combination of multi-view analysis improved the overall breast exam classification. In this paper, we propose a novel multi-view DL approach for BI-RADS and density assessment… ▽ More

    Submitted 17 April, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: This paper has been accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (2022 IEEE EMBC)

  40. arXiv:2111.08956  [pdf, other

    cs.NI

    Intelligence Reflecting Surface-Aided Integrated Data and Energy Networking Coexisting D2D Communications

    Authors: Nguyen Thi Thanh Van, Huy T. Nguyen, Nguyen Cong Luong, Ngo Manh Tien, Dusit Niyato, Dong In Kim

    Abstract: In this paper, we consider an integrated data and energy network and D2D communication coexistence (DED2D) system. The DED2D system allows a base station (BS) to transfer data to information-demanded users (IUs) and energy to energy-demanded users (EUs), i.e., using a time-fraction-based information and energy transfer (TFIET) scheme. Furthermore, the DED2D system enables D2D communications to sha… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  41. arXiv:2111.00640  [pdf, other

    cs.CL

    VSEC: Transformer-based Model for Vietnamese Spelling Correction

    Authors: Dinh-Truong Do, Ha Thanh Nguyen, Thang Ngoc Bui, Dinh Hieu Vo

    Abstract: Spelling error correction is one of topics which have a long history in natural language processing. Although previous studies have achieved remarkable results, challenges still exist. In the Vietnamese language, a state-of-the-art method for the task infers a syllable's context from its adjacent syllables. The method's accuracy can be unsatisfactory, however, because the model may lose the contex… ▽ More

    Submitted 8 November, 2021; v1 submitted 31 October, 2021; originally announced November 2021.

  42. TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining

    Authors: Viet-Anh Nguyen, Anh H. T. Nguyen, Andy W. H. Khong

    Abstract: We introduce a block-online variant of the temporal feature-wise linear modulation (TFiLM) model to achieve bandwidth extension. The proposed architecture simplifies the UNet backbone of the TFiLM to reduce inference time and employs an efficient transformer at the bottleneck to alleviate performance degradation. We also utilize self-supervised pretraining and data augmentation to enhance the qual… ▽ More

    Submitted 7 June, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICASSP 2022, 5 pages, 4 figures, 3 tables

  43. arXiv:2108.12126  [pdf, other

    cs.CL

    Automated Generation of Accurate \& Fluent Medical X-ray Reports

    Authors: Hoang T. N. Nguyen, Dong Nie, Taivanbat Badamdorj, Yujie Liu, Yingying Zhu, Jason Truong, Li Cheng

    Abstract: Our paper focuses on automating the generation of medical reports from chest X-ray image inputs, a critical yet time-consuming task for radiologists. Unlike existing medical re-port generation efforts that tend to produce human-readable reports, we aim to generate medical reports that are both fluent and clinically accurate. This is achieved by our fully differentiable and end-to-end paradigm cont… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: accepted in emnlp

  44. arXiv:2108.06486  [pdf, other

    eess.IV cs.CV

    Learning to Automatically Diagnose Multiple Diseases in Pediatric Chest Radiographs Using Deep Convolutional Neural Networks

    Authors: Thanh T. Tran, Hieu H. Pham, Thang V. Nguyen, Tung T. Le, Hieu T. Nguyen, Ha Q. Nguyen

    Abstract: Chest radiograph (CXR) interpretation in pediatric patients is error-prone and requires a high level of understanding of radiologic expertise. Recently, deep convolutional neural networks (D-CNNs) have shown remarkable performance in interpreting CXR in adults. However, there is a lack of evidence indicating that D-CNNs can recognize accurately multiple lung pathologies from pediatric CXR scans. I… ▽ More

    Submitted 14 August, 2021; originally announced August 2021.

    Comments: This is a preprint of our paper which was accepted for publication to ICCV Workshop 2021

  45. arXiv:2108.05002  [pdf

    cs.CL cs.CV

    A Transformer-based Math Language Model for Handwritten Math Expression Recognition

    Authors: Huy Quang Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen, Thanh-Nghia Truong, Masaki Nakagawa

    Abstract: Handwritten mathematical expressions (HMEs) contain ambiguities in their interpretations, even for humans sometimes. Several math symbols are very similar in the writing style, such as dot and comma or 0, O, and o, which is a challenge for HME recognition systems to handle without using contextual information. To address this problem, this paper presents a Transformer-based Math Language Model (TM… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: 14 pages, accepted in ICDAR-DIL 2021

  46. arXiv:2106.14459  [pdf

    cs.CV

    Recurrent neural network transducer for Japanese and Chinese offline handwritten text recognition

    Authors: Trung Tan Ngo, Hung Tuan Nguyen, Nam Tuan Ly, Masaki Nakagawa

    Abstract: In this paper, we propose an RNN-Transducer model for recognizing Japanese and Chinese offline handwritten text line images. As far as we know, it is the first approach that adopts the RNN-Transducer model for offline handwritten text recognition. The proposed model consists of three main components: a visual feature encoder that extracts visual features from an input image by CNN and then encodes… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  47. arXiv:2106.12930  [pdf, other

    eess.IV cs.CV cs.LG

    VinDr-SpineXR: A deep learning framework for spinal lesions detection and classification from radiographs

    Authors: Hieu T. Nguyen, Hieu H. Pham, Nghia T. Nguyen, Ha Q. Nguyen, Thang Q. Huynh, Minh Dao, Van Vu

    Abstract: Radiographs are used as the most important imaging tool for identifying spine anomalies in clinical practice. The evaluation of spinal bone lesions, however, is a challenging task for radiologists. This work aims at develo** and evaluating a deep learning-based framework, named VinDr-SpineXR, for the classification and localization of abnormalities from spine X-rays. First, we build a large data… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: This is a preprint of our paper which was accepted for publication by the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2021)

  48. arXiv:2105.10159  [pdf

    cs.CV

    GSSF: A Generative Sequence Similarity Function based on a Seq2Seq model for clustering online handwritten mathematical answers

    Authors: Huy Quang Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen, Masaki Nakagawa

    Abstract: Toward a computer-assisted marking for descriptive math questions,this paper presents clustering of online handwritten mathematical expressions (OnHMEs) to help human markers to mark them efficiently and reliably. We propose a generative sequence similarity function for computing a similarity score of two OnHMEs based on a sequence-to-sequence OnHME recognizer. Each OnHME is represented by a simil… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: 16 pages, ICDAR2021

  49. arXiv:2105.10156  [pdf

    cs.CV

    Global Context for improving recognition of Online Handwritten Mathematical Expressions

    Authors: Cuong Tuan Nguyen, Thanh-Nghia Truong, Hung Tuan Nguyen, Masaki Nakagawa

    Abstract: This paper presents a temporal classification method for all three subtasks of symbol segmentation, symbol recognition and relation classification in online handwritten mathematical expressions (HMEs). The classification model is trained by multiple paths of symbols and spatial relations derived from the Symbol Relation Tree (SRT) representation of HMEs. The method benefits from global context of… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: 16 pages, ICDAR2021

  50. arXiv:2105.06084  [pdf

    cs.CV cs.LG

    Learning symbol relation tree for online mathematical expression recognition

    Authors: Thanh-Nghia Truong, Hung Tuan Nguyen, Cuong Tuan Nguyen, Masaki Nakagawa

    Abstract: This paper proposes a method for recognizing online handwritten mathematical expressions (OnHME) by building a symbol relation tree (SRT) directly from a sequence of strokes. A bidirectional recurrent neural network learns from multiple derived paths of SRT to predict both symbols and spatial relations between symbols using global context. The recognition system has two parts: a temporal classifie… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

    Comments: 13 pages, conference

    ACM Class: I.5.1