Search | arXiv e-print repository

Forget but Recall: Incremental Latent Rectification in Continual Learning

Authors: Nghia D. Nguyen, Hieu Trung Nguyen, Ang Li, Hoang Pham, Viet Anh Nguyen, Khoa D. Doan

Abstract: Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper in… ▽ More Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper investigates an unexplored CL direction for incremental learning called Incremental Latent Rectification or ILR. In a nutshell, ILR learns to propagate with correction (or rectify) the representation from the current trained DNN backward to the representation space of the old task, where performing predictive decisions is easier. This rectification process only employs a chain of small representation map** networks, called rectifier units. Empirical experiments on several continual learning benchmarks, including CIFAR10, CIFAR100, and Tiny ImageNet, demonstrate the effectiveness and potential of this novel CL direction compared to existing representative CL methods. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2405.00291 [pdf, other]

How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses

Authors: Jionghao Lin, Eason Chen, Zeifei Han, Ashish Gurung, Danielle R. Thomas, Wei Tan, Ngoc Dang Nguyen, Kenneth R. Koedinger

Abstract: Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study… ▽ More Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study leverages the capabilities of large language models, specifically Generative Pre-Trained Transformers (GPT), to explore a sequence labeling approach focused on identifying components of desired and less desired praise for providing explanatory feedback within a tutor training dataset. Our aim is to equip tutors with actionable, explanatory feedback during online training lessons. To investigate the potential of GPT models for providing the explanatory feedback, we employed two commonly-used approaches: prompting and fine-tuning. To quantify the quality of highlighted praise components identified by GPT models, we introduced a Modified Intersection over Union (M-IoU) score. Our findings demonstrate that: (1) the M-IoU score effectively correlates with human judgment in evaluating sequence quality; (2) using two-shot prompting on GPT-3.5 resulted in decent performance in recognizing effort-based (M-IoU of 0.46) and outcome-based praise (M-IoU of 0.68); and (3) our optimally fine-tuned GPT-3.5 model achieved M-IoU scores of 0.64 for effort-based praise and 0.84 for outcome-based praise, aligning with the satisfaction levels evaluated by human coders. Our results show promise for using GPT models to provide feedback that focuses on specific elements in their open-ended responses that are desirable or could use improvement. △ Less

Submitted 30 April, 2024; originally announced May 2024.

Comments: 11 pages, full research paper, EDM 2024

arXiv:2401.07395 [pdf, other]

Harnessing the Power of Beta Scoring in Deep Active Learning for Multi-Label Text Classification

Authors: Wei Tan, Ngoc Dang Nguyen, Lan Du, Wray Buntine

Abstract: Within the scope of natural language processing, the domain of multi-label text classification is uniquely challenging due to its expansive and uneven label distribution. The complexity deepens due to the demand for an extensive set of annotated data for training an advanced deep learning model, especially in specialized fields where the labeling task can be labor-intensive and often requires doma… ▽ More Within the scope of natural language processing, the domain of multi-label text classification is uniquely challenging due to its expansive and uneven label distribution. The complexity deepens due to the demand for an extensive set of annotated data for training an advanced deep learning model, especially in specialized fields where the labeling task can be labor-intensive and often requires domain-specific knowledge. Addressing these challenges, our study introduces a novel deep active learning strategy, capitalizing on the Beta family of proper scoring rules within the Expected Loss Reduction framework. It computes the expected increase in scores using the Beta Scoring Rules, which are then transformed into sample vector representations. These vector representations guide the diverse selection of informative samples, directly linking this process to the model's expected proper score. Comprehensive evaluations across both synthetic and real datasets reveal our method's capability to often outperform established acquisition techniques in multi-label text classification, presenting encouraging outcomes across various architectural and dataset scenarios. △ Less

Submitted 14 January, 2024; originally announced January 2024.

Comments: 7 pages AAAI 2024

arXiv:2312.10543 [pdf, other]

Study of cognitive component of auditory attention to natural speech events

Authors: Nhan D. T. Nguyen, Kaare Mikkelsen, Preben Kidmose

Abstract: Event-related potentials (ERP) have been used to address a wide range of research questions in neuroscience and cognitive psychology including selective auditory attention. The recent progress in auditory attention decoding (AAD) methods is based on algorithms that find a relation between the audio envelope and the neurophysiological response. The most popular approach is based on the reconstructi… ▽ More Event-related potentials (ERP) have been used to address a wide range of research questions in neuroscience and cognitive psychology including selective auditory attention. The recent progress in auditory attention decoding (AAD) methods is based on algorithms that find a relation between the audio envelope and the neurophysiological response. The most popular approach is based on the reconstruction of the audio envelope based on EEG signals. However, these methods are mainly based on the neurophysiological entrainment to physical attributes of the sensory stimulus and are generally limited by a long detection window. This study proposes a novel approach to auditory attention decoding by looking at higher-level cognitive responses to natural speech. To investigate if natural speech events elicit cognitive ERP components and how these components are affected by attention mechanisms, we designed a series of four experimental paradigms with increasing complexity: a word category oddball paradigm, a word category oddball paradigm with competing speakers, and competing speech streams with and without specific targets. We recorded the electroencephalogram (EEG) from 32 scalp electrodes and 12 in-ear electrodes (ear-EEG) from 24 participants. A cognitive ERP component, which we believe is related to the well-known P3b component, was observed at parietal electrode sites with a latency of approximately 620 ms. The component is statistically most significant for the simplest paradigm and gradually decreases in strength with increasing complexity of the paradigm. We also show that the component can be observed in the in-ear EEG signals by using spatial filtering. The cognitive component elicited by auditory attention may contribute to decoding auditory attention from electrophysiological recordings and its presence in the ear-EEG signals is promising for future applications within hearing aids. △ Less

Submitted 19 December, 2023; v1 submitted 16 December, 2023; originally announced December 2023.

Comments: 15 pages, 11 figures

arXiv:2311.04918 [pdf, other]

Low-Resource Named Entity Recognition: Can One-vs-All AUC Maximization Help?

Authors: Ngoc Dang Nguyen, Wei Tan, Lan Du, Wray Buntine, Richard Beare, Changyou Chen

Abstract: Named entity recognition (NER), a task that identifies and categorizes named entities such as persons or organizations from text, is traditionally framed as a multi-class classification problem. However, this approach often overlooks the issues of imbalanced label distributions, particularly in low-resource settings, which is common in certain NER contexts, like biomedical NER (bioNER). To address… ▽ More Named entity recognition (NER), a task that identifies and categorizes named entities such as persons or organizations from text, is traditionally framed as a multi-class classification problem. However, this approach often overlooks the issues of imbalanced label distributions, particularly in low-resource settings, which is common in certain NER contexts, like biomedical NER (bioNER). To address these issues, we propose an innovative reformulation of the multi-class problem as a one-vs-all (OVA) learning problem and introduce a loss function based on the area under the receiver operating characteristic curve (AUC). To enhance the efficiency of our OVA-based approach, we propose two training strategies: one groups labels with similar linguistic characteristics, and another employs meta-learning. The superiority of our approach is confirmed by its performance, which surpasses traditional NER learning in varying NER settings. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Comments: 6 pages, 3 figures, ICDM 2023

arXiv:2311.00906 [pdf, other]

Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition

Authors: Haocheng Luo, Wei Tan, Ngoc Dang Nguyen, Lan Du

Abstract: Active learning, a widely adopted technique for enhancing machine learning models in text and image classification tasks with limited annotation resources, has received relatively little attention in the domain of Named Entity Recognition (NER). The challenge of data imbalance in NER has hindered the effectiveness of active learning, as sequence labellers lack sufficient learning signals. To addre… ▽ More Active learning, a widely adopted technique for enhancing machine learning models in text and image classification tasks with limited annotation resources, has received relatively little attention in the domain of Named Entity Recognition (NER). The challenge of data imbalance in NER has hindered the effectiveness of active learning, as sequence labellers lack sufficient learning signals. To address these challenges, this paper presents a novel reweighting-based active learning strategy that assigns dynamic smoothed weights to individual tokens. This adaptable strategy is compatible with various token-level acquisition functions and contributes to the development of robust active learners. Experimental results on multiple corpora demonstrate the substantial performance improvement achieved by incorporating our re-weighting strategy into existing acquisition functions, validating its practical efficacy. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2306.15498 [pdf, other]

Using Large Language Models to Provide Explanatory Feedback to Human Tutors

Authors: Jionghao Lin, Danielle R. Thomas, Feifei Han, Shivang Gupta, Wei Tan, Ngoc Dang Nguyen, Kenneth R. Koedinger

Abstract: Research demonstrates learners engaging in the process of producing explanations to support their reasoning, can have a positive impact on learning. However, providing learners real-time explanatory feedback often presents challenges related to classification accuracy, particularly in domain-specific environments, containing situationally complex and nuanced responses. We present two approaches fo… ▽ More Research demonstrates learners engaging in the process of producing explanations to support their reasoning, can have a positive impact on learning. However, providing learners real-time explanatory feedback often presents challenges related to classification accuracy, particularly in domain-specific environments, containing situationally complex and nuanced responses. We present two approaches for supplying tutors real-time feedback within an online lesson on how to give students effective praise. This work-in-progress demonstrates considerable accuracy in binary classification for corrective feedback of effective, or effort-based (F1 score = 0.811), and ineffective, or outcome-based (F1 score = 0.350), praise responses. More notably, we introduce progress towards an enhanced approach of providing explanatory feedback using large language model-facilitated named entity recognition, which can provide tutors feedback, not only while engaging in lessons, but can potentially suggest real-time tutor moves. Future work involves leveraging large language models for data augmentation to improve accuracy, while also develo** an explanatory feedback interface. △ Less

Submitted 27 June, 2023; originally announced June 2023.

Comments: 12 pages Workshop paper, The 24th International Conference on Artificial Intelligence in Education, AIED 2023 Educational Dialogue Act Classification, Large Language Models, Named Entity Recognition, Tutor Training, Explanatory Feedback, Natural Language Processing

arXiv:2304.07499 [pdf, other]

Robust Educational Dialogue Act Classifiers with Low-Resource and Imbalanced Datasets

Authors: Jionghao Lin, Wei Tan, Ngoc Dang Nguyen, David Lang, Lan Du, Wray Buntine, Richard Beare, Guanliang Chen, Dragan Gasevic

Abstract: Dialogue acts (DAs) can represent conversational actions of tutors or students that take place during tutoring dialogues. Automating the identification of DAs in tutoring dialogues is significant to the design of dialogue-based intelligent tutoring systems. Many prior studies employ machine learning models to classify DAs in tutoring dialogues and invest much effort to optimize the classification… ▽ More Dialogue acts (DAs) can represent conversational actions of tutors or students that take place during tutoring dialogues. Automating the identification of DAs in tutoring dialogues is significant to the design of dialogue-based intelligent tutoring systems. Many prior studies employ machine learning models to classify DAs in tutoring dialogues and invest much effort to optimize the classification accuracy by using limited amounts of training data (i.e., low-resource data scenario). However, beyond the classification accuracy, the robustness of the classifier is also important, which can reflect the capability of the classifier on learning the patterns from different class distributions. We note that many prior studies on classifying educational DAs employ cross entropy (CE) loss to optimize DA classifiers on low-resource data with imbalanced DA distribution. The DA classifiers in these studies tend to prioritize accuracy on the majority class at the expense of the minority class which might not be robust to the data with imbalanced ratios of different DA classes. To optimize the robustness of classifiers on imbalanced class distributions, we propose to optimize the performance of the DA classifier by maximizing the area under the ROC curve (AUC) score (i.e., AUC maximization). Through extensive experiments, our study provides evidence that (i) by maximizing AUC in the training process, the DA classifier achieves significant performance improvement compared to the CE approach under low-resource data, and (ii) AUC maximization approaches can improve the robustness of the DA classifier under different class imbalance ratios. △ Less

Submitted 15 April, 2023; originally announced April 2023.

Comments: 12 pages full paper, The 24th International Conference on Artificial Intelligence in Education, AIED 2023 Educational Dialogue Act Classification, Model Robustness, Low-Resource Data, Imbalanced Data, Large Language Models

arXiv:2303.05478 [pdf, other]

Real roots of random polynomials: asymptotics of the variance

Authors: Yen Q. Do, Nhan D. V. Nguyen

Abstract: We compute the precise leading asymptotics of the variance of the number of real roots for a large class of random polynomials, where the random coefficients have polynomial growth. Our results apply to many classical ensembles, including the Kac polynomials, hyperbolic polynomials, their derivatives, and any linear combinations of these polynomials. Prior to this paper, such asymptotics was only… ▽ More We compute the precise leading asymptotics of the variance of the number of real roots for a large class of random polynomials, where the random coefficients have polynomial growth. Our results apply to many classical ensembles, including the Kac polynomials, hyperbolic polynomials, their derivatives, and any linear combinations of these polynomials. Prior to this paper, such asymptotics was only established for the Kac polynomials in the 1970s, with the seminal contribution of Maslova. The main ingredients of the proof are new asymptotic estimates for the two-point correlation function of the real roots, revealing geometric structures in the distribution of the real roots of these random polynomials. As a corollary, we obtain asymptotic normality for the real roots for these random polynomials, extending and strengthening a related result of O. Nguyen and V. Vu. △ Less

Submitted 7 May, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

Comments: 41 pages, 6 figures, intro rewritten, main results unchanged, new references added

MSC Class: 60G50; 60F05; 41A60

arXiv:2302.09151 [pdf]

doi 10.1016/j.biosystems.2023.105001

SBcoyote: An Extensible Python-Based Reaction Editor and Viewer

Authors: ** Xu, Gary Geng, Nhan D. Nguyen, Carmen Perena-Cortes, Claire Samuels, Herbert M. Sauro

Abstract: SBcoyote is an open-source cross-platform biochemical reaction viewer and editor released under the liberal MIT license. It is written in Python and uses wxPython to implement the GUI and the drawing canvas. It supports the visualization and editing of compartments, species, and reactions. It includes many options to stylize each of these components. For instance, species can be in different color… ▽ More SBcoyote is an open-source cross-platform biochemical reaction viewer and editor released under the liberal MIT license. It is written in Python and uses wxPython to implement the GUI and the drawing canvas. It supports the visualization and editing of compartments, species, and reactions. It includes many options to stylize each of these components. For instance, species can be in different colors and shapes. Other core features include the ability to create alias nodes, alignment of groups of nodes, network zooming, as well as an interactive bird-eye view of the network to allow easy navigation on large networks. A unique feature of the tool is the extensive Python plugin API, where third-party developers can include new functionality. To assist third-party plugin developers, we provide a variety of sample plugins, including, random network generation, a simple auto layout tool, export to Antimony, export SBML, import SBML, etc. Of particular interest are the export and import SBML plugins since these support the SBML level 3 layout and render standard, which is exchangeable with other software packages. Plugins are stored in a GitHub repository, and an included plugin manager can retrieve and install new plugins from the repository on demand. Plugins have version metadata associated with them to make it install plugin updates. Availability: https://github.com/sys-bio/SBcoyote. △ Less

Submitted 14 August, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

arXiv:2301.05938 [pdf]

Deep Learning Provides Rapid Screen for Breast Cancer Metastasis with Sentinel Lymph Nodes

Authors: Kareem Allam, Xiaohong Iris Wang, Songlin Zhang, Jianmin Ding, Kevin Chiu, Karan Saluja, Amer Wahed, Hongxia Sun, Andy N. D. Nguyen

Abstract: Deep learning has been shown to be useful to detect breast cancer metastases by analyzing whole slide images of sentinel lymph nodes. However, it requires extensive scanning and analysis of all the lymph nodes slides for each case. Our deep learning study focuses on breast cancer screening with only a small set of image patches from any sentinel lymph node, positive or negative for metastasis, to… ▽ More Deep learning has been shown to be useful to detect breast cancer metastases by analyzing whole slide images of sentinel lymph nodes. However, it requires extensive scanning and analysis of all the lymph nodes slides for each case. Our deep learning study focuses on breast cancer screening with only a small set of image patches from any sentinel lymph node, positive or negative for metastasis, to detect changes in tumor environment and not in the tumor itself. We design a convolutional neural network in the Python language to build a diagnostic model for this purpose. The excellent results from this preliminary study provided a proof of concept for incorporating automated metastatic screen into the digital pathology workflow to augment the pathologists' productivity. Our approach is unique since it provides a very rapid screen rather than an exhaustive search for tumor in all fields of all sentinel lymph nodes. △ Less

Submitted 14 January, 2023; originally announced January 2023.

Comments: 9 pages, 3 figures, 5 tables

arXiv:2212.04800 [pdf, other]

AUC Maximization for Low-Resource Named Entity Recognition

Authors: Ngoc Dang Nguyen, Wei Tan, Wray Buntine, Richard Beare, Changyou Chen, Lan Du

Abstract: Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is in… ▽ More Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is inherently an imbalanced tagging problem, the model performance under the low-resource settings could suffer using these standard objective functions. Based on recent advances in area under the ROC curve (AUC) maximization, we propose to optimize the NER model by maximizing the AUC score. We give evidence that by simply combining two binary-classifiers that maximize the AUC score, significant performance improvement over traditional loss functions is achieved under low-resource NER settings. We also conduct extensive experiments to demonstrate the advantages of our method under the low-resource and highly-imbalanced data distribution settings. To the best of our knowledge, this is the first work that brings AUC maximization to the NER setting. Furthermore, we show that our method is agnostic to different types of NER embeddings, models and domains. The code to replicate this work will be provided upon request. △ Less

Submitted 13 April, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

Comments: 10 pages, 4 figures, AAAI 2023

arXiv:2211.05980 [pdf, other]

Hardness-guided domain adaptation to recognise biomedical named entities under low-resource scenarios

Authors: Ngoc Dang Nguyen, Lan Du, Wray Buntine, Changyou Chen, Richard Beare

Abstract: Domain adaptation is an effective solution to data scarcity in low-resource scenarios. However, when applied to token-level tasks such as bioNER, domain adaptation methods often suffer from the challenging linguistic characteristics that clinical narratives possess, which leads to unsatisfactory performance. In this paper, we present a simple yet effective hardness-guided domain adaptation (HGDA)… ▽ More Domain adaptation is an effective solution to data scarcity in low-resource scenarios. However, when applied to token-level tasks such as bioNER, domain adaptation methods often suffer from the challenging linguistic characteristics that clinical narratives possess, which leads to unsatisfactory performance. In this paper, we present a simple yet effective hardness-guided domain adaptation (HGDA) framework for bioNER tasks that can effectively leverage the domain hardness information to improve the adaptability of the learnt model in low-resource scenarios. Experimental results on biomedical datasets show that our model can achieve significant performance improvement over the recently published state-of-the-art (SOTA) MetaNER model △ Less

Submitted 10 November, 2022; originally announced November 2022.

arXiv:2209.01304 [pdf, other]

doi 10.25073/2588-1086/vnucsce.369

vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM

Authors: Thanh Tin Nguyen, Long H. Nguyen, Nhat Truong Pham, Liu Tai Nguyen, Van Huong Do, Hai Nguyen, Ngoc Duy Nguyen

Abstract: This study presents our approach on the automatic Vietnamese image captioning for healthcare domain in text processing tasks of Vietnamese Language and Speech Processing (VLSP) Challenge 2021, as shown in Figure 1. In recent years, image captioning often employs a convolutional neural network-based architecture as an encoder and a long short-term memory (LSTM) as a decoder to generate sentences. T… ▽ More This study presents our approach on the automatic Vietnamese image captioning for healthcare domain in text processing tasks of Vietnamese Language and Speech Processing (VLSP) Challenge 2021, as shown in Figure 1. In recent years, image captioning often employs a convolutional neural network-based architecture as an encoder and a long short-term memory (LSTM) as a decoder to generate sentences. These models perform remarkably well in different datasets. Our proposed model also has an encoder and a decoder, but we instead use a Swin Transformer in the encoder, and a LSTM combined with an attention module in the decoder. The study presents our training experiments and techniques used during the competition. Our model achieves a BLEU4 score of 0.293 on the vietCap4H dataset, and the score is ranked the 3$^{rd}$ place on the private leaderboard. Our code can be found at \url{https://git.io/JDdJm}. △ Less

Submitted 2 September, 2022; originally announced September 2022.

Comments: Accepted for publication in the VNU Journal of Science: Computer Science and Communication Engineering

Journal ref: VNU Journal of Science: Computer Science and Communication Engineering, 38(2), 2022

arXiv:2204.07002 [pdf, other]

XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-based Textual Knowledge Source

Authors: Kiet Van Nguyen, Phong Nguyen-Thuan Do, Nhat Duy Nguyen, Tin Van Huynh, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

Abstract: Question answering (QA) is a natural language understanding task within the fields of information retrieval and information extraction that has attracted much attention from the computational linguistics and artificial intelligence research community in recent years because of the strong development of machine reading comprehension-based models. A reader-based QA system is a high-level search engi… ▽ More Question answering (QA) is a natural language understanding task within the fields of information retrieval and information extraction that has attracted much attention from the computational linguistics and artificial intelligence research community in recent years because of the strong development of machine reading comprehension-based models. A reader-based QA system is a high-level search engine that can find correct answers to queries or questions in open-domain or domain-specific texts using machine reading comprehension (MRC) techniques. The majority of advancements in data resources and machine-learning approaches in the MRC and QA systems especially are developed significantly in two resource-rich languages such as English and Chinese. A low-resource language like Vietnamese has witnessed a scarcity of research on QA systems. This paper presents XLMRQA, the first Vietnamese QA system using a supervised transformer-based reader on the Wikipedia-based textual knowledge source (using the UIT-ViQuAD corpus), outperforming the two robust QA systems using deep neural network models: DrQA and BERTserini with 24.46% and 6.28%, respectively. From the results obtained on the three systems, we analyze the influence of question types on the performance of the QA systems. △ Less

Submitted 13 August, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

Comments: Accepted by ACIIDS 2022

arXiv:2111.10875 [pdf, other]

doi 10.1214/24-EJP1142

The number of real zeros of elliptic polynomials

Authors: Nhan D. V. Nguyen

Abstract: Let $N_n(a, b)$ denote the number of real zeros of Gaussian elliptic polynomials of degree $n$ on the interval $(a, b)$, where $a$ and $b$ may vary with $n$. We obtain a precise formula for the variance of $N_n(a, b)$ and utilize this expression to derive an asymptotic expansion for large values of $n$. Furthermore, we provide sharp estimates for the cumulants and central moments of $N_n(a, b)$. T… ▽ More Let $N_n(a, b)$ denote the number of real zeros of Gaussian elliptic polynomials of degree $n$ on the interval $(a, b)$, where $a$ and $b$ may vary with $n$. We obtain a precise formula for the variance of $N_n(a, b)$ and utilize this expression to derive an asymptotic expansion for large values of $n$. Furthermore, we provide sharp estimates for the cumulants and central moments of $N_n(a, b)$. These estimates are instrumental in establishing sufficient conditions on the interval $(a, b)$ for $N_n(a, b)$ to satisfy both a central limit theorem and a strong law of large numbers. In the second part of the paper, we extend our analysis to nondegenerate Gaussian analytic functions, including well-known examples such as the Gaussian Weyl series and Weyl polynomials. △ Less

Submitted 7 May, 2024; v1 submitted 21 November, 2021; originally announced November 2021.

Comments: 49 pages, 2 figures, 1 table, final version

MSC Class: 60G15; 60G50; 60F05; 41A60

Journal ref: Electron. J. Probab. 29 (2024), 1-49

arXiv:2111.01450 [pdf, other]

doi 10.1039/D1TA09620F

Relevance of Ge incorporation to control the physical behaviour of point defects in kesterite

Authors: Thomas Ratz, Ngoc Duy Nguyen, Guy Brammertz, Bart Vermang, Jean-Yves Raty

Abstract: To reduce the prominent VOC-deficit that limits kesterite-based solar cells efficiencies, Ge has been proposed over the recent years with encouraging results, as the reduction of the non-radiative recombination rate is considered as a way to improve the well-known Sn-kesterite world record efficiency. To gain further insight into this mechanism, we investigate the physical behaviour of intrinsic p… ▽ More To reduce the prominent VOC-deficit that limits kesterite-based solar cells efficiencies, Ge has been proposed over the recent years with encouraging results, as the reduction of the non-radiative recombination rate is considered as a way to improve the well-known Sn-kesterite world record efficiency. To gain further insight into this mechanism, we investigate the physical behaviour of intrinsic point defects both upon Ge do** and alloying of Cu2ZnSnS4 kesterite. Using a first-principles approach, we confirm the p-type conductivity of both Cu2ZnSnS4 and Cu2ZnGeS4, attributed to the low formation energies of the VCu and CuZn acceptor defects within the whole stable phase diagram range. Via do** of the Sn-kesterite matrix, we report the lowest formation energy for the substitutional defect GeSn. We also confirm the detrimental role of the substitutional defects XZn (X=Sn,Ge) acting as recombination centres within the Sn-based, the Ge-doped and the Ge-based kesterite. Finally, we highlight the reduction of the lattice distortion upon Ge incorporation resulting in a reduction of the carrier capture cross section and consequently a decrease of the non-radiative recombination rate within the bulk material. △ Less

Submitted 2 November, 2021; originally announced November 2021.

Comments: 14 pages, 6 figures, Journal of Materials Chemistry A (2022)

arXiv:2109.03219 [pdf, other]

Fruit-CoV: An Efficient Vision-based Framework for Speedy Detection and Diagnosis of SARS-CoV-2 Infections Through Recorded Cough Sounds

Authors: Long H. Nguyen, Nhat Truong Pham, Van Huong Do, Liu Tai Nguyen, Thanh Tin Nguyen, Van Dung Do, Hai Nguyen, Ngoc Duy Nguyen

Abstract: SARS-CoV-2 is colloquially known as COVID-19 that had an initial outbreak in December 2019. The deadly virus has spread across the world, taking part in the global pandemic disease since March 2020. In addition, a recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths over the world. Therefore, it is vital to possess a self-testing serv… ▽ More SARS-CoV-2 is colloquially known as COVID-19 that had an initial outbreak in December 2019. The deadly virus has spread across the world, taking part in the global pandemic disease since March 2020. In addition, a recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths over the world. Therefore, it is vital to possess a self-testing service of SARS-CoV-2 at home. In this study, we introduce Fruit-CoV, a two-stage vision framework, which is capable of detecting SARS-CoV-2 infections through recorded cough sounds. Specifically, we convert sounds into Log-Mel Spectrograms and use the EfficientNet-V2 network to extract its visual features in the first stage. In the second stage, we use 14 convolutional layers extracted from the large-scale Pretrained Audio Neural Networks for audio pattern recognition (PANNs) and the Wavegram-Log-Mel-CNN to aggregate feature representations of the Log-Mel Spectrograms. Finally, we use the combined features to train a binary classifier. In this study, we use a dataset provided by the AICovidVN 115M Challenge, which includes a total of 7371 recorded cough sounds collected throughout Vietnam, India, and Switzerland. Experimental results show that our proposed model achieves an AUC score of 92.8% and ranks the 1st place on the leaderboard of the AICovidVN Challenge. More importantly, our proposed framework can be integrated into a call center or a VoIP system to speed up detecting SARS-CoV-2 infections through online/recorded cough sounds. △ Less

Submitted 6 September, 2021; originally announced September 2021.

Comments: 4 pages

arXiv:2105.09043 [pdf, other]

Sentence Extraction-Based Machine Reading Comprehension for Vietnamese

Authors: Phong Nguyen-Thuan Do, Nhat Duy Nguyen, Tin Van Huynh, Kiet Van Nguyen, Anh Gia-Tuan Nguyen, Ngan Luu-Thuy Nguyen

Abstract: The development of natural language processing (NLP) in general and machine reading comprehension in particular has attracted the great attention of the research community. In recent years, there are a few datasets for machine reading comprehension tasks in Vietnamese with large sizes, such as UIT-ViQuAD and UIT-ViNewsQA. However, the datasets are not diverse in answers to serve the research. In t… ▽ More The development of natural language processing (NLP) in general and machine reading comprehension in particular has attracted the great attention of the research community. In recent years, there are a few datasets for machine reading comprehension tasks in Vietnamese with large sizes, such as UIT-ViQuAD and UIT-ViNewsQA. However, the datasets are not diverse in answers to serve the research. In this paper, we introduce UIT-ViWikiQA, the first dataset for evaluating sentence extraction-based machine reading comprehension in the Vietnamese language. The UIT-ViWikiQA dataset is converted from the UIT-ViQuAD dataset, consisting of comprises 23.074 question-answers based on 5.109 passages of 174 Wikipedia Vietnamese articles. We propose a conversion algorithm to create the dataset for sentence extraction-based machine reading comprehension and three types of approaches for sentence extraction-based machine reading comprehension in Vietnamese. Our experiments show that the best machine model is XLM-R_Large, which achieves an exact match (EM) of 85.97% and an F1-score of 88.77% on our dataset. Besides, we analyze experimental results in terms of the question type in Vietnamese and the effect of context on the performance of the MRC models, thereby showing the challenges from the UIT-ViWikiQA dataset that we propose to the language processing community. △ Less

Submitted 11 June, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

Comments: Accepted by KSEM 2021 (International Conference on Knowledge Science, Engineering and Management)

arXiv:2011.12764 [pdf, other]

doi 10.1088/2515-7655/abefbe

Opto-electronic properties and solar cell efficiency modelling of Cu$_2$ZnXS$_4$ (X=Sn,Ge,Si) kesterites

Authors: Thomas Ratz, Jean-Yves Raty, Guy Brammertz, Bart Vermang, Ngoc Duy Nguyen

Abstract: In this work, first principle calculations of Cu$_2$ZnSnS$_4$ (CZTS), Cu$_2$ZnGeS$_4$ (CZGS) and Cu$_2$ZnSiS$_4$ (CZSS) are performed to highlight the impact of the cationic substitution on the structural, electronic and optical properties of kesterite compounds. Direct bandgaps are reported with values of 1.32, 1.89 and 3.06 eV respectively for CZTS, CZGS and CZSS. In addition, absorption coeffic… ▽ More In this work, first principle calculations of Cu$_2$ZnSnS$_4$ (CZTS), Cu$_2$ZnGeS$_4$ (CZGS) and Cu$_2$ZnSiS$_4$ (CZSS) are performed to highlight the impact of the cationic substitution on the structural, electronic and optical properties of kesterite compounds. Direct bandgaps are reported with values of 1.32, 1.89 and 3.06 eV respectively for CZTS, CZGS and CZSS. In addition, absorption coefficient values of the order of $10^4$ cm$^{-1}$ are obtained, indicating the applicability of these materials as absorber layer for solar cell applications. In the second part of this study, ab initio results are used as input data to model the electrical power conversion efficiency of kesterite-based solar cell. In that perspective, we used an improved version of the Shockley-Queisser theoretical model including non-radiative recombination via an external parameter defined as the internal quantum efficiency. Based on predicted optimal absorber layer thicknesses, the variation of the solar cell maximal efficiency is studied as a function of the non-radiative recombination rate. Maximal efficiencies of 25.88, 19.94 and 3.11% are reported respectively for CZTS, CZGS and CZSS for vanishing non-radiative recombination rate. Using an internal quantum efficiency providing $V_{OC}$ values comparable to experimental measurements, solar cell efficiencies of 15.88, 14.98 and 2.66% are reported respectively for CZTS, CZGS and CZSS (for an optimal thickness of 1.15 $μ$m). With this methodology, we confirm the suitability of CZTS in single junction solar cells, with a possible efficiency improvement of 10% enabled through the reduction of the non-radiative recombination rate. In addition, CZGS appears to be an interesting candidate as top cell absorber layer for tandem approaches whereas CZSS might be interesting for transparent PV windows. △ Less

Submitted 25 November, 2020; originally announced November 2020.

Comments: 11 pages, 6 figures and 3 tables

Report number: ULG-CESAM-SPIN-2020-03

Journal ref: Journal of Physics: Energy 3.3 (2021): 035005

arXiv:2010.00669 [pdf, other]

doi 10.1103/PhysRevApplied.15.034058

A roadmap for the design of four-terminal spin valves and the extraction of spin diffusion length

Authors: Emile Fourneau, Alejandro V. Silhanek, Ngoc D. Nguyen

Abstract: Graphene is a promising substrate for future spintronics devices owing to its remarkable electronic mobility and low spin-orbit coupling. Hanle precession in spin valve devices is commonly used to evaluate the spin diffusion and spin lifetime properties. In this work, we demonstrate that this method is no longer accurate when the distance between inner and outer electrodes is smaller than six time… ▽ More Graphene is a promising substrate for future spintronics devices owing to its remarkable electronic mobility and low spin-orbit coupling. Hanle precession in spin valve devices is commonly used to evaluate the spin diffusion and spin lifetime properties. In this work, we demonstrate that this method is no longer accurate when the distance between inner and outer electrodes is smaller than six times the spin diffusion length, leading to errors as large as 50% for the calculations of the spin figures of merit of graphene. We suggest simple but efficient approaches to circumvent this limitation by addressing a revised version of the Hanle fit function. Complementarily, we provide clear guidelines for the design of four-terminal spin valves able to yield flawless estimations of the spin lifetime and the spin diffusion coefficient. △ Less

Submitted 1 October, 2020; originally announced October 2020.

Comments: 7 pages, 5 figures

Journal ref: Phys. Rev. Applied 15, 034058 (2021)

arXiv:2003.10150 [pdf, other]

doi 10.1103/PhysRevApplied.14.024020

On the origin of the giant spin detection efficiency in tunnel barrier based electrical spin detector

Authors: Emile Fourneau, Alejandro V. Silhanek, Ngoc Duy Nguyen

Abstract: Efficient conversion of a spin signal into an electric voltage in mainstream semiconductors is one of the grand challenges of spintronics. This process is commonly achieved via a ferromagnetic tunnel barrier where non-linear electric transport occurs. In this work, we demonstrate that non-linearity may lead to a spin-to-charge conversion efficiency larger than 10 times the spin polarization of the… ▽ More Efficient conversion of a spin signal into an electric voltage in mainstream semiconductors is one of the grand challenges of spintronics. This process is commonly achieved via a ferromagnetic tunnel barrier where non-linear electric transport occurs. In this work, we demonstrate that non-linearity may lead to a spin-to-charge conversion efficiency larger than 10 times the spin polarization of the tunnel barrier when the latter is under bias of a few mV. We identify the underlying mechanisms responsible for this remarkably efficient spin detection as the tunnel barrier deformation and the conduction band shift resulting from a change of applied voltage. In addition, we derive an approximate analytical expression for the detector spin sensitivity $P_{\textrm{det}}(V)$. Calculations performed for different barrier shapes show that this enhancement is present in oxide barriers as well as in Schottky tunnel barriers even if the dominant mechanisms differs with the barrier type. Moreover, although the spin signal is reduced at high temperatures, it remains superior to the value predicted by the linear model. Our findings shed light into the interpretation and understanding of electrical spin detection experiments and open new paths to optimize the performance of spin transport devices. △ Less

Submitted 23 March, 2020; originally announced March 2020.

Comments: 11 pages, 9 figures

Report number: ULG-CESAM-SPIN-2020-01

Journal ref: Phys. Rev. Applied 14, 024020 (2020)

arXiv:2002.11883 [pdf, other]

doi 10.13140/RG.2.2.16789.06883

Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework

Authors: Ngoc Duy Nguyen, Thanh Thi Nguyen, Hai Nguyen, Doug Creighton, Saeid Nahavandi

Abstract: The integration of deep learning to reinforcement learning (RL) has enabled RL to perform efficiently in high-dimensional environments. Deep RL methods have been applied to solve many complex real-world problems in recent years. However, development of a deep RL-based system is challenging because of various issues such as the selection of a suitable deep RL algorithm, its network configuration, t… ▽ More The integration of deep learning to reinforcement learning (RL) has enabled RL to perform efficiently in high-dimensional environments. Deep RL methods have been applied to solve many complex real-world problems in recent years. However, development of a deep RL-based system is challenging because of various issues such as the selection of a suitable deep RL algorithm, its network configuration, training time, training methods, and so on. This paper proposes a comprehensive software framework that not only plays a vital role in designing a connect-the-dots deep RL architecture but also provides a guideline to develop a realistic RL application in a short time span. We have designed and developed a deep RL-based software framework that strictly ensures flexibility, robustness, and scalability. By inheriting the proposed architecture, software managers can foresee any challenges when designing a deep RL-based system. As a result, they can expedite the design process and actively control every stage of software development, which is especially critical in agile development environments. To enforce generalization, the proposed architecture does not depend on a specific RL algorithm, a network configuration, the number of agents, or the type of agents. Using our framework, software developers can develop and integrate new RL algorithms or new types of agents, and can flexibly change network configuration or the number of agents. △ Less

Submitted 23 February, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

arXiv:2002.11882 [pdf, other]

doi 10.13140/RG.2.2.13433.62563

A Visual Communication Map for Multi-Agent Deep Reinforcement Learning

Authors: Ngoc Duy Nguyen, Thanh Thi Nguyen, Doug Creighton, Saeid Nahavandi

Abstract: Deep reinforcement learning has been applied successfully to solve various real-world problems and the number of its applications in the multi-agent settings has been increasing. Multi-agent learning distinctly poses significant challenges in the effort to allocate a concealed communication medium. Agents receive thorough knowledge from the medium to determine subsequent actions in a distributed n… ▽ More Deep reinforcement learning has been applied successfully to solve various real-world problems and the number of its applications in the multi-agent settings has been increasing. Multi-agent learning distinctly poses significant challenges in the effort to allocate a concealed communication medium. Agents receive thorough knowledge from the medium to determine subsequent actions in a distributed nature. Apparently, the goal is to leverage the cooperation of multiple agents to achieve a designated objective efficiently. Recent studies typically combine a specialized neural network with reinforcement learning to enable communication between agents. This approach, however, limits the number of agents or necessitates the homogeneity of the system. In this paper, we have proposed a more scalable approach that not only deals with a great number of agents but also enables collaboration between dissimilar functional agents and compatibly combined with any deep reinforcement learning methods. Specifically, we create a global communication map to represent the status of each agent in the system visually. The visual map and the environmental state are fed to a shared-parameter network to train multiple agents concurrently. Finally, we select the Asynchronous Advantage Actor-Critic (A3C) algorithm to demonstrate our proposed scheme, namely Visual communication map for Multi-agent A3C (VMA3C). Simulation results show that the use of visual communication map improves the performance of A3C regarding learning speed, reward achievement, and robustness in multi-agent problems. △ Less

Submitted 23 February, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

arXiv:1911.03720 [pdf, other]

Trap** of electrons around nanoscale metallic wires embedded in a semiconductor medium

Authors: Chi Cuong Huynh, R. Evrard, Ngoc Duy Nguyen

Abstract: We predict that conduction electrons in a semiconductor film containing a centered square array of metal nanowires normal to its plane are bound in quantum states around the central wires, if a positive bias voltage is applied between the wires at the square vertices and these latter. We obtain and discuss the eigenenergies and eigenfunctions of two models with different dimensions. The results sh… ▽ More We predict that conduction electrons in a semiconductor film containing a centered square array of metal nanowires normal to its plane are bound in quantum states around the central wires, if a positive bias voltage is applied between the wires at the square vertices and these latter. We obtain and discuss the eigenenergies and eigenfunctions of two models with different dimensions. The results show that the eigenstates can be grouped into different shells. The energy differences between the shells is typically a few tens of meV, which corresponds to frequencies of emitted or absorbed photons in a range of 3 THz to 20 THz approximately. These energy differences strongly depend on the bias voltage. We calculate the linear response of individual electrons on the ground level of our models to large-wavelength electromagnetic waves whose electric field is in the plane of the semiconductor film. The computed oscillator strengths are dominated by the transitions to the states in each shell whose wave function has a single radial node line normal to the wave electric field. We include the effect of the image charge induced on the central metal wires and show that it modifies the oscillator strengths so that their sum deviates from the value given by the Thomas-Reiche-Kuhn rule. We report the linear response, or polarizability, versus photon energy, of the studied models and their absorption spectra. These latter show well-defined peaks as expected from the study of the oscillator strengths. We show that the position of these absorption peaks is strongly dependent on the bias voltage so that the frequency of photon absorption or emission in the systems described here is easily tunable. This makes them good candidates for the development of novel infrared devices. △ Less

Submitted 9 November, 2019; originally announced November 2019.

Comments: 15 pages, 15 figures, 23 references

Report number: ULG-CESAM-SPIN-2019-01

arXiv:1902.05183 [pdf, other]

Manipulating Soft Tissues by Deep Reinforcement Learning for Autonomous Robotic Surgery

Authors: Ngoc Duy Nguyen, Thanh Nguyen, Saeid Nahavandi, Asim Bhatti, Glenn Guest

Abstract: In robotic surgery, pattern cutting through a deformable material is a challenging research field. The cutting procedure requires a robot to concurrently manipulate a scissor and a gripper to cut through a predefined contour trajectory on the deformable sheet. The gripper ensures the cutting accuracy by nailing a point on the sheet and continuously tensioning the pinch point to different direction… ▽ More In robotic surgery, pattern cutting through a deformable material is a challenging research field. The cutting procedure requires a robot to concurrently manipulate a scissor and a gripper to cut through a predefined contour trajectory on the deformable sheet. The gripper ensures the cutting accuracy by nailing a point on the sheet and continuously tensioning the pinch point to different directions while the scissor is in action. The goal is to find a pinch point and a corresponding tensioning policy to minimize damage to the material and increase cutting accuracy measured by the symmetric difference between the predefined contour and the cut contour. Previous study considers finding one fixed pinch point during the course of cutting, which is inaccurate and unsafe when the contour trajectory is complex. In this paper, we examine the soft tissue cutting task by using multiple pinch points, which imitates human operations while cutting. This approach, however, does not require the use of a multi-gripper robot. We use a deep reinforcement learning algorithm to find an optimal tensioning policy of a pinch point. Simulation results show that the multi-point approach outperforms the state-of-the-art method in soft pattern cutting task with respect to both accuracy and reliability. △ Less

Submitted 13 February, 2019; originally announced February 2019.

arXiv:1901.03327 [pdf, other]

doi 10.1109/ICIT.2019.8755235

A New Tensioning Method using Deep Reinforcement Learning for Surgical Pattern Cutting

Authors: Thanh Thi Nguyen, Ngoc Duy Nguyen, Fernando Bello, Saeid Nahavandi

Abstract: Surgeons normally need surgical scissors and tissue grippers to cut through a deformable surgical tissue. The cutting accuracy depends on the skills to manipulate these two tools. Such skills are part of basic surgical skills training as in the Fundamentals of Laparoscopic Surgery. The gripper is used to pinch a point on the surgical sheet and pull the tissue to a certain direction to maintain the… ▽ More Surgeons normally need surgical scissors and tissue grippers to cut through a deformable surgical tissue. The cutting accuracy depends on the skills to manipulate these two tools. Such skills are part of basic surgical skills training as in the Fundamentals of Laparoscopic Surgery. The gripper is used to pinch a point on the surgical sheet and pull the tissue to a certain direction to maintain the tension while the scissors cut through a trajectory. As the surgical materials are deformable, it requires a comprehensive tensioning policy to yield appropriate tensioning direction at each step of the cutting process. Automating a tensioning policy for a given cutting trajectory will support not only the human surgeons but also the surgical robots to improve the cutting accuracy and reliability. This paper presents a multiple pinch point approach to modelling an autonomous tensioning planner based on a deep reinforcement learning algorithm. Experiments on a simulator show that the proposed method is superior to existing methods in terms of both performance and robustness. △ Less

Submitted 10 January, 2019; originally announced January 2019.

Comments: 2019 IEEE International Conference on Industrial Technology (ICIT), Melbourne, Australia (to appear)

Journal ref: 2019 IEEE International Conference on Industrial Technology (ICIT)

arXiv:1812.11794 [pdf, other]

doi 10.1109/TCYB.2020.2977374

Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications

Authors: Thanh Thi Nguyen, Ngoc Duy Nguyen, Saeid Nahavandi

Abstract: Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms however have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in the… ▽ More Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms however have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This paper addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multi-agent deep RL (MADRL) is presented, including non-stationarity, partial observability, continuous state and action spaces, multi-agent training schemes, multi-agent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed, with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to future development of more robust and highly useful multi-agent learning methods for solving real-world problems. △ Less

Submitted 6 February, 2019; v1 submitted 31 December, 2018; originally announced December 2018.

Report number: https://ieeexplore.ieee.org/document/9043893

Journal ref: IEEE Transactions on Cybernetics, 20 March 2020

arXiv:1812.06582

Quasi one-shot full-field surface profilometry using digital diffractive-confocal imaging correlation microscope

Authors: Duc Trung Nguyen, Liang-Chia Chen, Nguyen Dinh Nguyen

Abstract: One-shot full-field surface profilometry using digital diffractive-confocal imaging correlation microscope based on digital micromirror device is developed for one-shot microscopic 3D surface measurement. Optical configuration applies confocal microscope setup and was building on DMD to generate specific pinhole array arrangement for minimizing cross talk effect. An innovative method was invented… ▽ More One-shot full-field surface profilometry using digital diffractive-confocal imaging correlation microscope based on digital micromirror device is developed for one-shot microscopic 3D surface measurement. Optical configuration applies confocal microscope setup and was building on DMD to generate specific pinhole array arrangement for minimizing cross talk effect. An innovative method was invented to create normalized cross correlation depth response curve from diffraction patterns of the pinhole. Using this approach, the sub-micrometer scale depth can be detected with high accuracy and precision. △ Less

Submitted 11 April, 2019; v1 submitted 16 December, 2018; originally announced December 2018.

Comments: Conflict of interest between authors

arXiv:1812.06573

Innovative full-field chromatic confocal microscopy using multispectral sensors

Authors: Liang-Chia Chen, Pei-Ju Tan, Chih-Jer Lin, Duc Trung Nguyen, Yu-Shuan Chou, Nguyen Dinh Nguyen, Nguyen Thanh Trung

Abstract: A full-field chromatic confocal microscopy using a multispectral sensor was developed for quasi-one-shot microscopic 3D surface measurement. An innovative optical configuration employs a digital micromirror device (DMD) and a multispectral sensor is used to realize chromatic confocal microscopy with full-field area scanning. In the optical design, an area-scan type chromatic dispersive objective i… ▽ More A full-field chromatic confocal microscopy using a multispectral sensor was developed for quasi-one-shot microscopic 3D surface measurement. An innovative optical configuration employs a digital micromirror device (DMD) and a multispectral sensor is used to realize chromatic confocal microscopy with full-field area scanning. In the optical design, an area-scan type chromatic dispersive objective is specially designed to achieve measuring specification. Based on an 8x chromatic dispersive objective, the FOV for one shot measurement can be reached to 1.8mm*1.3mm which is immersive to microscopic profilometry. The spectral image captured by the multispectral sensor at each pinhole position has a unique spectrum pattern corresponding to its conjugate measured depth. A normalized cross-correlation (NCC) algorithm is developed to establish a spectrum-depth response curve with its corresponding spectrum pattern sets for accurate reconstruction of the tested 3D surface profile. With real test on standard targets, the measurement repeatability for a single surface depth is less than 0.6 micrometer. △ Less

Submitted 11 April, 2019; v1 submitted 16 December, 2018; originally announced December 2018.

Comments: Conflict of interest between co-authors

arXiv:1811.02668 [pdf]

Automated Diagnosis of Lymphoma with Digital Pathology Images Using Deep Learning

Authors: Hanadi El Achi, Tatiana Belousova, Lei Chen, Amer Wahed, Iris Wang, Zhihong Hu, Zeyad Kanaan, Adan Rios, Andy N. D. Nguyen

Abstract: Recent studies have shown promising results in using Deep Learning to detect malignancy in whole slide imaging. However, they were limited to just predicting positive or negative finding for a specific neoplasm. We attempted to use Deep Learning with a convolutional neural network algorithm to build a lymphoma diagnostic model for four diagnostic categories: benign lymph node, diffuse large B cell… ▽ More Recent studies have shown promising results in using Deep Learning to detect malignancy in whole slide imaging. However, they were limited to just predicting positive or negative finding for a specific neoplasm. We attempted to use Deep Learning with a convolutional neural network algorithm to build a lymphoma diagnostic model for four diagnostic categories: benign lymph node, diffuse large B cell lymphoma, Burkitt lymphoma, and small lymphocytic lymphoma. Our software was written in Python language. We obtained digital whole slide images of Hematoxylin and Eosin stained slides of 128 cases including 32 cases for each diagnostic category. Four sets of 5 representative images, 40x40 pixels in dimension, were taken for each case. A total of 2,560 images were obtained from which 1,856 were used for training, 464 for validation and 240 for testing. For each test set of 5 images, the predicted diagnosis was combined from prediction of 5 images. The test results showed excellent diagnostic accuracy at 95% for image-by-image prediction and at 10% for set-by-set prediction. This preliminary study provided a proof of concept for incorporating automated lymphoma diagnostic screen into future pathology workflow to augment the pathologists' productivity. △ Less

Submitted 30 October, 2018; originally announced November 2018.

Comments: 13 pages, 2 figures, 2 tables

arXiv:1810.13247 [pdf]

Application of Deep Learning on Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

Authors: Mei Lin, Vanya Jaitly, Iris Wang, Zhihong Hu, Lei Chen, Md. Amer Wahed, Zeyad Kanaan, Adan Rios, Andy N. D. Nguyen

Abstract: We explore how Deep Learning (DL) can be utilized to predict prognosis of acute myeloid leukemia (AML). Out of TCGA (The Cancer Genome Atlas) database, 94 AML cases are used in this study. Input data include age, 10 common cytogenetic and 23 most common mutation results; output is the prognosis (diagnosis to death, DTD). In our DL network, autoencoders are stacked to form a hierarchical DL model f… ▽ More We explore how Deep Learning (DL) can be utilized to predict prognosis of acute myeloid leukemia (AML). Out of TCGA (The Cancer Genome Atlas) database, 94 AML cases are used in this study. Input data include age, 10 common cytogenetic and 23 most common mutation results; output is the prognosis (diagnosis to death, DTD). In our DL network, autoencoders are stacked to form a hierarchical DL model from which raw data are compressed and organized and high-level features are extracted. The network is written in R language and is designed to predict prognosis of AML for a given case (DTD of more than or less than 730 days). The DL network achieves an excellent accuracy of 83% in predicting prognosis. As a proof-of-concept study, our preliminary results demonstrate a practical application of DL in future practice of prognostic prediction using next-gen sequencing (NGS) data. △ Less

Submitted 30 October, 2018; originally announced October 2018.

Comments: 11 pages, 1 table, 1 figure. arXiv admin note: substantial text overlap with arXiv:1801.01019

arXiv:1806.04562 [pdf, other]

doi 10.1109/ICIT.2019.8755032

Multi-Agent Deep Reinforcement Learning with Human Strategies

Authors: Thanh Nguyen, Ngoc Duy Nguyen, Saeid Nahavandi

Abstract: Deep learning has enabled traditional reinforcement learning methods to deal with high-dimensional problems. However, one of the disadvantages of deep reinforcement learning methods is the limited exploration capacity of learning agents. In this paper, we introduce an approach that integrates human strategies to increase the exploration capacity of multiple deep reinforcement learning agents. We a… ▽ More Deep learning has enabled traditional reinforcement learning methods to deal with high-dimensional problems. However, one of the disadvantages of deep reinforcement learning methods is the limited exploration capacity of learning agents. In this paper, we introduce an approach that integrates human strategies to increase the exploration capacity of multiple deep reinforcement learning agents. We also report the development of our own multi-agent environment called Multiple Tank Defence to simulate the proposed approach. The results show the significant performance improvement of multiple agents that have learned cooperatively with human strategies. This implies that there is a critical need for human intellect teamed with machines to solve complex problems. In addition, the success of this simulation indicates that our multi-agent environment can be used as a testbed platform to develop and validate other multi-agent control algorithms. △ Less

Submitted 30 May, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

Comments: 2019 IEEE International Conference on Industrial Technology (ICIT), Melbourne, Australia

Journal ref: 2019 IEEE International Conference on Industrial Technology (ICIT)

arXiv:1804.01874 [pdf, other]

doi 10.1109/SMC.2018.00682

A Human Mixed Strategy Approach to Deep Reinforcement Learning

Authors: Ngoc Duy Nguyen, Saeid Nahavandi, Thanh Nguyen

Abstract: In 2015, Google's DeepMind announced an advancement in creating an autonomous agent based on deep reinforcement learning (DRL) that could beat a professional player in a series of 49 Atari games. However, the current manifestation of DRL is still immature, and has significant drawbacks. One of DRL's imperfections is its lack of "exploration" during the training process, especially when working wit… ▽ More In 2015, Google's DeepMind announced an advancement in creating an autonomous agent based on deep reinforcement learning (DRL) that could beat a professional player in a series of 49 Atari games. However, the current manifestation of DRL is still immature, and has significant drawbacks. One of DRL's imperfections is its lack of "exploration" during the training process, especially when working with high-dimensional problems. In this paper, we propose a mixed strategy approach that mimics behaviors of human when interacting with environment, and create a "thinking" agent that allows for more efficient exploration in the DRL training process. The simulation results based on the Breakout game show that our scheme achieves a higher probability of obtaining a maximum score than does the baseline DRL algorithm, i.e., the asynchronous advantage actor-critic method. The proposed scheme therefore can be applied effectively to solving a complicated task in a real-world application. △ Less

Submitted 5 April, 2018; originally announced April 2018.

Journal ref: 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

arXiv:1803.08067 [pdf]

doi 10.1109/JSYST.2019.2918283

A Review of Situation Awareness Assessment Approaches in Aviation Environments

Authors: Thanh Nguyen, Chee Peng Lim, Ngoc Duy Nguyen, Lee Gordon-Brown, Saeid Nahavandi

Abstract: Situation awareness (SA) is an important constituent in human information processing and essential in pilots' decision-making processes. Acquiring and maintaining appropriate levels of SA is critical in aviation environments as it affects all decisions and actions taking place in flights and air traffic control. This paper provides an overview of recent measurement models and approaches to establi… ▽ More Situation awareness (SA) is an important constituent in human information processing and essential in pilots' decision-making processes. Acquiring and maintaining appropriate levels of SA is critical in aviation environments as it affects all decisions and actions taking place in flights and air traffic control. This paper provides an overview of recent measurement models and approaches to establishing and enhancing SA in aviation environments. Many aspects of SA are examined including the classification of SA techniques into six categories, and different theoretical SA models from individual, to shared or team, and to distributed or system levels. Quantitative and qualitative perspectives pertaining to SA methods and issues of SA for unmanned vehicles are also addressed. Furthermore, future research directions regarding SA assessment approaches are raised to deal with shortcomings of the existing state-of-the-art methods in the literature. △ Less

Submitted 7 June, 2019; v1 submitted 6 March, 2018; originally announced March 2018.

Comments: IEEE Systems Journal, https://ieeexplore.ieee.org/document/8732669

arXiv:1803.02965 [pdf]

doi 10.1016/j.engappai.2020.103915

A Multi-Objective Deep Reinforcement Learning Framework

Authors: Thanh Thi Nguyen, Ngoc Duy Nguyen, Peter Vamplew, Saeid Nahavandi, Richard Dazeley, Chee Peng Lim

Abstract: This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We develop a high-performance MODRL framework that supports both single-policy and multi-policy strategies, as well as both linear and non-linear approaches to action selection. The experimental results on two benchmark problems (two-objective deep sea treasure environment a… ▽ More This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We develop a high-performance MODRL framework that supports both single-policy and multi-policy strategies, as well as both linear and non-linear approaches to action selection. The experimental results on two benchmark problems (two-objective deep sea treasure environment and three-objective Mountain Car problem) indicate that the proposed framework is able to find the Pareto-optimal solutions effectively. The proposed framework is generic and highly modularized, which allows the integration of different deep reinforcement learning algorithms in different complex problem domains. This therefore overcomes many disadvantages involved with standard multi-objective reinforcement learning methods in the current literature. The proposed framework acts as a testbed platform that accelerates the development of MODRL for solving increasingly complicated multi-objective problems. △ Less

Submitted 19 June, 2020; v1 submitted 7 March, 2018; originally announced March 2018.

Comments: 21 pages

Report number: Volume 96, November 2020, 103915

Journal ref: Engineering Applications of Artificial Intelligence, 2020

arXiv:1801.01019 [pdf]

Proteomics Analysis of FLT3-ITD Mutation in Acute Myeloid Leukemia Using Deep Learning Neural Network

Authors: Christine A. Liang, Lei Chen, Amer Wahed, Andy N. D. Nguyen

Abstract: Deep Learning can significantly benefit cancer proteomics and genomics. In this study, we attempt to determine a set of critical proteins that are associated with the FLT3-ITD mutation in newly-diagnosed acute myeloid leukemia patients. A Deep Learning network consisting of autoencoders forming a hierarchical model from which high-level features are extracted without labeled training data. Dimensi… ▽ More Deep Learning can significantly benefit cancer proteomics and genomics. In this study, we attempt to determine a set of critical proteins that are associated with the FLT3-ITD mutation in newly-diagnosed acute myeloid leukemia patients. A Deep Learning network consisting of autoencoders forming a hierarchical model from which high-level features are extracted without labeled training data. Dimensional reduction reduced the number of critical proteins from 231 to 20. Deep Learning found an excellent correlation between FLT3-ITD mutation with the levels of these 20 critical proteins (accuracy 97%, sensitivity 90%, specificity 100%). Our Deep Learning network could hone in on 20 proteins with the strongest association with FLT3-ITD. The results of this study allow a novel approach to determine critical protein pathways in the FLT3-ITD mutation, and provide proof-of-concept for an accurate approach to model big data in cancer proteomics and genomics. △ Less

Submitted 29 December, 2017; originally announced January 2018.

Comments: 12 pages, 4 figures, 2 tables

arXiv:1709.03355 [pdf, ps, other]

doi 10.1038/s41467-018-04024-y

Evidence for Z=6 `magic number' in neutron-rich carbon isotopes

Authors: D. T. Tran, H. J. Ong, G. Hagen, T. D. Morris, N. Aoi, T. Suzuki, Y. Kanada-En'yo, L. S. Geng, S. Terashima, I. Tanihata, T. T. Nguyen, Y. Ayyad, P. Y. Chan, M. Fukuda, H. Geissel, M. N. Harakeh, T. Hashimoto, T. H. Hoang, E. Ideguchi, A. Inoue, G. R. Jansen, R. Kanungo, T. Kawabata, L. H. Khiem, W. P. Lin , et al. (15 additional authors not shown)

Abstract: The nuclear shell structure, which originates in the nearly independent motion of nucleons in an average potential, provides an important guide for our understanding of nuclear structure and the underlying nuclear forces. Its most remarkable fingerprint is the existence of the so-called `magic numbers' of protons and neutrons associated with extra stability. Although the introduction of a phenomen… ▽ More The nuclear shell structure, which originates in the nearly independent motion of nucleons in an average potential, provides an important guide for our understanding of nuclear structure and the underlying nuclear forces. Its most remarkable fingerprint is the existence of the so-called `magic numbers' of protons and neutrons associated with extra stability. Although the introduction of a phenomenological spin-orbit (SO) coupling force in 1949 helped explain the nuclear magic numbers, its origins are still open questions. Here, we present experimental evidence for the smallest SO-originated magic number (subshell closure) at the proton number 6 in 13-20C obtained from systematic analysis of point-proton distribution radii, electromagnetic transition rates and atomic masses of light nuclei. Performing ab initio calculations on 14,15C, we show that the observed proton distribution radii and subshell closure can be explained by the state-of-the-art nuclear theory with chiral nucleon-nucleon and three-nucleon forces, which are rooted in the quantum chromodynamics. △ Less

Submitted 11 September, 2017; originally announced September 2017.

Comments: 7 pages, 5 figures

arXiv:1606.08575 [pdf, ps, other]

doi 10.1103/PhysRevC.94.064604

Charge-changing-cross-section measurements of $^{12-16}$C at around $45A$ MeV and development of a Glauber model for incident energies $10A-2100A$ MeV

Authors: D. T. Tran, H. J. Ong, T. T. Nguyen, I. Tanihata, N. Aoi, Y. Ayyad, P. Y. Chan, M. Fukuda, T. Hashimoto, T. H. Hoang, E. Ideguchi, A. Inoue, T. Kawabata, L. H. Khiem, W. P. Lin, K. Matsuta, M. Mihara, S. Momota, D. Nagae, N. D. Nguyen, D. Nishimura, A. Ozawa, P. P. Ren, H. Sakaguchi, J. Tanaka , et al. (4 additional authors not shown)

Abstract: We have measured for the first time the charge-changing cross sections ($σ_{\text{CC}}$) of $^{12-16}$C on a $^{12}$C target at energies below $100A$ MeV. To analyze these low-energy data, we have developed a finite-range Glauber model with a global parameter set within the optical-limit approximation which is applicable to reaction cross section ($σ_{\text{R}}$) and $σ_{\text{CC}}$ measurements a… ▽ More We have measured for the first time the charge-changing cross sections ($σ_{\text{CC}}$) of $^{12-16}$C on a $^{12}$C target at energies below $100A$ MeV. To analyze these low-energy data, we have developed a finite-range Glauber model with a global parameter set within the optical-limit approximation which is applicable to reaction cross section ($σ_{\text{R}}$) and $σ_{\text{CC}}$ measurements at incident energies from 10$A$ to $2100A$ MeV. Adopting the proton-density distribution of $^{12}$C known from the electron-scattering data, as well as the bare total nucleon-nucleon cross sections, and the real-to-imaginary-part ratios of the forward proton-proton elastic scattering amplitude available in the literatures, we determine the energy-dependent slope parameter $β_{\rm pn}$ of the proton-neutron elastic differential cross section so as to reproduce the existing $σ_{\text{R}}$ and interaction-cross-section data for $^{12}$C+$^{12}$C over a wide range of incident energies. The Glauber model thus formulated is applied to calculate the $σ_{\text{\tiny R}}$'s of $^{12}$C on a $^9$Be and $^{27}$Al targets at various incident energies. Our calculations show excellent agreement with the experimental data. Applying our model to the $σ_{\text{\tiny R}}$ and $σ_{\text{\tiny CC}}$ for the "neutron-skin" $^{16}$C nucleus, we reconfirm the importance of measurements at incident energies below $100A$ MeV. The proton root-mean-square radii of $^{12-16}$C are extracted using the measured $σ_{\text{CC}}$'s and the existing $σ_{\text{R}}$ data. The results for $^{12-14}$C are consistent with the values from the electron scatterings, demonstrating the feasibility, usefulness of the $σ_{\text{CC}}$ measurement and the present Glauber model. △ Less

Submitted 28 June, 2016; originally announced June 2016.

Comments: 8 pages, 4 figures

Journal ref: Phys. Rev. C 94, 064604 (2016)

arXiv:1603.06213 [pdf, other]

Note on the numerical solution of the scalar Helmholtz equation in a nanotorus with uniform Dirichlet boundary conditions

Authors: N. D. Nguyen, R. Evrard, Michael A. Stroscio

Abstract: This note describes the solution of the Helmholtz equation inside a nanotorus with uniform Dirichlet boundary conditions. The eigenfunction symmetry is discussed and the lower-order eigenvalues and eigenfunctions are shown. The similarity with the case of a long cylinder and with that of the vibrations of a circular elastic membrane is discussed. This similarity is used to propose a classification… ▽ More This note describes the solution of the Helmholtz equation inside a nanotorus with uniform Dirichlet boundary conditions. The eigenfunction symmetry is discussed and the lower-order eigenvalues and eigenfunctions are shown. The similarity with the case of a long cylinder and with that of the vibrations of a circular elastic membrane is discussed. This similarity is used to propose a classification scheme of the eigenfunctions based on three indices. △ Less

Submitted 20 March, 2016; originally announced March 2016.

Report number: ULG-CESAM-SPIN-2016-01

arXiv:1408.2420 [pdf, other]

doi 10.1088/1367-2630/16/10/103003

Classical analogy for the deflection of flux avalanches by a metallic layer

Authors: J. Brisbois, B. Vanderheyden, F. Colauto, M. Motta, W. A. Ortiz, J. Fritzsche, N. D. Nguyen, B. Hackens, O. -A. Adami, A. V. Silhanek

Abstract: Sudden avalanches of magnetic flux bursting into a superconducting sample undergo deflections of their trajectories when encountering a conductive layer deposited on top of the superconductor. Remarkably, in some cases flux is totally excluded from the area covered by the conductive layer. We present a simple classical model that accounts for this behaviour and considers a magnetic monopole approa… ▽ More Sudden avalanches of magnetic flux bursting into a superconducting sample undergo deflections of their trajectories when encountering a conductive layer deposited on top of the superconductor. Remarkably, in some cases flux is totally excluded from the area covered by the conductive layer. We present a simple classical model that accounts for this behaviour and considers a magnetic monopole approaching a semi-infinite conductive plane. This model suggests that magnetic braking is an important mechanism responsible for avalanche deflection. △ Less

Submitted 11 August, 2014; originally announced August 2014.

Comments: 14 pages, 5 figures

arXiv:1111.4052 [pdf]

A Facial Expression Classification System Integrating Canny, Principal Component Analysis and Artificial Neural Network

Authors: Le Hoang Thai, Nguyen Do Thai Nguyen, Tran Son Hai

Abstract: Facial Expression Classification is an interesting research problem in recent years. There are a lot of methods to solve this problem. In this research, we propose a novel approach using Canny, Principal Component Analysis (PCA) and Artificial Neural Network. Firstly, in preprocessing phase, we use Canny for local region detection of facial images. Then each of local region's features will be pres… ▽ More Facial Expression Classification is an interesting research problem in recent years. There are a lot of methods to solve this problem. In this research, we propose a novel approach using Canny, Principal Component Analysis (PCA) and Artificial Neural Network. Firstly, in preprocessing phase, we use Canny for local region detection of facial images. Then each of local region's features will be presented based on Principal Component Analysis (PCA). Finally, using Artificial Neural Network (ANN)applies for Facial Expression Classification. We apply our proposal method (Canny_PCA_ANN) for recognition of six basic facial expressions on JAFFE database consisting 213 images posed by 10 Japanese female models. The experimental result shows the feasibility of our proposal method. △ Less

Submitted 17 November, 2011; originally announced November 2011.

Comments: 6 pages, 10 figures, International Journal of Machine Learning and Computing, Vol. 1, No. 4, October 2011, ISSN (Online): 2010-3700, http://www.ijmlc.org/

Journal ref: International Journal of Machine Learning and Computing, Vol. 1, No. 4, 2011, 388-393

Showing 1–42 of 42 results for author: Nguyen, N D