-
Forget but Recall: Incremental Latent Rectification in Continual Learning
Authors:
Nghia D. Nguyen,
Hieu Trung Nguyen,
Ang Li,
Hoang Pham,
Viet Anh Nguyen,
Khoa D. Doan
Abstract:
Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper in…
▽ More
Intrinsic capability to continuously learn a changing data stream is a desideratum of deep neural networks (DNNs). However, current DNNs suffer from catastrophic forgetting, which hinders remembering past knowledge. To mitigate this issue, existing Continual Learning (CL) approaches either retain exemplars for replay, regularize learning, or allocate dedicated capacity for new tasks. This paper investigates an unexplored CL direction for incremental learning called Incremental Latent Rectification or ILR. In a nutshell, ILR learns to propagate with correction (or rectify) the representation from the current trained DNN backward to the representation space of the old task, where performing predictive decisions is easier. This rectification process only employs a chain of small representation map** networks, called rectifier units. Empirical experiments on several continual learning benchmarks, including CIFAR10, CIFAR100, and Tiny ImageNet, demonstrate the effectiveness and potential of this novel CL direction compared to existing representative CL methods.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses
Authors:
Jionghao Lin,
Eason Chen,
Zeifei Han,
Ashish Gurung,
Danielle R. Thomas,
Wei Tan,
Ngoc Dang Nguyen,
Kenneth R. Koedinger
Abstract:
Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study…
▽ More
Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study leverages the capabilities of large language models, specifically Generative Pre-Trained Transformers (GPT), to explore a sequence labeling approach focused on identifying components of desired and less desired praise for providing explanatory feedback within a tutor training dataset. Our aim is to equip tutors with actionable, explanatory feedback during online training lessons. To investigate the potential of GPT models for providing the explanatory feedback, we employed two commonly-used approaches: prompting and fine-tuning. To quantify the quality of highlighted praise components identified by GPT models, we introduced a Modified Intersection over Union (M-IoU) score. Our findings demonstrate that: (1) the M-IoU score effectively correlates with human judgment in evaluating sequence quality; (2) using two-shot prompting on GPT-3.5 resulted in decent performance in recognizing effort-based (M-IoU of 0.46) and outcome-based praise (M-IoU of 0.68); and (3) our optimally fine-tuned GPT-3.5 model achieved M-IoU scores of 0.64 for effort-based praise and 0.84 for outcome-based praise, aligning with the satisfaction levels evaluated by human coders. Our results show promise for using GPT models to provide feedback that focuses on specific elements in their open-ended responses that are desirable or could use improvement.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Harnessing the Power of Beta Scoring in Deep Active Learning for Multi-Label Text Classification
Authors:
Wei Tan,
Ngoc Dang Nguyen,
Lan Du,
Wray Buntine
Abstract:
Within the scope of natural language processing, the domain of multi-label text classification is uniquely challenging due to its expansive and uneven label distribution. The complexity deepens due to the demand for an extensive set of annotated data for training an advanced deep learning model, especially in specialized fields where the labeling task can be labor-intensive and often requires doma…
▽ More
Within the scope of natural language processing, the domain of multi-label text classification is uniquely challenging due to its expansive and uneven label distribution. The complexity deepens due to the demand for an extensive set of annotated data for training an advanced deep learning model, especially in specialized fields where the labeling task can be labor-intensive and often requires domain-specific knowledge. Addressing these challenges, our study introduces a novel deep active learning strategy, capitalizing on the Beta family of proper scoring rules within the Expected Loss Reduction framework. It computes the expected increase in scores using the Beta Scoring Rules, which are then transformed into sample vector representations. These vector representations guide the diverse selection of informative samples, directly linking this process to the model's expected proper score. Comprehensive evaluations across both synthetic and real datasets reveal our method's capability to often outperform established acquisition techniques in multi-label text classification, presenting encouraging outcomes across various architectural and dataset scenarios.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
Study of cognitive component of auditory attention to natural speech events
Authors:
Nhan D. T. Nguyen,
Kaare Mikkelsen,
Preben Kidmose
Abstract:
Event-related potentials (ERP) have been used to address a wide range of research questions in neuroscience and cognitive psychology including selective auditory attention. The recent progress in auditory attention decoding (AAD) methods is based on algorithms that find a relation between the audio envelope and the neurophysiological response. The most popular approach is based on the reconstructi…
▽ More
Event-related potentials (ERP) have been used to address a wide range of research questions in neuroscience and cognitive psychology including selective auditory attention. The recent progress in auditory attention decoding (AAD) methods is based on algorithms that find a relation between the audio envelope and the neurophysiological response. The most popular approach is based on the reconstruction of the audio envelope based on EEG signals. However, these methods are mainly based on the neurophysiological entrainment to physical attributes of the sensory stimulus and are generally limited by a long detection window. This study proposes a novel approach to auditory attention decoding by looking at higher-level cognitive responses to natural speech. To investigate if natural speech events elicit cognitive ERP components and how these components are affected by attention mechanisms, we designed a series of four experimental paradigms with increasing complexity: a word category oddball paradigm, a word category oddball paradigm with competing speakers, and competing speech streams with and without specific targets. We recorded the electroencephalogram (EEG) from 32 scalp electrodes and 12 in-ear electrodes (ear-EEG) from 24 participants. A cognitive ERP component, which we believe is related to the well-known P3b component, was observed at parietal electrode sites with a latency of approximately 620 ms. The component is statistically most significant for the simplest paradigm and gradually decreases in strength with increasing complexity of the paradigm. We also show that the component can be observed in the in-ear EEG signals by using spatial filtering. The cognitive component elicited by auditory attention may contribute to decoding auditory attention from electrophysiological recordings and its presence in the ear-EEG signals is promising for future applications within hearing aids.
△ Less
Submitted 19 December, 2023; v1 submitted 16 December, 2023;
originally announced December 2023.
-
Low-Resource Named Entity Recognition: Can One-vs-All AUC Maximization Help?
Authors:
Ngoc Dang Nguyen,
Wei Tan,
Lan Du,
Wray Buntine,
Richard Beare,
Changyou Chen
Abstract:
Named entity recognition (NER), a task that identifies and categorizes named entities such as persons or organizations from text, is traditionally framed as a multi-class classification problem. However, this approach often overlooks the issues of imbalanced label distributions, particularly in low-resource settings, which is common in certain NER contexts, like biomedical NER (bioNER). To address…
▽ More
Named entity recognition (NER), a task that identifies and categorizes named entities such as persons or organizations from text, is traditionally framed as a multi-class classification problem. However, this approach often overlooks the issues of imbalanced label distributions, particularly in low-resource settings, which is common in certain NER contexts, like biomedical NER (bioNER). To address these issues, we propose an innovative reformulation of the multi-class problem as a one-vs-all (OVA) learning problem and introduce a loss function based on the area under the receiver operating characteristic curve (AUC). To enhance the efficiency of our OVA-based approach, we propose two training strategies: one groups labels with similar linguistic characteristics, and another employs meta-learning. The superiority of our approach is confirmed by its performance, which surpasses traditional NER learning in varying NER settings.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition
Authors:
Haocheng Luo,
Wei Tan,
Ngoc Dang Nguyen,
Lan Du
Abstract:
Active learning, a widely adopted technique for enhancing machine learning models in text and image classification tasks with limited annotation resources, has received relatively little attention in the domain of Named Entity Recognition (NER). The challenge of data imbalance in NER has hindered the effectiveness of active learning, as sequence labellers lack sufficient learning signals. To addre…
▽ More
Active learning, a widely adopted technique for enhancing machine learning models in text and image classification tasks with limited annotation resources, has received relatively little attention in the domain of Named Entity Recognition (NER). The challenge of data imbalance in NER has hindered the effectiveness of active learning, as sequence labellers lack sufficient learning signals. To address these challenges, this paper presents a novel reweighting-based active learning strategy that assigns dynamic smoothed weights to individual tokens. This adaptable strategy is compatible with various token-level acquisition functions and contributes to the development of robust active learners. Experimental results on multiple corpora demonstrate the substantial performance improvement achieved by incorporating our re-weighting strategy into existing acquisition functions, validating its practical efficacy.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Using Large Language Models to Provide Explanatory Feedback to Human Tutors
Authors:
Jionghao Lin,
Danielle R. Thomas,
Feifei Han,
Shivang Gupta,
Wei Tan,
Ngoc Dang Nguyen,
Kenneth R. Koedinger
Abstract:
Research demonstrates learners engaging in the process of producing explanations to support their reasoning, can have a positive impact on learning. However, providing learners real-time explanatory feedback often presents challenges related to classification accuracy, particularly in domain-specific environments, containing situationally complex and nuanced responses. We present two approaches fo…
▽ More
Research demonstrates learners engaging in the process of producing explanations to support their reasoning, can have a positive impact on learning. However, providing learners real-time explanatory feedback often presents challenges related to classification accuracy, particularly in domain-specific environments, containing situationally complex and nuanced responses. We present two approaches for supplying tutors real-time feedback within an online lesson on how to give students effective praise. This work-in-progress demonstrates considerable accuracy in binary classification for corrective feedback of effective, or effort-based (F1 score = 0.811), and ineffective, or outcome-based (F1 score = 0.350), praise responses. More notably, we introduce progress towards an enhanced approach of providing explanatory feedback using large language model-facilitated named entity recognition, which can provide tutors feedback, not only while engaging in lessons, but can potentially suggest real-time tutor moves. Future work involves leveraging large language models for data augmentation to improve accuracy, while also develo** an explanatory feedback interface.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Robust Educational Dialogue Act Classifiers with Low-Resource and Imbalanced Datasets
Authors:
Jionghao Lin,
Wei Tan,
Ngoc Dang Nguyen,
David Lang,
Lan Du,
Wray Buntine,
Richard Beare,
Guanliang Chen,
Dragan Gasevic
Abstract:
Dialogue acts (DAs) can represent conversational actions of tutors or students that take place during tutoring dialogues. Automating the identification of DAs in tutoring dialogues is significant to the design of dialogue-based intelligent tutoring systems. Many prior studies employ machine learning models to classify DAs in tutoring dialogues and invest much effort to optimize the classification…
▽ More
Dialogue acts (DAs) can represent conversational actions of tutors or students that take place during tutoring dialogues. Automating the identification of DAs in tutoring dialogues is significant to the design of dialogue-based intelligent tutoring systems. Many prior studies employ machine learning models to classify DAs in tutoring dialogues and invest much effort to optimize the classification accuracy by using limited amounts of training data (i.e., low-resource data scenario). However, beyond the classification accuracy, the robustness of the classifier is also important, which can reflect the capability of the classifier on learning the patterns from different class distributions. We note that many prior studies on classifying educational DAs employ cross entropy (CE) loss to optimize DA classifiers on low-resource data with imbalanced DA distribution. The DA classifiers in these studies tend to prioritize accuracy on the majority class at the expense of the minority class which might not be robust to the data with imbalanced ratios of different DA classes. To optimize the robustness of classifiers on imbalanced class distributions, we propose to optimize the performance of the DA classifier by maximizing the area under the ROC curve (AUC) score (i.e., AUC maximization). Through extensive experiments, our study provides evidence that (i) by maximizing AUC in the training process, the DA classifier achieves significant performance improvement compared to the CE approach under low-resource data, and (ii) AUC maximization approaches can improve the robustness of the DA classifier under different class imbalance ratios.
△ Less
Submitted 15 April, 2023;
originally announced April 2023.
-
Real roots of random polynomials: asymptotics of the variance
Authors:
Yen Q. Do,
Nhan D. V. Nguyen
Abstract:
We compute the precise leading asymptotics of the variance of the number of real roots for a large class of random polynomials, where the random coefficients have polynomial growth. Our results apply to many classical ensembles, including the Kac polynomials, hyperbolic polynomials, their derivatives, and any linear combinations of these polynomials. Prior to this paper, such asymptotics was only…
▽ More
We compute the precise leading asymptotics of the variance of the number of real roots for a large class of random polynomials, where the random coefficients have polynomial growth. Our results apply to many classical ensembles, including the Kac polynomials, hyperbolic polynomials, their derivatives, and any linear combinations of these polynomials. Prior to this paper, such asymptotics was only established for the Kac polynomials in the 1970s, with the seminal contribution of Maslova. The main ingredients of the proof are new asymptotic estimates for the two-point correlation function of the real roots, revealing geometric structures in the distribution of the real roots of these random polynomials. As a corollary, we obtain asymptotic normality for the real roots for these random polynomials, extending and strengthening a related result of O. Nguyen and V. Vu.
△ Less
Submitted 7 May, 2024; v1 submitted 9 March, 2023;
originally announced March 2023.
-
SBcoyote: An Extensible Python-Based Reaction Editor and Viewer
Authors:
** Xu,
Gary Geng,
Nhan D. Nguyen,
Carmen Perena-Cortes,
Claire Samuels,
Herbert M. Sauro
Abstract:
SBcoyote is an open-source cross-platform biochemical reaction viewer and editor released under the liberal MIT license. It is written in Python and uses wxPython to implement the GUI and the drawing canvas. It supports the visualization and editing of compartments, species, and reactions. It includes many options to stylize each of these components. For instance, species can be in different color…
▽ More
SBcoyote is an open-source cross-platform biochemical reaction viewer and editor released under the liberal MIT license. It is written in Python and uses wxPython to implement the GUI and the drawing canvas. It supports the visualization and editing of compartments, species, and reactions. It includes many options to stylize each of these components. For instance, species can be in different colors and shapes. Other core features include the ability to create alias nodes, alignment of groups of nodes, network zooming, as well as an interactive bird-eye view of the network to allow easy navigation on large networks. A unique feature of the tool is the extensive Python plugin API, where third-party developers can include new functionality. To assist third-party plugin developers, we provide a variety of sample plugins, including, random network generation, a simple auto layout tool, export to Antimony, export SBML, import SBML, etc. Of particular interest are the export and import SBML plugins since these support the SBML level 3 layout and render standard, which is exchangeable with other software packages. Plugins are stored in a GitHub repository, and an included plugin manager can retrieve and install new plugins from the repository on demand. Plugins have version metadata associated with them to make it install plugin updates. Availability: https://github.com/sys-bio/SBcoyote.
△ Less
Submitted 14 August, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Deep Learning Provides Rapid Screen for Breast Cancer Metastasis with Sentinel Lymph Nodes
Authors:
Kareem Allam,
Xiaohong Iris Wang,
Songlin Zhang,
Jianmin Ding,
Kevin Chiu,
Karan Saluja,
Amer Wahed,
Hongxia Sun,
Andy N. D. Nguyen
Abstract:
Deep learning has been shown to be useful to detect breast cancer metastases by analyzing whole slide images of sentinel lymph nodes. However, it requires extensive scanning and analysis of all the lymph nodes slides for each case. Our deep learning study focuses on breast cancer screening with only a small set of image patches from any sentinel lymph node, positive or negative for metastasis, to…
▽ More
Deep learning has been shown to be useful to detect breast cancer metastases by analyzing whole slide images of sentinel lymph nodes. However, it requires extensive scanning and analysis of all the lymph nodes slides for each case. Our deep learning study focuses on breast cancer screening with only a small set of image patches from any sentinel lymph node, positive or negative for metastasis, to detect changes in tumor environment and not in the tumor itself. We design a convolutional neural network in the Python language to build a diagnostic model for this purpose. The excellent results from this preliminary study provided a proof of concept for incorporating automated metastatic screen into the digital pathology workflow to augment the pathologists' productivity. Our approach is unique since it provides a very rapid screen rather than an exhaustive search for tumor in all fields of all sentinel lymph nodes.
△ Less
Submitted 14 January, 2023;
originally announced January 2023.
-
AUC Maximization for Low-Resource Named Entity Recognition
Authors:
Ngoc Dang Nguyen,
Wei Tan,
Wray Buntine,
Richard Beare,
Changyou Chen,
Lan Du
Abstract:
Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is in…
▽ More
Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is inherently an imbalanced tagging problem, the model performance under the low-resource settings could suffer using these standard objective functions. Based on recent advances in area under the ROC curve (AUC) maximization, we propose to optimize the NER model by maximizing the AUC score. We give evidence that by simply combining two binary-classifiers that maximize the AUC score, significant performance improvement over traditional loss functions is achieved under low-resource NER settings. We also conduct extensive experiments to demonstrate the advantages of our method under the low-resource and highly-imbalanced data distribution settings. To the best of our knowledge, this is the first work that brings AUC maximization to the NER setting. Furthermore, we show that our method is agnostic to different types of NER embeddings, models and domains. The code to replicate this work will be provided upon request.
△ Less
Submitted 13 April, 2023; v1 submitted 9 December, 2022;
originally announced December 2022.
-
Hardness-guided domain adaptation to recognise biomedical named entities under low-resource scenarios
Authors:
Ngoc Dang Nguyen,
Lan Du,
Wray Buntine,
Changyou Chen,
Richard Beare
Abstract:
Domain adaptation is an effective solution to data scarcity in low-resource scenarios. However, when applied to token-level tasks such as bioNER, domain adaptation methods often suffer from the challenging linguistic characteristics that clinical narratives possess, which leads to unsatisfactory performance. In this paper, we present a simple yet effective hardness-guided domain adaptation (HGDA)…
▽ More
Domain adaptation is an effective solution to data scarcity in low-resource scenarios. However, when applied to token-level tasks such as bioNER, domain adaptation methods often suffer from the challenging linguistic characteristics that clinical narratives possess, which leads to unsatisfactory performance. In this paper, we present a simple yet effective hardness-guided domain adaptation (HGDA) framework for bioNER tasks that can effectively leverage the domain hardness information to improve the adaptability of the learnt model in low-resource scenarios. Experimental results on biomedical datasets show that our model can achieve significant performance improvement over the recently published state-of-the-art (SOTA) MetaNER model
△ Less
Submitted 10 November, 2022;
originally announced November 2022.
-
vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM
Authors:
Thanh Tin Nguyen,
Long H. Nguyen,
Nhat Truong Pham,
Liu Tai Nguyen,
Van Huong Do,
Hai Nguyen,
Ngoc Duy Nguyen
Abstract:
This study presents our approach on the automatic Vietnamese image captioning for healthcare domain in text processing tasks of Vietnamese Language and Speech Processing (VLSP) Challenge 2021, as shown in Figure 1. In recent years, image captioning often employs a convolutional neural network-based architecture as an encoder and a long short-term memory (LSTM) as a decoder to generate sentences. T…
▽ More
This study presents our approach on the automatic Vietnamese image captioning for healthcare domain in text processing tasks of Vietnamese Language and Speech Processing (VLSP) Challenge 2021, as shown in Figure 1. In recent years, image captioning often employs a convolutional neural network-based architecture as an encoder and a long short-term memory (LSTM) as a decoder to generate sentences. These models perform remarkably well in different datasets. Our proposed model also has an encoder and a decoder, but we instead use a Swin Transformer in the encoder, and a LSTM combined with an attention module in the decoder. The study presents our training experiments and techniques used during the competition. Our model achieves a BLEU4 score of 0.293 on the vietCap4H dataset, and the score is ranked the 3$^{rd}$ place on the private leaderboard. Our code can be found at \url{https://git.io/JDdJm}.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-based Textual Knowledge Source
Authors:
Kiet Van Nguyen,
Phong Nguyen-Thuan Do,
Nhat Duy Nguyen,
Tin Van Huynh,
Anh Gia-Tuan Nguyen,
Ngan Luu-Thuy Nguyen
Abstract:
Question answering (QA) is a natural language understanding task within the fields of information retrieval and information extraction that has attracted much attention from the computational linguistics and artificial intelligence research community in recent years because of the strong development of machine reading comprehension-based models. A reader-based QA system is a high-level search engi…
▽ More
Question answering (QA) is a natural language understanding task within the fields of information retrieval and information extraction that has attracted much attention from the computational linguistics and artificial intelligence research community in recent years because of the strong development of machine reading comprehension-based models. A reader-based QA system is a high-level search engine that can find correct answers to queries or questions in open-domain or domain-specific texts using machine reading comprehension (MRC) techniques. The majority of advancements in data resources and machine-learning approaches in the MRC and QA systems especially are developed significantly in two resource-rich languages such as English and Chinese. A low-resource language like Vietnamese has witnessed a scarcity of research on QA systems. This paper presents XLMRQA, the first Vietnamese QA system using a supervised transformer-based reader on the Wikipedia-based textual knowledge source (using the UIT-ViQuAD corpus), outperforming the two robust QA systems using deep neural network models: DrQA and BERTserini with 24.46% and 6.28%, respectively. From the results obtained on the three systems, we analyze the influence of question types on the performance of the QA systems.
△ Less
Submitted 13 August, 2022; v1 submitted 14 April, 2022;
originally announced April 2022.
-
The number of real zeros of elliptic polynomials
Authors:
Nhan D. V. Nguyen
Abstract:
Let $N_n(a, b)$ denote the number of real zeros of Gaussian elliptic polynomials of degree $n$ on the interval $(a, b)$, where $a$ and $b$ may vary with $n$. We obtain a precise formula for the variance of $N_n(a, b)$ and utilize this expression to derive an asymptotic expansion for large values of $n$. Furthermore, we provide sharp estimates for the cumulants and central moments of $N_n(a, b)$. T…
▽ More
Let $N_n(a, b)$ denote the number of real zeros of Gaussian elliptic polynomials of degree $n$ on the interval $(a, b)$, where $a$ and $b$ may vary with $n$. We obtain a precise formula for the variance of $N_n(a, b)$ and utilize this expression to derive an asymptotic expansion for large values of $n$. Furthermore, we provide sharp estimates for the cumulants and central moments of $N_n(a, b)$. These estimates are instrumental in establishing sufficient conditions on the interval $(a, b)$ for $N_n(a, b)$ to satisfy both a central limit theorem and a strong law of large numbers. In the second part of the paper, we extend our analysis to nondegenerate Gaussian analytic functions, including well-known examples such as the Gaussian Weyl series and Weyl polynomials.
△ Less
Submitted 7 May, 2024; v1 submitted 21 November, 2021;
originally announced November 2021.
-
Relevance of Ge incorporation to control the physical behaviour of point defects in kesterite
Authors:
Thomas Ratz,
Ngoc Duy Nguyen,
Guy Brammertz,
Bart Vermang,
Jean-Yves Raty
Abstract:
To reduce the prominent VOC-deficit that limits kesterite-based solar cells efficiencies, Ge has been proposed over the recent years with encouraging results, as the reduction of the non-radiative recombination rate is considered as a way to improve the well-known Sn-kesterite world record efficiency. To gain further insight into this mechanism, we investigate the physical behaviour of intrinsic p…
▽ More
To reduce the prominent VOC-deficit that limits kesterite-based solar cells efficiencies, Ge has been proposed over the recent years with encouraging results, as the reduction of the non-radiative recombination rate is considered as a way to improve the well-known Sn-kesterite world record efficiency. To gain further insight into this mechanism, we investigate the physical behaviour of intrinsic point defects both upon Ge do** and alloying of Cu2ZnSnS4 kesterite. Using a first-principles approach, we confirm the p-type conductivity of both Cu2ZnSnS4 and Cu2ZnGeS4, attributed to the low formation energies of the VCu and CuZn acceptor defects within the whole stable phase diagram range. Via do** of the Sn-kesterite matrix, we report the lowest formation energy for the substitutional defect GeSn. We also confirm the detrimental role of the substitutional defects XZn (X=Sn,Ge) acting as recombination centres within the Sn-based, the Ge-doped and the Ge-based kesterite. Finally, we highlight the reduction of the lattice distortion upon Ge incorporation resulting in a reduction of the carrier capture cross section and consequently a decrease of the non-radiative recombination rate within the bulk material.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
Fruit-CoV: An Efficient Vision-based Framework for Speedy Detection and Diagnosis of SARS-CoV-2 Infections Through Recorded Cough Sounds
Authors:
Long H. Nguyen,
Nhat Truong Pham,
Van Huong Do,
Liu Tai Nguyen,
Thanh Tin Nguyen,
Van Dung Do,
Hai Nguyen,
Ngoc Duy Nguyen
Abstract:
SARS-CoV-2 is colloquially known as COVID-19 that had an initial outbreak in December 2019. The deadly virus has spread across the world, taking part in the global pandemic disease since March 2020. In addition, a recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths over the world. Therefore, it is vital to possess a self-testing serv…
▽ More
SARS-CoV-2 is colloquially known as COVID-19 that had an initial outbreak in December 2019. The deadly virus has spread across the world, taking part in the global pandemic disease since March 2020. In addition, a recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths over the world. Therefore, it is vital to possess a self-testing service of SARS-CoV-2 at home. In this study, we introduce Fruit-CoV, a two-stage vision framework, which is capable of detecting SARS-CoV-2 infections through recorded cough sounds. Specifically, we convert sounds into Log-Mel Spectrograms and use the EfficientNet-V2 network to extract its visual features in the first stage. In the second stage, we use 14 convolutional layers extracted from the large-scale Pretrained Audio Neural Networks for audio pattern recognition (PANNs) and the Wavegram-Log-Mel-CNN to aggregate feature representations of the Log-Mel Spectrograms. Finally, we use the combined features to train a binary classifier. In this study, we use a dataset provided by the AICovidVN 115M Challenge, which includes a total of 7371 recorded cough sounds collected throughout Vietnam, India, and Switzerland. Experimental results show that our proposed model achieves an AUC score of 92.8% and ranks the 1st place on the leaderboard of the AICovidVN Challenge. More importantly, our proposed framework can be integrated into a call center or a VoIP system to speed up detecting SARS-CoV-2 infections through online/recorded cough sounds.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Sentence Extraction-Based Machine Reading Comprehension for Vietnamese
Authors:
Phong Nguyen-Thuan Do,
Nhat Duy Nguyen,
Tin Van Huynh,
Kiet Van Nguyen,
Anh Gia-Tuan Nguyen,
Ngan Luu-Thuy Nguyen
Abstract:
The development of natural language processing (NLP) in general and machine reading comprehension in particular has attracted the great attention of the research community. In recent years, there are a few datasets for machine reading comprehension tasks in Vietnamese with large sizes, such as UIT-ViQuAD and UIT-ViNewsQA. However, the datasets are not diverse in answers to serve the research. In t…
▽ More
The development of natural language processing (NLP) in general and machine reading comprehension in particular has attracted the great attention of the research community. In recent years, there are a few datasets for machine reading comprehension tasks in Vietnamese with large sizes, such as UIT-ViQuAD and UIT-ViNewsQA. However, the datasets are not diverse in answers to serve the research. In this paper, we introduce UIT-ViWikiQA, the first dataset for evaluating sentence extraction-based machine reading comprehension in the Vietnamese language. The UIT-ViWikiQA dataset is converted from the UIT-ViQuAD dataset, consisting of comprises 23.074 question-answers based on 5.109 passages of 174 Wikipedia Vietnamese articles. We propose a conversion algorithm to create the dataset for sentence extraction-based machine reading comprehension and three types of approaches for sentence extraction-based machine reading comprehension in Vietnamese. Our experiments show that the best machine model is XLM-R_Large, which achieves an exact match (EM) of 85.97% and an F1-score of 88.77% on our dataset. Besides, we analyze experimental results in terms of the question type in Vietnamese and the effect of context on the performance of the MRC models, thereby showing the challenges from the UIT-ViWikiQA dataset that we propose to the language processing community.
△ Less
Submitted 11 June, 2021; v1 submitted 19 May, 2021;
originally announced May 2021.
-
Opto-electronic properties and solar cell efficiency modelling of Cu$_2$ZnXS$_4$ (X=Sn,Ge,Si) kesterites
Authors:
Thomas Ratz,
Jean-Yves Raty,
Guy Brammertz,
Bart Vermang,
Ngoc Duy Nguyen
Abstract:
In this work, first principle calculations of Cu$_2$ZnSnS$_4$ (CZTS), Cu$_2$ZnGeS$_4$ (CZGS) and Cu$_2$ZnSiS$_4$ (CZSS) are performed to highlight the impact of the cationic substitution on the structural, electronic and optical properties of kesterite compounds. Direct bandgaps are reported with values of 1.32, 1.89 and 3.06 eV respectively for CZTS, CZGS and CZSS. In addition, absorption coeffic…
▽ More
In this work, first principle calculations of Cu$_2$ZnSnS$_4$ (CZTS), Cu$_2$ZnGeS$_4$ (CZGS) and Cu$_2$ZnSiS$_4$ (CZSS) are performed to highlight the impact of the cationic substitution on the structural, electronic and optical properties of kesterite compounds. Direct bandgaps are reported with values of 1.32, 1.89 and 3.06 eV respectively for CZTS, CZGS and CZSS. In addition, absorption coefficient values of the order of $10^4$ cm$^{-1}$ are obtained, indicating the applicability of these materials as absorber layer for solar cell applications. In the second part of this study, ab initio results are used as input data to model the electrical power conversion efficiency of kesterite-based solar cell. In that perspective, we used an improved version of the Shockley-Queisser theoretical model including non-radiative recombination via an external parameter defined as the internal quantum efficiency. Based on predicted optimal absorber layer thicknesses, the variation of the solar cell maximal efficiency is studied as a function of the non-radiative recombination rate. Maximal efficiencies of 25.88, 19.94 and 3.11% are reported respectively for CZTS, CZGS and CZSS for vanishing non-radiative recombination rate. Using an internal quantum efficiency providing $V_{OC}$ values comparable to experimental measurements, solar cell efficiencies of 15.88, 14.98 and 2.66% are reported respectively for CZTS, CZGS and CZSS (for an optimal thickness of 1.15 $μ$m). With this methodology, we confirm the suitability of CZTS in single junction solar cells, with a possible efficiency improvement of 10% enabled through the reduction of the non-radiative recombination rate. In addition, CZGS appears to be an interesting candidate as top cell absorber layer for tandem approaches whereas CZSS might be interesting for transparent PV windows.
△ Less
Submitted 25 November, 2020;
originally announced November 2020.
-
A roadmap for the design of four-terminal spin valves and the extraction of spin diffusion length
Authors:
Emile Fourneau,
Alejandro V. Silhanek,
Ngoc D. Nguyen
Abstract:
Graphene is a promising substrate for future spintronics devices owing to its remarkable electronic mobility and low spin-orbit coupling. Hanle precession in spin valve devices is commonly used to evaluate the spin diffusion and spin lifetime properties. In this work, we demonstrate that this method is no longer accurate when the distance between inner and outer electrodes is smaller than six time…
▽ More
Graphene is a promising substrate for future spintronics devices owing to its remarkable electronic mobility and low spin-orbit coupling. Hanle precession in spin valve devices is commonly used to evaluate the spin diffusion and spin lifetime properties. In this work, we demonstrate that this method is no longer accurate when the distance between inner and outer electrodes is smaller than six times the spin diffusion length, leading to errors as large as 50% for the calculations of the spin figures of merit of graphene. We suggest simple but efficient approaches to circumvent this limitation by addressing a revised version of the Hanle fit function. Complementarily, we provide clear guidelines for the design of four-terminal spin valves able to yield flawless estimations of the spin lifetime and the spin diffusion coefficient.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
On the origin of the giant spin detection efficiency in tunnel barrier based electrical spin detector
Authors:
Emile Fourneau,
Alejandro V. Silhanek,
Ngoc Duy Nguyen
Abstract:
Efficient conversion of a spin signal into an electric voltage in mainstream semiconductors is one of the grand challenges of spintronics. This process is commonly achieved via a ferromagnetic tunnel barrier where non-linear electric transport occurs. In this work, we demonstrate that non-linearity may lead to a spin-to-charge conversion efficiency larger than 10 times the spin polarization of the…
▽ More
Efficient conversion of a spin signal into an electric voltage in mainstream semiconductors is one of the grand challenges of spintronics. This process is commonly achieved via a ferromagnetic tunnel barrier where non-linear electric transport occurs. In this work, we demonstrate that non-linearity may lead to a spin-to-charge conversion efficiency larger than 10 times the spin polarization of the tunnel barrier when the latter is under bias of a few mV. We identify the underlying mechanisms responsible for this remarkably efficient spin detection as the tunnel barrier deformation and the conduction band shift resulting from a change of applied voltage. In addition, we derive an approximate analytical expression for the detector spin sensitivity $P_{\textrm{det}}(V)$. Calculations performed for different barrier shapes show that this enhancement is present in oxide barriers as well as in Schottky tunnel barriers even if the dominant mechanisms differs with the barrier type. Moreover, although the spin signal is reduced at high temperatures, it remains superior to the value predicted by the linear model. Our findings shed light into the interpretation and understanding of electrical spin detection experiments and open new paths to optimize the performance of spin transport devices.
△ Less
Submitted 23 March, 2020;
originally announced March 2020.
-
Review, Analysis and Design of a Comprehensive Deep Reinforcement Learning Framework
Authors:
Ngoc Duy Nguyen,
Thanh Thi Nguyen,
Hai Nguyen,
Doug Creighton,
Saeid Nahavandi
Abstract:
The integration of deep learning to reinforcement learning (RL) has enabled RL to perform efficiently in high-dimensional environments. Deep RL methods have been applied to solve many complex real-world problems in recent years. However, development of a deep RL-based system is challenging because of various issues such as the selection of a suitable deep RL algorithm, its network configuration, t…
▽ More
The integration of deep learning to reinforcement learning (RL) has enabled RL to perform efficiently in high-dimensional environments. Deep RL methods have been applied to solve many complex real-world problems in recent years. However, development of a deep RL-based system is challenging because of various issues such as the selection of a suitable deep RL algorithm, its network configuration, training time, training methods, and so on. This paper proposes a comprehensive software framework that not only plays a vital role in designing a connect-the-dots deep RL architecture but also provides a guideline to develop a realistic RL application in a short time span. We have designed and developed a deep RL-based software framework that strictly ensures flexibility, robustness, and scalability. By inheriting the proposed architecture, software managers can foresee any challenges when designing a deep RL-based system. As a result, they can expedite the design process and actively control every stage of software development, which is especially critical in agile development environments. To enforce generalization, the proposed architecture does not depend on a specific RL algorithm, a network configuration, the number of agents, or the type of agents. Using our framework, software developers can develop and integrate new RL algorithms or new types of agents, and can flexibly change network configuration or the number of agents.
△ Less
Submitted 23 February, 2021; v1 submitted 26 February, 2020;
originally announced February 2020.
-
A Visual Communication Map for Multi-Agent Deep Reinforcement Learning
Authors:
Ngoc Duy Nguyen,
Thanh Thi Nguyen,
Doug Creighton,
Saeid Nahavandi
Abstract:
Deep reinforcement learning has been applied successfully to solve various real-world problems and the number of its applications in the multi-agent settings has been increasing. Multi-agent learning distinctly poses significant challenges in the effort to allocate a concealed communication medium. Agents receive thorough knowledge from the medium to determine subsequent actions in a distributed n…
▽ More
Deep reinforcement learning has been applied successfully to solve various real-world problems and the number of its applications in the multi-agent settings has been increasing. Multi-agent learning distinctly poses significant challenges in the effort to allocate a concealed communication medium. Agents receive thorough knowledge from the medium to determine subsequent actions in a distributed nature. Apparently, the goal is to leverage the cooperation of multiple agents to achieve a designated objective efficiently. Recent studies typically combine a specialized neural network with reinforcement learning to enable communication between agents. This approach, however, limits the number of agents or necessitates the homogeneity of the system. In this paper, we have proposed a more scalable approach that not only deals with a great number of agents but also enables collaboration between dissimilar functional agents and compatibly combined with any deep reinforcement learning methods. Specifically, we create a global communication map to represent the status of each agent in the system visually. The visual map and the environmental state are fed to a shared-parameter network to train multiple agents concurrently. Finally, we select the Asynchronous Advantage Actor-Critic (A3C) algorithm to demonstrate our proposed scheme, namely Visual communication map for Multi-agent A3C (VMA3C). Simulation results show that the use of visual communication map improves the performance of A3C regarding learning speed, reward achievement, and robustness in multi-agent problems.
△ Less
Submitted 23 February, 2021; v1 submitted 26 February, 2020;
originally announced February 2020.
-
Trap** of electrons around nanoscale metallic wires embedded in a semiconductor medium
Authors:
Chi Cuong Huynh,
R. Evrard,
Ngoc Duy Nguyen
Abstract:
We predict that conduction electrons in a semiconductor film containing a centered square array of metal nanowires normal to its plane are bound in quantum states around the central wires, if a positive bias voltage is applied between the wires at the square vertices and these latter. We obtain and discuss the eigenenergies and eigenfunctions of two models with different dimensions. The results sh…
▽ More
We predict that conduction electrons in a semiconductor film containing a centered square array of metal nanowires normal to its plane are bound in quantum states around the central wires, if a positive bias voltage is applied between the wires at the square vertices and these latter. We obtain and discuss the eigenenergies and eigenfunctions of two models with different dimensions. The results show that the eigenstates can be grouped into different shells. The energy differences between the shells is typically a few tens of meV, which corresponds to frequencies of emitted or absorbed photons in a range of 3 THz to 20 THz approximately. These energy differences strongly depend on the bias voltage. We calculate the linear response of individual electrons on the ground level of our models to large-wavelength electromagnetic waves whose electric field is in the plane of the semiconductor film. The computed oscillator strengths are dominated by the transitions to the states in each shell whose wave function has a single radial node line normal to the wave electric field. We include the effect of the image charge induced on the central metal wires and show that it modifies the oscillator strengths so that their sum deviates from the value given by the Thomas-Reiche-Kuhn rule. We report the linear response, or polarizability, versus photon energy, of the studied models and their absorption spectra. These latter show well-defined peaks as expected from the study of the oscillator strengths. We show that the position of these absorption peaks is strongly dependent on the bias voltage so that the frequency of photon absorption or emission in the systems described here is easily tunable. This makes them good candidates for the development of novel infrared devices.
△ Less
Submitted 9 November, 2019;
originally announced November 2019.
-
Manipulating Soft Tissues by Deep Reinforcement Learning for Autonomous Robotic Surgery
Authors:
Ngoc Duy Nguyen,
Thanh Nguyen,
Saeid Nahavandi,
Asim Bhatti,
Glenn Guest
Abstract:
In robotic surgery, pattern cutting through a deformable material is a challenging research field. The cutting procedure requires a robot to concurrently manipulate a scissor and a gripper to cut through a predefined contour trajectory on the deformable sheet. The gripper ensures the cutting accuracy by nailing a point on the sheet and continuously tensioning the pinch point to different direction…
▽ More
In robotic surgery, pattern cutting through a deformable material is a challenging research field. The cutting procedure requires a robot to concurrently manipulate a scissor and a gripper to cut through a predefined contour trajectory on the deformable sheet. The gripper ensures the cutting accuracy by nailing a point on the sheet and continuously tensioning the pinch point to different directions while the scissor is in action. The goal is to find a pinch point and a corresponding tensioning policy to minimize damage to the material and increase cutting accuracy measured by the symmetric difference between the predefined contour and the cut contour. Previous study considers finding one fixed pinch point during the course of cutting, which is inaccurate and unsafe when the contour trajectory is complex. In this paper, we examine the soft tissue cutting task by using multiple pinch points, which imitates human operations while cutting. This approach, however, does not require the use of a multi-gripper robot. We use a deep reinforcement learning algorithm to find an optimal tensioning policy of a pinch point. Simulation results show that the multi-point approach outperforms the state-of-the-art method in soft pattern cutting task with respect to both accuracy and reliability.
△ Less
Submitted 13 February, 2019;
originally announced February 2019.
-
A New Tensioning Method using Deep Reinforcement Learning for Surgical Pattern Cutting
Authors:
Thanh Thi Nguyen,
Ngoc Duy Nguyen,
Fernando Bello,
Saeid Nahavandi
Abstract:
Surgeons normally need surgical scissors and tissue grippers to cut through a deformable surgical tissue. The cutting accuracy depends on the skills to manipulate these two tools. Such skills are part of basic surgical skills training as in the Fundamentals of Laparoscopic Surgery. The gripper is used to pinch a point on the surgical sheet and pull the tissue to a certain direction to maintain the…
▽ More
Surgeons normally need surgical scissors and tissue grippers to cut through a deformable surgical tissue. The cutting accuracy depends on the skills to manipulate these two tools. Such skills are part of basic surgical skills training as in the Fundamentals of Laparoscopic Surgery. The gripper is used to pinch a point on the surgical sheet and pull the tissue to a certain direction to maintain the tension while the scissors cut through a trajectory. As the surgical materials are deformable, it requires a comprehensive tensioning policy to yield appropriate tensioning direction at each step of the cutting process. Automating a tensioning policy for a given cutting trajectory will support not only the human surgeons but also the surgical robots to improve the cutting accuracy and reliability. This paper presents a multiple pinch point approach to modelling an autonomous tensioning planner based on a deep reinforcement learning algorithm. Experiments on a simulator show that the proposed method is superior to existing methods in terms of both performance and robustness.
△ Less
Submitted 10 January, 2019;
originally announced January 2019.
-
Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications
Authors:
Thanh Thi Nguyen,
Ngoc Duy Nguyen,
Saeid Nahavandi
Abstract:
Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms however have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in the…
▽ More
Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms however have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This paper addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multi-agent deep RL (MADRL) is presented, including non-stationarity, partial observability, continuous state and action spaces, multi-agent training schemes, multi-agent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed, with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to future development of more robust and highly useful multi-agent learning methods for solving real-world problems.
△ Less
Submitted 6 February, 2019; v1 submitted 31 December, 2018;
originally announced December 2018.
-
Quasi one-shot full-field surface profilometry using digital diffractive-confocal imaging correlation microscope
Authors:
Duc Trung Nguyen,
Liang-Chia Chen,
Nguyen Dinh Nguyen
Abstract:
One-shot full-field surface profilometry using digital diffractive-confocal imaging correlation microscope based on digital micromirror device is developed for one-shot microscopic 3D surface measurement. Optical configuration applies confocal microscope setup and was building on DMD to generate specific pinhole array arrangement for minimizing cross talk effect. An innovative method was invented…
▽ More
One-shot full-field surface profilometry using digital diffractive-confocal imaging correlation microscope based on digital micromirror device is developed for one-shot microscopic 3D surface measurement. Optical configuration applies confocal microscope setup and was building on DMD to generate specific pinhole array arrangement for minimizing cross talk effect. An innovative method was invented to create normalized cross correlation depth response curve from diffraction patterns of the pinhole. Using this approach, the sub-micrometer scale depth can be detected with high accuracy and precision.
△ Less
Submitted 11 April, 2019; v1 submitted 16 December, 2018;
originally announced December 2018.
-
Innovative full-field chromatic confocal microscopy using multispectral sensors
Authors:
Liang-Chia Chen,
Pei-Ju Tan,
Chih-Jer Lin,
Duc Trung Nguyen,
Yu-Shuan Chou,
Nguyen Dinh Nguyen,
Nguyen Thanh Trung
Abstract:
A full-field chromatic confocal microscopy using a multispectral sensor was developed for quasi-one-shot microscopic 3D surface measurement. An innovative optical configuration employs a digital micromirror device (DMD) and a multispectral sensor is used to realize chromatic confocal microscopy with full-field area scanning. In the optical design, an area-scan type chromatic dispersive objective i…
▽ More
A full-field chromatic confocal microscopy using a multispectral sensor was developed for quasi-one-shot microscopic 3D surface measurement. An innovative optical configuration employs a digital micromirror device (DMD) and a multispectral sensor is used to realize chromatic confocal microscopy with full-field area scanning. In the optical design, an area-scan type chromatic dispersive objective is specially designed to achieve measuring specification. Based on an 8x chromatic dispersive objective, the FOV for one shot measurement can be reached to 1.8mm*1.3mm which is immersive to microscopic profilometry. The spectral image captured by the multispectral sensor at each pinhole position has a unique spectrum pattern corresponding to its conjugate measured depth. A normalized cross-correlation (NCC) algorithm is developed to establish a spectrum-depth response curve with its corresponding spectrum pattern sets for accurate reconstruction of the tested 3D surface profile. With real test on standard targets, the measurement repeatability for a single surface depth is less than 0.6 micrometer.
△ Less
Submitted 11 April, 2019; v1 submitted 16 December, 2018;
originally announced December 2018.
-
Automated Diagnosis of Lymphoma with Digital Pathology Images Using Deep Learning
Authors:
Hanadi El Achi,
Tatiana Belousova,
Lei Chen,
Amer Wahed,
Iris Wang,
Zhihong Hu,
Zeyad Kanaan,
Adan Rios,
Andy N. D. Nguyen
Abstract:
Recent studies have shown promising results in using Deep Learning to detect malignancy in whole slide imaging. However, they were limited to just predicting positive or negative finding for a specific neoplasm. We attempted to use Deep Learning with a convolutional neural network algorithm to build a lymphoma diagnostic model for four diagnostic categories: benign lymph node, diffuse large B cell…
▽ More
Recent studies have shown promising results in using Deep Learning to detect malignancy in whole slide imaging. However, they were limited to just predicting positive or negative finding for a specific neoplasm. We attempted to use Deep Learning with a convolutional neural network algorithm to build a lymphoma diagnostic model for four diagnostic categories: benign lymph node, diffuse large B cell lymphoma, Burkitt lymphoma, and small lymphocytic lymphoma. Our software was written in Python language. We obtained digital whole slide images of Hematoxylin and Eosin stained slides of 128 cases including 32 cases for each diagnostic category. Four sets of 5 representative images, 40x40 pixels in dimension, were taken for each case. A total of 2,560 images were obtained from which 1,856 were used for training, 464 for validation and 240 for testing. For each test set of 5 images, the predicted diagnosis was combined from prediction of 5 images. The test results showed excellent diagnostic accuracy at 95% for image-by-image prediction and at 10% for set-by-set prediction. This preliminary study provided a proof of concept for incorporating automated lymphoma diagnostic screen into future pathology workflow to augment the pathologists' productivity.
△ Less
Submitted 30 October, 2018;
originally announced November 2018.
-
Application of Deep Learning on Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations
Authors:
Mei Lin,
Vanya Jaitly,
Iris Wang,
Zhihong Hu,
Lei Chen,
Md. Amer Wahed,
Zeyad Kanaan,
Adan Rios,
Andy N. D. Nguyen
Abstract:
We explore how Deep Learning (DL) can be utilized to predict prognosis of acute myeloid leukemia (AML). Out of TCGA (The Cancer Genome Atlas) database, 94 AML cases are used in this study. Input data include age, 10 common cytogenetic and 23 most common mutation results; output is the prognosis (diagnosis to death, DTD). In our DL network, autoencoders are stacked to form a hierarchical DL model f…
▽ More
We explore how Deep Learning (DL) can be utilized to predict prognosis of acute myeloid leukemia (AML). Out of TCGA (The Cancer Genome Atlas) database, 94 AML cases are used in this study. Input data include age, 10 common cytogenetic and 23 most common mutation results; output is the prognosis (diagnosis to death, DTD). In our DL network, autoencoders are stacked to form a hierarchical DL model from which raw data are compressed and organized and high-level features are extracted. The network is written in R language and is designed to predict prognosis of AML for a given case (DTD of more than or less than 730 days). The DL network achieves an excellent accuracy of 83% in predicting prognosis. As a proof-of-concept study, our preliminary results demonstrate a practical application of DL in future practice of prognostic prediction using next-gen sequencing (NGS) data.
△ Less
Submitted 30 October, 2018;
originally announced October 2018.
-
Multi-Agent Deep Reinforcement Learning with Human Strategies
Authors:
Thanh Nguyen,
Ngoc Duy Nguyen,
Saeid Nahavandi
Abstract:
Deep learning has enabled traditional reinforcement learning methods to deal with high-dimensional problems. However, one of the disadvantages of deep reinforcement learning methods is the limited exploration capacity of learning agents. In this paper, we introduce an approach that integrates human strategies to increase the exploration capacity of multiple deep reinforcement learning agents. We a…
▽ More
Deep learning has enabled traditional reinforcement learning methods to deal with high-dimensional problems. However, one of the disadvantages of deep reinforcement learning methods is the limited exploration capacity of learning agents. In this paper, we introduce an approach that integrates human strategies to increase the exploration capacity of multiple deep reinforcement learning agents. We also report the development of our own multi-agent environment called Multiple Tank Defence to simulate the proposed approach. The results show the significant performance improvement of multiple agents that have learned cooperatively with human strategies. This implies that there is a critical need for human intellect teamed with machines to solve complex problems. In addition, the success of this simulation indicates that our multi-agent environment can be used as a testbed platform to develop and validate other multi-agent control algorithms.
△ Less
Submitted 30 May, 2019; v1 submitted 12 June, 2018;
originally announced June 2018.
-
A Human Mixed Strategy Approach to Deep Reinforcement Learning
Authors:
Ngoc Duy Nguyen,
Saeid Nahavandi,
Thanh Nguyen
Abstract:
In 2015, Google's DeepMind announced an advancement in creating an autonomous agent based on deep reinforcement learning (DRL) that could beat a professional player in a series of 49 Atari games. However, the current manifestation of DRL is still immature, and has significant drawbacks. One of DRL's imperfections is its lack of "exploration" during the training process, especially when working wit…
▽ More
In 2015, Google's DeepMind announced an advancement in creating an autonomous agent based on deep reinforcement learning (DRL) that could beat a professional player in a series of 49 Atari games. However, the current manifestation of DRL is still immature, and has significant drawbacks. One of DRL's imperfections is its lack of "exploration" during the training process, especially when working with high-dimensional problems. In this paper, we propose a mixed strategy approach that mimics behaviors of human when interacting with environment, and create a "thinking" agent that allows for more efficient exploration in the DRL training process. The simulation results based on the Breakout game show that our scheme achieves a higher probability of obtaining a maximum score than does the baseline DRL algorithm, i.e., the asynchronous advantage actor-critic method. The proposed scheme therefore can be applied effectively to solving a complicated task in a real-world application.
△ Less
Submitted 5 April, 2018;
originally announced April 2018.
-
A Review of Situation Awareness Assessment Approaches in Aviation Environments
Authors:
Thanh Nguyen,
Chee Peng Lim,
Ngoc Duy Nguyen,
Lee Gordon-Brown,
Saeid Nahavandi
Abstract:
Situation awareness (SA) is an important constituent in human information processing and essential in pilots' decision-making processes. Acquiring and maintaining appropriate levels of SA is critical in aviation environments as it affects all decisions and actions taking place in flights and air traffic control. This paper provides an overview of recent measurement models and approaches to establi…
▽ More
Situation awareness (SA) is an important constituent in human information processing and essential in pilots' decision-making processes. Acquiring and maintaining appropriate levels of SA is critical in aviation environments as it affects all decisions and actions taking place in flights and air traffic control. This paper provides an overview of recent measurement models and approaches to establishing and enhancing SA in aviation environments. Many aspects of SA are examined including the classification of SA techniques into six categories, and different theoretical SA models from individual, to shared or team, and to distributed or system levels. Quantitative and qualitative perspectives pertaining to SA methods and issues of SA for unmanned vehicles are also addressed. Furthermore, future research directions regarding SA assessment approaches are raised to deal with shortcomings of the existing state-of-the-art methods in the literature.
△ Less
Submitted 7 June, 2019; v1 submitted 6 March, 2018;
originally announced March 2018.
-
A Multi-Objective Deep Reinforcement Learning Framework
Authors:
Thanh Thi Nguyen,
Ngoc Duy Nguyen,
Peter Vamplew,
Saeid Nahavandi,
Richard Dazeley,
Chee Peng Lim
Abstract:
This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We develop a high-performance MODRL framework that supports both single-policy and multi-policy strategies, as well as both linear and non-linear approaches to action selection. The experimental results on two benchmark problems (two-objective deep sea treasure environment a…
▽ More
This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We develop a high-performance MODRL framework that supports both single-policy and multi-policy strategies, as well as both linear and non-linear approaches to action selection. The experimental results on two benchmark problems (two-objective deep sea treasure environment and three-objective Mountain Car problem) indicate that the proposed framework is able to find the Pareto-optimal solutions effectively. The proposed framework is generic and highly modularized, which allows the integration of different deep reinforcement learning algorithms in different complex problem domains. This therefore overcomes many disadvantages involved with standard multi-objective reinforcement learning methods in the current literature. The proposed framework acts as a testbed platform that accelerates the development of MODRL for solving increasingly complicated multi-objective problems.
△ Less
Submitted 19 June, 2020; v1 submitted 7 March, 2018;
originally announced March 2018.
-
Proteomics Analysis of FLT3-ITD Mutation in Acute Myeloid Leukemia Using Deep Learning Neural Network
Authors:
Christine A. Liang,
Lei Chen,
Amer Wahed,
Andy N. D. Nguyen
Abstract:
Deep Learning can significantly benefit cancer proteomics and genomics. In this study, we attempt to determine a set of critical proteins that are associated with the FLT3-ITD mutation in newly-diagnosed acute myeloid leukemia patients. A Deep Learning network consisting of autoencoders forming a hierarchical model from which high-level features are extracted without labeled training data. Dimensi…
▽ More
Deep Learning can significantly benefit cancer proteomics and genomics. In this study, we attempt to determine a set of critical proteins that are associated with the FLT3-ITD mutation in newly-diagnosed acute myeloid leukemia patients. A Deep Learning network consisting of autoencoders forming a hierarchical model from which high-level features are extracted without labeled training data. Dimensional reduction reduced the number of critical proteins from 231 to 20. Deep Learning found an excellent correlation between FLT3-ITD mutation with the levels of these 20 critical proteins (accuracy 97%, sensitivity 90%, specificity 100%). Our Deep Learning network could hone in on 20 proteins with the strongest association with FLT3-ITD. The results of this study allow a novel approach to determine critical protein pathways in the FLT3-ITD mutation, and provide proof-of-concept for an accurate approach to model big data in cancer proteomics and genomics.
△ Less
Submitted 29 December, 2017;
originally announced January 2018.
-
Evidence for Z=6 `magic number' in neutron-rich carbon isotopes
Authors:
D. T. Tran,
H. J. Ong,
G. Hagen,
T. D. Morris,
N. Aoi,
T. Suzuki,
Y. Kanada-En'yo,
L. S. Geng,
S. Terashima,
I. Tanihata,
T. T. Nguyen,
Y. Ayyad,
P. Y. Chan,
M. Fukuda,
H. Geissel,
M. N. Harakeh,
T. Hashimoto,
T. H. Hoang,
E. Ideguchi,
A. Inoue,
G. R. Jansen,
R. Kanungo,
T. Kawabata,
L. H. Khiem,
W. P. Lin
, et al. (15 additional authors not shown)
Abstract:
The nuclear shell structure, which originates in the nearly independent motion of nucleons in an average potential, provides an important guide for our understanding of nuclear structure and the underlying nuclear forces. Its most remarkable fingerprint is the existence of the so-called `magic numbers' of protons and neutrons associated with extra stability. Although the introduction of a phenomen…
▽ More
The nuclear shell structure, which originates in the nearly independent motion of nucleons in an average potential, provides an important guide for our understanding of nuclear structure and the underlying nuclear forces. Its most remarkable fingerprint is the existence of the so-called `magic numbers' of protons and neutrons associated with extra stability. Although the introduction of a phenomenological spin-orbit (SO) coupling force in 1949 helped explain the nuclear magic numbers, its origins are still open questions. Here, we present experimental evidence for the smallest SO-originated magic number (subshell closure) at the proton number 6 in 13-20C obtained from systematic analysis of point-proton distribution radii, electromagnetic transition rates and atomic masses of light nuclei. Performing ab initio calculations on 14,15C, we show that the observed proton distribution radii and subshell closure can be explained by the state-of-the-art nuclear theory with chiral nucleon-nucleon and three-nucleon forces, which are rooted in the quantum chromodynamics.
△ Less
Submitted 11 September, 2017;
originally announced September 2017.
-
Charge-changing-cross-section measurements of $^{12-16}$C at around $45A$ MeV and development of a Glauber model for incident energies $10A-2100A$ MeV
Authors:
D. T. Tran,
H. J. Ong,
T. T. Nguyen,
I. Tanihata,
N. Aoi,
Y. Ayyad,
P. Y. Chan,
M. Fukuda,
T. Hashimoto,
T. H. Hoang,
E. Ideguchi,
A. Inoue,
T. Kawabata,
L. H. Khiem,
W. P. Lin,
K. Matsuta,
M. Mihara,
S. Momota,
D. Nagae,
N. D. Nguyen,
D. Nishimura,
A. Ozawa,
P. P. Ren,
H. Sakaguchi,
J. Tanaka
, et al. (4 additional authors not shown)
Abstract:
We have measured for the first time the charge-changing cross sections ($σ_{\text{CC}}$) of $^{12-16}$C on a $^{12}$C target at energies below $100A$ MeV. To analyze these low-energy data, we have developed a finite-range Glauber model with a global parameter set within the optical-limit approximation which is applicable to reaction cross section ($σ_{\text{R}}$) and $σ_{\text{CC}}$ measurements a…
▽ More
We have measured for the first time the charge-changing cross sections ($σ_{\text{CC}}$) of $^{12-16}$C on a $^{12}$C target at energies below $100A$ MeV. To analyze these low-energy data, we have developed a finite-range Glauber model with a global parameter set within the optical-limit approximation which is applicable to reaction cross section ($σ_{\text{R}}$) and $σ_{\text{CC}}$ measurements at incident energies from 10$A$ to $2100A$ MeV. Adopting the proton-density distribution of $^{12}$C known from the electron-scattering data, as well as the bare total nucleon-nucleon cross sections, and the real-to-imaginary-part ratios of the forward proton-proton elastic scattering amplitude available in the literatures, we determine the energy-dependent slope parameter $β_{\rm pn}$ of the proton-neutron elastic differential cross section so as to reproduce the existing $σ_{\text{R}}$ and interaction-cross-section data for $^{12}$C+$^{12}$C over a wide range of incident energies. The Glauber model thus formulated is applied to calculate the $σ_{\text{\tiny R}}$'s of $^{12}$C on a $^9$Be and $^{27}$Al targets at various incident energies. Our calculations show excellent agreement with the experimental data. Applying our model to the $σ_{\text{\tiny R}}$ and $σ_{\text{\tiny CC}}$ for the "neutron-skin" $^{16}$C nucleus, we reconfirm the importance of measurements at incident energies below $100A$ MeV. The proton root-mean-square radii of $^{12-16}$C are extracted using the measured $σ_{\text{CC}}$'s and the existing $σ_{\text{R}}$ data. The results for $^{12-14}$C are consistent with the values from the electron scatterings, demonstrating the feasibility, usefulness of the $σ_{\text{CC}}$ measurement and the present Glauber model.
△ Less
Submitted 28 June, 2016;
originally announced June 2016.
-
Note on the numerical solution of the scalar Helmholtz equation in a nanotorus with uniform Dirichlet boundary conditions
Authors:
N. D. Nguyen,
R. Evrard,
Michael A. Stroscio
Abstract:
This note describes the solution of the Helmholtz equation inside a nanotorus with uniform Dirichlet boundary conditions. The eigenfunction symmetry is discussed and the lower-order eigenvalues and eigenfunctions are shown. The similarity with the case of a long cylinder and with that of the vibrations of a circular elastic membrane is discussed. This similarity is used to propose a classification…
▽ More
This note describes the solution of the Helmholtz equation inside a nanotorus with uniform Dirichlet boundary conditions. The eigenfunction symmetry is discussed and the lower-order eigenvalues and eigenfunctions are shown. The similarity with the case of a long cylinder and with that of the vibrations of a circular elastic membrane is discussed. This similarity is used to propose a classification scheme of the eigenfunctions based on three indices.
△ Less
Submitted 20 March, 2016;
originally announced March 2016.
-
Classical analogy for the deflection of flux avalanches by a metallic layer
Authors:
J. Brisbois,
B. Vanderheyden,
F. Colauto,
M. Motta,
W. A. Ortiz,
J. Fritzsche,
N. D. Nguyen,
B. Hackens,
O. -A. Adami,
A. V. Silhanek
Abstract:
Sudden avalanches of magnetic flux bursting into a superconducting sample undergo deflections of their trajectories when encountering a conductive layer deposited on top of the superconductor. Remarkably, in some cases flux is totally excluded from the area covered by the conductive layer. We present a simple classical model that accounts for this behaviour and considers a magnetic monopole approa…
▽ More
Sudden avalanches of magnetic flux bursting into a superconducting sample undergo deflections of their trajectories when encountering a conductive layer deposited on top of the superconductor. Remarkably, in some cases flux is totally excluded from the area covered by the conductive layer. We present a simple classical model that accounts for this behaviour and considers a magnetic monopole approaching a semi-infinite conductive plane. This model suggests that magnetic braking is an important mechanism responsible for avalanche deflection.
△ Less
Submitted 11 August, 2014;
originally announced August 2014.
-
A Facial Expression Classification System Integrating Canny, Principal Component Analysis and Artificial Neural Network
Authors:
Le Hoang Thai,
Nguyen Do Thai Nguyen,
Tran Son Hai
Abstract:
Facial Expression Classification is an interesting research problem in recent years. There are a lot of methods to solve this problem. In this research, we propose a novel approach using Canny, Principal Component Analysis (PCA) and Artificial Neural Network. Firstly, in preprocessing phase, we use Canny for local region detection of facial images. Then each of local region's features will be pres…
▽ More
Facial Expression Classification is an interesting research problem in recent years. There are a lot of methods to solve this problem. In this research, we propose a novel approach using Canny, Principal Component Analysis (PCA) and Artificial Neural Network. Firstly, in preprocessing phase, we use Canny for local region detection of facial images. Then each of local region's features will be presented based on Principal Component Analysis (PCA). Finally, using Artificial Neural Network (ANN)applies for Facial Expression Classification. We apply our proposal method (Canny_PCA_ANN) for recognition of six basic facial expressions on JAFFE database consisting 213 images posed by 10 Japanese female models. The experimental result shows the feasibility of our proposal method.
△ Less
Submitted 17 November, 2011;
originally announced November 2011.