A Data-driven Latent Semantic Analysis for Automatic Text Summarization using LDA Topic Modelling
Authors:
Daniel F. O. Onah,
Elaine L. L. Pang,
Mahmoud El-Haj
Abstract:
With the advent and popularity of big data mining and huge text analysis in modern times, automated text summarization became prominent for extracting and retrieving important information from documents. This research investigates aspects of automatic text summarization from the perspectives of single and multiple documents. Summarization is a task of condensing huge text articles into short, summ…
▽ More
With the advent and popularity of big data mining and huge text analysis in modern times, automated text summarization became prominent for extracting and retrieving important information from documents. This research investigates aspects of automatic text summarization from the perspectives of single and multiple documents. Summarization is a task of condensing huge text articles into short, summarized versions. The text is reduced in size for summarization purpose but preserving key vital information and retaining the meaning of the original document. This study presents the Latent Dirichlet Allocation (LDA) approach used to perform topic modelling from summarised medical science journal articles with topics related to genes and diseases. In this study, PyLDAvis web-based interactive visualization tool was used to visualise the selected topics. The visualisation provides an overarching view of the main topics while allowing and attributing deep meaning to the prevalence individual topic. This study presents a novel approach to summarization of single and multiple documents. The results suggest the terms ranked purely by considering their probability of the topic prevalence within the processed document using extractive summarization technique. PyLDAvis visualization describes the flexibility of exploring the terms of the topics' association to the fitted LDA model. The topic modelling result shows prevalence within topics 1 and 2. This association reveals that there is similarity between the terms in topic 1 and 2 in this study. The efficacy of the LDA and the extractive summarization methods were measured using Latent Semantic Analysis (LSA) and Recall-Oriented Understudy for Gisting Evaluation (ROUGE) metrics to evaluate the reliability and validity of the model.
△ Less
Submitted 29 May, 2023; v1 submitted 23 July, 2022;
originally announced July 2022.
Study of helium irradiation induced hardening in MNHS steel
Authors:
J. Wang,
Z. G. Wang,
E. Q. Xie,
N. Gao,
M. H. Cui,
T. L. Shen,
K. F. Wei,
C. F. Yao,
J. R. Sun,
Y. B. Zhu,
L. L. Pang,
D. Wang,
H. P. Zhu,
Y. Y. Du
Abstract:
A recently developed reduced activation ferritic/martensitic steel MNHS was irradiated with 200keV He ions to a fluence of 1E21ions/m^2 at 450 celsius degree and 1E20ions/m^2 at 300 celsius degree and 450 celsius degree, respectively. The irradiation hardening of the steel was investigated by nanoindentation measurements combined with transmission electron microscopy (TEM) analysis. Dispersed barr…
▽ More
A recently developed reduced activation ferritic/martensitic steel MNHS was irradiated with 200keV He ions to a fluence of 1E21ions/m^2 at 450 celsius degree and 1E20ions/m^2 at 300 celsius degree and 450 celsius degree, respectively. The irradiation hardening of the steel was investigated by nanoindentation measurements combined with transmission electron microscopy (TEM) analysis. Dispersed barrier-hardening (DBH) model was applied to predict the hardness increments based on TEM analysis. The predicted hardness increments are consistent with the values obtained by nanoindentation tests. It is found that dislocation loops and He bubbles are hard barriers against dislocation motion and they are the main contributions to He irradiation-induced hardening of MNHS steel. The obstacle strength of He bubbles is stronger than the obstacle strength of dislocation loops.
△ Less
Submitted 24 March, 2015;
originally announced March 2015.