-
Text mining in education
Authors:
R. Ferreira-Mello,
M. Andre,
A. Pinheiro,
E. Costa,
C. Romero
Abstract:
The explosive growth of online education environments is generating a massive volume of data, specially in text format from forums, chats, social networks, assessments, essays, among others. It produces exciting challenges on how to mine text data in order to find useful knowledge for educational stakeholders. Despite the increasing number of educational applications of text mining published recen…
▽ More
The explosive growth of online education environments is generating a massive volume of data, specially in text format from forums, chats, social networks, assessments, essays, among others. It produces exciting challenges on how to mine text data in order to find useful knowledge for educational stakeholders. Despite the increasing number of educational applications of text mining published recently, we have not found any paper surveying them. In this line, this work presents a systematic overview of the current status of the Educational Text Mining field. Our final goal is to answer three main research questions: Which are the text mining techniques most used in educational environments? Which are the most used educational resources? And which are the main applications or educational goals? Finally, we outline the conclusions and the more interesting future trends.
△ Less
Submitted 11 February, 2024;
originally announced March 2024.
-
Machine learning in the prediction of cardiac epicardial and mediastinal fat volumes
Authors:
É. O. Rodrigues,
V. H. A. Pinheiro,
P. Liatsis,
A. Conci
Abstract:
We propose a methodology to predict the cardiac epicardial and mediastinal fat volumes in computed tomography images using regression algorithms. The obtained results indicate that it is feasible to predict these fats with a high degree of correlation, thus alleviating the requirement for manual or automatic segmentation of both fat volumes. Instead, segmenting just one of them suffices, while the…
▽ More
We propose a methodology to predict the cardiac epicardial and mediastinal fat volumes in computed tomography images using regression algorithms. The obtained results indicate that it is feasible to predict these fats with a high degree of correlation, thus alleviating the requirement for manual or automatic segmentation of both fat volumes. Instead, segmenting just one of them suffices, while the volume of the other may be predicted fairly precisely. The correlation coefficient obtained by the Rotation Forest algorithm using MLP Regressor for predicting the mediastinal fat based on the epicardial fat was 0.9876, with a relative absolute error of 14.4% and a root relative squared error of 15.7%. The best correlation coefficient obtained in the prediction of the epicardial fat based on the mediastinal was 0.9683 with a relative absolute error of 19.6% and a relative squared error of 24.9%. Moreover, we analysed the feasibility of using linear regressors, which provide an intuitive interpretation of the underlying approximations. In this case, the obtained correlation coefficient was 0.9534 for predicting the mediastinal fat based on the epicardial, with a relative absolute error of 31.6% and a root relative squared error of 30.1%. On the prediction of the epicardial fat based on the mediastinal fat, the correlation coefficient was 0.8531, with a relative absolute error of 50.43% and a root relative squared error of 52.06%. In summary, it is possible to speed up general medical analyses and some segmentation and quantification methods that are currently employed in the state-of-the-art by using this prediction approach, which consequently reduces costs and therefore enables preventive treatments that may lead to a reduction of health problems.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Severity classification in cases of Collagen VI-related myopathy with Convolutional Neural Networks and handcrafted texture features
Authors:
Rafael Rodrigues,
Susana Quijano-Roy,
Robert-Yves Carlier,
Antonio M. G. Pinheiro
Abstract:
Magnetic Resonance Imaging (MRI) is a non-invasive tool for the clinical assessment of low-prevalence neuromuscular disorders. Automated diagnosis methods might reduce the need for biopsies and provide valuable information on disease follow-up. In this paper, three methods are proposed to classify target muscles in Collagen VI-related myopathy cases, based on their degree of involvement, notably a…
▽ More
Magnetic Resonance Imaging (MRI) is a non-invasive tool for the clinical assessment of low-prevalence neuromuscular disorders. Automated diagnosis methods might reduce the need for biopsies and provide valuable information on disease follow-up. In this paper, three methods are proposed to classify target muscles in Collagen VI-related myopathy cases, based on their degree of involvement, notably a Convolutional Neural Network, a Fully Connected Network to classify texture features, and a hybrid method combining the two feature sets. The proposed methods were evaluated on axial T1-weighted Turbo Spin-Echo MRI from 26 subjects, including Ullrich Congenital Muscular Dystrophy and Bethlem Myopathy patients at different evolution stages. The hybrid model achieved the best cross-validation results, with a global accuracy of 93.8%, and F-scores of 0.99, 0.82, and 0.95, for healthy, mild and moderate/severe cases, respectively.
△ Less
Submitted 4 July, 2022; v1 submitted 28 February, 2022;
originally announced February 2022.
-
QUALINET White Paper on Definitions of Immersive Media Experience (IMEx)
Authors:
Andrew Perkis,
Christian Timmerer,
Sabina Baraković,
Jasmina Baraković Husić,
Søren Bech,
Sebastian Bosse,
Jean Botev,
Kjell Brunnström,
Luis Cruz,
Katrien De Moor,
Andrea de Polo Saibanti,
Wouter Durnez,
Sebastian Egger-Lampl,
Ulrich Engelke,
Tiago H. Falk,
Jesús Gutiérrez,
Asim Hameed,
Andrew Hines,
Tanja Kojic,
Dragan Kukolj,
Eirini Liotou,
Dragorad Milovanovic,
Sebastian Möller,
Niall Murray,
Babak Naderi
, et al. (19 additional authors not shown)
Abstract:
With the coming of age of virtual/augmented reality and interactive media, numerous definitions, frameworks, and models of immersion have emerged across different fields ranging from computer graphics to literary works. Immersion is oftentimes used interchangeably with presence as both concepts are closely related. However, there are noticeable interdisciplinary differences regarding definitions,…
▽ More
With the coming of age of virtual/augmented reality and interactive media, numerous definitions, frameworks, and models of immersion have emerged across different fields ranging from computer graphics to literary works. Immersion is oftentimes used interchangeably with presence as both concepts are closely related. However, there are noticeable interdisciplinary differences regarding definitions, scope, and constituents that are required to be addressed so that a coherent understanding of the concepts can be achieved. Such consensus is vital for paving the directionality of the future of immersive media experiences (IMEx) and all related matters. The aim of this white paper is to provide a survey of definitions of immersion and presence which leads to a definition of immersive media experience (IMEx). The Quality of Experience (QoE) for immersive media is described by establishing a relationship between the concepts of QoE and IMEx followed by application areas of immersive media experience. Influencing factors on immersive media experience are elaborated as well as the assessment of immersive media experience. Finally, standardization activities related to IMEx are highlighted and the white paper is concluded with an outlook related to future developments.
△ Less
Submitted 24 November, 2020; v1 submitted 10 June, 2020;
originally announced July 2020.
-
vSDNEmul: A Software-Defined Network Emulator Based on Container Virtualization
Authors:
Fernando N. N. Farias,
Antônio de O. Junior,
Leonardo B. da Costa,
Billy A. Pinheiro,
Antônio J. G. Abelém
Abstract:
The main issue related to Software-Defined Network emulators is how to replicate real behavior in experiments. Mininet and others SDN emulators have an architecture that limits both the scope of experiments and the fidelity of networking tests. Consequently, the serialization, contention, and load of background processes may produce delays that compromise the operation of events such as transmitti…
▽ More
The main issue related to Software-Defined Network emulators is how to replicate real behavior in experiments. Mininet and others SDN emulators have an architecture that limits both the scope of experiments and the fidelity of networking tests. Consequently, the serialization, contention, and load of background processes may produce delays that compromise the operation of events such as transmitting a packet or completing a computation, possibly invalidating the performance evaluation of a network emulation. To address these problems, this paper presents vSDNEmul, a network emulator based on Docker container virtualization. Different from Mininet, vSDNEmul isolates each node in a container and interconnects the nodes through virtual or tunnel links. By using containers, vSDNEmul allows autonomous and flexible creation of independent network elements, resulting in more realistic emulations. This paper reports performance evaluations comparing vSDNEmul and Mininet. The results obtained with the vSDNEmul emulator are more realistic and present higher accuracy.
△ Less
Submitted 28 August, 2019;
originally announced August 2019.
-
Segmentation of Skeletal Muscle in Thigh Dixon MRI Based on Texture Analysis
Authors:
Rafael Rodrigues,
Antonio M. G. Pinheiro
Abstract:
Segmentation of skeletal muscles in Magnetic Resonance Images (MRI) is essential for the study of muscle physiology and diagnosis of muscular pathologies. However, manual segmentation of large MRI volumes is a time-consuming task. The state-of-the-art on algorithms for muscle segmentation in MRI is still not very extensive and is somewhat database-dependent. In this paper, an automated segmentatio…
▽ More
Segmentation of skeletal muscles in Magnetic Resonance Images (MRI) is essential for the study of muscle physiology and diagnosis of muscular pathologies. However, manual segmentation of large MRI volumes is a time-consuming task. The state-of-the-art on algorithms for muscle segmentation in MRI is still not very extensive and is somewhat database-dependent. In this paper, an automated segmentation method based on AdaBoost classification of local texture features is presented. The texture descriptor consists of the Histogram of Oriented Gradients (HOG), Wavelet-based features, and a set of statistical measures computed from both the original and the Laplacian of Gaussian filtering of the grayscale MRI. The classifier performance suggests that texture analysis may be a helpful tool for designing a generalized and automated MRI muscle segmentation framework. Furthermore, an atlas-based approach to individual muscle segmentation is also described in this paper. The atlas is obtained by overlaying the muscle segmentation ground truth, provided by a radiologist, after image alignment using an appropriate affine transformation. Then, it is used to define the muscle labels upon the AdaBoost binary segmentation. The developed atlas method provides reasonable results when an accurate muscle tissue segmentation was obtained.
△ Less
Submitted 9 April, 2019;
originally announced April 2019.
-
Fast and Efficient Lenslet Image Compression
Authors:
Hadi Amirpour,
Antonio Pinheiro,
Manuela Pereira,
Mohammad Ghanbari
Abstract:
Light field imaging is characterized by capturing brightness, color, and directional information of light rays in a scene. This leads to image representations with huge amount of data that require efficient coding schemes. In this paper, lenslet images are rendered into sub-aperture images. These images are organized as a pseudo-sequence input for the HEVC video codec. To better exploit redundancy…
▽ More
Light field imaging is characterized by capturing brightness, color, and directional information of light rays in a scene. This leads to image representations with huge amount of data that require efficient coding schemes. In this paper, lenslet images are rendered into sub-aperture images. These images are organized as a pseudo-sequence input for the HEVC video codec. To better exploit redundancy among the neighboring sub-aperture images and consequently decrease the distances between a sub-aperture image and its references used for prediction, sub-aperture images are divided into four smaller groups that are scanned in a serpentine order. The most central sub-aperture image, which has the highest similarity to all the other images, is used as the initial reference image for each of the four regions. Furthermore, a structure is defined that selects spatially adjacent sub-aperture images as prediction references with the highest similarity to the current image. In this way, encoding efficiency increases, and furthermore it leads to a higher similarity among the co-located Coding Three Units (CTUs). The similarities among the co-located CTUs are exploited to predict Coding Unit depths.Moreover, independent encoding of each group division enables parallel processing, that along with the proposed coding unit depth prediction decrease the encoding execution time by almost 80% on average. Simulation results show that Rate-Distortion performance of the proposed method has higher compression gain than the other state-of-the-art lenslet compression methods with lower computational complexity.
△ Less
Submitted 27 January, 2019;
originally announced January 2019.
-
Canonical form of linear subspaces and coding invariants: the poset metric point of view
Authors:
Jerry Anderson Pinheiro,
Marcelo Firer
Abstract:
In this work we introduce the concept of a sub-space decomposition, subject to a partition of the coordinates. Considering metrics determined by partial orders in the set of coordinates, the so called poset metrics, we show the existence of maximal decompositions according to the metric. These decompositions turns to be an important tool to obtain the canonical form for codes over any poset metric…
▽ More
In this work we introduce the concept of a sub-space decomposition, subject to a partition of the coordinates. Considering metrics determined by partial orders in the set of coordinates, the so called poset metrics, we show the existence of maximal decompositions according to the metric. These decompositions turns to be an important tool to obtain the canonical form for codes over any poset metrics and to obtain bounds for important invariants such as the packing radius of a linear subspace. Furthermore, using maximal decompositions, we are able to reduce and optimize the full lookup table algorithm for the syndrome decoding process.
△ Less
Submitted 29 June, 2017;
originally announced June 2017.
-
Combinatorial metrics: MacWilliams-type identities, isometries and extension property
Authors:
Jerry Anderson Pinheiro,
Roberto Assis Machado,
Marcelo Firer
Abstract:
In this work we characterize the combinatorial metrics admitting a MacWilliams-type identity and describe the group of linear isometries of such metrics. Considering coverings that are not connected, we classify the metrics satisfying the MacWilliams extension property.
In this work we characterize the combinatorial metrics admitting a MacWilliams-type identity and describe the group of linear isometries of such metrics. Considering coverings that are not connected, we classify the metrics satisfying the MacWilliams extension property.
△ Less
Submitted 23 March, 2017;
originally announced March 2017.
-
Characterization of metrics induced by hierarchical posets
Authors:
Roberto Assis Machado,
Jerry Anderson Pinheiro,
Marcelo Firer
Abstract:
In this paper we consider metrics determined by hierarchical posets and give explicit formulae for the main parameters of a linear code: the minimum distance and the packing, covering and Chebyshev radii of a code. We also present ten characterizations of hierarchical poset metrics, including new characterizations and simple new proofs to the known ones.
In this paper we consider metrics determined by hierarchical posets and give explicit formulae for the main parameters of a linear code: the minimum distance and the packing, covering and Chebyshev radii of a code. We also present ten characterizations of hierarchical poset metrics, including new characterizations and simple new proofs to the known ones.
△ Less
Submitted 23 March, 2017; v1 submitted 4 August, 2015;
originally announced August 2015.
-
Coding and Decoding Schemes for MSE and Image Transmission
Authors:
Marcelo Firer,
Luciano Panek,
Jerry Anderson Pinheiro
Abstract:
In this work we explore possibilities for coding and decoding tailor-made for mean squared error evaluation of error in contexts such as image transmission. To do so, we introduce a loss function that expresses the overall performance of a coding and decoding scheme for discrete channels and that exchanges the usual goal of minimizing the error probability to that of minimizing the expected loss.…
▽ More
In this work we explore possibilities for coding and decoding tailor-made for mean squared error evaluation of error in contexts such as image transmission. To do so, we introduce a loss function that expresses the overall performance of a coding and decoding scheme for discrete channels and that exchanges the usual goal of minimizing the error probability to that of minimizing the expected loss. In this environment we explore the possibilities of using ordered decoders to create a message-wise unequal error protection (UEP), where the most valuable information is protected by placing in its proximity information words that differ by a small valued error. We give explicit examples, using scale-of-gray images, including small-scale performance analysis and visual simulations for the BSMC.
△ Less
Submitted 4 November, 2014;
originally announced November 2014.
-
Bounds for complexity of syndrome decoding for poset metrics
Authors:
Marcelo Firer,
Jerry Anderson Pinheiro
Abstract:
In this work we show how to decompose a linear code relatively to any given poset metric. We prove that the complexity of syndrome decoding is determined by a maximal (primary) such decomposition and then show that a refinement of a partial order leads to a refinement of the primary decomposition. Using this and considering already known results about hierarchical posets, we can establish upper an…
▽ More
In this work we show how to decompose a linear code relatively to any given poset metric. We prove that the complexity of syndrome decoding is determined by a maximal (primary) such decomposition and then show that a refinement of a partial order leads to a refinement of the primary decomposition. Using this and considering already known results about hierarchical posets, we can establish upper and lower bounds for the complexity of syndrome decoding relatively to a poset metric.
△ Less
Submitted 17 February, 2015; v1 submitted 3 November, 2014;
originally announced November 2014.
-
Classification of poset-block spaces admitting MacWilliams-type identity
Authors:
Jerry Anderson Pinheiro,
Marcelo Firer
Abstract:
In this work we prove that a poset-block space admits a MacWilliams-type identity if and only if the poset is hierarchical and at any level of the poset, all the blocks have the same dimension. When the poset-block admits the MacWilliams-type identity we explicit the relation between the weight enumerators of a code and its dual.
In this work we prove that a poset-block space admits a MacWilliams-type identity if and only if the poset is hierarchical and at any level of the poset, all the blocks have the same dimension. When the poset-block admits the MacWilliams-type identity we explicit the relation between the weight enumerators of a code and its dual.
△ Less
Submitted 28 February, 2012;
originally announced February 2012.