-
Assessment of text coherence based on the cohesion estimation
Authors:
S. D. Pogorilyy,
A. A. Kramov
Abstract:
In this paper, a graph-based coherence estimation method based on the cohesion estimation is suggested. Our method uses a graph-based approach to provide a user with an understanding of the evaluation process. Moreover, it can be applied to different languages, therefore, the effectiveness of this method is examined on the set of English, Chinese, and Arabic texts.
In this paper, a graph-based coherence estimation method based on the cohesion estimation is suggested. Our method uses a graph-based approach to provide a user with an understanding of the evaluation process. Moreover, it can be applied to different languages, therefore, the effectiveness of this method is examined on the set of English, Chinese, and Arabic texts.
△ Less
Submitted 11 November, 2020;
originally announced November 2020.
-
Method of the coherence evaluation of Ukrainian text
Authors:
S. D. Pogorilyy,
A. A. Kramov
Abstract:
Due to the growing role of the SEO technologies, it is necessary to perform an automated analysis of the article's quality. Such approach helps both to return the most intelligible pages for the user's query and to raise the web sites positions to the top of query results. An automated assessment of a coherence is a part of the complex analysis of the text. In this article, main methods for text c…
▽ More
Due to the growing role of the SEO technologies, it is necessary to perform an automated analysis of the article's quality. Such approach helps both to return the most intelligible pages for the user's query and to raise the web sites positions to the top of query results. An automated assessment of a coherence is a part of the complex analysis of the text. In this article, main methods for text coherence measurements for Ukrainian language are analyzed. Expediency of using the semantic similarity graph method in comparison with other methods are explained. It is suggested the improvement of that method by the pre-training of the neural network for vector representations of sentences. Experimental examination of the original method and its modifications is made. Training and examination procedures are made on the corpus of Ukrainian texts, which were previously retrieved from abstracts and full texts of Ukrainian scientific articles. The testing procedure is implemented by performing of two typical tasks for the text coherence assessment: document discrimination task and insertion task. Accordingly to the analysis it is defined the most effective combination of method's modification and its parameter for the measurement of the text coherence.
△ Less
Submitted 31 October, 2020;
originally announced November 2020.
-
Development of the complex system for the remote monitoring of the human heart rate
Authors:
Artem Kramov,
Olexandr Bauzha
Abstract:
An implementation of the remote pulse monitoring system which allows observing of the patient's pulse in a real-time mode via browser is offered in this work. The result of the work is the development of the complex system, which contains the hardware components for the pulse measurement and the software component for the data processing and visualization in a web-interface. The web-interface prov…
▽ More
An implementation of the remote pulse monitoring system which allows observing of the patient's pulse in a real-time mode via browser is offered in this work. The result of the work is the development of the complex system, which contains the hardware components for the pulse measurement and the software component for the data processing and visualization in a web-interface. The web-interface provides the heart rate visualization in real-time mode and informs the appropriate person in case of deviation from pulse limits. The monitoring system can detect two disease types: tachycardia and bradycardia. A pulse sensor detects the heartbeat moment and functions like a plethysmograph. The microcontroller ATmega8 is used to read data from the sensor, to analyze information, and pass it to the next hardware block. Arduino Uno and Ethernet module ENC28J60 are used to transform the information about the heartbeat event to the web interface. Ethernet module ENC28J60 is connected to Arduino Uno using the SPI interface. The pair of Bluetooth modules HC-05 is used to connect ATmega8 and Arduino Uno with each other. The module HC-05 is connected to both microcontrollers using the UART interface. The WebSocket protocol is used to implement the real-time data demonstration in the web-interface. The web-interface is adapted to mobile devices therefore it can be viewed from smartphones and tablets. The complex can be used both by the qualified specialist for the remote monitoring of the patient's state and as a personal prophylactic
△ Less
Submitted 23 October, 2020;
originally announced October 2020.
-
Method of noun phrase detection in Ukrainian texts
Authors:
S. D. Pogorilyy,
A. A. Kramov
Abstract:
Introduction. The area of natural language processing considers AI-complete tasks that cannot be solved using traditional algorithmic actions. Such tasks are commonly implemented with the usage of machine learning methodology and means of computer linguistics. One of the preprocessing tasks of a text is the search of noun phrases. The accuracy of this task has implications for the effectiveness of…
▽ More
Introduction. The area of natural language processing considers AI-complete tasks that cannot be solved using traditional algorithmic actions. Such tasks are commonly implemented with the usage of machine learning methodology and means of computer linguistics. One of the preprocessing tasks of a text is the search of noun phrases. The accuracy of this task has implications for the effectiveness of many other tasks in the area of natural language processing. In spite of the active development of research in the area of natural language processing, the investigation of the search for noun phrases within Ukrainian texts are still at an early stage. Results. The different methods of noun phrases detection have been analyzed. The expediency of the representation of sentences as a tree structure has been justified. The key disadvantage of many methods of noun phrase detection is the severe dependence of the effectiveness of their detection from the features of a certain language. Taking into account the unified format of sentence processing and the availability of the trained model for the building of sentence trees for Ukrainian texts, the Universal Dependency model has been chosen. The complex method of noun phrases detection in Ukrainian texts utilizing Universal Dependencies means and named-entity recognition model has been suggested. Experimental verification of the effectiveness of the suggested method on the corpus of Ukrainian news has been performed. Different metrics of method accuracy have been calculated. Conclusions. The results obtained can indicate that the suggested method can be used to find noun phrases in Ukrainian texts. An accuracy increase of the method can be made with the usage of appropriate named-entity recognition models according to a subject area.
△ Less
Submitted 22 October, 2020;
originally announced October 2020.
-
Evaluating text coherence based on the graph of the consistency of phrases to identify symptoms of schizophrenia
Authors:
Artem Kramov
Abstract:
Different state-of-the-art methods of the detection of schizophrenia symptoms based on the estimation of text coherence have been analyzed. The analysis of a text at the level of phrases has been suggested. The method based on the graph of the consistency of phrases has been proposed to evaluate the semantic coherence and the cohesion of a text. The semantic coherence, cohesion, and other linguist…
▽ More
Different state-of-the-art methods of the detection of schizophrenia symptoms based on the estimation of text coherence have been analyzed. The analysis of a text at the level of phrases has been suggested. The method based on the graph of the consistency of phrases has been proposed to evaluate the semantic coherence and the cohesion of a text. The semantic coherence, cohesion, and other linguistic features (lexical diversity, lexical density) have been taken into account to form feature vectors for the training of a model-classifier. The training of the classifier has been performed on the set of English-language interviews. According to the retrieved results, the impact of each feature on the output of the model has been analyzed. The results obtained can indicate that the proposed method based on the graph of the consistency of phrases may be used in the different tasks of the detection of mental illness.
△ Less
Submitted 6 May, 2020;
originally announced May 2020.