Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network
Authors:
Mariana Rodrigues Makiuchi,
Tifani Warnita,
Nakamasa Inoue,
Koichi Shinoda,
Michitaka Yoshimura,
Momoko Kitazawa,
Kei Funaki,
Yoko Eguchi,
Taishiro Kishimoto
Abstract:
We propose a non-invasive and cost-effective method to automatically detect dementia by utilizing solely speech audio data. We extract paralinguistic features for a short speech segment and use Gated Convolutional Neural Networks (GCNN) to classify it into dementia or healthy. We evaluate our method on the Pitt Corpus and on our own dataset, the PROMPT Database. Our method yields the accuracy of 7…
▽ More
We propose a non-invasive and cost-effective method to automatically detect dementia by utilizing solely speech audio data. We extract paralinguistic features for a short speech segment and use Gated Convolutional Neural Networks (GCNN) to classify it into dementia or healthy. We evaluate our method on the Pitt Corpus and on our own dataset, the PROMPT Database. Our method yields the accuracy of 73.1% on the Pitt Corpus using an average of 114 seconds of speech data. In the PROMPT Database, our method yields the accuracy of 74.7% using 4 seconds of speech data and it improves to 80.8% when we use all the patient's speech data. Furthermore, we evaluate our method on a three-class classification problem in which we included the Mild Cognitive Impairment (MCI) class and achieved the accuracy of 60.6% with 40 seconds of speech data.
△ Less
Submitted 6 October, 2020; v1 submitted 16 April, 2020;
originally announced April 2020.
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data
Authors:
Tifani Warnita,
Nakamasa Inoue,
Koichi Shinoda
Abstract:
We propose an automatic detection method of Alzheimer's diseases using a gated convolutional neural network (GCNN) from speech data. This GCNN can be trained with a relatively small amount of data and can capture the temporal information in audio paralinguistic features. Since it does not utilize any linguistic features, it can be easily applied to any languages. We evaluated our method using Pitt…
▽ More
We propose an automatic detection method of Alzheimer's diseases using a gated convolutional neural network (GCNN) from speech data. This GCNN can be trained with a relatively small amount of data and can capture the temporal information in audio paralinguistic features. Since it does not utilize any linguistic features, it can be easily applied to any languages. We evaluated our method using Pitt Corpus. The proposed method achieved the accuracy of 73.6%, which is better than the conventional sequential minimal optimization (SMO) by 7.6 points.
△ Less
Submitted 30 March, 2018;
originally announced March 2018.