-
Analysis of Fatigue-Induced Compensatory Movements in Bicep Curls: Gaining Insights for the Deployment of Wearable Sensors
Authors:
Ming Xuan Chua,
Yoshiro Okubo,
Shuhua Peng,
Thanh Nho Do,
Chun Hui Wang,
Liao Wu
Abstract:
A common challenge in Bicep Curls rehabilitation is muscle compensation, where patients adopt alternative movement patterns when the primary muscle group cannot act due to injury or fatigue, significantly decreasing the effectiveness of rehabilitation efforts. The problem is exacerbated by the growing trend toward transitioning from in-clinic to home-based rehabilitation, where constant monitoring…
▽ More
A common challenge in Bicep Curls rehabilitation is muscle compensation, where patients adopt alternative movement patterns when the primary muscle group cannot act due to injury or fatigue, significantly decreasing the effectiveness of rehabilitation efforts. The problem is exacerbated by the growing trend toward transitioning from in-clinic to home-based rehabilitation, where constant monitoring and correction by physiotherapists are limited. Develo** wearable sensors capable of detecting muscle compensation becomes crucial to address this challenge. This study aims to gain insights into the optimal deployment of wearable sensors through a comprehensive study of muscle compensation in Bicep Curls. We collect upper limb joint kinematics and surface electromyography signals (sEMG) from eight muscles in 12 healthy subjects during standard and fatigue stages. Two muscle synergies are derived from sEMG signals and are analyzed comprehensively along with joint kinematics. Our findings reveal a shift in the relative contribution of forearm muscles to shoulder muscles, accompanied by a significant increase in activation amplitude for both synergies. Additionally, more pronounced movement was observed at the shoulder joint during fatigue. These results suggest focusing on the shoulder muscle activities and joint motions when deploying wearable sensors to effectively detect compensatory movements.
△ Less
Submitted 25 May, 2024; v1 submitted 17 February, 2024;
originally announced February 2024.
-
Improving Speech Recognition for African American English With Audio Classification
Authors:
Shefali Garg,
Zhouyuan Huo,
Khe Chai Sim,
Suzan Schwartz,
Mason Chua,
Alëna Aksënova,
Tsendsuren Munkhdalai,
Levi King,
Darryl Wright,
Zion Mengesha,
Dongseong Hwang,
Tara Sainath,
Françoise Beaufays,
Pedro Moreno Mengibar
Abstract:
Automatic speech recognition (ASR) systems have been shown to have large quality disparities between the language varieties they are intended or expected to recognize. One way to mitigate this is to train or fine-tune models with more representative datasets. But this approach can be hindered by limited in-domain data for training and evaluation. We propose a new way to improve the robustness of a…
▽ More
Automatic speech recognition (ASR) systems have been shown to have large quality disparities between the language varieties they are intended or expected to recognize. One way to mitigate this is to train or fine-tune models with more representative datasets. But this approach can be hindered by limited in-domain data for training and evaluation. We propose a new way to improve the robustness of a US English short-form speech recognizer using a small amount of out-of-domain (long-form) African American English (AAE) data. We use CORAAL, YouTube and Mozilla Common Voice to train an audio classifier to approximately output whether an utterance is AAE or some other variety including Mainstream American English (MAE). By combining the classifier output with coarse geographic information, we can select a subset of utterances from a large corpus of untranscribed short-form queries for semi-supervised learning at scale. Fine-tuning on this data results in a 38.5% relative word error rate disparity reduction between AAE and MAE without reducing MAE quality.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
Extending the symmetry of the massless Klein-Gordon equation under the general disformal transformation
Authors:
Allan L. Alinea,
Mark Ruel D. Chua
Abstract:
The Klein-Gordon equation, one of the most fundamental equations in field theory, is known to be not invariant under conformal transformation. However, its massless limit exhibits symmetry under Bekenstein's disformal transformation, subject to some conditions on the disformal part of the metric variation. In this study, we explore the symmetry of the Klein-Gordon equation under the general disfor…
▽ More
The Klein-Gordon equation, one of the most fundamental equations in field theory, is known to be not invariant under conformal transformation. However, its massless limit exhibits symmetry under Bekenstein's disformal transformation, subject to some conditions on the disformal part of the metric variation. In this study, we explore the symmetry of the Klein-Gordon equation under the general disformal transformation encompassing that of Bekenstein and a hierarchy of `sub-generalisations' explored in the literature (within the context of inflationary cosmology and scalar-tensor theories). We find that the symmetry in the massless limit can be extended under this generalisation provided that the disformal factors takes a special form in relation to the conformal factor. Upon settling the effective extension of symmetry, we investigate the invertibility of the general disformal transformation to avoid propagating non-physical degrees of freedom upon changing the metric. We derive the inverse transformation and the accompanying restrictions that make this inverse possible.
△ Less
Submitted 4 January, 2023; v1 submitted 23 August, 2022;
originally announced August 2022.
-
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition
Authors:
Tsendsuren Munkhdalai,
Khe Chai Sim,
Angad Chandorkar,
Fan Gao,
Mason Chua,
Trevor Strohman,
Françoise Beaufays
Abstract:
Fast contextual adaptation has shown to be effective in improving Automatic Speech Recognition (ASR) of rare words and when combined with an on-device personalized training, it can yield an even better recognition result. However, the traditional re-scoring approaches based on an external language model is prone to diverge during the personalized training. In this work, we introduce a model-based…
▽ More
Fast contextual adaptation has shown to be effective in improving Automatic Speech Recognition (ASR) of rare words and when combined with an on-device personalized training, it can yield an even better recognition result. However, the traditional re-scoring approaches based on an external language model is prone to diverge during the personalized training. In this work, we introduce a model-based end-to-end contextual adaptation approach that is decoder-agnostic and amenable to on-device personalization. Our on-device simulation experiments demonstrate that the proposed approach outperforms the traditional re-scoring technique by 12% relative WER and 15.7% entity mention specific F1-score in a continues personalization scenario.
△ Less
Submitted 6 October, 2021; v1 submitted 4 October, 2021;
originally announced October 2021.
-
Audio Adversarial Examples: Attacks Using Vocal Masks
Authors:
Kai Yuan Tay,
Lynnette Ng,
Wei Han Chua,
Lucerne Loke,
Danqi Ye,
Melissa Chua
Abstract:
We construct audio adversarial examples on automatic Speech-To-Text systems . Given any audio waveform, we produce an another by overlaying an audio vocal mask generated from the original audio. We apply our audio adversarial attack to five SOTA STT systems: DeepSpeech, Julius, Kaldi, wav2letter@anywhere and CMUSphinx. In addition, we engaged human annotators to transcribe the adversarial audio. O…
▽ More
We construct audio adversarial examples on automatic Speech-To-Text systems . Given any audio waveform, we produce an another by overlaying an audio vocal mask generated from the original audio. We apply our audio adversarial attack to five SOTA STT systems: DeepSpeech, Julius, Kaldi, wav2letter@anywhere and CMUSphinx. In addition, we engaged human annotators to transcribe the adversarial audio. Our experiments show that these adversarial examples fool State-Of-The-Art Speech-To-Text systems, yet humans are able to consistently pick out the speech. The feasibility of this attack introduces a new domain to study machine and human perception of speech.
△ Less
Submitted 5 February, 2021; v1 submitted 4 February, 2021;
originally announced February 2021.
-
Semi-supervised and Unsupervised Methods for Heart Sounds Classification in Restricted Data Environments
Authors:
Balagopal Unnikrishnan,
Pranshu Ranjan Singh,
Xulei Yang,
Matthew Chin Heng Chua
Abstract:
Automated heart sounds classification is a much-required diagnostic tool in the view of increasing incidences of heart related diseases worldwide. In this study, we conduct a comprehensive study of heart sounds classification by using various supervised, semi-supervised and unsupervised approaches on the PhysioNet/CinC 2016 Challenge dataset. Supervised approaches, including deep learning and mach…
▽ More
Automated heart sounds classification is a much-required diagnostic tool in the view of increasing incidences of heart related diseases worldwide. In this study, we conduct a comprehensive study of heart sounds classification by using various supervised, semi-supervised and unsupervised approaches on the PhysioNet/CinC 2016 Challenge dataset. Supervised approaches, including deep learning and machine learning methods, require large amounts of labelled data to train the models, which are challenging to obtain in most practical scenarios. In view of the need to reduce the labelling burden for clinical practices, where human labelling is both expensive and time-consuming, semi-supervised or even unsupervised approaches in restricted data setting are desirable. A GAN based semi-supervised method is therefore proposed, which allows the usage of unlabelled data samples to boost the learning of data distribution. It achieves a better performance in terms of AUROC over the supervised baseline when limited data samples exist. Furthermore, several unsupervised methods are explored as an alternative approach by considering the given problem as an anomaly detection scenario. In particular, the unsupervised feature extraction using 1D CNN Autoencoder coupled with one-class SVM obtains good performance without any data labelling. The potential of the proposed semi-supervised and unsupervised methods may lead to a workflow tool in the future for the creation of higher quality datasets.
△ Less
Submitted 3 June, 2020;
originally announced June 2020.
-
BiRA-Net: Bilinear Attention Net for Diabetic Retinopathy Grading
Authors:
Ziyuan Zhao,
Kerui Zhang,
Xuejie Hao,
**g Tian,
Matthew Chin Heng Chua,
Li Chen,
Xin Xu
Abstract:
Diabetic retinopathy (DR) is a common retinal disease that leads to blindness. For diagnosis purposes, DR image grading aims to provide automatic DR grade classification, which is not addressed in conventional research methods of binary DR image classification. Small objects in the eye images, like lesions and microaneurysms, are essential to DR grading in medical imaging, but they could easily be…
▽ More
Diabetic retinopathy (DR) is a common retinal disease that leads to blindness. For diagnosis purposes, DR image grading aims to provide automatic DR grade classification, which is not addressed in conventional research methods of binary DR image classification. Small objects in the eye images, like lesions and microaneurysms, are essential to DR grading in medical imaging, but they could easily be influenced by other objects. To address these challenges, we propose a new deep learning architecture, called BiRA-Net, which combines the attention model for feature extraction and bilinear model for fine-grained classification. Furthermore, in considering the distance between different grades of different DR categories, we propose a new loss function, called grading loss, which leads to improved training convergence of the proposed approach. Experimental results are provided to demonstrate the superior performance of the proposed approach.
△ Less
Submitted 1 July, 2019; v1 submitted 15 May, 2019;
originally announced May 2019.