MindArm: Mechanized Intelligent Non-Invasive Neuro-Driven Prosthetic Arm System
Abstract
Currently, people with disability or difficulty to move their arms (referred to as “patients”) have very limited technological solutions to efficiently address their physiological limitations. It is mainly due to two reasons: (1) the non-invasive solutions like mind-controlled prosthetic devices are typically very costly and require expensive maintenance; and (2) other solutions require costly invasive brain surgery, which is high risk to perform, expensive, and difficult to maintain. Therefore, current technological solutions are not accessible for all patients with different financial backgrounds. Toward this, we propose a low-cost technological solution called MindArm, a mechanized intelligent non-invasive neuro-driven prosthetic arm system. Our MindArm system employs a deep neural network (DNN) engine to translate brain signals into the intended prosthetic arm motion, thereby hel** patients to perform many activities despite their physiological limitations. Here, our MindArm system utilizes widely accessible and low-cost surface electroencephalogram (EEG) electrodes coupled with an Open Brain Computer Interface and UDP networking for acquiring brain signals and transmitting them to the compute module for signal processing. In the compute module, we run a trained DNN model to interpret normalized micro-voltage of the brain signals, and then translate them into a prosthetic arm action via serial communication seamlessly. The experimental results on a fully working prototype demonstrate that, from the three defined actions, our MindArm system achieves positive success rates, i.e., 91% for idle/stationary, 85% for shake hand, and 84% for pick-up cup. This demonstrates that our MindArm provides a novel approach for an alternate low-cost mind-controlled prosthetic devices for all patients.
Index Terms:
EEG, Myoelectric, RPNI, BCI, Artefacts, Artificial Intelligence, Fast Fourier Transform.I Introduction
In recent years, approximately 5.4 million people live with paralysis in the United States alone [1] and 57.7 million people live with limb amputations globally [2]; see the global distribution of age-standardized amputation rates in Fig. 1. Among them, there are people with disability or difficulty to move their arms, which we refer to as “patients” in this paper for brevity. These patients have very limited technological solutions for efficiently addressing their physiological limitations as current technological solutions are not easily accessible for all patients with different financial backgrounds, because the existing solutions are typically very costly. For instance, the non-invasive solutions such as mind-controlled prosthetic devices are typically very costly (i.e., over $100K), and hence expensive to maintain [3][4]. Other solutions require invasive brain surgery which is very expensive, high risk, and difficult to maintain, thereby hindering its widespread adoption [5][6]. Hence, there is a significant need for an alternative low-cost solution that help the patients to perform diverse activities.
Targeted Research Problem: How can we develop a low-cost solution that can help the patients to move their arms for performing desired actions? An efficient solution to this problem will help the patients from different financial backgrounds to access an alternate low-cost solution for addressing their physiological limitations (i.e., unable to move hands) and performing diverse activities, thereby improving their quality of life.
I-A State-of-the-Art Solutions and Their Limitations
To address the physiological limitations of the patients, the existing solutions can be loosely classified into two categories: (1) non-invasive solutions, such as mind-controlled prosthetic devices; (2) and invasive solutions, such as brain-computer interface (BCI).
Non-invasive Solutions: These solutions do not require invasive devices inside the patients’ body to obtain high quality information signals. For instance, prosthetic-based solutions like “DEKA bionic arm” [8] use electromyography (EMG) signals to capture the muscle signals and then translate them into the desired action. Another non-invasive solutions (e.g., gesture control armband) focus on capturing and interpreting muscle signals without physical actuator attached [9], thereby making them lightweight. In such EMG-based solutions, when the patients contracts their muscles, the electrodes detect the muscle signals, then send them to the compute module for further processing [10] [11]. However, EMG signals may not be present in amputated patients, or cases where physical conditions are severely compromised (e.g., patients with severe disability, such as paraplegia and tetraplegia/quadriplegia). Furthermore, the other non-invasive solutions like mind-controlled prosthetic devices are typically very costly (i.e., over $100K) and hence expensive to maintain [3][4]. It is partially due to the effective yet expensive materials such as titanium alloys [12].
Invasive Solutions: To improve the quality of information signals, some solutions employ invasive devices to access the signal sources. Consequently, this device needs to be physically implanted inside the patients’ body through invasive surgery. However, such an approach is a high risk procedure to perform and typically very expensive. In addition, most of such technologies are not wireless, thereby making them difficult to maintain. For instance, to acquire electroencephalogram (EEG) waves, ones need to put electrodes in the patients’ head (skull), and attach the plug and connecting cables to the compute module. Furthermore, such invasive-based solutions are not commercially available, and it is estimated that when it becomes available, it may cost hundreds of thousands of dollars [13]. Hence, not all patients with different financial backgrounds can afford that.
In summary, the existing technological solutions are still very expensive as well as difficult to maintain. Moreover, in the invasive solution cases, there is a high risk to the patients’ body, which may cause other negative side effects.
Given the benefits and weaknesses of the state-of-the-art, we identify that the potential solution is to consider a non-invasive solution with low-cost technology, while ensuring high accuracy of signal processing that can correctly interpret the input signals into actions. To fulfill such requirements, we opt to develop a non-invasive EEG-based prosthetic arm.
I-B Scientific Challenges
Our non-invasive EEG-based prosthetic arm solution bears potentials to address the existing weaknesses in the state-of-the-art, but it also imposes several challenges, as discussed in the following.
-
•
To reduce the design cost, one of the main challenge is to design a system with an effective signal processing pipeline, that can be implemented using a low-cost off-the-shelf devices and modules.
-
•
There is environmental noise which pollutes the EEG signals. Such artefacts need to be removed, and thereby requiring an effective denoising process.
-
•
It requires an effective algorithm to learn and extract information from EEG signals. Once the system learns the EEG features, it should be able to correlate these features to the corresponding prosthetic arm action.
I-C Our Novel Contributions
To address the targeted problem and scientific challenges, we propose MindArm, a mechanized intelligent non-invasive neuro-driven prosthetic arm system; see an overview in Fig. 2. It employs a deep neural network (DNN) engine to extract information from EEG signals for identifying the given instruction, then translate it into a prosthetic arm action. To achieve this, our MindArm system makes the following novel contributions.
-
•
Removing Noise in the EEG Signals (Section II-A): We reduce noise in the EEG signals by extracting band power from each channel with respective frequencies such that residual noise is filtered, as well as employing metal insulation to reduce eddy currents and domestic alternate current noise.
-
•
Learning EEG Signal Features using DNN Training (Section II-B): We employ DNN training to effectively learn EEG signal features that are obtained from a low-cost off-the-shelf Ganglion board. We employ a window buffer size to overcome the shortcomings of a small number of EEG channels on the Ganglion board.
-
•
Low-Cost Prosthetic Arm Design (Section II-C): We design the prosthetic arm structure in Fusion360, and build it using 3D printer and Prusa MK3 with a combination of ABS111ABS: Acrylonitrile Butadiene Styrene, PETG222PETG: Poly-Ethylene Terephthalate Glyco, and PLA333PLA: Poly-Lactic Acid filaments to provide a low weight yet strong structure. Lastly, the design of the prosthetic is modular, allowing all parts can be easily replaced if required, thereby reducing the maintenance cost.
Key Results: To evaluate our MindArm, we build a complete setup encompassing EEG acquisition module, compute module (i.e., DNN engine), and actuator module (e.g., servo motor and prosthetic arm), whose total cost is $450.
Our fully functional prototype of the MindArm system demonstrates promising success rates in performing three predefined actions, showcasing the efficacy of MindArm as an affordable solution for a mind-controlled prosthetic arm.
II Methodology
In this section we will describe the MindArm system in detail, along with the dataset generation, refinement and training workflow. We also elaborate the prosthetic design and label feedback selection along with system integration; see an overview in Fig. 3.
II-A EEG Data Collection & Extraction
The off-the-shelf state-of-the-art devices on the market for collecting EEG signals include the OpenBCI complete Ultracortex [14], Emotiv EPOC X, and Flex kit. However, in addition to the price of the Emotiv kits [15], they also require preparation of either saline soaked felt or gel coating further decreasing practical effectiveness compared to the dry electrodes utilized with the Ganglion brain computer interface [14]. Therefore, to maintain a low price for the prosthetic and maintain practicality of dry electrodes, the Ganglion board is utilized.
The OpenBCI Ganglion board features 4 EEG channels and 2 references, facilitating the use of both dry comb electrodes and flat electrodes with a snap connection interface coated in silver-silver chloride [16]. Flat electrodes are positioned at Fp1 and Fp2 locations in the nasion, and comb electrodes at T3 and T4, with references placed at A1 and A2. The placement at Fp1 and Fp2 aims to enhance alpha wave detection. Although alpha waves are more prominent in the occipital lobe, placement near the inion (back of the head) often encounters greater noise. Consequently, while O1 and O2 locations yield stronger alpha and theta wave signals, achieving stable amplitudes across different frequencies using Fast Fourier Transform (FFT) analysis proves more challenging. Evidently, a scientific challenge arises when attempting to interpret data from only 4 EEG channels compared to numerous EEG channels in non-invasive state of the art devices.
Device Name | Price | Channel/Sensors | Requirements |
---|---|---|---|
Ganglion BCI | $400 | 4 channels | None |
Emotiv EPOC X | $800 | 14 sensors | Saline/Gel |
Emotiv Flex Kit | $1700 | 32 sensors | Saline/Gel |
openBCI Ultracortex | $2400 | 16 channels | None |
In the initial phase of the study, rather than directly decomposing the EEG signal into specific frequency bands, a more rudimentary threshold-based system was employed to ascertain the subject’s state of relaxation or concentration. The determination of the subject being relaxed or focused was then mapped to corresponding actions to be executed by the prosthetic device.
The results indicate promising trends; however, the current method presents a notable limitation: the states of concentration and relaxation are not distinctly categorized by Graphical User Interface (GUI). Consequently, the algorithm fails to differentiate between ‘relaxed’ and ‘concentrated’ states, only recognizing ‘relaxed and stationary’ as well as ‘concentrated and stationary’ states. This issue stems from an observable overlap in the metric thresholds for relaxation and concentration within the GUI. Therefore, it is imperative to develop an alternative approach that enables accurate prediction across all three desired states: handshaking, cup gras**, and remaining stationary. Notably, a parallel in amplitude across various frequencies is observed when comparing the thought of handshaking with the relaxed state, and the thought of cup gras** with the concentrated state.
![Refer to caption](extracted/2403.19992v1/Figures/Result_3.png)
![Refer to caption](extracted/2403.19992v1/Figures/Result_4.png)
To enhance the accuracy of state prediction, the proposed methodology involves adopting a strategy of associating relaxation with the intent to shake hands and concentration with the intent to pick up a cup.
To validate the efficacy of this approach, a novel signal processing technique is employed, which decomposes EEG signals into five distinct brain wave categories: delta, theta, alpha, beta, and gamma, each within their characteristic frequency bands: delta (1–4 Hz), theta (4–8 Hz), alpha (8–13 Hz, with sub-bands alpha-1 8–10 Hz and alpha-2 11–13 Hz), beta (13–30 Hz), and gamma (above 30 Hz). This spectral decomposition allows for more precise extraction of features from each channel, thereby reducing noise transmission through the User Datagram Protocol (UDP) network and enhancing algorithmic accuracy. The choice of UDP is driven by its expedited data transfer capabilities, absence of connection establishment procedures, and greater efficiency via lower bandwidth usage and overhead, thus compensating for potential latency and ensuring a robust set of features for the neural network classifier, contributing to a low-maintenance system [17].
Moreover, the system capitalizes on the ganglion board’s capability to process data at a sampling rate of 200Hz by transmitting only the most significant bit from each EEG sample [18]. This transmission approach results in slightly lower sample resolution, which is considered a negligible trade-off. Consequently, the neural network receives a substantial, numerically labeled dataset every second for each classification category. This enhances the learning rate of the prosthetic actions, offering a cost-effective and faster training methodology when compared to other BCI devices within the same price bracket of $400, such as the ganglion.
A significant impediment encountered with non-invasive mind-controlled prosthetics is environmental electrical noise interference at the user’s location. To mitigate this, data was sampled across various settings with differing levels of domestic alternating current (AC). Areas with minimal AC interference exhibited reduced noise in the EEG channels. Despite this, the reduction was not adequate to yield clean channel features suitable for neural network input. Moreover, this approach does not align with the practical need for portability, as the prosthetic should function optimally in diverse locations irrespective of AC noise levels. The results of this filtering process are shown in Fig. 10.
With artifacts minimized, it is essential to transmit the band power data to the Python IDE in real-time. This transmission is facilitated through UDP networking, enabling data transfer via a designated port and socket. The maximum buffer size is set to 1024 bytes, which sufficiently accommodates the data payload. For each sample frame per second, the data is converted from binary to decimal format, flattened, and then written to a designated file, where all the data within that file is categorized under the same class.
The flattening process is represented as: flattened_data = [i] + [item for sublist in data_list for item in sublist]
Here, ‘i’ represents the input number, facilitating continuous tracking of the dataset size.
II-B Neural Network Training
Currently, the algorithm processes three datasets, each corresponding to a distinct action executed by the prosthetic:
-
•
Shaking hands with an individual.
-
•
Remaining stationary or in a resting position.
-
•
Picking up a cup.
During the data collection phase, the user is instructed to concentrate on one of three predetermined actions for a fixed duration. Concurrently, the acquired data undergoes cleaning procedures before being stored in a CSV file, with each entry tagged with a numerical label corresponding to the envisioned action. For instance, data associated with the act of shaking hands is recorded in ‘shakehands.csv’ and labeled as ‘0’. Although the intended sampling rate is 200Hz, practical limitations due to latency in data processing result in an effective rate of approximately 50Hz. This discrepancy necessitates an extended duration for the training phase. The resultant datasets comprise between 10,000 and 20,000 sampled pieces, employed for training the neural network. The data structure includes 20 columns, reflecting the aggregation of five normalized brainwave metrics across four distinct EEG channels.
In the exploration of optimal neural network architectures for our dataset, a diverse range of models was assessed. These included simple Feedforward Neural Networks (FFNNs), Recurrent Neural Networks (RNNs), Long Short-Term Memory networks (LSTMs), CNN-LSTM hybrids, and Transformer-based networks. Among these, the Transformer-based network outperformed other models achieving an impressive validation accuracy of 97.1%. This superior performance can be attributed to the Transformer’s ability to process sequential data in parallel and its efficient handling of long-range dependencies, which are critical for understanding complex patterns within the EEG dataset. However, the trade-off for this high level of accuracy involves increased computational resources and training time, compared to simpler models like FFNNs or RNNs. The decision to employ a Transformer-based model thus reflects a strategic balance between seeking optimal performance and managing the computational costs associated with more sophisticated architectures. The transformer architecture is illustrated in Fig. 14. The training accuracy of these networks are depicted in Fig. 15 showcasing the comparative training performance across epochs for the various network architectures.
Initial training sessions revealed challenges with label volatility, as the model output action labels at a rate of approximately 40 times per second over UDP. This high frequency of label generation led to instances where transient thought patterns inadvertently triggered unintended actions. For example, a brief, unintended contemplation of an action could result in the erroneous activation of the prosthetic arm.
To address this, the model’s training and input collection were adapted to include a larger window size, enhancing data stability and output accuracy. It is critical for the input dataset to accurately reflect sustained thought patterns associated with specific actions, which typically last more than a fortieth of a second. In practice, a thought duration of at least 2 seconds is necessary for consistent brain wave intensity.
Accordingly, an optimized window size of 100 was implemented. This setup accumulates 100 rows of CSV data—each row representing a 20-dimensional vector from the EEG—into a single tensor. This tensor is then reshaped into a 1x2000 matrix (20x100), serving as the input for the neural network. This approach ensures that the input data effectively represents approximately two seconds of EEG data, allowing for more accurate and representative model outputs as live EEG data is streamed.
This approach necessitates that the neural network receives a singular, consolidated input for processing. Consequently, the total number of potential inputs from each file was determined, followed by the resha** of matrices to conform to the neural network’s specified input dimensions
Action | Original Dimension | Windowed Dimension |
---|---|---|
Shake Hands | ||
Stay Stationary | ||
Pick Up Cup |
With the refactored algorithm, an output of the intended user action is transmitted to the Arduino during inference every 2 seconds. This decreases volatility of the actions as a substantial greater number of features is taking into consideration for a sustained time period, thereby enhancing the robustness of the model.
Additionally, the neural network undergoes pre-training prior to its deployment and is subsequently stored on a cloud service. This strategy ensures that the duration for outputting a label is shorter than the interval between consecutive inputs, thereby reducing the potential for data loss during the wireless EEG streaming process.
II-C Prosthetic Design and Label Feedback
The prosthetic control system transmits the output label to the Arduino via serial communication every 2 seconds. At a standardized baud rate, the Arduino interprets the received number ‘0, 1, or 2’ and initiates the corresponding action. To prevent potential damage to the servos and artificial tendons due to rapid oscillation between positions, the algorithm is designed to pause reading incoming data until the current action is fully executed.
Initially, the prosthetic’s actuation mechanism relied on the contraction and relaxation of tendons, facilitated by a servo horn. However, this design was found impractical due to the persistent friction between the tendons, made of braided fishing line, and the joint pins. Consequently, this friction led to the tendons’ degradation over time, resulting in both wear and tear of the prosthetic components and a decline in performance.
![Refer to caption](extracted/2403.19992v1/Figures/hand_model.png)
20 kg torque servos are utilized at the elbow joint to ensure the load borne by the prosthetic is adequately supported, thereby guaranteeing system durability and practicality. A modular design was developed for each servo compartment, allowing users to easily replace servos without the need for specialized tools.
In the current market, commercially available realistic prosthetic gloves, exemplified by products from companies like Ottobock, typically start at a price point exceeding 250 USD. In contrast, this study utilized a silicone mold technique to fabricate a comparable realistic prosthetic glove at a material cost of just 15 USD. This substantial reduction in cost represents a significant stride toward democratizing access to prosthetic technology, markedly lowering the financial barrier for potential users.
III Results and Discussion
III-A Experimental Setup
The experimental setup for the prosthetic system comprises an array of specialized tools and technologies. The neural signal acquisition is managed using the Ganglion board and accompanying GUI from OpenBCI. Data communication is facilitated via UDP networking. For the development and training of the neural network, the PyTorch framework is employed. The system’s commands are transmitted through serial communication with the Arduino C++ IDE. The design and prototy** phase utilizes Tinkercad and Fusion360 for 3D modeling, alongside custom settings in Prusa Slicer for 3D printing and circuit schematic refinement.
III-B Analysis
The Transformer network’s was analysed by interfacing the model output with an Arduino board using serial communication protocols. The Arduino was programmed to translate the neural network’s output into actionable commands for the prosthetic hand. Each predicted action from the network triggered the corresponding movement in the prosthetic hand, showcasing the potential of this system in real-world applications. The deployment of this system demonstrated not only the high accuracy of the Transformer network, as reflected in the classification report and the confusion matrix but also its capability to operate in real time with the physical hardware, offering a seamless transition from prediction to action execution.
Class | Precision | Recall | F1-Score |
---|---|---|---|
pickUpCup | 0.86 | 0.82 | 0.84 |
shakeHands | 0.83 | 0.86 | 0.85 |
stayStationary | 0.90 | 0.92 | 0.91 |
Accuracy | 0.86 | ||
Macro Avg | 0.86 | 0.87 | 0.86 |
Weighted Avg | 0.86 | 0.86 | 0.86 |
IV Conclusion
In this paper, we present a MindArm methodology to realize a low-cost mind-controlled prosthetic arm solution for people of determination, so that they can move their hands for doing activities. It translates brain signals into the intended arm motion by utilizing EEG technologies and employing DNN model to interpret brain signals into a prosthetic arm action. The experimental results show that, our MindArm system achieves positive success rates in three different actions, i.e., 90% for idle/stationary, 80% for shake hand, and 80% for pick-up cup. This demonstrates that our MindArm provides a novel approach for an alternate low-cost mind-controlled prosthetic devices for all people.
References
- [1] B. Armour, E. Courtney-Long, M. Fox, H. Fredine, and A. Cahill, “Prevalence and causes of paralysis-united states, 2013,” American journal of public health, vol. 106, pp. e1–e3, 08 2016.
- [2] C. McDonald, S. Westcott-McCoy, M. Weaver, J. Haagsma, and D. Kartin, “Global prevalence of traumatic non-fatal limb amputation,” Prosthetics and Orthotics International, vol. Publish Ahead of Print, 12 2020.
- [3] E. Kwek and M. Choi, “Is a prosthetic arm customized prada? a critical perspective on the social aspects of prosthetic arms,” Disability & Society, vol. 31, pp. 1144 – 1147, 2016. [Online]. Available: https://api.semanticscholar.org/CorpusID:152081983
- [4] T. Beyrouthy, S. A. Kork, J. A. Korbane, and A. Abdulmonem, “Eeg mind controlled smart prosthetic arm,” 2016 IEEE International Conference on Emerging Technologies and Innovative Business Practices for the Transformation of Societies (EmergiTech), pp. 404–409, 2016. [Online]. Available: https://api.semanticscholar.org/CorpusID:34212152
- [5] L. H. B. Huinink, H. Bouwsema, D. H. Plettenburg, C. K. van der Sluis, and R. M. Bongers, “Learning to use a body-powered prosthesis: changes in functionality and kinematics,” Journal of NeuroEngineering and Rehabilitation, vol. 13, 2016. [Online]. Available: https://api.semanticscholar.org/CorpusID:1815633
- [6] J. Collinger, S. Foldes, T. Bruns, B. Wodlinger, R. Gaunt, and D. Weber, “Neuroprosthetic technology for individuals with spinal cord injury,” The journal of spinal cord medicine, vol. 36, pp. 258–272, 07 2013.
- [7] B. Yuan, D. Hu, S. Gu, S. Xiao, and F. Song, “The global burden of traumatic amputation in 204 countries and territories,” Frontiers in Public Health, vol. 11, 2023. [Online]. Available: https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2023.1258853
- [8] C. Bloomer and K. L. Kontson, “Comparison of deka arm and body-powered upper limb prosthesis joint kinematics,” Archives of Rehabilitation Research and Clinical Translation, vol. 2, no. 3, p. 100057, 2020.
- [9] S. M. M. Rahman, H. Mattila, M. Janka, and J. Virkki, “Impedance evaluation of textile electrodes for eeg measurements,” Textile Research Journal, vol. 93, no. 7-8, pp. 1878–1888, 2023. [Online]. Available: https://doi.org/10.1177/00405175221135131
- [10] K. R. K. Kadir A. Yildiz, Alexander Y. Shin, “Interfaces with the peripheral nervous system for the control of a neuroprosthetic limb: a review,” Journal of NeuroEngineering and Rehabilitation, vol. Publish Ahead of Print, March 2020.
- [11] P. Visconti, F. Gaetani, G. Zappatore, and P. Primiceri, “Technical features and functionalities of myo armband: An overview on related literature and advanced applications of myoelectric armbands mainly focused on arm prostheses,” International Journal on Smart Sensing and Intelligent Systems, vol. 11, pp. 1–25, 06 2018.
- [12] M. Sarraf, E. Rezvani Ghomi, S. Alipour, S. Ramakrishna, and N. Liana Sukiman, “A state-of-the-art review of the fabrication and characteristics of titanium and its alloys for biomedical applications,” Bio-design and Manufacturing, pp. 1–25, 2021.
- [13] D. R. Sandra V A, “Brain gate technology,” International Journal of Engineering Research & Technology (IJERT), vol. Publish Ahead of Print, 2015.
- [14] OPENBCI. (2024) The complete ultracortex. [Online]. Available: https://shop.openbci.com/products/the-complete-headset-eeg
- [15] Emotiv. (2024) Epoc. [Online]. Available: https://www.emotiv.com/epoc
- [16] OPENBCI. (2024) Ganglion board (4-channels). [Online]. Available: https://shop.openbci.com/products/ganglion-board
- [17] A. Roshdy, S. Al Kork, S. Said, and T. Beyrouthy, “A wearable exoskeleton rehabilitation device for paralysis-a comprehensive study,” vol. 4, pp. 17–26, 01 2019.
- [18] OpenBCI. (2024) Ganglion data format. [Online]. Available: https://docs.openbci.com/Ganglion/GanglionDataFormat/