Full text

Turn on search term navigation

1. Introduction

In recent decades, biomedical signals have been used for communication in Human–Computer Interfaces (HCI) for medical applications; an instance of these signals are the myoelectric signals (MES), which are generated in the muscles of the human body as unidimensional patterns. Because of this, the methods and algorithms developed for pattern recognition in signals can be applied for their analyses once these signals have been sampled and turned into electromyographic (EMG) signals. Additionally, in recent years, many researchers have dedicated their efforts to studying prosthetic control by means of EMG signal classification, that is, by logging a set of MES in a proper range of frequencies to classify the corresponding EMG signals.

The EMG signals are obtained from sensors placed on the skin surface and can help retrieve muscular information during contractions when flexing or extending an articulation. There are also implants placed under the skin that facilitate the signal acquisition, but these are not commonly used.

Regarding the pattern recognition problem for myoelectric control systems, its success depends mostly on the classification accuracy [1] because myoelectric control algorithms are capable of detecting movement intention; therefore, they are mainly used to actuate prostheses for amputees [2].

With the aim of carrying out the pattern recognition for myoelectric applications, a series of features is extracted from the myoelectric signal for classification purposes. The feature classification can be carried out on the time domain or by using other domains such as the frequency domain (also known as the spectral domain), time scale, and time–frequency, amongst others [3].

One of the main methods used for pattern recognition in myoelectric signals is the Support Vector Machines (SVM) technique whose primary function is to identify an n-dimensional hyperplane to separate a set of input feature points into different classes. This technique has the potential to recognize complex patterns [4] and on several occasions it has proven its worth when compared to other classifiers such as Artificial Neural Network (ANN), Linear Discriminant Analysis (LDA) and Particle Swarm Optimization (PSO) [5,6,7,8]. The key concepts underlying the SVM are: (a) the hyperplane separator; (b) the kernel function; (c) the optimal separation hyperplane; and (d) a soft margin (hyperplane tolerance).

A compilation of the most outstanding works that combine different techniques based on SVM is presented in this paper. It also includes a list of those features most commonly used in the time, frequency, time–frequency, and spatial domains for pattern recognition. Finally, other applications of the SVM-based classifier are included in the last section.

2. EMG Signals

Pattern recognition-based myoelectric signal classification consists of logging a specific time interval of EMG signals coming from the muscles and performing many repetitions with different movements, to segment them later. The classification is performed by extracting the features in each interval of the signal to recognize the characteristic information of each movement. From these data, the training is accomplished—which varies according to the method—and, in this way, it is possible to classify the type of movement.

However, the myoelectric signal acquisition is not a simple procedure since the EMG signals have a high noise content since they are not extracted directly from the muscles, but they need to go through the different layers of the skin between the electrode and the pulse generated by the muscle. Besides, the signal acquisition instruments introduce noise themselves by the parasitic frequencies on the power line.

2.1. Signal Acquisition

Due to the acquisition process, it is necessary to make a signal pre-processing step before performing the sample segmentation. This pre-processing stage consists of applying different classes of filters, such as a notch filter to eliminate the power line noise. The noise generated by the skin layers is at frequencies above 500 Hz [1,2,8,9,10,11,12,13,14,15,16,17,18,19,20,21] and at those below 10 Hz [1,9,11,12,14,18,20,22]. Some authors consider that there is noise also in higher frequencies, and thus they use filters that cancel up to 20 Hz [2,6,8,13,15,16,17,19,21,23,24]. Nevertheless, in [7], the authors suppressed frequencies between 90 and 250 Hz and, in [25], the authors removed frequencies lower than 5 Hz and higher than 375 Hz. Table 1 summarizes the band-pass frequency allowed by each study.

Another issue found in the study of myoelectric signals is the frequency at which EMG signals should be sampled—a high frequency could give excess noise, and a lower one could lose a vast amount of information. The sampled frequency most commonly used is 1 kHz [1,9,10,12,13,18,19,27,28,29]. Other authors (e.g., [2,11,22,26,30,31]) use a higher frequency of 1.5, 2, 3, 4, or 10 kHz. In addition, some authors use lower frequencies, such as 500 Hz [15,32]. Table 2 exhibits a recap of the sampling frequencies used by different studies.

Generally, for signal classification, more than one signal is required, because every movement is originated from different parts of the muscle and depends on a number of different muscles; therefore, the use of different channels helps to extract as much information as possible from the action(s) performed by the muscle(s). Among the various studies that have been done, it is common to work with four [1,9,13,23,29,38,39], six [19,40,41], or eight [2,7,11,22,30] channels for the acquisition of the signal; some research papers even work with a smaller number of channels [26,42]. Table 3 depicts an abridgement of the number of channels used by different studies and Table 4 summarizes the electrode type used and the place of electrode placement body. On the other hand, Doulah et al. [33] used a device that includes potentiometers, accelerometers, gyroscopes and force sensors and Lin et al. [43] only used potentiometers to perform their calculations.

The separation between electrode locations is also vital. Most authors recommend a 20 mm distance; however, other authors differ from this opinion since their studies have yielded different results. There are those who indicate that the optimal distance is 10 mm [12] while others point out the optimal distance as 40 mm [55]. At the other extreme, certain authors remark that, when there is more than one channel available to read EMG signals, using a longitudinal channel as well as a transversal one is recommendable.

2.2. Feature extraction from an EMG signal

A parameter of an EMG signal is a stable variable or a value from a mathematical or physical model ideally associated with the generation or detection of an MES process, such as length and depth of a fiber, electrode surface or distance between them, coefficients of the auto-regressive model, etc. [28]. After the signal acquisition stage, a processing stage extracts a series of parameters for the analysis of the EMG signal.

After filtering the signal and digitalizing it, since the vector components do not have any meaning individually but only as a whole, it is necessary to characterize the vector representation of the signal. As a result, it is required to extract the features from the vector that represents the signal, and, from it, implement the vector classification.

A feature of an EMG signal is a unique property, which can be observed or described qualitatively, such as being big or small, fast or slow, and sharp or smooth. An EMG variable is a physical amount that can be computed, reported and transmitted in a numeric form, and that can change as a function of time, such as voltage, frequency, velocity, and delay, amongst others. The variable is estimated during a finite time interval known as an epoch [28].

Nevertheless, when the purpose of extracting the signal features is to control a certain device, it is necessary to obtain more information from each channel of the EMG signal or to assign a control function to a specific combination from the multi-channel system, which is the particular purpose of extracting characteristics from the signals [27].

For the extraction of features, the signals can be processed in the time domain; they can also be transformed into the frequency domain, or represented in the time–frequency space or in time scale. This process consists in assembling a feature vector with different parameters of the signal. Choosing the proper parameters that will form the feature vector correctly is of vital importance since this is the starting point from which classification is made.

That is, an MES is a time function, and, thus, it can be described in terms of its amplitude, frequency, or phase. Hence, for its study, the extracted features lie in different domains, such as time or frequency, and some of their variants. A description of the most common features is given in Appendix A.

2.2.1. Time Domain (TD)

MESs have a very particular structure during muscle contraction, which varies according to the movement performed by the extremity. For this reason, MES classification can be used to actuate a prosthesis. By processing signals in the time domain, there is an increase in the available time for analysis since there is no need for the time-consuming task of transforming the signal to a different domain [27].

Since the signals are usually sampled in the time domain, it is more common to extract features in that domain, since they do not need to be converted and can be processed directly. These time-domain signals are studied in depth and used by researchers from the medical and engineering fields.

Time-domain features are more natural and simpler to extract since they are calculated from the sampled MES time series (the EMG signal) without any intermediate transformation [23,27]. Notwithstanding, time-domain EMG signals also present some disadvantages, which come from the non-stationary properties of the MES, with time-varying statistical properties. Nonetheless, features in this domain are highly used, because of their performance during classification presents a very reduced amount of noise and their processing time is lower compared with those features found in the frequency domain and timescale.

2.2.2. Frequency Domain (FD)

Spectral analysis, also known as representation in the frequency domain, is instrumental in studying muscle fatigue and it is influenced by the firing rate of the motor unit in frequencies lower than 40 Hz and for the morphology of the action potential in muscle fiber in frequencies higher than it [56].

2.2.3. Time–Frequency Domain (TFD)

TFD features are more sophisticated computationally than time-domain features. However, there are fast algorithms with which the characteristics can be implemented in TFD in order that real-time requirements necessary for MES classification are still met [9,11,50,57].

2.2.4. Spatial Domain (SD)

Spatial Domain features allow finding an improvement in the difference between postures and MES signal force levels, which provide information about the spatial distribution of the motor unit action potential (MUAP) and load between muscles [58].

In theory, a classifier must be able to differentiate, according to the input values, to which class it belongs. An MES is, in essence, a one-dimensional pattern, so that the methods and algorithms developed for pattern recognition can be applied to its analysis. The information extracted from an MES, represented in a feature vector, is chosen to minimize the control error. The feature set should be selected as the one that separates as much as possible the desired output classes [27].

3. Myoelectric Signal Classification

There are many classifiers in the literature, such as Simple Logistic Regression (SLR), Artificial Neural Networks (ANN), Linear Discriminant Analysis (LDA), Naïve Bayes (NB), K-nearest neighbor (KNN), Nonlinear Logistic Regression (NLR), Multi-Layer Perceptron (MLP), and Support Vector Machines (SVM), among others. However, in some cases, such as the ones shown below, the classification of MES with SVM has demonstrated improved performance in terms of accuracy.

Recently, Dhindsa et al. [8] made a performance evaluation of several classifiers, using EMG signals to predict five levels of knee angles. They used 15 features per each of the four measured muscles, combining time and frequency features with four auto-regressive (AR) coefficients. The evaluated classifiers were LDA, NB, KNN, and SVM with different kernels, of which the quadratic kernel of SVM performed the best classification accuracy of $93.07 \pm 3.84$ %. In addition, in the study, the EMG signals were segmented in five different window sizes with various overlapped window schemes, and the best result was achieved with the overlapping of 500 ms and 250 ms window sizes.

Furthermore, EMG signals can be used for finger movement recognition. Purushothaman and Vikas [6] compared SVM against LDA and NB in the classification of 15 different finger movements from 15 subjects. They utilized Particle Swarm Optimization (PSO) and Ant Colony Optimization (ACO) as feature selection algorithms. MAV, ZC, SSC, and WL were the features extracted (see Appendix A). Remarkably, they achieved more than 95% effectiveness without feature selection using the SVM and 4% less with 16 features considered using PSO and ACO.

EMG signals also allow the diagnosis of muscular dystrophy disorder. Kehri and Awale [25] compared ANN with SVM in the classification of EMG signals to identify this disorder by using a wavelet-based decomposition technique. The results show the classification accuracy of an available clinical EMG database, of 140 samples, with 95% effectiveness using a polynomial kernel of fifth order in an SVM.

A low-cost mechatronics platform for the design and development of robotic hands was proposed by Geethanjali [14], by comparing the SVM with other classifiers, such as ANN, LDA, and SLR. For the database, the subjects performed six different hand movements, and from those signals, the time-domain features were extracted, such as MAV, ZC, SSC, WL, MAVS, VAR, RMS, WAMP, and fourth-order AR coefficients (see Appendix A). With these features, they ensembled five groups to demonstrate their influence. Finally, they applied different kernel functions in the classification with SVM and reached the best result with a linear kernel and normalized data of 92.8% in the group with MAV, SSV, WL, ZC and fourth-order AR coefficients.

3.1. Support Vector Machines

SVM are used quite often as classification algorithms, of body movements, images, sound, etc. SVM construct an optimal separation hyperplane into a feature space that is of high dimension, due to the entries that are mapped using non-linear functions, to distinguish between two (as depicted in Figure 1) or more types of objects. This theory was introduced by Vapnik and Corina in 1995 [59].

For the nonlinear separable problem, the input space is mapped into a high-dimensional feature space and the separation hyperplane is found in this new space. The optimal hyperplane needs to discriminate different categories correctly, and so the hyperplane with maximum clearance between classes them should be found, i.e., the hyperplane that best separates the classes.

In SVM, the training algorithm is reformulated as a problem to solve by Quadratic Programming (QP), whose solution is global and unique. Considering input training data $(x_{1}, y_{1})$ , …, $(x_{m}, y_{m}) \in R^{N} \times {- 1, + 1}$ , where $x_{i}$ corresponds to the input value and $y_{i}$ to the assigned class ( $- 1$ or $+ 1$ ) to which it belongs. If these data are not linearly separable, they are mapped by a non-linear transformation $ϕ : R^{N} \to R^{M}$ inside of a new feature space $R^{M}$ where the transformed data will be linearly separable. In this way, the obtained hyperplane that separates object types can be seen as

(1) $ω \cdot ϕ (x) + b = 0,$

where

ω \in R^{M}

and

b \in R

The QP problem is supposed to build an optimal hyperplane with a maximum value of separation and a closed error $ξ = (ξ_{1}, \dots, ξ_{m})$ in the training algorithm, that is, we aim to

(2) $\min_{ω, b} \frac{1}{2} {∥ ω ∥}^{2} + C \sum_{i = 1}^{m} ξ_{i} .$

subject to

$y_{i} (ω \cdot ϕ (x_{i}) + b) \geq 1 - ξ_{i}, i = 1, \dots, m .$

If the data points are too close, indeed, if it is difficult to separate them directly, it is possible to use a kernel function K to separate them. That is,

(3) $F (α) = \sum_{i = 1}^{m} α_{i} - \frac{1}{2} \sum_{j, k = 1}^{m} α_{j} α_{k} y_{j} y_{k} K (x_{j}, x_{k}),$

subject to

$\sum_{i = 1}^{m} y_{i} α_{i} = 0, C \geq α_{i} \geq 0, i = 1, \dots, m .$

where

K (x_{j}, x_{k})

is the kernel function, which can be a Radial Basis Function (RBF), a Gaussian, a polynomial, etc. A polynomial kernel can be linear, quadratic, cubic or of any degree d [6], which can be described as

(4) $K_{P} (x_{j}, x_{k}) = {(1 + x_{j} \cdot x_{k})}^{d} .$

The RBF of two samples, which are feature vectors, is defined by [60]:

(5) $K_{R} (x_{j}, x_{k}) = \exp (- γ {∥ x_{j} - x_{k} ∥}^{2}) .$

The Gaussian function is written as [6]

(6) $K_{G} (x, μ, σ) = \frac{1}{2 π σ} \exp [- \frac{{(x - μ)}^{2}}{2 σ^{2}}],$

where

γ = 1 / 2 σ^{2}

and

σ

is the standard deviation.

When SVM are used to classify more than two classes, two strategies can be adopted: One Against One (OAO) and One Against All (OAA). The first one discriminates between classes, one by one, that is, the first category compared only against another category and so on, while the second separates each class from the rest.

3.2. SVM-Based Myoelectric Signal Classification

Many researchers have extensively discussed resolution methods for pattern-based classification for control applications. This review only includes works related with SVM, since this method is widely recommended by several authors; this is largely because it is very flexible and can be combined with other methods, which allows improving the accuracy of classification.

For example, in [61], forms of Motor Unit Potentials (MUP) in a Motor Unit Potential Train (MUPT) are evaluated to determinate if they represent a single motor unit They authors obtained 95.6% accuracy with this method.

By taking advantage of technological advances, in [32], the Myo Armband device from Thalmic Lab was placed on 26 subjects to perform a series of four hand gestures, and a linear kernel was used for obtaining an average accuracy of 94.9% with eight electrodes and 72% with four.

Similarly, in [54], a DL-3100 system was used to measure MES signals as a new user authentication method for mobile devices. SVM were trained under four features values (max, min, time of max and min value) and it is possible to choose between five hand gestures to unlock the mobile device. However, in [20], only two channels were used to classify five classes of hand movements, and a method for normalize the signals was implemented, reaching 95% accuracy precision with an expert user.

In [13], two different kernels (Gaussian and RBF) were used to classify five different leg movements, through four MES channels, by combining MAV, WL, ZC and, SSC in such a way that in total 16 different vectors are introduced to SVM. With this method (MKL-SVM), the authors obtained more than 90% of accuracy. In the same manner, in [12,51], an RBF kernel was used for the SVM classifier to perform the error estimation using Leave-One-Out Cross Validation (LOOCV) to separate between six different walking movements; however, the former used 31 electrodes placed on different leg and buttocks muscles to form 16 bipolar signals, while the latter only used 9 electrodes. Both extracted MAV, ZC, WL and, SSC feature signals, but, in [12], the authors also extracted RMS, AR1, AR2 and, AR3. In both studies, 95% precision was obtained.

In time domain, Alkan and Günay [49] combined discriminant analysis with SVM to distinguish between four arm movements. When extracting MAV and AR from windows formed by 32 samples at a sampling frequency of 1 kHz, they made the features vector, with which they obtained an average accuracy of 99%.

Additionally, Liu [22] used AR6, MAV, ZC, WL and SSC to classify six different movements, taking the signals readings in the forearm, by means an incremental learning adaptive algorithm to SVM, which incorporated useful information in tests to a self-correction mechanism to suppress erroneous classifications, with 96.6% average accuracy.

In addition, some studies use MES signals focused on support for people with disabilities. For example, Ishii et al. [45] studied the navigation of an Electric WheelChair (EWC). Four channels of MES signals, placed on cheek, neck, and shoulder, control the seven EWC movements, the iMES of each channel was calculated to form the feature vector. The average classification accuracy was 89.7%.

In the same manner, Rossi et al. [53] took advantage of signals in time domain, therefore they did not use any method for feature extraction. Besides, they combined the information about signal history through the HMM (Hidden Markov Models) with the advantages of time-independent SVM classification, forming an HMM-SVM classification algorithm with 91.8% accuracy to distinguish six different arm movements. Wang et al. [24] implemented visual feedback from the virtual prosthetic hand system to improve classification accuracy and achieved a mean of 98.79%. The authors distinguished between eight movements, with three pairs of sensors, by the SVM classifier and compared the obtained results by using RMS, MAV, VAR, WL, WAM, IAV and SCC as a different features vector.

Sometimes, authors combine features in the time domain with those in the frequency domain. Sasaki et al. [47] developed and tested a tongue interface to detect six motions, including saliva swallowing, from the surface of suprahyoid muscles at the underside of the jaw. They combined RMS from time domain with CC features from frequency domain and achieved $95.1 \pm 1.9$ % classification accuracy. In addition, Cai et al. [36] performed a classification of eight facial expressions. They used 74 features for each expression using only six channels, and achieved 99.6% of overall accuracy with a cubic kernel in SVM classifier. Among the features used, they included mean value and RMS value of all channels mean values.

Nevertheless, other authors prefer to work in time–frequency domain. Lucas et al. [11] used the representation space of characteristic vector based on DWT (Discrete Wavelet Transform), by using an unrestricted parametrization of Wavelet mother. With this method, they obtained an average classification error of six hand movements, through eight electrodes, of $4.7 \pm 3.7$ %. Using the same method, Lin et al. [43] utilized a pair of features after applying DWT in their MES raw data, from eight subjects, to identify the movement intention of the patients. They found as the best choice the fifth step in DWT decomposition combined with MAV and Max features, which achieved 100% accuracy with SVM classifier.

In addition, Too et al. [21] classified 17 hand and wrist movements from MES signals acquired from NinaPro database. The feature vector was composed of RMS extracted from DWT and the average energy of spectrogram at each frequency bin, after having applied a Principal Component Analysis (PCA) and conserving the first three. By applying SVM, the highest classification accuracy was 95% and 71.3% for normally-limbed and amputee subjects, respectively. Moreover, Ahlawat et al. [17] used PCA for dimensionality reduction with a kernel quadratic in SVM classifier, where kurtosis, skewness, SSC, MAV and AR1 in TD were the features extracted. The overall mean classification accuracy was 99.04% for the two activities performed.

At the other extreme, Omari and Liu [39] proposed an algorithm called GAPSO-SVM that combines Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) combined with SVM. The algorithm selected optimal parameters for RBF and an optimal decomposition level also making use wavelet mother function to classify energy of wavelet coefficient obtained from forearm signals. In addition, they implemented PCA, and obtained 98.7% accuracy in the classification. As in the previous study, Sui et al. [46] used PSO but they combined it with an improved SVM (by removing slack variables, as the bias value from the decision function), calling their method PSO-ISVM, which effectively identified six types of upper limb movements with an average recognition rate of 90.66%. As a feature vector, the WPT was used to extract the variance and energy of the wavelet packet coefficients and, as a kernel, the RBF.

In the same manner, Xing et al. [50] used the energy node of WPT coefficients as MES signal characteristic. They also used the Non-parametric Weighted Feature Extraction (NWFE) to reduce the dimensionality of features vector that enters the SVM, from which they obtained 98.39% precision in OAA mode, and 98.214% in OAO mode. After the analysis and comparison between different mother wavelet functions in discrete and continuous time, Too et al. [37] achieved 98.74% using MAV feature in Symlet 4 function in level 2, 98.49% by applying WL feature in Coif 3 function in level 4, both from DWT. For CWT, 98.56% was achieved with MAV by employing Sym 6, scale 16 and 98.64% with WL by using Mexh, scale 32. The authors classified 10 different hand movements with four channels.

Another method that is used to reduce the dimensionality problem is that used by Erkilinc and Sahin [48], which, in addition to applying FFT to four MES signals for the camera control, whose movements are up, downright, left and neutral, performs a component reduction by SPCA (Sparse Principal Component Analysis), before entering SVM values. The authors implemented a not widely used technique, namely the data division in Kaiser windows, obtaining 81% accuracy. In the same manner, Goen and Tiwari [5] employed SPCA with the lasso to produce modified principal components with scattered loads. In addition, they used the SVM ensemble for classification of seven different arm movements, combined with a window length of 256 ms and a 50% overlapping. The classification accuracy obtained by the authors reached 98%.

Unlike other authors, Kouchaki et al. [62] simulated, in the laboratory of the University of Waterloo, MES signals by incorporating statistical and morphological properties. They utilized SVM to discriminate between different neuromuscular diseases (neuropathy and myopathy), and, after simulating signals, decomposed them by using an Empirical Mode Decomposition (EMD) as well as the Kolmogorov complexity and other informative features to reveal the number of irregularities within each subspace formed by EMD. The accuracy obtained was 91.11%.

Combining time and frequency domain features, Yoshikawa et al. [35] used ZC in time domain and mean MES, CC and, DCC (Delta Cepstrum Coefficients) in frequency domain. The last one is defined as a characteristic of the dynamic type, and it is the difference between two CC. The features were used to classify seven hand movements, and they obtained a precision of 91–94.4%, depending on the test subject. This classification was used for digital robotic arm control, with 62.5 Hz delay. In addition, Doulah et al. [33] presented a method for automatic detection of posture transition and used it for a knee-ankle-foot orthosis. Furthermore, they used a PCA for dimensionality reduction from the eleven extracted features of ten subjects with 14 sensors (MAV, SSC, STD, entropy, coefficient of variation, maximum, minimum, median, maximum to RMS ratio, RMS to mean ratio, and fractal dimension). The obtained precision was 92.94% for the detection of the sit-to-stand posture transition.

In the same way, Bian et al. [15] compounded IEMG, STD and RMS features in time domain and MPF and MNF in frequency domain. Besides using a linear kernel, the dimensionality reduction was done by PCA. Starting from seven vectors, the obtained results were higher than 92.25% for classifying eight movements, whose signals were extracted using the eight channels from Myo armband device. Furthermore, Roldan-Vasco et al. [7] combined five time domain features (LOG, DASDV, VAR, ZC and MYOP) with two in frequency domain (MNF and FR) to record the activity from 47 healthy subjects when swallowing water, yogurt and saliva, using for classification SVM with RBF kernel, and obtained 92.03% accuracy.

Moreover, without feature extraction tools, Luo et al. [19] extracted the synergistic patterns of myoelectrical activities by a non-negative matrix factorization (NMF), with five healthy subjects, to classify five different movements (hand open and close, key pinch, palm valgus and grasp cylindrical tool). They implemented two different filters, one analog and one digital, within the recorded signals from six muscles. By their method, the muscle synergy patterns as a feature vector matrix could achieve the mean classification rate of 96.08%.

Table 5 lists classification accuracy according to each one of the authors mentioned above, who worked with time-domain features. Number of classes and channels are in the second and third column, respectively. The features extracted from EMG signals are in the third column and, in the last column, the classification accuracy obtained is presented. Table 6 summarizes those who worked in time–frequency domain, while Table 7 presents the authors who combined features in different domains.

3.3. Other Applications of SVM-Based Classifiers

To identify abnormal changes in Mental Workload (MWL) and thus prevent accidents due to work overload, Yin and Zhang [63] classified overload levels (low, medium and high) using EEG-PSD to form the features vector. To reduce it, they used the Locally Linear Embedding (LLE) technique, and subsequently classified with combined techniques of Support Vector Clustering (SVC) and Support Vector Data Description (SVDD), obtaining 79.54% accuracy. In the same year, Yin and Zhang [64] used LS-SVM to differentiate between the state of high mental load and fatigue, by combining EEG, ECG, and Electrooculogram (EOG) signals, with a feature reduction made with a Recursive Feature Elimination (FRE) and keeping RBF kernel. This procedure improved the accuracy to 92.67% compared against their previous study.

The classification of other types of signals has also been utilized as a guide for some doctors and therapists for the detection of different diseases, or the diagnosis of disorders in the motor system. In [65], an adaptive system for SVM, called ASVM, is proposed to diagnose diseases through the blood, using data on diabetes and breast cancer. In this method, the bias value of SVM is adjusted by a feedback mechanism, which allows the classification to be done more quickly and with higher precision than in its different evaluation, obtaining 67.22–97.39% accuracy.

Similarly, features in temporal and frequency space were utilized in [66] to perform the detection of muscular fatigue of the lower extremities to prevent falls and injuries; with the aid of six cameras, the authors differentiated between the state of fatigue and without fatigue using SVM with linear and RBF kernel, obtaining 96% accuracy with both kernels. In the same manner, in [67], six cameras with a 200 Hz sample rate were used to differentiate between assisted walking of patients with arm support and unassisted walking. The authors also used temporal space features, combining Non-dominated Sorting Genetic Algorithm II (NSGAII) and Genetic Algorithms (GA) with SVM to choose from among 30 marching parameters and conducted the classification, obtaining 99.31% precision.

Other authors (e.g., ) combined different signal types, such as EEG, ECG, EOG, EGM and videotaping for sleep evaluation, i.e., distinguishing between the state of waking and sleep of different people. The features used by Park et al. [68] were Proportional Integration Mode (PIM), Zero Crossings Mode (ZCM) and FFT, to which RBF is applied and then classified with the SVM, obtaining a precision of 88.94%.

More techniques utilized by some authors also reduced the number of features to improve the classification accuracy and to increase the processing speed by reducing the dimensionality of the features vector; for instance, in [69], an approach of random forest classification to the diagnosis of lymphatic diseases is proposed. In the first stage, the authors performed a features reduction of different methods, obtaining as best result 92.2% accuracy in the distinction of four states of the patient, including normal, malignant lymph or fibrosis, with the reduction from 18 to 6 features using genetic algorithms.

Khazaee and Ebrahimzadeh [70] used ECG signals to differentiate between five kinds of arrhythmias. They used a database offered by MIT-BIH. The procedure was performed in three different stages. The first one consisted in feature extraction by Non-Parametric Power Spectral Density (NPPSD). In the second, the classification using SVM with Gaussian Radial Basis Function was performed. Finally, they accomplished the SVM parameter optimization by a GA. The classification accuracy obtained was 96%.

Raj and Ray [42] differentiated arrhythmias using PCA to reduce features in time–frequency space. These features were obtained from ECG by means of the Discrete Orthonormal Stockwell Transform (DOST) concatenated with morphological features. They also employed the PSO technique to adjust the SVM parameters with an RBF kernel. These combined methods, PSO and SVM, reached 98.82% classification accuracy to differentiate among 16 types of arrhythmic events that are produced more frequently in the heart.

Dobrowolski et al. [71] used the SVM for neuromuscular disorder diagnosis based on the analysis of scalograms formed from MES extracted from the deltoid muscle. Then, the SVM analysis was implemented to subsequently reduce to a single decision parameter. The error probability of this method was 0.5%.

Another medical application where an SVM is used for classification is in the differentiation of four main classes in which a protein is composed. In [72], a method is proposed to discriminate between both classes and protein structures by the incorporation of pseudo average chemical shift along with an SVM. This method was used onin four different databases, obtaining 84.2%, 85%, 86.4%, and 89.2%, respectively, in classification accuracy.

In the case of hyperspectral image classification, in [73], a guided filter is incorporated into the SVM classifier. This originates from the fusion of spectral and spatial features with the help of the PCA method. The authors classified more than nine classes with the spatial features of the SVM and achieved an average 98.92% classification accuracy.

Other images that also have been classified are digital mammograms, in search of microcalcifications for diseases prevention. For instance, El-Naqa et al. [74] used a database of 76 digital mammograms, and the obtained accuracy with a polynomial kernel in SVM was 94%.

The authors of [39,60] used GA combined with SVM to classify images obtained by fusing multifrequency RADARSAT-2 synthetic aperture radar and Thaichote multispectral images. The results provided high classification accuracy at over 95%.

The Content-Based Image Retrieval (CBIR) technique, which is a developing trend in digital image processing, aims at recovering a queried image from a large database. In this field, Sugamya et al. [75], in the search for an image, extracted color, form and texture from an image, and later they used SVM for the classification, obtaining 76.6% accuracy with the help of the standardized Euclidean metric.

Moreover, in addition to extracting voltage signals from the brain, images can also be extracted. Alam et al. [76] differentiated between healthy individuals and those who suffer from Alzheimer’s disease. They used structural Magnetic Resonance Imaging (sMRI) data, from which a features extraction was done with the aid of Voxel-Based Morphometry (VBM), and the features reduction with the PCA. Finally, the classification accuracy was 84.17%.

Sharma and Srivastava [77] used SVM to classify characters strings, i.e., text classification. The fact that such character strings did not have the proper format to be classified was solved with the help of the Stemmers–Stemming algorithm. For classification, they uses RBF kernel, with LibSVM to distinguish between related phrases with shopping and food, obtaining a correct classification of 64.86%.

Other signals frequently used are those of digital modulation. For example, Zhou [78] used second-, fourth- and sixth-order cumulants as signal features, and also employed an RBF kernel combined with a method of cross-validation grid parameters selection to improve the SVM-based classification accuracy, reaching 92.2%.

4. Conclusions

Pattern classification is used in certain areas of knowledge. Depending on the needs and characteristics of each system, a specific tool may be selected. SVM offer a high classification accuracy since they allow the combination with other pattern classification methods to reach different objectives that are taken in the classification, besides a high accuracy percentage. In other words, it allows the incorporation of tools that transform the input data to the SVM or that solve the same.

The mentioned authors combined methods to improve the classification accuracy in different applications, although the main purpose of the article is making a compilation of those studies which used myoelectric signals as input vector. In addition, other applications of classification based on SVM are listed to give the reader a broader idea of different fields of study in which the SVM can be applied. Then, four points can be concluded:

The most common kernel used was RBF, followed by linear and Gaussian.
PCA is the most common tool for dimensionality reduction.
MAV, SSC, ZC and WL are the most utilized time-domain features.
Many channels are not necessary to obtain good precision.

Most of the published papers seek advantage in feature extraction area, but only certain researchers reported an algorithm to combine directly with the SVM. An example of this is the combination with GA [39,67,70,79]. Therefore, this is an area of opportunity, as there are several algorithms in artificial intelligence that can be tested and combined with SVM.

Another small studied tool is, instead of using feature reduction algorithms such as PCA or SPCA, decreasing the number of input vectors when one has many features extracted; in such case, feature extraction algorithms can be used. Currently, these algorithms are mainly used in data mining.

Author Contributions

Conceptualization, D.C.T.-P. and J.R.-R.; Methodology, D.C.T.-P.; Software, D.C.T.-P.; Validation, D.C.T.-P. and J.C.J.-C.; Formal analysis, D.C.T.-P.; Investigation and Visualization, D.C.T.-P. and J.R.-R.; Data curation, D.C.T.-P., and J.C.J.-C.; Writing—original draft preparation, Writing—original draft, review & editing, all the authors.

Funding

CONACYT paid for scholarship 561144 for this research.

Acknowledgments

We would like to thank the Graduate Studies Division from the Faculty of Informatics at Universidad Autónoma de Querétaro by enabling C.L. to carry out this Ph.D. research.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AAC	Average Amplitude Change
AAV	Average Absolute Value
ACO	Ant Colony Optimization
AFB	Amplitude of the First Burst
ANN	Artificial Neural Network
AR	Auto-Regressive
CC	Cepstrum Coefficients
CBIR	Content-Based Image Retrieval
COF	Coefficient of variation
Com1	Mean value of all channels’ mean values considered.
Com2	RMS value of all channels’ mean values considered.
CWT	Continuous Wavelet Transform
DASDV	Difference of Absolute Standard Deviation
DCC	Delta Cepstrum Coefficients
DFT	Discrete Fourier Transform
DWT	Discrete Wavelet Transform
ECG	Electrocardiogram
EEG	Electroencephalogram
EGM	Electrogram
EMD	Empirical Mode Decomposition
EMG	Electromyography
ENT	Entropy
EOG	Electrooculogram
EWC	Electric WheelChair
FD	Frequency Domain
FFT	Fast Fourier Transform
FMD	Frequency Median Density
FMN	Frequency Mean Density
FRD	Fractal Dimension
GA	Genetic Algorithm
HCI	Human-Computer Interface
HIST	Histogram
HMM	Hidden Markov Models
FR	Frequency Ratio
IAV	Integrated Absolute Value
IEMG	Integrated Electromyography Signal
ISVM	Improved Support Vector Machine
KNN	K-Nearest Neighbour
LDA	Linear Discriminant Analysis
LLE	Locally Linear Embedding
LOG	Log-Detector
LOOCV	Leave-One-Out Cross Validation
MAV	Mean Absolute Value
MAVS	Mean Absolute Value Slope
MAVSLP	Differences between Mean Absolute Value
MAX	Maximum Value
MDV	Median Differenital Value
MDF	Median Frequency
MED	Median
MES	Myoelectric Signals
MHW	Multiple Hamming Windows
MIN	Minimum Value
MKL	Multiple Kernel Learning
MLP	Multi-Layer Perceptron
MMAV1	Modified Mean Absolute Valye type 1
MMAV1	Modified Mean Absolute Valye type 2
MNF	Mean Frequency
MNP	Mean Power
MTW	Multiple Trapezoidal Windows
MUP	Motor Unit Potentials
MUPT	Motor Unit Potential Train
MWL	Mental WorkLoad
MYOP	Myopulse Percentage rate
NB	Naive Bayes
NLR	Non-linear Logistic Regression
NMF	Non-negative Matrix Factorization
NOR	Norm
NPPSD	Non-Parametric Power Spectral Density
NSGAII	Nondominated Sorting Genetic Algorithm II
NWFE	Non-parametric Weighted Feature Extraction
OAA	One Against All
OAO	One Against One
PCA	Principal Component Analysis
PIM	Proportional Integration Mode
PKF	Peak Frequency
PSO	Particle Swarm Optimization
PSD	Power Spectral Density
PSR	Power Spectrum Ratio
QP	Quadratic Programming
RAN	Range
RBF	Radial Basis Function
RFE	Recursive Feature Elimination
RMS	Root Mean Square Value
SD	Spatial Domain
SLR	Simple Logistic Regression
sMRI	structural Magnetic Resonance Imaging
STD	Standard Deviation
STFT	Short Time Fourier Density
SPCA	Sparse Principal Component Analysis
SSC	Slope Sign Changes
SSI	Simple Square Integral
SVC	Support Vector Clustering
SVDD	Support Vector Data Description
SVM	Support Vector Machines
SUM	Summation
SWT	Stationary Wavelet Transform
TD	Time Domain
TFD	Time–Frequency Domain
TTP	Total Power
TVAR	Time Varying Auto Regressive
VAR	Variance
VBM	Voxel-Based Morphometry
v-Order	V
WAMP	Willson Amplitude
WL	Waveform Length
WPT	Wavelet Packet Transform
ZC	Zero Crossings
ZCM	Zero Crossings Mode

Appendix A. Common Features

Figure and Tables

Figure 1. SVM geometric definition.

Table 1

Reported band-pass frequency.

Reference	Hz
Reference	10–500	20–450	20–500	25–500
[1]	X
[9]	X
[11]	X
[26]				X
[23]		X
[12]	X
[13]			X
[2]			X
[14]	X
[15]			X
[16]		X
[24]		X
[17]			X
[18]	X
[19]			X
[6]		X
[20]	X
[21]			X
[8]			X

Table 2

Reported sampling frequency.

Reference	Sampling Frequency
[15,32,33,34]	500 Hz
[1,9,10,12,13,14,17,18,19,20,21,27,28,29]	1 kHz
[2]	1.5 kHz
[7,8,11,16,30,35,36,37]	2 kHz
[22]	3 kHz
[6,25,26]	4 kHz
[31]	10 kHz

Table 3

Reported number of channels.

Reference	Number of Channels
[25]	1
[16,17,20,26]	2
[24,31]	3
[1,8,9,10,13,21,23,29,35,39,44,45,46]	4
[19,36,40,41]	6
[2,7,11,15,22,30,32,34]	8
[37]	12
[33]	14
[12,14]	16
[47]	22

Table 4

Electrodes type and place of electrode placement body.

Reference	Electrode Type and Body Region
[5,14,16,18,19,37,39,40,48,49,50]	Monopolar and Upper limb
[2,12,26,51]	Monopolar and Lower limb
[36,47]	Monopolar and Facial muscles
[4,6,9,10,11,15,20,22,24,27,29,31,32,34,35,44,52,53,54]	Bipolar and Upper limb
[13,30]	Bipolar and Lower limb
[45]	Bipolar and Cheek
[7]	Bipolar and Facial muscles

Table 5

EMG signals classification methods with time-domain features.

Ref.	Class	Chan	Features	Accuracy (%)
[12]	6	16	MAV, ZC, WFL, SSC, RMS, AR1, AR2, and AR3	95.00
[13]	5	4	MAV, WL, ZC, and SSC	90.00
[51]	5	16	MAV, SSC, ZC, and WL	95.00
[49]	4		MAV	99.00
[2]	7	8	MAV, VAR, WL, SSC, and ZC	94.7
[22]	6	8	MAV, ZC, WL, SSC, and AR	96.60
[39]	8	4	WL	98.70
[14]	6	16	MAV, ZC, SSC, WL, MAVS, VAR, RMS, WAMP, and AR4	92.80
[47]	6	22	RMS and CC	95.10
[34]	6	8	MAV, ZC, WL, SSC, AR1, AR2, AR3, AR4, AR5, and AR6	99.00
[32]	4	4	WL	94.90
[43]	2		RMS, MAV, NOR, SUM, MAX, MIN, and RAN	100.00
[24]	8	3	RMS, MAV, VAR, WL, WAMP, IAC, and SSC	98.79
[17]	2	2	SSC, MAV, Kurtosis, Skewness, and AR1	99.04
[18]	5	96	WAMP, RMS, WL, and AR1	99.10
[45]	7	4	iEMG	89.70
[6]	15	8	MAV, WL, SSC, and ZC	95.00
[20]	5	2	MAV	95.00

Table 6

EMG signals classification methods with time–frequency domain features.

Ref.	Class	Chan	Features	Accuracy (%)
[50]	7	4	Energy of the WPT coefficients	98.39
[37]	17	12	8 RMS of DWT coefficients	95.00
[21]	10	4	MAV and WL of CWT and DWT	98.74
[46]	6	4	Energy and variance of WPT	90.66

Table 7

EMG signals classification methods with features in two or more domains.

Ref.	Class	Chan	Features	Accuracy (%)
[35]	7	4	ZC, CC, DCC, EMG, and iEMG	90.00
[33]	10	4	STD, ENT, COF, MAV, MAX, MIN, MED, SSC, MAX of RMS, RMS to mean ratio, and FRD	92.94
[5]	7	8	MAV, RMS, MNF, MDF, TVAR, STFT, and DWT	98.00
[15]	8	8	iEMG, STD, RMS, MNP, and MDF	92.25
[36]	8	6	iEMG, VAR, MAV, SSI, MDV, RMS, WL, MNF, MDF, FMD, FMN, Com1, and Com2	99.60
[25]	2	1	Mean, STD, RMS, ENT, energy of WT	95.00
[7]	3	8	LOG, DASDV, MYOP, and MNF	92.03
[8]	5	4	IEMG, SSI, RMS, ZC, WL, WAMP, AR1, AR2, AR3, AR4, MNF, MDF, PKF, MNP, and SM1	93.07

Word count: 6999

Show less

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

This paper gives an overview of the different research works related to electromyographic signals (EMG) classification based on Support Vector Machines (SVM). The article summarizes the techniques used to make the classification in each reference. Furthermore, it includes the obtained accuracy, the number of signals or channels used, the way the authors made the feature vector, and the type of kernels used. Hence, this article also includes a compilation about the bands used to filter signals, the number of signals recommended, the most commonly used sampling frequencies, and certain features that can create the characteristics of the vector. This research gathers articles related to different kinds of SVM-based classification and other tools for signal processing in the field.

Details

Title

Support Vector Machine-Based EMG Signal Classification Techniques: A Review

Author

Toledo-Pérez, Diana C¹; Rodríguez-Reséndiz, Juvenal²

; Gómez-Loenzo, Roberto A²

; Jauregui-Correa, J C²

¹ Facultad de Informática, Universidad Autónoma de Querétaro, 76010 Querétaro, Mexico; [email protected]
² Facultad de Ingeniería, Universidad Autónoma de Querétaro, 76010 Querétaro, Mexico; [email protected] (R.A.G.-L.); [email protected] (J.C.J.-C.)

First page

4402

Publication year

2019

Publication date

2019

Publisher

MDPI AG

e-ISSN

20763417

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.3390/app9204402

ProQuest document ID

2533676400

Support Vector Machine-Based EMG Signal Classification Techniques: A Review

Jump to:

Full text

Abstract

Details

Suggested sources