Data synthesis with dual-stage sample grouping for electromyography signals. (1st March 2023)
- Record Type:
- Journal Article
- Title:
- Data synthesis with dual-stage sample grouping for electromyography signals. (1st March 2023)
- Main Title:
- Data synthesis with dual-stage sample grouping for electromyography signals
- Authors:
- Lee, Donghee
Yang, Wonseok
Cho, Gyoungryul
You, Dayoung
Nam, Woochul - Abstract:
- Highlights: New EMG data synthesis model was developed with clustering and random sampling. Dual-stage grouping enables synthesized datasets with increased diversity. Sample quality verified by probability distribution, JS divergence, and t -SNE plots. The accuracy of various models increased when trained with the synthesized data. The proposed model can be used for other non-stationary and stochastic datasets. Abstract: The correlation-based data synthesis (CDS) model can improve the accuracy of machine learning models via dataset enrichment. However, the performance of the classical CDS model is unsatisfactory when applied to electromyography (EMG) signals because the signal is stochastic and nonstationary. To overcome this problem, this study proposes a new CDS model integrated with dual-stage sample grouping: intra-class clustering followed by intra-cluster random selection. This sample grouping method not only enables the CDS model to create signals similar to the original EMG signal but also increases the diversity of the synthesized dataset. The synthesized sample quality was verified using the sample probability distribution, the Jensen-Shannon divergence, and t -distributed stochastic neighbor embedding plots. Furthermore, the classification accuracy of various machine learning models increased when the synthesized dataset was used for training. Specifically, accuracy improvements of 6.29%, 5.09%, 9.26%, and 3.69% were observed for the multi-layer perceptron,Highlights: New EMG data synthesis model was developed with clustering and random sampling. Dual-stage grouping enables synthesized datasets with increased diversity. Sample quality verified by probability distribution, JS divergence, and t -SNE plots. The accuracy of various models increased when trained with the synthesized data. The proposed model can be used for other non-stationary and stochastic datasets. Abstract: The correlation-based data synthesis (CDS) model can improve the accuracy of machine learning models via dataset enrichment. However, the performance of the classical CDS model is unsatisfactory when applied to electromyography (EMG) signals because the signal is stochastic and nonstationary. To overcome this problem, this study proposes a new CDS model integrated with dual-stage sample grouping: intra-class clustering followed by intra-cluster random selection. This sample grouping method not only enables the CDS model to create signals similar to the original EMG signal but also increases the diversity of the synthesized dataset. The synthesized sample quality was verified using the sample probability distribution, the Jensen-Shannon divergence, and t -distributed stochastic neighbor embedding plots. Furthermore, the classification accuracy of various machine learning models increased when the synthesized dataset was used for training. Specifically, accuracy improvements of 6.29%, 5.09%, 9.26%, and 3.69% were observed for the multi-layer perceptron, support vector machine (SVM) with linear kernel, SVM with radial basis function kernel, and k -nearest neighbor models, respectively. As the EMG signals considerably vary over subjects, the classifiers need to be optimized for individual subjects. The proposed model is useful to create personalized classifiers with a small number of original samples. … (more)
- Is Part Of:
- Expert systems with applications. Volume 213:Part B(2023)
- Journal:
- Expert systems with applications
- Issue:
- Volume 213:Part B(2023)
- Issue Display:
- Volume 213, Issue 2 (2023)
- Year:
- 2023
- Volume:
- 213
- Issue:
- 2
- Issue Sort Value:
- 2023-0213-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-03-01
- Subjects:
- EMG -- Data synthesis -- Correlation -- Nonstationary signal -- Clustering -- Random selection
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2022.119059 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24510.xml