Assessing the signal quality of electrocardiograms from varied acquisition sources: A generic machine learning pipeline for model generation. (March 2021)
- Record Type:
- Journal Article
- Title:
- Assessing the signal quality of electrocardiograms from varied acquisition sources: A generic machine learning pipeline for model generation. (March 2021)
- Main Title:
- Assessing the signal quality of electrocardiograms from varied acquisition sources: A generic machine learning pipeline for model generation
- Authors:
- Albaba, Adnan
Simões-Capela, Neide
Wang, Yuyang
Hendriks, Richard C.
De Raedt, Walter
Van Hoof, Chris - Abstract:
- Abstract: Background and objective: Long-term electrocardiogram monitoring comes at the expense of signal quality. During unconstrained movements, the electrocardiogram is often corrupted by motion artefacts, which can lead to inaccurate physiological information. In this situation, automated quality assessment methods are useful to increase the reliability of the measurements. A generic machine learning pipeline that generates classification models for electrocardiogram quality assessment is presented in this article. The presented pipeline is tested on signals from varied acquisition sources, towards selecting segments that can be used for heart rate analysis in lifestyle applications. Methods: Electrocardiogram recordings from traditional, wearable and ubiquitous devices, are segmented in 10 s windows and manually labeled by experienced researchers into two quality classes. To capture the electrocardiogram dynamics, a comprehensive set of 43 features is extracted from each segment, based on the time-domain signal, its Fast Fourier Transform, the Autocorrelation function and the Stationary Wavelet Transform. To select the most relevant features for each acquisition source we employ both a customized hybrid approach and the state-of-the-art Neighborhood Component Analysis method and compare them. Support Vector Machines (SVM), Decision Trees, K-Nearest-Neighbors and supervised ensemble methods are tested as possible binary classifiers. Results: The results for the bestAbstract: Background and objective: Long-term electrocardiogram monitoring comes at the expense of signal quality. During unconstrained movements, the electrocardiogram is often corrupted by motion artefacts, which can lead to inaccurate physiological information. In this situation, automated quality assessment methods are useful to increase the reliability of the measurements. A generic machine learning pipeline that generates classification models for electrocardiogram quality assessment is presented in this article. The presented pipeline is tested on signals from varied acquisition sources, towards selecting segments that can be used for heart rate analysis in lifestyle applications. Methods: Electrocardiogram recordings from traditional, wearable and ubiquitous devices, are segmented in 10 s windows and manually labeled by experienced researchers into two quality classes. To capture the electrocardiogram dynamics, a comprehensive set of 43 features is extracted from each segment, based on the time-domain signal, its Fast Fourier Transform, the Autocorrelation function and the Stationary Wavelet Transform. To select the most relevant features for each acquisition source we employ both a customized hybrid approach and the state-of-the-art Neighborhood Component Analysis method and compare them. Support Vector Machines (SVM), Decision Trees, K-Nearest-Neighbors and supervised ensemble methods are tested as possible binary classifiers. Results: The results for the best performing models on traditional, wearable and ubiquitous electrocardiogram datasets are, respectively: balanced-accuracy: 89%, F1-score: 93% with the Fine Gaussian SVM model and 10 features; balanced-accuracy: 93%, F1-score: 93% with the Fine Gaussian SVM model and 11 features; balanced-accuracy: 95%, F1-score: 86%, with the Fine Gaussian SVM model and 8 features. Conclusions: According to the results, our generic pipeline can generate classification models tailored to individual acquisition sources, provided that a standard Lead I or Lead II is available. Such models accurately establish whether the electrocardiogram quality is good or bad for heart rate analysis. Furthermore, removing bad quality segments decreases errors in heart rate calculation. Highlights: Long-term electrocardiogram monitoring comes at the expense of signal quality. Automated quality assessment methods are necessary to filter out the unreliable segments. A generic machine learning pipeline for generating ECG signal quality classifiers is presented along with 43 descriptive features. The pipeline is tested on signals from traditional, wearable, and ubiquitous devices. High performances are achieved, with balanced accuracy reaching up to 95% when using a Fine Gaussian SVM model with 8 selected features. … (more)
- Is Part Of:
- Computers in biology and medicine. Volume 130(2021)
- Journal:
- Computers in biology and medicine
- Issue:
- Volume 130(2021)
- Issue Display:
- Volume 130, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 130
- Issue:
- 2021
- Issue Sort Value:
- 2021-0130-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-03
- Subjects:
- Electrocardiogram -- Wearables -- Ubiquitous -- Non-contact -- Classification -- Feature selection -- Motion artefact -- Signal quality
Medicine -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00104825/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiomed.2020.104164 ↗
- Languages:
- English
- ISSNs:
- 0010-4825
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.880000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15790.xml