Applying density-based outlier identifications using multiple datasets for validation of stroke clinical outcomes. (December 2019)
- Record Type:
- Journal Article
- Title:
- Applying density-based outlier identifications using multiple datasets for validation of stroke clinical outcomes. (December 2019)
- Main Title:
- Applying density-based outlier identifications using multiple datasets for validation of stroke clinical outcomes
- Authors:
- Lin, Ching-Heng
Hsu, Kai-Cheng
Johnson, Kory R.
Luby, Marie
Fann, Yang C. - Abstract:
- Graphical abstract: Highlights: Using a nationwide prospective stroke registry database to develop density-based outlier detection models. Using four independent NINDS stroke datasets to evaluate models. Density-based outlier detection methods are promising for identifying incorrect stroke outcome assessments. Abstract: Introduction: Clinicians commonly use the modified Rankin Scale (mRS) and the Barthel Index (BI) to measure clinical outcome after stroke. These are potential targets in machine learning models for stroke outcome prediction. Therefore, the quality of the measurements is crucial for training and validation of these models. The objective of this study was to apply and evaluate density-based outlier detection methods for identifying potentially incorrect measurements in multiple large stroke datasets to assess the measurement quality. Method: We applied three density-based outlier detection methods including density-based spatial clustering of applications (DBSCAN), hierarchical DBSCAN (HDBSCAN) and local outlier factor (LOF) based on a large dataset obtained from a nationwide prospective stroke registry in Taiwan. The testing of each method was done by using four different NINDS funded stroke datasets. Result: The DBSCAN achieved a high performance across all mRS values where the highest average accuracy was 99.2 ± 0.7 at mRS of 4 and the lowest average accuracy was 92.0 ± 4.6 at mRS of 3. The LOF also achieved similar performance, however, the HDBSCAN withGraphical abstract: Highlights: Using a nationwide prospective stroke registry database to develop density-based outlier detection models. Using four independent NINDS stroke datasets to evaluate models. Density-based outlier detection methods are promising for identifying incorrect stroke outcome assessments. Abstract: Introduction: Clinicians commonly use the modified Rankin Scale (mRS) and the Barthel Index (BI) to measure clinical outcome after stroke. These are potential targets in machine learning models for stroke outcome prediction. Therefore, the quality of the measurements is crucial for training and validation of these models. The objective of this study was to apply and evaluate density-based outlier detection methods for identifying potentially incorrect measurements in multiple large stroke datasets to assess the measurement quality. Method: We applied three density-based outlier detection methods including density-based spatial clustering of applications (DBSCAN), hierarchical DBSCAN (HDBSCAN) and local outlier factor (LOF) based on a large dataset obtained from a nationwide prospective stroke registry in Taiwan. The testing of each method was done by using four different NINDS funded stroke datasets. Result: The DBSCAN achieved a high performance across all mRS values where the highest average accuracy was 99.2 ± 0.7 at mRS of 4 and the lowest average accuracy was 92.0 ± 4.6 at mRS of 3. The LOF also achieved similar performance, however, the HDBSCAN with default parameters setting required further tuning improvement. Conclusion: The density-based outlier detection methods were proven to be promising for validation of stroke outcome measures. The outlier detection algorithm developed from a large prospective registry dataset was effectively applied in four different NINDS stroke datasets with high performance results. The tool developed from this detection algorithm can be further applied to real world datasets to increase the data quality in stroke outcome measures. … (more)
- Is Part Of:
- International journal of medical informatics. Volume 132(2019)
- Journal:
- International journal of medical informatics
- Issue:
- Volume 132(2019)
- Issue Display:
- Volume 132, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 132
- Issue:
- 2019
- Issue Sort Value:
- 2019-0132-2019-0000
- Page Start:
- Page End:
- Publication Date:
- 2019-12
- Subjects:
- Stroke outcome -- modified Rankin Scale -- Barthel Index -- Outlier detection
Medical informatics -- Periodicals
Information science -- Periodicals
Computers -- Periodicals
Medical technology -- Periodicals
Medical Informatics -- Periodicals
Technology, Medical -- Periodicals
Computers
Information science
Medical informatics
Medical technology
Electronic journals
Periodicals
Electronic journals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13865056 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/13865056 ↗
http://www.clinicalkey.com.au/dura/browse/journalIssue/13865056 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ijmedinf.2019.103988 ↗
- Languages:
- English
- ISSNs:
- 1386-5056
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.345250
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17144.xml