Data‐driven machine‐learning analysis of potential embolic sources in embolic stroke of undetermined source. (7th October 2020)
- Record Type:
- Journal Article
- Title:
- Data‐driven machine‐learning analysis of potential embolic sources in embolic stroke of undetermined source. (7th October 2020)
- Main Title:
- Data‐driven machine‐learning analysis of potential embolic sources in embolic stroke of undetermined source
- Authors:
- Ntaios, G.
Weng, S. F.
Perlepe, K.
Akyea, R.
Condon, L.
Lambrou, D.
Sirimarco, G.
Strambo, D.
Eskandari, A.
Karagkiozi, E.
Vemmou, A.
Korompoki, E.
Manios, E.
Makaritsis, K.
Vemmos, K.
Michel, P. - Abstract:
- Abstract : Background and purpose: Hierarchical clustering, a common 'unsupervised' machine‐learning algorithm, is advantageous for exploring potential underlying aetiology in particularly heterogeneous diseases. We investigated potential embolic sources in embolic stroke of undetermined source (ESUS) using a data‐driven machine‐learning method, and explored variation in stroke recurrence between clusters. Methods: We used a hierarchical k‐means clustering algorithm on patients' baseline data, which assigned each individual into a unique clustering group, using a minimum‐variance method to calculate the similarity between ESUS patients based on all baseline features. Potential embolic sources were categorised into atrial cardiopathy, atrial fibrillation, arterial disease, left ventricular disease, cardiac valvulopathy, patent foramen ovale (PFO) and cancer. Results: Among 800 consecutive ESUS patients (43.3% women, median age 67 years), the optimal number of clusters was four. Left ventricular disease was most prevalent in cluster 1 (present in all patients) and perfectly associated with cluster 1. PFO was most prevalent in cluster 2 (38.9% of patients) and associated significantly with increased likelihood of cluster 2 [adjusted odds ratio: 2.69, 95% confidence interval (CI): 1.64–4.41]. Arterial disease was most prevalent in cluster 3 (57.7%) and associated with increased likelihood of cluster 3 (adjusted odds ratio: 2.21, 95% CI: 1.43–3.13). Atrial cardiopathy was mostAbstract : Background and purpose: Hierarchical clustering, a common 'unsupervised' machine‐learning algorithm, is advantageous for exploring potential underlying aetiology in particularly heterogeneous diseases. We investigated potential embolic sources in embolic stroke of undetermined source (ESUS) using a data‐driven machine‐learning method, and explored variation in stroke recurrence between clusters. Methods: We used a hierarchical k‐means clustering algorithm on patients' baseline data, which assigned each individual into a unique clustering group, using a minimum‐variance method to calculate the similarity between ESUS patients based on all baseline features. Potential embolic sources were categorised into atrial cardiopathy, atrial fibrillation, arterial disease, left ventricular disease, cardiac valvulopathy, patent foramen ovale (PFO) and cancer. Results: Among 800 consecutive ESUS patients (43.3% women, median age 67 years), the optimal number of clusters was four. Left ventricular disease was most prevalent in cluster 1 (present in all patients) and perfectly associated with cluster 1. PFO was most prevalent in cluster 2 (38.9% of patients) and associated significantly with increased likelihood of cluster 2 [adjusted odds ratio: 2.69, 95% confidence interval (CI): 1.64–4.41]. Arterial disease was most prevalent in cluster 3 (57.7%) and associated with increased likelihood of cluster 3 (adjusted odds ratio: 2.21, 95% CI: 1.43–3.13). Atrial cardiopathy was most prevalent in cluster 4 (100%) and perfectly associated with cluster 4. Cluster 3 was the largest cluster involving 53.7% of patients. Atrial fibrillation was not significantly associated with any cluster. Conclusions: This data‐driven machine‐learning analysis identified four clusters of ESUS that were strongly associated with arterial disease, atrial cardiopathy, PFO and left ventricular disease, respectively. More than half of the patients were assigned to the cluster associated with arterial disease. Abstract : A data‐driven machine‐learning analysis in 800 consecutive patients with emboic stroke of undetermined source (ESUS) identified 4 clusters of ESUS which were strongly associated with arterial disease, atrial cardiopathy, patent foramen ovale and left ventricular disease respectively. More than half of patients were assigned to the cluster associated with arterial disease. … (more)
- Is Part Of:
- European journal of neurology. Volume 28:Number 1(2021)
- Journal:
- European journal of neurology
- Issue:
- Volume 28:Number 1(2021)
- Issue Display:
- Volume 28, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 28
- Issue:
- 1
- Issue Sort Value:
- 2021-0028-0001-0000
- Page Start:
- 192
- Page End:
- 201
- Publication Date:
- 2020-10-07
- Subjects:
- embolic stroke of undetermined source -- hierarchical clustering -- machine learning -- potential embolic source -- stroke
Neurology -- Periodicals
Nervous system -- Diseases -- Periodicals
616.8 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1468-1331 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/ene.14524 ↗
- Languages:
- English
- ISSNs:
- 1351-5101
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3829.731680
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 23506.xml