Selecting Appropriate Clustering Methods for Materials Science Applications of Machine Learning. Issue 12 (9th October 2019)
- Record Type:
- Journal Article
- Title:
- Selecting Appropriate Clustering Methods for Materials Science Applications of Machine Learning. Issue 12 (9th October 2019)
- Main Title:
- Selecting Appropriate Clustering Methods for Materials Science Applications of Machine Learning
- Authors:
- Parker, Amanda J.
Barnard, Amanda S. - Abstract:
- Abstract: Based on a general definition of a cluster and the quality of a clustering result, a new method for evaluating existing clustering algorithms, or undertaking clustering, capable of predicting the number and type of clusters and outliers present in a data set, regardless of the complexity of the distribution of points, is presented. This algorithm, referred to as iterative label spreading, can recognize the characteristics expected of a successful clustering result before any clustering algorithm is applied, providing a type of hyper‐parameter optimization for clustering. The efficacy of the algorithm, and the assessment of clustering result, are both confirmed using large benchmark two dimensional synthetic data sets, and small multidimensional data describing a set of silver nanoparticles. It is shown that the method is ideal for studying noisy data with high dimensionality and high variance, typical of data captured in materials and nanoscience. Abstract : A new clustering method is developed that is ideally suited to small data sets with high dimensionality, as commonly found in materials informatics. The method, iterative label spreading outperforms popular methods such as k ‐Means, Ward agglomerative clustering, and density‐based spatial clustering of applications with noise, and is used to identify clusters in a diverse set of 425 silver nanoparticles.
- Is Part Of:
- Advanced theory and simulations. Volume 2:Issue 12(2019)
- Journal:
- Advanced theory and simulations
- Issue:
- Volume 2:Issue 12(2019)
- Issue Display:
- Volume 2, Issue 12 (2019)
- Year:
- 2019
- Volume:
- 2
- Issue:
- 12
- Issue Sort Value:
- 2019-0002-0012-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2019-10-09
- Subjects:
- materials classification -- materials clustering -- machine learning -- materials design -- nanoparticles
Science -- Simulation methods -- Periodicals
Science -- Methodology -- Periodicals
Engineering -- Simulation methods -- Periodicals
Engineering -- Methodology -- Periodicals
507.21 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/adts.201900145 ↗
- Languages:
- English
- ISSNs:
- 2513-0390
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0696.935575
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12461.xml