Will it crystallise? Predicting crystallinity of molecular materials. Issue 9 (11th November 2014)
- Record Type:
- Journal Article
- Title:
- Will it crystallise? Predicting crystallinity of molecular materials. Issue 9 (11th November 2014)
- Main Title:
- Will it crystallise? Predicting crystallinity of molecular materials
- Authors:
- Wicker, Jerome G. P.
Cooper, Richard I. - Abstract:
- Abstract : Machine learning algorithms can be used to create models which separate molecular materials which will form good-quality crystals from those that will not, and predict how synthetic modifications will change the crystallinity. Abstract : Predicting and controlling crystallinity of molecular materials has applications in a crystal engineering context, as well as process control and formulation in the pharmaceutical industry. Here, we present a machine learning approach to this problem which uses a large input training set which is classified on a single measurable outcome: does a substance have a reasonable probability of forming good quality crystals. While the related problem of crystal structure prediction requires reliable calculation of three dimensional molecular conformations, the method employed here for predicting crystallisation propensity uses only "two dimensional" information consisting of atom types and connectivity. We show that an error rate lower than 10% can be achieved against unseen test data. The predictive model was also tested in a blind screen of a set of compounds which do not have crystal structures reported in the literature, and we found it to have a 79% classification accuracy. Analysis of the most significant descriptors used in the classification shows that the number of rotatable bonds and a molecular connectivity index are key in determining crystallisation propensity and using these two measures alone can give 80% accurateAbstract : Machine learning algorithms can be used to create models which separate molecular materials which will form good-quality crystals from those that will not, and predict how synthetic modifications will change the crystallinity. Abstract : Predicting and controlling crystallinity of molecular materials has applications in a crystal engineering context, as well as process control and formulation in the pharmaceutical industry. Here, we present a machine learning approach to this problem which uses a large input training set which is classified on a single measurable outcome: does a substance have a reasonable probability of forming good quality crystals. While the related problem of crystal structure prediction requires reliable calculation of three dimensional molecular conformations, the method employed here for predicting crystallisation propensity uses only "two dimensional" information consisting of atom types and connectivity. We show that an error rate lower than 10% can be achieved against unseen test data. The predictive model was also tested in a blind screen of a set of compounds which do not have crystal structures reported in the literature, and we found it to have a 79% classification accuracy. Analysis of the most significant descriptors used in the classification shows that the number of rotatable bonds and a molecular connectivity index are key in determining crystallisation propensity and using these two measures alone can give 80% accurate classification of unseen test data. … (more)
- Is Part Of:
- CrystEngComm. Volume 17:Issue 9(2015)
- Journal:
- CrystEngComm
- Issue:
- Volume 17:Issue 9(2015)
- Issue Display:
- Volume 17, Issue 9 (2015)
- Year:
- 2015
- Volume:
- 17
- Issue:
- 9
- Issue Sort Value:
- 2015-0017-0009-0000
- Page Start:
- 1927
- Page End:
- 1934
- Publication Date:
- 2014-11-11
- Subjects:
- Crystals -- Periodicals
Crystal growth -- Periodicals
Crystallography -- Periodicals
Cristaux -- Périodiques
Cristaux -- Croissance -- Périodiques
Cristallographie -- Périodiques
548 - Journal URLs:
- http://pubs.rsc.org/en/journals/journalissues/ce#!issueid=ce016040&type=current ↗
http://www.rsc.org/ ↗ - DOI:
- 10.1039/c4ce01912a ↗
- Languages:
- English
- ISSNs:
- 1466-8033
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3490.168000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 2002.xml