Deep transfer learning for predicting frontier orbital energies of organic materials using small data and its application to porphyrin photocatalysts. Issue 15 (29th March 2023)
- Record Type:
- Journal Article
- Title:
- Deep transfer learning for predicting frontier orbital energies of organic materials using small data and its application to porphyrin photocatalysts. Issue 15 (29th March 2023)
- Main Title:
- Deep transfer learning for predicting frontier orbital energies of organic materials using small data and its application to porphyrin photocatalysts
- Authors:
- Su, An
Zhang, Xin
Zhang, Chengwei
Ding, Debo
Yang, Yun-Fang
Wang, Keke
She, Yuan-Bin - Abstract:
- Abstract : A deep transfer learning approach is used to predict HOMO/LUMO energies of organic materials with a small amount of training data. Abstract : Machine learning (ML) models have received increasing attention as a new approach for the virtual screening of organic materials. Although some ML models trained on large databases have achieved high prediction accuracy, the application of ML to certain types of organic materials is limited by the small amount of available data. On the other hand, metalloporphyrins and porphyrins (MpPs) have received increasing attention as potential photocatalysts, and recent studies have found that both HOMO/LUMO energy levels and energy gaps are important factors controlling the MpP photocatalysts. Since the training data of MpPs are insufficient and limited to porphyrin-based dyes, in this study, we proposed a deep transfer learning approach to rapidly predict the HOMO/LUMO energy levels and energy gaps of MpPs. To complement the open-source Porphyrin-based Dyes Database (PBDD), we curated a new database, the Metalloporphyrins and Porphyrins Database (MpPD), in which MpPs were specifically designed as potential photocatalysts and the HOMO/LUMO energies were calculated by advanced DFT functionals. We proposed PorphyBERT, a BERT-based regression model that was pre-trained with PBDD and fine-tuned with MpPD. The model performed satisfactorily in predicting HOMO and LUMO energies and energy gap with RMSEs of 0.0955, 0.0988, and 0.0787 eV andAbstract : A deep transfer learning approach is used to predict HOMO/LUMO energies of organic materials with a small amount of training data. Abstract : Machine learning (ML) models have received increasing attention as a new approach for the virtual screening of organic materials. Although some ML models trained on large databases have achieved high prediction accuracy, the application of ML to certain types of organic materials is limited by the small amount of available data. On the other hand, metalloporphyrins and porphyrins (MpPs) have received increasing attention as potential photocatalysts, and recent studies have found that both HOMO/LUMO energy levels and energy gaps are important factors controlling the MpP photocatalysts. Since the training data of MpPs are insufficient and limited to porphyrin-based dyes, in this study, we proposed a deep transfer learning approach to rapidly predict the HOMO/LUMO energy levels and energy gaps of MpPs. To complement the open-source Porphyrin-based Dyes Database (PBDD), we curated a new database, the Metalloporphyrins and Porphyrins Database (MpPD), in which MpPs were specifically designed as potential photocatalysts and the HOMO/LUMO energies were calculated by advanced DFT functionals. We proposed PorphyBERT, a BERT-based regression model that was pre-trained with PBDD and fine-tuned with MpPD. The model performed satisfactorily in predicting HOMO and LUMO energies and energy gap with RMSEs of 0.0955, 0.0988, and 0.0787 eV and MAEs of 0.0774, 0.0824, and 0.0549 eV. Furthermore, due to its unique unsupervised pre-training phase, the model is not affected by the difference in computational functionals between pre-training and fine-tuning databases. Finally, we recommended 12 MpPs as potential photocatalysts for CO2 reduction with out-of-sample model predictions of energy gaps close to the values calculated by DFT. … (more)
- Is Part Of:
- Physical chemistry chemical physics. Volume 25:Issue 15(2023)
- Journal:
- Physical chemistry chemical physics
- Issue:
- Volume 25:Issue 15(2023)
- Issue Display:
- Volume 25, Issue 15 (2023)
- Year:
- 2023
- Volume:
- 25
- Issue:
- 15
- Issue Sort Value:
- 2023-0025-0015-0000
- Page Start:
- 10536
- Page End:
- 10549
- Publication Date:
- 2023-03-29
- Subjects:
- Chemistry, Physical and theoretical -- Periodicals
541.3 - Journal URLs:
- http://pubs.rsc.org/en/journals/journalissues/cp#!issueid=cp016040&type=current&issnprint=1463-9076 ↗
http://www.rsc.org/ ↗ - DOI:
- 10.1039/d3cp00917c ↗
- Languages:
- English
- ISSNs:
- 1463-9076
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6475.306000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 26922.xml