Convolutional neural network prediction of molecular properties for aerosol chemistry and health effects. Issue 4 (18th August 2022)
- Record Type:
- Journal Article
- Title:
- Convolutional neural network prediction of molecular properties for aerosol chemistry and health effects. Issue 4 (18th August 2022)
- Main Title:
- Convolutional neural network prediction of molecular properties for aerosol chemistry and health effects
- Authors:
- Krüger, Matteo
Wilson, Jake
Wietzoreck, Marco
Bandowe, Benjamin A. Musa
Lammel, Gerhard
Schmidt, Bertil
Pöschl, Ulrich
Berkemeier, Thomas - Abstract:
- Abstract: Quinones are chemical compounds commonly found in air particulate matter (PM). Their redox activity can generate reactive oxygen species (ROS) and contribute to the oxidative potential (OP) of PM leading to adverse health effects of aerosols. The quinones' OP and ability to form ROS are linked to their reduction potential (RP, measured in volts), a metric for the tendency to lose electrons in redox reactions. Here, we use convolutional neural networks (CNN) as quantitative structure‐activity relationship (QSAR) models to relate the one‐electron RP of quinones to their molecular structure. For CNN training and testing, a data set of more than 100, 000 quinones with associated RP values derived from density functional theory calculations was encoded in simplified molecular input line entry system (SMILES). The best performing CNN model achieved a root mean square error (RMSE) of 0.115 V for an independent test data set and outperformed linear regression models fitted on common molecular descriptors (≥ 0.140 V RMSE). Augmentation methods were newly adapted or applied to support CNN training with smaller data sets, improving RMSE by up to approximately 37% for a data set of 321 molecules. Adjusted for solvent effects, the CNN‐derived RP predictions showed good agreement with experimental data. Using the newly developed method, we identified a subset of atmospherically relevant quinones that are likely to have a high OP and play a role in aerosol health effects, whichAbstract: Quinones are chemical compounds commonly found in air particulate matter (PM). Their redox activity can generate reactive oxygen species (ROS) and contribute to the oxidative potential (OP) of PM leading to adverse health effects of aerosols. The quinones' OP and ability to form ROS are linked to their reduction potential (RP, measured in volts), a metric for the tendency to lose electrons in redox reactions. Here, we use convolutional neural networks (CNN) as quantitative structure‐activity relationship (QSAR) models to relate the one‐electron RP of quinones to their molecular structure. For CNN training and testing, a data set of more than 100, 000 quinones with associated RP values derived from density functional theory calculations was encoded in simplified molecular input line entry system (SMILES). The best performing CNN model achieved a root mean square error (RMSE) of 0.115 V for an independent test data set and outperformed linear regression models fitted on common molecular descriptors (≥ 0.140 V RMSE). Augmentation methods were newly adapted or applied to support CNN training with smaller data sets, improving RMSE by up to approximately 37% for a data set of 321 molecules. Adjusted for solvent effects, the CNN‐derived RP predictions showed good agreement with experimental data. Using the newly developed method, we identified a subset of atmospherically relevant quinones that are likely to have a high OP and play a role in aerosol health effects, which remains to be further elucidated by experimental studies. We suggest to use the presented machine learning approach in further investigations of atmospheric aerosol chemistry and health effects as well as other studies that require a target‐oriented screening of the properties and effects of large classes of substances. Key Points: 1. Convolutional neural networks can be used to estimate unknown physical and chemical properties of chemical substances and outperform additive group contribution methods. 2. Augmentation methods aid in the prevalent problem of data availability. 3. Quinone species detected in the environment are screened for potential relevance in atmospheric chemistry and public health and presented in this study. Abstract : … (more)
- Is Part Of:
- Natural sciences. Volume 2:Issue 4(2022)
- Journal:
- Natural sciences
- Issue:
- Volume 2:Issue 4(2022)
- Issue Display:
- Volume 2, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 2
- Issue:
- 4
- Issue Sort Value:
- 2022-0002-0004-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2022-08-18
- Subjects:
- quinone -- redox chemistry -- environmental chemistry -- redox potential -- oxidative potential -- convolutional neural network -- machine learning -- QSAR
Science -- Periodicals
505 - Journal URLs:
- https://onlinelibrary.wiley.com/journal/26986248 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/ntls.20220016 ↗
- Languages:
- English
- ISSNs:
- 2698-6248
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24813.xml