A conceptual study of transfer learning with linear models for data-driven property prediction. (January 2022)
- Record Type:
- Journal Article
- Title:
- A conceptual study of transfer learning with linear models for data-driven property prediction. (January 2022)
- Main Title:
- A conceptual study of transfer learning with linear models for data-driven property prediction
- Authors:
- Li, Bowen
Rangarajan, Srinivas - Abstract:
- Highlights: A conceptual study of when transfer learning improves data-driven models. Using an interpretable model to infer how shared features affect transfer learning. Illustration of transfer learning in multi-fidelity predictive models. Illustration of transfer learning for molecular property models. Example of detrimental effect of transfer learning between disparate properties. Abstract: Transfer learning is a concept whereby data-driven models can be developed for tasks (e.g. molecular properties) with limited data availability (target task) by sharing information from a related task. In the context of chemical engineering, the two tasks can either pertain to related properties or to the same property calculated or measured in two different ways (with differing accuracies or resolution). Using an ensemble of linear and interpretable models, in this work, we present a conceptual study to explicate when transfer learning can be beneficial. We show that a large overlap of the underlying features of the two tasks (specifically greater than 50%) is required for transfer learning to improve the model for the target task. On the other hand, transferring information (in particular, information regarding salient features) from an uncorrelated task can be detrimental to train a model for the target task. Subsequently, we present three illustrative examples of transfer learning for molecular property prediction and rationalize the usefulness of transferred information based onHighlights: A conceptual study of when transfer learning improves data-driven models. Using an interpretable model to infer how shared features affect transfer learning. Illustration of transfer learning in multi-fidelity predictive models. Illustration of transfer learning for molecular property models. Example of detrimental effect of transfer learning between disparate properties. Abstract: Transfer learning is a concept whereby data-driven models can be developed for tasks (e.g. molecular properties) with limited data availability (target task) by sharing information from a related task. In the context of chemical engineering, the two tasks can either pertain to related properties or to the same property calculated or measured in two different ways (with differing accuracies or resolution). Using an ensemble of linear and interpretable models, in this work, we present a conceptual study to explicate when transfer learning can be beneficial. We show that a large overlap of the underlying features of the two tasks (specifically greater than 50%) is required for transfer learning to improve the model for the target task. On the other hand, transferring information (in particular, information regarding salient features) from an uncorrelated task can be detrimental to train a model for the target task. Subsequently, we present three illustrative examples of transfer learning for molecular property prediction and rationalize the usefulness of transferred information based on the inferences from our conceptual studies. This work, thus, provides a simplified analysis of the concept of transfer learning for building molecular property models. … (more)
- Is Part Of:
- Computers & chemical engineering. Volume 157(2022)
- Journal:
- Computers & chemical engineering
- Issue:
- Volume 157(2022)
- Issue Display:
- Volume 157, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 157
- Issue:
- 2022
- Issue Sort Value:
- 2022-0157-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-01
- Subjects:
- Transfer learning -- Machine learning -- Molecular property
Chemical engineering -- Data processing -- Periodicals
660.0285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00981354 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compchemeng.2021.107599 ↗
- Languages:
- English
- ISSNs:
- 0098-1354
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.664000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20420.xml