A random forest with multi-fidelity Gaussian process leaves for modeling multi-fidelity data with heterogeneity. (December 2022)
- Record Type:
- Journal Article
- Title:
- A random forest with multi-fidelity Gaussian process leaves for modeling multi-fidelity data with heterogeneity. (December 2022)
- Main Title:
- A random forest with multi-fidelity Gaussian process leaves for modeling multi-fidelity data with heterogeneity
- Authors:
- Ghosh, Mithun
Wu, Lang
Hao, Qing
Zhou, Qiang - Abstract:
- Abstract: Modeling multi-fidelity datasets has been widely used recently. High-fidelity data often suffer from scarcity. Low-fidelity models have abundant observations where information from low-fidelity models can be transferred to high-fidelity models. However, the modeling performance for the multi-fidelity models is below par in most cases due to the heterogeneity of the data. Modeling time is also a critical issue for MF datasets due to high dimension of the data. We propose to frame a multi-fidelity Gaussian process model into a random forest framework to incorporate its flexibility and improve the prediction performance when there are a limited amount of high-fidelity data and the data exhibit heterogeneity in the space of interest. Information extracted from the low-fidelity model can be borrowed for the high-fidelity model by capturing cross-level data correlations. The multi-fidelity model is extended to a tree ensemble structure with an efficient partitioning criterion to tackle data heterogeneity. The proposed method is able to provide uncertainty quantification for predicted values. Numerical examples and case studies are conducted to show the efficacy of our method for the heterogeneous behaviors of the responses across the input space. Highlights: A Novel Material Oxide Discovery Problem is studied. Clustering of materials based on physical and chemical properties is considered. An ensemble of random forest and multi-fidelity Gaussian process algorithms isAbstract: Modeling multi-fidelity datasets has been widely used recently. High-fidelity data often suffer from scarcity. Low-fidelity models have abundant observations where information from low-fidelity models can be transferred to high-fidelity models. However, the modeling performance for the multi-fidelity models is below par in most cases due to the heterogeneity of the data. Modeling time is also a critical issue for MF datasets due to high dimension of the data. We propose to frame a multi-fidelity Gaussian process model into a random forest framework to incorporate its flexibility and improve the prediction performance when there are a limited amount of high-fidelity data and the data exhibit heterogeneity in the space of interest. Information extracted from the low-fidelity model can be borrowed for the high-fidelity model by capturing cross-level data correlations. The multi-fidelity model is extended to a tree ensemble structure with an efficient partitioning criterion to tackle data heterogeneity. The proposed method is able to provide uncertainty quantification for predicted values. Numerical examples and case studies are conducted to show the efficacy of our method for the heterogeneous behaviors of the responses across the input space. Highlights: A Novel Material Oxide Discovery Problem is studied. Clustering of materials based on physical and chemical properties is considered. An ensemble of random forest and multi-fidelity Gaussian process algorithms is proposed. A tailored computation method to deal with heterogeneity and data scarcity is devised. Computational experiments verify the effectiveness of the proposed algorithm. … (more)
- Is Part Of:
- Computers & industrial engineering. Volume 174(2022)
- Journal:
- Computers & industrial engineering
- Issue:
- Volume 174(2022)
- Issue Display:
- Volume 174, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 174
- Issue:
- 2022
- Issue Sort Value:
- 2022-0174-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12
- Subjects:
- Multi-fidelity model -- Gaussian process -- Random forest -- Product of experts
Engineering -- Data processing -- Periodicals
Industrial engineering -- Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03608352 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cie.2022.108746 ↗
- Languages:
- English
- ISSNs:
- 0360-8352
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.713000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24449.xml