Using high-fidelity meta-models to improve performance of small dataset trained Bayesian Networks. (January 2020)
- Record Type:
- Journal Article
- Title:
- Using high-fidelity meta-models to improve performance of small dataset trained Bayesian Networks. (January 2020)
- Main Title:
- Using high-fidelity meta-models to improve performance of small dataset trained Bayesian Networks
- Authors:
- MacAllister, Anastacia
Kohl, Adam
Winer, Eliot - Abstract:
- Highlights: Bayesian Networks often cannot be used with small datasets due to accuracy concerns. Kriging and radial-basis function meta-models are viable options for augmenting datasets. Bayesian network accuracy increases when using meta-model generated data. Abstract: Machine Learning (ML) is increasingly being used by companies like Google, Amazon and Apple to help identify market trends and predict customer behavior. Continuous improvement and maturing of these ML tools will help improve decision making across a number of industries. Unfortunately, before many ML strategies can be utilized the methods often require large amounts of data. For a number of realistic situations, however, only smaller subsets of data are available (i.e. hundreds to thousands of points). This work explores this problem by investigating the feasibility of using meta-models, specifically Kriging and Radial Basis Functions, to generate data for training a BN when only small amounts of original data are available. This paper details the meta-model creation process and the results of using Particle Swarm Optimization (PSO) for tuning parameters for four network structures trained using three relatively small data sets. Additionally, a series of experiments augment these small datasets by generating ten thousand, one-hundred thousand, and a million synthetic data points using the Kriging and RBF meta-models as well as intelligently establishing prior probabilities using PSO. Results show thatHighlights: Bayesian Networks often cannot be used with small datasets due to accuracy concerns. Kriging and radial-basis function meta-models are viable options for augmenting datasets. Bayesian network accuracy increases when using meta-model generated data. Abstract: Machine Learning (ML) is increasingly being used by companies like Google, Amazon and Apple to help identify market trends and predict customer behavior. Continuous improvement and maturing of these ML tools will help improve decision making across a number of industries. Unfortunately, before many ML strategies can be utilized the methods often require large amounts of data. For a number of realistic situations, however, only smaller subsets of data are available (i.e. hundreds to thousands of points). This work explores this problem by investigating the feasibility of using meta-models, specifically Kriging and Radial Basis Functions, to generate data for training a BN when only small amounts of original data are available. This paper details the meta-model creation process and the results of using Particle Swarm Optimization (PSO) for tuning parameters for four network structures trained using three relatively small data sets. Additionally, a series of experiments augment these small datasets by generating ten thousand, one-hundred thousand, and a million synthetic data points using the Kriging and RBF meta-models as well as intelligently establishing prior probabilities using PSO. Results show that augmenting limited existing datasets with meta-model generated data can dramatically affect network accuracy. Overall, the exploratory results presented in this paper demonstrate the feasibility of using meta-model generated data to increase the accuracy of small sample set trained BN. Further developing this method will help underserved areas with access to only small datasets make use of the powerful predictive analytics of ML. … (more)
- Is Part Of:
- Expert systems with applications. Volume 139(2020)
- Journal:
- Expert systems with applications
- Issue:
- Volume 139(2020)
- Issue Display:
- Volume 139, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 139
- Issue:
- 2020
- Issue Sort Value:
- 2020-0139-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-01
- Subjects:
- Machine learning -- Bayesian Networks -- Small datasets -- Meta-models -- Data generation
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2019.112830 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12031.xml