Time-series clustering approach for training data selection of a data-driven predictive model: Application to an industrial bio 2, 3-butanediol distillation process. (May 2022)
- Record Type:
- Journal Article
- Title:
- Time-series clustering approach for training data selection of a data-driven predictive model: Application to an industrial bio 2, 3-butanediol distillation process. (May 2022)
- Main Title:
- Time-series clustering approach for training data selection of a data-driven predictive model: Application to an industrial bio 2, 3-butanediol distillation process
- Authors:
- Choi, Yeongryeol
An, Nahyeon
Hong, Seokyoung
Cho, Hyungtae
Lim, Jongkoo
Han, In-Su
Moon, Il
Kim, Junghwan - Abstract:
- Highlights: The training data selection method using time-series clustering is proposed. The proposed method is applied to commercial 2, 3-BDO distillation process. The number and ratio of training data are optimized by mathematical model. Abstract: In this study, we propose a time-series clustering approach that selects optimal training data for the development of predictive models. The optimal number of clusters was set based on the variation of within-cluster sums of squares. A predictive model was developed with the selection ratio of training data from each of those clusters. Based on the results, a regression model was developed to predict the performance of the model. The search space was applied to the regression model, and the optimal training data ratio were selected satisfying the objective function and constraints. The effectiveness of the method is demonstrated by addressing a commercial bio 2, 3-butanediol distillation process. As a result, the number of data for model training was reduced by 49.20% compared to the base case without clustering. The coefficient of determination (R 2 ) showed the same level of performance, and the root-mean-square error was improved up to 14.07%.
- Is Part Of:
- Computers & chemical engineering. Volume 161(2022)
- Journal:
- Computers & chemical engineering
- Issue:
- Volume 161(2022)
- Issue Display:
- Volume 161, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 161
- Issue:
- 2022
- Issue Sort Value:
- 2022-0161-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-05
- Subjects:
- Training data selection -- Data-driven predictive model -- Time-series clustering -- Bio 2, 3-BDO
Chemical engineering -- Data processing -- Periodicals
660.0285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00981354 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compchemeng.2022.107758 ↗
- Languages:
- English
- ISSNs:
- 0098-1354
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.664000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21651.xml