A novel imputation based predictive algorithm for reducing common cause variation from small and mixed datasets with missing values. (May 2023)
- Record Type:
- Journal Article
- Title:
- A novel imputation based predictive algorithm for reducing common cause variation from small and mixed datasets with missing values. (May 2023)
- Main Title:
- A novel imputation based predictive algorithm for reducing common cause variation from small and mixed datasets with missing values
- Authors:
- Batbooti, Raed S.
Ransing, Rajesh S. - Abstract:
- Abstract: Most process control algorithms need a predetermined target value as an input for a process variable so that the deviation is observed and minimized. In this paper, a novel machine learning algorithm is proposed that has an ability to not only suggest new target values for both categorical and continuous variables to minimize process output variation but also predict the extent to which the variation can be minimized. In foundry processes, an average rejection rate of 3%–5% within batches of castings produced is considered as acceptable and is considered as an effect of the common cause variation. As a result, the operating range for process input values is often not changed during the root cause analysis. The relevant available historical process data is normally limited with missing values and it combines both categorical and continuous variables (mixed dataset). However, technological advancements manufacturing processes provide opportunities to further refine process inputs in order to minimize undesired variation in process outputs. A new linear regression based algorithm is proposed to achieve lower prediction error in comparison to the commonly used linear factor analysis for mixed data (FAMD) method. This algorithm is further coupled with a novel missing data algorithm to predict the process response values corresponding to a given set of values for process inputs. This enabled the novel imputation based predictive algorithm to quantify the effect of aAbstract: Most process control algorithms need a predetermined target value as an input for a process variable so that the deviation is observed and minimized. In this paper, a novel machine learning algorithm is proposed that has an ability to not only suggest new target values for both categorical and continuous variables to minimize process output variation but also predict the extent to which the variation can be minimized. In foundry processes, an average rejection rate of 3%–5% within batches of castings produced is considered as acceptable and is considered as an effect of the common cause variation. As a result, the operating range for process input values is often not changed during the root cause analysis. The relevant available historical process data is normally limited with missing values and it combines both categorical and continuous variables (mixed dataset). However, technological advancements manufacturing processes provide opportunities to further refine process inputs in order to minimize undesired variation in process outputs. A new linear regression based algorithm is proposed to achieve lower prediction error in comparison to the commonly used linear factor analysis for mixed data (FAMD) method. This algorithm is further coupled with a novel missing data algorithm to predict the process response values corresponding to a given set of values for process inputs. This enabled the novel imputation based predictive algorithm to quantify the effect of a confirmation trial based on the proposed changes in the operating ranges of one or more process inputs. A set of values for optimal process inputs is generated from operating ranges discovered by a recently proposed quality correlation algorithm (QCA) using a Bootstrap sampling method. The odds ratio, which represents a ratio between the probability of occurrence of desired and undesired process output values, is used to quantify the effect of a confirmation trial. The limitations of the underlying PCA based linear model have been discussed and the future research areas have been identified. Highlights: A missing data algorithm for mixed dataset is also used as a predictive tool. The effect of optimal and avoid ranges within the process variation is quantified. Process interactions among categorical and continuous variables are considered. The algorithm is useful for tolerance synthesis of any process. … (more)
- Is Part Of:
- Computers & industrial engineering. Volume 179(2023)
- Journal:
- Computers & industrial engineering
- Issue:
- Volume 179(2023)
- Issue Display:
- Volume 179, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 179
- Issue:
- 2023
- Issue Sort Value:
- 2023-0179-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-05
- Subjects:
- Common cause variation -- Missing data -- Predictive analytics -- Quality improvement -- Tolerance limit optimization -- 7Epsilon
Engineering -- Data processing -- Periodicals
Industrial engineering -- Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03608352 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cie.2023.109230 ↗
- Languages:
- English
- ISSNs:
- 0360-8352
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.713000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 27042.xml