Colorectal Cancer Detected by Machine Learning Models Using Conventional Laboratory Test Data. (November 2021)
- Record Type:
- Journal Article
- Title:
- Colorectal Cancer Detected by Machine Learning Models Using Conventional Laboratory Test Data. (November 2021)
- Main Title:
- Colorectal Cancer Detected by Machine Learning Models Using Conventional Laboratory Test Data
- Authors:
- Li, Hui
Lin, Jianmei
Xiao, Yanhong
Zheng, Wenwen
Zhao, Lu
Yang, Xiangling
Zhong, Minsheng
Liu, Huanliang - Abstract:
- Background: Current diagnostic methods for colorectal cancer (CRC) are colonoscopy and sigmoidoscopy, which are invasive and complex procedures with possible complications. This study aimed to determine models for CRC identification that involve minimally invasive, affordable, portable, and accurate screening variables.Methods: This was a retrospective study that used data from electronic medical records of patients with CRC and healthy individuals between July 2017 and June 2018. Laboratory data, including liver enzymes, lipid profiles, complete blood counts, and tumor biomarkers, were extracted from the electronic medical records. Five machine learning models (logistic regression, random forest, k-nearest neighbors, support vector machine [SVM], and naïve Bayes) were used to identify CRC. The performances were evaluated using the areas under the curve (AUCs), sensitivity, specificity, positive predictive values (PPV), and negative predictive values (NPV).Results: A total of 1164 electronic medical records (CRC patients: 582; healthy controls: 582) were included. The logistic regression model achieved the highest performance in identifying CRC (AUC: 0.865, sensitivity: 89.5%, specificity: 83.5%, PPV: 84.4%, NPV: 88.9%). The first four weighted features in the model were carcinoembryonic antigen (CEA), hemoglobin (HGB), lipoprotein (a) (Lp(a)), and high-density lipoprotein (HDL). A diagnostic model for CRC was established based on the four indicators, with an AUC of 0.849Background: Current diagnostic methods for colorectal cancer (CRC) are colonoscopy and sigmoidoscopy, which are invasive and complex procedures with possible complications. This study aimed to determine models for CRC identification that involve minimally invasive, affordable, portable, and accurate screening variables.Methods: This was a retrospective study that used data from electronic medical records of patients with CRC and healthy individuals between July 2017 and June 2018. Laboratory data, including liver enzymes, lipid profiles, complete blood counts, and tumor biomarkers, were extracted from the electronic medical records. Five machine learning models (logistic regression, random forest, k-nearest neighbors, support vector machine [SVM], and naïve Bayes) were used to identify CRC. The performances were evaluated using the areas under the curve (AUCs), sensitivity, specificity, positive predictive values (PPV), and negative predictive values (NPV).Results: A total of 1164 electronic medical records (CRC patients: 582; healthy controls: 582) were included. The logistic regression model achieved the highest performance in identifying CRC (AUC: 0.865, sensitivity: 89.5%, specificity: 83.5%, PPV: 84.4%, NPV: 88.9%). The first four weighted features in the model were carcinoembryonic antigen (CEA), hemoglobin (HGB), lipoprotein (a) (Lp(a)), and high-density lipoprotein (HDL). A diagnostic model for CRC was established based on the four indicators, with an AUC of 0.849 (0.840-0.860) for identifying all CRC patients, and it performed best in discriminating patients with late colon cancer from healthy individuals with an AUC of 0.905 (0.889-0.929).Conclusions: The logistic regression model based on CEA, HGB, Lp(a), and HDL might be a powerful, noninvasive, and cost-effective method to identify CRC. … (more)
- Is Part Of:
- Technology in cancer research & treatment. Volume 20(2021)
- Journal:
- Technology in cancer research & treatment
- Issue:
- Volume 20(2021)
- Issue Display:
- Volume 20, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 20
- Issue:
- 2021
- Issue Sort Value:
- 2021-0020-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-11
- Subjects:
- diagnosis -- colorectal cancer -- machine learning -- logistic regression -- clinical laboratory techniques
Oncology -- Periodicals
Cancer -- Diagnosis -- Periodicals
Cancer -- Treatment -- Technological innovations -- Periodicals
616.994 - Journal URLs:
- http://tct.sagepub.com/ ↗
http://www.tcrt.org ↗
http://www.sagepub.com ↗ - DOI:
- 10.1177/15330338211058352 ↗
- Languages:
- English
- ISSNs:
- 1533-0346
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19940.xml