Accurate Classification of Pediatric Colonic Inflammatory Bowel Disease Subtype Using a Random Forest Machine Learning Classifier. Issue 2 (February 2021)
- Record Type:
- Journal Article
- Title:
- Accurate Classification of Pediatric Colonic Inflammatory Bowel Disease Subtype Using a Random Forest Machine Learning Classifier. Issue 2 (February 2021)
- Main Title:
- Accurate Classification of Pediatric Colonic Inflammatory Bowel Disease Subtype Using a Random Forest Machine Learning Classifier
- Authors:
- Dhaliwal, Jasbir
Erdman, Lauren
Drysdale, Erik
Rinawi, Firas
Muir, Jennifer
Walters, Thomas D.
Siddiqui, Iram
Griffiths, Anne M.
Church, Peter C. - Abstract:
- Abstract : Supplemental Digital Content is available in the text ABSTRACT: Background: The pediatric inflammatory bowel disease (PIBD) classes algorithm was developed to bring consistency to labelling of colonic IBD, but labels are exclusively based on features atypical for ulcerative colitis (UC). Aim: The aim of the study was to develop an algorithm and identify features that discriminate between pediatric UC and colonic Crohn disease (CD). Methods: Baseline clinical, endoscopic, radiologic, and histologic data, including the PIBD class features in 74 colonic IBD (56: UC, 18: colonic CD) patients were collected. The PIBD class features and additional features common to UC were used to perform initial clustering, using similarity network fusion (SNF). We trained a Random Forest (RF) classifier on the full dataset and used a leave-one-out approach to evaluate model accuracy. The top-features were used to build a new classifier, which we tested on 15 previously unused patients. We then performed clustering with SNF on the top RF features and assessed ability to discriminate between UC and colonic-CD independent of a supervised model. Results: The initial SNF clustering with 58 patients demonstrated 2 groups: group 1 (n = 39, 90% UC) and group 2 (n = 19, 68% colonic-CD). Our RF classifier correctly labelled 97% of the 58 patients based on leave-one-out cross validation and identified the 7 most important features (3 histological and 4 endoscopic) to clinically distinguishAbstract : Supplemental Digital Content is available in the text ABSTRACT: Background: The pediatric inflammatory bowel disease (PIBD) classes algorithm was developed to bring consistency to labelling of colonic IBD, but labels are exclusively based on features atypical for ulcerative colitis (UC). Aim: The aim of the study was to develop an algorithm and identify features that discriminate between pediatric UC and colonic Crohn disease (CD). Methods: Baseline clinical, endoscopic, radiologic, and histologic data, including the PIBD class features in 74 colonic IBD (56: UC, 18: colonic CD) patients were collected. The PIBD class features and additional features common to UC were used to perform initial clustering, using similarity network fusion (SNF). We trained a Random Forest (RF) classifier on the full dataset and used a leave-one-out approach to evaluate model accuracy. The top-features were used to build a new classifier, which we tested on 15 previously unused patients. We then performed clustering with SNF on the top RF features and assessed ability to discriminate between UC and colonic-CD independent of a supervised model. Results: The initial SNF clustering with 58 patients demonstrated 2 groups: group 1 (n = 39, 90% UC) and group 2 (n = 19, 68% colonic-CD). Our RF classifier correctly labelled 97% of the 58 patients based on leave-one-out cross validation and identified the 7 most important features (3 histological and 4 endoscopic) to clinically distinguish these groups. We trained a new RF classifier with the top 7 features and found 100% accuracy in a set of 15 held-out patients. Finally, post hoc clustering with these 7 features revealed 2 groups of patients: group 1 (n = 55, 98% UC) and group 2 (n = 18, 94% colonic-CD). Conclusions: A combination of supervised and unsupervised analyses identified a short list of features, which consistently distinguish UC from colonic CD. Future directions include validation in other populations. … (more)
- Is Part Of:
- Journal of pediatric gastroenterology and nutrition. Volume 72:Issue 2(2021)
- Journal:
- Journal of pediatric gastroenterology and nutrition
- Issue:
- Volume 72:Issue 2(2021)
- Issue Display:
- Volume 72, Issue 2 (2021)
- Year:
- 2021
- Volume:
- 72
- Issue:
- 2
- Issue Sort Value:
- 2021-0072-0002-0000
- Page Start:
- 262
- Page End:
- 269
- Publication Date:
- 2021-02
- Subjects:
- classification -- Crohn disease -- machine learning -- Random Forest -- ulcerative Colitis
Children -- Nutrition -- Periodicals
Pediatric gastroenterology -- Periodicals
Infants -- Nutrition -- Periodicals
Nutrition disorders in children -- Periodicals
Child Nutrition -- Periodicals
Digestive System -- growth & development -- Periodicals
Gastrointestinal Diseases -- Periodicals
Infant Nutrition -- Periodicals
Nutrition Disorders -- Periodicals
Child
618.923 - Journal URLs:
- http://www.jpgn.org ↗
http://ovidsp.ovid.com/ovidweb.cgi?T=JS&NEWS=n&CSC=Y&PAGE=toc&D=yrovft&AN=00005176-000000000-00000 ↗
http://journals.lww.com ↗ - DOI:
- 10.1097/MPG.0000000000002956 ↗
- Languages:
- English
- ISSNs:
- 0277-2116
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5030.175000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 22251.xml