Pediatric population health analysis of southern and central Illinois region: A cross sectional retrospective study using association rule mining and multiple logistic regression. (September 2019)
- Record Type:
- Journal Article
- Title:
- Pediatric population health analysis of southern and central Illinois region: A cross sectional retrospective study using association rule mining and multiple logistic regression. (September 2019)
- Main Title:
- Pediatric population health analysis of southern and central Illinois region: A cross sectional retrospective study using association rule mining and multiple logistic regression
- Authors:
- Buxton, Elham Khorasani
Vohra, Sameer
Guo, Yanhui
Fogleman, Amanda
Patel, Rushabh - Abstract:
- Highlights: Class Association Rule Mining (CARM) is an effective method to discover unknown associations and generate hypotheses which correlate patient demographics to diagnosis of pediatric diseases in a large medical administrative database. The clinically relevant associations found by CARM must be further validated and adjusted using multiple logistic regression. CARM was applied to a large cross-sectional data of pediatric patients served by Southern Illinois University School of Medicine (SIUSOM) to find clinically relevant correlations between patients' demographics and diagnosis of pediatric diseases. Several associations were found correlating specific SIUSOM pediatric patients' residential zip codes with the diagnosis codes for viral hepatitis carrier, exposure to communicable diseases, screening for mental and developmental disorder in childhood, history allergy to medications, disturbance of emotions specific to childhood, and acute sinusitis. SIUSOM African American pediatric patients were more likely to be screened for mental and developmental disorders compared to White patients for SIUSOM pediatric population. Abstract: Background: Southern Illinois University School of Medicine (SIUSOM) collects large amounts of data every day. SIUSOM and other similar healthcare systems are always looking for better ways to use the data to understand and address population level problems. The purpose of this study is to analyze the administrative dataset for pediatricHighlights: Class Association Rule Mining (CARM) is an effective method to discover unknown associations and generate hypotheses which correlate patient demographics to diagnosis of pediatric diseases in a large medical administrative database. The clinically relevant associations found by CARM must be further validated and adjusted using multiple logistic regression. CARM was applied to a large cross-sectional data of pediatric patients served by Southern Illinois University School of Medicine (SIUSOM) to find clinically relevant correlations between patients' demographics and diagnosis of pediatric diseases. Several associations were found correlating specific SIUSOM pediatric patients' residential zip codes with the diagnosis codes for viral hepatitis carrier, exposure to communicable diseases, screening for mental and developmental disorder in childhood, history allergy to medications, disturbance of emotions specific to childhood, and acute sinusitis. SIUSOM African American pediatric patients were more likely to be screened for mental and developmental disorders compared to White patients for SIUSOM pediatric population. Abstract: Background: Southern Illinois University School of Medicine (SIUSOM) collects large amounts of data every day. SIUSOM and other similar healthcare systems are always looking for better ways to use the data to understand and address population level problems. The purpose of this study is to analyze the administrative dataset for pediatric patients served by Southern Illinois University School of Medicine (SIUSOM) to uncover patterns that correlate specific demographic information to diagnoses of pediatric diseases. The study uses a cross-sectional database of medical billing information for all pediatric patients served by SIUSOM between June 2013 and December 2016. The dataset consists of about 980.9K clinical visits for 65.4K unique patients and includes patient demographic identifiers such as their sex, date of birth, race, anonymous zipcode and primary and secondary insurance plan as well as the related pediatric diagnosis codes. The goal is to find unknown correlations in this database. Method: We proposed a two step methodology to derive unknown correlations in SIUSOM administrative database. First, Class association rule mining was used as a well-established data mining method to generate hypothesis and derive associations of the form D → M, where D is diagnosis code of a pediatric disease and M is a patient demographic identifier (age, sex, anonymous zipcode, insurance plan, or race). The resulting associations were pruned and filtered using measures such as lift, odds ratio, relative risk, and confidence. The final associations were selected by a pediatric doctor based on their clinical significance. Second, each association rule in the final set was further validated and adjusted odds ratios were obtained using multiple logistic regression. Results: Several associations were found correlating specific patients' residential zip codes with the diagnosis codes for viral hepatitis carrier, exposure to communicable diseases, screening for mental and developmental disorder in childhood, history allergy to medications, disturbance of emotions specific to childhood, and acute sinusitis. In addition, the results show that African American patients are more likely to be screened for mental and developmental disorders compared to White patients for SIUSOM pediatric population (Odds Ratio (OR):3.56, 95% Confidence Interval (CI):[3.29, 3.85]). Conclusion: Class association rule mining is an effective method for detecting signals in a large patient administrative database and generating hypotheses which correlate patients' demographics with diagnosis of pediatric diseases. A post processing of the hypotheses generated by this method is necessary to prune spurious associations and select a set of clinically relevant hypotheses. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 178(2019)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 178(2019)
- Issue Display:
- Volume 178, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 178
- Issue:
- 2019
- Issue Sort Value:
- 2019-0178-2019-0000
- Page Start:
- 145
- Page End:
- 153
- Publication Date:
- 2019-09
- Subjects:
- Population health analysis -- Data mining -- Medical billing database -- Pediatric diseases risk factors
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2019.06.020 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11355.xml