Data quality and autism: Issues and potential impacts. (February 2023)
- Record Type:
- Journal Article
- Title:
- Data quality and autism: Issues and potential impacts. (February 2023)
- Main Title:
- Data quality and autism: Issues and potential impacts
- Authors:
- Heyl, Johannes
Hardy, Flavien
Tucker, Katie
Hopper, Adrian
Marchã, Maria J.
Liew, Ashley
Reep, Judith
Harwood, Kerry-Anne
Roberts, Luke
Yates, Jeremy
Day, Jamie
Wheeler, Andrew
Eve-Jones, Sue
Briggs, Tim W.R.
Gray, William K. - Abstract:
- Highlights: Patient features relating strongly with autism coding inconsistencies were found. Hospital spell features were identified relating to autism coding inconsistencies. Nephrology spells and spells involving dialysis had high rates of inconsistencies. Abstract: Introduction: Large healthcare datasets can provide insight that has the potential to improve outcomes for patients. However, it is important to understand the strengths and limitations of such datasets so that the insights they provide are accurate and useful. The aim of this study was to identify data inconsistencies within the Hospital Episodes Statistics (HES) dataset for autistic patients and assess potential biases introduced through these inconsistencies and their impact on patient outcomes. The study can only identify inconsistencies in recording of autism diagnosis and not whether the inclusion or exclusion of the autism diagnosis is the error. Methods: Data were extracted from the HES database for the period 1st April 2013 to 31st March 2021 for patients with a diagnosis of autism. First spells in hospital during the study period were identified for each patient and these were linked to any subsequent spell in hospital for the same patient. Data inconsistencies were recorded where autism was not recorded as a diagnosis in a subsequent spell. Features associated with data inconsistencies were identified using a random forest classifiers and regression modelling. Results: Data were available for 172,Highlights: Patient features relating strongly with autism coding inconsistencies were found. Hospital spell features were identified relating to autism coding inconsistencies. Nephrology spells and spells involving dialysis had high rates of inconsistencies. Abstract: Introduction: Large healthcare datasets can provide insight that has the potential to improve outcomes for patients. However, it is important to understand the strengths and limitations of such datasets so that the insights they provide are accurate and useful. The aim of this study was to identify data inconsistencies within the Hospital Episodes Statistics (HES) dataset for autistic patients and assess potential biases introduced through these inconsistencies and their impact on patient outcomes. The study can only identify inconsistencies in recording of autism diagnosis and not whether the inclusion or exclusion of the autism diagnosis is the error. Methods: Data were extracted from the HES database for the period 1st April 2013 to 31st March 2021 for patients with a diagnosis of autism. First spells in hospital during the study period were identified for each patient and these were linked to any subsequent spell in hospital for the same patient. Data inconsistencies were recorded where autism was not recorded as a diagnosis in a subsequent spell. Features associated with data inconsistencies were identified using a random forest classifiers and regression modelling. Results: Data were available for 172, 324 unique patients who had been recorded as having an autism diagnosis on first admission. In total, 43.7 % of subsequent spells were found to have inconsistencies. The features most strongly associated with inconsistencies included greater age, greater deprivation, longer time since the first spell, change in provider, shorter length of stay, being female and a change in the main specialty description. The random forest algorithm had an area under the receiver operating characteristic curve of 0.864 (95 % CI [0.862 – 0.866]) in predicting a data inconsistency. For patients who died in hospital, inconsistencies in their final spell were significantly associated with being 80 years and over, being female, greater deprivation and use of a palliative care code in the death spell. Conclusions: Data inconsistencies in the HES database were relatively common in autistic patients and were associated a number of patient and hospital admission characteristics. Such inconsistencies have the potential to distort our understanding of service use in key demographic groups. … (more)
- Is Part Of:
- International journal of medical informatics. Volume 170(2023)
- Journal:
- International journal of medical informatics
- Issue:
- Volume 170(2023)
- Issue Display:
- Volume 170, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 170
- Issue:
- 2023
- Issue Sort Value:
- 2023-0170-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-02
- Subjects:
- Healthcare data -- Data consistency -- Autism -- Data accuracy
Medical informatics -- Periodicals
Information science -- Periodicals
Computers -- Periodicals
Medical technology -- Periodicals
Medical Informatics -- Periodicals
Technology, Medical -- Periodicals
Computers
Information science
Medical informatics
Medical technology
Electronic journals
Periodicals
Electronic journals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13865056 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/13865056 ↗
http://www.clinicalkey.com.au/dura/browse/journalIssue/13865056 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ijmedinf.2022.104938 ↗
- Languages:
- English
- ISSNs:
- 1386-5056
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.345250
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25645.xml