A comparison of accuracy and computational feasibility of two record linkage algorithms in retrieving vital status information from HIV/AIDS patients registered in Brazilian public databases. (June 2018)
- Record Type:
- Journal Article
- Title:
- A comparison of accuracy and computational feasibility of two record linkage algorithms in retrieving vital status information from HIV/AIDS patients registered in Brazilian public databases. (June 2018)
- Main Title:
- A comparison of accuracy and computational feasibility of two record linkage algorithms in retrieving vital status information from HIV/AIDS patients registered in Brazilian public databases
- Authors:
- de Paula, Adelzon Assis
Pires, Denise Franqueira
Filho, Pedro Alves
de Lemos, Kátia Regina Valente
Barçante, Eduardo
Pacheco, Antonio Guilherme - Abstract:
- Highlights: Two free record linkage algorithms employed to retrieve vital status from people living with HIV/AIDS were compared. Accuracy parameters and computational feasibility were comparatively discussed. Fake test datasets containing different proportions of records from patients known to be alive and from patients actually dead were assembled in order to determine accuracy parameters. Both algorithms proved to be accurate enough in cross-referencing information from PLWHA to the mortality database. While accuracy measures were interchangeable, a clear trade-off between processing time and the need of manual reviewing is presented. Abstract: Background and objective: While cross-referencing information from people living with HIV/AIDS (PLWHA) to the official mortality database is a critical step in monitoring the HIV/AIDS epidemic in Brazil, the accuracy of the linkage routine may compromise the validity of the final database, yielding to biased epidemiological estimates. We compared the accuracy and the total runtime of two linkage algorithms applied to retrieve vital status information from PLWHA in Brazilian public databases. Methods: Nominally identified records from PLWHA were obtained from three distinct government databases. Linkage routines included an algorithm in Python language (PLA) and Reclink software (RlS), a probabilistic software largely utilized in Brazil. Records from PLWHA 1 known to be alive were added to those from patients reported as deceased.Highlights: Two free record linkage algorithms employed to retrieve vital status from people living with HIV/AIDS were compared. Accuracy parameters and computational feasibility were comparatively discussed. Fake test datasets containing different proportions of records from patients known to be alive and from patients actually dead were assembled in order to determine accuracy parameters. Both algorithms proved to be accurate enough in cross-referencing information from PLWHA to the mortality database. While accuracy measures were interchangeable, a clear trade-off between processing time and the need of manual reviewing is presented. Abstract: Background and objective: While cross-referencing information from people living with HIV/AIDS (PLWHA) to the official mortality database is a critical step in monitoring the HIV/AIDS epidemic in Brazil, the accuracy of the linkage routine may compromise the validity of the final database, yielding to biased epidemiological estimates. We compared the accuracy and the total runtime of two linkage algorithms applied to retrieve vital status information from PLWHA in Brazilian public databases. Methods: Nominally identified records from PLWHA were obtained from three distinct government databases. Linkage routines included an algorithm in Python language (PLA) and Reclink software (RlS), a probabilistic software largely utilized in Brazil. Records from PLWHA 1 known to be alive were added to those from patients reported as deceased. Data were then searched into the mortality system. Scenarios where 5% and 50% of patients actually dead were simulated, considering both complete cases and 20% missing maternal names. Results: When complete information was available both algorithms had comparable accuracies. In the scenario of 20% missing maternal names, PLA 2 and RlS 3 had sensitivities of 94.5% and 94.6% (p > 0.5), respectively; after manual reviewing, PLA sensitivity increased to 98.4% (96.6–100.0) exceeding that for RlS (p < 0.01). PLA had higher positive predictive value in 5% death proportion. Manual reviewing was intrinsically required by RlS in up to 14% register for people actually dead, whereas the corresponding proportion ranged from 1.5% to 2% for PLA. The lack of manual inspection did not alter PLA sensitivity when complete information was available. When incomplete data was available PLA sensitivity increased from 94.5% to 98.4%, thus exceeding that presented by RlS (94.6%, p < 0.05). RlS spanned considerably less processing time compared to PLA. Conclusion: Both linkage algorithms presented interchangeable accuracies in retrieving vital status data from PLWHA. RlS had a considerably lesser runtime but intrinsically required manually reviewing a fastidious proportion of the matched registries. On the other hand, PLA spent quite more runtime but spared manual reviewing at no expense of accuracy. … (more)
- Is Part Of:
- International journal of medical informatics. Volume 114(2018)
- Journal:
- International journal of medical informatics
- Issue:
- Volume 114(2018)
- Issue Display:
- Volume 114, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 114
- Issue:
- 2018
- Issue Sort Value:
- 2018-0114-2018-0000
- Page Start:
- 45
- Page End:
- 51
- Publication Date:
- 2018-06
- Subjects:
- Record linkage -- Deterministic linkage -- Probabilistic linkage -- HIV -- Public datasets -- Mortality
Medical informatics -- Periodicals
Information science -- Periodicals
Computers -- Periodicals
Medical technology -- Periodicals
Medical Informatics -- Periodicals
Technology, Medical -- Periodicals
Computers
Information science
Medical informatics
Medical technology
Electronic journals
Periodicals
Electronic journals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13865056 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/13865056 ↗
http://www.clinicalkey.com.au/dura/browse/journalIssue/13865056 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ijmedinf.2018.03.005 ↗
- Languages:
- English
- ISSNs:
- 1386-5056
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.345250
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9188.xml