Application of recommender systems and time series models to monitor quality at HIV/AIDS health facilities. (11th July 2022)
- Record Type:
- Journal Article
- Title:
- Application of recommender systems and time series models to monitor quality at HIV/AIDS health facilities. (11th July 2022)
- Main Title:
- Application of recommender systems and time series models to monitor quality at HIV/AIDS health facilities
- Authors:
- Friedman, Jonathan
Allen, Zola
Fox, Allison
Webert, Jose
Devlin, Andrew - Abstract:
- Abstract: The US government invests substantial sums to control the HIV/AIDS epidemic. To monitor progress toward epidemic control, PEPFAR, or the President's Emergency Plan for AIDS Relief, oversees a data reporting system that includes standard indicators, reporting formats, information systems, and data warehouses. These data, reported quarterly, inform understanding of the global epidemic, resource allocation, and identification of trouble spots. PEPFAR has developed tools to assess the quality of data reported. These tools made important contributions but are limited in the methods used to identify anomalous data points. The most advanced consider univariate probability distributions, whereas correlations between indicators suggest a multivariate approach is better suited. For temporal analysis, the same tool compares values to the averages of preceding periods, though does not consider underlying trends and seasonal factors. To that end, we apply two methods to identify anomalous data points among routinely collected facility-level HIV/AIDS data. One approach is Recommender Systems, an unsupervised machine learning method that captures relationships between users and items. We apply the approach in a novel way by predicting reported values, comparing predicted to reported values, and identifying the greatest deviations. For a temporal perspective, we apply time series models that are flexible to include trend and seasonality. Results of these methods were validatedAbstract: The US government invests substantial sums to control the HIV/AIDS epidemic. To monitor progress toward epidemic control, PEPFAR, or the President's Emergency Plan for AIDS Relief, oversees a data reporting system that includes standard indicators, reporting formats, information systems, and data warehouses. These data, reported quarterly, inform understanding of the global epidemic, resource allocation, and identification of trouble spots. PEPFAR has developed tools to assess the quality of data reported. These tools made important contributions but are limited in the methods used to identify anomalous data points. The most advanced consider univariate probability distributions, whereas correlations between indicators suggest a multivariate approach is better suited. For temporal analysis, the same tool compares values to the averages of preceding periods, though does not consider underlying trends and seasonal factors. To that end, we apply two methods to identify anomalous data points among routinely collected facility-level HIV/AIDS data. One approach is Recommender Systems, an unsupervised machine learning method that captures relationships between users and items. We apply the approach in a novel way by predicting reported values, comparing predicted to reported values, and identifying the greatest deviations. For a temporal perspective, we apply time series models that are flexible to include trend and seasonality. Results of these methods were validated against manual review (95% agreement on non-anomalies, 56% agreement on anomalies for recommender systems; 96% agreement on non-anomalies, 91% agreement on anomalies for time series). This tool will apply greater methodological sophistication to monitoring data quality in an accelerated and standardized manner. … (more)
- Is Part Of:
- Data & policy. Volume 4(2022)
- Journal:
- Data & policy
- Issue:
- Volume 4(2022)
- Issue Display:
- Volume 4, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 4
- Issue:
- 2022
- Issue Sort Value:
- 2022-0004-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-07-11
- Subjects:
- anomaly detection -- data quality -- machine learning -- recommender systems -- time series
Policy sciences -- Periodicals
Policy sciences -- Statistical methods -- Periodicals
Policy sciences -- Data processing -- Periodicals
Decision making -- Data processing -- Periodicals
320.60727 - Journal URLs:
- https://www.cambridge.org/core/journals/data-and-policy ↗
- DOI:
- 10.1017/dap.2022.15 ↗
- Languages:
- English
- ISSNs:
- 2632-3249
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 22283.xml