Improved Secondary Analysis of Linked Data: A Framework and an Illustration. (7th June 2019)
- Record Type:
- Journal Article
- Title:
- Improved Secondary Analysis of Linked Data: A Framework and an Illustration. (7th June 2019)
- Main Title:
- Improved Secondary Analysis of Linked Data: A Framework and an Illustration
- Authors:
- Chambers, Ray
Diniz da Silva, Andrea - Abstract:
- Summary: Applications that use linked data are now part of mainstream social science research, though they generally do not take linkage error into consideration. Solutions that correct for the bias caused by these errors have been proposed but are not yet embedded in the various analysis procedures in common use. Secondary analyses based on linked data can therefore be potentially misleading. We review some recent approaches to non-deterministic data linkage together with a framework for secondary analysis of the linked data which makes use of paradata produced by the linkage process to correct this bias. We also describe a new method for secondary analysis of linked data that builds on this framework and show how it can be used for estimation of a set of domain means based on linked data. We then illustrate this approach via an empirical study based on record linkage of agricultural producers in four states of Brazil aimed at producing estimates of agricultural output by industry. Our study considers register-to-register linkage as well as sample-to-register linkage, and we show results for the traditional Fellegi–Sunter approach to record linkage as well as for a newer linkage procedure based on the use of classification trees and bagging.
- Is Part Of:
- Journal of the Royal Statistical Society. Volume 183:Number 1(2020)
- Journal:
- Journal of the Royal Statistical Society
- Issue:
- Volume 183:Number 1(2020)
- Issue Display:
- Volume 183, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 183
- Issue:
- 1
- Issue Sort Value:
- 2020-0183-0001-0000
- Page Start:
- 37
- Page End:
- 59
- Publication Date:
- 2019-06-07
- Subjects:
- Bias correction -- Classification analysis -- Linkage error -- Paradata -- Probability linkage
Social sciences -- Statistical methods -- Periodicals
Statistics -- Periodicals
300.15195 - Journal URLs:
- http://rss.onlinelibrary.wiley.com/hub/journal/10.1111/(ISSN)1467-985X/ ↗
https://academic.oup.com/jrsssa ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/rssa.12477 ↗
- Languages:
- English
- ISSNs:
- 0964-1998
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4866.000000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 27093.xml