Comparison of non-parametric methods for ungrouping coarsely aggregated data. Issue 1 (December 2016)
- Record Type:
- Journal Article
- Title:
- Comparison of non-parametric methods for ungrouping coarsely aggregated data. Issue 1 (December 2016)
- Main Title:
- Comparison of non-parametric methods for ungrouping coarsely aggregated data
- Authors:
- Rizzi, Silvia
Thinggaard, Mikael
Engholm, Gerda
Christensen, Niels
Johannesen, Tom
Vaupel, James
Lindahl-Jacobsen, Rune - Abstract:
- Abstract Background Histograms are a common tool to estimate densities non-parametrically. They are extensively encountered in health sciences to summarize data in a compact format. Examples are age-specific distributions of death or onset of diseases grouped in 5-years age classes with an open-ended age group at the highest ages. When histogram intervals are too coarse, information is lost and comparison between histograms with different boundaries is arduous. In these cases it is useful to estimate detailed distributions from grouped data. Methods From an extensive literature search we identify five methods for ungrouping count data. We compare the performance of two spline interpolation methods, two kernel density estimators and a penalized composite link model first via a simulation study and then with empirical data obtained from the NORDCAN Database. All methods analyzed can be used to estimate differently shaped distributions; can handle unequal interval length; and allow stretches of 0 counts. Results The methods show similar performance when the grouping scheme is relatively narrow, i.e. 5-years age classes. With coarser age intervals, i.e. in the presence of open-ended age groups, the penalized composite link model performs the best. Conclusion We give an overview and test different methods to estimate detailed distributions from grouped count data. Health researchers can benefit from these versatile methods, which are ready for use in the statistical software R.Abstract Background Histograms are a common tool to estimate densities non-parametrically. They are extensively encountered in health sciences to summarize data in a compact format. Examples are age-specific distributions of death or onset of diseases grouped in 5-years age classes with an open-ended age group at the highest ages. When histogram intervals are too coarse, information is lost and comparison between histograms with different boundaries is arduous. In these cases it is useful to estimate detailed distributions from grouped data. Methods From an extensive literature search we identify five methods for ungrouping count data. We compare the performance of two spline interpolation methods, two kernel density estimators and a penalized composite link model first via a simulation study and then with empirical data obtained from the NORDCAN Database. All methods analyzed can be used to estimate differently shaped distributions; can handle unequal interval length; and allow stretches of 0 counts. Results The methods show similar performance when the grouping scheme is relatively narrow, i.e. 5-years age classes. With coarser age intervals, i.e. in the presence of open-ended age groups, the penalized composite link model performs the best. Conclusion We give an overview and test different methods to estimate detailed distributions from grouped count data. Health researchers can benefit from these versatile methods, which are ready for use in the statistical software R. We recommend using the penalized composite link model when data are grouped in wide age classes. … (more)
- Is Part Of:
- BMC medical research methodology. Volume 16:Issue 1(2016)
- Journal:
- BMC medical research methodology
- Issue:
- Volume 16:Issue 1(2016)
- Issue Display:
- Volume 16, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 16
- Issue:
- 1
- Issue Sort Value:
- 2016-0016-0001-0000
- Page Start:
- 1
- Page End:
- 12
- Publication Date:
- 2016-12
- Subjects:
- Aggregated count data -- Ungrouping methods -- Smoothing
Medicine -- Research -- Methodology -- Periodicals
610.72 - Journal URLs:
- http://www.biomedcentral.com/bmcmedresmethodol/ ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=43 ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s12874-016-0157-8 ↗
- Languages:
- English
- ISSNs:
- 1471-2288
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 12399.xml