Cargando…
Efficient Estimation of Smooth Distributions From Coarsely Grouped Data
Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493979/ https://www.ncbi.nlm.nih.gov/pubmed/26081676 http://dx.doi.org/10.1093/aje/kwv020 |
_version_ | 1782380008783740928 |
---|---|
author | Rizzi, Silvia Gampe, Jutta Eilers, Paul H. C. |
author_facet | Rizzi, Silvia Gampe, Jutta Eilers, Paul H. C. |
author_sort | Rizzi, Silvia |
collection | PubMed |
description | Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information in the tail area. Age group–specific disease incidence rates and abridged life tables are examples of binned data. We propose a versatile method for ungrouping histograms that assumes that only the underlying distribution is smooth. Because of this modest assumption, the approach is suitable for most applications. The method is based on the composite link model, with a penalty added to ensure the smoothness of the target distribution. Estimates are obtained by maximizing a penalized likelihood. This maximization is performed efficiently by a version of the iteratively reweighted least-squares algorithm. Optimal values of the smoothing parameter are chosen by minimizing Akaike's Information Criterion. We demonstrate the performance of this method in a simulation study and provide several examples that illustrate the approach. Wide, open-ended intervals can be handled properly. The method can be extended to the estimation of rates when both the event counts and the exposures to risk are grouped. |
format | Online Article Text |
id | pubmed-4493979 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-44939792015-07-09 Efficient Estimation of Smooth Distributions From Coarsely Grouped Data Rizzi, Silvia Gampe, Jutta Eilers, Paul H. C. Am J Epidemiol Practice of Epidemiology Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information in the tail area. Age group–specific disease incidence rates and abridged life tables are examples of binned data. We propose a versatile method for ungrouping histograms that assumes that only the underlying distribution is smooth. Because of this modest assumption, the approach is suitable for most applications. The method is based on the composite link model, with a penalty added to ensure the smoothness of the target distribution. Estimates are obtained by maximizing a penalized likelihood. This maximization is performed efficiently by a version of the iteratively reweighted least-squares algorithm. Optimal values of the smoothing parameter are chosen by minimizing Akaike's Information Criterion. We demonstrate the performance of this method in a simulation study and provide several examples that illustrate the approach. Wide, open-ended intervals can be handled properly. The method can be extended to the estimation of rates when both the event counts and the exposures to risk are grouped. Oxford University Press 2015-07-15 2015-06-16 /pmc/articles/PMC4493979/ /pubmed/26081676 http://dx.doi.org/10.1093/aje/kwv020 Text en © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Practice of Epidemiology Rizzi, Silvia Gampe, Jutta Eilers, Paul H. C. Efficient Estimation of Smooth Distributions From Coarsely Grouped Data |
title | Efficient Estimation of Smooth Distributions From Coarsely Grouped Data |
title_full | Efficient Estimation of Smooth Distributions From Coarsely Grouped Data |
title_fullStr | Efficient Estimation of Smooth Distributions From Coarsely Grouped Data |
title_full_unstemmed | Efficient Estimation of Smooth Distributions From Coarsely Grouped Data |
title_short | Efficient Estimation of Smooth Distributions From Coarsely Grouped Data |
title_sort | efficient estimation of smooth distributions from coarsely grouped data |
topic | Practice of Epidemiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493979/ https://www.ncbi.nlm.nih.gov/pubmed/26081676 http://dx.doi.org/10.1093/aje/kwv020 |
work_keys_str_mv | AT rizzisilvia efficientestimationofsmoothdistributionsfromcoarselygroupeddata AT gampejutta efficientestimationofsmoothdistributionsfromcoarselygroupeddata AT eilerspaulhc efficientestimationofsmoothdistributionsfromcoarselygroupeddata |