Cargando…

Efficient Estimation of Smooth Distributions From Coarsely Grouped Data

Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information...

Descripción completa

Detalles Bibliográficos
Autores principales: Rizzi, Silvia, Gampe, Jutta, Eilers, Paul H. C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493979/
https://www.ncbi.nlm.nih.gov/pubmed/26081676
http://dx.doi.org/10.1093/aje/kwv020
_version_ 1782380008783740928
author Rizzi, Silvia
Gampe, Jutta
Eilers, Paul H. C.
author_facet Rizzi, Silvia
Gampe, Jutta
Eilers, Paul H. C.
author_sort Rizzi, Silvia
collection PubMed
description Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information in the tail area. Age group–specific disease incidence rates and abridged life tables are examples of binned data. We propose a versatile method for ungrouping histograms that assumes that only the underlying distribution is smooth. Because of this modest assumption, the approach is suitable for most applications. The method is based on the composite link model, with a penalty added to ensure the smoothness of the target distribution. Estimates are obtained by maximizing a penalized likelihood. This maximization is performed efficiently by a version of the iteratively reweighted least-squares algorithm. Optimal values of the smoothing parameter are chosen by minimizing Akaike's Information Criterion. We demonstrate the performance of this method in a simulation study and provide several examples that illustrate the approach. Wide, open-ended intervals can be handled properly. The method can be extended to the estimation of rates when both the event counts and the exposures to risk are grouped.
format Online
Article
Text
id pubmed-4493979
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44939792015-07-09 Efficient Estimation of Smooth Distributions From Coarsely Grouped Data Rizzi, Silvia Gampe, Jutta Eilers, Paul H. C. Am J Epidemiol Practice of Epidemiology Ungrouping binned data can be desirable for many reasons: Bins can be too coarse to allow for accurate analysis; comparisons can be hindered when different grouping approaches are used in different histograms; and the last interval is often wide and open-ended and, thus, covers a lot of information in the tail area. Age group–specific disease incidence rates and abridged life tables are examples of binned data. We propose a versatile method for ungrouping histograms that assumes that only the underlying distribution is smooth. Because of this modest assumption, the approach is suitable for most applications. The method is based on the composite link model, with a penalty added to ensure the smoothness of the target distribution. Estimates are obtained by maximizing a penalized likelihood. This maximization is performed efficiently by a version of the iteratively reweighted least-squares algorithm. Optimal values of the smoothing parameter are chosen by minimizing Akaike's Information Criterion. We demonstrate the performance of this method in a simulation study and provide several examples that illustrate the approach. Wide, open-ended intervals can be handled properly. The method can be extended to the estimation of rates when both the event counts and the exposures to risk are grouped. Oxford University Press 2015-07-15 2015-06-16 /pmc/articles/PMC4493979/ /pubmed/26081676 http://dx.doi.org/10.1093/aje/kwv020 Text en © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Practice of Epidemiology
Rizzi, Silvia
Gampe, Jutta
Eilers, Paul H. C.
Efficient Estimation of Smooth Distributions From Coarsely Grouped Data
title Efficient Estimation of Smooth Distributions From Coarsely Grouped Data
title_full Efficient Estimation of Smooth Distributions From Coarsely Grouped Data
title_fullStr Efficient Estimation of Smooth Distributions From Coarsely Grouped Data
title_full_unstemmed Efficient Estimation of Smooth Distributions From Coarsely Grouped Data
title_short Efficient Estimation of Smooth Distributions From Coarsely Grouped Data
title_sort efficient estimation of smooth distributions from coarsely grouped data
topic Practice of Epidemiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493979/
https://www.ncbi.nlm.nih.gov/pubmed/26081676
http://dx.doi.org/10.1093/aje/kwv020
work_keys_str_mv AT rizzisilvia efficientestimationofsmoothdistributionsfromcoarselygroupeddata
AT gampejutta efficientestimationofsmoothdistributionsfromcoarselygroupeddata
AT eilerspaulhc efficientestimationofsmoothdistributionsfromcoarselygroupeddata