Cargando…
Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data
BACKGROUND: The accurate measurement of educational attainment is of great importance for population research. Past studies measuring average years of schooling rely on strong assumptions to incorporate binned data. These assumptions, which we refer to as the standard duration method, have not been...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264843/ https://www.ncbi.nlm.nih.gov/pubmed/30496312 http://dx.doi.org/10.1371/journal.pone.0208019 |
_version_ | 1783375577687785472 |
---|---|
author | Friedman, Joseph Graetz, Nicholas Gakidou, Emmanuela |
author_facet | Friedman, Joseph Graetz, Nicholas Gakidou, Emmanuela |
author_sort | Friedman, Joseph |
collection | PubMed |
description | BACKGROUND: The accurate measurement of educational attainment is of great importance for population research. Past studies measuring average years of schooling rely on strong assumptions to incorporate binned data. These assumptions, which we refer to as the standard duration method, have not been previously evaluated for bias or accuracy. METHODS: We assembled a database of 1,680 survey and census datasets, representing both binned and single-year education data. We developed two models that split bins of education into single year values. We evaluate our models, and compare them to the standard duration method, using out-of-sample predictive validity. RESULTS: Our results indicate that typical methods used to split bins of educational attainment introduce substantial error and bias into estimates of average years of schooling, as compared to new approaches. Globally, the standard duration method underestimates average years of schooling, with a median error of -0.47 years. This effect is especially pronounced in datasets with a smaller number of bins or higher true average attainment, leading to irregular error patterns between geographies and time periods. Both models we developed resulted in unbiased predictions of average years of schooling, with smaller average error than previous methods. We find that one approach using a metric of distance in space and time to identify training data, had the best performance, with a root mean squared error of mean attainment of 0.26 years, compared to 0.92 years for the standard duration algorithm. CONCLUSIONS: Education is a key social indicator and its accurate estimation should be a population research priority. The use of a space-time distance bin-splitting model drastically improved the estimation of average years of schooling from binned education data. We provide a detailed description of how to use the method and recommend that future studies estimating educational attainment across time or geographies use a similar approach. |
format | Online Article Text |
id | pubmed-6264843 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-62648432018-12-19 Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data Friedman, Joseph Graetz, Nicholas Gakidou, Emmanuela PLoS One Research Article BACKGROUND: The accurate measurement of educational attainment is of great importance for population research. Past studies measuring average years of schooling rely on strong assumptions to incorporate binned data. These assumptions, which we refer to as the standard duration method, have not been previously evaluated for bias or accuracy. METHODS: We assembled a database of 1,680 survey and census datasets, representing both binned and single-year education data. We developed two models that split bins of education into single year values. We evaluate our models, and compare them to the standard duration method, using out-of-sample predictive validity. RESULTS: Our results indicate that typical methods used to split bins of educational attainment introduce substantial error and bias into estimates of average years of schooling, as compared to new approaches. Globally, the standard duration method underestimates average years of schooling, with a median error of -0.47 years. This effect is especially pronounced in datasets with a smaller number of bins or higher true average attainment, leading to irregular error patterns between geographies and time periods. Both models we developed resulted in unbiased predictions of average years of schooling, with smaller average error than previous methods. We find that one approach using a metric of distance in space and time to identify training data, had the best performance, with a root mean squared error of mean attainment of 0.26 years, compared to 0.92 years for the standard duration algorithm. CONCLUSIONS: Education is a key social indicator and its accurate estimation should be a population research priority. The use of a space-time distance bin-splitting model drastically improved the estimation of average years of schooling from binned education data. We provide a detailed description of how to use the method and recommend that future studies estimating educational attainment across time or geographies use a similar approach. Public Library of Science 2018-11-29 /pmc/articles/PMC6264843/ /pubmed/30496312 http://dx.doi.org/10.1371/journal.pone.0208019 Text en © 2018 Friedman et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Friedman, Joseph Graetz, Nicholas Gakidou, Emmanuela Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data |
title | Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data |
title_full | Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data |
title_fullStr | Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data |
title_full_unstemmed | Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data |
title_short | Improving the estimation of educational attainment: New methods for assessing average years of schooling from binned data |
title_sort | improving the estimation of educational attainment: new methods for assessing average years of schooling from binned data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264843/ https://www.ncbi.nlm.nih.gov/pubmed/30496312 http://dx.doi.org/10.1371/journal.pone.0208019 |
work_keys_str_mv | AT friedmanjoseph improvingtheestimationofeducationalattainmentnewmethodsforassessingaverageyearsofschoolingfrombinneddata AT graetznicholas improvingtheestimationofeducationalattainmentnewmethodsforassessingaverageyearsofschoolingfrombinneddata AT gakidouemmanuela improvingtheestimationofeducationalattainmentnewmethodsforassessingaverageyearsofschoolingfrombinneddata |