Cargando…

Linking Data for Mothers and Babies in De-Identified Electronic Health Data

OBJECTIVE: Linkage of longitudinal administrative data for mothers and babies supports research and service evaluation in several populations around the world. We established a linked mother-baby cohort using pseudonymised, population-level data for England. DESIGN AND SETTING: Retrospective linkage...

Descripción completa

Detalles Bibliográficos
Autores principales: Harron, Katie, Gilbert, Ruth, Cromwell, David, van der Meulen, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5072610/
https://www.ncbi.nlm.nih.gov/pubmed/27764135
http://dx.doi.org/10.1371/journal.pone.0164667
_version_ 1782461423566192640
author Harron, Katie
Gilbert, Ruth
Cromwell, David
van der Meulen, Jan
author_facet Harron, Katie
Gilbert, Ruth
Cromwell, David
van der Meulen, Jan
author_sort Harron, Katie
collection PubMed
description OBJECTIVE: Linkage of longitudinal administrative data for mothers and babies supports research and service evaluation in several populations around the world. We established a linked mother-baby cohort using pseudonymised, population-level data for England. DESIGN AND SETTING: Retrospective linkage study using electronic hospital records of mothers and babies admitted to NHS hospitals in England, captured in Hospital Episode Statistics between April 2001 and March 2013. RESULTS: Of 672,955 baby records in 2012/13, 280,470 (42%) linked deterministically to a maternal record using hospital, GP practice, maternal age, birthweight, gestation, birth order and sex. A further 380,164 (56%) records linked using probabilistic methods incorporating additional variables that could differ between mother/baby records (admission dates, ethnicity, 3/4-character postcode district) or that include missing values (delivery variables). The false-match rate was estimated at 0.15% using synthetic data. Data quality improved over time: for 2001/02, 91% of baby records were linked (holding the estimated false-match rate at 0.15%). The linked cohort was representative of national distributions of gender, gestation, birth weight and maternal age, and captured approximately 97% of births in England. CONCLUSION: Probabilistic linkage of maternal and baby healthcare characteristics offers an efficient way to enrich maternity data, improve data quality, and create longitudinal cohorts for research and service evaluation. This approach could be extended to linkage of other datasets that have non-disclosive characteristics in common.
format Online
Article
Text
id pubmed-5072610
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-50726102016-10-27 Linking Data for Mothers and Babies in De-Identified Electronic Health Data Harron, Katie Gilbert, Ruth Cromwell, David van der Meulen, Jan PLoS One Research Article OBJECTIVE: Linkage of longitudinal administrative data for mothers and babies supports research and service evaluation in several populations around the world. We established a linked mother-baby cohort using pseudonymised, population-level data for England. DESIGN AND SETTING: Retrospective linkage study using electronic hospital records of mothers and babies admitted to NHS hospitals in England, captured in Hospital Episode Statistics between April 2001 and March 2013. RESULTS: Of 672,955 baby records in 2012/13, 280,470 (42%) linked deterministically to a maternal record using hospital, GP practice, maternal age, birthweight, gestation, birth order and sex. A further 380,164 (56%) records linked using probabilistic methods incorporating additional variables that could differ between mother/baby records (admission dates, ethnicity, 3/4-character postcode district) or that include missing values (delivery variables). The false-match rate was estimated at 0.15% using synthetic data. Data quality improved over time: for 2001/02, 91% of baby records were linked (holding the estimated false-match rate at 0.15%). The linked cohort was representative of national distributions of gender, gestation, birth weight and maternal age, and captured approximately 97% of births in England. CONCLUSION: Probabilistic linkage of maternal and baby healthcare characteristics offers an efficient way to enrich maternity data, improve data quality, and create longitudinal cohorts for research and service evaluation. This approach could be extended to linkage of other datasets that have non-disclosive characteristics in common. Public Library of Science 2016-10-20 /pmc/articles/PMC5072610/ /pubmed/27764135 http://dx.doi.org/10.1371/journal.pone.0164667 Text en © 2016 Harron et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Harron, Katie
Gilbert, Ruth
Cromwell, David
van der Meulen, Jan
Linking Data for Mothers and Babies in De-Identified Electronic Health Data
title Linking Data for Mothers and Babies in De-Identified Electronic Health Data
title_full Linking Data for Mothers and Babies in De-Identified Electronic Health Data
title_fullStr Linking Data for Mothers and Babies in De-Identified Electronic Health Data
title_full_unstemmed Linking Data for Mothers and Babies in De-Identified Electronic Health Data
title_short Linking Data for Mothers and Babies in De-Identified Electronic Health Data
title_sort linking data for mothers and babies in de-identified electronic health data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5072610/
https://www.ncbi.nlm.nih.gov/pubmed/27764135
http://dx.doi.org/10.1371/journal.pone.0164667
work_keys_str_mv AT harronkatie linkingdataformothersandbabiesindeidentifiedelectronichealthdata
AT gilbertruth linkingdataformothersandbabiesindeidentifiedelectronichealthdata
AT cromwelldavid linkingdataformothersandbabiesindeidentifiedelectronichealthdata
AT vandermeulenjan linkingdataformothersandbabiesindeidentifiedelectronichealthdata