Cargando…
Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information
Linking very large, consented birth cohorts to birthing hospitals clinical data could elucidate the lifecourse outcomes of health care and exposures during the pregnancy, birth and newborn periods. Unfortunately, cohort personally identifiable information (PII) often does not include unique identifi...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776599/ https://www.ncbi.nlm.nih.gov/pubmed/36553359 http://dx.doi.org/10.3390/children9121916 |
_version_ | 1784855905318207488 |
---|---|
author | Hu, Yanhong Jessika Fedyukova, Anna Wang, Jing Said, Joanne M. Thomas, Niranjan Noble, Elizabeth Cheong, Jeanie L. Y. Karanatsios, Bill Goldfeld, Sharon Wake, Melissa |
author_facet | Hu, Yanhong Jessika Fedyukova, Anna Wang, Jing Said, Joanne M. Thomas, Niranjan Noble, Elizabeth Cheong, Jeanie L. Y. Karanatsios, Bill Goldfeld, Sharon Wake, Melissa |
author_sort | Hu, Yanhong Jessika |
collection | PubMed |
description | Linking very large, consented birth cohorts to birthing hospitals clinical data could elucidate the lifecourse outcomes of health care and exposures during the pregnancy, birth and newborn periods. Unfortunately, cohort personally identifiable information (PII) often does not include unique identifier numbers, presenting matching challenges. To develop optimized cohort matching to birthing hospital clinical records, this pilot drew on a one-year (December 2020–December 2021) cohort for a single Australian birthing hospital participating in the whole-of-state Generation Victoria (GenV) study. For 1819 consented mother-baby pairs and 58 additional babies (whose mothers were not themselves participating), we tested the accuracy and effort of various approaches to matching. We selected demographic variables drawn from names, DOB, sex, telephone, address (and birth order for multiple births). After variable standardization and validation, accuracy rose from 10% to 99% using a deterministic-rule-based approach in 10 steps. Using cohort-specific modifications of the Australian Statistical Linkage Key (SLK-581), it took only 3 steps to reach 97% (SLK-5881) and 98% (SLK-5881.1) accuracy. We conclude that our SLK-5881 process could safely and efficiently achieve high accuracy at the population level for future birth cohort-birth hospital matching in the absence of unique identifier numbers. |
format | Online Article Text |
id | pubmed-9776599 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-97765992022-12-23 Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information Hu, Yanhong Jessika Fedyukova, Anna Wang, Jing Said, Joanne M. Thomas, Niranjan Noble, Elizabeth Cheong, Jeanie L. Y. Karanatsios, Bill Goldfeld, Sharon Wake, Melissa Children (Basel) Article Linking very large, consented birth cohorts to birthing hospitals clinical data could elucidate the lifecourse outcomes of health care and exposures during the pregnancy, birth and newborn periods. Unfortunately, cohort personally identifiable information (PII) often does not include unique identifier numbers, presenting matching challenges. To develop optimized cohort matching to birthing hospital clinical records, this pilot drew on a one-year (December 2020–December 2021) cohort for a single Australian birthing hospital participating in the whole-of-state Generation Victoria (GenV) study. For 1819 consented mother-baby pairs and 58 additional babies (whose mothers were not themselves participating), we tested the accuracy and effort of various approaches to matching. We selected demographic variables drawn from names, DOB, sex, telephone, address (and birth order for multiple births). After variable standardization and validation, accuracy rose from 10% to 99% using a deterministic-rule-based approach in 10 steps. Using cohort-specific modifications of the Australian Statistical Linkage Key (SLK-581), it took only 3 steps to reach 97% (SLK-5881) and 98% (SLK-5881.1) accuracy. We conclude that our SLK-5881 process could safely and efficiently achieve high accuracy at the population level for future birth cohort-birth hospital matching in the absence of unique identifier numbers. MDPI 2022-12-07 /pmc/articles/PMC9776599/ /pubmed/36553359 http://dx.doi.org/10.3390/children9121916 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Hu, Yanhong Jessika Fedyukova, Anna Wang, Jing Said, Joanne M. Thomas, Niranjan Noble, Elizabeth Cheong, Jeanie L. Y. Karanatsios, Bill Goldfeld, Sharon Wake, Melissa Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information |
title | Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information |
title_full | Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information |
title_fullStr | Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information |
title_full_unstemmed | Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information |
title_short | Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information |
title_sort | improving cohort-hospital matching accuracy through standardization and validation of participant identifiable information |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776599/ https://www.ncbi.nlm.nih.gov/pubmed/36553359 http://dx.doi.org/10.3390/children9121916 |
work_keys_str_mv | AT huyanhongjessika improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT fedyukovaanna improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT wangjing improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT saidjoannem improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT thomasniranjan improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT nobleelizabeth improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT cheongjeaniely improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT karanatsiosbill improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT goldfeldsharon improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation AT wakemelissa improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation |