Cargando…

Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information

Linking very large, consented birth cohorts to birthing hospitals clinical data could elucidate the lifecourse outcomes of health care and exposures during the pregnancy, birth and newborn periods. Unfortunately, cohort personally identifiable information (PII) often does not include unique identifi...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Yanhong Jessika, Fedyukova, Anna, Wang, Jing, Said, Joanne M., Thomas, Niranjan, Noble, Elizabeth, Cheong, Jeanie L. Y., Karanatsios, Bill, Goldfeld, Sharon, Wake, Melissa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776599/
https://www.ncbi.nlm.nih.gov/pubmed/36553359
http://dx.doi.org/10.3390/children9121916
_version_ 1784855905318207488
author Hu, Yanhong Jessika
Fedyukova, Anna
Wang, Jing
Said, Joanne M.
Thomas, Niranjan
Noble, Elizabeth
Cheong, Jeanie L. Y.
Karanatsios, Bill
Goldfeld, Sharon
Wake, Melissa
author_facet Hu, Yanhong Jessika
Fedyukova, Anna
Wang, Jing
Said, Joanne M.
Thomas, Niranjan
Noble, Elizabeth
Cheong, Jeanie L. Y.
Karanatsios, Bill
Goldfeld, Sharon
Wake, Melissa
author_sort Hu, Yanhong Jessika
collection PubMed
description Linking very large, consented birth cohorts to birthing hospitals clinical data could elucidate the lifecourse outcomes of health care and exposures during the pregnancy, birth and newborn periods. Unfortunately, cohort personally identifiable information (PII) often does not include unique identifier numbers, presenting matching challenges. To develop optimized cohort matching to birthing hospital clinical records, this pilot drew on a one-year (December 2020–December 2021) cohort for a single Australian birthing hospital participating in the whole-of-state Generation Victoria (GenV) study. For 1819 consented mother-baby pairs and 58 additional babies (whose mothers were not themselves participating), we tested the accuracy and effort of various approaches to matching. We selected demographic variables drawn from names, DOB, sex, telephone, address (and birth order for multiple births). After variable standardization and validation, accuracy rose from 10% to 99% using a deterministic-rule-based approach in 10 steps. Using cohort-specific modifications of the Australian Statistical Linkage Key (SLK-581), it took only 3 steps to reach 97% (SLK-5881) and 98% (SLK-5881.1) accuracy. We conclude that our SLK-5881 process could safely and efficiently achieve high accuracy at the population level for future birth cohort-birth hospital matching in the absence of unique identifier numbers.
format Online
Article
Text
id pubmed-9776599
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97765992022-12-23 Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information Hu, Yanhong Jessika Fedyukova, Anna Wang, Jing Said, Joanne M. Thomas, Niranjan Noble, Elizabeth Cheong, Jeanie L. Y. Karanatsios, Bill Goldfeld, Sharon Wake, Melissa Children (Basel) Article Linking very large, consented birth cohorts to birthing hospitals clinical data could elucidate the lifecourse outcomes of health care and exposures during the pregnancy, birth and newborn periods. Unfortunately, cohort personally identifiable information (PII) often does not include unique identifier numbers, presenting matching challenges. To develop optimized cohort matching to birthing hospital clinical records, this pilot drew on a one-year (December 2020–December 2021) cohort for a single Australian birthing hospital participating in the whole-of-state Generation Victoria (GenV) study. For 1819 consented mother-baby pairs and 58 additional babies (whose mothers were not themselves participating), we tested the accuracy and effort of various approaches to matching. We selected demographic variables drawn from names, DOB, sex, telephone, address (and birth order for multiple births). After variable standardization and validation, accuracy rose from 10% to 99% using a deterministic-rule-based approach in 10 steps. Using cohort-specific modifications of the Australian Statistical Linkage Key (SLK-581), it took only 3 steps to reach 97% (SLK-5881) and 98% (SLK-5881.1) accuracy. We conclude that our SLK-5881 process could safely and efficiently achieve high accuracy at the population level for future birth cohort-birth hospital matching in the absence of unique identifier numbers. MDPI 2022-12-07 /pmc/articles/PMC9776599/ /pubmed/36553359 http://dx.doi.org/10.3390/children9121916 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hu, Yanhong Jessika
Fedyukova, Anna
Wang, Jing
Said, Joanne M.
Thomas, Niranjan
Noble, Elizabeth
Cheong, Jeanie L. Y.
Karanatsios, Bill
Goldfeld, Sharon
Wake, Melissa
Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information
title Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information
title_full Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information
title_fullStr Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information
title_full_unstemmed Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information
title_short Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information
title_sort improving cohort-hospital matching accuracy through standardization and validation of participant identifiable information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776599/
https://www.ncbi.nlm.nih.gov/pubmed/36553359
http://dx.doi.org/10.3390/children9121916
work_keys_str_mv AT huyanhongjessika improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT fedyukovaanna improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT wangjing improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT saidjoannem improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT thomasniranjan improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT nobleelizabeth improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT cheongjeaniely improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT karanatsiosbill improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT goldfeldsharon improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation
AT wakemelissa improvingcohorthospitalmatchingaccuracythroughstandardizationandvalidationofparticipantidentifiableinformation