Cargando…
The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials
An app‐based clinical trial enrolment process can contribute to duplicated records, carrying data management implications. Our objective was to identify duplicated records in real time in the Apple Heart Study (AHS). We leveraged personal identifiable information (PII) to develop a dissimilarity sco...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9787886/ https://www.ncbi.nlm.nih.gov/pubmed/36589778 http://dx.doi.org/10.1002/sta4.470 |
_version_ | 1784858620691742720 |
---|---|
author | Garcia, Ariadna Lee, Justin Balasubramanian, Vidhya Gardner, Rebecca Gummidipundi, Santosh E. Hung, Grace Ferris, Todd Cheung, Lauren Desai, Sumbul Granger, Christopher B. Hills, Mellanie True Kowey, Peter Nag, Divya Rumsfeld, John S. Russo, Andrea M. Stein, Jeffrey W. Talati, Nisha Tsay, David Mahaffey, Kenneth W. Perez, Marco V. Turakhia, Mintu P. Hedlin, Haley Desai, Manisha |
author_facet | Garcia, Ariadna Lee, Justin Balasubramanian, Vidhya Gardner, Rebecca Gummidipundi, Santosh E. Hung, Grace Ferris, Todd Cheung, Lauren Desai, Sumbul Granger, Christopher B. Hills, Mellanie True Kowey, Peter Nag, Divya Rumsfeld, John S. Russo, Andrea M. Stein, Jeffrey W. Talati, Nisha Tsay, David Mahaffey, Kenneth W. Perez, Marco V. Turakhia, Mintu P. Hedlin, Haley Desai, Manisha |
author_sort | Garcia, Ariadna |
collection | PubMed |
description | An app‐based clinical trial enrolment process can contribute to duplicated records, carrying data management implications. Our objective was to identify duplicated records in real time in the Apple Heart Study (AHS). We leveraged personal identifiable information (PII) to develop a dissimilarity score (DS) using the Damerau–Levenshtein distance. For computational efficiency, we focused on four types of records at the highest risk of duplication. We used the receiver operating curve (ROC) and resampling methods to derive and validate a decision rule to classify duplicated records. We identified 16,398 (4%) duplicated participants, resulting in 419,297 unique participants out of a total of 438,435 possible. Our decision rule yielded a high positive predictive value (96%) with negligible impact on the trial's original findings. Our findings provide principled solutions for future digital trials. When establishing deduplication procedures for digital trials, we recommend collecting device identifiers in addition to participant identifiers; collecting and ensuring secure access to PII; conducting a pilot study to identify reasons for duplicated records; establishing an initial deduplication algorithm that can be refined; creating a data quality plan that informs refinement; and embedding the initial deduplication algorithm in the enrolment platform to ensure unique enrolment and linkage to previous records. |
format | Online Article Text |
id | pubmed-9787886 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97878862022-12-28 The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials Garcia, Ariadna Lee, Justin Balasubramanian, Vidhya Gardner, Rebecca Gummidipundi, Santosh E. Hung, Grace Ferris, Todd Cheung, Lauren Desai, Sumbul Granger, Christopher B. Hills, Mellanie True Kowey, Peter Nag, Divya Rumsfeld, John S. Russo, Andrea M. Stein, Jeffrey W. Talati, Nisha Tsay, David Mahaffey, Kenneth W. Perez, Marco V. Turakhia, Mintu P. Hedlin, Haley Desai, Manisha Stat (Int Stat Inst) Special Issue Articles An app‐based clinical trial enrolment process can contribute to duplicated records, carrying data management implications. Our objective was to identify duplicated records in real time in the Apple Heart Study (AHS). We leveraged personal identifiable information (PII) to develop a dissimilarity score (DS) using the Damerau–Levenshtein distance. For computational efficiency, we focused on four types of records at the highest risk of duplication. We used the receiver operating curve (ROC) and resampling methods to derive and validate a decision rule to classify duplicated records. We identified 16,398 (4%) duplicated participants, resulting in 419,297 unique participants out of a total of 438,435 possible. Our decision rule yielded a high positive predictive value (96%) with negligible impact on the trial's original findings. Our findings provide principled solutions for future digital trials. When establishing deduplication procedures for digital trials, we recommend collecting device identifiers in addition to participant identifiers; collecting and ensuring secure access to PII; conducting a pilot study to identify reasons for duplicated records; establishing an initial deduplication algorithm that can be refined; creating a data quality plan that informs refinement; and embedding the initial deduplication algorithm in the enrolment platform to ensure unique enrolment and linkage to previous records. John Wiley and Sons Inc. 2022-11-18 2022-12 /pmc/articles/PMC9787886/ /pubmed/36589778 http://dx.doi.org/10.1002/sta4.470 Text en © 2022 The Authors. Stat published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Special Issue Articles Garcia, Ariadna Lee, Justin Balasubramanian, Vidhya Gardner, Rebecca Gummidipundi, Santosh E. Hung, Grace Ferris, Todd Cheung, Lauren Desai, Sumbul Granger, Christopher B. Hills, Mellanie True Kowey, Peter Nag, Divya Rumsfeld, John S. Russo, Andrea M. Stein, Jeffrey W. Talati, Nisha Tsay, David Mahaffey, Kenneth W. Perez, Marco V. Turakhia, Mintu P. Hedlin, Haley Desai, Manisha The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials |
title | The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials |
title_full | The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials |
title_fullStr | The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials |
title_full_unstemmed | The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials |
title_short | The development of a mobile app‐focused deduplication strategy for the Apple Heart Study that informs recommendations for future digital trials |
title_sort | development of a mobile app‐focused deduplication strategy for the apple heart study that informs recommendations for future digital trials |
topic | Special Issue Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9787886/ https://www.ncbi.nlm.nih.gov/pubmed/36589778 http://dx.doi.org/10.1002/sta4.470 |
work_keys_str_mv | AT garciaariadna thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT leejustin thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT balasubramanianvidhya thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT gardnerrebecca thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT gummidipundisantoshe thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT hunggrace thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT ferristodd thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT cheunglauren thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT desaisumbul thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT grangerchristopherb thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT hillsmellanietrue thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT koweypeter thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT nagdivya thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT rumsfeldjohns thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT russoandream thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT steinjeffreyw thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT talatinisha thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT tsaydavid thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT mahaffeykennethw thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT perezmarcov thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT turakhiamintup thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT hedlinhaley thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT desaimanisha thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT thedevelopmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT garciaariadna developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT leejustin developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT balasubramanianvidhya developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT gardnerrebecca developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT gummidipundisantoshe developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT hunggrace developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT ferristodd developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT cheunglauren developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT desaisumbul developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT grangerchristopherb developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT hillsmellanietrue developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT koweypeter developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT nagdivya developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT rumsfeldjohns developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT russoandream developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT steinjeffreyw developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT talatinisha developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT tsaydavid developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT mahaffeykennethw developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT perezmarcov developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT turakhiamintup developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT hedlinhaley developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT desaimanisha developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials AT developmentofamobileappfocuseddeduplicationstrategyfortheappleheartstudythatinformsrecommendationsforfuturedigitaltrials |