Cargando…

Statistical Power for Postlicensure Medical Product Safety Data Mining

OBJECTIVE: To perform sample size calculations when using tree-based scan statistics in longitudinal observational databases. METHODS: Tree-based scan statistics enable data mining on epidemiologic datasets where thousands of disease outcomes are organized into hierarchical tree structures with auto...

Descripción completa

Detalles Bibliográficos
Autores principales: Maro, Judith C., Nguyen, Michael D., Dashevsky, Inna, Baker, Meghan A., Kulldorff, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Ubiquity Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5982804/
https://www.ncbi.nlm.nih.gov/pubmed/29881732
http://dx.doi.org/10.5334/egems.225
_version_ 1783328313806159872
author Maro, Judith C.
Nguyen, Michael D.
Dashevsky, Inna
Baker, Meghan A.
Kulldorff, Martin
author_facet Maro, Judith C.
Nguyen, Michael D.
Dashevsky, Inna
Baker, Meghan A.
Kulldorff, Martin
author_sort Maro, Judith C.
collection PubMed
description OBJECTIVE: To perform sample size calculations when using tree-based scan statistics in longitudinal observational databases. METHODS: Tree-based scan statistics enable data mining on epidemiologic datasets where thousands of disease outcomes are organized into hierarchical tree structures with automatic adjustment for multiple testing. We show how to evaluate the statistical power of the unconditional and conditional Poisson versions. The null hypothesis is that there is no increase in the risk for any of the outcomes. The alternative is that one or more outcomes have an excess risk. We varied the excess risk, total sample size, frequency of the underlying event rate, and the level of across-the-board health care utilization. We also quantified the reduction in statistical power resulting from specifying a risk window that was too long or too short. RESULTS: For 500,000 exposed people, we had at least 98 percent power to detect an excess risk of 1 event per 10,000 exposed for all outcomes. In the presence of potential temporal confounding due to across-the-board elevations of health care utilization in the risk window, the conditional tree-based scan statistic controlled type I error well, while the unconditional version did not. DISCUSSION: Data mining analyses using tree-based scan statistics expand the pharmacovigilance toolbox, ensuring adequate monitoring of thousands of outcomes of interest while controlling for multiple hypothesis testing. These power evaluations enable investigators to design and optimize implementation of retrospective data mining analyses.
format Online
Article
Text
id pubmed-5982804
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Ubiquity Press
record_format MEDLINE/PubMed
spelling pubmed-59828042018-06-07 Statistical Power for Postlicensure Medical Product Safety Data Mining Maro, Judith C. Nguyen, Michael D. Dashevsky, Inna Baker, Meghan A. Kulldorff, Martin EGEMS (Wash DC) Research OBJECTIVE: To perform sample size calculations when using tree-based scan statistics in longitudinal observational databases. METHODS: Tree-based scan statistics enable data mining on epidemiologic datasets where thousands of disease outcomes are organized into hierarchical tree structures with automatic adjustment for multiple testing. We show how to evaluate the statistical power of the unconditional and conditional Poisson versions. The null hypothesis is that there is no increase in the risk for any of the outcomes. The alternative is that one or more outcomes have an excess risk. We varied the excess risk, total sample size, frequency of the underlying event rate, and the level of across-the-board health care utilization. We also quantified the reduction in statistical power resulting from specifying a risk window that was too long or too short. RESULTS: For 500,000 exposed people, we had at least 98 percent power to detect an excess risk of 1 event per 10,000 exposed for all outcomes. In the presence of potential temporal confounding due to across-the-board elevations of health care utilization in the risk window, the conditional tree-based scan statistic controlled type I error well, while the unconditional version did not. DISCUSSION: Data mining analyses using tree-based scan statistics expand the pharmacovigilance toolbox, ensuring adequate monitoring of thousands of outcomes of interest while controlling for multiple hypothesis testing. These power evaluations enable investigators to design and optimize implementation of retrospective data mining analyses. Ubiquity Press 2017-06-12 /pmc/articles/PMC5982804/ /pubmed/29881732 http://dx.doi.org/10.5334/egems.225 Text en Copyright: © 2018 The Author(s) https://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0), which permits unrestricted use and distribution, for non-commercial purposes, as long as the original material has not been modified, and provided the original author and source are credited. See https://creativecommons.org/licenses/by-nc-nd/3.0/.
spellingShingle Research
Maro, Judith C.
Nguyen, Michael D.
Dashevsky, Inna
Baker, Meghan A.
Kulldorff, Martin
Statistical Power for Postlicensure Medical Product Safety Data Mining
title Statistical Power for Postlicensure Medical Product Safety Data Mining
title_full Statistical Power for Postlicensure Medical Product Safety Data Mining
title_fullStr Statistical Power for Postlicensure Medical Product Safety Data Mining
title_full_unstemmed Statistical Power for Postlicensure Medical Product Safety Data Mining
title_short Statistical Power for Postlicensure Medical Product Safety Data Mining
title_sort statistical power for postlicensure medical product safety data mining
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5982804/
https://www.ncbi.nlm.nih.gov/pubmed/29881732
http://dx.doi.org/10.5334/egems.225
work_keys_str_mv AT marojudithc statisticalpowerforpostlicensuremedicalproductsafetydatamining
AT nguyenmichaeld statisticalpowerforpostlicensuremedicalproductsafetydatamining
AT dashevskyinna statisticalpowerforpostlicensuremedicalproductsafetydatamining
AT bakermeghana statisticalpowerforpostlicensuremedicalproductsafetydatamining
AT kulldorffmartin statisticalpowerforpostlicensuremedicalproductsafetydatamining