Cargando…

Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project

There is a strong and continuously growing interest in using large electronic healthcare databases to study health outcomes and the effects of pharmaceutical products. However, concerns regarding disease misclassification (i.e. classification errors of the disease status) and its impact on the study...

Descripción completa

Detalles Bibliográficos
Autores principales: Bollaerts, Kaatje, Rekkas, Alexandros, De Smedt, Tom, Dodd, Caitlin, Andrews, Nick, Gini, Rosa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176121/
https://www.ncbi.nlm.nih.gov/pubmed/32320422
http://dx.doi.org/10.1371/journal.pone.0231333
_version_ 1783524957028876288
author Bollaerts, Kaatje
Rekkas, Alexandros
De Smedt, Tom
Dodd, Caitlin
Andrews, Nick
Gini, Rosa
author_facet Bollaerts, Kaatje
Rekkas, Alexandros
De Smedt, Tom
Dodd, Caitlin
Andrews, Nick
Gini, Rosa
author_sort Bollaerts, Kaatje
collection PubMed
description There is a strong and continuously growing interest in using large electronic healthcare databases to study health outcomes and the effects of pharmaceutical products. However, concerns regarding disease misclassification (i.e. classification errors of the disease status) and its impact on the study results are legitimate. Validation is therefore increasingly recognized as an essential component of database research. In this work, we elucidate the interrelations between the true prevalence of a disease in a database population (i.e. prevalence assuming no disease misclassification), the observed prevalence subject to disease misclassification, and the most common validity indices: sensitivity, specificity, positive and negative predictive value. Based on this, we obtained analytical expressions to derive all the validity indices and true prevalence from the observed prevalence and any combination of two other parameters. The analytical expressions can be used for various purposes. Most notably, they can be used to obtain an estimate of the observed prevalence adjusted for outcome misclassification from any combination of two validity indices and to derive validity indices from each other which would otherwise be difficult to obtain. To allow researchers to easily use the analytical expressions, we additionally developed a user-friendly and freely available web-application.
format Online
Article
Text
id pubmed-7176121
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71761212020-05-12 Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project Bollaerts, Kaatje Rekkas, Alexandros De Smedt, Tom Dodd, Caitlin Andrews, Nick Gini, Rosa PLoS One Research Article There is a strong and continuously growing interest in using large electronic healthcare databases to study health outcomes and the effects of pharmaceutical products. However, concerns regarding disease misclassification (i.e. classification errors of the disease status) and its impact on the study results are legitimate. Validation is therefore increasingly recognized as an essential component of database research. In this work, we elucidate the interrelations between the true prevalence of a disease in a database population (i.e. prevalence assuming no disease misclassification), the observed prevalence subject to disease misclassification, and the most common validity indices: sensitivity, specificity, positive and negative predictive value. Based on this, we obtained analytical expressions to derive all the validity indices and true prevalence from the observed prevalence and any combination of two other parameters. The analytical expressions can be used for various purposes. Most notably, they can be used to obtain an estimate of the observed prevalence adjusted for outcome misclassification from any combination of two validity indices and to derive validity indices from each other which would otherwise be difficult to obtain. To allow researchers to easily use the analytical expressions, we additionally developed a user-friendly and freely available web-application. Public Library of Science 2020-04-22 /pmc/articles/PMC7176121/ /pubmed/32320422 http://dx.doi.org/10.1371/journal.pone.0231333 Text en © 2020 Bollaerts et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bollaerts, Kaatje
Rekkas, Alexandros
De Smedt, Tom
Dodd, Caitlin
Andrews, Nick
Gini, Rosa
Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project
title Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project
title_full Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project
title_fullStr Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project
title_full_unstemmed Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project
title_short Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project
title_sort disease misclassification in electronic healthcare database studies: deriving validity indices—a contribution from the advance project
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176121/
https://www.ncbi.nlm.nih.gov/pubmed/32320422
http://dx.doi.org/10.1371/journal.pone.0231333
work_keys_str_mv AT bollaertskaatje diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject
AT rekkasalexandros diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject
AT desmedttom diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject
AT doddcaitlin diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject
AT andrewsnick diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject
AT ginirosa diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject