Cargando…
Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project
There is a strong and continuously growing interest in using large electronic healthcare databases to study health outcomes and the effects of pharmaceutical products. However, concerns regarding disease misclassification (i.e. classification errors of the disease status) and its impact on the study...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176121/ https://www.ncbi.nlm.nih.gov/pubmed/32320422 http://dx.doi.org/10.1371/journal.pone.0231333 |
_version_ | 1783524957028876288 |
---|---|
author | Bollaerts, Kaatje Rekkas, Alexandros De Smedt, Tom Dodd, Caitlin Andrews, Nick Gini, Rosa |
author_facet | Bollaerts, Kaatje Rekkas, Alexandros De Smedt, Tom Dodd, Caitlin Andrews, Nick Gini, Rosa |
author_sort | Bollaerts, Kaatje |
collection | PubMed |
description | There is a strong and continuously growing interest in using large electronic healthcare databases to study health outcomes and the effects of pharmaceutical products. However, concerns regarding disease misclassification (i.e. classification errors of the disease status) and its impact on the study results are legitimate. Validation is therefore increasingly recognized as an essential component of database research. In this work, we elucidate the interrelations between the true prevalence of a disease in a database population (i.e. prevalence assuming no disease misclassification), the observed prevalence subject to disease misclassification, and the most common validity indices: sensitivity, specificity, positive and negative predictive value. Based on this, we obtained analytical expressions to derive all the validity indices and true prevalence from the observed prevalence and any combination of two other parameters. The analytical expressions can be used for various purposes. Most notably, they can be used to obtain an estimate of the observed prevalence adjusted for outcome misclassification from any combination of two validity indices and to derive validity indices from each other which would otherwise be difficult to obtain. To allow researchers to easily use the analytical expressions, we additionally developed a user-friendly and freely available web-application. |
format | Online Article Text |
id | pubmed-7176121 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-71761212020-05-12 Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project Bollaerts, Kaatje Rekkas, Alexandros De Smedt, Tom Dodd, Caitlin Andrews, Nick Gini, Rosa PLoS One Research Article There is a strong and continuously growing interest in using large electronic healthcare databases to study health outcomes and the effects of pharmaceutical products. However, concerns regarding disease misclassification (i.e. classification errors of the disease status) and its impact on the study results are legitimate. Validation is therefore increasingly recognized as an essential component of database research. In this work, we elucidate the interrelations between the true prevalence of a disease in a database population (i.e. prevalence assuming no disease misclassification), the observed prevalence subject to disease misclassification, and the most common validity indices: sensitivity, specificity, positive and negative predictive value. Based on this, we obtained analytical expressions to derive all the validity indices and true prevalence from the observed prevalence and any combination of two other parameters. The analytical expressions can be used for various purposes. Most notably, they can be used to obtain an estimate of the observed prevalence adjusted for outcome misclassification from any combination of two validity indices and to derive validity indices from each other which would otherwise be difficult to obtain. To allow researchers to easily use the analytical expressions, we additionally developed a user-friendly and freely available web-application. Public Library of Science 2020-04-22 /pmc/articles/PMC7176121/ /pubmed/32320422 http://dx.doi.org/10.1371/journal.pone.0231333 Text en © 2020 Bollaerts et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Bollaerts, Kaatje Rekkas, Alexandros De Smedt, Tom Dodd, Caitlin Andrews, Nick Gini, Rosa Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project |
title | Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project |
title_full | Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project |
title_fullStr | Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project |
title_full_unstemmed | Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project |
title_short | Disease misclassification in electronic healthcare database studies: Deriving validity indices—A contribution from the ADVANCE project |
title_sort | disease misclassification in electronic healthcare database studies: deriving validity indices—a contribution from the advance project |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176121/ https://www.ncbi.nlm.nih.gov/pubmed/32320422 http://dx.doi.org/10.1371/journal.pone.0231333 |
work_keys_str_mv | AT bollaertskaatje diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject AT rekkasalexandros diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject AT desmedttom diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject AT doddcaitlin diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject AT andrewsnick diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject AT ginirosa diseasemisclassificationinelectronichealthcaredatabasestudiesderivingvalidityindicesacontributionfromtheadvanceproject |