Cargando…

Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records

UK Biobank (UKB) is widely employed to investigate mental health disorders and related exposures; however, its applicability and relevance in a clinical setting and the assumptions required have not been sufficiently and systematically investigated. Here, we present the first validation study using...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhenpeng, Kormilitzin, Andrey, Fernandes, Marco, Vaci, Nemanja, Liu, Qiang, Newby, Danielle, Goodday, Sarah, Smith, Tanya, Nevado-Holgado, Alejo J, Winchester, Laura
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Science Ireland Ltd 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8889024/
https://www.ncbi.nlm.nih.gov/pubmed/35168089
http://dx.doi.org/10.1016/j.ijmedinf.2022.104704
_version_ 1784661300178059264
author Li, Zhenpeng
Kormilitzin, Andrey
Fernandes, Marco
Vaci, Nemanja
Liu, Qiang
Newby, Danielle
Goodday, Sarah
Smith, Tanya
Nevado-Holgado, Alejo J
Winchester, Laura
author_facet Li, Zhenpeng
Kormilitzin, Andrey
Fernandes, Marco
Vaci, Nemanja
Liu, Qiang
Newby, Danielle
Goodday, Sarah
Smith, Tanya
Nevado-Holgado, Alejo J
Winchester, Laura
author_sort Li, Zhenpeng
collection PubMed
description UK Biobank (UKB) is widely employed to investigate mental health disorders and related exposures; however, its applicability and relevance in a clinical setting and the assumptions required have not been sufficiently and systematically investigated. Here, we present the first validation study using secondary care mental health data with linkage to UKB from Oxford - Clinical Record Interactive Search (CRIS) focusing on comparison of demographic information, diagnostic outcome, medication record and cognitive test results, with missing data and the implied bias from both resources depicted. We applied a natural language processing model to extract information embedded in unstructured text from clinical notes and attachments. Using a contingency table we compared the demographic information recorded in UKB and CRIS. We calculated the positive predictive value (PPV, proportion of true positives cases detected) for mental health diagnosis and relevant medication. Amongst the cohort of 854 subjects, PPVs for any mental health diagnosis for dementia, depression, bipolar disorder and schizophrenia were 41.6%, and were 59.5%, 12.5%, 50.0% and 52.6%, respectively. Self-reported medication records in UKB had general PPV of 47.0%, with the prevalence of frequently prescribed medicines to each typical mental health disorder considerably different from the information provided by CRIS. UKB is highly multimodal, but with limited follow-up records, whereas CRIS offers a longitudinal high-resolution clinical picture with more than ten years of observations. The linkage of both datasets will reduce the self-report bias and synergistically augment diverse modalities into a unified resource to facilitate more robust research in mental health.
format Online
Article
Text
id pubmed-8889024
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier Science Ireland Ltd
record_format MEDLINE/PubMed
spelling pubmed-88890242022-04-01 Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records Li, Zhenpeng Kormilitzin, Andrey Fernandes, Marco Vaci, Nemanja Liu, Qiang Newby, Danielle Goodday, Sarah Smith, Tanya Nevado-Holgado, Alejo J Winchester, Laura Int J Med Inform Article UK Biobank (UKB) is widely employed to investigate mental health disorders and related exposures; however, its applicability and relevance in a clinical setting and the assumptions required have not been sufficiently and systematically investigated. Here, we present the first validation study using secondary care mental health data with linkage to UKB from Oxford - Clinical Record Interactive Search (CRIS) focusing on comparison of demographic information, diagnostic outcome, medication record and cognitive test results, with missing data and the implied bias from both resources depicted. We applied a natural language processing model to extract information embedded in unstructured text from clinical notes and attachments. Using a contingency table we compared the demographic information recorded in UKB and CRIS. We calculated the positive predictive value (PPV, proportion of true positives cases detected) for mental health diagnosis and relevant medication. Amongst the cohort of 854 subjects, PPVs for any mental health diagnosis for dementia, depression, bipolar disorder and schizophrenia were 41.6%, and were 59.5%, 12.5%, 50.0% and 52.6%, respectively. Self-reported medication records in UKB had general PPV of 47.0%, with the prevalence of frequently prescribed medicines to each typical mental health disorder considerably different from the information provided by CRIS. UKB is highly multimodal, but with limited follow-up records, whereas CRIS offers a longitudinal high-resolution clinical picture with more than ten years of observations. The linkage of both datasets will reduce the self-report bias and synergistically augment diverse modalities into a unified resource to facilitate more robust research in mental health. Elsevier Science Ireland Ltd 2022-04 /pmc/articles/PMC8889024/ /pubmed/35168089 http://dx.doi.org/10.1016/j.ijmedinf.2022.104704 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Zhenpeng
Kormilitzin, Andrey
Fernandes, Marco
Vaci, Nemanja
Liu, Qiang
Newby, Danielle
Goodday, Sarah
Smith, Tanya
Nevado-Holgado, Alejo J
Winchester, Laura
Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records
title Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records
title_full Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records
title_fullStr Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records
title_full_unstemmed Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records
title_short Validation of UK Biobank data for mental health outcomes: A pilot study using secondary care electronic health records
title_sort validation of uk biobank data for mental health outcomes: a pilot study using secondary care electronic health records
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8889024/
https://www.ncbi.nlm.nih.gov/pubmed/35168089
http://dx.doi.org/10.1016/j.ijmedinf.2022.104704
work_keys_str_mv AT lizhenpeng validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT kormilitzinandrey validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT fernandesmarco validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT vacinemanja validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT liuqiang validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT newbydanielle validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT gooddaysarah validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT smithtanya validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT nevadoholgadoalejoj validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords
AT winchesterlaura validationofukbiobankdataformentalhealthoutcomesapilotstudyusingsecondarycareelectronichealthrecords