Cargando…

Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies

The use of primary care electronic health records for research is abundant. The benefits gained from utilising such records lies in their size, longitudinal data collection and data quality. However, the use of such data to undertake high quality epidemiological studies, can lead to significant chal...

Descripción completa

Detalles Bibliográficos
Autores principales: Gokhale, Krishna Margadhamane, Chandan, Joht Singh, Toulis, Konstantinos, Gkoutos, Georgios, Tino, Peter, Nirantharakumar, Krishnarajah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987616/
https://www.ncbi.nlm.nih.gov/pubmed/32856160
http://dx.doi.org/10.1007/s10654-020-00677-6
_version_ 1783668648457535488
author Gokhale, Krishna Margadhamane
Chandan, Joht Singh
Toulis, Konstantinos
Gkoutos, Georgios
Tino, Peter
Nirantharakumar, Krishnarajah
author_facet Gokhale, Krishna Margadhamane
Chandan, Joht Singh
Toulis, Konstantinos
Gkoutos, Georgios
Tino, Peter
Nirantharakumar, Krishnarajah
author_sort Gokhale, Krishna Margadhamane
collection PubMed
description The use of primary care electronic health records for research is abundant. The benefits gained from utilising such records lies in their size, longitudinal data collection and data quality. However, the use of such data to undertake high quality epidemiological studies, can lead to significant challenges particularly in dealing with misclassification, variation in coding and the significant effort required to pre-process the data in a meaningful format for statistical analysis. In this paper, we describe a methodology to aid with the extraction and processing of such databases, delivered by a novel software programme; the “Data extraction for epidemiological research” (DExtER). The basis of DExtER relies on principles of extract, transform and load processes. The tool initially provides the ability for the healthcare dataset to be extracted, then transformed in a format whereby data is normalised, converted and reformatted. DExtER has a user interface designed to obtain data extracts specific to each research question and observational study design. There are facilities to input the requirements for; eligible study period, definition of exposed and unexposed groups, outcome measures and important baseline covariates. To date the tool has been utilised and validated in a multitude of settings. There have been over 35 peer-reviewed publications using the tool, and DExtER has been implemented as a validated public health surveillance tool for obtaining accurate statistics on epidemiology of key morbidities. Future direction of this work will be the application of the framework to linked as well as international datasets and the development of standardised methods for conducting electronic pre-processing and extraction from datasets for research purposes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s10654-020-00677-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-7987616
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-79876162021-04-12 Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies Gokhale, Krishna Margadhamane Chandan, Joht Singh Toulis, Konstantinos Gkoutos, Georgios Tino, Peter Nirantharakumar, Krishnarajah Eur J Epidemiol Methods The use of primary care electronic health records for research is abundant. The benefits gained from utilising such records lies in their size, longitudinal data collection and data quality. However, the use of such data to undertake high quality epidemiological studies, can lead to significant challenges particularly in dealing with misclassification, variation in coding and the significant effort required to pre-process the data in a meaningful format for statistical analysis. In this paper, we describe a methodology to aid with the extraction and processing of such databases, delivered by a novel software programme; the “Data extraction for epidemiological research” (DExtER). The basis of DExtER relies on principles of extract, transform and load processes. The tool initially provides the ability for the healthcare dataset to be extracted, then transformed in a format whereby data is normalised, converted and reformatted. DExtER has a user interface designed to obtain data extracts specific to each research question and observational study design. There are facilities to input the requirements for; eligible study period, definition of exposed and unexposed groups, outcome measures and important baseline covariates. To date the tool has been utilised and validated in a multitude of settings. There have been over 35 peer-reviewed publications using the tool, and DExtER has been implemented as a validated public health surveillance tool for obtaining accurate statistics on epidemiology of key morbidities. Future direction of this work will be the application of the framework to linked as well as international datasets and the development of standardised methods for conducting electronic pre-processing and extraction from datasets for research purposes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s10654-020-00677-6) contains supplementary material, which is available to authorized users. Springer Netherlands 2020-08-27 2021 /pmc/articles/PMC7987616/ /pubmed/32856160 http://dx.doi.org/10.1007/s10654-020-00677-6 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Methods
Gokhale, Krishna Margadhamane
Chandan, Joht Singh
Toulis, Konstantinos
Gkoutos, Georgios
Tino, Peter
Nirantharakumar, Krishnarajah
Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies
title Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies
title_full Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies
title_fullStr Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies
title_full_unstemmed Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies
title_short Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies
title_sort data extraction for epidemiological research (dexter): a novel tool for automated clinical epidemiology studies
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7987616/
https://www.ncbi.nlm.nih.gov/pubmed/32856160
http://dx.doi.org/10.1007/s10654-020-00677-6
work_keys_str_mv AT gokhalekrishnamargadhamane dataextractionforepidemiologicalresearchdexteranoveltoolforautomatedclinicalepidemiologystudies
AT chandanjohtsingh dataextractionforepidemiologicalresearchdexteranoveltoolforautomatedclinicalepidemiologystudies
AT touliskonstantinos dataextractionforepidemiologicalresearchdexteranoveltoolforautomatedclinicalepidemiologystudies
AT gkoutosgeorgios dataextractionforepidemiologicalresearchdexteranoveltoolforautomatedclinicalepidemiologystudies
AT tinopeter dataextractionforepidemiologicalresearchdexteranoveltoolforautomatedclinicalepidemiologystudies
AT nirantharakumarkrishnarajah dataextractionforepidemiologicalresearchdexteranoveltoolforautomatedclinicalepidemiologystudies