Cargando…
Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research
BACKGROUND: Using machine-learning techniques, clinical diagnostic model research extracts diagnostic models from patient data. Traditionally, patient data are often collected using electronic Case Report Form (eCRF) systems, while mathematical software is used for analyzing these data using machine...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Gunther Eysenbach
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288112/ https://www.ncbi.nlm.nih.gov/pubmed/25600863 http://dx.doi.org/10.2196/medinform.3251 |
_version_ | 1782351912467693568 |
---|---|
author | Installé, Arnaud JF Van den Bosch, Thierry De Moor, Bart Timmerman, Dirk |
author_facet | Installé, Arnaud JF Van den Bosch, Thierry De Moor, Bart Timmerman, Dirk |
author_sort | Installé, Arnaud JF |
collection | PubMed |
description | BACKGROUND: Using machine-learning techniques, clinical diagnostic model research extracts diagnostic models from patient data. Traditionally, patient data are often collected using electronic Case Report Form (eCRF) systems, while mathematical software is used for analyzing these data using machine-learning techniques. Due to the lack of integration between eCRF systems and mathematical software, extracting diagnostic models is a complex, error-prone process. Moreover, due to the complexity of this process, it is usually only performed once, after a predetermined number of data points have been collected, without insight into the predictive performance of the resulting models. OBJECTIVE: The objective of the study of Clinical Data Miner (CDM) software framework is to offer an eCRF system with integrated data preprocessing and machine-learning libraries, improving efficiency of the clinical diagnostic model research workflow, and to enable optimization of patient inclusion numbers through study performance monitoring. METHODS: The CDM software framework was developed using a test-driven development (TDD) approach, to ensure high software quality. Architecturally, CDM’s design is split over a number of modules, to ensure future extendability. RESULTS: The TDD approach has enabled us to deliver high software quality. CDM’s eCRF Web interface is in active use by the studies of the International Endometrial Tumor Analysis consortium, with over 4000 enrolled patients, and more studies planned. Additionally, a derived user interface has been used in six separate interrater agreement studies. CDM's integrated data preprocessing and machine-learning libraries simplify some otherwise manual and error-prone steps in the clinical diagnostic model research workflow. Furthermore, CDM's libraries provide study coordinators with a method to monitor a study's predictive performance as patient inclusions increase. CONCLUSIONS: To our knowledge, CDM is the only eCRF system integrating data preprocessing and machine-learning libraries. This integration improves the efficiency of the clinical diagnostic model research workflow. Moreover, by simplifying the generation of learning curves, CDM enables study coordinators to assess more accurately when data collection can be terminated, resulting in better models or lower patient recruitment costs. |
format | Online Article Text |
id | pubmed-4288112 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Gunther Eysenbach |
record_format | MEDLINE/PubMed |
spelling | pubmed-42881122015-01-15 Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research Installé, Arnaud JF Van den Bosch, Thierry De Moor, Bart Timmerman, Dirk JMIR Med Inform Original Paper BACKGROUND: Using machine-learning techniques, clinical diagnostic model research extracts diagnostic models from patient data. Traditionally, patient data are often collected using electronic Case Report Form (eCRF) systems, while mathematical software is used for analyzing these data using machine-learning techniques. Due to the lack of integration between eCRF systems and mathematical software, extracting diagnostic models is a complex, error-prone process. Moreover, due to the complexity of this process, it is usually only performed once, after a predetermined number of data points have been collected, without insight into the predictive performance of the resulting models. OBJECTIVE: The objective of the study of Clinical Data Miner (CDM) software framework is to offer an eCRF system with integrated data preprocessing and machine-learning libraries, improving efficiency of the clinical diagnostic model research workflow, and to enable optimization of patient inclusion numbers through study performance monitoring. METHODS: The CDM software framework was developed using a test-driven development (TDD) approach, to ensure high software quality. Architecturally, CDM’s design is split over a number of modules, to ensure future extendability. RESULTS: The TDD approach has enabled us to deliver high software quality. CDM’s eCRF Web interface is in active use by the studies of the International Endometrial Tumor Analysis consortium, with over 4000 enrolled patients, and more studies planned. Additionally, a derived user interface has been used in six separate interrater agreement studies. CDM's integrated data preprocessing and machine-learning libraries simplify some otherwise manual and error-prone steps in the clinical diagnostic model research workflow. Furthermore, CDM's libraries provide study coordinators with a method to monitor a study's predictive performance as patient inclusions increase. CONCLUSIONS: To our knowledge, CDM is the only eCRF system integrating data preprocessing and machine-learning libraries. This integration improves the efficiency of the clinical diagnostic model research workflow. Moreover, by simplifying the generation of learning curves, CDM enables study coordinators to assess more accurately when data collection can be terminated, resulting in better models or lower patient recruitment costs. Gunther Eysenbach 2014-10-20 /pmc/articles/PMC4288112/ /pubmed/25600863 http://dx.doi.org/10.2196/medinform.3251 Text en ©Arnaud JF Installé, Thierry Van den Bosch, Bart De Moor, Dirk Timmerman. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 20.10.2014. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Installé, Arnaud JF Van den Bosch, Thierry De Moor, Bart Timmerman, Dirk Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research |
title | Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research |
title_full | Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research |
title_fullStr | Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research |
title_full_unstemmed | Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research |
title_short | Clinical Data Miner: An Electronic Case Report Form System With Integrated Data Preprocessing and Machine-Learning Libraries Supporting Clinical Diagnostic Model Research |
title_sort | clinical data miner: an electronic case report form system with integrated data preprocessing and machine-learning libraries supporting clinical diagnostic model research |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288112/ https://www.ncbi.nlm.nih.gov/pubmed/25600863 http://dx.doi.org/10.2196/medinform.3251 |
work_keys_str_mv | AT installearnaudjf clinicaldatamineranelectroniccasereportformsystemwithintegrateddatapreprocessingandmachinelearninglibrariessupportingclinicaldiagnosticmodelresearch AT vandenboschthierry clinicaldatamineranelectroniccasereportformsystemwithintegrateddatapreprocessingandmachinelearninglibrariessupportingclinicaldiagnosticmodelresearch AT demoorbart clinicaldatamineranelectroniccasereportformsystemwithintegrateddatapreprocessingandmachinelearninglibrariessupportingclinicaldiagnosticmodelresearch AT timmermandirk clinicaldatamineranelectroniccasereportformsystemwithintegrateddatapreprocessingandmachinelearninglibrariessupportingclinicaldiagnosticmodelresearch |