Cargando…

Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation

BACKGROUND: The introduction of next-generation sequencing (NGS) into molecular cancer diagnostics has led to an increase in the data available for the identification and evaluation of driver mutations and for defining personalized cancer treatment regimens. The meaningful combination of omics data,...

Descripción completa

Detalles Bibliográficos
Autores principales: Gruendner, Julian, Wolf, Nicolas, Tögel, Lars, Haller, Florian, Prokosch, Hans-Ulrich, Christoph, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7578821/
https://www.ncbi.nlm.nih.gov/pubmed/33026356
http://dx.doi.org/10.2196/19879
_version_ 1783598449161142272
author Gruendner, Julian
Wolf, Nicolas
Tögel, Lars
Haller, Florian
Prokosch, Hans-Ulrich
Christoph, Jan
author_facet Gruendner, Julian
Wolf, Nicolas
Tögel, Lars
Haller, Florian
Prokosch, Hans-Ulrich
Christoph, Jan
author_sort Gruendner, Julian
collection PubMed
description BACKGROUND: The introduction of next-generation sequencing (NGS) into molecular cancer diagnostics has led to an increase in the data available for the identification and evaluation of driver mutations and for defining personalized cancer treatment regimens. The meaningful combination of omics data, ie, pathogenic gene variants and alterations with other patient data, to understand the full picture of malignancy has been challenging. OBJECTIVE: This study describes the implementation of a system capable of processing, analyzing, and subsequently combining NGS data with other clinical patient data for analysis within and across institutions. METHODS: On the basis of the already existing NGS analysis workflows for the identification of malignant gene variants at the Institute of Pathology of the University Hospital Erlangen, we defined basic requirements on an NGS processing and analysis pipeline and implemented a pipeline based on the GEMINI (GEnome MINIng) open source genetic variation database. For the purpose of validation, this pipeline was applied to data from the 1000 Genomes Project and subsequently to NGS data derived from 206 patients of a local hospital. We further integrated the pipeline into existing structures of data integration centers at the University Hospital Erlangen and combined NGS data with local nongenomic patient-derived data available in Fast Healthcare Interoperability Resources format. RESULTS: Using data from the 1000 Genomes Project and from the patient cohort as input, the implemented system produced the same results as already established methodologies. Further, it satisfied all our identified requirements and was successfully integrated into the existing infrastructure. Finally, we showed in an exemplary analysis how the data could be quickly loaded into and analyzed in KETOS, a web-based analysis platform for statistical analysis and clinical decision support. CONCLUSIONS: This study demonstrates that the GEMINI open source database can be augmented to create an NGS analysis pipeline. The pipeline generates high-quality results consistent with the already established workflows for gene variant annotation and pathological evaluation. We further demonstrate how NGS-derived genomic and other clinical data can be combined for further statistical analysis, thereby providing for data integration using standardized vocabularies and methods. Finally, we demonstrate the feasibility of the pipeline integration into hospital workflows by providing an exemplary integration into the data integration center infrastructure, which is currently being established across Germany.
format Online
Article
Text
id pubmed-7578821
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-75788212020-10-27 Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation Gruendner, Julian Wolf, Nicolas Tögel, Lars Haller, Florian Prokosch, Hans-Ulrich Christoph, Jan J Med Internet Res Original Paper BACKGROUND: The introduction of next-generation sequencing (NGS) into molecular cancer diagnostics has led to an increase in the data available for the identification and evaluation of driver mutations and for defining personalized cancer treatment regimens. The meaningful combination of omics data, ie, pathogenic gene variants and alterations with other patient data, to understand the full picture of malignancy has been challenging. OBJECTIVE: This study describes the implementation of a system capable of processing, analyzing, and subsequently combining NGS data with other clinical patient data for analysis within and across institutions. METHODS: On the basis of the already existing NGS analysis workflows for the identification of malignant gene variants at the Institute of Pathology of the University Hospital Erlangen, we defined basic requirements on an NGS processing and analysis pipeline and implemented a pipeline based on the GEMINI (GEnome MINIng) open source genetic variation database. For the purpose of validation, this pipeline was applied to data from the 1000 Genomes Project and subsequently to NGS data derived from 206 patients of a local hospital. We further integrated the pipeline into existing structures of data integration centers at the University Hospital Erlangen and combined NGS data with local nongenomic patient-derived data available in Fast Healthcare Interoperability Resources format. RESULTS: Using data from the 1000 Genomes Project and from the patient cohort as input, the implemented system produced the same results as already established methodologies. Further, it satisfied all our identified requirements and was successfully integrated into the existing infrastructure. Finally, we showed in an exemplary analysis how the data could be quickly loaded into and analyzed in KETOS, a web-based analysis platform for statistical analysis and clinical decision support. CONCLUSIONS: This study demonstrates that the GEMINI open source database can be augmented to create an NGS analysis pipeline. The pipeline generates high-quality results consistent with the already established workflows for gene variant annotation and pathological evaluation. We further demonstrate how NGS-derived genomic and other clinical data can be combined for further statistical analysis, thereby providing for data integration using standardized vocabularies and methods. Finally, we demonstrate the feasibility of the pipeline integration into hospital workflows by providing an exemplary integration into the data integration center infrastructure, which is currently being established across Germany. JMIR Publications 2020-10-07 /pmc/articles/PMC7578821/ /pubmed/33026356 http://dx.doi.org/10.2196/19879 Text en ©Julian Gruendner, Nicolas Wolf, Lars Tögel, Florian Haller, Hans-Ulrich Prokosch, Jan Christoph. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 07.10.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Gruendner, Julian
Wolf, Nicolas
Tögel, Lars
Haller, Florian
Prokosch, Hans-Ulrich
Christoph, Jan
Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation
title Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation
title_full Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation
title_fullStr Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation
title_full_unstemmed Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation
title_short Integrating Genomics and Clinical Data for Statistical Analysis by Using GEnome MINIng (GEMINI) and Fast Healthcare Interoperability Resources (FHIR): System Design and Implementation
title_sort integrating genomics and clinical data for statistical analysis by using genome mining (gemini) and fast healthcare interoperability resources (fhir): system design and implementation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7578821/
https://www.ncbi.nlm.nih.gov/pubmed/33026356
http://dx.doi.org/10.2196/19879
work_keys_str_mv AT gruendnerjulian integratinggenomicsandclinicaldataforstatisticalanalysisbyusinggenomemininggeminiandfasthealthcareinteroperabilityresourcesfhirsystemdesignandimplementation
AT wolfnicolas integratinggenomicsandclinicaldataforstatisticalanalysisbyusinggenomemininggeminiandfasthealthcareinteroperabilityresourcesfhirsystemdesignandimplementation
AT togellars integratinggenomicsandclinicaldataforstatisticalanalysisbyusinggenomemininggeminiandfasthealthcareinteroperabilityresourcesfhirsystemdesignandimplementation
AT hallerflorian integratinggenomicsandclinicaldataforstatisticalanalysisbyusinggenomemininggeminiandfasthealthcareinteroperabilityresourcesfhirsystemdesignandimplementation
AT prokoschhansulrich integratinggenomicsandclinicaldataforstatisticalanalysisbyusinggenomemininggeminiandfasthealthcareinteroperabilityresourcesfhirsystemdesignandimplementation
AT christophjan integratinggenomicsandclinicaldataforstatisticalanalysisbyusinggenomemininggeminiandfasthealthcareinteroperabilityresourcesfhirsystemdesignandimplementation