Cargando…

Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory

BACKGROUND: The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Onsongo, Getiria, Erdmann, Jesse, Spears, Michael D, Chilton, John, Beckman, Kenneth B, Hauge, Adam, Yohe, Sophia, Schomaker, Matthew, Bower, Matthew, Silverstein, Kevin A T, Thyagarajan, Bharat
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4036707/
https://www.ncbi.nlm.nih.gov/pubmed/24885806
http://dx.doi.org/10.1186/1756-0500-7-314
_version_ 1782318188522897408
author Onsongo, Getiria
Erdmann, Jesse
Spears, Michael D
Chilton, John
Beckman, Kenneth B
Hauge, Adam
Yohe, Sophia
Schomaker, Matthew
Bower, Matthew
Silverstein, Kevin A T
Thyagarajan, Bharat
author_facet Onsongo, Getiria
Erdmann, Jesse
Spears, Michael D
Chilton, John
Beckman, Kenneth B
Hauge, Adam
Yohe, Sophia
Schomaker, Matthew
Bower, Matthew
Silverstein, Kevin A T
Thyagarajan, Bharat
author_sort Onsongo, Getiria
collection PubMed
description BACKGROUND: The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. FINDINGS: To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. CONCLUSIONS: We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software.
format Online
Article
Text
id pubmed-4036707
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40367072014-05-29 Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory Onsongo, Getiria Erdmann, Jesse Spears, Michael D Chilton, John Beckman, Kenneth B Hauge, Adam Yohe, Sophia Schomaker, Matthew Bower, Matthew Silverstein, Kevin A T Thyagarajan, Bharat BMC Res Notes Technical Note BACKGROUND: The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. FINDINGS: To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. CONCLUSIONS: We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software. BioMed Central 2014-05-23 /pmc/articles/PMC4036707/ /pubmed/24885806 http://dx.doi.org/10.1186/1756-0500-7-314 Text en Copyright © 2014 Onsongo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Onsongo, Getiria
Erdmann, Jesse
Spears, Michael D
Chilton, John
Beckman, Kenneth B
Hauge, Adam
Yohe, Sophia
Schomaker, Matthew
Bower, Matthew
Silverstein, Kevin A T
Thyagarajan, Bharat
Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory
title Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory
title_full Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory
title_fullStr Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory
title_full_unstemmed Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory
title_short Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory
title_sort implementation of cloud based next generation sequencing data analysis in a clinical laboratory
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4036707/
https://www.ncbi.nlm.nih.gov/pubmed/24885806
http://dx.doi.org/10.1186/1756-0500-7-314
work_keys_str_mv AT onsongogetiria implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT erdmannjesse implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT spearsmichaeld implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT chiltonjohn implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT beckmankennethb implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT haugeadam implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT yohesophia implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT schomakermatthew implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT bowermatthew implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT silversteinkevinat implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory
AT thyagarajanbharat implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory