Cargando…
Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory
BACKGROUND: The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that ca...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4036707/ https://www.ncbi.nlm.nih.gov/pubmed/24885806 http://dx.doi.org/10.1186/1756-0500-7-314 |
_version_ | 1782318188522897408 |
---|---|
author | Onsongo, Getiria Erdmann, Jesse Spears, Michael D Chilton, John Beckman, Kenneth B Hauge, Adam Yohe, Sophia Schomaker, Matthew Bower, Matthew Silverstein, Kevin A T Thyagarajan, Bharat |
author_facet | Onsongo, Getiria Erdmann, Jesse Spears, Michael D Chilton, John Beckman, Kenneth B Hauge, Adam Yohe, Sophia Schomaker, Matthew Bower, Matthew Silverstein, Kevin A T Thyagarajan, Bharat |
author_sort | Onsongo, Getiria |
collection | PubMed |
description | BACKGROUND: The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. FINDINGS: To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. CONCLUSIONS: We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software. |
format | Online Article Text |
id | pubmed-4036707 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40367072014-05-29 Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory Onsongo, Getiria Erdmann, Jesse Spears, Michael D Chilton, John Beckman, Kenneth B Hauge, Adam Yohe, Sophia Schomaker, Matthew Bower, Matthew Silverstein, Kevin A T Thyagarajan, Bharat BMC Res Notes Technical Note BACKGROUND: The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. FINDINGS: To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. CONCLUSIONS: We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software. BioMed Central 2014-05-23 /pmc/articles/PMC4036707/ /pubmed/24885806 http://dx.doi.org/10.1186/1756-0500-7-314 Text en Copyright © 2014 Onsongo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Technical Note Onsongo, Getiria Erdmann, Jesse Spears, Michael D Chilton, John Beckman, Kenneth B Hauge, Adam Yohe, Sophia Schomaker, Matthew Bower, Matthew Silverstein, Kevin A T Thyagarajan, Bharat Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory |
title | Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory |
title_full | Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory |
title_fullStr | Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory |
title_full_unstemmed | Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory |
title_short | Implementation of Cloud based Next Generation Sequencing data analysis in a clinical laboratory |
title_sort | implementation of cloud based next generation sequencing data analysis in a clinical laboratory |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4036707/ https://www.ncbi.nlm.nih.gov/pubmed/24885806 http://dx.doi.org/10.1186/1756-0500-7-314 |
work_keys_str_mv | AT onsongogetiria implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT erdmannjesse implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT spearsmichaeld implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT chiltonjohn implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT beckmankennethb implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT haugeadam implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT yohesophia implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT schomakermatthew implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT bowermatthew implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT silversteinkevinat implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory AT thyagarajanbharat implementationofcloudbasednextgenerationsequencingdataanalysisinaclinicallaboratory |