Cargando…
Distributed gene clinical decision support system based on cloud computing
BACKGROUND: The clinical decision support system can effectively break the limitations of doctors’ knowledge and reduce the possibility of misdiagnosis to enhance health care. The traditional genetic data storage and analysis methods based on stand-alone environment are hard to meet the computationa...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245588/ https://www.ncbi.nlm.nih.gov/pubmed/30454054 http://dx.doi.org/10.1186/s12920-018-0415-1 |
_version_ | 1783372269684260864 |
---|---|
author | Xu, Bo Li, Changlong Zhuang, Hang Wang, Jiali Wang, Qingfeng Wang, Chao Zhou, Xuehai |
author_facet | Xu, Bo Li, Changlong Zhuang, Hang Wang, Jiali Wang, Qingfeng Wang, Chao Zhou, Xuehai |
author_sort | Xu, Bo |
collection | PubMed |
description | BACKGROUND: The clinical decision support system can effectively break the limitations of doctors’ knowledge and reduce the possibility of misdiagnosis to enhance health care. The traditional genetic data storage and analysis methods based on stand-alone environment are hard to meet the computational requirements with the rapid genetic data growth for the limited scalability. METHODS: In this paper, we propose a distributed gene clinical decision support system, which is named GCDSS. And a prototype is implemented based on cloud computing technology. At the same time, we present CloudBWA which is a novel distributed read mapping algorithm leveraging batch processing strategy to map reads on Apache Spark. RESULTS: Experiments show that the distributed gene clinical decision support system GCDSS and the distributed read mapping algorithm CloudBWA have outstanding performance and excellent scalability. Compared with state-of-the-art distributed algorithms, CloudBWA achieves up to 2.63 times speedup over SparkBWA. Compared with stand-alone algorithms, CloudBWA with 16 cores achieves up to 11.59 times speedup over BWA-MEM with 1 core. CONCLUSIONS: GCDSS is a distributed gene clinical decision support system based on cloud computing techniques. In particular, we incorporated a distributed genetic data analysis pipeline framework in the proposed GCDSS system. To boost the data processing of GCDSS, we propose CloudBWA, which is a novel distributed read mapping algorithm to leverage batch processing technique in mapping stage using Apache Spark platform. |
format | Online Article Text |
id | pubmed-6245588 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62455882018-11-26 Distributed gene clinical decision support system based on cloud computing Xu, Bo Li, Changlong Zhuang, Hang Wang, Jiali Wang, Qingfeng Wang, Chao Zhou, Xuehai BMC Med Genomics Research BACKGROUND: The clinical decision support system can effectively break the limitations of doctors’ knowledge and reduce the possibility of misdiagnosis to enhance health care. The traditional genetic data storage and analysis methods based on stand-alone environment are hard to meet the computational requirements with the rapid genetic data growth for the limited scalability. METHODS: In this paper, we propose a distributed gene clinical decision support system, which is named GCDSS. And a prototype is implemented based on cloud computing technology. At the same time, we present CloudBWA which is a novel distributed read mapping algorithm leveraging batch processing strategy to map reads on Apache Spark. RESULTS: Experiments show that the distributed gene clinical decision support system GCDSS and the distributed read mapping algorithm CloudBWA have outstanding performance and excellent scalability. Compared with state-of-the-art distributed algorithms, CloudBWA achieves up to 2.63 times speedup over SparkBWA. Compared with stand-alone algorithms, CloudBWA with 16 cores achieves up to 11.59 times speedup over BWA-MEM with 1 core. CONCLUSIONS: GCDSS is a distributed gene clinical decision support system based on cloud computing techniques. In particular, we incorporated a distributed genetic data analysis pipeline framework in the proposed GCDSS system. To boost the data processing of GCDSS, we propose CloudBWA, which is a novel distributed read mapping algorithm to leverage batch processing technique in mapping stage using Apache Spark platform. BioMed Central 2018-11-20 /pmc/articles/PMC6245588/ /pubmed/30454054 http://dx.doi.org/10.1186/s12920-018-0415-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Xu, Bo Li, Changlong Zhuang, Hang Wang, Jiali Wang, Qingfeng Wang, Chao Zhou, Xuehai Distributed gene clinical decision support system based on cloud computing |
title | Distributed gene clinical decision support system based on cloud computing |
title_full | Distributed gene clinical decision support system based on cloud computing |
title_fullStr | Distributed gene clinical decision support system based on cloud computing |
title_full_unstemmed | Distributed gene clinical decision support system based on cloud computing |
title_short | Distributed gene clinical decision support system based on cloud computing |
title_sort | distributed gene clinical decision support system based on cloud computing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245588/ https://www.ncbi.nlm.nih.gov/pubmed/30454054 http://dx.doi.org/10.1186/s12920-018-0415-1 |
work_keys_str_mv | AT xubo distributedgeneclinicaldecisionsupportsystembasedoncloudcomputing AT lichanglong distributedgeneclinicaldecisionsupportsystembasedoncloudcomputing AT zhuanghang distributedgeneclinicaldecisionsupportsystembasedoncloudcomputing AT wangjiali distributedgeneclinicaldecisionsupportsystembasedoncloudcomputing AT wangqingfeng distributedgeneclinicaldecisionsupportsystembasedoncloudcomputing AT wangchao distributedgeneclinicaldecisionsupportsystembasedoncloudcomputing AT zhouxuehai distributedgeneclinicaldecisionsupportsystembasedoncloudcomputing |