Cargando…
A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective
BACKGROUND: Current advancements in next-generation sequencing technology have made possible to sequence whole genome but assembling a large number of short sequence reads is still a big challenge. In this article, we present the comparative study of seven assemblers, namely, ABySS, Velvet, Edena, S...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5826002/ https://www.ncbi.nlm.nih.gov/pubmed/29511353 http://dx.doi.org/10.1177/1176934318758650 |
_version_ | 1783302273639645184 |
---|---|
author | Khan, Abdul Rafay Pervez, Muhammad Tariq Babar, Masroor Ellahi Naveed, Nasir Shoaib, Muhammad |
author_facet | Khan, Abdul Rafay Pervez, Muhammad Tariq Babar, Masroor Ellahi Naveed, Nasir Shoaib, Muhammad |
author_sort | Khan, Abdul Rafay |
collection | PubMed |
description | BACKGROUND: Current advancements in next-generation sequencing technology have made possible to sequence whole genome but assembling a large number of short sequence reads is still a big challenge. In this article, we present the comparative study of seven assemblers, namely, ABySS, Velvet, Edena, SGA, Ray, SSAKE, and Perga, using prokaryotic and eukaryotic paired-end as well as single-end data sets from Illumina platform. RESULTS: Results showed that in case of single-end data sets, Velvet and ABySS outperformed in all the seven assemblers with comparatively low assembling time and high genome fraction. Velvet consumed the least amount of memory than any other assembler. In case of paired-end data sets, Velvet consumed least amount of time and produced high genome fraction after ABySS and Ray. In terms of low memory usage, SGA and Edena outperformed in all the assemblers. Ray also showed good genome fraction; however, extremely high assembling time consumed by the Ray might make it prohibitively slow on larger data sets of single and paired-end data. CONCLUSIONS: Our comparison study will provide assistance to the scientists for selecting the suitable assembler according to their data sets and will also assist the developers to upgrade or develop a new assembler for de novo assembling. |
format | Online Article Text |
id | pubmed-5826002 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-58260022018-03-06 A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective Khan, Abdul Rafay Pervez, Muhammad Tariq Babar, Masroor Ellahi Naveed, Nasir Shoaib, Muhammad Evol Bioinform Online Original Research BACKGROUND: Current advancements in next-generation sequencing technology have made possible to sequence whole genome but assembling a large number of short sequence reads is still a big challenge. In this article, we present the comparative study of seven assemblers, namely, ABySS, Velvet, Edena, SGA, Ray, SSAKE, and Perga, using prokaryotic and eukaryotic paired-end as well as single-end data sets from Illumina platform. RESULTS: Results showed that in case of single-end data sets, Velvet and ABySS outperformed in all the seven assemblers with comparatively low assembling time and high genome fraction. Velvet consumed the least amount of memory than any other assembler. In case of paired-end data sets, Velvet consumed least amount of time and produced high genome fraction after ABySS and Ray. In terms of low memory usage, SGA and Edena outperformed in all the assemblers. Ray also showed good genome fraction; however, extremely high assembling time consumed by the Ray might make it prohibitively slow on larger data sets of single and paired-end data. CONCLUSIONS: Our comparison study will provide assistance to the scientists for selecting the suitable assembler according to their data sets and will also assist the developers to upgrade or develop a new assembler for de novo assembling. SAGE Publications 2018-02-20 /pmc/articles/PMC5826002/ /pubmed/29511353 http://dx.doi.org/10.1177/1176934318758650 Text en © The Author(s) 2018 http://www.creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Khan, Abdul Rafay Pervez, Muhammad Tariq Babar, Masroor Ellahi Naveed, Nasir Shoaib, Muhammad A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective |
title | A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective |
title_full | A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective |
title_fullStr | A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective |
title_full_unstemmed | A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective |
title_short | A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective |
title_sort | comprehensive study of de novo genome assemblers: current challenges and future prospective |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5826002/ https://www.ncbi.nlm.nih.gov/pubmed/29511353 http://dx.doi.org/10.1177/1176934318758650 |
work_keys_str_mv | AT khanabdulrafay acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT pervezmuhammadtariq acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT babarmasroorellahi acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT naveednasir acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT shoaibmuhammad acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT khanabdulrafay comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT pervezmuhammadtariq comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT babarmasroorellahi comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT naveednasir comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective AT shoaibmuhammad comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective |