Cargando…

A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective

BACKGROUND: Current advancements in next-generation sequencing technology have made possible to sequence whole genome but assembling a large number of short sequence reads is still a big challenge. In this article, we present the comparative study of seven assemblers, namely, ABySS, Velvet, Edena, S...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Abdul Rafay, Pervez, Muhammad Tariq, Babar, Masroor Ellahi, Naveed, Nasir, Shoaib, Muhammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5826002/
https://www.ncbi.nlm.nih.gov/pubmed/29511353
http://dx.doi.org/10.1177/1176934318758650
_version_ 1783302273639645184
author Khan, Abdul Rafay
Pervez, Muhammad Tariq
Babar, Masroor Ellahi
Naveed, Nasir
Shoaib, Muhammad
author_facet Khan, Abdul Rafay
Pervez, Muhammad Tariq
Babar, Masroor Ellahi
Naveed, Nasir
Shoaib, Muhammad
author_sort Khan, Abdul Rafay
collection PubMed
description BACKGROUND: Current advancements in next-generation sequencing technology have made possible to sequence whole genome but assembling a large number of short sequence reads is still a big challenge. In this article, we present the comparative study of seven assemblers, namely, ABySS, Velvet, Edena, SGA, Ray, SSAKE, and Perga, using prokaryotic and eukaryotic paired-end as well as single-end data sets from Illumina platform. RESULTS: Results showed that in case of single-end data sets, Velvet and ABySS outperformed in all the seven assemblers with comparatively low assembling time and high genome fraction. Velvet consumed the least amount of memory than any other assembler. In case of paired-end data sets, Velvet consumed least amount of time and produced high genome fraction after ABySS and Ray. In terms of low memory usage, SGA and Edena outperformed in all the assemblers. Ray also showed good genome fraction; however, extremely high assembling time consumed by the Ray might make it prohibitively slow on larger data sets of single and paired-end data. CONCLUSIONS: Our comparison study will provide assistance to the scientists for selecting the suitable assembler according to their data sets and will also assist the developers to upgrade or develop a new assembler for de novo assembling.
format Online
Article
Text
id pubmed-5826002
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-58260022018-03-06 A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective Khan, Abdul Rafay Pervez, Muhammad Tariq Babar, Masroor Ellahi Naveed, Nasir Shoaib, Muhammad Evol Bioinform Online Original Research BACKGROUND: Current advancements in next-generation sequencing technology have made possible to sequence whole genome but assembling a large number of short sequence reads is still a big challenge. In this article, we present the comparative study of seven assemblers, namely, ABySS, Velvet, Edena, SGA, Ray, SSAKE, and Perga, using prokaryotic and eukaryotic paired-end as well as single-end data sets from Illumina platform. RESULTS: Results showed that in case of single-end data sets, Velvet and ABySS outperformed in all the seven assemblers with comparatively low assembling time and high genome fraction. Velvet consumed the least amount of memory than any other assembler. In case of paired-end data sets, Velvet consumed least amount of time and produced high genome fraction after ABySS and Ray. In terms of low memory usage, SGA and Edena outperformed in all the assemblers. Ray also showed good genome fraction; however, extremely high assembling time consumed by the Ray might make it prohibitively slow on larger data sets of single and paired-end data. CONCLUSIONS: Our comparison study will provide assistance to the scientists for selecting the suitable assembler according to their data sets and will also assist the developers to upgrade or develop a new assembler for de novo assembling. SAGE Publications 2018-02-20 /pmc/articles/PMC5826002/ /pubmed/29511353 http://dx.doi.org/10.1177/1176934318758650 Text en © The Author(s) 2018 http://www.creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Khan, Abdul Rafay
Pervez, Muhammad Tariq
Babar, Masroor Ellahi
Naveed, Nasir
Shoaib, Muhammad
A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective
title A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective
title_full A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective
title_fullStr A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective
title_full_unstemmed A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective
title_short A Comprehensive Study of De Novo Genome Assemblers: Current Challenges and Future Prospective
title_sort comprehensive study of de novo genome assemblers: current challenges and future prospective
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5826002/
https://www.ncbi.nlm.nih.gov/pubmed/29511353
http://dx.doi.org/10.1177/1176934318758650
work_keys_str_mv AT khanabdulrafay acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT pervezmuhammadtariq acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT babarmasroorellahi acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT naveednasir acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT shoaibmuhammad acomprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT khanabdulrafay comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT pervezmuhammadtariq comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT babarmasroorellahi comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT naveednasir comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective
AT shoaibmuhammad comprehensivestudyofdenovogenomeassemblerscurrentchallengesandfutureprospective